JP5004877B2

JP5004877B2 - Image encoder, image decoder, image encoding method, image decoding method, and program

Info

Publication number: JP5004877B2
Application number: JP2008147407A
Authority: JP
Inventors: 孝之仲地; 喜秀外村; 竜也藤井
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2008-06-04
Filing date: 2008-06-04
Publication date: 2012-08-22
Anticipated expiration: 2028-06-04
Also published as: JP2009296277A

Description

本発明は、画像符号化器及び画像復号化器及び画像符号化方法及び画像復号化方法及びプログラムに係り、特に、軽量符号化や多視点画像の符号化、すなわち、互いを独立に符号化する必要のあるシステム、及び、無線やインターネット等の損失が発生するネットワーク通信システムの画像符号化器及び画像復号化器及び画像符号化方法及び画像復号化方法及びプログラムに関する。 The present invention relates to an image encoder, an image decoder, an image encoding method, an image decoding method, and a program, and in particular, lightweight encoding and multi-view image encoding, that is, encoding each other independently. The present invention relates to an image encoder, an image decoder, an image encoding method, an image decoding method, and a program of a necessary system and a network communication system in which loss such as wireless or Internet occurs.

近年、アナログ信号システムからディジタル信号システムへと移行しており、ディジタル画像の需要が増加している。しかし、ディジタル画像はそのままではデータ量が膨大になることから、画像を効率的に圧縮する符号化技術が重要なものになっており、国際標準アルゴリズムが広く用いられている。例としてティジタルＴＶにはMPEG-2が用いられており、Blu-ray Discの規格にはH.264/AVCが用いられている。これらの画像圧縮アルゴリズムの具体的な圧縮アルゴリズムは、まず、エンコーダ側では動き補償（ＭＣ）が行われ、時間的な信号の冗長性が除去され、その後、離散コサイン変換（ＤＣＴ）等の周波数変換により空間的な信号の冗長性が除去された後に、エントロピー符号化が行われる。デコーダ側は真逆の処理を行うことにより復号される。 In recent years, there has been a shift from analog signal systems to digital signal systems, and the demand for digital images has increased. However, since a digital image has a huge amount of data as it is, an encoding technique for efficiently compressing the image is important, and an international standard algorithm is widely used. As an example, MPEG-2 is used for digital TV, and H.264 / AVC is used for the Blu-ray Disc standard. The specific compression algorithm of these image compression algorithms is such that motion compensation (MC) is first performed on the encoder side, temporal signal redundancy is removed, and then frequency conversion such as discrete cosine transform (DCT) is performed. Thus, entropy coding is performed after the redundancy of the spatial signal is removed. The decoder side performs decoding by performing the reverse process.

一方、近年、Distributed Video Coding (DVC)と呼ばれる新しいパラダイムの映像符号化方式が注目されている。DVCは、Distributed Source Coding (DSC)の原理をビデオ符号化に応用したものである。DSCは、図８に示すように、複数の相関のある情報源に対して互いを観測することなく分散して符号化し、受信側ではそれらの各データを一括して復号するシステムである。DSCはもともと、１９７０年代にSlepianとWolfにより確立されたSlepian-Wolf定理（例えば、非特許文献１参照）及び、WynerとZivにより確立されたWyner-Ziv定理（例えば、非特許文献２参照）に基づいている。Slepian-Wolf定理は、２つの情報源を分散符号化した場合における無歪状態で復号できる許容圧縮レート領域を与えたものであり、Wyner-Ziv定理は２つの情報源において１つの情報源に歪が発生した場合についてレート歪領域を与えたものである。これらの定理により、分散して符号化する場合の圧縮限界は、Slepian-Wolf定理及びWyner-Ziv定理の条件の範囲内にてではあるが、分散しなかった場合と等しいことが証明されている。近年では、Turbo符号を用いてSlepian-Wolf定理やWyner-Ziv定理で与えられた限界にどの程度迫れるかが研究されている（例えば、非特許文献３、非特許文献４参照）。 On the other hand, in recent years, a new paradigm video encoding method called Distributed Video Coding (DVC) has attracted attention. DVC is an application of the principle of Distributed Source Coding (DSC) to video coding. As shown in FIG. 8, the DSC is a system in which a plurality of correlated information sources are distributed and encoded without observing each other, and each data is decoded on the receiving side. DSC was originally developed in the 1970s by the Slepian-Wolf theorem established by Slepian and Wolf (for example, see Non-Patent Document 1) and the Wyner-Ziv theorem established by Wyner and Ziv (for example, Non-Patent Document 2). Is based. The Slepian-Wolf theorem gives an allowable compression rate region that can be decoded without distortion when two information sources are distributedly encoded. The Wyner-Ziv theorem distorts one information source in two information sources. A rate distortion region is given in the case of occurrence of From these theorems, it is proved that the compression limit when encoding in a distributed manner is within the range of the conditions of the Slepian-Wolf theorem and the Wyner-Ziv theorem, but is the same as the case without the dispersion. . In recent years, research has been conducted on how close to the limits given by the Slepian-Wolf theorem and Wyner-Ziv theorem using Turbo codes (see, for example, Non-Patent Document 3 and Non-Patent Document 4).

DSCの理論を映像符号化に応用したDVCは、その分散性から、軽量エンコーダ、ロバスト符号化及び多視点映像符号化などへの応用が期待されている。例えば、近年標準化された動画像の圧縮アルゴリズムは、圧縮効率こそ高くはなっているが、それに伴い演算量も膨大になっている。H.264では大量の演算処理が必要であり、MPEG-2に比べて演算処理量は５〜１０倍程度増加することが知られている。この大量の演算処理は複雑なフレーム間予測や可変マクロブックサイズの最適化等に必要な演算であり、エンコーダ側において必要となる。これに対して、たとえデコーダ側の演算処理は増えたとしてもエンコーダ側の処理が少ない方が好ましいという要求がセンサカメラや携帯電話の動画処理などのアプリケーションでは考えられる。DVCでは、エンコーダ側の高負荷演算処理をデコーダ側に移行した符号化方式であり、軽量エンコーダが実現できる。 DVC, which applies DSC theory to video coding, is expected to be applied to lightweight encoders, robust coding, and multi-view video coding because of its dispersibility. For example, a compression algorithm for moving images that has been standardized in recent years has a high compression efficiency, but the amount of computation has also become enormous. H.264 requires a large amount of arithmetic processing, and it is known that the arithmetic processing amount increases by about 5 to 10 times compared to MPEG-2. This large amount of arithmetic processing is necessary for complicated inter-frame prediction, variable macro book size optimization, and the like, and is necessary on the encoder side. On the other hand, even if the arithmetic processing on the decoder side increases, there is a demand for applications such as sensor camera and mobile phone video processing that it is preferable that the processing on the encoder side be less. DVC is an encoding method in which heavy load calculation processing on the encoder side is shifted to the decoder side, and a lightweight encoder can be realized.

従来、MPEGに代表される符号化方式では予測誤差信号を情報源符号器を用いて圧縮／伸張していたが、DVCではKeyフレームからの補足情報を作成することによりWyner-Zivフレームの予測信号を生成し、その予測誤差信号をある種のエラーと捉え、通信路符号器でエラーを訂正することで圧縮／伸張を行う。しかし、Keyフレームからの補助情報の作成において、補足情報が外れた場合にパリティビットの再送等を行う必要があり、符号化効率も低下する。したがって、DVCにおいて、符号化効率を向上させるためには、予測精度の高い補助情報を作成することが重要となる。 Conventionally, in an encoding method typified by MPEG, a prediction error signal is compressed / decompressed using an information source encoder. In DVC, a prediction signal of a Wyner-Ziv frame is created by creating supplementary information from a Key frame. , The prediction error signal is regarded as a certain type of error, and the channel encoder corrects the error to perform compression / decompression. However, in the creation of auxiliary information from the Key frame, it is necessary to retransmit parity bits and the like when supplementary information is lost, and the coding efficiency also decreases. Therefore, in order to improve coding efficiency in DVC, it is important to create auxiliary information with high prediction accuracy.

DVCの実際の構成方法は種々あげられるが、図９に示すように、Wyner-Zivフレームの符号化及び復号にターボ符号を用いた方法が提案されている（例えば、非特許文献５参照）。さらに、符号化効率を改善するために、上記の非特許文献５の技術における画素領域DVCを拡張した変換領域DVCが提案されている（例えば、非特許文献６参照）。この変換領域DVCでは、DCTを用いることより信号の空間的冗長性を除去し、画素領域DVCと比較して符号化効率を改善している。 Although there are various methods for configuring DVC, as shown in FIG. 9, a method using a turbo code for encoding and decoding a Wyner-Ziv frame has been proposed (see, for example, Non-Patent Document 5). Furthermore, in order to improve encoding efficiency, a transform area DVC in which the pixel area DVC in the technique of Non-Patent Document 5 described above is extended has been proposed (see Non-Patent Document 6, for example). In this transform area DVC, the spatial redundancy of the signal is removed by using DCT, and the coding efficiency is improved as compared with the pixel area DVC.

また、DCTに代わりウェーブレット変換を用いたウェーブレット領域DVCも提案されている（例えば、非特許文献７参照）。ウェーブレット領域DVCは、画素領域DVCと比較して符号化効率に優れることが報告されており、DCT領域DVCにはない解像度スケーラビリティやＳＮＲスケーラビリティといった優れた特徴も有している。
D. Slepian and J. K. Wolf, "Noiseless coding of correlated information sources," IEEE Trans. Inform. Theory, vol. 19, no.4, pp. 471-480, 1973） A. Wyner and J. Ziv, "The rate-distortion function for source coding with side information at the decoder," IEEE Trans. Inform. Theory, vol. 22, no. 1, pp1-19, 1976) J. Bajcsy and P. Mitran, "Coding for the Slepian - Wolf problem with turbo codes," in Proc. IEEE Global Communications Conf., vol.2, 2001, pp.1400-1404. A. Aaron and B. Girod, "Compression with side information using turbo codes," in Proc. IEEE Data Compression Conf., 2002, pp. 252-2561 B. Girod, A. Aaron, S. Rane and D. Rebollo-Menedero, "Distributed video coding," Proceedings of the IEEE, Special Issue on Video Coding and Delivery, vol. 93, no.1, pp.71-83, January 2005. A. Aaron, S. Rane, E. Setton and B. Girod, "Transform-domain Wyner-Ziv codec for video," VCIP-2004, San Jose, CA, Jan. 2004. Y. Tonomura, T. Nakachi and T. Fujii, "Distributed video coding using JPEG 2000 coding scheme," IEICE Trans. on Fundamentals, E90-A, pp. 581-589, March 2007. A wavelet domain DVC using wavelet transform instead of DCT has also been proposed (see, for example, Non-Patent Document 7). The wavelet region DVC has been reported to be superior in encoding efficiency compared to the pixel region DVC, and has excellent features such as resolution scalability and SNR scalability that the DCT region DVC does not have.
D. Slepian and JK Wolf, "Noiseless coding of correlated information sources," IEEE Trans. Inform. Theory, vol. 19, no.4, pp. 471-480, 1973) A. Wyner and J. Ziv, "The rate-distortion function for source coding with side information at the decoder," IEEE Trans. Inform. Theory, vol. 22, no. 1, pp1-19, 1976) J. Bajcsy and P. Mitran, "Coding for the Slepian-Wolf problem with turbo codes," in Proc.IEEE Global Communications Conf., Vol.2, 2001, pp.1400-1404. A. Aaron and B. Girod, "Compression with side information using turbo codes," in Proc. IEEE Data Compression Conf., 2002, pp. 252-2561 B. Girod, A. Aaron, S. Rane and D. Rebollo-Menedero, "Distributed video coding," Proceedings of the IEEE, Special Issue on Video Coding and Delivery, vol. 93, no.1, pp.71-83 , January 2005. A. Aaron, S. Rane, E. Setton and B. Girod, "Transform-domain Wyner-Ziv codec for video," VCIP-2004, San Jose, CA, Jan. 2004. Y. Tonomura, T. Nakachi and T. Fujii, "Distributed video coding using JPEG 2000 coding scheme," IEICE Trans. On Fundamentals, E90-A, pp. 581-589, March 2007.

上記で述べたようにDVCは多視点映像符号化やＩＰネットワーク等のベストエフォート型の通信システムに対する符号化として効果が期待されている。しかし、DVCはMPEGなどエンコーダ側で冗長性を除去する方式に比べるとまだまだ符号化効率が悪いという問題がある。 As described above, the DVC is expected to be effective as a coding for a best-effort communication system such as multi-view video coding or IP network. However, DVC has a problem that encoding efficiency is still lower than a method such as MPEG which removes redundancy on the encoder side.

本発明は、上記の点に鑑みなされたもので、高い符号化効率を実現可能な画像符号化器及び画像復号化器及び画像符号化方法及び画像復号化方法及びプログラムを提供することを目的とする。 The present invention has been made in view of the above points, and an object thereof is to provide an image encoder, an image decoder, an image encoding method, an image decoding method, and a program capable of realizing high encoding efficiency. To do.

図１は、本発明の原理構成図である。 FIG. 1 is a principle configuration diagram of the present invention.

本発明（請求項１）は、ウェーブレット領域Distributed Video Coding (DVC)を用いて動画像フレームを符号化する画像符号化器であって、
入力されたKeyフレームを符号化する符号化手段３５と、
入力されたWyner-Zivフレームをウェーブレット領域に変換するDWT１１０と、
ウェーブレット領域の信号を量子化し、最低周波数帯域のサブバンドをビットプレーンに分割する量子化手段１２０と、
復号器２００からのモード切替判定結果に基づいて、量子化手段１２０からの出力のエントロピー符号化を行うエントロピー符号化手段１５０と、
復号器か２００らのモード切替判定結果に基づいて、量子化手段１２０により取得したビットプレーンについてWyner-Ziv符号化を行うSlepian-Wolf符号化手段１４０と、
復号器２００から取得したモード判定結果に基づいて量子化手段１２０からの出力をエントロピー符号化手段１５０または、Slepian-Wolf符号化手段１４０のいずれかに振り分けるデマルチプレクサ１３０と、を有し、
Slepian-Wolf符号化手段１４０は、
復号器に、Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報の相関を表す指標であるSAD(Sum of Absolute Differences)を計算した結果、それらのSADの中で最も小さな値のSADが得られる補助情報を該復号器から取得して符号化を行う手段を含み、
デマルチプレクサ１３０は、
復号器２００から取得した、復号済みサブバンドLL_sのMAD（Mean Absolute Difference）とSADが、MAD＜μSAD（但し、μは定数）の場合は、エントロピー符号化手段１５０に符号化を指示し、MAD≧μSADの場合には、Wyner-Ziv符号化を行うSlepian-Wolf符号化手段１４０に符号化を指示する手段を含む。 The present invention (claim 1) is an image encoder for encoding a moving image frame using a wavelet domain Distributed Video Coding (DVC),
Encoding means 35 for encoding the input key frame;
A DWT 110 that converts the input Wyner-Ziv frame into a wavelet domain;
Quantizing means 120 for quantizing a wavelet domain signal and dividing subbands of the lowest frequency band into bit planes;
An entropy encoding unit 150 that performs entropy encoding of the output from the quantization unit 120 based on the mode switching determination result from the decoder 200;
A Slepian-Wolf encoding unit 140 that performs Wyner-Ziv encoding on the bit plane acquired by the quantization unit 120 based on the mode switching determination result from the decoder or 200;
A demultiplexer 130 that distributes the output from the quantization means 120 to either the entropy encoding means 150 or the Slepian-Wolf encoding means 140 based on the mode determination result obtained from the decoder 200,
Slepian-Wolf encoding means 140
Low frequency band divided into small rectangular blocks when the decoder has auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frames As a result of calculating SAD (Sum of Absolute Differences), which is an index representing the correlation between each decoded subband LL _s and each auxiliary information, the auxiliary information for obtaining the smallest SAD among those SADs is decoded. Means for obtaining and encoding from a container,
The demultiplexer 130 is
When MAD (Mean Absolute Difference) and SAD of the decoded subband LL _s obtained from the decoder 200 are MAD <μSAD (where μ is a constant), the entropy encoding means 150 is instructed to encode, In the case of MAD ≧ μSAD, there is included means for instructing encoding to the Slepian-Wolf encoding means 140 that performs Wyner-Ziv encoding.

本発明（請求項２）は、ウェーブレット領域DVCを用いて動画像フレームを復号化する画像復号化器であって、
入力されたKeyフレームを復号する復号手段４４と、
Wyner-Zivフレームについて、前向き予測、後ろ向き予測の両方向の予測による推定値をs回DWT変換することにより第sステージの最低周波数のサブバンドLL _L である補助情報を生成する補助情報生成手段２１０と、
Wyner-Zivフレームの最低周波数帯域のサブバンドを、補助情報を用いて再構成する再構成手段２６０と、
Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s，HH_sを、それぞれ矩形の小ブロックに分割し、小ブロック毎にエントロピー符号化または、Wyner-Ziv符号化を行うかの判定を行い、モード判定結果を符号化器に送るモード判定手段２７０と、
Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s、高周波数帯域のサブバンドHH_sのフレーム内符号化されたブロックをエントロピー復号化するエントロピー復号化手段２９０と、
Wyner-Ziv符号化されたブロックをSlepian-Wolf復号化するSlepian-Wolf復号化手段２４０と、を有し、
Slepian-Wolf復号化手段２４０は、
Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報の相関を表す指標であるSADを計算した結果、それらのSADの中で最も小さな値のSADが得られる補助情報を用いて復号化を行う手段を含み、
モード判定手段２７０は、
復号済みサブバンドLL_sのMADとSADが、MAD＜μSAD（但し、μは定数）の場合は、エントロピー符号化を行い、MAD≧μSADの場合には、Wyner-Ziv符号化を行うものと判定する手段を含む。 The present invention (Claim 2 ) is an image decoder for decoding a moving image frame using a wavelet domain DVC,
Decoding means 44 for decoding the input Key frame;
Auxiliary information generating means 210 for generating auxiliary information which is a subband LL _L of the lowest frequency of the s-th stage by performing s times DWT conversion on the estimated value of the forward prediction and the backward prediction for the Wyner-Ziv frame. ,
Reconstructing means 260 for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using auxiliary information;
The subbands LH _s , HL _s , and HH _s in the middle and high frequency bands in the wavelet stage s of the Wyner-Ziv frame are divided into small rectangular blocks, and entropy coding or Wyner-Ziv coding is performed for each small block. Mode determination means 270 for determining whether to perform the determination and sending the mode determination result to the encoder;
Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, HL _s, the entropy decoding unit 290 for entropy decoding the intra-coded blocks of the subband HH _s in the high frequency band ,
And Slepian-Wolf decoding means 240 for Slepian-Wolf decoding the Wyner-Ziv encoded block,
Slepian-Wolf decoding means 240
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frame, the decoded sub-band of the low frequency band divided into rectangular small blocks As a result of calculating SAD, which is an index indicating the correlation between the band LL _s and each auxiliary information, including means for performing decoding using auxiliary information from which SAD having the smallest value among those SADs is obtained,
The mode determination means 270
If the MAD and SAD of the decoded subband LL _s are MAD <μSAD (where μ is a constant), entropy coding is performed. If MAD ≧ μSAD, Wyner-Ziv coding is determined. Means to do.

本発明（請求項３）は、ウェーブレット領域DVCを用いて動画像フレームを符号化する画像符号化器であって、
入力されたKeyフレームを符号化する符号化手段と、
入力されたWyner-Zivフレームをウェーブレット領域に変換するDWT(Discrete Wavelet Transform)と、
ウェーブレット領域の信号を量子化し、最低周波数帯域のサブバンドをビットプレーンに分割する量子化手段と、
復号器からのモード切替判定結果に基づいて、量子化手段からの出力のエントロピー符号化を行うエントロピー符号化手段と、
復号器からのモード切替判定結果に基づいて、量子化手段により取得したビットプレーンについてWyner-Ziv符号化を行うSlepian-Wolf符号化手段と、
復号器から取得したモード判定結果に基づいて量子化手段からの出力をエントロピー符号化手段または、Slepian-Wolf符号化手段のいずれかに振り分けるデマルチプレクサと、を有し、
Slepian-Wolf符号化手段は、
復号器に、Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報のMSEを計算した結果、それらのMSEの中で最も小さなMSE（Mean Square Error）が得られる補助情報を該復号器から取得して符号化を行う手段を含み、
デマルチプレクサは、
復号器から取得した、低周波数帯域の復号済みサブバンドLL_sの分散σとMSEが、σ＜μMSE（但し、μは定数）の場合は、エントロピー符号化手段に符号化を指示し、σ≧μMSEの場合には、Wyner-Ziv符号化を行うSlepian-Wolf符号化手段に符号化を指示する手段を含む。 The present invention (claim 3) is an image encoder for encoding a moving image frame using a wavelet domain DVC,
Encoding means for encoding the input Key frame;
DWT (Discrete Wavelet Transform) that transforms the input Wyner-Ziv frame into the wavelet domain,
Quantizing means for quantizing the wavelet domain signal and dividing the subband of the lowest frequency band into bit planes;
Entropy encoding means for performing entropy encoding of the output from the quantization means based on the mode switching determination result from the decoder;
Based on the mode switching determination result from the decoder, the Slepian-Wolf encoding means for performing Wyner-Ziv encoding on the bit plane acquired by the quantization means,
A demultiplexer that distributes the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result obtained from the decoder,
Slepian-Wolf encoding means
Low frequency band divided into small rectangular blocks when the decoder has auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frames As a result of calculating MSEs of the decoded subbands LL _s and the respective auxiliary information, auxiliary information from which the smallest MSE (Mean Square Error) among those MSEs is obtained is obtained from the decoder and encoded. Including means,
The demultiplexer
If the variance σ and MSE of the decoded subband LL _{s in} the low frequency band obtained from the decoder are σ <μMSE (where μ is a constant), the entropy coding means is instructed to encode, and σ ≧ In the case of μMSE, it includes means for instructing encoding to the Slepian-Wolf encoding means for performing Wyner-Ziv encoding.

本発明（請求項４）は、ウェーブレット領域DVCを用いて動画像フレームを復号化する画像復号化器であって、
入力されたKeyフレームを復号する復号手段と、
Wyner-Zivフレームについて、前向き予測、後ろ向き予測の両方向の予測による推定値をs回DWT変換することにより第sステージの最低周波数のサブバンドLL _L である補助情報を生成する補助情報生成手段と、
Wyner-Zivフレームの最低周波数帯域のサブバンドを、補助情報を用いて再構成する再構成手段と、
Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s，高周波数帯域のサブバンドHH_sを、それぞれ矩形の小ブロックに分割し、小ブロック毎にエントロピー符号化または、Wyner-Ziv符号化を行うかの判定を行い、モード判定結果を符号化器に送るモード判定手段と、
Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s，高周波数帯域のサブバンドHH_sのフレーム内符号化されたブロックをエントロピー復号化するエントロピー復号化手段と、
Wyner-Ziv符号化されたブロックをSlepian-Wolf復号化するSlepian-Wolf復号化手段と、を有し、
Slepian-Wolf復号化手段は、
Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報のMSEを計算した結果、それらのMSEの中で最も小さなMSEが得られる補助情報を用いて復号化を行う手段を含み、
モード判定手段は、
低周波数帯域の復号済みサブバンドLL_sの分散σとMSEが、σ＜μMSE（但し、μは定数）の場合は、エントロピー符号化を行い、σ≧μMSEの場合には、Wyner-Ziv符号化を行うものと判定する手段を含む。 The present invention (Claim 4 ) is an image decoder for decoding a moving image frame using a wavelet domain DVC,
Decryption means for decrypting the input Key frame;
For the Wyner-Ziv frame, auxiliary information generating means for generating auxiliary information that is the subband LL _L of the lowest frequency of the s-stage by performing s times DWT conversion on the estimated value by the forward prediction and the backward prediction in both directions ,
Reconstruction means for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using auxiliary information;
The middle and high frequency band subbands LH _s and HL _s and the high frequency band subband HH _s in the wavelet stage s of the Wyner-Ziv frame are each divided into rectangular small blocks, and entropy coding is performed for each small block, or Mode determination means for determining whether to perform Wyner-Ziv encoding and sending a mode determination result to the encoder;
Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, and HL _s, entropy decoding unit entropy decodes the coded blocks in the frame of the sub-band HH _s in the high frequency band,
And Slepian-Wolf decoding means for Slepian-Wolf decoding the Wyner-Ziv encoded block,
Slepian-Wolf decoding means
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frame, the decoded sub-band of the low frequency band divided into rectangular small blocks As a result of calculating the MSE of the band LL _s and each auxiliary information, including means for decoding using the auxiliary information that can obtain the smallest MSE among those MSEs,
The mode judgment means is
Entropy coding is performed when the variance σ and MSE of the decoded subband LL _{s in} the low frequency band is σ <μMSE (where μ is a constant), and Wyner-Ziv coding when σ ≧ μMSE. Means for determining that the operation is to be performed.

図２は、本発明の原理を説明するための図である。 FIG. 2 is a diagram for explaining the principle of the present invention.

本発明（請求項５）は、ウェーブレット領域DVCを用いて動画像フレームを符号化する画像符号化方法であって、
符号化手段が、入力されたKeyフレームを符号化する符号化ステップ（ステップ１）と、
DWTが、入力されたWyner-Zivフレームをウェーブレット領域に変換するウェーブレット変換ステップ（ステップ２）と、
量子化手段が、ウェーブレット領域の信号を量子化し、最低周波数帯域のサブバンドをビットプレーンに分割する量子化ステップ（ステップ３）と、
デマルチプレクサが、復号器からのモード判定結果に基づいて量子化手段からの出力をエントロピー符号化手段または、Slepian-Wolf符号化手段のいずれかに振り分ける振り分けステップ（ステップ4、ステップ５）と、
エントロピー符号化手段が、量子化手段からの出力のエントロピー符号化を行うエントロピー符号化ステップ（ステップ６）と、
Slepian-Wolf符号化手段が、復号器からのモード切替判定結果に基づいて、量子化手段により取得したビットプレーンについてWyner-Ziv符号化を行うSlepian-Wolf符号化ステップ（ステップ７）と、を行い、
振り分けステップ（ステップ５）において、
復号器に、Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報の相関を表す指標であるSADを取得し、該復号済みサブバンドLLのMADと該SADが、MAD＜μSAD（但し、μは定数）の場合は、エントロピー符号化手段に符号化を指示し、MAD≧μSADの場合には、Wyner-Ziv符号化を行うSlepian-Wolf符号化手段に符号化を指示し、
Slepian-Wolf符号化ステップ（ステップ７）において、
SADの中で最も小さな値のSADが得られる補助情報を該復号器から取得して符号化を行う。 The present invention (Claim 5 ) is an image encoding method for encoding a moving image frame using a wavelet domain DVC,
An encoding step (step 1) in which the encoding means encodes the input key frame;
A wavelet transform step (step 2) in which the DWT transforms the input Wyner-Ziv frame into a wavelet domain;
A quantization step (step 3) in which the quantization means quantizes the wavelet domain signal and divides the subband of the lowest frequency band into bit planes;
A distribution step (steps 4 and 5) in which the demultiplexer distributes the output from the quantization unit to either the entropy encoding unit or the Slepian-Wolf encoding unit based on the mode determination result from the decoder;
An entropy encoding means for performing entropy encoding of the output from the quantization means (step 6);
The Slepian-Wolf encoding means performs a Slepian-Wolf encoding step (Step 7) for performing Wyner-Ziv encoding on the bit plane obtained by the quantization means based on the mode switching determination result from the decoder. ,
In the sorting step (step 5),
Low frequency band divided into small rectangular blocks when the decoder has auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frames get the decoded subband LL _s of SAD is an index representing a correlation of the respective auxiliary information, MAD and the SAD of the decoded sub-band LL is, MAD <μSAD (where, mu is a constant) in the case of Instructing encoding to the entropy encoding means, and if MAD ≧ μSAD, instructing encoding to the Slepian-Wolf encoding means for performing Wyner-Ziv encoding ,
In the Slepian-Wolf encoding step (Step 7),
Auxiliary information for obtaining the SAD having the smallest value among the SADs is obtained from the decoder and encoded .

本発明（請求項６）は、ウェーブレット領域DVCを用いて動画像フレームを復号化する画像復号化方法であって、
復号手段が、入力されたKeyフレームを復号する復号ステップと、
補助情報生成手段が、Wyner-Zivフレームについて、前向き予測、後ろ向き予測の両方向の予測による推定値をs回DWT変換することにより第sステージの最低周波数のサブバンドLL _L である補助情報を生成する補助情報生成ステップと、
再構成手段が、Wyner-Zivフレームの最低周波数帯域のサブバンドを、補助情報を用いて再構成する再構成ステップと、
モード判定手段が、Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s、高周波数帯域のサブバンドHH_sを、それぞれ矩形の小ブロックに分割し、小ブロック毎にエントロピー符号化または、Wyner-Ziv符号化を行うかの判定を行い、モード判定結果を符号化器に送るモード判定ステップと、
エントロピー復号化手段が、Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s、高周波数帯域のサブバンドHH_sのフレーム内符号化されたブロックをエントロピー復号化するエントロピー復号化ステップと、
Slepian-Wolf復号化手段が、Wyner-Ziv符号化されたブロックをSlepian-Wolf復号化するSlepian-Wolf復号化ステップと、を行い、
モード判定ステップにおいて、
Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報の相関を表す指標であるSADを取得し、該復号済みサブバンドLL_sのMADと該SADが、MAD＜μSAD（但し、μは定数）の場合は、エントロピー符号化を行い、MAD≧μSADの場合には、Wyner-Ziv符号化を行うものと判定し、
Slepian-Wolf復号化ステップにおいて、
SADの中で最も小さな値のSADが得られる補助情報を用いて復号化を行う。 The present invention (Claim 6) is an image decoding method for decoding a moving image frame using a wavelet domain DVC,
A decoding step in which the decoding means decodes the input Key frame;
Auxiliary information generation means generates auxiliary information that is the subband LL _L of the lowest frequency of the s-th stage by performing DWT transformation on the estimated value of the forward prediction and the backward prediction for the Wyner-Ziv frame s times. An auxiliary information generation step;
A reconstruction means for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using auxiliary information;
The mode judging means divides the sub-bands LH _s and HL _s in the medium and high frequency band and the sub-band HH _s in the high frequency band in the wavelet stage s of the Wyner-Ziv frame into small rectangular blocks. A mode determination step of determining whether to perform entropy encoding or Wyner-Ziv encoding and sending a mode determination result to the encoder;
Entropy decoding means, to high and high frequency band of the subband LH _s, HL _s, entropy decoding the intra-coded blocks of the subband HH _s in the high frequency band in wavelet first s stage of Wyner-Ziv frames An entropy decoding step;
Slepian-Wolf decoding means performs a Slepian-Wolf decoding step of Slepian-Wolf decoding a Wyner-Ziv encoded block;
In the mode determination step,
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frame, the decoded sub-band of the low frequency band divided into rectangular small blocks get the band LL _s and SAD is an index representing a correlation of the respective auxiliary information, MAD and the SAD of the decoded subband LL _s is the case of MAD <μSAD (where, mu is a constant), entropy coding If MAD ≧ μSAD, it is determined that Wyner-Ziv encoding is performed ,
In the Slepian-Wolf decoding step,
Decoding is performed using auxiliary information that can obtain the SAD having the smallest value among the SADs .

本発明（請求項７）は、ウェーブレット領域DVCを用いて動画像フレームを符号化する画像符号化方法であって、
符号化手段が、入力されたKeyフレームを符号化する符号化ステップと、
DWTが、入力されたWyner-Zivフレームをウェーブレット領域に変換するウェーブレット変換ステップと、
量子化手段が、ウェーブレット領域の信号を量子化し、最低周波数帯域のサブバンドをビットプレーンに分割する量子化ステップと、
エントロピー符号化手段が、復号器からのモード切替判定結果に基づいて、量子化手段からの出力のエントロピー符号化を行うエントロピー符号化ステップと、
Slepian-Wolf復号化手段が、復号器からのモード切替判定結果に基づいて、量子化手段により取得したビットプレーンについてWyner-Ziv符号化を行うSlepian-Wolf符号化ステップと、
デマルチプレクサが、復号器から取得したモード判定結果に基づいて量子化手段からの出力をエントロピー符号化手段または、Slepian-Wolf符号化手段のいずれかに振り分ける振り分けステップと、を行い、
振り分けステップにおいて、
復号器に、Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、該復号器から矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報のMSEを計算した結果を取得し、該復号済みサブバンドLL_sの分散σと該MSEが、σ＜μMSE（但し、μは定数）の場合は、エントロピー符号化手段に符号化を指示し、σ≧μMSEの場合には、Wyner-Ziv符号化を行うSlepian-Wolf符号化手段に符号化を指示し、
Slepian-Wolf符号化ステップにおいて、
MSEの中で最も小さなMSE（Mean Square Error）が得られる補助情報を該復号器から取得して符号化を行う。 The present invention (Claim 7 ) is an image encoding method for encoding a moving image frame using a wavelet domain DVC,
An encoding step in which the encoding means encodes the input Key frame;
A wavelet transform step in which DWT transforms the input Wyner-Ziv frame into a wavelet domain;
A quantization means for quantizing a signal in the wavelet domain and dividing a subband of the lowest frequency band into bit planes;
An entropy encoding means for performing entropy encoding of the output from the quantization means based on the mode switching determination result from the decoder;
Slepian-Wolf decoding means, based on the mode switching determination result from the decoder, Slepian-Wolf encoding step for performing Wyner-Ziv encoding on the bit plane obtained by the quantization means;
A demultiplexer performs a distribution step of distributing the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result obtained from the decoder;
In the sorting step,
When the decoder has auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frames, it is divided into rectangular small blocks from the decoder. low frequency band of the decoded subband LL _s and each MSE of auxiliary information to get the result of the calculation, the variance sigma and the MSE of the decoded subband LL _s is, σ <μMSE (where, mu is a constant and ) Instruct the encoding to the entropy encoding means, and in the case of σ ≧ μMSE, instruct the encoding to the Slepian-Wolf encoding means to perform Wyner-Ziv encoding,
In the Slepian-Wolf encoding step,
Auxiliary information that provides the smallest MSE (Mean Square Error) among the MSEs is obtained from the decoder and encoded .

本発明（請求項８）は、ウェーブレット領域DVCを用いて動画像フレームを復号化する画像復号化方法であって、
復号手段が、入力されたKeyフレームを復号する復号ステップと、
補助情報生成手段が、Wyner-Zivフレームについて、前向き予測、後ろ向き予測の両方向の予測による推定値をs回DWT変換することにより第sステージの最低周波数のサブバンドLL _L である補助情報を生成する補助情報生成ステップと、
再構成手段が、Wyner-Zivフレームの最低周波数帯域のサブバンドを、補助情報を用いて再構成する再構成ステップと、
モード判定手段が、Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s、高周波数帯域のサブバンドHH_sを、それぞれ矩形の小ブロックに分割し、小ブロック毎にエントロピー符号化または、Wyner-Ziv符号化を行うかの判定を行い、モード判定結果を符号化器に送るモード判定ステップと、
エントロピー復号化手段が、Wyner-Zivフレームのウェーブレット第ｓステージにおける中高周波数帯域のサブバンドLH_s，HL_s、高周波数帯域のサブバンドHH_sのフレーム内符号化されたブロックをエントロピー復号化するエントロピー復号化ステップと、
Slepian-Wolf復号化手段が、Wyner-Ziv符号化されたブロックをSlepian-Wolf復号化するSlepian-Wolf復号化ステップと、を行い、
モード判定ステップにおいて、
Wyner-Zivフレームの前向き予測、後ろ向き予測、両方向予測を離散ウェーブレット変換して得られた４つのサブバンドである補助情報がある場合に、矩形の小ブロックに分割された低周波数帯域の復号済みサブバンドLL _ｓとそれぞれの補助情報のMSEを計算した結果を取得し、該復号済みサブバンドLL_sの分散σと該MSEが、σ＜μMSE（但し、μは定数）の場合は、エントロピー符号化を行い、σ≧μMSEの場合には、Wyner-Ziv符号化を行うものと判定し、
Slepian-Wolf復号化ステップにおいて、
MSEの中で最も小さなMSEが得られる補助情報を用いて復号化を行う、 The present invention (Claim 8 ) is an image decoding method for decoding a moving image frame using a wavelet domain DVC,
A decoding step in which the decoding means decodes the input Key frame;
Auxiliary information generation means generates auxiliary information that is the subband LL _L of the lowest frequency of the s-th stage by performing DWT transformation on the estimated value of the forward prediction and the backward prediction for the Wyner-Ziv frame s times. An auxiliary information generation step;
A reconstruction means for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using auxiliary information;
The mode judging means divides the sub-bands LH _s and HL _s in the medium and high frequency band and the sub-band HH _s in the high frequency band in the wavelet stage s of the Wyner-Ziv frame into small rectangular blocks. A mode determination step of determining whether to perform entropy encoding or Wyner-Ziv encoding and sending a mode determination result to the encoder;
Entropy decoding means, to high and high frequency band of the subband LH _s, HL _s, entropy decoding the intra-coded blocks of the subband HH _s in the high frequency band in wavelet first s stage of Wyner-Ziv frames An entropy decoding step;
Slepian-Wolf decoding means performs a Slepian-Wolf decoding step of Slepian-Wolf decoding a Wyner-Ziv encoded block ;
In the mode determination step,
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction, and bidirectional prediction of Wyner-Ziv frame, the decoded sub-band of the low frequency band divided into rectangular small blocks Gets the results of calculating the MSE band LL _s and each of the auxiliary information, the variance σ and the MSE of the decoded subband LL _s is the case σ of <μMSE (where, mu is a constant), entropy coding When σ ≧ μMSE, it is determined that Wyner-Ziv encoding is performed ,
In the Slepian-Wolf decoding step,
Decodes using auxiliary information that gives the smallest MSE among MSEs,

本発明（請求項９）は、請求項１または３記載の符号化器を構成する各手段としてコンピュータを機能させるための画像符号化プログラムである。 The present invention (Claim 9 ) is an image encoding program for causing a computer to function as each means constituting the encoder according to Claim 1 or 3 .

本発明（請求項１０）は、請求項２または４記載の復号器を構成する手段としてコンピュータを機能させるための画像復号化プログラムである。 The present invention (Claim 10) is an image decoding program for causing a computer to function as means constituting the decoder according to Claim 2 or 4 .

上述のように、本発明によれば、複数の相関がある情報源（多視点画像等）に対して互いの情報源を分散符号化し、受信側で一括復号するDistributed Video Coding(DVC)において、映像フレームを構成するKeyフレームとWyner-Zivフレームのうち、Wyner-Zivフレームを符号化する際に、その中高周波数帯のサブバンドを矩形ブロックに分割し、その各々について２つの指標Mean Absolute Difference (MAD)、Sun of Absolute Difference(SAD)によってエントロピー符号化を行うか、Slepian-Wolf符号化を行うかを判定し、送信側に送出することにより、Wyner-Zivフレームの予測値の精度を向上させると共に、Distributed Video Coding(DVC)の高符号化効率を実現することが可能となる。 As described above, according to the present invention, in Distributed Video Coding (DVC) in which each information source is distributedly encoded with respect to a plurality of correlated information sources (such as multi-viewpoint images) and collectively decoded on the receiving side, Of the Key frame and Wyner-Ziv frame that composes the video frame, when encoding the Wyner-Ziv frame, the sub-band of the middle and high frequency band is divided into rectangular blocks, and each of them has two indices Mean Absolute Difference ( MAD), Sun of Absolute Difference (SAD) is used to determine whether entropy encoding or Slepian-Wolf encoding is to be performed, and the accuracy of the prediction value of the Wyner-Ziv frame is improved by sending it to the transmission side At the same time, it is possible to realize high encoding efficiency of Distributed Video Coding (DVC).

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

最初に、画素領域DVCと従来のウェーブレット領域DVCの動作原理について説明する。 First, the operation principle of the pixel area DVC and the conventional wavelet area DVC will be described.

［画素領域DVC］
一般的に、DVCにおいて、映像フレームはKeyフレーム及びWyner-Zivフレームに分割される。ここでは、簡単化のために、Keyフレームには奇数フレームX_2n±1を割り当て、Wyner-Zivフレームには偶数フレームＸ_2nを割り当てる。 [Pixel area DVC]
In general, in DVC, a video frame is divided into a Key frame and a Wyner-Ziv frame. Here, for simplicity, assign the odd frame X _{2n ± 1} for Key frames, it allocates the even frame X _2n are the Wyner-Ziv frames.

以下、前述の図９の画素領域DVCの構成を用いて説明する。同図は、前述の非特許文献５において提案された画素領域DVCの例である。Keyフレームは、JPEGなど従来のフレーム内符号器１１により符号化される。Wyner-ZivフレームX_2ｎは、量子化器１２で一様量子化され、量子化された信号はビットプレーンに分割された後、Splepian-Wolf符号器１３に送られる。各ビットプレーンは最上位ビット(MSB: Most Significant Bit)からSlepian-Wolf符号器１３に送られ、それぞれのビットプレーンに対して十分なシンドロームビットが生成される。そして、それぞれのビットプレーンが順々にバッファ１３２に蓄えられ、デコーダ側からのリクエストにより上位ビットプレーンから順々にシンドロームビットを送信する。 Hereinafter, description will be made using the configuration of the pixel region DVC of FIG. 9 described above. This figure is an example of the pixel region DVC proposed in Non-Patent Document 5 described above. The Key frame is encoded by a conventional intraframe encoder 11 such as JPEG. The Wyner-Ziv frame X _2n is uniformly quantized by the quantizer 12, and the quantized signal is divided into bit planes and then sent to the Splepian-Wolf encoder 13. Each bit plane is sent from the most significant bit (MSB) to the Slepian-Wolf encoder 13, and sufficient syndrome bits are generated for each bit plane. Each bit plane is stored in the buffer 132 in order, and syndrome bits are transmitted in order from the upper bit plane in response to a request from the decoder side.

一方、デコーダ側では、KeyフレームＸ_2n±1から動き補償を用いてブロック単位の補助情報（Side Information）Ｙ_2nが次式（１）で作成される。 On the other hand, on the decoder side, block unit auxiliary information (Side Information) Y _2n is created from the Key frame X _{2n ± 1} using motion compensation by the following equation (1).

但し、式中MVはKeyフレームから作成された動きベクトルである。

In the expression, MV is a motion vector created from the Key frame.

次に、ビット尤度を推定し、Slepian-Wolf復号器１３３にそのビット尤度を与える。ビット尤度は、一般にWyner-ZivフレームＸ_2nと補助情報Ｙ_2nのウェーブレット領域間の差分信号、すなわち、予測誤差信号はラプラス分布で近似されることから次式を用いて推定される。 Next, the bit likelihood is estimated, and the bit likelihood is given to the Slepian-Wolf decoder 133. The bit likelihood is generally estimated using the following equation since the difference signal between the wavelet regions of the Wyner-Ziv frame X _2n and the auxiliary information Y _2n , that is, the prediction error signal is approximated by a Laplace distribution.

但し、αはラプラス分布のパラメータである。Slepian-Wolf復号器１３３では、ビット尤度とエンコーダ１３１から受信するシンドロームビットを用いて量子化されたビットプレーンq'_2nを復号する。

Where α is a parameter of the Laplace distribution. The Slepian-Wolf decoder 133 decodes the bit plane q ′ _2n quantized using the bit likelihood and the syndrome bits received from the encoder 131.

最後に、復号されたビットプレーンは再構成器２４に送られ、補助情報Ｙ_2nを用いて次式で再構成される。 Finally, the decoded bit plane is sent to the reconstructor 24 and reconstructed by the following equation using the auxiliary information Y _2n .

この画素領域DVCは、以後、数多く発明されるDVCの原型となった。

This pixel region DVC became the prototype of many DVCs that have been invented thereafter.

［ウェーブレット領域DVC］
次にウェーブレット領域DVCについて説明する。 [Wavelet domain DVC]
Next, the wavelet area DVC will be described.

符号化効率を改善するために、Girodらは前述の非特許文献５に示される技術の画素領域DVCを拡張した変換領域DVCを提案している（例えば、非特許文献６）。この変換領域DVCでは、DCTを用いることにより信号の空間的冗長性を除去し、画素領域DVCと比較して符号化効率を改善している。また、DCTの代わりにウェーブレット変換を用いたウェーブレット領域DVCも提案されている（例えば、非特許文献７）。ここでは、ウェーブレット領域DVCについて説明する。図３にウェーブレット領域DVCの原理構成を示す。画素領域DVCとの違いは、画素領域で行っていた通信路符号器によるエラー訂正を、ウェーブレット領域で行っている点である。 In order to improve the encoding efficiency, Girod et al. Have proposed a transform area DVC that is an extension of the pixel area DVC of the technique shown in Non-Patent Document 5 (for example, Non-Patent Document 6). In this transform area DVC, the spatial redundancy of the signal is removed by using DCT, and the coding efficiency is improved as compared with the pixel area DVC. A wavelet domain DVC using wavelet transform instead of DCT has also been proposed (for example, Non-Patent Document 7). Here, the wavelet area DVC will be described. FIG. 3 shows the principle configuration of the wavelet domain DVC. The difference from the pixel area DVC is that error correction by the channel encoder performed in the pixel area is performed in the wavelet area.

ウェーブレット領域DVCでは、Wyner-ZivフレームX_2nは、離散ウェーブレット変換DWT３１によりウェーブレット領域に変換された後、量子化器３２で各サブバンドの信号は２^M≦｛0,4,8,16,32,64,128,256,…｝のレベルに一様量子化される。量子化された信号は、サブビットプレーン分割部３３でビットプレーンに分割された後、各サブバンド毎にSlepian-Wolf符号器３４に送られる。 In the wavelet domain DVC, the Wyner-Ziv frame X _2n is converted into the wavelet domain by the discrete wavelet transform DWT 31, and then the signal of each subband is 2 ^M ≦ {0,4,8,16,32 by the quantizer 32. , 64, 128, 256, ...} are uniformly quantized. The quantized signal is divided into bit planes by the sub bit plane dividing unit 33 and then sent to the Slepian-Wolf encoder 34 for each sub band.

一方、デコーダ側では、KeyフレームＸ_2n±1から動き補償を用いてブロック単位に予測値Ｙ_2nが作成される。ここで動き補償処理は、画素領域DVCと同じ手法を用いる。 On the other hand, on the decoder side, a predicted value Y _2n is created for each block from the Key frame X _{2n ± 1} using motion compensation. Here, the motion compensation process uses the same technique as that of the pixel region DVC.

予測値Ｙ_2nは、離散ウェーブレット変換DWT４６によりウェーブレット領域に変換された後、補助情報として用いられる。次に、ビット尤度を推定しSlepian-Wolf復号器４１にビット尤度を与える。ビット尤度は、一般にWyner-ZivフレームX_2nと予測値Ｙ_2nのウェーブレット領域間の差分信号、すなわち、ラプラス分布で近似され、画素領域DVCと同様な方法で推定される。復号されたビットプレーンは再構成器４２に送られ、補助情報を用いて再構成される。再構成された信号は、IDWT４３において、最後に逆離散ウェーブレット変換IDWTが施され、出力される。 The predicted value Y _2n is used as auxiliary information after being converted into a wavelet region by the discrete wavelet transform DWT 46. Next, the bit likelihood is estimated and the bit likelihood is given to the Slepian-Wolf decoder 41. The bit likelihood is generally approximated by a difference signal between the wavelet regions of the Wyner-Ziv frame X _2n and the predicted value Y _2n , that is, a Laplace distribution, and is estimated by a method similar to that for the pixel region DVC. The decoded bit plane is sent to the reconstructor 42 and reconstructed using the auxiliary information. The reconstructed signal is finally subjected to inverse discrete wavelet transform IDWT in the IDWT 43 and output.

以下に、本発明のウェーブレット領域DVCについて説明する。 The wavelet region DVC of the present invention will be described below.

画素領域DVCからウェーブレット領域DVCかに関わらず、DVCの符号化効率は、復号器で生成される補足情報の予測制度に大きく依存する。DVCでは、Keyフレームより作成される補足情報をWyner-Zivフレームの予測信号とし、補助情報とWyner-Zivフレームの予測誤差信号をある種のエラーと捉え、通信路符号器でエラーを訂正することで圧縮／復号を行う。補足情報が外れた場合には、パリティビットの再送等を行う必要があり、符号化効率も低下する。従って、予測精度の高い補助情報を与えることが必要となる。この補助情報の生成において、上記の画素領域DVCやウェーブレット領域DVC並びにDCT領域のDVC（例えば、非特許文献６）など従来の変換領域DVCでは、補助情報をKeyフレームのみの情報によって生成している。それは、復号器では、Wyner-Zivフレームの情報がないことに起因している。 Regardless of whether the pixel area DVC or the wavelet area DVC, the encoding efficiency of the DVC depends largely on the prediction system of the supplementary information generated by the decoder. In DVC, supplementary information created from a Key frame is used as a Wyner-Ziv frame prediction signal, and the auxiliary information and the Wyner-Ziv frame prediction error signal are regarded as a certain type of error, and the channel encoder corrects the error. Compress / decode with. If the supplementary information is lost, it is necessary to retransmit the parity bit and the like, and the coding efficiency also decreases. Therefore, it is necessary to provide auxiliary information with high prediction accuracy. In the generation of the auxiliary information, in the conventional conversion area DVC such as the pixel area DVC, the wavelet area DVC, and the DCT area DVC (for example, Non-Patent Document 6), the auxiliary information is generated only by information of the Key frame. . This is due to the fact that there is no Wyner-Ziv frame information in the decoder.

本発明では、より推定精度の高い補助情報を与えるために、ウェーブレット領域の信号を低周波帯域のサブバンドから高周波帯域に向けて復号する過程で、低周波数帯域の復号されたサブバンドを利用し、より推定精度の高い補助情報を与える手法を示す。また、補助情報の推定制度が悪いと判断された場合には、ハフマン符号や算術符号化などのエントロピー符号化を用いる。 In the present invention, in order to provide auxiliary information with higher estimation accuracy, the decoded subband of the low frequency band is used in the process of decoding the signal in the wavelet domain from the subband of the low frequency band toward the high frequency band. A method for providing auxiliary information with higher estimation accuracy will be described. If it is determined that the auxiliary information estimation system is poor, entropy coding such as Huffman coding or arithmetic coding is used.

図４は、本発明の一実施の形態におけるウェーブレット領域DVCの構成を示す。 FIG. 4 shows the configuration of the wavelet region DVC in one embodiment of the present invention.

同図に示す符号器１００は、ＤＷＴ１１０、量子化器１２０、デマルチプレクサ１３０、Slepian-Wolf符号器１４０、エントロピー符号器１５０から構成される。また、復号器２００は、補助情報生成器２１０、ＤＷＴ２２０、Slepian-Wolf復号器２４０、マルチプレクサ２５０、再構成器２６０、モード判定器２７０、ＩＤＷＴ２８０、エントロピー復号器２９０から構成される。 The encoder 100 shown in the figure includes a DWT 110, a quantizer 120, a demultiplexer 130, a Slepian-Wolf encoder 140, and an entropy encoder 150. The decoder 200 includes an auxiliary information generator 210, a DWT 220, a Slepian-Wolf decoder 240, a multiplexer 250, a reconstructor 260, a mode determiner 270, an IDWT 280, and an entropy decoder 290.

同図に示すDVCと従来のDVCのウェーブレット領域DVCとの違いは、Wyner-Zivフレームの信号を符号化する際に、エントロピー符号化か、または、Slepian-Wolf符号化かを判断する機能が付加されている点と、Slepian-Wolfと判定された場合でも、複数の補助情報の中で、予測精度が高いと判定される補助情報を選択する機能を有するモード判定器２７０が設けられている点である。以下に具体的な動作手順を示す。 The difference between the DVC shown in the figure and the conventional DVC wavelet domain DVC is the addition of a function that determines whether entropy encoding or Slepian-Wolf encoding is used when encoding a Wyner-Ziv frame signal. And a mode determination unit 270 having a function of selecting auxiliary information determined to have high prediction accuracy from among a plurality of auxiliary information even when it is determined as Slepian-Wolf. It is. The specific operation procedure is shown below.

図６は、本発明の一実施の形態における符号化・復号化のシーケンスチャートである。 FIG. 6 is a sequence chart of encoding / decoding according to an embodiment of the present invention.

ステップ１０１）符号器２００側において、入力されたKeyフレームX_2ｎ±1をＪＰＥＧ2000など従来のフレーム内符号器を用いて符号化する。 Step 101) On the encoder 200 side, the input Key frame X _{2n ± 1} is encoded using a conventional intraframe encoder such as JPEG2000.

ステップ１０２）符号器２００側において、入力されたWyner-ZivフレームＸ_2nを離散ウェーブレット変換DWT１１０によりウェーブレット領域に変換する。図５は、２レベルのウェーブレット変換の例を示している。その後、それぞれウェーブレット領域の信号は、量子化器１２０で２^M≦（０，４，８，１６，３２，６４，１２８，２５６，…）のレベルに一様に量子化される。 Step 102) On the encoder 200 side, the input Wyner-Ziv frame X _2n is converted into a wavelet domain by the discrete wavelet transform DWT110. FIG. 5 shows an example of a two-level wavelet transform. Thereafter, each wavelet domain signal is uniformly quantized by the quantizer 120 to a level of 2 ^M ≦ (0, 4, 8, 16, 32, 64, 128, 256,...).

ステップ１０３）復号器２００側において、符号器１００から入力された符号化されたKeyフレームＸ_2ｎ±1を、フレーム内復号器を用いて復号する。 Step 103) On the decoder 200 side, the encoded Key frame X _{2n ± 1} input from the encoder 100 is decoded using an intra-frame decoder.

ステップ１０４）復号器２００側において、補助情報生成器２１０は、Wyner-Zivフレームの補助情報を作成する。そのために、前述の非特許文献６に記載されている技術などに示す動き補償補間、平均補間、あるいは外挿補間を用いて、ブロック単位にWyner-Zivフレーム信号Ｘ_2nの推定値 Step 104) On the decoder 200 side, the auxiliary information generator 210 creates auxiliary information of the Wyner-Ziv frame. Therefore, the estimated value of the Wyner-Ziv frame signal X _{2n in} units of blocks using motion compensation interpolation, average interpolation, or extrapolation shown in the technique described in Non-Patent Document 6 above.

を作成する。次式は式（１）に示す動きベクトルMVを用いて作成した推定値の例である。

Create The following expression is an example of an estimated value created using the motion vector MV shown in Expression (1).

それぞれＸ_2n-1，Ｘ_2n+1並びに両信号を用いて作成しており、前向き予測、後ろ向き予測、両方向予測と呼ぶ。

They are created using X _2n-1 , X _{2n + 1 and} both signals, respectively, and are called forward prediction, backward prediction and bidirectional prediction.

次に、DWT２２０において、 Next, in DWT220,

に、それぞれＬレベルの離散時間ウェーブレット変換

Respectively, L-level discrete-time wavelet transform

を施す。それぞれの変換後の第ｓステージにおけるサブバンド

Apply. Subband in the sth stage after each conversion

の値を、

The value of

と定義する。これらは、補助情報として用いられる。

It is defined as These are used as auxiliary information.

ステップ１０５）符号器１００において、Wyner-Zivフレームに関して、上記のステップ１０２において一様に量子化器１２０で量子化された最低周波数帯域のサブバンドLL_Lは、デマルチプレクサ１３０でビットプレーンに分割された後、Slepian-Wolf符号器１４０に送られる。 Step 105) In the encoder 100, for the Wyner-Ziv frame, the subband LL _L of the lowest frequency band that is uniformly quantized by the quantizer 120 in Step 102 is divided into bit planes by the demultiplexer 130. Then, it is sent to the Slepian-Wolf encoder 140.

ステップ１０６）復号器２００において、Wyner-zivフレームの最低周波数帯域のサブバンドLL_Lに関して、 Step 106) In the decoder 200, for the subband LL _L of the lowest frequency band of the Wyner-ziv frame,

を補助情報として、再構成器２６０でLL_Lを再構成する。このLL_Lが再構成されるまでのSlepian-Wolf符号器１４０／Slepian-Wolf復号器２４０の処理は、従来のウェーブレット領域DVCと同じである。ここで、再構成された信号をLL'_Lと定義する。s＝Lとする。

As the auxiliary information, to reconstruct the LL _L at reconstructor 260. Processing Slepian-Wolf encoder 140 / Slepian-Wolf decoder 240 until the LL _L is reconstructed is the same as the conventional wavelet domain DVC. Here, the reconstructed signal is defined as LL ′ _L. Let s = L.

ステップ１０７）復号器２００において、Wyner-Zivフレームの中高周波数帯域のサブバンドLH_ｓ，HL_ｓ，HH_sは、それぞれ矩形の小ブロックに分割され、小ブロック毎にエントロピー符号化、または、Wyner-Ziv符号化を行う。モード判定器２７０における、エントロピー符号化を行うか、Wyner-Ziv符号化を行うかのモード切替の判定は、低周波数帯域の復号済みサブバンドLL_s'と、補助情報生成器２１０及びDWT２２０で生成された補助情報 Step 107) In the decoder 200, the subbands LH _s , HL _s , and HH _{s in} the middle and high frequency bands of the Wyner-Ziv frame are divided into rectangular small blocks, and entropy coding or Wyner− is performed for each small block. Perform Ziv encoding. The mode switching unit 270 determines whether to perform entropy coding or Wyner-Ziv coding mode switching by the low-frequency band decoded subband LL _{s ′} , the auxiliary information generator 210, and the DWT 220. Auxiliary information

を用いて、以下に示す２つの指標により行う。

The following two indices are used.

モード判定器２７０における、モード切替の判定は、各ブロック毎に行われるが、これはウェーブレット変換の帯域相関を利用しており、サブバンドLL_ｓのある小ブロックが、隣接するKeyフレームのサブバンドLL_sの小ブロックと相関が高ければ（低ければ）、Wyner-ZivフレームのサブバンドLH_s、HL_s、HH_sの空間的に対応する小ブロックも、隣接するKeyフレームのサブバンドLH_s、HL_s、HH_sの空間的に対応する小ブロックとで相関が高い（低い）性質を利用している。図７に小ブロックの空間的対応関係を示す。 Mode switching in the mode determiner 270 is performed for each block, which uses band correlation of wavelet transform, and a small block with a subband LL _s is a subband of an adjacent Key frame. a higher correlation with small blocks LL _s (a low), Wyner-Ziv frames subbands LH _{_{_s,}} HL _s, HH _s also spatially corresponding small block of subband LH _s adjacent Key frame, It uses the property of high (low) correlation between small blocks corresponding to HL _s and HH _s in space. FIG. 7 shows the spatial correspondence of small blocks.

（ａ）MAD (Mean Absolute Difference)
復号済みサブバンドLL'_sのブロックＢ内の信号の空間的滑らかさを表す指標であり、次式により計算する。 (A) MAD (Mean Absolute Difference)
It is an index representing the spatial smoothness of the signal in the block B of the decoded subband LL _'s, is calculated by the following equation.

ここで、LL'_s（ｘ，ｙ）は、復号済みサブバンドで、（ｘ，ｙ）は小ブロック内の座標を表している。但し、

Here, LL ′ _s (x, y) is a decoded subband, and (x, y) represents the coordinates in the small block. However,

であり、Ｎは小ブロック内の信号の数を表す。

N represents the number of signals in the small block.

（ｂ）SAD (Sum of Absolute differences)
２つ目の指標は、復号済みサブバンドLL'sと補助情報との相関を表す指標であり、次式により計算する。 (B) SAD (Sum of Absolute differences)
The second index is an index representing the correlation between the decoded subband LL's and the auxiliary information, and is calculated by the following equation.

式（１１）は、前向き、後向き、両方向予測を基に生成された補助情報の中で、最も良い推定値を与える補助情報を判定する指標となる。

Expression (11) serves as an index for determining auxiliary information that gives the best estimated value among auxiliary information generated based on forward, backward, and bidirectional prediction.

モード判定器２７０において、以上の式（１０）と式（１１）の２つの指標を用いて、エントロピー符号器１５０でエントロピー符号化を行う、または、Slepian-Wolf符号器１４０でSlepian-Wolf符号化を行うかの判定は、以下のように行われる。 In the mode determiner 270, the entropy encoder 150 performs the entropy encoding using the two indexes of the above formulas (10) and (11), or the Slepian-Wolf encoder 140 performs the Slepian-Wolf encoding. The determination of whether or not to perform is performed as follows.

（ａ）MAD<μSAD（但し、μは定数）の場合は、エントロピー符号化を行う。 (A) If MAD <μSAD (where μ is a constant), entropy coding is performed.

（ｂ）mad≧Μsadの場合には、Slepian-Wolf符号化を行う。Slepian-Wolf符号化を行う際には、補助情報として、式（１１）を最小化する補助情報を用いる。例えば、SAD=SAD_fの場合には、LH_s,，HL_s，HH_sの補助情報として、それぞれ (B) When mad ≧ Μsad, Slepian-Wolf encoding is performed. When Slepian-Wolf encoding is performed, auxiliary information that minimizes Equation (11) is used as auxiliary information. For example, when SAD = SAD _f , as auxiliary information of LH _s , HL _s , HH _s , respectively

を用いる。

Is used.

得られたモード切替の判定結果を、フィードバックチャネルを通して、符号器１００に送る。 The obtained mode switching determination result is sent to the encoder 100 through the feedback channel.

ステップ１０８）符号器１００では、Wyner-Zivフレームの第ｓステージにおける高周波数帯域のサブバンドLH_s，HL_s，HH_sに関して、復号器２００より送られてきたモード切替の判定結果を元に、各ブロッ毎にエントロピー符号器１５０によるエントロピー符号化またはSlepian-Wolf符号器１４０によるSlepian-Wolf符号化を行う。エントロピー符号化として、ハフマン符号化や算術符号化を用いる。Slepian-Wolf符号化として、LDPCやターボ符号化を用いる。 Step 108) In the encoder 100, for the high frequency band subbands LH _s , HL _s and HH _s in the s-th stage of the Wyner-Ziv frame, based on the mode switching determination result sent from the decoder 200, Entropy encoding by the entropy encoder 150 or Slepian-Wolf encoding by the Slepian-Wolf encoder 140 is performed for each block. As entropy coding, Huffman coding or arithmetic coding is used. LDPC or turbo coding is used as Slepian-Wolf coding.

ステップ１０９）復号器２００において、Wyner-Zivフレームの第ｓステージにおける高周波帯域のサブバンドLH_s，HL_s，HH_sに関して、フレーム内符号化されたブロックは、エントロピー復号器２９０でエントロピー復号を行う。Wyner-Ziv符号化されたブロックは、Slepian-Wolf復号器２４０でSlepian-Wolf復号処理が実行される。このSlepian-Wolf復号処理は、従来のウェーブレット領域DVCと同じ方法を用いる。第ｓステージの全ての小ブロックに対して、以上の復号処理を行う。 Step 109) In the decoder 200, for the subbands LH _s , HL _s , and HH _s in the high frequency band in the s-th stage of the Wyner-Ziv frame, the intra-coded block is subjected to entropy decoding in the entropy decoder 290. . The Slypian-Wolf decoder 240 executes a Slepian-Wolf decoding process on the Wyner-Ziv encoded block. This Slepian-Wolf decoding process uses the same method as the conventional wavelet domain DVC. The above decoding process is performed on all the small blocks in the sth stage.

ステップ１１０） IDWT２８０で１レベルの逆離散ウェーブレット変換を施す。ｓ＝ｓ−１とする。 Step 110) The IDWT 280 performs one level inverse discrete wavelet transform. Let s = s−1.

ステップ１１１）上記のステップ１１０の処理を原画像が復号されるレベルまでステップ１０７以降の処理を繰り返す。 Step 111) Repeat the processing in step 110 and subsequent steps until the original image is decoded.

なお、上記の図４に示す符号器及び復号器の各構成要素の動作をプログラムとして構築し、符号化器及び復号器として利用されるコンピュータにインストールして実行させることが可能である。 Note that the operation of each component of the encoder and decoder shown in FIG. 4 can be constructed as a program, and can be installed and executed in a computer used as the encoder and decoder.

また、構築されたプログラムをハードディスクや、フレキシブルディスク・ＣＤ−ＲＯＭ等の可搬記憶媒体に格納し、コンピュータにインストールする、または、配布することが可能である。 Further, the constructed program can be stored in a portable storage medium such as a hard disk, a flexible disk, or a CD-ROM, and can be installed or distributed in a computer.

なお、本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made within the scope of the claims.

本発明は、ウェーブレット領域を用いて動画像フレームを符号化／復号化するシステムに適用可能である。 The present invention can be applied to a system for encoding / decoding a moving image frame using a wavelet region.

本発明の原理構成図である。It is a principle block diagram of this invention. 本発明の原理を説明するための図である。It is a figure for demonstrating the principle of this invention. ウェーブレット領域ＤＶＣの構成図である。It is a block diagram of the wavelet area | region DVC. 本発明の一実施の形態におけるウェーブレット領域ＤＶＣの構成図である。It is a block diagram of the wavelet area | region DVC in one embodiment of this invention. 本発明の一実施の形態における符号化・復号化のシーケンスチャートである。It is a sequence chart of encoding and decoding in one embodiment of the present invention. 本発明の一実施の形態における２レベルのウェーブレット変換を示す図である。It is a figure which shows 2 level wavelet transformation in one embodiment of this invention. 本発明の一実施の形態における第ｓステージでのLL_sサブバンドに基づくモード判定である。It is mode determination based on LL _s subband in the s-th stage in one embodiment of the present invention. Distributed Source Coding の構成図である。It is a block diagram of Distributed Source Coding. 画素領域ＤＶＣの構成図である。It is a block diagram of pixel region DVC.

Explanation of symbols

１１コンベンショナルイントラフレーム符号器
１２量子化器
１３ Slepian-Wolf符号器
２１コンベンショナルイントラフレーム復号器
２２記述または補間部
２４再構成器
３１ＤＷＴ
３２量子化部
３３サブバンドビットプレーン分割部
３４ Slepian-Wolf符号器
３５符号化手段
４１ Slepian-Wolf復号器
４２再構成器
４３ＩＤＷＴ
４４復号手段
４５補助情報生成器
４６ＤＷＴ
１００符号器
１１０ＤＷＴ
１２０量子化器
１３０デマルチプレクサ
１３１ターボ符号器
１３２バッファ
１３３ターボ復号器（Slepian-Wolf復号器）
１４０ Slepian-Wolf符号化手段、Slepian-Wolf符号器
１５０エントロピー符号化手段、エントロピー符号器
２００復号器
２１０補助情報生成手段、補助情報生成器
２２０ＤＷＴ
２４０ Slepian-Wolf復号手段、Slepian-Wolf復号器
２５０マルチプレクサ
２６０再構成手段、再構成器
２７０モード判定手段、モード判定器
２８０ IＤＷＴ
２９０エントロピー復号器 11 Conventional Intra Frame Encoder 12 Quantizer 13 Slepian-Wolf Encoder 21 Conventional Intra Frame Decoder 22 Description or Interpolator 24 Reconstructor 31 DWT
32 Quantization unit 33 Subband bit plane division unit 34 Slepian-Wolf encoder 35 Encoding means 41 Slepian-Wolf decoder 42 Reconstructor 43 IDWT
44 Decoding means 45 Auxiliary information generator 46 DWT
100 Encoder 110 DWT
120 Quantizer 130 Demultiplexer 131 Turbo Encoder 132 Buffer 133 Turbo Decoder (Slepian-Wolf Decoder)
140 Slepian-Wolf encoding means, Slepian-Wolf encoder 150 Entropy encoding means, entropy encoder 200 Decoder 210 Auxiliary information generating means, Auxiliary information generator 220 DWT
240 Slepian-Wolf decoding means, Slepian-Wolf decoder 250 Multiplexer 260 Reconstruction means, reconstructor 270 Mode determination means, Mode determiner 280 IDWT
290 Entropy Decoder

Claims

An image encoder for encoding a moving image frame using a wavelet domain Distributed Video Coding (DVC),
Encoding means for encoding the input Key frame;
DWT (Discrete Wavelet Transform) that transforms the input Wyner-Ziv frame into the wavelet domain,
Quantizing means for quantizing the wavelet domain signal and dividing the subband of the lowest frequency band into bit planes;
Entropy encoding means for performing entropy encoding of the output from the quantization means based on the mode switching determination result from the decoder;
Based on the mode switching determination result from the decoder, Slepian-Wolf encoding means for performing Wyner-Ziv encoding on the bit plane acquired by the quantization means;
A demultiplexer that distributes the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result obtained from the decoder,
The Slepian-Wolf encoding means is
When the decoder has auxiliary information that is four subbands obtained by performing discrete wavelet transform on forward prediction, backward prediction, and bidirectional prediction of the Wyner-Ziv frame, As a result of calculating SAD (Sum of Absolute Differences), which is an index indicating the correlation between the decoded subband LL _{s of the} frequency band and each auxiliary information, auxiliary information that gives the smallest value of SAD among those SADs Means for obtaining and encoding from the decoder;
The demultiplexer
Obtained from the decoder, the SAD and MAD (Mean Absolute Difference) of the decoded subband LL _s is the case of MAD <μSAD (where, mu is a constant), instructs the encoding to the entropy coding means In the case of MAD ≧ μSAD, the image encoder includes means for instructing encoding to the Slepian-Wolf encoding means for performing Wyner-Ziv encoding.

An image decoder for decoding a moving image frame using a wavelet domain DVC,
Decryption means for decrypting the input Key frame;
For the Wyner-Ziv frame, auxiliary information generating means for generating auxiliary information that is the subband LL _L of the lowest frequency of the s-stage by performing s times DWT conversion on the estimated value by the forward prediction and the backward prediction in both directions ,
Reconstructing means for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using the auxiliary information;
The subbands LH _s , HL _s , and HH _s in the middle and high frequency bands in the wavelet stage s of the Wyner-Ziv frame are divided into small rectangular blocks, and entropy coding or Wyner-Ziv coding is performed for each small block. Mode determination means for determining whether to perform the mode determination, and sending the mode determination result to the encoder;
The Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, and HL _s, entropy decoding unit entropy decodes the coded blocks in the frame of the sub-band HH _s in the high frequency band ,
And Slepian-Wolf decoding means for Slepian-Wolf decoding the Wyner-Ziv encoded block,
The Slepian-Wolf decoding means is:
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction and bi-directional prediction of the Wyner-Ziv frame, the low frequency band divided into small rectangular blocks has been decoded. As a result of calculating SAD, which is an index indicating the correlation between subband LL _s and each auxiliary information, including means for decoding using auxiliary information from which SAD having the smallest value among those SADs is obtained,
The mode determination means includes
When the MAD and the SAD of the decoded subband LL _s are MAD <μSAD (where μ is a constant), entropy coding is performed, and when MAD ≧ μSAD, Wyner-Ziv coding is performed. picture decoder characterized in that it comprises means for determining the.

An image encoder for encoding a moving image frame using a wavelet domain DVC,
Encoding means for encoding the input Key frame;
DWT (Discrete Wavelet Transform) that transforms the input Wyner-Ziv frame into the wavelet domain,
Quantizing means for quantizing the wavelet domain signal and dividing the subband of the lowest frequency band into bit planes;
Entropy encoding means for performing entropy encoding of the output from the quantization means based on the mode switching determination result from the decoder;
Based on the mode switching determination result from the decoder, Slepian-Wolf encoding means for performing Wyner-Ziv encoding on the bit plane acquired by the quantization means;
A demultiplexer that distributes the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result obtained from the decoder,
The Slepian-Wolf encoding means is
When the decoder has auxiliary information that is four subbands obtained by performing discrete wavelet transform on forward prediction, backward prediction, and bidirectional prediction of the Wyner-Ziv frame, As a result of calculating the decoded subband LL _{s of the} frequency band and the MSE of each auxiliary information, auxiliary information that obtains the smallest MSE (Mean Square Error) among those MSEs is obtained from the decoder and encoded. Including means for performing
The demultiplexer
If the variance σ of the decoded subband LL _{s in} the low frequency band obtained from the decoder and the MSE are σ <μMSE (where μ is a constant), the entropy encoding means is instructed to perform encoding. , Σ ≧ μMSE, an image encoder comprising means for instructing encoding to the Slepian-Wolf encoding means for performing Wyner-Ziv encoding.

An image decoder for decoding a moving image frame using a wavelet domain DVC,
Decryption means for decrypting the input Key frame;
For the Wyner-Ziv frame, auxiliary information generating means for generating auxiliary information that is the subband LL _L of the lowest frequency of the s-stage by performing s times DWT conversion on the estimated value by the forward prediction and the backward prediction in both directions ,
Reconstructing means for reconstructing the subband of the lowest frequency band of the Wyner-Ziv frame using the auxiliary information;
The middle and high frequency band subbands LH _s and HL _s and the high frequency band subband HH _s in the wavelet stage s of the Wyner-Ziv frame are each divided into rectangular small blocks, and entropy coding or , Mode determination means for determining whether to perform Wyner-Ziv encoding and sending a mode determination result to the encoder;
The Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, and HL _s, entropy decoding unit entropy decodes the coded blocks in the frame of the sub-band HH _s in the high frequency band ,
And Slepian-Wolf decoding means for Slepian-Wolf decoding the Wyner-Ziv encoded block,
The Slepian-Wolf decoding means is:
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction and bi-directional prediction of the Wyner-Ziv frame, the low frequency band divided into small rectangular blocks has been decoded. As a result of calculating the MSE of the subband LL _s and each auxiliary information, including means for decoding using the auxiliary information that can obtain the smallest MSE among those MSEs,
The mode determination means includes
If the variance σ of the decoded subband LL _{s in} the low frequency band and the MSE are σ <μMSE (where μ is a constant), entropy coding is performed, and if σ ≧ μMSE, Wyner-Ziv picture decoder characterized in that it comprises means for determining as to perform encoding.

An image encoding method for encoding a moving image frame using a wavelet domain DVC,
An encoding step in which the encoding means encodes the input Key frame;
A wavelet transform step in which DWT transforms the input Wyner-Ziv frame into a wavelet domain;
A quantization means for quantizing the signal in the wavelet domain and dividing a subband of the lowest frequency band into bit planes;
A demultiplexer that distributes the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result from the decoder;
An entropy encoding means for performing entropy encoding of the output from the quantization means;
Slepian-Wolf encoding means performs a Slepian-Wolf encoding step of performing Wyner-Ziv encoding on the bit plane acquired by the quantization means based on the mode switching determination result from the decoder. ,
In the sorting step,
When the decoder has auxiliary information that is four subbands obtained by performing discrete wavelet transform on forward prediction, backward prediction, and bidirectional prediction of the Wyner-Ziv frame, get the SAD is an index representing a correlation of the decoded subband LL _s and the respective auxiliary information frequency band, MAD and the SAD of the decoded sub-band LL is, MAD <μSAD (but, mu is a constant) In this case, the entropy encoding means is instructed to encode, and in the case of MAD ≧ μSAD, the Slepian-Wolf encoding means for performing Wyner-Ziv encoding is instructed to encode ,
In the Slepian-Wolf encoding step,
An image encoding method characterized in that auxiliary information for obtaining a SAD having the smallest value among the SADs is acquired from the decoder and encoded.

An image decoding method for decoding a moving image frame using a wavelet domain DVC,
A decoding step in which the decoding means decodes the input Key frame;
Auxiliary information generation means generates auxiliary information that is the subband LL _L of the lowest frequency of the s-th stage by performing DWT transformation on the estimated value of the forward prediction and the backward prediction for the Wyner-Ziv frame s times. An auxiliary information generation step;
Reconstructing means for reconstructing a subband of the lowest frequency band of the Wyner-Ziv frame using the auxiliary information;
The mode determination means divides the subbands LH _s and HL _s of the medium and high frequency band and the subband HH _s of the high frequency band in the wavelet s stage of the Wyner-Ziv frame into small rectangular blocks, and Determining whether to perform entropy encoding or Wyner-Ziv encoding, and sending a mode determination result to the encoder;
Entropy decoding means, the Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, HL _s, entropy decoding the intra-coded blocks of the subband HH _s in the high frequency band Entropy decoding step,
Slepian-Wolf decoding means performs a Slepian-Wolf decoding step of Slepian-Wolf decoding a Wyner-Ziv encoded block;
In the mode determination step,
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction and bi-directional prediction of the Wyner-Ziv frame, the low frequency band divided into small rectangular blocks has been decoded. get the subband LL _s and SAD is an index representing a correlation of the respective auxiliary information, MAD and the SAD of the decoded subband LL _s is, MAD <If μSAD (but, mu is a constant), entropy If MAD ≧ μSAD, it is determined that Wyner-Ziv encoding is performed .
In the Slepian-Wolf decoding step,
Decoding is performed using auxiliary information from which the SAD having the smallest value among the SADs is obtained.
An image decoding method characterized by the above.

An image encoding method for encoding a moving image frame using a wavelet domain DVC,
An encoding step in which the encoding means encodes the input Key frame;
A wavelet transform step in which DWT transforms the input Wyner-Ziv frame into a wavelet domain;
A quantization means for quantizing the signal in the wavelet domain and dividing a subband of the lowest frequency band into bit planes;
An entropy encoding unit performs entropy encoding of an output from the quantization unit based on a mode switching determination result from a decoder; and
Slepian-Wolf encoding means, based on the mode switching determination result from the decoder, Slepian-Wolf encoding step for performing Wyner-Ziv encoding on the bit plane acquired by the quantization means;
A demultiplexer performs a distribution step of distributing the output from the quantization means to either the entropy encoding means or the Slepian-Wolf encoding means based on the mode determination result acquired from the decoder,
In the sorting step,
When the decoder has auxiliary information, which is four subbands obtained by performing discrete wavelet transform on forward prediction, backward prediction, and bidirectional prediction of the Wyner-Ziv frame, a small rectangular block is output from the decoder. the MSE of the divided low frequency band decoded subband LL _s and each of the auxiliary information to obtain the result of calculation, the variance sigma and the MSE of the decoded subband LL _s is, σ <μMSE (where, mu Is a constant), the encoding is instructed to the entropy encoding means, and in the case of σ ≧ μMSE, the encoding is instructed to the Slepian-Wolf encoding means for performing Wyner-Ziv encoding,
In the Slepian-Wolf encoding step,
Auxiliary information for obtaining the smallest MSE (Mean Square Error) in the MSE is obtained from the decoder and encoded.
An image encoding method characterized by the above.

An image decoding method for decoding a moving image frame using a wavelet domain DVC,
A decoding step in which the decoding means decodes the input Key frame;
Auxiliary information generation means generates auxiliary information that is the subband LL _L of the lowest frequency of the s-th stage by performing DWT transformation on the estimated value of the forward prediction and the backward prediction for the Wyner-Ziv frame s times. An auxiliary information generation step;
Reconstructing means for reconstructing a subband of the lowest frequency band of the Wyner-Ziv frame using the auxiliary information;
The mode determination means divides the subbands LH _s and HL _s of the medium and high frequency band and the subband HH _s of the high frequency band in the wavelet s stage of the Wyner-Ziv frame into small rectangular blocks, and Determining whether to perform entropy encoding or Wyner-Ziv encoding, and sending a mode determination result to the encoder;
Entropy decoding means, the Wyner-Ziv subbands high and high frequency bands in the wavelet first s stage frame LH _s, HL _s, entropy decoding the intra-coded blocks of the subband HH _s in the high frequency band Entropy decoding step,
Slepian-Wolf decoding means performs a Slepian-Wolf decoding step of Slepian-Wolf decoding a Wyner-Ziv encoded block ;
In the mode determination step,
When there is auxiliary information that is four subbands obtained by discrete wavelet transform of forward prediction, backward prediction and bi-directional prediction of the Wyner-Ziv frame, the low frequency band divided into small rectangular blocks has been decoded. Gets the results of calculating the subband LL _s and MSE of each of the auxiliary information, the variance σ and the MSE of the decoded subband LL _s is the case σ of <μMSE (where, mu is a constant), entropy coding If σ ≧ μMSE, it is determined that Wyner-Ziv encoding is performed ,
In the Slepian-Wolf decoding step,
Decoding using auxiliary information that provides the smallest MSE among the MSEs,
An image decoding method characterized by the above.

An image encoding program for causing a computer to function as each means constituting the encoder according to claim 1 .

An image decoding program for causing a computer to function as means for constituting the decoder according to claim 2 .