JP6641149B2

JP6641149B2 - Video encoding device, video decoding device, and program

Info

Publication number: JP6641149B2
Application number: JP2015199563A
Authority: JP
Inventors: 俊枝三須; 井口　和久; 和久井口; 市ヶ谷　敦郎; 敦郎市ヶ谷; 康孝松尾; 境田　慎一; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2015-10-07
Filing date: 2015-10-07
Publication date: 2020-02-05
Anticipated expiration: 2035-10-07
Also published as: JP2017073668A

Description

本発明は、映像信号を圧縮符号化する映像符号化装置、映像信号を復号する映像復号装置、及びプログラムに関する。 The present invention relates to a video encoding device that compresses and encodes a video signal, a video decoding device that decodes a video signal, and a program.

従来、信号の符号化方式として、信号波形を予測し、予測残差を符号化する手法が知られている。特に、映像信号を符号化する場合においては、あるフレームを符号化する際に、映像をブロックに分割し、その符号化対象ブロック毎に既に符号化済みの別のフレームまたは当該フレーム内の画像から予測処理を行う。そして、予測処理の結果と符号化対象ブロックとの誤差に対して離散コサイン変換等の変換処理を行い、変換処理の結果である変換係数を量子化し、さらに量子化結果をエントロピー符号化する。このようにして、映像信号のデータが圧縮符号化される。 Conventionally, as a signal encoding method, a method of predicting a signal waveform and encoding a prediction residual has been known. In particular, when encoding a video signal, when encoding a certain frame, the video is divided into blocks, and for each of the encoding target blocks, another encoded frame or an image in the frame is used. Perform prediction processing. Then, transform processing such as discrete cosine transform is performed on the error between the result of the prediction processing and the block to be coded, the transform coefficients that are the result of the transform processing are quantized, and the quantization result is entropy-coded. Thus, the data of the video signal is compression-encoded.

〔従来の映像符号化装置〕
まず、従来の映像符号化装置について説明する。図６は、従来の映像符号化装置の構成例を示すブロック図である。この映像符号化装置１００は、ブロック分割手段１０、差分手段１１、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７、加算手段１９、ループフィルタ２０、メモリ２１、予測手段２２及び最適化手段２３を備えている。 [Conventional video encoding device]
First, a conventional video encoding device will be described. FIG. 6 is a block diagram illustrating a configuration example of a conventional video encoding device. The video encoding apparatus 100 includes a block dividing unit 10, a difference unit 11, an orthogonal transformation unit 13, a quantization unit 14, an entropy encoding unit 15, an inverse quantization unit 16, an inverse orthogonal transformation unit 17, an addition unit 19, a loop The apparatus includes a filter 20, a memory 21, a prediction unit 22, and an optimization unit 23.

映像符号化装置１００は、映像信号を入力し、映像信号に対し既存の符号化処理を行い符号化ビット列を生成し、符号化ビット列を後述する映像復号装置２００へ出力する。 The video encoding device 100 receives a video signal, performs an existing encoding process on the video signal, generates an encoded bit sequence, and outputs the encoded bit sequence to a video decoding device 200 described later.

ブロック分割手段１０は、映像信号を入力し、映像信号のフレーム毎に、当該フレームをブロック分割する。ブロック分割手段１０は、分割したブロックを差分手段１１、予測手段２２及び最適化手段２３に出力する。 The block dividing unit 10 receives a video signal and divides the frame into blocks for each frame of the video signal. The block division unit 10 outputs the divided blocks to the difference unit 11, the prediction unit 22, and the optimization unit 23.

ブロック分割は、一定の大きさの正方形で行われることもあれば、MPEG-4 AVC／H.264、MPEG−H HEVC／H.265等で規定されているように、様々な大きさの矩形で行われることもある。 The block division may be performed by a square of a fixed size, or a rectangle of various sizes as defined in MPEG-4 AVC / H.264, MPEG-H HEVC / H.265, etc. Sometimes it is done in.

尚、後述する差分手段１１、直交変換手段１３、逆直交変換手段１７及び加算手段１９の処理におけるブロック形状と、後述する予測手段２２及び最適化手段２３の処理におけるブロック形状とが異なる場合もあり得る。この場合、ブロック分割手段１０は、差分手段１１等用のブロック形状と予測手段２２等用のブロック形状とが異なるように、それぞれの分割形状を割り当てる。そして、ブロック分割手段１０は、映像信号のフレームを、それぞれのブロック形状のブロックに分割し、一方のブロックを差分手段１１に出力し、他方のブロックを予測手段２２及び最適化手段２３に出力する。 Note that the block shape in the processing of the later-described difference unit 11, orthogonal transformation unit 13, inverse orthogonal transformation unit 17, and addition unit 19 may be different from the block shape in the processing of the prediction unit 22 and the optimization unit 23 described later. obtain. In this case, the block dividing means 10 assigns each divided shape such that the block shape for the difference means 11 and the like and the block shape for the prediction means 22 and the like are different. Then, the block dividing unit 10 divides the frame of the video signal into blocks having respective block shapes, outputs one block to the difference unit 11, and outputs the other block to the prediction unit 22 and the optimization unit 23. .

差分手段１１は、ブロック分割手段１０からブロックを入力すると共に、予測手段２２から予測画像を入力し、当該ブロックの位置について、ブロックの画素値列から予測画像の画素値列を減算し、その減算結果である残差値列を求める。差分手段１１は、ブロックの残差値列を直交変換手段１３に出力する。 The difference unit 11 receives the block from the block division unit 10 and the prediction image from the prediction unit 22, and subtracts the pixel value sequence of the prediction image from the pixel value sequence of the block for the position of the block. Find the resulting residual value sequence. The difference means 11 outputs the residual value sequence of the block to the orthogonal transformation means 13.

直交変換手段１３は、差分手段１１からブロックの残差値列を入力し、残差値列に対し、所定の複数の基底関数にて直交変換を施し、変換係数列を生成する。直交変換手段１３は、ブロックの変換係数列を量子化手段１４に出力する。 The orthogonal transformation means 13 receives the residual value sequence of the block from the difference means 11 and performs an orthogonal transformation on the residual value sequence using a plurality of predetermined basis functions to generate a transformation coefficient sequence. The orthogonal transform unit 13 outputs a transform coefficient sequence of the block to the quantization unit 14.

基底関数には、例えば離散コサイン変換、その整数近似である整数変換が用いられる。また、MPEG−H HEVC／H.265においては、所定の条件下で離散サイン変換も用いられる。変換係数列とは、水平及び垂直の２次元空間周波数のそれぞれについて、異なる基底関数を適用した結果の変換係数を並べたものである。 For the basis function, for example, a discrete cosine transform or an integer approximation thereof is used. In MPEG-H HEVC / H.265, discrete sine transform is also used under predetermined conditions. The conversion coefficient sequence is obtained by arranging conversion coefficients obtained by applying different basis functions to each of the horizontal and vertical two-dimensional spatial frequencies.

量子化手段１４は、直交変換手段１３からブロックの変換係数列を入力し、変換係数列に対し、所定の量子化テーブルに基づき量子化（離散値化）し、量子化インデックス列を生成する。量子化手段１４は、ブロックの量子化インデックス列をエントロピー符号化手段１５及び逆量子化手段１６に出力する。 The quantization means 14 receives the transform coefficient sequence of the block from the orthogonal transform means 13 and quantizes (discretely converts) the transform coefficient sequence based on a predetermined quantization table to generate a quantization index sequence. The quantization means 14 outputs the quantization index sequence of the block to the entropy coding means 15 and the inverse quantization means 16.

量子化の幅は、量子化ステップ及び量子化パラメータ（ＱＰ：Quantization Parameter）により決定される。また、量子化の幅は、通例、空間周波数により異なるように設定され、空間周波数が高いほど広く（粗く）、空間周波数が低いほど狭く（細かく）設定される。量子化パラメータは、値が大きいほど変換係数列が粗く量子化され、情報圧縮の度合いが大きくなる反面、画質の劣化が大きくなる。 The width of quantization is determined by a quantization step and a quantization parameter (QP: Quantization Parameter). Further, the quantization width is generally set to be different depending on the spatial frequency, and is set wider (rougher) as the spatial frequency is higher, and narrower (finer) as the spatial frequency is lower. As the value of the quantization parameter increases, the transform coefficient sequence is coarsely quantized and the degree of information compression increases, but the image quality deteriorates greatly.

エントロピー符号化手段１５は、量子化手段１４からブロックの量子化インデックス列を入力し、量子化インデックス列に対し、変換係数列の統計的性質を考慮したモデルに基づきエントロピー符号化を行って符号を割り当て、符号化ビット列を生成する。エントロピー符号化手段１５により生成された符号化ビット列は、後述する映像復号装置２００へ出力される。 The entropy coding unit 15 inputs the block's quantization index sequence from the quantization unit 14 and performs entropy coding on the quantization index sequence based on a model that takes into account the statistical properties of the transform coefficient sequence to encode a code. Generate an allocation and coded bit string. The coded bit string generated by the entropy coding unit 15 is output to a video decoding device 200 described later.

エントロピー符号化には、例えば、ハフマン符号化、算術符号化、ランレングス符号化、ＣＡＶＬＣ（Context−adaptive Variable Length Coding）、ＣＡＢＡＣ（Context−adaptive Binary Arithmetic Coding）等が用いられる。 For the entropy coding, for example, Huffman coding, arithmetic coding, run-length coding, CAVLC (Context-adaptive Variable Length Coding), CABAC (Context-adaptive Binary Arithmetic Coding) and the like are used.

ここで、エントロピー符号化手段１５により、量子化インデックス列に加え、後述する予測手段２２にて求めた動きベクトル情報、後述する最適化手段２３にて求めた最適モード情報、映像の大きさ等の情報、その他、使用者等により付加された情報等もエントロピー符号化され、符号化ビット列として出力される。 Here, in addition to the quantization index sequence, the entropy encoding unit 15 adds motion vector information obtained by a prediction unit 22 described later, optimal mode information obtained by an optimization unit 23 described later, Information and other information added by a user or the like are also entropy-encoded and output as encoded bit strings.

逆量子化手段１６は、量子化手段１４からブロックの量子化インデックス列を入力し、量子化手段１４の逆の処理を行うことで、量子化インデックス列を逆量子化し、量子化インデックス列を変換係数列に戻す。逆量子化手段１６は、ブロックの変換係数列を逆直交変換手段１７に出力する。 The inverse quantization means 16 receives the quantization index sequence of the block from the quantization means 14 and performs inverse processing of the quantization means 14 to inversely quantize the quantization index sequence and transform the quantization index sequence. Return to coefficient sequence. The inverse quantization means 16 outputs the transform coefficient sequence of the block to the inverse orthogonal transform means 17.

量子化手段１４においては、ある量子化インデックス値を与える変換係数値には所定の範囲（幅）があるが、逆量子化手段１６は、入力した量子化インデックス値に対する変換係数値として、当該範囲の代表値（例えば、中央値）を出力する。 The quantization means 14 has a predetermined range (width) for the transform coefficient value giving a certain quantization index value, but the inverse quantization means 16 determines the transform coefficient value for the input quantization index value as the range. Is output (for example, a median value).

逆直交変換手段１７は、逆量子化手段１６からブロックの変換係数列を入力し、直交変換手段１３の逆の処理を行うことで、変換係数列に逆直交変換を施し、復号残差値列を生成する。逆直交変換手段１７は、ブロックの復号残差値列を加算手段１９に出力する。例えば、逆直交変換手段１７は、直交変換手段１３が離散コサイン変換を行う場合、逆離散コサイン変換を行う。 The inverse orthogonal transform unit 17 receives the transform coefficient sequence of the block from the inverse quantization unit 16, performs inverse processing of the orthogonal transform unit 13, performs inverse orthogonal transform on the transform coefficient sequence, and obtains a decoded residual value sequence. Generate The inverse orthogonal transform unit 17 outputs the decoded residual value sequence of the block to the adding unit 19. For example, the inverse orthogonal transform unit 17 performs the inverse discrete cosine transform when the orthogonal transform unit 13 performs the discrete cosine transform.

加算手段１９は、逆直交変換手段１７からブロックの復号残差値列を入力すると共に、後述する予測手段２２から予測画像を入力する。そして、加算手段１９は、当該ブロックの位置について、復号残差値列に予測画像の画素値列を加算し、その加算結果である復号画素値列を求める。加算手段１９は、復号画素値列をループフィルタ２０に出力する。 The adding means 19 inputs the decoded residual value sequence of the block from the inverse orthogonal transform means 17 and a predicted image from the predicting means 22 described later. Then, the adding means 19 adds the pixel value sequence of the predicted image to the decoded residual value sequence for the position of the block, and obtains a decoded pixel value sequence as a result of the addition. The adding means 19 outputs the decoded pixel value sequence to the loop filter 20.

ループフィルタ２０は、加算手段１９から復号画素値列を入力し、復号画素値列に対してフィルタ処理を行い、フィルタ処理後の復号画素値列をメモリ２１に格納すると共に、最適化手段２３に出力する。これにより、ブロック境界の歪み等を抑制することができる。ループフィルタ２０には、例えばデブロッキングフィルタ、ＳＡＯ（Sample Adaptive Offset）、ＡＬＦ（Adaptive Loop Filter）等が用いられる。 The loop filter 20 receives the decoded pixel value sequence from the adding unit 19, performs a filtering process on the decoded pixel value sequence, stores the decoded pixel value sequence after the filtering process in the memory 21, and Output. This makes it possible to suppress distortion or the like at the block boundary. As the loop filter 20, for example, a deblocking filter, SAO (Sample Adaptive Offset), ALF (Adaptive Loop Filter), or the like is used.

メモリ２１には、ループフィルタ２０によりフィルタ処理後の復号画素値列が格納される。ここで、ループフィルタ２０は、復号画素値列を、ブロックの位置を考慮しながら、メモリ２１に格納する。これにより、復号画像が順次形成される。順次というのは、ブロック分割手段１０により分割されたブロックの領域について順次処理するにあたり、ループフィルタ２０がその処理を終える毎に、当該ブロックの領域の復号画素値列が埋め込まれ、復号画像が形成される過程を意図している。 The decoded pixel value sequence after the filter processing by the loop filter 20 is stored in the memory 21. Here, the loop filter 20 stores the decoded pixel value sequence in the memory 21 in consideration of the position of the block. Thereby, decoded images are sequentially formed. The term “sequential” means that when sequentially processing the regions of the blocks divided by the block dividing means 10, each time the loop filter 20 completes the processing, the decoded pixel value sequence of the regions of the blocks is embedded, and the decoded image is formed. It is intended to be a process.

予測手段２２は、メモリ２１においてこれまでに形成された復号画像（これまでに復号された他の時刻のフレームであってもよい）の画素値列を読み出すと共に、必要に応じてブロック分割手段１０から映像信号のフレームのブロックを入力する。そして、予測手段２２は、これから処理すべきブロック内の画素値列を予測的に合成し、予測画像を生成する。予測手段２２は、予測画像を差分手段１１及び加算手段１９に出力する。 The prediction unit 22 reads out the pixel value sequence of the decoded image formed so far (may be a frame decoded at another time so far) in the memory 21 and, if necessary, the block dividing unit 10. To input a block of a frame of a video signal. Then, the prediction unit 22 predictively synthesizes a pixel value sequence in a block to be processed, and generates a predicted image. The prediction unit 22 outputs the predicted image to the difference unit 11 and the addition unit 19.

予測手段２２は、例えば、処理対象のブロックの領域において、メモリ２１内に既に形成されている他の時刻の復号画像の画素値列と、ブロック分割手段１０から入力したブロックにより構成された現フレームの画素値列とを比較し、その画素値列が最も整合する（誤差が小さくなる）空間的な相対位置関係を動きベクトルとして求める（動き探索により動きベクトルを求める）。そして、予測手段２２は、当該動きベクトルに基づいて、他の時刻の復号画像の画素値列を空間的に移動させる（動き補償する）ことで、予測画像を生成する。 The predicting unit 22 includes, for example, a pixel value sequence of a decoded image at another time that has already been formed in the memory 21 and a current frame formed by a block input from the block dividing unit 10 in an area of a block to be processed. Are compared with each other, and a spatial relative positional relationship where the pixel value sequence is most matched (the error is reduced) is obtained as a motion vector (a motion vector is obtained by motion search). Then, the prediction unit 22 generates a predicted image by spatially moving (motion-compensating) the pixel value sequence of the decoded image at another time based on the motion vector.

また、予測手段２２は、例えば、処理対象のブロックの領域に空間的に隣接し、かつメモリ２１内に既に形成されている画素値列に基づいて、その画素値列を空間的に一定方向に外挿したり、その画素値列間で演算を行った結果で埋めたりすることで、予測画像を生成する。 For example, the prediction unit 22 spatially adjoins the pixel value sequence spatially adjacent to the area of the block to be processed and based on the pixel value sequence already formed in the memory 21 in a fixed direction. A predicted image is generated by extrapolating or filling with a result obtained by performing an operation between the pixel value strings.

最適化手段２３は、ブロック分割手段１０、直交変換手段１３（これに対応する逆直交変換手段１７）、量子化手段１４（これに対応する逆量子化手段１６）、エントロピー符号化手段１５、ループフィルタ２０及び予測手段２２にて用いるパラメータを様々に変化させて試行する。そして、最適化手段２３は、レート歪特性（より少ない符号量、より少ない誤差を以てより良好とする）が最高または最高を近似するものとなるパラメータ（レート歪み特性を表す値が最高値から所定分低い値までの間に存在する場合のパラメータ）を特定する。 The optimizing unit 23 includes a block dividing unit 10, an orthogonal transform unit 13 (corresponding inverse orthogonal transform unit 17), a quantizing unit 14 (corresponding inverse quantizing unit 16), an entropy coding unit 15, a loop The experiment is performed by changing various parameters used in the filter 20 and the prediction unit 22. Then, the optimization unit 23 determines a parameter (the value representing the rate distortion characteristic is a predetermined value from the maximum value) that maximizes or approximates the maximum in the rate distortion characteristic (to be better with a smaller code amount and a smaller error). Parameters that exist between the lower values).

最適化手段２３は、特定したパラメータを最適モードのパラメータとし、最適モードのパラメータをブロック分割手段１０、直交変換手段１３（これに対応する逆直交変換手段１７）、量子化手段１４（これに対応する逆量子化手段１６）、エントロピー符号化手段１５、ループフィルタ２０及び予測手段２２に出力し、当該パラメータに従って動作させる。 The optimizing unit 23 uses the specified parameters as the parameters of the optimal mode, and uses the parameters of the optimal mode as the block dividing unit 10, the orthogonal transform unit 13 (the inverse orthogonal transform unit 17 corresponding thereto), and the quantization unit 14 (corresponding to this). To the inverse quantization means 16), the entropy coding means 15, the loop filter 20 and the prediction means 22, and operate according to the parameters.

これにより、ブロック分割手段１０、直交変換手段１３（これに対応する逆直交変換手段１７）、量子化手段１４（これに対応する逆量子化手段１６）、エントロピー符号化手段１５、ループフィルタ２０及び予測手段２２は、最適化手段２３から入力したパラメータに従った処理を行う。 Thereby, the block dividing means 10, the orthogonal transform means 13 (the corresponding inverse orthogonal transform means 17), the quantizing means 14 (the corresponding inverse quantizing means 16), the entropy encoding means 15, the loop filter 20, The prediction unit 22 performs a process according to the parameters input from the optimization unit 23.

また、最適化手段２３は、最適モードのパラメータを最適モード情報としてエントロピー符号化手段１５に出力する。 Further, the optimization unit 23 outputs the parameters of the optimal mode to the entropy encoding unit 15 as optimal mode information.

これにより、最適モード情報は、エントロピー符号化手段１５によりエントロピー符号化され、符号化ビット列として後述する映像復号装置２００へ出力される。 As a result, the optimal mode information is entropy-encoded by the entropy encoding means 15 and output to the video decoding device 200 described later as an encoded bit sequence.

このように、映像符号化装置１００の既存の符号化処理により、量子化インデックス列及び最適モード情報等が符号化ビット列に変換され、後述する映像復号装置２００へ出力される。 As described above, by the existing encoding processing of the video encoding device 100, the quantization index sequence, the optimal mode information, and the like are converted into encoded bit sequences, and output to the video decoding device 200 described later.

〔従来の映像復号装置〕
次に、従来の映像復号装置について説明する。図７は、従来の映像復号装置の構成例を示すブロック図である。この映像復号装置２００は、エントロピー復号手段３１、逆量子化手段３２、逆直交変換手段３３、加算手段３５、ループフィルタ３６、メモリ３７及び予測手段３８を備えている。 [Conventional video decoding device]
Next, a conventional video decoding device will be described. FIG. 7 is a block diagram illustrating a configuration example of a conventional video decoding device. The video decoding device 200 includes an entropy decoding unit 31, an inverse quantization unit 32, an inverse orthogonal transformation unit 33, an addition unit 35, a loop filter 36, a memory 37, and a prediction unit 38.

映像復号装置２００は、映像符号化装置１００から量子化インデックス列及び最適モード情報等の符号化ビット列を入力し、最適モード情報の示す最適モードのパラメータに従い既存の復号処理を行い、量子化インデックス列から元の映像信号のフレームを復元する。そして、映像復号装置２００は、復号映像信号を出力する。 The video decoding device 200 receives a quantization index sequence and an encoded bit sequence such as optimal mode information from the video encoding device 100, performs an existing decoding process according to the optimal mode parameter indicated by the optimal mode information, and performs quantization index sequence. From the original video signal. Then, the video decoding device 200 outputs the decoded video signal.

エントロピー復号手段３１は、映像符号化装置１００から符号化ビット列を入力し、図６に示したエントロピー符号化手段１５の逆の処理を行い、符号化ビット列をエントロピー復号することで、元の情報に可逆的に復号する。元の情報には、量子化インデックス列に加え、図１に示した予測手段２２にて求めた動きベクトル情報、最適化手段２３にて求めた最適モード情報、映像の大きさ等の情報、その他、使用者等により付加された情報等も含まれる。 The entropy decoding unit 31 receives the coded bit sequence from the video coding device 100, performs the reverse process of the entropy coding unit 15 shown in FIG. 6, and performs entropy decoding on the coded bit sequence to obtain the original information. Reversibly decode. The original information includes, in addition to the quantization index sequence, the motion vector information obtained by the prediction unit 22 shown in FIG. 1, the optimal mode information obtained by the optimization unit 23, the information such as the size of the video, and the like. , And information added by the user or the like.

エントロピー復号手段３１は、エントロピー復号した最適モード情報である最適モードのパラメータを、後述する逆量子化手段３２、逆直交変換手段３３、ループフィルタ３６及び予測手段３８に出力し、当該パラメータに従って動作させる。 The entropy decoding unit 31 outputs the parameters of the optimal mode, which are the entropy-decoded optimal mode information, to the below-described inverse quantization unit 32, inverse orthogonal transform unit 33, loop filter 36, and prediction unit 38, and operates according to the parameters. .

これにより、後述する逆量子化手段３２、逆直交変換手段３３、ループフィルタ３６及び予測手段３８は、エントロピー復号手段３１から入力した最適モードのパラメータに従った処理を行う。 Accordingly, the inverse quantization means 32, the inverse orthogonal transform means 33, the loop filter 36, and the prediction means 38, which will be described later, perform processing according to the parameters of the optimal mode input from the entropy decoding means 31.

また、エントロピー復号手段３１は、エントロピー復号したブロックの量子化インデックス列を逆量子化手段３２に出力する。 Further, the entropy decoding unit 31 outputs the quantization index sequence of the entropy-decoded block to the inverse quantization unit 32.

逆量子化手段３２は、エントロピー復号手段３１からブロックの量子化インデックス列を入力し、図６に示した逆量子化手段１６と同様の処理を行うことで、量子化インデックス列を変換係数列に変換する。逆量子化手段３２は、ブロックの変換係数列を逆直交変換手段３３に出力する。 The inverse quantization means 32 receives the block quantization index sequence from the entropy decoding means 31 and performs the same processing as the inverse quantization means 16 shown in FIG. 6 to convert the quantization index sequence into a transform coefficient sequence. Convert. The inverse quantization means 32 outputs the transform coefficient sequence of the block to the inverse orthogonal transform means 33.

逆直交変換手段３３は、逆量子化手段３２からブロックの変換係数列を入力し、図６に示した逆直交変換手段１７と同様の処理を行うことで、変換係数列に逆直交変換を施し、復号残差値列を生成する。逆直交変換手段３３は、ブロックの復号残差値列を加算手段３５に出力する。 The inverse orthogonal transform unit 33 receives the transform coefficient sequence of the block from the inverse quantization unit 32 and performs the same processing as the inverse orthogonal transform unit 17 shown in FIG. 6 to perform the inverse orthogonal transform on the transform coefficient sequence. , And generates a decoded residual value sequence. The inverse orthogonal transform unit 33 outputs the decoded residual value sequence of the block to the adding unit 35.

加算手段３５は、逆直交変換手段３３からブロックの復号残差値列を入力すると共に、後述する予測手段３８から予測画像を入力する。そして、加算手段３５は、図６に示した加算手段１９と同様の処理を行うことで、当該ブロックの位置について、復号残差値列に予測画像の画素値列を加算し、その加算結果である復号画素値列を求める。加算手段３５は、復号画素値列をループフィルタ３６に出力する。 The adding unit 35 receives the decoded residual value sequence of the block from the inverse orthogonal transforming unit 33 and also receives a predicted image from a predicting unit 38 described later. Then, the adding unit 35 performs the same processing as the adding unit 19 shown in FIG. 6 to add the pixel value sequence of the predicted image to the decoded residual value sequence for the position of the block, and obtains the result of the addition. A certain decoded pixel value sequence is obtained. The adder 35 outputs the decoded pixel value sequence to the loop filter 36.

ループフィルタ３６は、加算手段３５から復号画素値列を入力し、図６に示したループフィルタ２０と同様の処理を行うことで、復号画素値列に対してフィルタ処理を行い、フィルタ処理後の復号画素値列をメモリ３７に格納する。 The loop filter 36 receives the decoded pixel value sequence from the adding unit 35 and performs the same processing as the loop filter 20 shown in FIG. 6 to perform a filtering process on the decoded pixel value sequence. The decoded pixel value sequence is stored in the memory 37.

メモリ３７には、図６に示したメモリ２１と同様に、ループフィルタ３６によりフィルタ処理後の復号画素値列が、対応するブロックの位置に順次格納される。これにより、メモリ３７には、復号画像が順次形成される。 In the memory 37, similarly to the memory 21 illustrated in FIG. 6, the decoded pixel value sequence that has been subjected to the filtering process by the loop filter 36 is sequentially stored at the corresponding block position. Thus, decoded images are sequentially formed in the memory 37.

映像復号装置２００は、メモリ３７内に復号画像が完全に形成された後、メモリ３７から復号画像を読み出し、復号映像信号として外部へ出力する。 After the decoded image is completely formed in the memory 37, the video decoding device 200 reads the decoded image from the memory 37 and outputs the decoded image to the outside as a decoded video signal.

予測手段３８は、メモリ３７においてこれまでに形成された復号画像（これまでに復号された他の時刻のフレームであってもよい）の画素値列を読み出し、図６に示した予測手段２２と同様に、これから処理すべきブロック内の画素値列を予測的に合成し、予測画像を生成する。予測手段３８は、予測画像を加算手段３５に出力する。 The prediction unit 38 reads out the pixel value sequence of the decoded image formed so far (may be a frame decoded at another time so far) in the memory 37, and reads the pixel value sequence from the prediction unit 22 shown in FIG. Similarly, a pixel value sequence in a block to be processed is synthesized predictively to generate a predicted image. The prediction unit 38 outputs the predicted image to the addition unit 35.

予測手段３８は、例えば、動きベクトル情報を利用する場合、エントロピー復号手段３１からパラメータである動きベクトル情報を入力し、動きベクトル情報に基づいて、他の時刻の復号画像の画素値列を空間的に移動させる（動き補償する）ことで、予測画像を生成する。この場合、動き探索は行わない。 For example, when using the motion vector information, the prediction unit 38 inputs the motion vector information as a parameter from the entropy decoding unit 31 and spatially converts the pixel value sequence of the decoded image at another time based on the motion vector information. (A motion compensation) to generate a predicted image. In this case, no motion search is performed.

このように、映像復号装置２００の既存の復号処理により、符号化ビット列が量子化インデックス列及び最適モード情報等に変換され、元の映像信号のフレームが復元される。 As described above, by the existing decoding processing of the video decoding device 200, the coded bit sequence is converted into the quantization index sequence and the optimal mode information and the like, and the frame of the original video signal is restored.

図６及び図７にて説明した符号化方式は、ハイブリッド符号化またはＭＣ−ＤＣＴ（Motion Compensation／Discrete Cosine Transform）符号化と呼ばれている。例えば、ＭＰＥＧ−１、ＭＰＥＧ−２、ＭＰＥＧ−４、ＭＰＥＧ−４ＡＶＣ／Ｈ．２６４、及びＭＰＥＧ−ＨＨＥＶＣ／Ｈ．２６５の各映像符号化方式は、ＭＣ−ＤＣＴ符号化に属する方式である。 The coding scheme described in FIGS. 6 and 7 is called hybrid coding or MC-DCT (Motion Compensation / Discrete Cosine Transform) coding. For example, MPEG-1, MPEG-2, MPEG-4, MPEG-4 AVC / H. H.264, and MPEG-H HEVC / H. Each of the H.265 video coding schemes is a scheme belonging to MC-DCT coding.

また、ＭＣ−ＤＣＴ符号化を含む既存の映像符号化方式に、前処理及び後処理を付加することで、符号化効率及び主観画質を向上する手法が開示されている（例えば特許文献１を参照）。特許文献１の手法では、前処理に解像度削減処理を用いることで、既存の映像符号化方式における圧縮率を緩和し、一方、後処理に超解像技術を応用することで、ぼやけの少ない解像度復元を実現している。このとき、解像度復元の処理を調整可能とし、入力映像を最もよく再現できる最適な調整値を送信側で予め求めて、最適な調整値を復号側へ補助情報として伝送することで、高効率な圧縮伝送を可能とする Also, a technique has been disclosed in which pre-processing and post-processing are added to existing video coding methods including MC-DCT coding to improve coding efficiency and subjective image quality (for example, see Patent Document 1). ). In the method of Patent Literature 1, the compression ratio in the existing video encoding method is reduced by using the resolution reduction processing in the pre-processing, while the resolution is reduced by applying the super-resolution technique to the post-processing. The restoration has been realized. At this time, it is possible to adjust the resolution restoring process, obtain an optimal adjustment value that can reproduce the input video best on the transmission side in advance, and transmit the optimal adjustment value to the decoding side as auxiliary information, thereby achieving high efficiency. Enables compressed transmission

特許第５４１９７９５号公報Japanese Patent No. 5419795

しかしながら、特許文献１の手法は、前処理及び後処理が、既存の符号化処理の外側に付加される形態である。このため、既存の符号化処理において実行される最適化（例えば、レート歪最適化：Rate-Distortion Optimization（ＲＤＯ））と、前処理及び後処理において実行される最適化（解像度復元処理等において実行される最適化）とは、異なる処理であって互いに独立している。既存の符号化処理において最適化が行われ、かつ前処理及び後処理において最適化が行われたとしても、全体として最適化が行われるとは限らない。したがって、特許文献１の手法は、既存の符号化処理、前処理及び後処理の全体の大局的な観点において、必ずしも最適化に適しているとは限らない。 However, the method of Patent Document 1 is a form in which pre-processing and post-processing are added outside the existing encoding processing. For this reason, optimization performed in existing encoding processing (for example, Rate-Distortion Optimization (RDO)) and optimization performed in pre-processing and post-processing (performed in resolution restoration processing and the like) Optimization is a different process and is independent of each other. Even if optimization is performed in existing encoding processing and optimization is performed in pre-processing and post-processing, optimization is not necessarily performed as a whole. Therefore, the technique of Patent Document 1 is not always suitable for optimization from the overall viewpoint of the existing encoding processing, pre-processing, and post-processing.

例えば、前処理及び後処理は、既存の符号化処理におけるブロック分割とは無関係に実行される。このため、前処理の後の画像形状または標本化構造は、既存の符号化処理に入力可能な形態である必要が生じ、前処理として適用可能な変換処理が限定的であった。また、既存の符号化処理において、ブロック内の場所毎に異なる倍率の解像度変換を適用したり、標本化格子形状を縦横以外の方向に拡大または縮小したり、非線形的な標本化構造により再標本化することは、一般的に許容されていない。 For example, pre-processing and post-processing are performed independently of block division in existing encoding processing. For this reason, the image shape or the sampling structure after the preprocessing needs to be in a form that can be input to the existing encoding processing, and the conversion processing applicable as the preprocessing is limited. In addition, in existing encoding processing, resolution conversion of a different magnification is applied to each location in a block, the sampling grid shape is enlarged or reduced in directions other than the vertical and horizontal directions, and a resampling is performed by a non-linear sampling structure. Is generally not allowed.

このように、特許文献１の手法により、既存の符号化処理に対し前処理及び後処理を付加することで符号化効率を高めることはできるが、前述のとおり、既存の符号化処理と前処理及び後処理との間で十分に連係を図ることができないことから、全体として最適化が不十分であるという問題があった。このため、全体の大局的な観点で最適化を実現するべく、さらなる改善が望まれていた。 As described above, by adding the pre-processing and the post-processing to the existing encoding processing, the encoding efficiency can be increased by the technique of Patent Document 1, but as described above, the existing encoding processing and the pre-processing are performed. In addition, there is a problem that the optimization is insufficient as a whole because it is not possible to sufficiently cooperate with the post-processing. For this reason, further improvement has been desired in order to realize optimization from a global perspective.

そこで、本発明は前記課題を解決するためになされたものであり、その目的は、映像信号の符号化効率を改善可能な映像符号化装置、映像復号装置及びプログラムを提供することにある。 Then, this invention is made in order to solve the said subject, The objective is to provide the video encoding device which can improve the encoding efficiency of a video signal, a video decoding device, and a program.

前記課題を解決するために、請求項１の映像符号化装置は、映像信号のフレームをブロックに分割し、前記ブロックの各標本点の画素値、または前記画素値と予測画像の画素値との間の差である残差値に対し、直交変換及び量子化を施し、エントロピー符号化した符号化ビット列を出力する映像符号化装置において、前記ブロックを単位として、前記標本点の数、配置、及び前記画素値または前記残差値のうちの少なくとも１つを変更することで、前記標本点を再標本化する再標本化手段と、前記再標本化手段にて用いるパラメータ、並びに前記ブロックを分割する処理、前記直交変換の処理及び前記量子化の処理にて用いるそれぞれのパラメータを含む複数のパラメータを変化させ、符号化効率を表す評価値をそれぞれ求め、前記評価値が所定範囲の値のときの前記複数のパラメータを、最適な複数のパラメータとして特定する最適化手段と、前記再標本化手段により再標本化され、前記直交変換及び前記量子化が施された量子化インデックス列、及び、前記最適化手段により特定された前記複数のパラメータをエントロピー符号化し、前記符号化ビット列を生成するエントロピー符号化手段と、を備え、前記再標本化手段が、前記最適化手段により特定されたパラメータに対応した再標本化規則に従い、前記標本点を再標本化する、ことを特徴とする。 In order to solve the problem, the video encoding device according to claim 1 divides a frame of a video signal into blocks, and calculates a pixel value of each sample point of the block, or a pixel value of each of the sample values and a pixel value of a prediction image. In a video encoding device that performs orthogonal transformation and quantization on a residual value that is a difference between the two, and outputs an encoded bit sequence that is entropy-encoded, the number, arrangement, and Changing at least one of the pixel value or the residual value to resample the sample points, a parameter used in the resampler , and the block. Process, changing a plurality of parameters including respective parameters used in the orthogonal transformation process and the quantization process, respectively, to obtain evaluation values representing coding efficiency, Optimizing means for specifying the plurality of parameters at a fixed range of values as optimal plurality of parameters, and quantization performed by the orthogonal transformation and the quantization, which are resampled by the resampling means. An index sequence, and entropy-encoding means for entropy-encoding the plurality of parameters specified by the optimizing means to generate the encoded bit string, wherein the re-sampling means is configured by the optimizing means The sampling point is resampled according to a resampling rule corresponding to the specified parameter.

係る構成によれば、映像信号のフレームがブロックに分割された後に、当該ブロックを単位として再標本化が行われる。これにより、再標本化の処理とブロックを単位として行われる直交変換及び量子化の処理とを連動させることができるから、ブロックの単位とは無関係な前処理及び後処理が行われる従来技術に比べ、全体として効果的な最適化を実現できる。また、ブロックを単位として、最適なパラメータに対応した再標本化が行われるから、最適化を確実に実現することができる。結果として、符号化効率を改善することが可能となる。 According to such a configuration, after the frame of the video signal is divided into blocks, resampling is performed in units of the blocks. As a result, the resampling process and the orthogonal transform and quantization processes performed on a block-by-block basis can be linked, so that pre-processing and post-processing irrespective of the block unit are performed as compared with the related art. As a whole, effective optimization can be realized. In addition, since resampling corresponding to the optimal parameter is performed in units of blocks, optimization can be reliably realized. As a result, it is possible to improve coding efficiency.

また、請求項２の映像符号化装置は、請求項１に記載の映像符号化装置において、前記再標本化手段が入力する標本点を入力標本点とし、前記再標本化手段により再標本化された標本点を出力標本点とした場合に、前記再標本化規則が、前記ブロックを単位として、前記入力標本点の数を削減するように、または前記入力標本点の粗密を変化させるように、前記出力標本点の配置を設定し、前記出力標本点を基点とした所定範囲に存在する所定数の前記入力標本点の前記画素値または前記残差値に基づいて、前記出力標本点の前記画素値または前記残差値を算出する規則である、ことを特徴とする。 In the video encoding apparatus according to a second aspect of the present invention, in the video encoding apparatus according to the first aspect , the sample points input by the resampling unit are set as input sample points, and resampled by the resampling unit. When the sampled points are output sample points, the re-sampling rule, in units of the block, to reduce the number of the input sample points, or to change the density of the input sample points, Set the arrangement of the output sample points, based on the pixel value or the residual value of a predetermined number of the input sample points present in a predetermined range based on the output sample points, the pixel of the output sample points Value or a rule for calculating the residual value.

係る構成によれば、例えば、画素値または残差値が所定の広い範囲で類似値となる映像信号について、最適なパラメータに対応して広い範囲のブロックが分割された場合、再標本化手段により、最適なパラメータに対応して標本点の数が削減されることになり得る。これにより、標本点の数が少なくなるから、後段の直交変換処理の演算負荷を抑えつつ、等価的な直交変換処理のサポート範囲を拡大することができる。また、再標本化手段により、最適なパラメータに対応して、標本化格子をブロック内のテクスチャ、その残差の程度または複雑さ等に応じて粗密を変化させることができる。これにより、ブロック内で必要な箇所に多くの標本点を配置することができ、映像信号の近似精度を向上させることができる。結果として、符号化効率を改善することが可能となる。 According to such a configuration, for example, for a video signal in which pixel values or residual values have similar values in a predetermined wide range, when a wide range of blocks is divided corresponding to an optimal parameter, , The number of sample points may be reduced corresponding to the optimal parameters. As a result, the number of sample points is reduced, and the support range of the equivalent orthogonal transformation process can be expanded while suppressing the computational load of the subsequent orthogonal transformation process. Further, the resampling means can change the density of the sampling grid in accordance with the texture in the block, the degree or complexity of the residual, etc., in accordance with the optimal parameters. Thereby, many sample points can be arranged at necessary places in the block, and the approximation accuracy of the video signal can be improved. As a result, it is possible to improve coding efficiency.

さらに、請求項３の映像復号装置は、請求項１の映像符号化装置により出力された符号化ビット列を入力し、前記符号化ビット列をエントロピー復号し、逆量子化及び逆直交変換して元の映像信号のフレームを復元する映像復号装置において、前記符号化ビット列をエントロピー復号し、量子化インデックス列及び複数のパラメータを生成するエントロピー復号手段と、前記映像信号のフレームが分割されたブロックを単位として、前記映像符号化装置が標本点を再標本化する処理とは逆の処理にて、前記エントロピー復号手段により生成されたパラメータに対応した逆再標本化規則に従い、前記標本点の数、配置、及び前記量子化インデックス列が逆直交変換及び前記逆直交変換された復号画素値または復号残差値のうちの少なくとも１つを変更することで、前記標本点を逆再標本化する逆再標本化手段と、を備え、前記逆再標本化手段により逆再標本化された前記復号画素値または前記復号残差値に基づいて、前記元の映像信号のフレームを復元する、ことを特徴とする。 Furthermore, the image decoding apparatus according to claim 3, enter the encoded bit string output by the video encoding apparatus according to claim 1, wherein the encoded bit stream to the entropy decoding the original and the inverse quantization and inverse orthogonal transform In a video decoding device for restoring a frame of a video signal, entropy decoding of the coded bit sequence, entropy decoding means for generating a quantization index sequence and a plurality of parameters, the block of the frame of the video signal as a unit In a process reverse to the process in which the video encoding device resamples the sample points, the number, arrangement, and the number of the sample points according to a reverse resampling rule corresponding to the parameter generated by the entropy decoding means. And the quantization index sequence is at least one of an inverse orthogonal transform and a decoded pixel value or a decoded residual value subjected to the inverse orthogonal transform. By changing the sampling point, reverse resampling means for reverse resampling the sample points, based on the decoded pixel value or the decoded residual value inversely resampled by the reverse resampling means. And restoring the frame of the original video signal.

係る構成によれば、ブロックを単位として逆再標本化が行われる。これにより、ブロックを単位として行われる逆量子化及び逆直交変換の処理と逆標本化の処理とを連動させることができるから、ブロックの単位とは無関係な前処理及び後処理が行われる従来技術に比べ、全体として効果的な最適化を実現できる。また、ブロックを単位として、最適なパラメータに対応した逆再標本化が行われるから、最適化を確実に実現することができる。結果として、符号化効率を改善することが可能となる。 According to such a configuration, inverse resampling is performed for each block. This makes it possible to link the inverse quantization and inverse orthogonal transform processing performed in units of blocks with the inverse sampling processing, so that the prior art in which preprocessing and postprocessing independent of the block units are performed. As a whole, effective optimization can be realized. In addition , since inverse resampling corresponding to the optimal parameters is performed in units of blocks, optimization can be reliably achieved. As a result, it is possible to improve coding efficiency.

また、請求項４の映像復号装置は、請求項３に記載の映像復号装置において、前記逆再標本化手段が入力する標本点を入力標本点とし、前記逆再標本化手段により逆再標本化された標本点を出力標本点とした場合に、前記逆再標本化規則が、前記ブロックを単位として、前記標本点の数を増加させるように、または前記入力標本点の粗密を変化させるように、前記出力標本点の配置を設定し、前記出力標本点を基点とした所定範囲に存在する所定数の前記入力標本点の前記復号画素値または前記復号残差値に基づいて、前記出力標本点の前記復号画素値または前記復号残差値を算出する規則である、ことを特徴とする。 According to a fourth aspect of the present invention, in the video decoding apparatus according to the third aspect , the sample points input by the inverse resampling unit are set as input sample points, and the inverse resampling unit performs inverse resampling. When the sampled points are set as output sample points, the inverse re-sampling rule increases the number of the sample points in units of the block or changes the density of the input sample points. Setting the arrangement of the output sample points, based on the decoded pixel values or the decoded residual values of a predetermined number of the input sample points present in a predetermined range based on the output sample points, Is a rule for calculating the decoded pixel value or the decoded residual value.

係る構成によれば、例えば、画素値または残差値が所定の広い範囲で類似値となる映像信号について、映像符号化装置にて最適なパラメータに対応して広い範囲のブロックが分割され、映像符号化装置の再標本化手段により、最適なパラメータに対応して標本点の数が減少された場合、当該映像復号装置の逆再標本化手段により、最適なパラメータに対応して標本点の数を増加して元の標本点に戻すことができる。また、映像符号化装置の再標本化手段により、最適なパラメータに対応して、標本化格子をブロック内のテクスチャ、その残差の程度または複雑さ等に応じて粗密を変化させた場合、当該映像復号装置の逆再標本化手段により、最適なパラメータに対応して元の標本点に戻すことができる。結果として、符号化効率を改善することが可能となる。 According to such a configuration, for example, for a video signal whose pixel value or residual value is a similar value in a predetermined wide range, a wide range of blocks is divided according to the optimal parameter in the video encoding device, When the number of sample points corresponding to the optimal parameters is reduced by the resampling unit of the encoding device, the number of sample points corresponding to the optimal parameters is reduced by the inverse resampling unit of the video decoding device. Can be increased back to the original sample point. Further, when the sampling grid is changed in density according to the texture in the block, the degree or complexity of the residual, etc., in accordance with the optimal parameter by the resampling means of the video encoding device, By the inverse resampling means of the video decoding device, it is possible to return to the original sample points corresponding to the optimal parameters. As a result, it is possible to improve coding efficiency.

以上のように、本発明によれば、映像信号の符号化効率を改善することが可能となる。 As described above, according to the present invention, it is possible to improve the coding efficiency of a video signal.

本発明の実施形態による映像符号化装置の構成例を示すブロック図である。1 is a block diagram illustrating a configuration example of a video encoding device according to an embodiment of the present invention. 再標本化処理の例を説明する図である。FIG. 9 is a diagram illustrating an example of a resampling process. ４つの入力標本点の残差値（画素値）に基づいた双一次補間を行う数式（１）を説明する図である。FIG. 9 is a diagram illustrating Formula (1) for performing bilinear interpolation based on residual values (pixel values) of four input sample points. 逆再標本化処理の例を説明する図である。FIG. 14 is a diagram illustrating an example of a reverse resampling process. 本発明の実施形態による映像復号装置の構成例を示すブロック図である。It is a block diagram showing an example of composition of a picture decoding device by an embodiment of the present invention. 従来の映像符号化装置の構成例を示すブロック図である。FIG. 14 is a block diagram illustrating a configuration example of a conventional video encoding device. 従来の映像復号装置の構成例を示すブロック図である。FIG. 29 is a block diagram illustrating a configuration example of a conventional video decoding device.

以下、本発明を実施するための形態について図面を用いて詳細に説明する。
〔映像符号化装置〕
まず、本発明の実施形態による映像符号化装置について説明する。図１は、本発明の実施形態による映像符号化装置の構成例を示すブロック図である。この映像符号化装置１は、ブロック分割手段１０、差分手段１１、再標本化手段１２、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７、逆再標本化手段１８、加算手段１９、ループフィルタ２０、メモリ２１、予測手段２２及び最適化手段２４を備えている。 Hereinafter, embodiments for carrying out the present invention will be described in detail with reference to the drawings.
[Video encoding device]
First, a video encoding device according to an embodiment of the present invention will be described. FIG. 1 is a block diagram illustrating a configuration example of a video encoding device according to an embodiment of the present invention. The video encoding apparatus 1 includes a block dividing unit 10, a difference unit 11, a resampling unit 12, an orthogonal transform unit 13, a quantization unit 14, an entropy encoding unit 15, an inverse quantization unit 16, and an inverse orthogonal transform unit 17. , An inverse resampling unit 18, an adding unit 19, a loop filter 20, a memory 21, a predicting unit 22, and an optimizing unit 24.

図６に示した従来の映像符号化装置１００と、図１に示す本発明の実施形態による映像符号化装置１とを比較すると、両映像符号化装置１，１００は、ブロック分割手段１０、差分手段１１、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７、加算手段１９、ループフィルタ２０、メモリ２１及び予測手段２２を備えている点で同一である。 Comparing the conventional video encoding device 100 shown in FIG. 6 with the video encoding device 1 according to the embodiment of the present invention shown in FIG. Means 11, orthogonal transformation means 13, quantization means 14, entropy coding means 15, inverse quantization means 16, inverse orthogonal transformation means 17, addition means 19, loop filter 20, memory 21, and prediction means 22 Are the same.

一方、映像符号化装置１は、映像符号化装置１００の構成に加え、さらに再標本化手段１２及び逆再標本化手段１８を備え、映像符号化装置１００に備えた最適化手段２３とは異なる最適化手段２４を備えている点で、映像符号化装置１００と相違する。図１において、図６と共通する部分には図６と同一の符号を付し、その詳しい説明は省略する。 On the other hand, the video encoding device 1 further includes a resampling unit 12 and an inverse resampling unit 18 in addition to the configuration of the video encoding device 100, and is different from the optimizing unit 23 provided in the video encoding device 100. It differs from the video encoding device 100 in that it has an optimizing unit 24. In FIG. 1, portions common to FIG. 6 are denoted by the same reference numerals as in FIG. 6, and detailed description thereof will be omitted.

再標本化手段１２は、差分手段１１と直交変換手段１３の間に設けられ、差分手段１１からブロックの残差値列を入力し、ブロック単位に、標本点の数、配置及び残差値を変更することで標本点を再標本化する。そして、再標本化手段１２は、再標本化後のブロックの残差値列を直交変換手段１３に出力する。 The resampling unit 12 is provided between the difference unit 11 and the orthogonal transformation unit 13, receives a sequence of residual values of a block from the difference unit 11, and calculates the number, arrangement, and residual value of the sample points for each block. Resample the sampling points by changing. Then, the resampling unit 12 outputs the residual value sequence of the block after the resampling to the orthogonal transformation unit 13.

尚、再標本化手段１２は、標本点の数のみを変更することで再標本化してもよいし、標本点の配置のみを変更することで再標本化してもよいし、標本点の残差値のみを変更することで再標本化してもよい。要するに、再標本化手段１２は、標本点の数、配置及び残差値のうちの１つ、２つまたは全てを変更することで、標本点を再標本化すればよい。 The resampling unit 12 may resample by changing only the number of sample points, may resample by changing only the arrangement of the sample points, Re-sampling may be performed by changing only the value. In short, the resampling unit 12 may resample the sample points by changing one, two, or all of the number, arrangement, and residual values of the sample points.

図２は、再標本化手段１２による再標本化処理の例を説明する図である。図２において、○印は、再標本化手段１２に入力されるブロックの残差値列についての標本点（再標本化前の標本点（入力標本点））を示し、×印は、再標本化手段１２が出力するブロックの残差値列についての標本点（再標本化後の標本点（出力標本点））を示す。入力標本点の残差値列を出力標本点の残差値列に変換するための再標本化の規則（再標本化規則）は、図２に示す入力標本点及び出力標本点の数、配置が定義され、出力標本点の残差値を算出するための数式を用いた変換規則として、予め設定されているものとする。 FIG. 2 is a diagram illustrating an example of a resampling process by the resampling unit 12. In FIG. 2, a circle indicates a sample point (sample point before resampling (input sample point)) of a residual value sequence of a block input to the resampling unit 12, and a cross indicates resampling. 2 shows sample points (sample points after re-sampling (output sample points)) of the residual value sequence of the block output by the conversion means 12. The resampling rule (resampling rule) for converting the residual value sequence of the input sample points into the residual value sequence of the output sample points is based on the number and arrangement of the input sample points and the output sample points shown in FIG. Is defined, and is set in advance as a conversion rule using a mathematical expression for calculating a residual value of an output sample point.

再標本化手段１２は、後述する最適化手段２４から最適再標本化モードのパラメータを入力し、当該パラメータに基づいて、予め設定された複数の再標本化規則（例えば図２（ａ）〜（ｄ）の規則）の中から１つの再標本化規則を決定する。そして、再標本化手段１２は、決定した再標本化規則に従い、標本点を再標本化する。 The resampling unit 12 receives the parameters of the optimal resampling mode from the optimizing unit 24, which will be described later, and based on the parameters, sets a plurality of preset resampling rules (for example, FIGS. One resampling rule is determined from rule d)). Then, the resampling unit 12 resamples the sampling points according to the determined resampling rule.

図２（ａ）は、標本化格子を変更しない例であり、再標本化を行わないものと等価である。再標本化手段１２は、図２（ａ）の予め設定された再標本化規則に従い、図２（ａ）に示す入力標本点を再標本化し、図２（ａ）に示す出力標本点の残差値列を求める。この場合、再標本化手段１２は、再標本化を行わないで、入力した残差値列をそのまま出力する。この再標本化規則は、図２（ａ）に示すように、入力標本点及び出力標本点の数及び配置を変更せず、かつその残差値も変更しない規則である。 FIG. 2A shows an example in which the sampling grid is not changed, and is equivalent to a case in which resampling is not performed. The resampling means 12 resamples the input sample points shown in FIG. 2A according to the preset resampling rule shown in FIG. Find the difference sequence. In this case, the resampling unit 12 outputs the input residual value sequence without performing resampling. As shown in FIG. 2A, this resampling rule is a rule that does not change the number and arrangement of input sample points and output sample points, and does not change the residual value.

図２（ｂ）は、標本数を削減し（水平方向に１／２倍、垂直方向に１／２倍に削減し）、再標本化後の配置も均一な間隔とする例である。再標本化手段１２は、図２（ｂ）の予め設定された再標本化規則に従い、図２（ｂ）に示す入力標本点を再標本化し、図２（ｂ）に示す出力標本点の残差値列を求める。この再標本化規則は、図２（ｂ）に示すように、入力標本点の標本数を水平方向に１／２倍、垂直方向に１／２倍に削減し、入力標本点（の均一な間隔）とは異なる均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の残差値を算出する規則である。 FIG. 2B shows an example in which the number of samples is reduced (１／ in the horizontal direction and 倍 in the vertical direction), and the arrangement after re-sampling is also uniform. The resampling means 12 resamples the input sampling points shown in FIG. 2B according to the preset resampling rule shown in FIG. Find the difference sequence. As shown in FIG. 2B, the resampling rule reduces the number of input sample points by a factor of 1/2 in the horizontal direction and by a factor of 1/2 in the vertical direction. This is a rule that sets the arrangement of the output sample points so as to have a uniform interval different from the (interval), and calculates the residual value of the output sample points using a predetermined mathematical formula.

図２（ｃ）は、標本数を保ったまま、再標本化の位置を不均一な間隔に（標本点の粗密を変化させるように）再配置する例である。再標本化手段１２は、図２（ｃ）の予め設定された再標本化規則に従い、図２（ｃ）に示す入力標本点を再標本化し、図２（ｃ）に示す出力標本点の残差値列を求める。この再標本化規則は、図２（ｃ）に示すように、入力標本点及び出力標本点の標本数を同一とし、入力標本点と同じまたは異なる位置かつ不均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の残差値を算出する規則である。 FIG. 2C shows an example in which the positions of resampling are rearranged at non-uniform intervals (to change the density of the sample points) while maintaining the number of samples. The resampling unit 12 resamples the input sampling points shown in FIG. 2C according to the preset resampling rule shown in FIG. Find the difference sequence. As shown in FIG. 2C, the resampling rule sets the number of samples of the input sample points and the output sample points to be the same, and sets the output sample points to be the same or different from the input sample points and to have an uneven interval. This is a rule that sets the arrangement of the sample points and calculates the residual value of the output sample points using a predetermined mathematical formula.

図２（ｄ）は、標本数を削減し（水平方向に１／４倍、垂直方向に１／４倍に削減し）、かつ再標本化の位置を不均一な間隔に（標本点の粗密を変化させるように）再配置する例である。再標本化手段１２は、図２（ｄ）の予め設定された再標本化規則に従い、図２（ｄ）に示す入力標本点を再標本化し、図２（ｄ）に示す出力標本点の残差値列を求める。この再標本化規則は、図２（ｄ）に示すように、入力標本点の標本数を水平方向に１／４倍、垂直方向に１／４倍に削減し、入力標本点と同じまたは異なる位置かつ不均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の残差値を算出する規則である。 FIG. 2 (d) shows that the number of samples is reduced (1/4 times in the horizontal direction and 1/4 times in the vertical direction), and the resampling positions are set at uneven intervals (coarse and Is changed). The resampling means 12 resamples the input sampling points shown in FIG. 2D according to the preset resampling rule shown in FIG. Find the difference sequence. This resampling rule reduces the number of samples at the input sample points to 1/4 times in the horizontal direction and 1/4 times in the vertical direction as shown in FIG. This is a rule for setting the arrangement of output sample points so as to be at a position and non-uniform intervals, and calculating the residual value of the output sample points using a predetermined mathematical formula.

図２（ａ）〜（ｄ）に示すように、再標本化規則において、出力標本点の位置と入力標本点の位置とは、必ずしも重なる必要はない。また、再標本化規則において、出力標本点の残差値は、当該出力標本点の近傍に存在する１以上の入力標本点（当該出力標本点を基点とした所定範囲に存在する所定数の入力標本点）の残差値に基づいて算出されるように定義されている。 As shown in FIGS. 2A to 2D, in the resampling rule, the position of the output sample point and the position of the input sample point do not necessarily need to overlap. In the resampling rule, the residual value of the output sampling point is calculated by calculating one or more input sampling points near the output sampling point (a predetermined number of input sampling points existing in a predetermined range starting from the output sampling point). (Sample point) is defined to be calculated based on the residual value.

例えば、出力標本点の位置（ｘ，ｙ）における残差値は、当該位置（ｘ，ｙ）からの距離（例えば、ユークリッド距離）が最も近い入力標本点の残差値であると定義してもよい。また、例えば、出力標本点の位置（ｘ，ｙ）における残差値は、当該位置（ｘ，ｙ）に最も近い４つの入力標本点の残差値に基づいて、双一次補間（バイリニア補間）により算出されるように定義してもよい。 For example, the residual value at the position (x, y) of the output sample point is defined as the residual value of the input sample point whose distance (for example, the Euclidean distance) from the position (x, y) is closest. Is also good. Further, for example, the residual value at the position (x, y) of the output sample point is bilinear interpolation (bilinear interpolation) based on the residual values of the four input sample points closest to the position (x, y). May be defined as calculated by

出力標本点の位置（ｘ，ｙ）における残差値をＬ（ｘ，ｙ）、入力標本点の位置（ｘ，ｙ）における残差値をＨ（ｘ，ｙ）とし、入力標本点の位置における水平及び垂直座標がともに整数の格子点である場合、以下の数式が用いられる。この数式は、４つの入力標本点の残差値に基づいた双一次補間を行う式である。

The residual value at the position (x, y) of the output sample point is L (x, y), the residual value at the position (x, y) of the input sample point is H (x, y), and the position of the input sample point is When both the horizontal and vertical coordinates in are lattice points of integers, the following equation is used. This equation performs bilinear interpolation based on residual values of four input sample points.

図３は、４つの入力標本点の残差値に基づいた双一次補間を行う数式（１）を説明する図である。図３において、○印は入力標本点を示し、×印は出力標本点を示す。前記数式（１）のとおり、図３に示す４つの○印の入力標本点ａ１〜ａ４における残差値Ｈを用いて双一次補間を行い、×印の出力標本点ｂ１における残差値Ｌが算出される。 FIG. 3 is a diagram illustrating Expression (1) for performing bilinear interpolation based on residual values of four input sample points. In FIG. 3, a mark indicates an input sample point, and a mark indicates an output sample point. As shown in the above equation (1), bilinear interpolation is performed using the residual values H at the four input sample points a1 to a4 indicated by the circles in FIG. Is calculated.

尚、再標本化手段１２は、再標本化の前処理として、間引きフィルタ（例えば低域通過型フィルタ）を適用しても構わない。この場合、再標本化手段１２は、差分手段１１から残差値列を入力し、間引きフィルタを用いて残差値列にフィルタ処理を施し、フィルタ処理後の標本点を再標本化する。 Note that the resampling unit 12 may apply a thinning filter (for example, a low-pass filter) as preprocessing for resampling. In this case, the resampling unit 12 receives the residual value sequence from the difference unit 11, performs a filtering process on the residual value sequence using a thinning filter, and resamples the sample points after the filtering process.

図１に戻って、逆再標本化手段１８は、逆直交変換手段１７と加算手段１９の間に設けられている。逆再標本化手段１８は、逆直交変換手段１７からブロックの復号残差値列を入力し、再標本化手段１２の逆の処理を行い、ブロック単位に、標本点の数、配置及び復号残差値を変更することで、標本点を逆再標本化する。そして、逆再標本化手段１８は、逆標本化後のブロックの復号残差値列を加算手段１９に出力する。 Returning to FIG. 1, the inverse resampling unit 18 is provided between the inverse orthogonal transform unit 17 and the adding unit 19. The inverse resampling unit 18 receives the decoding residual value sequence of the block from the inverse orthogonal transforming unit 17, performs the inverse processing of the resampling unit 12, and performs the processing of the number, arrangement, and By changing the difference value, the sampling points are inversely resampled. Then, the inverse resampling unit 18 outputs the decoded residual value sequence of the block after the inverse sampling to the adding unit 19.

尚、逆再標本化手段１８は、標本点の数のみを変更することで逆再標本化してもよいし、標本点の配置のみを変更することで逆再標本化してもよいし、標本点の復号残差値のみを変更することで逆再標本化してもよい。要するに、逆再標本化手段１８は、標本点の数、配置及び復号残差値のうちの１つ、２つまたは全てを変更することで、標本点を逆再標本化すればよい。 The reverse resampling means 18 may perform reverse resampling by changing only the number of sample points, may perform reverse resampling by changing only the arrangement of sample points, May be inversely resampled by changing only the decoded residual value of. In short, the inverse re-sampling means 18 may inversely re-sample the sample points by changing one, two or all of the number, arrangement, and decoding residual value of the sample points.

図４は、逆再標本化手段１８による逆再標本化処理の例を説明する図である。図４において、×印は、逆再標本化手段１８に入力されるブロックの復号残差値列についての標本点（逆再標本化前の標本点（入力標本点））を示し、○印は、逆再標本化手段１８が出力するブロックの復号残差値列についての標本点（逆再標本化後の標本点（出力標本点））を示す。入力標本点の復号残差値列を出力標本点の復号残差値列に変換するための逆再標本化の規則（逆再標本化規則）は、図２に示した再標本化規則の逆であり、図４に示す入力標本点及び出力標本点の数と配置が定義され、出力標本点の復号残差値を算出するための所定の数式を用いた変換規則として、予め設定されているものとする。 FIG. 4 is a diagram for explaining an example of the reverse resampling process by the reverse resampling unit 18. In FIG. 4, crosses indicate sample points (sample points before reverse re-sampling (input sample points)) of the decoded residual value sequence of the block input to the inverse re-sampling means 18, and circles indicate Shows sample points (sample points after reverse resampling (output sample points)) of the decoded residual value sequence of the block output by the inverse resampling means 18. The inverse resampling rule (inverse resampling rule) for converting the decoded residual value sequence at the input sample points into the decoded residual value sequence at the output sample points is the inverse of the resampling rule shown in FIG. The number and arrangement of the input sample points and the output sample points shown in FIG. 4 are defined, and are set in advance as a conversion rule using a predetermined mathematical expression for calculating the decoding residual value of the output sample points. Shall be.

逆再標本化手段１８は、後述する最適化手段２４から最適逆再標本化モードのパラメータを入力し、当該パラメータに基づいて、予め設定された複数の再標本化規則（例えば図４（ａ）〜（ｄ）の規則）の中から１つの逆再標本化規則を決定する。そして、逆再標本化手段１８は、決定した逆再標本化規則に従い、標本点を再標本化する。 The inverse resampling unit 18 receives the parameters of the optimal inverse resampling mode from the optimizing unit 24 described later, and based on the parameters, sets a plurality of preset resampling rules (for example, FIG. 4A). To (d)), one reverse resampling rule is determined. Then, the reverse resampling means 18 resamples the sampling points in accordance with the determined reverse resampling rule.

図４（ａ）は、図２（ａ）に対応し、標本化格子を変更しない例であり、逆再標本化を行わないものと等価である。逆再標本化手段１８は、図４（ａ）の予め設定された逆再標本化規則に従い、図４（ａ）に示す入力標本点を再標本化し、図４（ａ）に示す出力標本点の復号残差値列を求める。この場合、逆再標本化手段１８は、逆再標本化を行わないで、入力した復号残差値列をそのまま出力する。この逆再標本化規則は、図４（ａ）に示すように、入力標本点及び出力標本点の数及び配置を変更せず、かつその復号残差値も変更しない規則である。 FIG. 4A corresponds to FIG. 2A and is an example in which the sampling grid is not changed, and is equivalent to the case where inverse resampling is not performed. The inverse resampling unit 18 resamples the input sample points shown in FIG. 4A according to the preset inverse resampling rule shown in FIG. 4A, and outputs the output sample points shown in FIG. Is obtained. In this case, the inverse resampling unit 18 outputs the input decoded residual value sequence without performing inverse resampling. As shown in FIG. 4A, this inverse resampling rule is a rule that does not change the number and arrangement of input sample points and output sample points, and does not change the decoding residual value.

図４（ｂ）は、図２（ｂ）に対応し、標本数を増加し（水平方向に２倍、垂直方向に２倍に増加し）、逆再標本化後の配置を均一な間隔に復元する例である。逆再標本化手段１８は、図４（ｂ）の予め設定された逆再標本化規則に従い、図４（ｂ）に示す入力標本点を逆再標本化し、図４（ｂ）に示す出力標本点の復号残差値列を求める。この逆再標本化規則は、図４（ｂ）に示すように、入力標本点の標本数を水平方向に２倍、垂直方向に２倍に増加し、元の均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の復号残差値を算出する規則である。 FIG. 4B corresponds to FIG. 2B, in which the number of samples is increased (doubled in the horizontal direction and doubled in the vertical direction), and the arrangement after the reverse resampling is made uniform. This is an example of restoration. The inverse resampling means 18 inversely resamples the input sampling points shown in FIG. 4B according to the preset inverse resampling rule shown in FIG. 4B, and outputs the output samples shown in FIG. Find a decoded residual value sequence for the point. As shown in FIG. 4B, this inverse resampling rule increases the number of samples at the input sampling points by a factor of two in the horizontal direction and by a factor of two in the vertical direction so that the original uniform intervals are obtained. This is a rule for setting the arrangement of output sample points and calculating the decoding residual value of the output sample points using a predetermined mathematical formula.

図４（ｃ）は、図２（ｃ）に対応し、標本数を保ったまま、逆再標本化後の配置を均一な間隔に復元する（標本点の粗密を元に戻す）例である。逆再標本化手段１８は、図４（ｃ）の予め設定された逆再標本化規則に従い、図４（ｃ）に示す入力標本点を逆再標本化し、図４（ｃ）に示す出力標本点の復号残差値列を求める。この逆再標本化規則は、図４（ｃ）に示すように、入力標本点及び出力標本点の標本数を同一とし、元の均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の復号残差値を算出する規則である。 FIG. 4 (c) corresponds to FIG. 2 (c), and is an example of restoring the arrangement after inverse re-sampling to uniform intervals (restoring the density of sampling points) while maintaining the number of samples. . The inverse resampling means 18 inversely resamples the input sampling points shown in FIG. 4C according to the preset inverse resampling rule shown in FIG. 4C, and outputs the output samples shown in FIG. Find a decoded residual value sequence for the point. In this inverse resampling rule, as shown in FIG. 4C, the arrangement of output sample points is set so that the number of input sample points and the number of output sample points are the same and the original uniform intervals are maintained. A rule for calculating the decoded residual value of the output sample point using a predetermined mathematical formula.

図４（ｄ）は、図２（ｄ）に対応し、標本数を増加し（水平方向に４倍、垂直方向に４倍に増加し）、かつ逆再標本化後の位置を均一な間隔に復元する（標本点の粗密を元に戻す）例である。逆再標本化手段１８は、図４（ｄ）の予め設定された逆再標本化規則に従い、図４（ｄ）に示す入力標本点を逆再標本化し、図４（ｄ）に示す出力標本点の復号残差値列を求める。この逆再標本化規則は、図４（ｄ）に示すように、入力標本点の標本数を水平方向に４倍、垂直方向に４倍に増加し、元の均一な間隔となるように、出力標本点の配置を設定すると共に、所定の数式にて出力標本点の復号残差値を算出する規則である。 FIG. 4 (d) corresponds to FIG. 2 (d), in which the number of samples is increased (4 times in the horizontal direction and 4 times in the vertical direction), and the positions after inverse re-sampling are evenly spaced. This is an example of restoring to (the original density of the sample points is restored). The inverse resampling means 18 inversely resamples the input sampling points shown in FIG. 4D according to the preset inverse resampling rule shown in FIG. 4D, and outputs the output samples shown in FIG. Find a decoded residual value sequence for the point. As shown in FIG. 4D, this inverse resampling rule increases the number of input sample points by four times in the horizontal direction and four times in the vertical direction so that the original uniform intervals are obtained. This is a rule for setting the arrangement of output sample points and calculating the decoding residual value of the output sample points using a predetermined mathematical formula.

図４（ａ）〜（ｄ）に示すように、逆再標本化規則において、出力標本点の位置と入力標本点の位置とは、必ずしも重なる必要はない。また、逆再標本化規則において、出力標本点の復号残差値は、当該出力標本点の近傍に存在する１以上の入力標本点の復号残差値に基づいて算出されるように定義されている。 As shown in FIGS. 4A to 4D, in the reverse resampling rule, the position of the output sample point and the position of the input sample point do not necessarily have to overlap. In the inverse resampling rule, the decoding residual value of the output sample point is defined to be calculated based on the decoding residual value of one or more input sample points existing near the output sample point. I have.

例えば、出力標本点の位置（ｘ，ｙ）における復号残差値は、当該位置（ｘ，ｙ）からの距離（例えば、ユークリッド距離）が最も近い入力標本点の復号残差値であるとしてもよい。また、例えば、出力標本点の位置（ｘ，ｙ）における復号残差値は、当該位置（ｘ，ｙ）に最も近い４つの入力標本点の復号残差値に基づいて、双一次補間により算出されるようにしてもよい。 For example, the decoding residual value at the position (x, y) of the output sample point may be the decoding residual value of the input sample point having the closest distance (for example, the Euclidean distance) from the position (x, y). Good. Also, for example, the decoding residual value at the position (x, y) of the output sample point is calculated by bilinear interpolation based on the decoding residual values of the four input sample points closest to the position (x, y). May be performed.

尚、逆再標本化手段１８は、逆再標本化の前処理として、間引きフィルタ（例えば低域通過型フィルタ）を適用しても構わない。この場合、逆再標本化手段１８は、逆直交変換手段１７から復号残差値列を入力し、間引きフィルタを用いて復号残差値列にフィルタ処理を施し、フィルタ処理後の標本点を逆再標本化する。 Note that the inverse resampling unit 18 may apply a thinning filter (for example, a low-pass filter) as preprocessing for inverse resampling. In this case, the inverse re-sampling unit 18 receives the decoded residual value sequence from the inverse orthogonal transform unit 17, performs a filtering process on the decoded residual value sequence using a thinning filter, and reverses the sample points after the filtering process. Resample.

また、逆再標本化手段１８は、出力標本点の復号残差値を算出する際に、当該出力標本点の近傍に存在する１以上の入力標本点の復号残差値、または必要に応じて以前に処理を行った入力標本点の復号残差値に基づいて、再標本化手段１２が再標本化を行ったときに失った高域情報を、超解像技術により復元または合成しても構わない。 When calculating the decoding residual value of the output sample point, the inverse re-sampling unit 18 calculates the decoding residual value of one or more input sample points present in the vicinity of the output sample point or, if necessary, Based on the decoded residual values of the input sample points that have been processed previously, the high-frequency information lost when the re-sampling unit 12 performs re-sampling may be restored or synthesized by super-resolution technology. I do not care.

図１に戻って、最適化手段２４は、ブロック分割手段１０、直交変換手段１３（これに対応する逆直交変換手段１７）、量子化手段１４（これに対応する逆量子化手段１６）、エントロピー符号化手段１５、ループフィルタ２０及び予測手段２２に加え、再標本化手段１２（これに対応する逆再標本化手段１８）にて用いるパラメータを様々に変化させて試行する。例えば、最適化手段２４は、再標本化手段１２に対し、図２（ａ）〜（ｄ）に対応するパラメータを変化させて試行する。また、最適化手段２４は、再標本化手段１２に対応する逆再標本化手段１８に対し、図４（ａ）〜（ｄ）に対応するパラメータを変化させて試行する。 Returning to FIG. 1, the optimizing means 24 includes the block dividing means 10, the orthogonal transform means 13 (corresponding inverse orthogonal transform means 17), the quantizing means 14 (corresponding inverse quantizing means 16), the entropy. In addition to the encoding means 15, the loop filter 20, and the prediction means 22, the parameters used in the resampling means 12 (corresponding to the inverse resampling means 18) are variously changed and the experiment is performed. For example, the optimization unit 24 tries the resampling unit 12 by changing the parameters corresponding to FIGS. In addition, the optimizing unit 24 tries the reverse resampling unit 18 corresponding to the resampling unit 12 by changing the parameters corresponding to FIGS.

最適化手段２４は、図６に示した最適化手段２３と同様に、レート歪特性が最高または最高を近似するものとなるパラメータ（レート歪み特性を表す値が最高値から所定分低い値までの間に存在する場合のパラメータ）を特定する。特定されたパラメータは、最高または最高を近似するものとなる符号化効率を実現可能なデータである。 The optimizing unit 24 is, like the optimizing unit 23 shown in FIG. 6, a parameter for which the rate distortion characteristic is the highest or approximates the highest (the value representing the rate distortion characteristic from the highest value to a value lower than the predetermined value by a predetermined amount). Parameters if they exist between them). The specified parameter is data capable of achieving the highest or a coding efficiency that approximates the highest.

すなわち、最適化手段２４は、ブロック分割手段１０、再標本化手段１２等のパラメータを様々に変化させ、符号化効率を表す評価値（例えば、画質の良さの程度及び符号量の少なさの程度を表した値）を算出し、当該評価値が所定範囲に入るときのパラメータを特定する。特定されたパラメータは、例えば、所定の画質を実現する場合に符号量が最も少なくて済み、かつ所定の符号量を用いる場合に画質が最も良くなるパラメータである。 That is, the optimization unit 24 variously changes the parameters of the block division unit 10, the resampling unit 12, and the like, and evaluates the coding efficiency (for example, the degree of good image quality and the degree of small code amount). Is calculated, and a parameter when the evaluation value falls within a predetermined range is specified. The specified parameter is, for example, a parameter that requires the least amount of code when realizing a predetermined image quality and that provides the best image quality when a predetermined amount of code is used.

最適化手段２４は、特定したパラメータと最適モードのパラメータとし、最適モードのパラメータをブロック分割手段１０、再標本化手段１２等に出力し、当該パラメータに従って動作させる。これにより、ブロック分割手段１０、再標本化手段１２等が最適モードのパラメータに従って動作することで、全体としてバランスのとれた所望の符号化効率を実現することができる。 The optimizing unit 24 sets the specified parameters and the parameters of the optimal mode, outputs the parameters of the optimal mode to the block dividing unit 10, the resampling unit 12, and the like, and operates according to the parameters. Thereby, the block dividing unit 10, the resampling unit 12, and the like operate according to the parameters of the optimal mode, so that a desired encoding efficiency that is balanced as a whole can be realized.

例えば、最適化手段２４は、再標本化手段１２について、レート歪特性が最高または最高を近似するものとなる再標本化モードのパラメータを、最適再標本化モードのパラメータとして特定する。また、最適化手段２４は、逆再標本化手段１８について、レート歪特性が最高または最高を近似するものとなる逆再標本化モードのパラメータを、最適逆再標本化モードのパラメータとして特定する。 For example, the optimization unit 24 specifies, for the resampling unit 12, a parameter of the resampling mode in which the rate distortion characteristic is the highest or approximates the highest as a parameter of the optimal resampling mode. In addition, the optimization unit 24 specifies, for the inverse resampling unit 18, a parameter of the inverse resampling mode in which the rate distortion characteristic is the highest or approximates the maximum as a parameter of the optimal inverse resampling mode.

ブロック分割手段１０、再標本化手段１２等は、最適化手段２４から入力したパラメータに従った処理を行う。例えば、再標本化手段１２は、最適化手段２４から最適再標本化モードのパラメータを入力し、パラメータが示す再標本化規則（例えば図２（ａ）〜（ｄ）のうちのいずれかの再標本化規則）に従った処理を行う。逆再標本化手段１８は、最適化手段２４から最適逆再標本化モードのパラメータを入力し、パラメータが示す逆再標本化規則（例えば図４（ａ）〜（ｄ）のうちのいずれかの再標本化規則）に従った処理を行う。 The block dividing unit 10, the resampling unit 12, and the like perform processing according to the parameters input from the optimizing unit 24. For example, the resampling unit 12 inputs the parameters of the optimal resampling mode from the optimizing unit 24, and executes the resampling rule indicated by the parameter (for example, any one of the resampling rules shown in FIGS. 2A to 2D). (Sampling rules). The inverse resampling unit 18 receives the parameter of the optimal inverse resampling mode from the optimizing unit 24, and executes the inverse resampling rule indicated by the parameter (for example, any one of the rules shown in FIGS. 4A to 4D). (Re-sampling rule).

また、最適化手段２４は、最適モードのパラメータを最適モード情報としてエントロピー符号化手段１５に出力する。 The optimizing unit 24 outputs the parameters of the optimal mode to the entropy encoding unit 15 as optimal mode information.

これにより、最適モード情報は、エントロピー符号化手段１５によりエントロピー符号化され、符号化ビット列として後述する映像復号装置２へ出力される。 As a result, the optimal mode information is entropy-encoded by the entropy encoding unit 15 and output to the video decoding device 2 described later as an encoded bit sequence.

このように、映像符号化装置１の符号化処理により、量子化インデックス列及び最適モード情報等が符号化ビット列に変換され、後述する映像復号装置２へ出力される。 As described above, the encoding process of the video encoding device 1 converts the quantization index sequence and the optimal mode information into an encoded bit sequence and outputs the encoded bit sequence to the video decoding device 2 described later.

尚、最適化手段２４は、再標本化手段１２及び逆再標本化手段１８を最適化の対象に含めるようにしたが、含めないようにしても構わない。この場合、再標本化手段１２は、予め設定された再標本化モードのパラメータが示す再標本化規則に従い、再標本化を行う。また、逆再標本化手段１８は、予め設定された逆再標本化モードのパラメータが示す逆再標本化規則に従い、逆再標本化を行う。 The optimizing unit 24 includes the resampling unit 12 and the inverse resampling unit 18 in the optimization target, but may not include them. In this case, the resampling unit 12 performs resampling according to a resampling rule indicated by a parameter of a preset resampling mode. The reverse resampling unit 18 performs reverse resampling according to a reverse resampling rule indicated by a preset reverse resampling mode parameter.

以上のように、本発明の実施形態の映像符号化装置１によれば、最適化手段２４は、ブロック分割手段１０、再標本化手段１２等にて用いるパラメータを様々に変化させて試行し、レート歪特性が最高または最高を近似するものとなるパラメータを特定し、特定したパラメータ（最適モードのパラメータ）を、ブロック分割手段１０、再標本化手段１２等に出力し、当該パラメータに従って動作させる。最適化手段２４により特定された最適モードのパラメータは、エントロピー符号化手段１５にて量子化インデックス列と共にエントロピー符号化され、符号化ビット列として後述する映像復号装置２へ出力される。 As described above, according to the video encoding device 1 of the embodiment of the present invention, the optimizing unit 24 trials by changing various parameters used in the block dividing unit 10, the resampling unit 12, and the like, A parameter whose rate-distortion characteristic is the highest or approximates the highest is specified, and the specified parameter (parameter in the optimal mode) is output to the block dividing means 10, the resampling means 12, and the like, and the operation is performed according to the parameter. The parameters of the optimal mode specified by the optimizing unit 24 are entropy-encoded by the entropy encoding unit 15 together with the quantization index sequence, and output to the video decoding device 2 described later as an encoded bit sequence.

再標本化手段１２は、ブロックの画素値列から予測画像の画素値列を減算した結果であるブロックの残差値列を入力し、最適化手段２４から入力したパラメータ（最適再標本化モードのパラメータ）が示す再標本化規則に従い、入力標本点を再標本化して出力標本点の残差値列を求める。再標本化手段１２により、ブロック単位に、標本点の数、配置及び残差値が変更される。 The re-sampling unit 12 inputs the residual value sequence of the block, which is the result of subtracting the pixel value sequence of the predicted image from the pixel value sequence of the block, and inputs the parameters (optimal resampling mode According to the resampling rule indicated by the parameter (parameter), the input sample points are resampled to obtain a residual value sequence of the output sample points. The number, arrangement, and residual value of the sampling points are changed by the resampling unit 12 for each block.

これにより、最適化手段２４により特定された最適モードのパラメータは、最高の符号化効率を実現可能なデータであるから、この最適モードのパラメータが示す再標本化規則も、最高の符号化効率を実現可能な規則となる。つまり、再標本化手段１２は、最高の符号化効率を実現するように、ブロック分割手段１０により分割されたブロックに対する符号化処理の中で、ブロック単位に、標本点の数、配置及び残差値を変更することができる。 Thereby, the parameter of the optimal mode specified by the optimizing means 24 is data capable of realizing the highest encoding efficiency. Therefore, the resampling rule indicated by the parameter of the optimal mode also has the highest encoding efficiency. It is a feasible rule. In other words, the re-sampling unit 12 performs the encoding process on the blocks divided by the block dividing unit 10 so as to realize the highest encoding efficiency. You can change the value.

例えば、映像信号の残差値が所定の広い範囲で類似値となる場合に、最適化手段２４が、最適モードのパラメータとして、ブロック分割手段１０について所定の広い範囲でブロック分割を行うパラメータを特定し、再標本化手段１２について標本点の数を減少させる最適再標本化モードのパラメータを特定した場合を想定する。この場合、再標本化手段１２は、所定の広い範囲のブロックについて、標本点の数を減少させた残差値列を求めることになる。これにより、標本点の数が少なくなるから、後続の処理の演算負荷を低減することができ、符号化効率を改善することができる。 For example, when the residual value of the video signal is a similar value in a predetermined wide range, the optimizing unit 24 specifies a parameter for performing block division in a predetermined wide range for the block dividing unit 10 as an optimal mode parameter. Then, it is assumed that the parameter of the optimal resampling mode for reducing the number of sampling points for the resampling unit 12 is specified. In this case, the resampling unit 12 obtains a residual value sequence in which the number of sampling points is reduced for a predetermined wide range of blocks. As a result, the number of sample points is reduced, so that the calculation load of the subsequent processing can be reduced, and the coding efficiency can be improved.

また、例えば、最適化手段２４が、最適モードのパラメータとして、再標本化手段１２についてブロック内の所定の箇所につき標本点の数を増加させる最適再標本化モードのパラメータを特定した場合、再標本化手段１２は、ブロック内の所定の箇所につき標本点の数を増加させた残差値列を求めることになる。これにより、図２（ｃ）及び（ｄ）に示したように、ブロック内の必要な箇所に多くの標本点を配置したり、ブロック内の他の箇所に少ない標本点を配置したり、すなわち標本点の粗密を変化させることができるから、映像信号の近似精度を向上させることができ、符号化効率を改善することができる。 Further, for example, when the optimizing unit 24 specifies a parameter of the optimal resampling mode for increasing the number of sampling points for a predetermined position in the block for the resampling unit 12 as the parameter of the optimal mode, The converting means 12 obtains a residual value sequence in which the number of sample points is increased for a predetermined location in the block. As a result, as shown in FIGS. 2C and 2D, a large number of sample points are arranged at necessary positions in the block, or a small number of sample points are arranged at other positions in the block. Since the density of the sample points can be changed, the approximation accuracy of the video signal can be improved, and the coding efficiency can be improved.

本発明の実施形態は、符号化処理の内部に再標本化手段１２の処理を挿入し、再標本化手段１２に対し、ブロック分割手段１０によるブロック分割処理、直交変換手段１３による直交変換処理及び量子化手段１４による量子化処理と連動して、ブロック単位に、標本点の数、配置及び残差値を変更させるものである。 According to the embodiment of the present invention, the processing of the resampling unit 12 is inserted into the encoding process, and the resampling unit 12 performs block division processing by the block division unit 10, orthogonal transformation processing by the orthogonal transformation unit 13, and In conjunction with the quantization processing by the quantization means 14, the number, arrangement, and residual value of sample points are changed in block units.

本発明の実施形態では、既存の符号化処理の外側に前処理及び後処理を付加する従来技術とは異なり、ブロック単位に再標本化を行うことができる。つまり、本発明の実施形態における再標本化処理は、ブロック分割後の他の処理（直交変換処理、量子化処理等）と連動して、同じフレームの他のブロック及び他のフレームのブロックとの間で符号化効率を考慮したバランスを図ることができ、最適な符号化処理を実現することができる。結果として、映像信号の符号化効率を改善し、映像信号を高画質に圧縮符号化及び復号することが可能となる。 In the embodiment of the present invention, unlike the related art in which the pre-processing and the post-processing are added outside the existing coding processing, the re-sampling can be performed in block units. In other words, the resampling process in the embodiment of the present invention is performed in conjunction with other processes (orthogonal transform process, quantization process, etc.) after the block division and other blocks of the same frame and blocks of other frames. A balance can be achieved in consideration of coding efficiency, and an optimum coding process can be realized. As a result, the encoding efficiency of the video signal can be improved, and the video signal can be compression-encoded and decoded with high image quality.

〔映像復号装置〕
次に、本発明の実施形態による映像復号装置について説明する。図５は、本発明の実施形態による映像復号装置の構成例を示すブロック図である。この映像復号装置２は、エントロピー復号手段３９、逆量子化手段３２、逆直交変換手段３３、逆再標本化手段３４、加算手段３５、ループフィルタ３６、メモリ３７及び予測手段３８を備えている。 [Video decoding device]
Next, a video decoding device according to an embodiment of the present invention will be described. FIG. 5 is a block diagram illustrating a configuration example of the video decoding device according to the embodiment of the present invention. The video decoding device 2 includes an entropy decoding unit 39, an inverse quantization unit 32, an inverse orthogonal transformation unit 33, an inverse resampling unit 34, an addition unit 35, a loop filter 36, a memory 37, and a prediction unit 38.

図７に示した従来の映像復号装置２００と、図５に示す本発明の実施形態による映像復号装置２とを比較すると、両映像復号装置２，２００は、逆量子化手段３２、逆直交変換手段３３、加算手段３５、ループフィルタ３６、メモリ３７及び予測手段３８を備えている点で同一である。 Comparing the conventional video decoding device 200 shown in FIG. 7 with the video decoding device 2 according to the embodiment of the present invention shown in FIG. 5, both video decoding devices 2 and 200 have the inverse quantization means 32 and the inverse orthogonal transform. They are the same in that they have a means 33, an adding means 35, a loop filter 36, a memory 37 and a predicting means 38.

一方、映像復号装置２は、映像復号装置２００の構成に加え、さらに逆再標本化手段３４を備え、映像復号装置２００に備えたエントロピー復号手段３１とは異なるエントロピー復号手段３９を備えている点で、映像復号装置２００と相違する。図５において、図７と共通する部分には図７と同一の符号を付し、その詳しい説明は省略する。 On the other hand, the video decoding device 2 further includes an inverse resampling unit 34 in addition to the configuration of the video decoding device 200, and an entropy decoding unit 39 different from the entropy decoding unit 31 provided in the video decoding device 200. This is different from the video decoding device 200. In FIG. 5, portions common to FIG. 7 are denoted by the same reference numerals as in FIG. 7, and detailed description thereof will be omitted.

映像復号装置２は、映像符号化装置１から量子化インデックス列及び最適モード情報等の符号化ビット列を入力し、最適モード情報のパラメータに従い、逆再標本化を含む復号処理を行い、量子化インデックス列から元の映像信号のフレームを復元する。そして、映像復号装置２は、復号映像信号を出力する。 The video decoding device 2 receives a quantization index sequence and an encoded bit sequence such as optimal mode information from the video encoding device 1, performs decoding processing including inverse re-sampling according to parameters of the optimal mode information, and Restore the original video signal frame from the column. Then, the video decoding device 2 outputs a decoded video signal.

エントロピー復号手段３９は、映像符号化装置１から符号化ビット列を入力し、図１に示したエントロピー符号化手段１５の逆の処理を行い、符号化ビット列をエントロピー復号することで、元の情報に可逆的に復号する。元の情報には、量子化インデックス列に加え、図１に示した予測手段２２にて求めた動きベクトル情報、最適化手段２３にて求めた最適モード情報、映像の大きさ等の情報、その他、使用者等により付加された情報等も含まれる。最適モード情報には、最適逆再標本化モードのパラメータが含まれる。 The entropy decoding unit 39 receives the encoded bit sequence from the video encoding device 1, performs a process reverse to that of the entropy encoding unit 15 shown in FIG. 1, and performs entropy decoding on the encoded bit sequence to obtain the original information. Reversibly decode. The original information includes, in addition to the quantization index sequence, the motion vector information obtained by the prediction unit 22 shown in FIG. 1, the optimal mode information obtained by the optimization unit 23, the information such as the size of the video, and the like. , And information added by the user or the like. The optimal mode information includes parameters of the optimal inverse resampling mode.

エントロピー復号手段３９は、エントロピー復号した最適モード情報である最適モードのパラメータを、逆量子化手段３２、逆直交変換手段３３、逆再標本化手段３４、ループフィルタ３６及び予測手段３８に出力し、当該パラメータに従って動作させる。この場合、エントロピー復号手段３９は、最適逆再標本化モードのパラメータを逆再標本化手段３４に出力する。 The entropy decoding unit 39 outputs the parameters of the optimal mode, which are the entropy-decoded optimal mode information, to the inverse quantization unit 32, the inverse orthogonal transform unit 33, the inverse resampling unit 34, the loop filter 36, and the prediction unit 38, Operate according to the parameters. In this case, the entropy decoding unit 39 outputs the parameters of the optimal inverse resampling mode to the inverse resampling unit 34.

これにより、逆量子化手段３２、逆直交変換手段３３、逆再標本化手段３４、ループフィルタ３６及び予測手段３８は、エントロピー復号手段３９から入力した最適モードのパラメータに従った処理を行う。 Accordingly, the inverse quantization means 32, the inverse orthogonal transform means 33, the inverse resampling means 34, the loop filter 36, and the prediction means 38 perform processing according to the parameters of the optimal mode input from the entropy decoding means 39.

また、エントロピー復号手段３９は、エントロピー復号したブロックの量子化インデックス列を逆量子化手段３２に出力する。 Further, the entropy decoding unit 39 outputs the quantization index sequence of the block subjected to the entropy decoding to the inverse quantization unit 32.

逆再標本化手段３４は、逆直交変換手段３３と加算手段３５の間に設けられる。逆再標本化手段３４は、逆直交変換手段３３からブロックの復号残差値列を入力し、図１に示した逆再標本化手段１８と同様の処理を行い、所定の逆再標本化規則に従って、ブロック単位に、標本点の数、配置及び復号残差値を変更することで、標本点を逆再標本化する。そして、逆再標本化手段３４は、逆標本化後のブロックの復号残差値列を加算手段３５に出力する。 The inverse resampling means 34 is provided between the inverse orthogonal transform means 33 and the adding means 35. The inverse re-sampling unit 34 receives the decoded residual value sequence of the block from the inverse orthogonal transform unit 33 and performs the same processing as the inverse re-sampling unit 18 shown in FIG. , The sample points are inversely resampled by changing the number, arrangement, and decoding residual value of the sample points in block units. Then, the inverse resampling unit 34 outputs the decoded residual value sequence of the block after the inverse sampling to the adding unit 35.

尚、逆再標本化手段３４は、標本点の数のみを変更することで逆再標本化してもよいし、標本点の配置のみを変更することで逆再標本化してもよいし、標本点の復号残差値のみを変更することで逆再標本化してもよい。要するに、逆再標本化手段３４は、標本点の数、配置及び復号残差値のうちの１つ、２つまたは全てを変更することで、標本点を逆再標本化すればよい。 The reverse resampling means 34 may perform reverse resampling by changing only the number of sample points, may perform reverse resampling by changing only the arrangement of sample points, May be inversely resampled by changing only the decoded residual value of. In short, the inverse re-sampling unit 34 may inversely re-sample the sample points by changing one, two, or all of the number, arrangement, and decoding residual value of the sample points.

このように、映像復号装置２の復号処理により、符号化ビット列が量子化インデックス列及び最適モード情報等に変換され、元の映像信号のフレームが復元される。 As described above, by the decoding process of the video decoding device 2, the coded bit sequence is converted into the quantization index sequence and the optimal mode information and the like, and the frame of the original video signal is restored.

尚、映像復号装置２が映像符号化装置１から最適逆再標本化モードのパラメータを含む符号化ビット列を入力する場合について説明したが、最適逆再標本化モードのパラメータを含む符号化ビット列を入力しない場合もあり得る。この場合、逆再標本化手段３４は、予め設定された逆再標本化モードのパラメータが示す逆再標本化規則に従い、逆再標本化を行う。 Note that the case has been described where the video decoding device 2 inputs a coded bit sequence including the parameters of the optimal inverse resampling mode from the video encoding device 1, but the coded bit sequence including the parameters of the optimal inverse resampling mode is input. It may not be. In this case, the inverse resampling unit 34 performs inverse resampling according to the inverse resampling rule indicated by the parameter of the preset inverse resampling mode.

以上のように、本発明の実施形態の映像復号装置２によれば、エントロピー復号手段３９は、映像符号化装置１から入力した符号化ビット列をエントロピー復号し、量子化インデックス列及び最適モードのパラメータを生成し、当該パラメータを、逆量子化手段３２、逆再標本化手段３４等に出力し、当該パラメータに従って動作させる。 As described above, according to the video decoding device 2 of the embodiment of the present invention, the entropy decoding unit 39 performs entropy decoding on the coded bit sequence input from the video coding device 1, and performs the quantization index sequence and the parameter of the optimal mode. Is generated, and the parameters are output to the inverse quantization means 32, the inverse resampling means 34, and the like, and are operated according to the parameters.

逆再標本化手段３４は、ブロックの画素値列から予測画像の画素値列が減算された結果であるブロックの復号残差値列を入力し、エントロピー復号手段３９から入力したパラメータ（最適逆再標本化モードのパラメータ）が示す逆再標本化規則に従い、入力標本点を逆再標本化して出力標本点の復号残差値列を求める。逆再標本化手段３４により、ブロック単位に、標本点の数、配置及び復号残差値が変更され、元に戻る。そして、元の映像信号が復号される。 The inverse re-sampling unit 34 inputs the decoded residual value sequence of the block obtained by subtracting the pixel value sequence of the prediction image from the pixel value sequence of the block, and inputs the parameter (optimum inverse re-sampling) input from the entropy decoding unit 39. According to the inverse resampling rule shown by the sampling mode parameter), the input sample points are inversely resampled to obtain a decoded residual value sequence of the output sample points. The inverse resampling unit changes the number, arrangement, and decoding residual value of the sample points in block units, and returns to the original. Then, the original video signal is decoded.

これにより、映像符号化装置１から入力してエントロピー復号手段３９により復号された最適モードのパラメータは、最高の符号化効率を実現可能なデータであるから、この最適モードのパラメータが示す逆再標本化規則も、最高の符号化効率を実現可能な規則となる。つまり、逆再標本化手段３４は、最高の符号化効率を実現するように、映像復号装置２における復号処理の中で、ブロック単位に、標本点の数、配置及び復号残差値を、元に戻すように変更することができる。 Thereby, the parameters of the optimal mode input from the video encoding device 1 and decoded by the entropy decoding means 39 are data capable of realizing the highest encoding efficiency. The coding rule is also a rule that can achieve the highest coding efficiency. That is, the inverse re-sampling unit 34 calculates the number, arrangement, and decoding residual value of the sample points in units of blocks in the decoding process in the video decoding device 2 so as to realize the highest coding efficiency. Can be changed to return to

例えば、映像信号の残差値が所定の広い範囲で類似値となる場合に、映像符号化装置１の最適化手段２４が、最適モードのパラメータとして、ブロック分割手段１０について所定の広い範囲でブロック分割を行うパラメータを特定し、再標本化手段１２について標本点の数を減少させる最適再標本化モードのパラメータを特定した場合を想定する。この場合、映像復号装置２のエントロピー復号手段３９は、最適モードのパラメータとして、所定の広い範囲でブロック分割が行われたブロックに対するパラメータを復号し、逆再標本化手段３４について標本点の数を増加させる最適逆再標本化モードのパラメータを復号する。そして、逆再標本化手段３４は、所定の広い範囲のブロックについて、映像符号化装置１の再標本化手段１２及び逆再標本化手段１８に対応して、標本点の数を増加させた復号残差値列を求めることになる。これにより、標本点の数を元に戻すことができ、符号化効率を改善することができる。 For example, when the residual value of the video signal has a similar value in a predetermined wide range, the optimizing unit 24 of the video encoding device 1 uses the block dividing unit 10 as a parameter of the optimal mode to block the block in the predetermined wide range. It is assumed that a parameter to be divided is specified, and a parameter of the optimal resampling mode for the resampling unit 12 that reduces the number of sampling points is specified. In this case, the entropy decoding unit 39 of the video decoding device 2 decodes, as the parameter of the optimal mode, the parameter for the block that has been divided into blocks in a predetermined wide range, and determines the number of sampling points for the inverse re-sampling unit 34. Decode the increasing optimal inverse resampling mode parameters. Then, the inverse re-sampling unit 34 decodes the blocks having a larger number of sample points corresponding to the re-sampling unit 12 and the inverse re-sampling unit 18 of the video encoding device 1 for a predetermined wide range of blocks. A sequence of residual values will be obtained. As a result, the number of sample points can be restored, and coding efficiency can be improved.

また、例えば、映像符号化装置１の最適化手段２４が、最適モードのパラメータとして、再標本化手段１２についてブロック内の所定の箇所につき標本点の数を増加させる最適再標本化モードのパラメータを特定した場合を想定する。この場合、映像復号装置２のエントロピー復号手段３９は、逆再標本化手段３４についてブロック内の所定の箇所につき標本点の数を減少させる最適逆再標本化モードのパラメータを復号する。そして、逆再標本化手段３４は、映像符号化装置１の再標本化手段１２及び逆再標本化手段１８に対応して、図４（ｃ）及び（ｄ）に示したように、ブロック内の所定の箇所につき標本点の数を減少させた復号残差値列を求めることになる。これにより、標本点の数を元に戻すことができ、符号化効率を改善することができる。 Further, for example, the optimizing unit 24 of the video encoding device 1 sets the parameter of the optimal resampling mode for increasing the number of sampling points for a predetermined location in the block for the resampling unit 12 as the parameter of the optimal mode. Suppose that it is specified. In this case, the entropy decoding unit 39 of the video decoding device 2 decodes the parameter of the optimal re-resampling mode for the inverse re-sampling unit 34 to reduce the number of sampling points for a predetermined position in the block. Then, the inverse resampling unit 34 corresponds to the resampling unit 12 and the inverse resampling unit 18 of the video encoding device 1 and, as shown in FIGS. For each predetermined location, a sequence of decoded residual values in which the number of sample points is reduced is obtained. As a result, the number of sample points can be restored, and the coding efficiency can be improved.

本発明の実施形態は、復号処理の内部に逆再標本化手段３４の処理を挿入し、逆再標本化手段３４に対し、逆量子化手段３２による逆量子化処理及び逆直交変換手段３３による逆直交変換処理と連動して、ブロック単位に、標本点の数、配置及び復号残差値を元に戻すように変更させるものである。 In the embodiment of the present invention, the processing of the inverse re-sampling unit 34 is inserted into the decoding process, and the inverse re-sampling unit 34 performs the inverse quantization process by the inverse quantization unit 32 and the inverse orthogonal transform unit 33. In conjunction with the inverse orthogonal transform processing, the number, arrangement, and decoding residual value of the sample points are changed back to the original values in block units.

本発明の実施形態では、従来技術とは異なり、ブロック単位に逆再標本化を行うことができる。つまり、本発明の実施形態における逆再標本化処理は、他の処理（逆量子化処理、逆直交変換処理等）と連動して、同じフレームの他のブロック及び他のフレームのブロックとの間で符号化効率を考慮したバランスを図ることができ、最適な符号化処理に対応した復号処理を実現することができる。結果として、映像信号の符号化効率を改善し、映像信号を高画質に圧縮符号化及び復号することが可能となる。 In the embodiment of the present invention, unlike the related art, inverse resampling can be performed in block units. That is, the inverse resampling process in the embodiment of the present invention interlocks with other processes (inverse quantization process, inverse orthogonal transform process, and the like) to interoperate with other blocks of the same frame and blocks of other frames. Thus, a balance can be achieved in consideration of the encoding efficiency, and a decoding process corresponding to an optimal encoding process can be realized. As a result, the encoding efficiency of the video signal can be improved, and the video signal can be compression-encoded and decoded with high image quality.

以上、実施形態を挙げて本発明を説明したが、本発明は前記実施形態に限定されるものではなく、その技術思想を逸脱しない範囲で種々変形可能である。例えば、前記実施形態において、図１に示した映像符号化装置１はループフィルタ２０を備えているが、必ずしもループフィルタ２０を備える必要はない。映像符号化装置１がループフィルタ２０を備えていない場合、メモリ２１には、加算手段１９により求められた復号画素値列が格納される。ここで、加算手段１９は、復号画素値列を、ブロックの位置を考慮しながら、メモリ２１に格納する。これにより、メモリ２１には、復号画像が順次形成される。 As described above, the present invention has been described with reference to the embodiment. However, the present invention is not limited to the above embodiment, and can be variously modified without departing from the technical idea thereof. For example, in the above embodiment, the video encoding device 1 illustrated in FIG. 1 includes the loop filter 20, but does not necessarily need to include the loop filter 20. When the video encoding device 1 does not include the loop filter 20, the decoded pixel value sequence obtained by the adding unit 19 is stored in the memory 21. Here, the adding unit 19 stores the decoded pixel value sequence in the memory 21 while considering the position of the block. Thus, decoded images are sequentially formed in the memory 21.

また、図５に示した映像復号装置２はループフィルタ３６を備えているが、必ずしもループフィルタ３６を備える必要はない。映像復号装置２がループフィルタ３６を備えていない場合、メモリ３７には、加算手段３５により求められた復号画素値列が、対応するブロックの位置に順次格納される。これにより、メモリ３７には、復号画像が順次形成される。 Although the video decoding device 2 shown in FIG. 5 includes the loop filter 36, the video decoding device 2 does not necessarily need to include the loop filter 36. When the video decoding device 2 does not include the loop filter 36, the decoded pixel value sequence obtained by the adding unit 35 is sequentially stored in the memory 37 at the position of the corresponding block. Thus, decoded images are sequentially formed in the memory 37.

また、前記実施形態では、図１に示した映像符号化装置１の再標本化手段１２、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７及び逆再標本化手段１８は、ブロックの各標本点について、映像信号の画素値から予測画像の画素値を減算した結果である残差値（復号残差値）を対象にして処理を行うようにした。これに対し、再標本化手段１２、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７及び逆再標本化手段１８は、標本点の残差値（復号残差値）の代わりに、標本点の画素値（復号画素値）を対象にして処理を行うようにしてもよい。図５に示した映像復号装置２の逆量子化手段３２、逆直交変換手段３３、逆再標本化手段３４等についても同様である。 In the above embodiment, the resampling unit 12, the orthogonal transform unit 13, the quantization unit 14, the entropy encoding unit 15, the inverse quantization unit 16, and the inverse orthogonal transform unit of the video encoding device 1 shown in FIG. 17 and the inverse resampling unit 18 perform processing on each sample point of the block with respect to a residual value (decoding residual value) obtained by subtracting the pixel value of the predicted image from the pixel value of the video signal. I did it. On the other hand, the resampling unit 12, the orthogonal transformation unit 13, the quantization unit 14, the entropy encoding unit 15, the inverse quantization unit 16, the inverse orthogonal transformation unit 17, and the inverse resampling unit 18 perform The processing may be performed on the pixel value (decoded pixel value) of the sample point instead of the difference value (decoded residual value). The same applies to the inverse quantization means 32, the inverse orthogonal transform means 33, the inverse resampling means 34, etc. of the video decoding device 2 shown in FIG.

また、前記実施形態では、図１に示した映像符号化装置１の最適化手段２４は、ブロック分割手段１０、再標本化手段１２（これに対応する逆再標本化手段１８）、直交変換手段１３（これに対応する逆直交変換手段１７）、量子化手段１４（これに対応する逆量子化手段１６）、エントロピー符号化手段１５、ループフィルタ２０及び予測手段２２にて用いるパラメータを様々に変化させて試行し、最適なパラメータを特定するようにした。これに対し、最適化手段２４は、これらの構成部のうちの１以上の構成部にて用いるパラメータを様々に変化させて試行するようにしてもよい。試行の対象でない構成部は、予め設定されたパラメータに従い、処理を行う。 In the embodiment, the optimizing unit 24 of the video encoding device 1 shown in FIG. 1 includes the block dividing unit 10, the resampling unit 12 (the corresponding inverse resampling unit 18), the orthogonal transform unit. 13 (corresponding inverse orthogonal transform means 17), quantizing means 14 (corresponding inverse quantizing means 16), entropy coding means 15, loop filter 20 and prediction means 22 vary various parameters. And tried to identify the optimal parameters. On the other hand, the optimizing unit 24 may change the parameters used in one or more of these components to perform the experiment. The component that is not the target of the trial performs processing according to the preset parameters.

尚、本発明の実施形態による映像符号化装置１及び映像復号装置２のハードウェア構成としては、通常のコンピュータを使用することができる。映像符号化装置１及び映像復号装置２は、ＣＰＵ、ＲＡＭ等の揮発性の記憶媒体、ＲＯＭ等の不揮発性の記憶媒体、及びインターフェース等を備えたコンピュータによって構成される。 As a hardware configuration of the video encoding device 1 and the video decoding device 2 according to the embodiment of the present invention, an ordinary computer can be used. The video encoding device 1 and the video decoding device 2 are configured by a computer including a CPU, a volatile storage medium such as a RAM, a nonvolatile storage medium such as a ROM, an interface, and the like.

映像符号化装置１に備えたブロック分割手段１０、差分手段１１、再標本化手段１２、直交変換手段１３、量子化手段１４、エントロピー符号化手段１５、逆量子化手段１６、逆直交変換手段１７、逆再標本化手段１８、加算手段１９、ループフィルタ２０、メモリ２１、予測手段２２及び最適化手段２３の各機能は、これらの機能を記述したプログラムをＣＰＵに実行させることによりそれぞれ実現される。 Block dividing means 10, difference means 11, resampling means 12, orthogonal transform means 13, quantization means 14, entropy encoding means 15, inverse quantization means 16, inverse orthogonal transform means 17 provided in video encoding apparatus 1. , The inverse resampling unit 18, the adding unit 19, the loop filter 20, the memory 21, the predicting unit 22, and the optimizing unit 23 are realized by causing a CPU to execute a program describing these functions. .

また、映像復号装置２に備えたエントロピー復号手段３９、逆量子化手段３２、逆直交変換手段３３、逆再標本化手段３４、加算手段３５、ループフィルタ３６、メモリ３７及び予測手段３８の各機能は、これらの機能を記述したプログラムをＣＰＵに実行させることによりそれぞれ実現される。 Also, each function of the entropy decoding means 39, the inverse quantization means 32, the inverse orthogonal transform means 33, the inverse resampling means 34, the addition means 35, the loop filter 36, the memory 37 and the prediction means 38 provided in the video decoding device 2 Is realized by causing a CPU to execute a program describing these functions.

これらのプログラムは、前記記憶媒体に格納されており、ＣＰＵに読み出されて実行される。また、これらのプログラムは、磁気ディスク（フロッピー（登録商標）ディスク、ハードディスク等）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤ等）、半導体メモリ等の記憶媒体に格納して頒布することもでき、ネットワークを介して送受信することもできる。 These programs are stored in the storage medium, and are read and executed by the CPU. These programs can also be stored and distributed on storage media such as magnetic disks (floppy (registered trademark) disks, hard disks, etc.), optical disks (CD-ROMs, DVDs, etc.), semiconductor memories, etc., and can be distributed via a network. You can also send and receive.

１，１００映像符号化装置
２，２００映像復号装置
１０ブロック分割手段
１１差分手段
１２再標本化手段
１３直交変換手段
１４量子化手段
１５エントロピー符号化手段
１６，３２逆量子化手段
１７，３３逆直交変換手段
１８，３４逆再標本化手段
１９，３５加算手段
２０，３６ループフィルタ
２１，３７メモリ
２２，３８予測手段
２３，２４最適化手段
３１，３９エントロピー復号手段 1,100 video encoding device 2,200 video decoding device 10 block dividing means 11 difference means 12 resampling means 13 orthogonal transform means 14 quantization means 15 entropy encoding means 16, 32 inverse quantization means 17, 33 inverse orthogonal Conversion means 18, 34 Inverse resampling means 19, 35 Addition means 20, 36 Loop filter 21, 37 Memory 22, 38 Prediction means 23, 24 Optimization means 31, 39 Entropy decoding means

Claims

A frame of a video signal is divided into blocks, and a pixel value of each sample point of the block or a residual value that is a difference between the pixel value and a pixel value of a predicted image is subjected to orthogonal transformation and quantization. , A video encoding device that outputs an encoded bit sequence entropy encoded,
Re-sampling means for re-sampling the sample points by changing at least one of the number, arrangement, and the pixel value or the residual value of the sample points in units of the block,
Changing the parameters used in the resampling unit, and a plurality of parameters including the parameters used in the process of dividing the block, the process of the orthogonal transform, and the process of the quantization, and representing the coding efficiency Optimizing means for determining a value, respectively, the plurality of parameters when the evaluation value is a value in a predetermined range, as an optimal plurality of parameters,
The resampling means resampled, the orthogonal index and the quantized index sequence subjected to the quantization, and the plurality of parameters specified by the optimizing means are entropy-encoded, and the encoded bit sequence And entropy encoding means for generating
The resampling means,
A video encoding device, wherein the sampling points are resampled in accordance with a resampling rule corresponding to the parameter specified by the optimizing means.

The video encoding device according to claim 1 ,
When the sample points input by the resampling unit are input sample points, and the sample points resampled by the resampling unit are output sample points,
The resampling rule sets the arrangement of the output sample points so as to reduce the number of the input sample points or to change the density of the input sample points in units of the block, A rule for calculating the pixel value or the residual value of the output sample point based on the pixel value or the residual value of a predetermined number of the input sample points existing in a predetermined range based on the sample point. , A video encoding device.

A video decoding apparatus for receiving a coded bit string output by the video coding apparatus according to claim 1 , entropy decoding the coded bit string, performing inverse quantization and inverse orthogonal transform, and restoring a frame of an original video signal. ,
Entropy decoding means for entropy decoding the coded bit sequence and generating a quantization index sequence and a plurality of parameters;
In a process reverse to the process in which the video encoding device resamples the sample points in units of blocks obtained by dividing the frame of the video signal, a reverse recoding corresponding to the parameter generated by the entropy decoding unit is performed. According to the sampling rule, the number of the sample points, the arrangement, and the quantization index sequence by changing at least one of the inverse orthogonal transform and the inverse orthogonal transformed decoded pixel value or decoding residual value, Reverse resampling means for reverse resampling the sample points,
A video decoding apparatus, wherein a frame of the original video signal is restored based on the decoded pixel value or the decoded residual value inversely resampled by the inverse resampling unit.

The video decoding device according to claim 3 ,
When the sample points input by the inverse re-sampling means are input sample points, and the sample points inversely re-sampled by the inverse re-sampling means are output sample points,
The inverse resampling rule sets the arrangement of the output sample points so as to increase the number of the sample points or to change the density of the input sample points in units of the block, and Calculating the decoded pixel value or the decoded residual value of the output sample point based on the decoded pixel value or the decoded residual value of a predetermined number of the input sample points existing in a predetermined range based on the sample point. A video decoding device, characterized in that: