JP4164903B2

JP4164903B2 - Video code string conversion apparatus and method

Info

Publication number: JP4164903B2
Application number: JP22016798A
Authority: JP
Inventors: 賢二杉山
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 1998-08-04
Filing date: 1998-08-04
Publication date: 2008-10-15
Anticipated expiration: 2018-08-04
Also published as: JP2000059789A

Description

【０００１】
【発明の属する技術分野】
画像を効率的に伝送、蓄積、表示するために、画像情報をより少ない符号量でディジタル信号にする高能率符号化において、符号列の変換に関する。特に符号化方式がブロック単位で動き補償を行う画像間予測符号化であり、異なった画素数の画像に変換する場合に関する。
【０００２】
【従来の技術】
＜動画像符号列の変換＞
ＭＰＥＧ等に代表される動画像高能率符号化で符号化された符号列を、異なったデータレートに変換、または可変転送レートを固定転送レートに変換する必要がある。この場合、画像を完全に復号化して異なったレートで再符号化するのが原則となるが、基本的な符号化処理が同じならば情報の一部はそのまま使うことが出来る。
具体的には、動きベクトル（ＭＶ）情報はそのまま再符号化で用いられ、多くの演算を必要とする動きベクトル検出を省略出来る。また、動き補償画像間予測処理が変わらないので、再符号化による劣化は量子化の違いのみとなり、最小限で済む。この様な処理は１９９３年画像符号化シンポジウム予稿集１−６「画像の再符号化における符号化制御の検討」やH.Sun,W.Kwok and J.Zdepski,“Architectures for MPEG Compressed Bitstream Scaling"(IEEE Transaction on Circuit and System for Video Technology, Vol.6, No.2, April 1996)に記載されている。
【０００３】
＜従来例動画像符号列の変換装置＞
図5は従来の動画像符号列の変換装置の一構成例を示したものである。
符号列入力端子１より入来する動き補償画像間予測符号化された符号列は、予測残差の符号列とＭＶの符号列が可変長復号化器２で固定長の符号に戻される。固定長符号として得られたＤＣＴ(離散コサイン変換)係数は逆量子化器３で係数値となり、逆ＤＣＴ４に与えられる。逆ＤＣＴ４は８×８個の係数を再生予測残差信号に変換し、加算器５に与える。加算器５では再生予測残差信号に後述の予測信号が加算され、再生画像となる。一方、可変長復号化器２から出力されるＭＶ情報は、動き補償予測器７と１４に与えられる。
【０００４】
この様にして得られた再生画像信号は、画像メモリ８と予測減算器９に与えられる。動き補償予測器７は、画像メモリ８に蓄積されている画像をＭＶに基づいて動き補償し、予測信号を形成する。得られた予測信号は加算器５に与えられる。つぎに再符号化系について説明する。加算器５から得られる再生画像信号は、減算器９において動き補償予測器１４から与えられる予測信号が減算され、予測残差となってＤＣＴ１０に与えられる。
【０００５】
ＤＣＴ１０はＤＣＴの変換処理を行い、得られた係数を量子化器１１に与える。量子化器１１は所定のステップ幅で係数を量子化し、固定長の符号となった係数を可変長符号化器１２と逆量子化器１８に与える。量子化ステップ幅は、転送レート変更に対応して逆量子化器３の量子化ステップ幅と異なったものとなる。可変長符号化器１２は、固定長の予測残差を可変長符号で圧縮し、さらにＭＶを可変長符号化し、出来た符号列を符号列出力端子１３より出力する。
【０００６】
一方、逆量子化器１８及び逆ＤＣＴ１７ではＤＣＴ１０及び量子化器１１の逆処理が行われ、画像間予測残差を再生する。得られた再生画像間予測残差は加算器１６で画像間予測信号が加算され再生画像となり、画像メモリ１５に与えられる。画像メモリ１５に蓄えられている再生画像は、動き補償予測器１４に与えられる。動き補償予測器１４は、可変長復号化器２から与えられるＭＶに従って画像間予測信号を作り、減算器９と加算器１６に与える。
ここで、動き補償画像間予測処理は復号化部と符号化部で同一であるため、加算器５と減算器９を相殺して、画像内処理のみ行えば良いようにも見える。さらにＤＣＴ４は逆ＤＣＴ１１に対する可逆変換処理なので相殺して、再量子化のみ行えばよいようにも見える。
【０００７】
しかし、復号系の画像メモリ８に蓄えられている再生画像と、再符号化系の画像メモリ１５に蓄えられている再生画像は、量子化処理が異なるので量子化誤差が異なった画像であり、予測信号が多少異なることになる。従って、画像間予測処理を省略すると１回の予測処理では大きな誤差とならないが、巡回型予測処理で誤差が累積し、大きなずれを生じる。すなわち、予測処理等を省略すると大きな画質劣化を生じることになる。
【０００８】
【発明が解決しようとする課題】
従来の動画像符号列変換装置は、変換の前後で同じ画素数の画像の場合にしか対応していない。同じ画素数の場合、動きベクトルはそのまま用いることが出来るが、変換前と変換後で画素数が異なると、動き補償ブロックの境界が変わるので、動きベクトルをそのまま使用することは出来ない。画素数の変換を伴う変換では、動きベクトル検出を再度行う必要があり、処理量や画質劣化が問題となっていた。
本発明は以上の点に着目してなされたもので、入来した符号列を復号すると共に、符号列から取り出した動きベクトルを用いて異なった画素数の画像に対応した動きベクトルを再構成し、新たな動きベクトルを用いて画素数が変換された復号画像を再符号化することで、処理量も画質劣化も少ない動画像符号列変換装置を提供することを目的とする。
【０００９】
【課題を解決するための手段】
本発明は、動き補償画像間予測された動画像の符号列の変換において、第１の動画像符号列を受け、符号列から第１の動きベクトルを分離し、前記第1の動画像符号列を第１の動き補償ブロックに対応した前記第１の動きベクトルを用いて画像間予測復号化を行い、第１の復号画像信号を得て、その画素数を変換し、異なった画素数の第２の復号画像信号を得る。一方、前記第２の復号画像を第２の動き補償ブロックで再符号化するための第２の動きベクトルを、実画像上で第２の動き補償ブロックの中央位置に存在する第１の動き補償ブロックを特定し、前記第１の動き補償ブロック用の第１の動きベクトルを前記画素数変換手段での画素数の変換比率に応じてスケーリングすることにより得て、それを用いて、前記第２の復号画像信号を前記第２の動き補償ブロックで画像間予測符号化して符号列を得、前記第２の動きベクトルの情報と多重化し第2の動画像符号列を得る動画像符号列変換装置及びその方法である。
【００１１】
（作用）
本発明では、入来した符号列のすべてまたは一部を復号すると共に、符号列から取り出した動きベクトル(ＭＶ)を用いて異なった画素数の画像の動き補償ブロックに対応したＭＶを再構成し、そのＭＶを用いて画素数が変換された復号画像を再符号化する。
再符号化する画像は画素数の変換により、再符号化の転送レートに対して解像度と量子化誤差のバランスを最適にすることが出来る。再構成で得たＭＶは、画素数が変換された画像での動き補償ブロックに対応したものとなっているので、再符号化での動き補償誤差は最小となる。ＭＶの再構成は動きベクトル検出を行う場合と比較して僅かな処理である。動きベクトルも入来したものと類似するので、画像間予測残差の変化も最小限であり、画質劣化も少なくなる。
【００１２】
【発明の実施の形態】
＜実施動画像符号列変換装置＞
本発明の動画像符号列変換装置の一実施例について、以下に説明する。
図１は、その構成を示したもので、図５の従来例と同一構成要素には同一番号を付してある。図１には、図５と比較してＭＶ再構成器６、画素数変換器１９が追加されている。このＭＶ再構成器６の内部構成は図２及び図３に示されている。実施例において従来例と異なるのは、再符号化画像の画素数と、それに伴う動きベクトル(ＭＶ)である。ＤＣＴや量子化等の各処理部分の処理動作は、従来例と基本的に同じである。
【００１３】
復号系から説明する。符号列入力端子１より入来する符号は、予測残差の符号列と前記符号列の動き補償ブロックに対応した第１ＭＶの符号列が可変長復号化器２で固定長の符号に戻される。固定長符号として得られたＤＣＴ係数は逆量子化器３で係数値となり、逆ＤＣＴ４に与えられる。逆ＤＣＴ４は８×８個の係数を再生予測残差信号に変換し、加算器５に与える。加算器５では再生予測残差信号に下記の動き補償予測器７から得られた予測信号が加算され、再生画像となる。一方、可変長復号化器２から取り出した第１ＭＶは動き補償予測器７とＭＶ再構成器６に与えられる。
【００１４】
この様にして得られた再生画像信号は、画像メモリ８と画素数変換器１９に与えられる。画素数変換器１９は再生画像を水平方向にリサンプルし、画素数の変換された画像を得る。リサンプル処理の詳細に付いては後述する。
リサンプルされた再生画像信号は、予測減算器９、ＭＶ再構成器６に与えられる。ＭＶ再構成器６は、可変長復号化器２で得られた第１ＭＶを用いて、再符号化で必要となる異なった画素数の画像に対応した第２ＭＶを再構成する。ＭＶの再構成方法に付いては後述する。
【００１５】
つぎに再符号化系について説明する。画素数変換器１９で画素数変換された再生画像信号は、減算器９において動き補償予測器１４から与えられる予測信号が減算され、予測残差となってＤＣＴ１０に与えられる。
ＤＣＴ１０はＤＣＴ変換処理を行い、得られた係数を量子化器１１に与える。量子化器１１は所定のステップ幅で係数を量子化し、固定長の符号となった係数を可変長符号化器１２と逆量子化器１８に与える。
可変長符号化器１２は、予測残差を可変長符号で圧縮された符号とし、第２ＭＶも可変長符号化し、両者を多重化して出来た符号列を符号出力端子１３より出力する。
【００１６】
一方、逆量子化器１８及び逆ＤＣＴ１７ではＤＣＴ１０及び量子化器１１の逆処理が行われ、画像間予測残差を再生する。得られた再生画像間予測残差は加算器１６で画像間予測信号が加算され再生画像となり、画像メモリ１５に与えられる。画像メモリ１５に蓄えられている再生画像は、動き補償予測器１４に与えられる。動き補償画像間予測器１４は、ＭＶ再構成器６から与えられる第２ＭＶに従って画像間予測信号を作り、減算器９と加算器１６に与えることで、画像間予測符号化が行われる。
なお、再符号化の各部分の処理内容は、図５の従来例と基本的に同じであるが、画素数の違いに合わせて具体的パラメータ等は異なる。
【００１７】
＜画素数変換器＞
画素数変換器１９は、再生画像に対して通常水平方向にリサンプルを行う。具体的には、通常７２０画素である再生画像を、３／４倍して５４０画素、２／３倍して４８０画素、１／２倍して３６０画素等に変換する。その際の画素の様子(１ラインのみ)を図６に示す。リサンプル処理は、ディジタルフィルタでタップ係数を切替えながら異なったサンプル位置の画素を作り出し、必要な画素のみ取り出すことで行われる。
なお、符号化画素数は動き補償ブロックが１６×１６画素なので、その倍数である必要があり、５４０画素では４画素ダミー画素を追加して５４４画素、３６０画素では８画素削除して３５２画素とする。
【００１８】
この様な画素数の変換は、通常転送レートの大幅な変更に伴って行われる。すなわち、７２０画素で８Ｍｂｐｓ程度の符号列を、７２０画素のまま５Ｍｂｐｓ以下に変換しようとすると、ＭＶ等の動き補償予測情報はそのままで、予測残差の情報のみを削減することになり、極端に画質が劣化する。
符号化対象画素数を削減すると、解像度は低下するが、動き補償予測情報も減るので、情報のバランスは適切に出来る。画素数を減らす程度は、再符号化の転送レートに依存し、よりレートを下げたい場合は、画素数もより少なくする。
なお、垂直方向の画素数変換はインターレース信号では問題があるが、プログレッシブ画像の場合は可能である。また、元の転送レートより再符号化の転送レートを上げても画質は改善されないので、画素数を増やすことは考え難い。
【００１９】
もう一つの例は、７２０画素の右端と左端の合計１６画素を削除し７０４画素とする場合である。ディジタル標準画像として使われるRec.601フォーマットは、水平有効画素数が７２０となっているが、実際に画像はアナログ画像信号規格との関係で７１１画素程度しか存在せず、左右にブランクサンプルが存在する。そこでブランクサンプルは排除して７０４画素のみを符号化する。この場合、リサンプルは行われず、画素が削除されるのみである。また、解像度低下はないが画素数の削減効果も僅かである。
【００２０】
＜ＭＶ再構成器＞
ＭＶは、それを用いる動き補償ブロックに適合したものである必要がある。本実施例では、入来する符号列の画像と再符号化され出力される画像で画素数が異なり、実画像上の動き補償ブロックの大きさ及びブロック境界位置が変化する。
その様子を図４に示すが、点線で示されたのは変換前の７２０画素でのブロック(第１ブロック)である。画素数変換で画素数が削減されると、同じ１６×１６画素でもブロック(第２ブロック)は実画像上で大きくなる。それに伴いブロックの境界位置も変化する。７２０が７０４に変換された場合は、ブロックの大きさは変化しないが、境界位置は変化する。
また、ＭＶの値は水平と垂直の画素移動量で示されるので、第１ＭＶと第２ＭＶが実画像上で同一な動きでも、画素数の変換に伴って画素移動量は変化するので、ＭＶ値を変換する必要がある。
【００２１】
ＭＶ再構成器６の構成は、図２または図３のようになる。
図２では入来する第１ＭＶはＭＶバッファ２１で蓄えられ、第１ブロックと第２ブロックの位置関係から、対象第２ブロックに一部でも含まれる第１ブロックを特定し、その第１ＭＶをＭＶ探索器２２に与えられる。
ＭＶ探索器２２は、そのＭＶを垂直成分値に垂直方向の画素数の変換比を乗じ、水平成分値に水平方向の画素数の変換比を乗じる。具体的には水平７２０が４８０に変換されるなら、垂直成分値はそのままで、水平成分値は２／３倍される。このように変換された第１ＭＶは第２ＭＶの候補として、画素数変換された再生画像を用いて再探索し、最も誤差の少なくなるＭＶを出力する。その際、ＭＶ候補に変換比を乗じた第１ＭＶの周辺値を追加しても良い。
【００２２】
ＭＶ再構成器は、図３に示したように簡易的な処理が可能なものでもよい。
図３では入来する第１ＭＶはＭＶバッファ２１で蓄えられ、その第１ＭＶがＭＶ選択器２３に与えられる。
ＭＶ選択器２３は、第１ブロックと第２ブロックの位置関係から、対象第２ブロックの中央にある第１ブロックの第１ＭＶを選択する。これを前記と同様に画素数の変換比を乗じて第２ＭＶとする。この場合は、探索は行われないので再生画像は必要ない。
【００２３】
【発明の効果】
本発明では、入来した符号列のすべてまたは一部を復号すると共に、符号列から取り出した動きベクトルを用いて異なった画素数の画像の動き補償ブロックに対応した動きベクトルを再構成し、その動きベクトルを用いて画素数が変換された復号画像を再符号化する。
画素数が変換された画像を再符号化することにより、再符号化の転送レートに対して解像度と量子化誤差のバランスを最適に出来、変換された符号列の再生画像の品質が改善される。
【００２４】
ＭＶ再構成はＭＶ検出と比較して僅かな処理であり、完全に復号して再符号化する場合に対して、装置規模を大幅に軽減出来る。また、再符号化でＭＶ検出を全く新たに行うと、ＭＶ検出で用いる画像は再生画像で劣化しているので、本来の動きと異なった動きを検出しやすくなるが、本手法では入来したＭＶを基準にした再探索であるので誤検出が起こりにくい。ＭＶも入来したものと類似するので、再符号化による画像間予測残差の変化も最小限であり、画質劣化も少ない。
【図面の簡単な説明】
【図１】本発明の動画像符号列変換装置の一実施例の構成を示す図である。
【図２】本発明の動画像符号列変換装置のＭＶ再構成器の第１実施例を示す図である。
【図３】本発明の動画像符号列変換装置のＭＶ再構成器の第２実施例を示す図である。
【図４】本発明の動き補償ブロックの様子を示す図である。
【図５】従来の動画像符号列変換装置の一構成例を示す図である。
【図６】本発明の画素数変換の様子を示す図である。
【符号の説明】
１符号列入力端子
２可変長復号化器（分離手段）
３、１８量子化器
４、１７ＤＣＴ
５、１６加算器
６ＭＶ再編成器
７、１４動補償予測器
８、１５画像メモリ
９減算器
１０ＤＣＴ
１１量子化器
１２可変長符号化器（多重化手段）
１３符号列出力端子
２１ＭＶバッファ
２２ＭＶ探索器
２３ＭＶ変換器[0001]
BACKGROUND OF THE INVENTION
The present invention relates to code string conversion in high-efficiency coding for converting image information into a digital signal with a smaller code amount in order to efficiently transmit, store, and display an image. In particular, the present invention relates to a case where the coding method is inter-picture predictive coding in which motion compensation is performed in units of blocks, and conversion into images having different numbers of pixels.
[0002]
[Prior art]
<Conversion of video code sequence>
It is necessary to convert a code string encoded by moving image high-efficiency encoding represented by MPEG or the like to a different data rate or convert a variable transfer rate to a fixed transfer rate. In this case, in principle, the image is completely decoded and re-encoded at a different rate. However, if the basic encoding process is the same, a part of the information can be used as it is.
Specifically, the motion vector (MV) information is used as it is in re-encoding, and the motion vector detection that requires many operations can be omitted. In addition, since the motion compensation inter-picture prediction process does not change, deterioration due to re-encoding is only a difference in quantization and can be minimized. Such processing is described in 1993 Image Encoding Symposium Proceedings 1-6 “Examination of encoding control in image re-encoding” and H.Sun, W. Kwok and J. Zdepski, “Architectures for MPEG Compressed Bitstream Scaling” (IEEE Transaction on Circuit and System for Video Technology, Vol. 6, No. 2, April 1996).
[0003]
<Conventional Video Code Sequence Converter>
FIG. 5 shows an example of the configuration of a conventional moving image code string converter.
The code sequence of the motion compensated inter-picture predictive encoding that comes from the code sequence input terminal 1 is returned to a fixed length code by the variable length decoder 2 in which the code sequence of the prediction residual and the MV code sequence are returned. A DCT (discrete cosine transform) coefficient obtained as a fixed length code becomes a coefficient value in the inverse quantizer 3 and is given to the inverse DCT 4. The inverse DCT 4 converts 8 × 8 coefficients into a reproduction prediction residual signal and supplies it to the adder 5. The adder 5 adds a prediction signal, which will be described later, to the reproduction prediction residual signal to form a reproduction image. On the other hand, the MV information output from the variable length decoder 2 is given to the motion compensation predictors 7 and 14.
[0004]
The reproduced image signal obtained in this way is supplied to the image memory 8 and the prediction subtracter 9. The motion compensation predictor 7 performs motion compensation on the image stored in the image memory 8 based on the MV, and forms a prediction signal. The obtained prediction signal is given to the adder 5. Next, the re-encoding system will be described. The reproduced image signal obtained from the adder 5 is subtracted from the prediction signal given from the motion compensation predictor 14 in the subtracter 9 and is given to the DCT 10 as a prediction residual.
[0005]
The DCT 10 performs DCT conversion processing and supplies the obtained coefficient to the quantizer 11. The quantizer 11 quantizes the coefficient with a predetermined step width, and supplies the coefficient that has become a fixed-length code to the variable-length encoder 12 and the inverse quantizer 18. The quantization step width is different from the quantization step width of the inverse quantizer 3 corresponding to the transfer rate change. The variable-length encoder 12 compresses the fixed-length prediction residual with the variable-length code, further variably encodes the MV, and outputs the resulting code string from the code string output terminal 13.
[0006]
On the other hand, the inverse quantizer 18 and the inverse DCT 17 perform the inverse processing of the DCT 10 and the quantizer 11 to reproduce the inter-picture prediction residual. The obtained inter-reproduced image prediction residual is added to the inter-image prediction signal by the adder 16 to form a regenerated image, which is given to the image memory 15. The reproduced image stored in the image memory 15 is given to the motion compensation predictor 14. The motion compensated predictor 14 creates an inter-picture prediction signal according to the MV given from the variable length decoder 2, and provides it to the subtracter 9 and the adder 16.
Here, since the motion compensation inter-picture prediction process is the same in the decoding unit and the encoding unit, it seems that the adder 5 and the subtracter 9 are canceled out and only the intra-image processing is performed. Furthermore, since DCT4 is a reversible conversion process for inverse DCT11, it seems to cancel and only requantization is performed.
[0007]
However, the reproduced image stored in the decoding-system image memory 8 and the reproduced image stored in the re-encoding-system image memory 15 are images having different quantization errors because of different quantization processes. The prediction signal will be slightly different. Therefore, if the inter-image prediction process is omitted, a large error does not occur in one prediction process, but errors accumulate in the cyclic prediction process, resulting in a large shift. That is, if the prediction process or the like is omitted, the image quality is greatly deteriorated.
[0008]
[Problems to be solved by the invention]
The conventional moving image code string conversion apparatus is compatible only with images having the same number of pixels before and after conversion. In the case of the same number of pixels, the motion vector can be used as it is. However, if the number of pixels before and after conversion is different, the boundary of the motion compensation block changes, so that the motion vector cannot be used as it is. In conversion involving conversion of the number of pixels, it is necessary to perform motion vector detection again, and the amount of processing and image quality degradation have been problems.
The present invention has been made paying attention to the above points. In addition to decoding an incoming code string, a motion vector corresponding to an image having a different number of pixels is reconstructed using a motion vector extracted from the code string. Another object of the present invention is to provide a moving image code string conversion device that re-encodes a decoded image in which the number of pixels is converted using a new motion vector, thereby reducing processing amount and image quality degradation.
[0009]
[Means for Solving the Problems]
The present invention relates to the conversion of a motion image code sequence predicted between motion compensated images, receives a first motion image code sequence, separates a first motion vector from the code sequence, and outputs the first motion image code sequence. Is subjected to inter-picture predictive decoding using the first motion vector corresponding to the first motion compensation block to obtain a first decoded image signal, the number of pixels is converted, and the number of pixels having a different number of pixels is converted. 2 decoded image signals are obtained. On the other hand, the second motion vector for re-encoding the second decoded image with the second motion compensation block is the first motion compensation existing at the center position of the second motion compensation block on the actual image. obtained by specifying the block, scales in accordance with the first motion vector for the first motion compensation block to the conversion ratio of the number of pixels in the pixel number conversion means, therewith, the second The decoded image signal is inter-picture prediction encoded by the second motion compensation block to obtain a code sequence, and is multiplexed with the information of the second motion vector to obtain a second video code sequence conversion apparatus And its method.
[0011]
(Work)
In the present invention, all or a part of the incoming code sequence is decoded, and a MV corresponding to a motion compensation block of an image having a different number of pixels is reconstructed using a motion vector (MV) extracted from the code sequence. The decoded image whose number of pixels is converted is re-encoded using the MV.
An image to be re-encoded can optimize the balance between resolution and quantization error with respect to the re-encoding transfer rate by converting the number of pixels. Since the MV obtained by the reconstruction corresponds to the motion compensation block in the image in which the number of pixels is converted, the motion compensation error in the re-encoding is minimized. The MV reconstruction is a slight process compared to the case where motion vector detection is performed. Since the motion vector is similar to the incoming one, the change in the inter-picture prediction residual is also minimal, and the image quality deterioration is reduced.
[0012]
DETAILED DESCRIPTION OF THE INVENTION
<Execution video code string converter>
An embodiment of the moving picture code string converter according to the present invention will be described below.
FIG. 1 shows the configuration, and the same reference numerals are given to the same components as those of the conventional example of FIG. In FIG. 1, an MV reconstructor 6 and a pixel number converter 19 are added as compared with FIG. The internal configuration of the MV reconstructor 6 is shown in FIGS. What is different from the conventional example in the embodiment is the number of pixels of the re-encoded image and the motion vector (MV) associated therewith. The processing operation of each processing part such as DCT and quantization is basically the same as the conventional example.
[0013]
The decoding system will be described. The code coming from the code string input terminal 1 is converted into a fixed-length code by the variable length decoder 2 from the code string of the prediction residual and the first MV code string corresponding to the motion compensation block of the code string. The DCT coefficient obtained as a fixed length code is converted into a coefficient value by the inverse quantizer 3 and is given to the inverse DCT 4. The inverse DCT 4 converts 8 × 8 coefficients into a reproduction prediction residual signal and supplies it to the adder 5. The adder 5 adds a prediction signal obtained from the motion compensation predictor 7 described below to the reproduction prediction residual signal, and a reproduction image is obtained. On the other hand, the first MV taken out from the variable length decoder 2 is supplied to the motion compensation predictor 7 and the MV reconstructor 6.
[0014]
The reproduced image signal obtained in this way is supplied to the image memory 8 and the pixel number converter 19. The pixel number converter 19 resamples the reproduced image in the horizontal direction to obtain an image in which the number of pixels is converted. Details of the resample processing will be described later.
The resampled reproduced image signal is supplied to the prediction subtracter 9 and the MV reconstructor 6. The MV reconstructor 6 uses the first MV obtained by the variable length decoder 2 to reconstruct a second MV corresponding to an image having a different number of pixels necessary for re-encoding. The MV reconstruction method will be described later.
[0015]
Next, the re-encoding system will be described. The reproduced image signal whose number of pixels has been converted by the number of pixels converter 19 is subtracted from the prediction signal given from the motion compensated predictor 14 in the subtracter 9 and given to the DCT 10 as a prediction residual.
The DCT 10 performs a DCT conversion process, and gives the obtained coefficient to the quantizer 11. The quantizer 11 quantizes the coefficient with a predetermined step width, and supplies the coefficient that has become a fixed-length code to the variable-length encoder 12 and the inverse quantizer 18.
The variable length encoder 12 converts the prediction residual into a code compressed with the variable length code, the second MV is also variable length encoded, and a code string formed by multiplexing both is output from the code output terminal 13.
[0016]
On the other hand, the inverse quantizer 18 and the inverse DCT 17 perform the inverse processing of the DCT 10 and the quantizer 11 to reproduce the inter-picture prediction residual. The obtained inter-reproduced image prediction residual is added to the inter-image prediction signal by the adder 16 to form a regenerated image, which is given to the image memory 15. The reproduced image stored in the image memory 15 is given to the motion compensation predictor 14. The motion-compensated inter-picture predictor 14 generates an inter-picture prediction signal according to the second MV given from the MV reconstructor 6 and provides it to the subtracter 9 and the adder 16 to perform inter-picture prediction coding.
The processing content of each part of re-encoding is basically the same as that of the conventional example of FIG. 5, but specific parameters and the like differ depending on the number of pixels.
[0017]
<Pixel number converter>
The pixel number converter 19 resamples the reproduced image in the normal horizontal direction. Specifically, a reproduction image that is normally 720 pixels is converted by 3/4 to 540 pixels, 2/3 to 480 pixels, 1/2 to 360 pixels, and the like. The state of the pixel at that time (only one line) is shown in FIG. The resampling process is performed by creating pixels at different sample positions while switching tap coefficients with a digital filter and extracting only necessary pixels.
Since the motion compensation block is 16 × 16 pixels, the number of encoded pixels needs to be a multiple thereof. For 540 pixels, 4 pixel dummy pixels are added, 544 pixels, 360 pixels are deleted, 8 pixels are deleted, and 352 pixels are obtained. To do.
[0018]
Such conversion of the number of pixels is usually performed with a significant change in the transfer rate. In other words, if an attempt is made to convert a code string of about 8 Mbps with 720 pixels to 5 Mbps or less with 720 pixels, motion compensation prediction information such as MV remains unchanged, and only prediction residual information is reduced. Image quality deteriorates.
If the number of encoding target pixels is reduced, the resolution is lowered, but the motion compensation prediction information is also reduced, so that the information can be appropriately balanced. The extent to which the number of pixels is reduced depends on the transfer rate of re-encoding, and if it is desired to lower the rate, the number of pixels is also reduced.
Note that the conversion of the number of pixels in the vertical direction has a problem with an interlace signal, but it is possible with a progressive image. Further, since the image quality is not improved even if the re-encoding transfer rate is increased from the original transfer rate, it is difficult to consider increasing the number of pixels.
[0019]
Another example is a case where a total of 16 pixels at the right end and the left end of 720 pixels are deleted to make 704 pixels. The Rec.601 format used as a digital standard image has a horizontal effective pixel count of 720, but in reality there are only about 711 pixels in relation to the analog image signal standard, and there are blank samples on the left and right. To do. Therefore, blank samples are excluded and only 704 pixels are encoded. In this case, resampling is not performed and only the pixels are deleted. Further, although the resolution is not lowered, the effect of reducing the number of pixels is slight.
[0020]
<MV reconstructor>
The MV needs to be compatible with the motion compensation block that uses it. In this embodiment, the number of pixels is different between the image of the incoming code sequence and the image that is re-encoded and output, and the size of the motion compensation block and the block boundary position on the actual image change.
This state is shown in FIG. 4, and a dotted line indicates a block (first block) of 720 pixels before conversion. When the number of pixels is reduced by pixel number conversion, the block (second block) becomes larger on the actual image even with the same 16 × 16 pixels. Accordingly, the boundary position of the block also changes. When 720 is converted to 704, the block size does not change, but the boundary position changes.
Further, since the MV value is indicated by the horizontal and vertical pixel movement amounts, even if the first MV and the second MV move in the same manner on the actual image, the pixel movement amount changes with the conversion of the number of pixels. Need to be converted.
[0021]
The configuration of the MV reconstructor 6 is as shown in FIG.
In FIG. 2, the incoming first MV is stored in the MV buffer 21, and the first block included in the target second block is specified from the positional relationship between the first block and the second block, and the first MV is MV It is given to the searcher 22.
The MV searcher 22 multiplies the MV by the vertical component value conversion ratio of the number of pixels in the vertical direction, and multiplies the horizontal component value by the conversion ratio of the number of pixels in the horizontal direction. Specifically, if the horizontal 720 is converted to 480, the vertical component value is left as it is, and the horizontal component value is multiplied by 2/3. The first MV converted in this way is re-searched as a second MV candidate using the reproduced image with the converted number of pixels, and the MV with the smallest error is output. At that time, a peripheral value of the first MV obtained by multiplying the MV candidate by the conversion ratio may be added.
[0022]
The MV reconstructor may be capable of simple processing as shown in FIG.
In FIG. 3, the incoming first MV is stored in the MV buffer 21, and the first MV is given to the MV selector 23.
The MV selector 23 selects the first MV of the first block at the center of the target second block from the positional relationship between the first block and the second block. This is multiplied by the conversion ratio of the number of pixels in the same manner as described above to obtain the second MV. In this case, since no search is performed, a reproduced image is not necessary.
[0023]
【The invention's effect】
In the present invention, all or part of the incoming code string is decoded, and a motion vector corresponding to a motion compensation block of an image having a different number of pixels is reconstructed using the motion vector extracted from the code string. The decoded image in which the number of pixels is converted is re-encoded using the motion vector.
By re-encoding the image with the converted number of pixels, the balance between resolution and quantization error can be optimized for the re-encoding transfer rate, and the quality of the reconstructed image of the converted code string can be improved. .
[0024]
MV reconstruction is a little processing compared to MV detection, and the scale of the apparatus can be greatly reduced as compared with the case of complete decoding and re-encoding. In addition, when MV detection is completely performed by re-encoding, an image used for MV detection is deteriorated in a reproduced image, so that it becomes easy to detect a motion different from the original motion. Since the re-search is based on MV, erroneous detection is unlikely to occur. Since the MV is similar to the incoming one, the change in the inter-picture prediction residual due to re-encoding is also minimal, and the image quality degradation is small.
[Brief description of the drawings]
FIG. 1 is a diagram illustrating a configuration of an embodiment of a moving image code string conversion apparatus according to the present invention.
FIG. 2 is a diagram illustrating a first embodiment of an MV reconstructor of a video code string converter according to the present invention.
FIG. 3 is a diagram illustrating a second embodiment of the MV reconstructor of the video code string converter according to the present invention.
FIG. 4 is a diagram showing a state of a motion compensation block of the present invention.
FIG. 5 is a diagram illustrating a configuration example of a conventional moving image code string conversion device.
FIG. 6 is a diagram illustrating a state of pixel number conversion according to the present invention.
[Explanation of symbols]
1 Code string input terminal 2 Variable length decoder (separation means)
3, 18 Quantizer 4, 17 DCT
5, 16 Adder 6 MV reorganizer 7, 14 dynamic compensation predictor 8, 15 Image memory 9 Subtractor 10 DCT
11 Quantizer 12 Variable Length Encoder (Multiplexing Unit)
13 Code string output terminal 21 MV buffer 22 MV searcher 23 MV converter

Claims

In a moving image code string conversion apparatus that converts a code string of a moving image predicted between motion compensated images,
Separating means for receiving a first moving image code string and obtaining a first motion vector from the code string;
Decoding means for performing inter-picture prediction using the first motion vector corresponding to the first motion compensation block and decoding the first moving picture code string to obtain a first decoded picture signal;
A pixel number converting means for converting the number of pixels of the first decoded image signal to obtain a second decoded image signal having a different number of pixels;
A first motion compensation block in which a second motion vector for re-encoding the second decoded image signal with a second motion compensation block is present at the center position of the second motion compensation block on the actual image And a motion vector reconstruction unit obtained by scaling the first motion vector for the first motion compensation block according to a conversion ratio of the number of pixels in the pixel number conversion unit;
Re-encoding means for inter-predicting the second decoded image signal with the second motion compensation block using the second motion vector to obtain a code string;
Video code stream conversion apparatus characterized by having a multiplexing means for obtaining the second motion vector video code stream of information and the multiplexes the code string obtained by the re-encoding means in the second.

In a moving image code sequence conversion method for converting a code sequence of a moving image predicted between motion compensated images,
Receiving a first video sequence and separating a first motion vector from the sequence;
The first moving image code string is subjected to inter-picture prediction decoding using the first motion vector corresponding to the first motion compensation block, to obtain a first decoded image signal,
Converting the number of pixels of the first decoded image signal to obtain a second decoded image signal having a different number of pixels;
A first motion compensation block in which a second motion vector for re-encoding the second decoded image signal with a second motion compensation block is present at the center position of the second motion compensation block on the actual image And the first motion vector for the first motion compensation block is determined according to the conversion ratio of the number of pixels of the first decoded image signal and the number of pixels of the second decoded image signal. Obtained by scaling
Using the second motion vector, the second decoded image signal is subjected to inter-picture prediction encoding with the second motion compensation block to obtain a code string,
Video code stream conversion method characterized by obtaining the second video code stream by multiplexing the information and the code sequence of said second motion vectors.