JP4100067B2

JP4100067B2 - Image information conversion method and image information conversion apparatus

Info

Publication number: JP4100067B2
Application number: JP2002195016A
Authority: JP
Inventors: 安弘橋本
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-07-03
Filing date: 2002-07-03
Publication date: 2008-06-11
Anticipated expiration: 2022-07-03
Also published as: JP2004040494A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像情報変換方法及び画像情報変換装置に関し、特に、離散コサイン変換等の直交変換と動き補償によって圧縮された画像情報（ビットストリーム）を、衛星放送、ケーブルＴＶ、インターネット等のネットワークを介して受信する際、或いは、光ディスク、磁気ディスク、フラッシュメモリ等の記憶媒体上で処理する際に用いられる画像情報変換方法及び画像情報変換装置に関する。
【０００２】
【従来の技術】
近年、画像情報をディジタルデータとして取り扱う際に、画像情報特有の冗長性を利用して、効率の高い情報の伝送及び蓄積を実現した画像情報変換方法及び装置が放送局と一般家庭との間の情報配信等において普及しつつある。
【０００３】
このような画像情報変換装置は、例えば、離散コサイン変換等の直交変換と動き補償により画像データを圧縮する方式に則っている。特に、ＭＰＥＧ（Moving Picture Experts Group：動画像符号化専門家会合）によって標準化されている画像符号化方式は、汎用画像符号化方式としてＩＳＯ／ＩＥＣ１３８１８−２に定義されており、飛び越し走査画像及び順次走査画像の双方、並びに標準解像度画像及び高精細画像を網羅している。そのためＭＰＥＧは、プロフェッショナル用途からコンシューマ用途まで、広範なアプリケーションに今後とも用いられるものと予想される。このＭＰＥＧ方式を用いると、例えば、７２０×４８０画素を持つ標準解像度の飛び越し走査画像であれば、４〜８Ｍｂｐｓの符号量（以下、ビットレートと記す。）に、１９２０×１０８８画素を持つ高解像度の飛び越し走査画像であれば、１８〜２２Ｍｂｐｓのビットレートに割り当てられるため、良好な画質を保って高い圧縮率が実現できる。
【０００４】
ＭＰＥＧ方式のように、動き補償及び離散コサイン変換によって画像データを圧縮する画像情報変換装置では、画像データにおける符号化単位としての各マクロブロックにおいて、画像内符号化画像を用いるか、画像間符号化画像を用いるかの判定、参照画像フレームとして、前方予測符号化画像を用いるか、後方予測符号化画像を用いるか、双方向予測符号化画像を用いるかの判定、さらには、動き補償予測としてフレーム予測を用いるか、フィールド予測を用いるかの判定を行っている。
【０００５】
例えば、ＭＰＥＧ２に代表される動画像圧縮符号化方式において、フレーム予測を用いるかフィールド予測を用いるかの判断方法については、規格として特に定められてはいない。
【０００６】
ただし、従来の画像情報変換装置では、一般的に演算量或いは回路規模を削減する等の目的から、輝度信号を用いてフレームベクトルとフィールドベクトルとを求め、さらに輝度信号を用いてフィールドとフレームとの間の予測残差を求め、これらの予測残差を比較して、圧縮に有利であると判断される予測画像が選択されている。
【０００７】
実際は、フレーム予測を用いると符号化されるベクトルの数が半分になり、圧縮効率がよくなることが知られている。そのため、一般的にはフレーム予測が選ばれやすいような判断基準が設定されている。また、動きベクトルを求める前に、どちらの予測方式を用いるかを判断することにより、動きベクトルの演算量を削減し、演算に関わる回路規模を削減する方法も知られている。
【０００８】
【発明が解決しようとする課題】
ＭＰＥＧ方式に則った従来の画像情報変換装置、例えば、ＭＰＥＧ２に代表される動画像圧縮符号化方式に対応した画像情報変換装置において、フレーム予測を用いるかフィールド予測を用いるかの判断方法の一例として、それぞれの予測方式を用いた場合の動きベクトルの二乗予測誤差を算出し、これらを比較して最小となる方式を選択するなどの方法があげられる。
【０００９】
例えば、動画像等の一連の画像データを圧縮する際、特に４：２：０フォーマットの入力画像に対して、画像データのピクチャ間予測方式としてフレーム予測が選択される場合について説明する。
【００１０】
通常、予測方式としてフレーム予測が選択されると圧縮効率がよいことは既述した。しかし、４：２：０フォーマットの場合、ピクチャ間の予測方式としてフレーム予測が選択されると、図５に示すように、動きベクトルの垂直成分の位相によっては、選択される色差信号が輝度信号のフィールドとは逆のフィールドの画素になる場合がある。特に、動きが大きい場合、輝度信号の動きベクトルと色差信号の動きベクトルとが互いに逆フィールドを指すと、トップフィールドとボトムフィールドとの間で色差信号の相関が低くなることが知られている。この結果、色差信号における圧縮効率が悪くなり、画質劣化を引き起こすことがあった。
【００１１】
そこで本発明は、このような従来の実情に鑑みて提案されたものであり、画質の劣化を招く予測方式か否かを判別し、低ビットレートで符号化効率及び画質の向上を実現する予測方式を選択する画像情報変換方法及び画像情報変換装置を提供することを目的とする。
【００１２】
【課題を解決するための手段】
上述した目的を達成するために、本発明に係る画像情報変換方法は、直交変換と動き補償を適用して画像データを圧縮して画像圧縮情報を得る画像情報変換方法において、画像データの符号化単位の動き補償予測としてトップフィールドとボトムフィールドとが合成されたフレームで動き補償予測を行う際、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指す場合、符号化単位の動き補償予測をフィールド予測に変更することを特徴とする。
【００１３】
ここで、輝度信号の動きベクトルの大きさが所定の閾値を超えている場合に符号化モードの判定をフィールド予測に変更することもできる。
【００１４】
特に、画像圧縮情報は、ＭＰＥＧ（Moving Picture Experts Group）規格に準拠していることが好ましい。
【００１５】
また、本発明に係る画像情報変換装置は、直交変換と動き補償を適用して画像データを圧縮して画像圧縮情報を得る画像情報変換装置において、画像データの符号化単位の動き補償予測としてトップフィールドとボトムフィールドとが合成されたフレームで動き補償予測を行う際、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指すか否かを判別する判別手段と、動きベクトルに応じて符号化単位の動き補償予測をフレーム予測にするかフィールド予測にするかを制御する予測モード制御手段とを備え、判別手段において、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指すと判断された場合、予測モード制御手段は、符号化単位の動き補償予測をフィールド予測に変更することを特徴としている。
【００１６】
ここで、動きベクトルの大きさが所定の閾値を超えている場合、符号化モードの判定をフィールド予測に変更することもできる。
【００１７】
特に、画像圧縮情報は、ＭＰＥＧ（Moving Picture Experts Group）規格に準拠していることが好ましい。
【００１８】
【発明の実施の形態】
本発明の具体例として示す画像情報変換装置は、動き補償及び離散コサイン変換によって画像データを圧縮する画像情報変換装置であって、画像データにおける符号化単位としての各マクロブロックにおいて、画像データの動き補償予測としてフレーム予測を用いるか、或いはフィールド予測を用いるかの判定に輝度の情報のみならず、動きベクトルの値や位相等の情報を用いることによって、低ビットレートで符号化効率及び画質の向上を実現した装置である。
【００１９】
以下、本発明の実施の形態について、図面を参照して詳細に説明する。図１を用いて、本発明の具体例として示す画像情報変換装置の構成を説明する。
【００２０】
図１に示す画像情報変換装置１は、ＭＰＥＧ（Moving Picture Experts Group：動画像符号化専門家会合）方式、特に、ＭＰＥＧ２方式に準拠する画像情報変換装置であって、主として、画像データを入力して前処理する構成と、動き予測及び動き補償によって符号化モードを決定する構成と、離散コサイン変換（ＤＣＴ）及び量子化を施す構成と、これら変換処理によって圧縮された画像データを出力する構成とを備えている。
【００２１】
具体的に、画像情報変換装置１は、画像データを入力して前処理する構成として、画像データの入力端子１１と、画面並換回路１２とを備える。
【００２２】
この画像データは、輝度（Ｙ）と色差信号（Ｃｂ、Ｃｒ）とから構成されたビデオ信号であって、図示しないが、例えば、入力端子１１より入力される前段階でディジタル化され、この後の符号化で用いる空間解像度にフォーマット変換されている。
【００２３】
この画像データは、画面並換回路１２において、Ｉピクチャ（フレーム内予測符号化画像）、Ｐピクチャ（フレーム間順方向予測符号化画像）、Ｂピクチャ（フレーム間双方向予測符号化画像）の３つのピクチャタイプ毎に並び換えられる。入力した画像データの各ピクチャは、後述する動き補償回路２４、ＤＣＴ回路１４において、この符号化順に、動き補償並びにＤＣＴ符号化される。
【００２４】
画像情報変換装置１は、さらに、動き予測及び動き補償によって符号化モードを決定する構成と、離散コサイン変換（ＤＣＴ）及び量子化を施す構成として、具体的に、差分器１３と、ＤＣＴ回路１４と、量子化回路１５と、動き補償回路２４と、動き予測回路２５とを備えている。動き補償回路２４及び動き予測回路２５は、一体化されていてもよい。
【００２５】
差分器１３は、画像並換回路１２から供給されたＩピクチャ又はＰピクチャと、予測により得た画像との差分をとって予測誤差を算出し、この値をＤＣＴ回路１４に供給する。ＤＣＴ回路１４は、画像データをＤＣＴ符号化係数へと変換する。量子化回路１５は、ここでのＤＣＴ符号化係数を、レート制御回路１６で決定された量子化スケールに基づいて量子化する。量子化されたデータは、動きベクトル及び符号化モード情報とともに、可変長符号化回路１７において可変長符号化された後、バッファ１８に蓄積され、目標のビットレートに合わせてＭＰＥＧビデオビットストリームとして出力端子１９より出力される。
【００２６】
また、画像データがＩピクチャ及びＰピクチャであれば、後の処理でこのＤＣＴ符号化係数を動き補償予測の参照画面として用いる必要があるため、量子化された情報は、逆量子化回路２０及び逆ＤＣＴ回路２１によって局所的に復号され、図示しない復号器で得られるものと同一の画像として復元され、加算器２２を通過して画像メモリ２３に蓄積される。
【００２７】
続いて、図１に示す画像情報変換装置１における符号化予測処理の流れを図２を用いて説明する。図２は、画像情報変換装置１における一連の予測モード判定処理を処理ブロックとして示したものであり、図２に示す動きベクトル演算ブロック４１及びフレーム予測／フィールド予測判定ブロック４２は、図１の動き予測回路２４における処理に相当している。また、符号化ブロック４３は、図１の入力端子１１、出力端子１９、画面並換回路１２、動き予測回路２５を除いた構成における処理に相当する。
【００２８】
入力された色差信号（入力画像色差信号３０）と輝度信号（入力画像輝度信号３１）とを含む画像データは、符号化ブロック４３へ送られる。このとき入力画像輝度信号３１は、動きベクトル演算ブロック４１にも送られる。
【００２９】
動きベクトル演算ブロック４１にて入力画像輝度信号３１から算出されたフィールド予測動きベクトル３２及びフレーム予測動きベクトル３３は、符号化ブロック４３へ送られる。また、動きベクトル演算ブロック４１にて算出されたフィールド予測残差３４とフレーム予測残差３５がフレーム予測／フィールド予測判定ブロック４２に送られる。このときフィールド予測動きベクトル３２及びフレーム予測動きベクトル３３は、フレーム予測／フィールド予測判定ブロック４２にも送られる。フレーム予測／フィールド予測判定ブロック４２では、フィールド予測動きベクトル３２、フレーム予測動きベクトル３３、フィールド予測残差３４、フレーム予測残差３５から、フレーム予測を用いるかフィールド予測を用いるかが選択される。
【００３０】
動き補償回路２４は、例えば、ＤＳＰ（ディジタルシグナルプロセッサ）であって、一手法として、フィールド予測の場合とフレーム予測の場合とで動きベクトルの二乗予測誤差を算出し、これらを比較して最小となる予測方式を決定している。またここでは、圧縮効率がよいフレーム予測が選ばれやすくなるように、一方の予測残差にオフセット分を加算して大小比較を実行してもよい。
【００３１】
フレーム予測／フィールド予測判定ブロック４２におけるフレーム／フィールド予測の判定結果は、フレーム／フィールド予測選択信号３６として符号化ブロック４３に送られ、符号化ブロック４３では、選択された予測方法の動きベクトルを用いて符号化する。そして、図１に示す可変長符号化回路１７からビットストリーム３８が出力される。
【００３２】
続いて、動き補償回路２４、動き予測回路２５における符号化予測モード判別処理の一例を図３に基づいて説明する。本発明の具体例として示す画像情報変換装置１では、符号化予測モードを判別する際、輝度情報から判定された符号化予測モードに対して、動きベクトルの値の位相や大きさを用いることで理想的な符号化効率で且つ画像劣化が少ない最適な予測モードを選択している。
【００３３】
動き補償回路２４は、まずステップＳ１において、従来汎用の手法に基づいて、輝度情報から画像データの符号化単位としてフレーム予測を用いるかフィールド予測を用いるかを判定する。
【００３４】
ステップＳ１においてフィールド予測が選択されると、このマクロブロックに関しては予測方法を変更することなく、次のマクロブロックに対して上述した符号化予測方法を繰り返す。
【００３５】
一方、ステップＳ１においてフレーム予測が選択されると、ステップＳ２において、フレーム予測の参照画像において、色差信号の動きベクトルが輝度信号の動きベクトルが示す画素と反対のフィールド上の画素を指していないかを判別する。ここでは例えば、次のような条件に基づいて判別する。すなわち、フレーム予測動きベクトルの垂直成分を４で割った余りが２のとき、選択される色差信号が反対のフィールド（逆相）上の画素になるとする。
ｉｆ（（フレーム予測動きベクトルの大きさの垂直成分）％４＝＝２）このとき、色差信号が逆相となる。
【００３６】
ｅｌｓｅ逆相とならない。
【００３７】
ステップＳ２において、選択される色差信号が反対のフィールド（逆相）上の画素になると判別された場合、フレーム予測では圧縮効率が低下するため、ステップＳ３においてフィールド予測に変更する。また、色差信号の動きベクトルが逆相の画素を指すことが検出されなければ、フレーム予測であっても圧縮効率が低下せず、良好な圧縮効率が得られるためフレーム予測を適用する。
【００３８】
ステップＳ４において、このフレームにおける全マクロブロックタイプの決定が完了すれば、ステップＳ５に進み、次のフレームの符号化予測モードを判定する。
【００３９】
上述した処理によれば、輝度信号に加え、動きベクトルの値の位相や大きさを用いて符号化予測モードを決定することにより、色差信号の動きベクトルが逆相の画素を示して圧縮効率の低下を招く場合を回避できるため、高画質化を実現するとともに演算量を低減できる。そのため回路規模を縮小できる。
【００４０】
ところで、動き量が少ない場合、動きベクトルの値は、色差信号のトップフィールドとボトムフィールドとでほぼ同じ値になると考えられる。そこで、動き量が少ない場合、符号化モードの判別に動きベクトルの動き量を導入する。続いて、この手法について説明する。
【００４１】
図４でのステップＳ１１、Ｓ１２、Ｓ１４、Ｓ１５及びＳ１６は、それぞれ図３に示すステップＳ１、Ｓ２、Ｓ３、Ｓ４、Ｓ５に対応している。この手法では、ステップＳ１２で色差信号の動きベクトルが逆相の画素を指すと判別されると、ステップＳ１３において、動きベクトルの大きさ（動き量）を用いて圧縮効率が低下するか否かを判別している。
【００４２】
例えば、輝度信号の動きベクトルの大きさの垂直成分と水平成分とがそれぞれ所定の閾値より小さければ、動き量が小であるとして、色差信号の動きベクトルが逆相の画素を指しても圧縮効率の大幅な低下がないと判別し、フィールド予測に変更することなくフレーム予測をそのまま適用する。
【００４３】
一方、動きベクトルの大きさの垂直成分と水平成分とが所定の閾値を超える場合、選択される色差信号が反対のフィールド（逆相）上の画素になり画質の劣化が顕著化するため、予測モードをフィールド予測へ変更する。
【００４４】
したがって、上述した画像情報変換装置１によれば、輝度情報に加えて、輝度情報の動きベクトル値、位相及び大きさを用いて、符号化単位の予測モードとしてフレーム予測を用いるかフィールド予測を用いるかを判定することにより、低ビットレートで画像劣化が最小限に抑えられた最適な符号化予測モードが選択できる。
【００４５】
なお、本発明は上述した実施の形態のみに限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能であることは勿論である。
【００４６】
【発明の効果】
以上詳細に説明したように、本発明に係る画像情報変換方法は、画像データの符号化単位の動き補償予測としてトップフィールドとボトムフィールドとが合成されたフレームで動き補償予測を行う際、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指す場合、符号化単位の動き補償予測をフィールド予測に変更することにより、圧縮効率の低下を回避し高画質化を実現できる。また、演算量を削減でき、回路規模が縮小できる。
【００４７】
特に、判別工程では、輝度信号の動きベクトルの大きさに所定の閾値を設け、この閾値を超えている場合、符号化モードの判定をフィールド予測に変更することによって、圧縮効率の低下を回避し高画質化を実現できる。また、演算量を削減でき、回路規模が縮小できる。
【００４８】
また、本発明に係る画像情報変換装置は、画像データの符号化単位の動き補償予測としてトップフィールドとボトムフィールドとが合成されたフレームで動き補償予測を行う際、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指すか否かを判別する判別手段と、動きベクトルに応じて符号化単位の動き補償予測をフレーム予測にするかフィールド予測にするかを制御する予測モード制御手段とを備え、判別手段において、色差信号における動きベクトルが輝度信号における動きベクトルが指す画素が存在するフィールドとは反対のフィールドの画素を指すと判断された場合、予測モード制御手段は、符号化単位の動き補償予測をフィールド予測に変更することにより、圧縮効率の低下を回避し高画質化を実現できる。また、演算量を削減でき、回路規模が縮小できる。
【００４９】
特に、輝度信号の動きベクトルの大きさに所定の閾値を設け、この閾値を超えている場合、符号化モードの判定をフィールド予測に変更することによって、圧縮効率の低下を回避し高画質化を実現できる。また、演算量を削減でき、回路規模が縮小できる。
【図面の簡単な説明】
【図１】本発明の具体例として示す画像情報変換装置の構成を説明するブロック図である。
【図２】図１に示す画像情報変換装置における処理ブロックを説明する図である。
【図３】図１に示す画像情報変換装置におけるフィールド／フレーム予測判別処理を説明するフローチャートである。
【図４】図１に示す画像情報変換装置におけるフィールド／フレーム予測判別処理の別の例を説明するフローチャートである。
【図５】フレーム予測において色差信号の動きベクトルが輝度信号とは反対のフィールド上の画素を示す場合を説明する模式図である。
【符号の説明】
１画像情報変換装置、１１入力端子、１２画面並換回路、１３差分器、１４ＤＣＴ回路、１５量子化回路、１６レート制御回路、１７可変長符号化回路、１８バッファ、１９出力端子、２０逆量子化回路、２１逆ＤＣＴ回路、２２加算器、２３画像メモリ、２４動き補償回路、２５動き予測回路、３０入力画像色差信号、３１入力画像輝度信号、３２フィールド予測動きベクトル、３３フレーム予測動きベクトル、３４フレーム予測残差、３５フィールド予測残差、３６フレーム予測／フィールド予測選択信号、３７参照画像データ、３８ビットストリーム、４１動きベクトル演算ブロック、４２フレーム予測／フィールド予測判定ブロック、４３符号化ブロック[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image information conversion method and an image information conversion apparatus, and more particularly, to image information (bitstream) compressed by orthogonal transform such as discrete cosine transform and motion compensation, for a network such as satellite broadcasting, cable TV, and the Internet. The present invention relates to an image information conversion method and an image information conversion apparatus that are used when receiving data via a network or processing on a storage medium such as an optical disk, a magnetic disk, or a flash memory.
[0002]
[Prior art]
In recent years, when image information is handled as digital data, an image information conversion method and apparatus that realizes efficient transmission and storage of information using redundancy unique to image information has been developed between a broadcasting station and a general household. It is spreading in information distribution.
[0003]
Such an image information conversion apparatus conforms to a method of compressing image data by orthogonal transform such as discrete cosine transform and motion compensation, for example. In particular, an image coding method standardized by the Moving Picture Experts Group (MPEG) is defined in ISO / IEC 13818-2 as a general-purpose image coding method. It covers both progressively scanned images, standard resolution images and high definition images. Therefore, MPEG is expected to be used in a wide range of applications from professional use to consumer use. Using this MPEG system, for example, a standard resolution interlaced scanned image having 720 × 480 pixels has a high resolution having 1920 × 1088 pixels in a code amount of 4 to 8 Mbps (hereinafter referred to as a bit rate). Since the interlaced scanned image is assigned to a bit rate of 18 to 22 Mbps, a high compression rate can be realized while maintaining good image quality.
[0004]
In an image information conversion apparatus that compresses image data by motion compensation and discrete cosine transform as in the MPEG system, an intra-image encoded image is used in each macroblock as an encoding unit in image data, or inter-image encoding is performed. Determination of whether to use an image, determination of whether to use a forward predictive encoded image, a backward predictive encoded image, or a bidirectional predictive encoded image as a reference image frame, and a frame as motion compensated prediction It is determined whether to use prediction or field prediction.
[0005]
For example, in a moving image compression encoding system represented by MPEG2, a method for determining whether to use frame prediction or field prediction is not particularly defined as a standard.
[0006]
However, in the conventional image information conversion apparatus, generally, for the purpose of reducing the calculation amount or the circuit scale, a frame vector and a field vector are obtained using a luminance signal, and further, a field and a frame are obtained using the luminance signal. A prediction image that is determined to be advantageous for compression is selected by calculating prediction residuals between the two and comparing these prediction residuals.
[0007]
Actually, it is known that when frame prediction is used, the number of encoded vectors is halved and compression efficiency is improved. Therefore, in general, a determination criterion is set so that frame prediction is easily selected. There is also known a method of reducing the amount of motion vector computation and reducing the circuit scale involved in computation by determining which prediction method is used before obtaining a motion vector.
[0008]
[Problems to be solved by the invention]
As an example of a method for determining whether to use frame prediction or field prediction in a conventional image information conversion apparatus conforming to the MPEG system, for example, an image information conversion apparatus corresponding to a moving image compression encoding system typified by MPEG2. For example, a method of calculating a square prediction error of a motion vector when using each prediction method and comparing these is selected.
[0009]
For example, when a series of image data such as a moving image is compressed, a case will be described in which frame prediction is selected as an inter-picture prediction method for image data, particularly for an input image in 4: 2: 0 format.
[0010]
As described above, compression efficiency is usually good when frame prediction is selected as the prediction method. However, in the 4: 2: 0 format, when frame prediction is selected as a prediction method between pictures, as shown in FIG. 5, depending on the phase of the vertical component of the motion vector, the selected color difference signal may be a luminance signal. In some cases, the pixel of the field opposite to the field of. In particular, when the motion is large, it is known that when the motion vector of the luminance signal and the motion vector of the chrominance signal indicate opposite fields, the correlation of the chrominance signal decreases between the top field and the bottom field. As a result, the compression efficiency of the color difference signal is deteriorated, which may cause image quality deterioration.
[0011]
Therefore, the present invention has been proposed in view of such conventional situations, and it is determined whether or not the prediction method causes deterioration in image quality, and prediction that realizes improvement in coding efficiency and image quality at a low bit rate. An object of the present invention is to provide an image information conversion method and an image information conversion apparatus for selecting a method.
[0012]
[Means for Solving the Problems]
In order to achieve the above-described object, an image information conversion method according to the present invention is an image information conversion method in which image data is compressed by applying orthogonal transform and motion compensation to obtain compressed image information. When performing motion compensation prediction in a frame in which the top field and the bottom field are combined as unit motion compensation prediction, a pixel in a field opposite to the field in which the motion vector in the chrominance signal has a pixel pointed to by the motion vector in the luminance signal exists , The motion compensated prediction of the coding unit is changed to field prediction.
[0013]
Here, when the magnitude of the motion vector of the luminance signal exceeds a predetermined threshold, the determination of the encoding mode can be changed to field prediction.
[0014]
In particular, the image compression information preferably conforms to the MPEG (Moving Picture Experts Group) standard.
[0015]
Also, the image information conversion apparatus according to the present invention is the top as motion compensation prediction in the encoding unit of image data in the image information conversion apparatus that obtains image compression information by compressing image data by applying orthogonal transform and motion compensation. When motion compensation prediction is performed in a frame in which the field and the bottom field are combined, it is determined whether or not the motion vector in the color difference signal indicates a pixel in a field opposite to the field in which the pixel indicated by the motion vector in the luminance signal exists And a prediction mode control means for controlling whether the motion compensated prediction of the coding unit is to be frame prediction or field prediction according to the motion vector, wherein the motion vector in the chrominance signal is a luminance When the pixel of the field opposite to the field where the pixel indicated by the motion vector in the signal exists If it is the cross-sectional, the prediction mode control means is characterized in that changing the motion compensation prediction of the coding unit in the field prediction.
[0016]
Here, when the magnitude of the motion vector exceeds a predetermined threshold, the determination of the encoding mode can be changed to field prediction.
[0017]
In particular, the image compression information preferably conforms to the MPEG (Moving Picture Experts Group) standard.
[0018]
DETAILED DESCRIPTION OF THE INVENTION
An image information conversion apparatus shown as a specific example of the present invention is an image information conversion apparatus that compresses image data by motion compensation and discrete cosine transform, and in each macroblock as a coding unit in the image data, the motion of the image data Enhancing coding efficiency and image quality at a low bit rate by using not only luminance information but also information such as motion vector values and phases in determining whether to use frame prediction or field prediction as compensation prediction It is a device that realized.
[0019]
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. The configuration of an image information conversion apparatus shown as a specific example of the present invention will be described with reference to FIG.
[0020]
An image information conversion apparatus 1 shown in FIG. 1 is an image information conversion apparatus that conforms to the MPEG (Moving Picture Experts Group) method, particularly the MPEG2 method, and mainly receives image data. A configuration for pre-processing, a configuration for determining an encoding mode by motion prediction and motion compensation, a configuration for performing discrete cosine transform (DCT) and quantization, and a configuration for outputting image data compressed by these transform processing It has.
[0021]
Specifically, the image information conversion apparatus 1 includes an image data input terminal 11 and a screen rearrangement circuit 12 as a configuration for inputting and preprocessing image data.
[0022]
This image data is a video signal composed of luminance (Y) and color difference signals (Cb, Cr), and although not shown, for example, it is digitized before being input from the input terminal 11, and thereafter The format has been converted to the spatial resolution used in the encoding.
[0023]
In the screen rearrangement circuit 12, this image data is converted into three pictures, i.e., I picture (intra-frame predictive encoded image), P picture (inter-frame forward predictive encoded image), and B picture (inter-frame bi-directional predictive encoded image). Sorted by each picture type. Each picture of the input image data is subjected to motion compensation and DCT encoding in this encoding order in a motion compensation circuit 24 and a DCT circuit 14 described later.
[0024]
The image information conversion apparatus 1 further includes, as a configuration for determining an encoding mode by motion prediction and motion compensation, and a configuration for performing discrete cosine transform (DCT) and quantization, specifically, a differencer 13 and a DCT circuit 14. A quantization circuit 15, a motion compensation circuit 24, and a motion prediction circuit 25. The motion compensation circuit 24 and the motion prediction circuit 25 may be integrated.
[0025]
The subtractor 13 calculates a prediction error by taking the difference between the I picture or P picture supplied from the image rearrangement circuit 12 and the image obtained by the prediction, and supplies this value to the DCT circuit 14. The DCT circuit 14 converts the image data into DCT coding coefficients. The quantization circuit 15 quantizes the DCT coding coefficient here based on the quantization scale determined by the rate control circuit 16. The quantized data is variable-length encoded in the variable-length encoding circuit 17 together with the motion vector and encoding mode information, and then stored in the buffer 18 and output as an MPEG video bit stream in accordance with the target bit rate. Output from terminal 19.
[0026]
Also, if the image data is an I picture and a P picture, it is necessary to use this DCT coding coefficient as a reference screen for motion compensation prediction in later processing. The image is locally decoded by the inverse DCT circuit 21, restored as the same image as that obtained by a decoder (not shown), passes through the adder 22, and is stored in the image memory 23.
[0027]
Next, the flow of the encoding prediction process in the image information conversion apparatus 1 shown in FIG. 1 will be described with reference to FIG. FIG. 2 shows a series of prediction mode determination processes in the image information conversion apparatus 1 as processing blocks. The motion vector calculation block 41 and the frame prediction / field prediction determination block 42 shown in FIG. This corresponds to the processing in the prediction circuit 24. The encoding block 43 corresponds to the processing in the configuration excluding the input terminal 11, the output terminal 19, the screen rearrangement circuit 12, and the motion prediction circuit 25 of FIG.
[0028]
The image data including the input color difference signal (input image color difference signal 30) and the luminance signal (input image luminance signal 31) is sent to the encoding block 43. At this time, the input image luminance signal 31 is also sent to the motion vector calculation block 41.
[0029]
The field prediction motion vector 32 and the frame prediction motion vector 33 calculated from the input image luminance signal 31 by the motion vector calculation block 41 are sent to the encoding block 43. The field prediction residual 34 and the frame prediction residual 35 calculated by the motion vector calculation block 41 are sent to the frame prediction / field prediction determination block 42. At this time, the field prediction motion vector 32 and the frame prediction motion vector 33 are also sent to the frame prediction / field prediction determination block 42. In the frame prediction / field prediction determination block 42, whether to use frame prediction or field prediction is selected from the field prediction motion vector 32, the frame prediction motion vector 33, the field prediction residual 34, and the frame prediction residual 35.
[0030]
The motion compensation circuit 24 is, for example, a DSP (digital signal processor). As one method, the motion compensation circuit 24 calculates a motion vector square prediction error in the case of field prediction and frame prediction, and compares them to determine the minimum. The prediction method is determined. Here, the size comparison may be performed by adding an offset to one of the prediction residuals so that frame prediction with good compression efficiency is easily selected.
[0031]
The determination result of the frame / field prediction in the frame prediction / field prediction determination block 42 is sent to the encoding block 43 as the frame / field prediction selection signal 36, and the encoding block 43 uses the motion vector of the selected prediction method. To encode. Then, the bit stream 38 is output from the variable length coding circuit 17 shown in FIG.
[0032]
Next, an example of the encoded prediction mode determination process in the motion compensation circuit 24 and the motion prediction circuit 25 will be described with reference to FIG. In the image information conversion apparatus 1 shown as a specific example of the present invention, when determining the encoding prediction mode, the phase or magnitude of the value of the motion vector is used for the encoding prediction mode determined from the luminance information. An optimal prediction mode with ideal encoding efficiency and little image degradation is selected.
[0033]
First, in step S1, the motion compensation circuit 24 determines whether to use frame prediction or field prediction as a coding unit of image data from luminance information based on a conventional general-purpose method.
[0034]
When field prediction is selected in step S1, the encoding prediction method described above is repeated for the next macroblock without changing the prediction method for this macroblock.
[0035]
On the other hand, if frame prediction is selected in step S1, whether or not the motion vector of the chrominance signal points to a pixel on the opposite field to the pixel indicated by the motion vector of the luminance signal in the frame prediction reference image in step S2 Is determined. Here, for example, the determination is made based on the following conditions. That is, when the remainder of dividing the vertical component of the frame prediction motion vector by 4 is 2, the selected color difference signal is a pixel on the opposite field (reverse phase).
if ((vertical component of the magnitude of the frame prediction motion vector)% 4 == 2) At this time, the color difference signal is in reverse phase.
[0036]
else Not in reverse phase.
[0037]
If it is determined in step S2 that the selected color difference signal is a pixel on the opposite field (reverse phase), the compression efficiency is reduced in the frame prediction, so the field prediction is changed in step S3. In addition, if it is not detected that the motion vector of the color difference signal indicates a pixel in reverse phase, the compression efficiency is not lowered even in the frame prediction, and the frame compression is applied because good compression efficiency is obtained.
[0038]
If the determination of all macroblock types in this frame is completed in step S4, the process proceeds to step S5, and the encoding prediction mode of the next frame is determined.
[0039]
According to the above-described processing, by determining the encoding prediction mode using the phase and magnitude of the motion vector value in addition to the luminance signal, the motion vector of the color difference signal indicates a pixel having an opposite phase and the compression efficiency is improved. Since it is possible to avoid the case where the deterioration is caused, it is possible to realize high image quality and reduce the amount of calculation. Therefore, the circuit scale can be reduced.
[0040]
By the way, when the amount of motion is small, the value of the motion vector is considered to be substantially the same in the top field and the bottom field of the color difference signal. Therefore, when the amount of motion is small, the amount of motion vector motion is introduced to determine the coding mode. Next, this method will be described.
[0041]
Steps S11, S12, S14, S15, and S16 in FIG. 4 correspond to steps S1, S2, S3, S4, and S5 shown in FIG. 3, respectively. In this method, if it is determined in step S12 that the motion vector of the color difference signal indicates a pixel in reverse phase, in step S13, it is determined whether or not the compression efficiency is reduced by using the magnitude (motion amount) of the motion vector. Judging.
[0042]
For example, if the vertical component and the horizontal component of the magnitude of the motion vector of the luminance signal are each smaller than a predetermined threshold, the amount of motion is assumed to be small, and even if the motion vector of the color difference signal points to a pixel in reverse phase, the compression efficiency Therefore, the frame prediction is applied as it is without changing to the field prediction.
[0043]
On the other hand, if the vertical and horizontal components of the motion vector size exceed a predetermined threshold, the selected color difference signal becomes a pixel on the opposite field (reverse phase) and the deterioration of image quality becomes noticeable. Change mode to field prediction.
[0044]
Therefore, according to the image information conversion apparatus 1 described above, using the motion vector value, the phase, and the magnitude of the luminance information in addition to the luminance information, the frame prediction is used as the prediction mode of the encoding unit or the field prediction is used. Therefore, it is possible to select an optimal encoding prediction mode in which image degradation is minimized at a low bit rate.
[0045]
It should be noted that the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the present invention.
[0046]
【The invention's effect】
As described above in detail, the image information conversion method according to the present invention performs the color difference signal when performing the motion compensation prediction in the frame in which the top field and the bottom field are combined as the motion compensation prediction of the coding unit of the image data. If the motion vector in the field indicates a pixel in a field opposite to the field in which the pixel pointed to by the motion vector in the luminance signal exists, the reduction in compression efficiency is avoided by changing the motion compensation prediction in the coding unit to field prediction. High image quality can be achieved. In addition, the amount of calculation can be reduced and the circuit scale can be reduced.
[0047]
In particular, in the determination step, a predetermined threshold is set for the magnitude of the motion vector of the luminance signal, and when this threshold is exceeded, the encoding mode determination is changed to field prediction to avoid a reduction in compression efficiency. High image quality can be achieved. In addition, the amount of calculation can be reduced and the circuit scale can be reduced.
[0048]
In addition, the image information conversion apparatus according to the present invention performs motion compensation prediction on a frame in which a top field and a bottom field are combined as motion compensation prediction of a coding unit of image data. A determination unit for determining whether or not a pixel in a field opposite to a field in which a pixel indicated by a motion vector exists is present, and whether motion compensation prediction of a coding unit is set to frame prediction or field prediction according to the motion vector. Prediction mode control means for controlling whether or not, and in the discrimination means, it is determined that the motion vector in the color difference signal indicates a pixel in a field opposite to the field in which the pixel indicated by the motion vector in the luminance signal exists. The prediction mode control means changes the motion compensation prediction of the coding unit to field prediction. More can be realized avoiding image quality deterioration of compression efficiency. In addition, the amount of calculation can be reduced and the circuit scale can be reduced.
[0049]
In particular, a predetermined threshold is set for the magnitude of the motion vector of the luminance signal, and when this threshold is exceeded, the encoding mode determination is changed to field prediction, thereby avoiding a reduction in compression efficiency and improving the image quality. realizable. In addition, the amount of calculation can be reduced and the circuit scale can be reduced.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration of an image information conversion apparatus shown as a specific example of the present invention.
FIG. 2 is a diagram illustrating processing blocks in the image information conversion apparatus shown in FIG. 1;
FIG. 3 is a flowchart for explaining a field / frame prediction determination process in the image information conversion apparatus shown in FIG. 1;
FIG. 4 is a flowchart for explaining another example of the field / frame prediction determination process in the image information conversion apparatus shown in FIG. 1;
FIG. 5 is a schematic diagram for explaining a case where a motion vector of a color difference signal indicates a pixel on a field opposite to a luminance signal in frame prediction.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 Image information converter, 11 Input terminal, 12 Screen rearrangement circuit, 13 Differentiator, 14 DCT circuit, 15 Quantization circuit, 16 Rate control circuit, 17 Variable length encoding circuit, 18 Buffer, 19 Output terminal, 20 Reverse Quantization circuit, 21 inverse DCT circuit, 22 adder, 23 image memory, 24 motion compensation circuit, 25 motion prediction circuit, 30 input image chrominance signal, 31 input image luminance signal, 32 field prediction motion vector, 33 frame prediction motion vector , 34 frame prediction residual, 35 field prediction residual, 36 frame prediction / field prediction selection signal, 37 reference image data, 38 bit stream, 41 motion vector operation block, 42 frame prediction / field prediction determination block, 43 encoding block

Claims

In an image information conversion method for obtaining image compression information by compressing image data by applying orthogonal transformation and motion compensation,
When motion compensation prediction is performed in a frame in which a top field and a bottom field are combined as motion compensation prediction in a coding unit of image data, a field in which a motion vector in a color difference signal includes a pixel indicated by a motion vector in a luminance signal An image information conversion method characterized by changing motion-compensated prediction of a coding unit to field prediction when referring to a pixel in the opposite field.

2. The image information conversion method according to claim 1, wherein when the magnitude of the motion vector of the luminance signal exceeds a predetermined threshold, the motion compensated prediction of the coding unit is changed to field prediction.

2. The image information conversion method according to claim 1, wherein the image compression information conforms to an MPEG (Moving Picture Experts Group) standard.

In an image information conversion device that obtains image compression information by compressing image data by applying orthogonal transformation and motion compensation,
When motion compensation prediction is performed in a frame in which a top field and a bottom field are combined as motion compensation prediction in a coding unit of image data, a field in which a motion vector in a color difference signal includes a pixel indicated by a motion vector in a luminance signal A discriminating means for discriminating whether or not it points to a pixel in the opposite field;
Prediction mode control means for controlling whether motion compensation prediction of a coding unit is to be frame prediction or field prediction according to a motion vector;
When the determination unit determines that the motion vector in the color difference signal indicates a pixel in a field opposite to the field in which the pixel indicated by the motion vector in the luminance signal exists, the prediction mode control unit An image information conversion apparatus characterized by changing compensation prediction to field prediction.

5. The image information conversion apparatus according to claim 4, wherein when the magnitude of the motion vector of the luminance signal exceeds a predetermined threshold, the motion compensated prediction of the coding unit is changed to field prediction.

5. The image information conversion apparatus according to claim 4, wherein the image compression information conforms to an MPEG (Moving Picture Experts Group) standard.