JP3629826B2

JP3629826B2 - Information signal encoding apparatus, encoding method, and information signal decoding method

Info

Publication number: JP3629826B2
Application number: JP19965996A
Authority: JP
Inventors: 哲二郎近藤; 泰弘藤森; 健治高橋; 邦雄川口
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1995-07-28
Filing date: 1996-07-10
Publication date: 2005-03-16
Anticipated expiration: 2016-07-10
Also published as: JPH09102743A

Description

【０００１】
【発明の属する技術分野】
この発明は、例えばディジタルオーディオ信号、ディジタル画像信号等のディジタル情報信号の発生データ量を低減するようにした情報信号符号化装置、符号化方法、および情報信号復号方法に関する。
【０００２】
【従来の技術】
ディジタルオーディオ信号、ディジタル画像信号等の伝送情報量を低減するために、予測符号化が知られている。例えば１次元ＤＰＣＭは、時間または空間方向において、入力サンプル値と予測値との差分（残差）を形成し、２次元ＤＰＣＭは、空間方向において入力サンプル値と予測値との残差を形成する。ディジタル情報信号は、時間方向、空間方向の相関を有しているので、残差信号のレベルがサンプル値よりも小さいものとなる。従って、残差信号を元の量子化ビット数より少ないビット数により再量子化することが可能で、それによって、情報量を圧縮できる。
【０００３】
残差信号を対象とする量子化装置としては、０付近の量子化ステップ幅を細かくし、レベルが大きいほど、量子化ステップ幅を粗くする非線形量子化装置が周知である。この非線形量子化を含めて従来の量子化装置は、発生しうる残差信号の全てのレベルを量子化対象としている。例えばディジタル画像信号の１サンプル（１画素）が８ビットに量子化されている場合、残差信号としては、（−２５５〜＋２５５）の値が存在しうる。従来の量子化装置は、この全範囲を量子化の対象としている。
【０００４】
【発明が解決しようとする課題】
従来の量子化装置は、発生しうる残差信号の全範囲を量子化の対象とするので、量子化ビット数を少なくした場合には、量子化精度が低下し、量子化ビット数を多くすると発生データ量が多くなる問題があった。その結果、復号した時に、充分な品質のオーディオ信号、画像信号が得られない問題があった。
【０００５】
従って、この発明の目的は、残差信号を量子化する時に、さらに、量子化出力のデータ量の削減が可能な情報信号符号化装置、符号化方法、並びに復号方法を提供することにある。すなわち、残差信号をブロック化し、ブロック毎の残差信号のレベル範囲がある条件を満たしている場合では、量子化ビット数をより少なくするように、量子化モードを切り替えるものである。
【０００６】
【課題を解決するための手段】
請求項１の発明は、入力ディジタル情報信号を発生データ量を少なくするように符号化する情報信号符号化装置において、
入力ディジタル情報信号と入力ディジタル情報信号の予測値との残差信号を生成する手段と、
残差信号をブロック化する手段と、
ブロック毎に、最大値および最小値を検出する手段と、
最大値および最小値からブロック毎の残差信号の存在範囲が０を跨ぐものかどうかを決定し、存在範囲が０を跨ぐ場合に第１の量子化モードを指示し、存在範囲が０を跨がない場合に第２の量子化モードを指示するモード決定手段と、
第１の量子化モードでは、元のビット数より少ない所定のビット数で、残差信号を量子化し、第２の量子化モードでは、元のビット数より少ないビット数で残差信号を量子化するとともに、コード変換によりビット数をより少なくするようになされた量子化手段と、
第１および第２の量子化モードを識別する情報と、
量子化手段の出力とを伝送する伝送手段と
からなることを特徴とする情報信号符号化装置である。
【０００７】
請求項３の発明は、入力ディジタル情報信号を発生データ量を少なくするように符号化する情報信号符号化方法において、
入力ディジタル情報信号と上記入力ディジタル情報信号の予測値との残差信号を生成するステップと、
残差信号をブロック化するステップと、
ブロック毎に、最大値および最小値を検出するステップと、
最大値および最小値からブロック毎の残差信号の存在範囲が０を跨ぐものかどうかを決定し、存在範囲が０を跨ぐ場合に第１の量子化モードを指示し、存在範囲が０を跨がない場合に第２の量子化モードを指示するモード決定のステップと、
第１の量子化モードでは、元のビット数より少ない所定のビット数で、残差信号を量子化し、第２の量子化モードでは、元のビット数より少ないビット数で残差信号を量子化するとともに、コード変換によりビット数をより少なくする量子化のステップと、
第１および第２の量子化モードを識別する情報と、
量子化手段の出力とを伝送するステップと
からなることを特徴とする情報信号符号化方法である。
【０００８】
請求項４の発明は、入力ディジタル情報信号から少なくとも第１および第２の階層データを形成し、第１および第２の階層データを符号化して伝送するようにした情報信号符号化装置において、
第１の階層データより解像度がより低い第２の階層データを形成する手段と、
第２の階層データから第１の階層データを予測する手段と、
予測されたデータと第１の階層データとの残差信号を形成する手段と、
残差信号をブロック化する手段と、
ブロック毎に、最大値および最小値を検出する手段と、
最大値および最小値からブロック毎の残差信号の存在範囲が０を跨ぐものかどうかを決定し、存在範囲が０を跨ぐ場合に第１の量子化モードを指示し、存在範囲が０を跨がない場合に第２の量子化モードを指示するモード決定手段と、
第１の量子化モードでは、元のビット数より少ない所定のビット数で、残差信号を量子化し、第２の量子化モードでは、元のビット数より少ないビット数で残差信号を量子化するとともに、コード変換によりビット数をより少なくするようになされた量子化手段と、
第１および第２の量子化モードを識別する情報と、
量子化手段の出力とを伝送する伝送手段と
からなることを特徴とする情報信号符号化装置である。
【０００９】
請求項５の発明は、入力ディジタル情報信号から少なくとも第１および第２の階層データを形成し、第１および第２の階層データを符号化して伝送するようにした情報信号符号化方法において、
第１の階層データより解像度がより低い第２の階層データを形成するステップと、
第２の階層データから第１の階層データを予測するステップと、
予測されたデータと第１の階層データとの残差信号を形成するステップと、
残差信号をブロック化するステップと、
ブロック毎に、最大値および最小値を検出するステップと、
最大値および最小値からブロック毎の残差信号の存在範囲が０を跨ぐものかどうかを決定し、存在範囲が０を跨ぐ場合に第１の量子化モードを指示し、存在範囲が０を跨がない場合に第２の量子化モードを指示するモード決定のステップと、
第１の量子化モードでは、元のビット数より少ない所定のビット数で、残差信号を量子化し、第２の量子化モードでは、元のビット数より少ないビット数で残差信号を量子化するとともに、コード変換によりビット数をより少なくする量子化のステップと、
第１および第２の量子化モードを識別する情報と、
量子化出力とを伝送する伝送のステップと
からなることを特徴とする情報信号符号化方法である。
【００１０】
請求項６の発明は、入力ディジタル情報信号と入力ディジタル情報信号の予測値との残差信号を生成され、残差信号をブロック化され、ブロック毎に、最大値および最小値が検出され、最大値および最小値からブロック毎の残差信号の存在範囲が０を跨ぐものかどうかが決定され、存在範囲が０を跨ぐ場合に第１の量子化モードが指示され、存在範囲が０を跨がない場合に第２の量子化モードが指示され、第１の量子化モードでは、残差信号が元のビット数より少ないビット数で量子化され、第２の量子化モードでは、残差信号が元のビット数より少ないビット数で量子化されるとともに、コード変換によりビット数をより少なくするように符号化され、第１および第２の量子化モードを識別する情報と、残差信号の情報とが伝送される情報信号符号化方法に対する復号方法において、
識別情報に基づいて、第１の量子化モードでは、データを逆量子化し、第２の量子化モードでは、データをコード変換するとともに、逆量子化する逆量子化のステップと、
逆量子化された残差信号をブロック分解し、元の順序へ変換するステップとからなることを特徴とする情報信号復号方法である。
【００１１】
残差信号のレベル分布の集中度を高めることによって、残差信号を元の量子化ビット数より少ないビット数で再量子化することができる。また、残差信号の分布がある条件を満足する時には、残差信号をより少ないビット数で量子化するが可能となる。
【００１２】
【発明の実施の形態】
以下、この発明の一実施例について図面を参照して説明する。この一実施例では、ビデオ信号が所定のサンプリング周波数でサンプリングされ、各サンプルが所定の量子化ビット数へ変換されたディジタル画像信号に対して、この発明が適用される。図１は、この発明の一実施例のシステムの構成を全体的に示す。
【００１３】
図１において、１２１で示す入力端子にディジタルビデオ信号が供給される。入力信号が減算器１２３に供給され、減算器１２３の出力（残差信号）がブロック化回路１２４および予測器１２２に供給され、予測器１２２で生成された予測信号が減算器１２３に供給される。減算器１２３は、入力信号から予測信号を減算し、予測残差を発生する。この残差信号がブロック化回路１２４に供給され、ラスター走査の順序からブロックの順序のデータへ変換される。ブロック化された残差信号が符号化ユニット１２５に供給される。
【００１４】
符号化ユニット１２５中の量子化回路は、後述するように、ブロック化された残差信号の度数分布に応じて、第１の量子化モードと第２の量子化モードとが切り替えられるものである。例えば残差信号のビット数が８ビットの場合、第１の量子化モードが選択される時には、符号化ユニット１２５から３ビットの量子化出力が発生する。第２の量子化モードが選択される時には、符号化ユニット１２５内のコード変換テーブルによって、２ビットに変換された量子化出力が発生する。
【００１５】
符号化ユニット１２５の符号化出力がエラー訂正符号エンコーダ１２６に供給され、エラー訂正符号の冗長コードが付加される。エラー訂正符号エンコーダ１２６の出力が変調部１２７に供給される。変調部１２７は、記録、伝送等に適した形態にディジタル信号を変調する。変調部１２７からの出力信号が記録ユニット１２８に供給され、記録ユニット１２８によって記録信号が情報信号記録媒体１２９に記録される。また、伝送路１３０を介してデータを伝送することも可能で、その場合では、記録ユニット１２８の代わりに伝送ユニットが使用される。量子化モードを指示するモード信号も符号化残差信号とともに、記録、あるいは伝送される。情報信号記録媒体１２９は、磁気、光磁気、相変化等を利用したディスク状、あるいはテープ状の記録媒体である。半導体メモリも一種の記録媒体である。
【００１６】
記録媒体１２９からデータを再生ユニット１３１が再生し、または伝送路１３０を介して伝送されたデータが受信される。再生ユニット１３１により再生されたデータが復調部１３２により復調され、復調出力がエラー訂正符号のデコーダ１３３に供給される。このデコーダ１３３は、冗長コードを利用してエラーを訂正し、また、訂正できないで残ったエラーを目立たないように修整する。
【００１７】
エラー訂正デコーダ１３３の出力が復号化ユニット１３４に供給される。復号化ユニット１３４は、後述するように、符号化ユニット１２５と逆に量子化出力を代表値（量子化復元値）に変換する、逆量子化の処理を行う。復号化ユニット１３４中の逆量子化回路は、モード信号により指示される量子化モードに従って逆量子化の処理を行う。復号化ユニット１３４から復号された残差信号が発生する。この復号残差信号がブロック分解回路１３５に供給される。ブロック分解回路１３５では、ブロック構造がラスター走査の順序に戻される。
【００１８】
復号残差信号が加算回路１３６に供給される。加算回路１３６により復号画像信号が形成され、出力端子１３７に取り出される。また、この復号画像信号が予測器１３８に供給され、予測信号が生成される。予測信号が加算回路１３６に供給される。
【００１９】
図２は、符号化ユニット１２５の一例を示す。ブロック化回路１２４からのブロック化された残差信号が入力端子１に供給される。図３は、残差信号の形成を概略的に示すものである。図３における一つの矩形の領域が１つの画素と対応している。ａ〜ｈのそれぞれは、局部復号された画素値を示し、Ａ〜Ｐは、符号化される前の画素値を示す。画素値Ａに対しての予測値Ａ’は、近傍の局部復号画素値を使用して予測器１２２により生成される。例えば予測値Ａ’は、Ａ’＝４ｃ−３（ｂ−ｆ）、Ａ’＝ｆ＋ｃ−ｂ等の予測式に従って形成される。画素値Ｂ、Ｃ、・・・に対する予測値も同様の予測式によって計算される。一般式で表すと、予測値は、（αａ＋βｂ＋γｆ、但し、α、β、γは定数）により生成される。
【００２０】
減算器１２３では、画素値（例えばＡ）から予測値（例えばＡ’）が減算され、残差信号Δａが生成される。同様に、残差信号Δｂ、Δｃ、・・・が生成される。ブロック化回路１２４では、生成された残差信号がブロック構造に変換される。例えばブロック化回路１２４によって、図３Ａの太線の枠で示すように、（４×４）のブロックと対応する残差信号Δａ〜Δｐのブロックのデータが形成される。なお、ディジタルオーディオ信号を扱う場合には、時間方向の予測値が形成され、１次元の残差信号のブロックが形成される。
【００２１】
残差信号のレベル範囲は、ブロック化することによって、集中度を高めることが可能である。１画素が８ビットのデータの場合では、１画面の全体の残差信号の発生度数の分布は、０を中心として（−２５５〜＋２５５）の範囲のものであり、残差が０の度数が最大となる。しかしながら、ブロックに分割した場合には、残差の度数分布がもとの分布に比してより集中したものとなる。
【００２２】
これは、１画面と比較して小空間のブロック内の残差は、大きな値となるものが確率的に少なく、また、ブロック内では残差の相関が強いことに因る。従って、ブロック化された残差信号を元の量子化ビット数より少ないビット数でもって再量子化することができる。また、ブロック化された残差信号の分布において、必ずしも０の値の度数が最大とならない。例えば、ブロック内で輝度のレベルが例えば対角線方向に除々に変化する場合等では、０の値の度数が最大とならない分布が生じる。なお、残差信号の分布の集中度を高める方法は、ブロック化が一例であって、これ以外の方法も可能である。
【００２３】
図２に戻って、符号化ユニット１２５について説明する。入力端子１からの残差信号が最大値検出回路２、最小値検出回路３、遅延回路４に供給される。最大値検出回路２では、ブロック毎の最大値ＭＡＸが検出され、最小値検出回路３では、ブロック毎の最小値ＭＩＮが検出される。検出された最大値ＭＡＸおよび最小値ＭＩＮがモード決定回路５へ供給される。
【００２４】
モード決定回路５には、ブロック化された残差信号、最大値ＭＡＸおよび最小値ＭＩＮが供給され、そのブロックの残差信号のレベル範囲に基づいて、量子化モードが決定され、モードを指示する信号ＭＯＤＥ（例えば１ビット）が生成される。モード信号ＭＯＤＥが量子化回路６に供給され、また、量子化回路６に対して、入力端子１からの残差信号が位相合わせのための遅延回路４を介して供給される。量子化回路６では、残差信号が元の量子化ビット数（例えば８ビット）より少ない量子化ビット数により再量子化される。量子化回路６では、第１および第２の量子化モードがモード信号ＭＯＤＥに従って選択される。
【００２５】
第１の量子化モードは、８ビットの残差信号を再量子化して、より少ない量子化ビット数例えば３ビットの量子化出力を発生する、通常のものである。第２の量子化モードは、ブロック化された残差信号のレベル範囲が０を跨がない場合に適用される。この一実施例では、残差信号の最小値ＭＩＮが（Δ／２）よりも大きい場合には、第２の量子化モードが適用される。ここで、Δは、再量子化ビット数をｎとするときに、ＤＲ／２^ｎで計算される量子化ステップである。ＤＲは、ダイナミックレンジで、（ＭＡＸ−ＭＩＮ）により計算できる。
【００２６】
第２の量子化モードでは、第１の量子化モードと同様に再量子化で発生したコード信号（例えば３ビット）が所定の規則に従って変換され、より少ないビット数例えば２ビットの量子化出力に変換される。３ビットのコードを２ビットの量子化出力に変換するコード変換の規則を下記に示す。
１００ − ００
１０１ − ０１
１１０ − １０
１１１ − １１
【００２７】
量子化処理について図４を参照して説明する。上述したように予測符号化により発生した残差信号は、１フレーム全体で見ると、０の値の度数が最大となるが、ブロック化したものは、図４Ａに示すように、そのブロックによってさまざまな偏りを持っている。図４Ａは、負側に偏った残差信号の分布、最大度数が０の値と一致した残差信号の分布、正側に偏った残差信号の分布を示す。
【００２８】
図４Ｂは、０の値を跨がるような残差信号の度数分布を示している。残差信号を量子化する場合では、例えばＭＡＸおよびＭＩＮの間が２^３＝８等分され、その範囲に属する残差信号に同一のコードが割り当てられる。そのコードは、復元されると、範囲の中央の代表値へ変換される。一般的には、コードのビット数がｎの場合、２^ｎの個数の範囲にＭＡＸおよびＭＩＮの間が分割される。このような量子化が第１の量子化モードである。
【００２９】
図４Ｃは、ブロック化された残差信号の度数分布の例を示す。すなわち、図４Ｃには、最小値ＭＩＮがΔ／２に等しいような分布３０ａ、図４Ｂのものと同様に、０を跨がる分布３０ｂ（破線で示す）、最大値ＭＡＸが−Δ／２に等しい分布３０ｃ（破線で示す）が示されている。この一実施例では、度数分布３０ａのように、（ＭＩＮ≧Δ／２）の条件を満たす場合では、第２の量子化モードが適用され、第２の量子化モードにおいて、上述したようなコード変換がなされる。復号側においては、２ビットから３ビットへの逆方向のコード変換がなされ、そして、３ビットのコードが逆量子化される。
【００３０】
例えば図４Ｃにおける度数分布３０ａを有する残差信号を第２の量子化モードで量子化すると、ＭＩＮ〜ＭＡＸの範囲の１００〜１１１の量子化出力が００〜１１の量子化出力に変換される。復号側では、逆に００〜１１の量子化出力が１００〜１１１のコードに変換される。この変換された３ビットのコードを第１の量子化モードで発生した３ビットの量子化出力と同様に、代表値に変換することによっで、逆量子化を行うことができる。
【００３１】
この発明の一実施例と異なり、ブロックの残差信号の最大値ＭＡＸが−Δ／２より小さい場合（例えば図４Ｃにおける度数分布３０ｃ）に、第２の量子化モードを適用するようにしても良い。さらに、最小値ＭＩＮが（ＭＩＮ≧０）の場合、あるいは最大値ＭＡＸが（ＭＡＸ≦０）の場合に、第２の量子化モードを適用するようにしても良い。
【００３２】
量子化回路６の一例を図５に示す。１５で示す入力端子に対して、ブロック化された残差信号が遅延回路４から供給される。モード信号ＭＯＤＥによって切り替えられるスイッチング回路１６の入力端子に入力残差信号が供給される。スイッチング回路１６の出力端子ａおよびｂには、例えば３ビットの出力を発生する量子化器１７ａおよび１７ｂがそれぞれ接続される。量子化器１７ｂに対してコード変換回路１８が接続される。コード変換回路１８は、上述したような３ビットを２ビットに変換する回路である。量子化器１７ａまたは１７ｂの出力が出力端子１９に取り出される。
【００３３】
モード決定回路５によって、そのブロックの残差信号に関して、（ＭＩＮ≧Δ／２）である場合には、モード信号ＭＯＤＥが第２の量子化モードを指示し、一方、それ以外の場合には、モード信号ＭＯＤＥが第１の量子化モードを指示する。第１の量子化モードでは、スイッチング回路１６の出力端子ａが選択され、第２の量子化モードでは、スイッチング回路１６の出力端子ｂが選択される。従って、第１の量子化モードでは、３ビットの量子化出力が出力端子１９に発生し、第２の量子化モードでは、２ビットの量子化出力が出力端子１９に発生する。なお、量子化器１７ａ、１７ｂを共通としても良い。
【００３４】
図２に戻って説明すると、量子化回路６の出力がビットプレーン符号化回路７に供給される。ビットプレーン符号化回路７は、ｎビット例えば２ビットのコードｑをＭＳＢ（最上位ビット）プレーンとＬＳＢ（最下位ビット）プレーンに分割する。ＭＳＢプレーンは、供給される２ビット量子化値のＭＳＢの集合であり、ＬＳＢプレーンは、ＬＳＢの集合である。図６Ａは、簡単のため、１画面が（４×３＝１２ブロック）で構成され、各ブロックに（４×４）の残差信号のコードｑが含まれる場合を示している。
【００３５】
図６Ａにおいて、例として示されるコードｑの値の０，１，２，３は、それぞれ２ビットで（００），（０１），（１０），（１１）を意味する。図６Ａの例では、ビットプレーン符号化回路７が図６Ｂに示すように、例えば１画面のコードｑをＭＳＢプレーンおよびＬＳＢプレーンへ分割する。量子化ビット数が３ビットの場合では、さらに、中間のビットのプレーンが形成される。この発明の一実施例では、二つの量子化モードによって２ビットあるいは３ビットの量子化出力が発生する。ビットプレーンを形成する場合では、量子化出力のビット数が分かっている必要がある。そのため、モード信号ＭＯＤＥがビットプレーン符号化回路７に供給されている。
【００３６】
ビットプレーン符号化回路７で生成された各ビットプレーンが可変長符号化回路８に供給され、各ビットプレーンが可変長符号化される。可変長符号化回路８では、ビットプレーン毎にランレングス符号化、例えばＭＭＲ（ＭｏｄｅｆｉｅｄＭＲ）が行われる。可変長符号化回路８の出力がフレーミング回路９に供給される。フレーミング回路９には、上述したモード信号ＭＯＤＥも供給される。フレーミング回路９の出力端子１０には、これらのモード信号ＭＯＤＥと可変長符号化されたコードが所定のフレーム構造のデータとして出力される。
【００３７】
次に、図７を参照して、図１中の復号化ユニット１３４の一例について説明する。エラー訂正デコーダ１３３からの再生、あるいは受信データが復号化ユニット１３４の入力端子３１に供給される。フレーム分解回路３２によって、可変長符号化された残差信号のコードと、モード信号ＭＯＤＥとが分離される。分離された残差信号のコードが可変長復号化回路３３により復号される。可変長復号化回路３３は、可変長符号化回路１３と対応している。可変長復号化回路３３の復号出力がビットプレーン復号化回路３４に供給され、ビットプレーンが合成される。この場合、分離されたモード信号ＭＯＤＥによって、ビットプレーンが正しく合成される。
【００３８】
ビットプレーン復号化回路３４の出力がスイッチング回路３５の入力端子に供給される。スイッチング回路３５は、モード信号ＭＯＤＥにより制御される。モード信号ＭＯＤＥが第１の量子化モードを指示する場合では、スイッチング回路３５の出力端子ａが選択され、これが第２の量子化モードを指示する場合では、スイッチング回路３５の出力端子ｂが選択される。出力端子ａには、３ビットの量子化コードを逆量子化して、量子化復元値を発生する逆量子化器３６ａが接続されている。出力端子ｂには、２ビットの量子化出力を３ビットのコードに変換するコード変換回路３７が接続され、コード変換回路３７に対して逆量子化器３６ｂが接続されている。逆量子化器３６ｂは、逆量子化器３６ａと同様に、３ビットのコードを復元値へ変換するものである。
【００３９】
逆量子化器３６ａ、３６ｂにより生成された復元値、すなわち、ブロック化された復号残差信号が出力端子３８に取り出される。出力端子３８に取り出された復号残差信号がブロック分解回路１３５に供給される（図１参照）。なお、逆量子化器３６ａ、３６ｂを共通としても良い。
【００４０】
次に、この発明を階層符号化に対して適用した他の実施例について説明する。ここで説明する階層符号化装置は、階層間で予測を行ない、また、階層間データに対し単純な算術式を用いることで、符号化対象画素数の増加を防止することができるものである。
【００４１】
図８を参照してこの階層符号化方法について説明する。図８は、一例として第１階層を最下位階層（原画）とし、第４階層を最上位階層とする４階層からなる階層間の模式図を示している。例えば、上位階層データ生成法として、空間的に対応する４画素の下位階層データの平均値を採用する場合、伝送画素数が増加しないようにできる。
【００４２】
すなわち、上位階層データをＭ、下位階層画素値をｘ_０、ｘ_１、ｘ_２、ｘ_３とすると、
Ｍ＝１／４・（ｘ_０＋ｘ_１＋ｘ_２＋ｘ_３）
によりデータＭが形成される。そして、データＭと、４個のデータの内の例えばｘ_３以外の３個のデータを伝送する。受信あるいは再生側では、
ｘ_３＝４・Ｍ−（ｘ_０＋ｘ_１＋ｘ_２）
という単純な算術式により非伝送画素ｘ_３を容易に復元することができる。図８において、斜線の矩形は、非伝送画素を示している。
【００４３】
図９は、上述した平均化を使用する例えば５階層の階層符号化の構成を示す。第１階層が入力画像の解像度レベルであるとする。この第１階層のブロックサイズは、（１×１）である。第２階層データは、第１階層データの４画素平均により生成される。この例では、第１階層データＸ_１（０）〜Ｘ_１（３）の平均値により、第２階層データＸ_２（０）が生成される。Ｘ_２（０）に隣接する第２階層データＸ_２（１）〜Ｘ_２（３）も同様に第１階層データの４画素平均により生成される。この第２階層のブロックサイズは、（１／２×１／２）である。
【００４４】
さらに、第３階層データは、空間的に対応する第２階層データの４画素平均により生成される。この第３階層のブロックサイズは、（１／４×１／４）である。また、第４階層のデータも同様に第３階層のデータから生成される。この第４階層のブロックサイズは、（１／８×１／８）である。最後に、最上位階層である第５階層データＸ_５（０）が第４階層データＸ_４（０）〜Ｘ_４（３）の平均値により生成される。この第５階層のブロックサイズは、（１／１６×１／１６）である。
【００４５】
上述した符号化対象画素数の増加を防止した階層構造データに対し、上位階層データにクラス分類適応予測を適用することで、下位階層データを予測し、下位階層データとその予測値との差分（すなわち、残差信号）を生成することで伝送データ量の削減を図ることができる。図１０は、そのような符号化ユニットを示す。入力端子４１を介して第１階層データｄ０が入力画像データｄ０として平均化回路４２および減算器４６へ供給される。第１階層データが元の解像度の画像データである。
【００４６】
入力画素データｄ０は、平均化回路４２において、１／４平均処理が実行され、階層データｄ１が生成される。この階層データｄ１は、図９に示す第２階層データに対応する。生成された階層データｄ１は、平均化回路４３および減算器４７へ供給される。
【００４７】
階層データｄ１に対して、平均化回路４３では、平均化回路４２と同様な処理が施され、階層データｄ２が生成される。この階層データｄ２は、第３階層データに対応する。生成された階層データｄ２は、平均化回路４４および減算器４８へ供給される。また、平均化回路４４でも同様に階層データｄ２に対して１／４平均処理がなされ、階層データｄ３が生成される。この階層データｄ３は、第４階層データに対応する。生成された階層データｄ３は、平均化回路４５および減算器４９へ供給される。さらに、平均化回路４５でも同様に階層データｄ３に対して１／４平均処理がなされ、階層データｄ４が生成される。この階層データｄ４は、第５階層データに対応する。生成された階層データｄ４は、量子化器５４へ供給される。
【００４８】
そして、これら５つの階層データについて階層間で予測が行われる。先ず、第５階層においてなされる圧縮のための量子化処理は、量子化器５４によりなされる。この量子化器５４の出力データｄ２１が可変長符号のエンコーダ７１に供給されると共に、逆量子化器５８へも供給される。エンコーダ７１の出力が出力端子７６に第５階層のデータとして取り出される。符号化データｄ２１が供給された逆量子化器５８の出力データｄ１６がクラス分類適応予測回路６２へ供給される。
【００４９】
クラス分類適応予測回路６２では、データｄ１６を使用して予測処理がなされ、第４階層データの予測値ｄ１２が生成され、この予測値ｄ１２が減算器４９へ供給される。この減算器４９では、平均化回路４４から供給される階層データｄ３と予測値ｄ１２との差分値が求められ、その差分値ｄ８が量子化器５３へ供給される。
【００５０】
差分値ｄ８が供給された量子化器５３では、量子化器５４と同様に量子化ビット数を低減するように、再量子化がなされる。この量子化器５３の出力データが演算器６６および逆量子化器５７へ供給される。この演算器６６では、４画素から１画素を間引く処理が行われる。演算器６６から出力されるデータｄ２０が可変長符号のエンコーダ７０で符号化され、エンコーダ７０の出力が出力端子７５に第４階層の出力データとして取り出される。
【００５１】
クラス分類適応予測回路６２により予測された第４階層データｄ１２と、逆量子化器５７の出力データ（復号残差信号）ｄ１５がクラス分類適応予測回路６１へ供給される。クラス分類適応予測回路６１では、データｄ１２に対してデータｄ１５を加算することによって、第４階層のローカル復号データを形成し、このローカル復号データを使用して予測処理がなされ、第３階層データの予測値ｄ１１が生成され、この予測値ｄ１１が減算器４８へ供給される。この減算器４８では、平均化回路４３から供給されるデータｄ２と予測値ｄ１１との差分値が求められ、その差分値ｄ７が量子化器５２へ供給される。
【００５２】
差分値ｄ７が供給された量子化器５２の出力データが演算器６５および逆量子化器５６へ供給される。この演算器６５では、４画素から１画素を間引く処理が行われる。演算器６５から出力される第３階層データｄ１９が可変長符号のエンコーダ６９に供給され、エンコーダ６９の出力が出力端子７４に第３階層のデータとして取り出される。
【００５３】
クラス分類適応予測回路６１により予測された第３階層データｄ１１と、量子化器５２から符号化データが供給された逆量子化器５６の出力データｄ１４がクラス分類適応予測回路６０へ供給される。クラス分類適応予測回路６０では、データｄ１１に対してデータｄ１４を加算することによって、第３階層のローカル復号データを形成し、このローカル復号データを使用して予測処理がなされ、第２階層データの予測値ｄ１０が生成され、予測値ｄ１０が減算器４７へ供給される。この減算器４７では、平均化回路４２から供給されるデータｄ１と予測値ｄ１０との差分値が求められ、その差分値ｄ６が量子化器５１へ供給される。
【００５４】
量子化器５１の出力データは、演算器６４および逆量子化器５５へ供給される。この演算器６４では、４画素から１画素を間引く処理が行われる。演算器６４から出力される第２階層データｄ１８が可変長符号のエンコーダ６８に供給され、エンコーダ６８の出力が出力端子７３に第２階層のデータとして取り出される。
【００５５】
クラス分類適応予測回路６０により予測された第２階層データｄ１０と、量子化器５１から符号化データが供給された逆量子化器５５の出力データｄ１３がクラス分類適応予測回路５９へ供給される。クラス分類適応予測回路５９では、データｄ１０に対してデータｄ１３を加算することによって、第２階層のローカル復号データを形成し、このローカル復号データを使用して予測処理がなされ、第１階層データの予測値ｄ９が生成され、予測値ｄ９が減算器４６へ供給される。この減算器４６では、入力端子４１から供給される入力画素データｄ０と予測値ｄ９との差分値が求められ、その差分値ｄ５が量子化器５０へ供給される。
【００５６】
差分値ｄ５が供給された量子化器５０の出力データは、演算器６３へ供給される。この演算器６３では、４画素から１画素を間引く処理が行われる。演算器６３から出力される第１階層データｄ１７が可変長符号のエンコーダ６７に供給され、エンコーダ６７の出力が出力端子７２に第１階層のデータとして取り出される。
【００５７】
クラス分類適応予測回路５９、６０、６１、６２のそれぞれは、予測しようとする下位階層の画素をその空間的に近傍の複数の画素（上位階層に含まれる）のレベル分布に基づいて予測するものである。図１２は、クラス分類適応予測回路の一例を示す。入力端子１４１からのローカル復号データが周辺コード値形成部１４２に供給される。周辺コード値形成部１４２は、予測しようとする下位階層の画素の近傍に位置する複数のデータｘ_１、ｘ_２、・・・・、ｘ_ｎを同時化する。周辺コード値がクラス分類部１４３および遅延部１４５に供給される。クラス分類部１４３は、周辺コード値ｘ_１〜ｘ_ｎのレベル分布のパターンと対応したクラスコードを出力する。クラスコードとしては、周辺コード値それ自体を使用しても良いが、クラス数が膨大となるので、周辺コードのそれぞれのビット数をＡＤＲＣ等により例えば１ビットに圧縮したものが使用される。クラス分類部１４３から発生したクラスコードが予測係数メモリ１４４に対してアドレス信号として供給される。
【００５８】
予測係数メモリ１４４には、予め学習により獲得された予測係数ｗ_１〜ｗ_ｎがアドレス毎に格納されている。すなわち、教師信号（例えば第４階層のデータ）と、入力信号（第４階層のデータから平均化処理で形成された第５階層のデータ）とを使用し、入力信号の複数のデータと係数との線形１次結合により予測値を求め、この予測値と教師信号の真値との誤差の自乗和を最小とするような係数がクラス毎に最小自乗法により求められる。クラスコードに対応して予測係数メモリ１４４から読出された予測係数ｗ１〜ｗ_ｎと遅延部１４５からの周辺コード値ｘ_１〜ｘ_ｎとが予測演算部１４６に供給される。
【００５９】
予測演算部１４６では、下記の線形１次結合式によって、予測値ｙが計算される。
ｙ＝ｗ_１ｘ_１＋ｗ_２ｘ_２＋・・・・・＋ｗ_ｎｘ_ｎ
予測演算部１４６により求められた予測値ｙが出力端子１４７に取り出される。なお、クラス分類のために使用される周辺コード値と、予測演算のために使用される周辺コード値とが異なったものでも良い。
【００６０】
上述した一実施例における符号化ユニット１２５（図２参照）と同様の構成が階層符号化のエンコーダ側にも設けられている。つまり、量子化回路６迄の前段の構成と同様の構成を量子化器５０、５１、５２、５３、５４がそれぞれ有し、ビットプレーン符号化回路７より後段の構成と同様の構成を可変長エンコーダ６７、６８、６９、７０、７１がそれぞれ有する。
【００６１】
次に、上述のエンコーダと対応する階層符号化のデコーダ側の構成例を図１１に示す。エンコーダ側で生成された各階層データは、ｄ３０〜ｄ３４として入力端子８１、８２、８３、８４、８５にそれぞれ供給される。そして、可変長符号のデコーダ８６、８７、８８、８９、９０にて可変長符号の復号がなされる。これらのデコーダに対して、逆量子化器９１、９２、９３、９４、９５がそれぞれ接続される。
【００６２】
先ず、第５階層入力データｄ３４は、逆量子化器９５において、エンコーダで施された量子化に対応する復号処理が行われ、画像データｄ３９となり、クラス分類適応予測回路１０７および演算器１０３へ供給される。また画像データｄ３９は、第５階層の画像出力として、出力端子１１２から取り出される。
【００６３】
クラス分類適応予測回路１０７では、第４階層の画像データに対してクラス分類適応予測が施され、第４階層データの予測値ｄ４７が生成される。逆量子化器９４からのデータｄ３８（すなわち、差分値）と予測値ｄ４７とが加算器９９で加算される。加算器９９から画像データｄ４３が演算器１０３へ供給され、演算器１０３では、非伝送画素の値を求めるために、上述した演算が実行され、逆量子化器９５から供給された画像データｄ３９と画像データｄ４３から第４階層の全画素値が復元される。この演算器１０３において、復元された全画素値は、画像データｄ５１として、クラス分類適応予測回路１０６および演算器１０２へ供給される。また画像データｄ５１は、第４階層の出力として、出力端子１１１から取り出される。
【００６４】
クラス分類適応予測回路１０６では、第３階層の画像データに対してクラス分類適応予測が施され、第３階層データの予測値ｄ４６が生成される。逆量子化器９３からのデータｄ３７と予測値ｄ４６とが加算器９８で加算される。加算器９８から画像データｄ４２が演算器１０２へ供給され、演算器１０２により非伝送画素の値が求められ、演算器１０３から供給された画像データｄ５１と画像データｄ４２から第３階層の全画素値が復元される。この演算器１０２において、復元された全画素値は、画像データｄ５０として、クラス分類適応予測回路１０５および演算器１０１へ供給される。また画像データｄ５０は、第３階層の出力として、出力端子１１０から取り出される。
【００６５】
また、クラス分類適応予測回路１０５では、第２階層の画像データに対してクラス分類適応予測が施され、第２階層データの予測値ｄ４５が生成される。逆量子化器９２からのデータｄ３６と予測値ｄ４５とが加算器９７で加算される。加算器９７から画像データｄ４１が演算器１０１へ供給され、演算器１０１により非伝送画素の値が求められ、演算器１０２から供給された画像データｄ５０と画像データｄ４１から第２階層の全画素値が復元される。この演算器１０１において、復元された全画素値は、画像データｄ４９として、クラス分類適応予測回路１０４および演算器１００へ供給される。また画像データｄ４９は、第２階層の出力として、出力端子１０９から取り出される。
【００６６】
さらに、クラス分類適応予測回路１０４では、第１階層の画像データに対してクラス分類適応予測が施され、第１階層データの予測値ｄ４４が生成される。逆量子化器９１からのデータｄ３５と予測値ｄ４４とが加算器９６で加算される。加算器９６から画像データｄ４０が演算器１００へ供給され、演算器１００により非伝送画素の値が求められ、演算器１０１から供給された画像データｄ４９と画像データｄ４０から第１階層の全画素値が復元される。この演算器１００において、復元された全画素値は、画像データｄ４８として、第１階層の出力として、出力端子１０８から取り出される。クラス分類適応予測回路１０４、１０５、１０６、１０７のそれぞれは、図１２に示し、上述した具体的構成を有している。こうして、符号化対象画素数の増加を防止した階層符号化において、クラス分類適応予測を導入することで符号化効率の向上を図ることが可能となる。
【００６７】
上述した一実施例の復号化ユニット１３４と同様の構成を可変長符号のデコーダ８６、８７、８８、８９、９０と、逆量子化器９１、９２、９３、９４、９５がそれぞれ有する。従って、上述した階層符号化に対してこの発明を適用した他の実施例によっても、上述した一実施例と同様に、ブロック化された残差信号のレベル範囲が所定の条件を満たす場合に、量子化ビット数をより少なくすることができる。
【００６８】
この発明において発生した、モード信号ＭＯＤＥと符号化残差信号との伝送方法としては、種々のものが可能である。例えば１フレーム分のモード信号を先にまとめて伝送し、その後に符号化残差信号を伝送することができる。また、この発明は、３以上の量子化モードを設定することができる。例えばブロックの残差信号のレベル範囲のみならず、そのダイナミックレンジＤＲも考慮して、ダイナミックレンジＤＲが小さい時には、第２の量子化モードにおけるビット数より少ないビット数の第３の量子化モードが設定されるようにしても良い。さらに、第１の量子化モードの量子化出力のレベル範囲に基づいて、量子化モードを決定するようにしても良い。
【００６９】
よりさらに、この発明は、上述した予測符号化以外の予測符号化で発生した残差信号の量子化に対しても適用できる。また、この発明は、量子化ステップ幅を制御することによって、発生データ量を制御するバッファリングの構成を有するシステムに対しても適用することができる。
【００７０】
【発明の効果】
この発明に依れば、ブロック化された残差信号の度数分布が０を跨がない場合を検出し、その場合では、量子化ビット数をより少なくし、伝送データ量を一層低減することができる。
【図面の簡単な説明】
【図１】この発明を適用できる記録／再生、あるいは伝送システムの一例のブロック図である。
【図２】この発明の一実施例のブロック図である。
【図３】この発明の一実施例中の残差信号の生成と、そのブロック化の説明に用いる略線図である。
【図４】この発明の一実施例における量子化処理の説明に用いる略線図である。
【図５】この発明の一実施例中の量子化回路の一例のブロック図である。
【図６】この発明の一実施例中のビットプレーンの説明に用いる略線図である。
【図７】この発明の一実施例における復号化ユニットのブロック図である。
【図８】階層符号化の一例の説明に用いる略線図である。
【図９】階層符号化の一例の説明に用いる略線図である。
【図１０】階層符号化に対してこの発明を適用した他の実施例のエンコード側の構成の一例を示すブロック図である。
【図１１】この発明の他の実施例のデコード側の構成の一例を示すブロック図である。
【図１２】この発明の他の実施例のクラス分類適応予測回路の構成の一例を示すブロック図である。
【符号の説明】
２・・・最大値検出回路，３・・・最小値検出回路，５・・・モード決定回路
６・・・量子化回路[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an information signal encoding device, an encoding method, and an information signal decoding method that reduce the amount of generated data of digital information signals such as digital audio signals and digital image signals.
[0002]
[Prior art]
Predictive coding is known to reduce the amount of transmission information such as digital audio signals and digital image signals. For example, a one-dimensional DPCM forms a difference (residual) between an input sample value and a predicted value in the time or spatial direction, and a two-dimensional DPCM forms a residual between the input sample value and a predicted value in the spatial direction. . Since the digital information signal has a correlation in the time direction and the spatial direction, the level of the residual signal is smaller than the sample value. Therefore, the residual signal can be requantized with a smaller number of bits than the original number of quantization bits, thereby reducing the amount of information.
[0003]
As a quantization device for a residual signal, a nonlinear quantization device is known in which the quantization step width near 0 is made finer and the quantization step width becomes coarser as the level increases. In the conventional quantization apparatus including this nonlinear quantization, all levels of the residual signal that can be generated are targeted for quantization. For example, when one sample (one pixel) of a digital image signal is quantized to 8 bits, a value of (−255 to +255) may exist as a residual signal. The conventional quantization apparatus uses the entire range as the object of quantization.
[0004]
[Problems to be solved by the invention]
Since the conventional quantization apparatus targets the entire range of residual signals that can be generated, if the number of quantization bits is reduced, the quantization accuracy decreases, and the number of quantization bits increases. There was a problem that the amount of generated data increased. As a result, there has been a problem that audio signals and image signals with sufficient quality cannot be obtained when decoding.
[0005]
Accordingly, an object of the present invention is to provide an information signal encoding apparatus, encoding method, and decoding method capable of further reducing the data amount of the quantized output when the residual signal is quantized. In other words, when the residual signal is blocked and the level range of the residual signal for each block satisfies a certain condition, the quantization mode is switched so as to reduce the number of quantization bits.
[0006]
[Means for Solving the Problems]
The invention of claim 1 is an information signal encoding apparatus for encoding an input digital information signal so as to reduce the amount of generated data.
Input digital information signalAnd the predicted value of the input digital information signalMeans for generating a residual signal of
Means for blocking the residual signal;
Means for detecting the maximum and minimum values for each block;
The residual signal for each block from the maximum and minimum valuesExistenceDetermine if the range is crossing zero,ExistenceInstruct the first quantization mode when the range crosses 0,ExistenceMode determining means for instructing the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual signal is quantized with a number of bits smaller than the original number of bits. And quantization means adapted to reduce the number of bits by code conversion,
Information identifying first and second quantization modes;
Transmission means for transmitting the output of the quantization means;
An information signal encoding device characterized by comprising:
[0007]
According to a third aspect of the present invention, there is provided an information signal encoding method for encoding an input digital information signal so as to reduce the amount of generated data.
Input digital information signalAnd the predicted value of the input digital information signalGenerating a residual signal of
Blocking the residual signal;
Detecting the maximum and minimum values for each block;
The residual signal for each block from the maximum and minimum valuesExistenceDetermine if the range is crossing zero,ExistenceInstruct the first quantization mode when the range crosses 0,ExistenceA mode determining step for indicating the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual signal is quantized with a number of bits smaller than the original number of bits. And a quantization step to reduce the number of bits by code conversion,
Information identifying first and second quantization modes;
Transmitting the output of the quantizing means;
An information signal encoding method characterized by comprising:
[0008]
According to a fourth aspect of the present invention, there is provided an information signal encoding apparatus in which at least first and second hierarchical data are formed from an input digital information signal, and the first and second hierarchical data are encoded and transmitted.
Means for forming second hierarchical data having a lower resolution than the first hierarchical data;
Means for predicting the first hierarchical data from the second hierarchical data;
Means for forming a residual signal between the predicted data and the first tier data;
Means for blocking the residual signal;
Means for detecting the maximum and minimum values for each block;
The residual signal for each block from the maximum and minimum valuesExistenceDetermine if the range is crossing zero,ExistenceInstruct the first quantization mode when the range crosses 0,ExistenceMode determining means for instructing the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual signal is quantized with a number of bits smaller than the original number of bits. And quantization means adapted to reduce the number of bits by code conversion,
Information identifying first and second quantization modes;
Transmission means for transmitting the output of the quantization means;
An information signal encoding device characterized by comprising:
[0009]
According to a fifth aspect of the present invention, there is provided an information signal encoding method in which at least first and second hierarchical data are formed from an input digital information signal, and the first and second hierarchical data are encoded and transmitted.
Forming second hierarchical data having a lower resolution than the first hierarchical data;
Predicting first hierarchical data from second hierarchical data;
Forming a residual signal between the predicted data and the first tier data;
Blocking the residual signal;
Detecting the maximum and minimum values for each block;
The residual signal for each block from the maximum and minimum valuesExistenceDetermine if the range is crossing zero,ExistenceInstruct the first quantization mode when the range crosses 0,ExistenceA mode determining step for indicating the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual signal is quantized with a number of bits smaller than the original number of bits. And a quantization step to reduce the number of bits by code conversion,
Information identifying first and second quantization modes;
A transmission step for transmitting the quantized output and
An information signal encoding method characterized by comprising:
[0010]
The invention of claim 6A residual signal between the input digital information signal and the predicted value of the input digital information signal is generated, the residual signal is blocked, the maximum and minimum values are detected for each block, and the maximum and minimum values are detected for each block. It is determined whether or not the existence range of the residual signal is crossing 0, the first quantization mode is instructed when the existence range crosses 0, and the second quantum when the existence range does not cross 0 Mode is instructed,In the first quantization mode, the residual signal is quantized with a smaller number of bits than the original number of bits, and in the second quantization mode, the residual signal is quantized with a smaller number of bits than the original number of bits. At the same time, it is encoded to reduce the number of bits by code conversion.And an information signal encoding method in which information for identifying the first and second quantization modes and information on the residual signal are transmitted.In the decryption method,
Based on the identification information, in the first quantization mode, the data is inversely quantized, and in the second quantization mode, the data is transcoded and inversely quantized,
An information signal decoding method comprising the steps of: decomposing a dequantized residual signal into blocks and converting the residual signal into an original order.
[0011]
By increasing the degree of concentration of the level distribution of the residual signal, the residual signal can be requantized with a smaller number of bits than the original number of quantization bits. Also, when the residual signal distribution satisfies a certain condition, the residual signal can be quantized with a smaller number of bits.
[0012]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, an embodiment of the present invention will be described with reference to the drawings. In this embodiment, the present invention is applied to a digital image signal obtained by sampling a video signal at a predetermined sampling frequency and converting each sample into a predetermined number of quantization bits. FIG. 1 shows the overall configuration of a system according to an embodiment of the present invention.
[0013]
In FIG. 1, a digital video signal is supplied to an input terminal indicated by 121. The input signal is supplied to the subtractor 123, the output (residual signal) of the subtractor 123 is supplied to the blocking circuit 124 and the predictor 122, and the prediction signal generated by the predictor 122 is supplied to the subtractor 123. . The subtractor 123 subtracts the prediction signal from the input signal to generate a prediction residual. This residual signal is supplied to the block forming circuit 124 and converted from raster scan order to block order data. The blocked residual signal is supplied to the encoding unit 125.
[0014]
As will be described later, the quantization circuit in the encoding unit 125 is switched between the first quantization mode and the second quantization mode in accordance with the frequency distribution of the blocked residual signal. . For example, when the number of bits of the residual signal is 8 bits, when the first quantization mode is selected, a 3-bit quantization output is generated from the encoding unit 125. When the second quantization mode is selected, a quantized output converted into 2 bits is generated by the code conversion table in the encoding unit 125.
[0015]
The encoded output of the encoding unit 125 is supplied to the error correction code encoder 126, and a redundant code of the error correction code is added. The output of the error correction code encoder 126 is supplied to the modulation unit 127. The modulation unit 127 modulates the digital signal into a form suitable for recording, transmission, and the like. An output signal from the modulation unit 127 is supplied to the recording unit 128, and the recording signal is recorded on the information signal recording medium 129 by the recording unit 128. It is also possible to transmit data via the transmission path 130. In this case, a transmission unit is used instead of the recording unit 128. A mode signal indicating the quantization mode is also recorded or transmitted together with the encoded residual signal. The information signal recording medium 129 is a disk-like or tape-like recording medium using magnetism, magneto-optical, phase change or the like. A semiconductor memory is also a kind of recording medium.
[0016]
Data is reproduced from the recording medium 129 by the reproduction unit 131 or data transmitted via the transmission path 130 is received. The data reproduced by the reproduction unit 131 is demodulated by the demodulator 132, and the demodulated output is supplied to the error correction code decoder 133. The decoder 133 corrects the error by using the redundant code, and corrects the remaining error that cannot be corrected to be inconspicuous.
[0017]
The output of the error correction decoder 133 is supplied to the decoding unit 134. As will be described later, the decoding unit 134 performs inverse quantization processing for converting the quantized output into a representative value (quantized restoration value), contrary to the encoding unit 125. The inverse quantization circuit in the decoding unit 134 performs inverse quantization processing according to the quantization mode indicated by the mode signal. A decoded residual signal is generated from the decoding unit 134. This decoded residual signal is supplied to the block decomposition circuit 135. In the block decomposition circuit 135, the block structure is returned to the order of raster scanning.
[0018]
The decoded residual signal is supplied to the adder circuit 136. A decoding image signal is formed by the adding circuit 136 and taken out to the output terminal 137. Further, the decoded image signal is supplied to the predictor 138, and a prediction signal is generated. The prediction signal is supplied to the addition circuit 136.
[0019]
FIG. 2 shows an example of the encoding unit 125. The blocked residual signal from the blocking circuit 124 is supplied to the input terminal 1. FIG. 3 schematically shows the formation of the residual signal. One rectangular area in FIG. 3 corresponds to one pixel. Each of a to h represents a locally decoded pixel value, and AP represents a pixel value before being encoded. A predicted value A ′ for pixel value A is generated by the predictor 122 using neighboring local decoded pixel values. For example, the predicted value A ′ is formed according to a prediction formula such as A ′ = 4c−3 (b−f), A ′ = f + c−b, or the like. The predicted values for the pixel values B, C,... Are calculated by the same prediction formula. When represented by a general formula, a predicted value is generated by (αa + βb + γf, where α, β, and γ are constants).
[0020]
The subtractor 123 subtracts the predicted value (for example, A ′) from the pixel value (for example, A) to generate a residual signal Δa. Similarly, residual signals Δb, Δc,... Are generated. In the blocking circuit 124, the generated residual signal is converted into a block structure. For example, the block forming circuit 124 forms data of blocks of residual signals Δa to Δp corresponding to the (4 × 4) block, as shown by the thick line frame in FIG. 3A. When a digital audio signal is handled, a prediction value in the time direction is formed, and a one-dimensional residual signal block is formed.
[0021]
By concentrating the level range of the residual signal, the degree of concentration can be increased. When one pixel is 8-bit data, the distribution of the occurrence frequency of the residual signal of one screen is in the range of (−255 to +255) around 0, and the frequency of the residual is 0. Maximum. However, when divided into blocks, the frequency distribution of residuals is more concentrated than the original distribution.
[0022]
This is due to the fact that the residuals in the blocks in the small space are relatively large in comparison with one screen, and the residuals are strongly correlated in the blocks. Therefore, the block residual signal can be requantized with a smaller number of bits than the original number of quantization bits. Further, in the distribution of the residual signals that are blocked, the frequency of the value of 0 is not necessarily the maximum. For example, when the luminance level gradually changes in the diagonal direction in the block, for example, a distribution in which the frequency of the value of 0 does not become maximum occurs. Note that the method of increasing the degree of concentration of the residual signal distribution is an example of blocking, and other methods are also possible.
[0023]
Returning to FIG. 2, the encoding unit 125 will be described. The residual signal from the input terminal 1 is supplied to the maximum value detection circuit 2, the minimum value detection circuit 3, and the delay circuit 4. The maximum value detection circuit 2 detects the maximum value MAX for each block, and the minimum value detection circuit 3 detects the minimum value MIN for each block. The detected maximum value MAX and minimum value MIN are supplied to the mode determination circuit 5.
[0024]
The mode determination circuit 5 is supplied with the block residual signal, the maximum value MAX, and the minimum value MIN, and the quantization mode is determined based on the level range of the residual signal of the block to indicate the mode. A signal MODE (for example, 1 bit) is generated. The mode signal MODE is supplied to the quantization circuit 6, and the residual signal from the input terminal 1 is supplied to the quantization circuit 6 through the delay circuit 4 for phase matching. In the quantization circuit 6, the residual signal is requantized with a smaller number of quantization bits than the original number of quantization bits (for example, 8 bits). In the quantization circuit 6, the first and second quantization modes are selected according to the mode signal MODE.
[0025]
The first quantization mode is a normal one that re-quantizes the 8-bit residual signal to generate a smaller number of quantization bits, eg, 3 bits. The second quantization mode is applied when the level range of the blocked residual signal does not cross zero. In this embodiment, when the minimum value MIN of the residual signal is larger than (Δ / 2), the second quantization mode is applied. Here, Δ is DR / 2 when the number of requantization bits is n.ⁿIs a quantization step calculated by DR is a dynamic range and can be calculated by (MAX-MIN).
[0026]
In the second quantization mode, a code signal (for example, 3 bits) generated by re-quantization is converted according to a predetermined rule in the same manner as in the first quantization mode, and a quantized output having a smaller number of bits, for example, 2 bits Converted. Code conversion rules for converting a 3-bit code into a 2-bit quantized output are shown below.
100-00
101-01
110-10
111-11
[0027]
The quantization process will be described with reference to FIG. As described above, the residual signal generated by the predictive coding has the maximum frequency of 0 when viewed in the whole frame. However, the residual signal varies depending on the block as shown in FIG. 4A. Have a bias. FIG. 4A shows a distribution of residual signals biased to the negative side, a distribution of residual signals that coincided with a value having a maximum frequency of 0, and a distribution of residual signals biased to the positive side.
[0028]
FIG. 4B shows the frequency distribution of the residual signal that crosses the value of 0. In the case of quantizing the residual signal, for example, the interval between MAX and MIN is 2³= 8, and the same code is assigned to the residual signals belonging to the range. When the code is restored, it is converted to a representative value in the middle of the range. In general, if the code has n bits, 2ⁿThe range between MAX and MIN is divided into a number range. Such quantization is the first quantization mode.
[0029]
FIG. 4C shows an example of the frequency distribution of the blocked residual signal. That is, FIG. 4C shows a distribution 30a in which the minimum value MIN is equal to Δ / 2, a distribution 30b (shown by a broken line) that crosses 0 similarly to that in FIG. 4B, and a maximum value MAX is −Δ / 2. A distribution 30c (indicated by a broken line) equal to is shown. In this embodiment, when the condition of (MIN ≧ Δ / 2) is satisfied as in the frequency distribution 30a, the second quantization mode is applied. In the second quantization mode, the code as described above is used. Conversion is done. On the decoding side, code conversion in the reverse direction from 2 bits to 3 bits is performed, and the 3 bits code is inversely quantized.
[0030]
For example, if the residual signal having the frequency distribution 30a in FIG. 4C is quantized in the second quantization mode, 100 to 111 quantized outputs in the range of MIN to MAX are converted to 00 to 11 quantized outputs. On the other hand, on the decoding side, the quantized output of 00 to 11 is converted into the code of 100 to 111. Inverse quantization can be performed by converting the converted 3-bit code into a representative value in the same manner as the 3-bit quantization output generated in the first quantization mode.
[0031]
Unlike the embodiment of the present invention, the second quantization mode may be applied when the maximum value MAX of the residual signal of the block is smaller than −Δ / 2 (for example, the frequency distribution 30c in FIG. 4C). good. Further, the second quantization mode may be applied when the minimum value MIN is (MIN ≧ 0) or when the maximum value MAX is (MAX ≦ 0).
[0032]
An example of the quantization circuit 6 is shown in FIG. A delayed residual signal is supplied from the delay circuit 4 to the input terminal indicated by 15. An input residual signal is supplied to the input terminal of the switching circuit 16 switched by the mode signal MODE. For example, quantizers 17a and 17b that generate a 3-bit output are connected to output terminals a and b of switching circuit 16, respectively. A code conversion circuit 18 is connected to the quantizer 17b. The code conversion circuit 18 is a circuit that converts 3 bits into 2 bits as described above. The output of the quantizer 17a or 17b is taken out to the output terminal 19.
[0033]
The mode signal MODE indicates the second quantization mode when (MIN ≧ Δ / 2) with respect to the residual signal of the block by the mode determination circuit 5, while in other cases, A mode signal MODE indicates the first quantization mode. In the first quantization mode, the output terminal a of the switching circuit 16 is selected, and in the second quantization mode, the output terminal b of the switching circuit 16 is selected. Therefore, in the first quantization mode, a 3-bit quantization output is generated at the output terminal 19, and in the second quantization mode, a 2-bit quantization output is generated at the output terminal 19. The quantizers 17a and 17b may be shared.
[0034]
Returning to FIG. 2, the output of the quantization circuit 6 is supplied to the bit plane encoding circuit 7. The bit plane encoding circuit 7 divides n bits, for example, a 2-bit code q into an MSB (most significant bit) plane and an LSB (least significant bit) plane. The MSB plane is a set of MSBs of supplied 2-bit quantized values, and the LSB plane is a set of LSBs. For simplicity, FIG. 6A shows a case where one screen is composed of (4 × 3 = 12 blocks) and each block includes a code q of (4 × 4) residual signals.
[0035]
In FIG. 6A, 0, 1, 2, 3 of the value of the code q shown as an example means (00), (01), (10), (11) with 2 bits, respectively. In the example of FIG. 6A, as shown in FIG. 6B, the bit plane encoding circuit 7 divides, for example, the code q of one screen into an MSB plane and an LSB plane. When the number of quantization bits is 3 bits, an intermediate bit plane is further formed. In one embodiment of the present invention, a two-bit or three-bit quantized output is generated by two quantization modes. In the case of forming a bit plane, it is necessary to know the number of bits of the quantized output. For this reason, the mode signal MODE is supplied to the bit plane encoding circuit 7.
[0036]
Each bit plane generated by the bit plane encoding circuit 7 is supplied to the variable length encoding circuit 8, and each bit plane is variable length encoded. The variable length coding circuit 8 performs run length coding, for example, MMR (Modified MR), for each bit plane. The output of the variable length coding circuit 8 is supplied to the framing circuit 9. The above-described mode signal MODE is also supplied to the framing circuit 9. The mode signal MODE and the variable-length encoded code are output to the output terminal 10 of the framing circuit 9 as data having a predetermined frame structure.
[0037]
Next, referring to FIG.1An example of the decoding unit 134 in the middle will be described. Reproduced or received data from the error correction decoder 133 is supplied to the input terminal 31 of the decoding unit 134. The frame decomposition circuit 32 separates the variable-length encoded residual signal code from the mode signal MODE. The code of the separated residual signal is decoded by the variable length decoding circuit 33. The variable length decoding circuit 33 corresponds to the variable length encoding circuit 13. The decoded output of the variable length decoding circuit 33 is supplied to the bit plane decoding circuit 34 to synthesize the bit plane. In this case, the bit plane is correctly synthesized by the separated mode signal MODE.
[0038]
The output of the bit plane decoding circuit 34 is supplied to the input terminal of the switching circuit 35. The switching circuit 35 is controlled by a mode signal MODE. When the mode signal MODE indicates the first quantization mode, the output terminal a of the switching circuit 35 is selected, and when this indicates the second quantization mode, the output terminal b of the switching circuit 35 is selected. The The output terminal a is connected to an inverse quantizer 36a that dequantizes a 3-bit quantization code and generates a quantized restoration value. A code conversion circuit 37 that converts a 2-bit quantized output into a 3-bit code is connected to the output terminal b, and an inverse quantizer 36 b is connected to the code conversion circuit 37. The inverse quantizer 36b converts a 3-bit code into a restored value, similarly to the inverse quantizer 36a.
[0039]
The restored values generated by the inverse quantizers 36a and 36b, that is, the decoded decoded residual signals are taken out to the output terminal 38. The decoded residual signal taken out to the output terminal 38 is supplied to the block decomposition circuit 135 (see FIG. 1). The inverse quantizers 36a and 36b may be shared.
[0040]
Next, another embodiment in which the present invention is applied to hierarchical coding will be described. The hierarchical encoding device described here can prevent an increase in the number of encoding target pixels by performing prediction between hierarchies and using a simple arithmetic expression for the inter-layer data.
[0041]
The hierarchical encoding method will be described with reference to FIG. FIG. 8 shows, as an example, a schematic diagram between four hierarchies having the first hierarchy as the lowest hierarchy (original picture) and the fourth hierarchy as the highest hierarchy. For example, when the average value of the lower layer data of four pixels corresponding spatially is adopted as the upper layer data generation method, the number of transmission pixels can be prevented from increasing.
[0042]
That is, the upper layer data is M and the lower layer pixel value is x.₀  , X₁  , X₂  , X₃  Then,
M = 1/4 · (x₀  + X₁  + X₂  + X₃)
As a result, data M is formed. And, for example, x of the data M and the four data₃The other three data are transmitted. On the receiving or playback side,
x₃  = 4 · M- (x₀  + X₁  + X₂  )
Non-transmission pixel x by a simple arithmetic expression₃  Can be easily restored. In FIG. 8, hatched rectangles indicate non-transmission pixels.
[0043]
FIG. 9 shows a configuration of hierarchical encoding of, for example, five layers using the above-described averaging. Assume that the first hierarchy is the resolution level of the input image. The block size of this first layer is (1 × 1). The second layer data is generated by averaging four pixels of the first layer data. In this example, the first hierarchy data X₁  (0) to X₁  Based on the average value of (3), the second hierarchical data X₂  (0) is generated. X₂  Second layer data X adjacent to (0)₂  (1) to X₂  Similarly, (3) is generated by averaging four pixels of the first layer data. The block size of the second hierarchy is (1/2 × 1/2).
[0044]
Further, the third layer data is generated by averaging four pixels of the second layer data corresponding spatially. The block size of the third hierarchy is (1/4 × 1/4). Similarly, the fourth layer data is generated from the third layer data. The block size of the fourth layer is (1/8 × 1/8). Finally, the fifth hierarchy data X which is the highest hierarchy₅  (0) is the fourth layer data X₄  (0) to X₄  It is generated by the average value of (3). The block size of the fifth layer is (1/16 × 1/16).
[0045]
By applying the class classification adaptive prediction to the upper layer data for the hierarchical structure data in which the increase in the number of encoding target pixels is prevented, the lower layer data is predicted, and the difference between the lower layer data and the predicted value ( That is, it is possible to reduce the amount of transmission data by generating a residual signal. FIG. 10 shows such a coding unit. The first hierarchy data d0 is supplied as input image data d0 to the averaging circuit 42 and the subtractor 46 via the input terminal 41. The first layer data is image data of the original resolution.
[0046]
The input pixel data d0 is subjected to ¼ averaging processing in the averaging circuit 42 to generate hierarchical data d1. This hierarchical data d1 corresponds to the second hierarchical data shown in FIG. The generated hierarchical data d1 is supplied to the averaging circuit 43 and the subtractor 47.
[0047]
The averaging circuit 43 performs the same processing as the averaging circuit 42 on the hierarchical data d1, and generates hierarchical data d2. This hierarchical data d2 corresponds to the third hierarchical data. The generated hierarchical data d2 is supplied to the averaging circuit 44 and the subtracter 48. Similarly, the averaging circuit 44 performs ¼ averaging processing on the hierarchical data d2 to generate hierarchical data d3. This hierarchical data d3 corresponds to the fourth hierarchical data. The generated hierarchical data d3 is supplied to the averaging circuit 45 and the subtracter 49. Further, the averaging circuit 45 similarly performs a 1/4 averaging process on the hierarchical data d3 to generate hierarchical data d4. This hierarchical data d4 corresponds to the fifth hierarchical data. The generated hierarchical data d4 is supplied to the quantizer 54.
[0048]
And about these five hierarchical data, prediction is performed between hierarchies. First, the quantization processing for compression performed in the fifth layer is performed by the quantizer 54. The output data d21 of the quantizer 54 is supplied to the variable length code encoder 71 and also to the inverse quantizer 58. The output of the encoder 71 is taken out to the output terminal 76 as fifth-layer data. The output data d16 of the inverse quantizer 58 supplied with the encoded data d21 is supplied to the class classification adaptive prediction circuit 62.
[0049]
In the class classification adaptive prediction circuit 62, prediction processing is performed using the data d16, a predicted value d12 of the fourth layer data is generated, and the predicted value d12 is supplied to the subtractor 49. In the subtractor 49, a difference value between the hierarchical data d3 supplied from the averaging circuit 44 and the predicted value d12 is obtained, and the difference value d8 is supplied to the quantizer 53.
[0050]
In the quantizer 53 to which the difference value d8 is supplied, requantization is performed so as to reduce the number of quantization bits as in the quantizer 54. The output data of the quantizer 53 is supplied to the arithmetic unit 66 and the inverse quantizer 57. In this calculator 66, a process of thinning out one pixel from four pixels is performed. Data d20 output from the arithmetic unit 66 is encoded by the variable-length code encoder 70, and the output of the encoder 70 is taken out as output data of the fourth layer to the output terminal 75.
[0051]
The fourth layer data d12 predicted by the class classification adaptive prediction circuit 62 and the output data (decoded residual signal) d15 of the inverse quantizer 57 are supplied to the class classification adaptive prediction circuit 61. In the class classification adaptive prediction circuit 61, the data d15 is added to the data d12 to form local decoded data of the fourth layer, and prediction processing is performed using the local decoded data. A predicted value d11 is generated, and this predicted value d11 is supplied to the subtractor 48. In the subtracter 48, a difference value between the data d2 supplied from the averaging circuit 43 and the predicted value d11 is obtained, and the difference value d7 is supplied to the quantizer 52.
[0052]
The output data of the quantizer 52 supplied with the difference value d7 is supplied to the arithmetic unit 65 and the inverse quantizer 56. In this calculator 65, a process of thinning out one pixel from four pixels is performed. The third layer data d19 output from the arithmetic unit 65 is supplied to the variable length code encoder 69, and the output of the encoder 69 is taken out to the output terminal 74 as third layer data.
[0053]
The third layer data d11 predicted by the class classification adaptive prediction circuit 61 and the output data d14 of the inverse quantizer 56 supplied with the encoded data from the quantizer 52 are supplied to the class classification adaptive prediction circuit 60. In the class classification adaptive prediction circuit 60, the data d14 is added to the data d11 to form third-layer local decoded data, and the local decoded data is used to perform prediction processing. A predicted value d10 is generated, and the predicted value d10 is supplied to the subtractor 47. In the subtractor 47, a difference value between the data d1 supplied from the averaging circuit 42 and the predicted value d10 is obtained, and the difference value d6 is supplied to the quantizer 51.
[0054]
The output data of the quantizer 51 is supplied to the arithmetic unit 64 and the inverse quantizer 55. The calculator 64 performs a process of thinning out one pixel from four pixels. The second layer data d18 output from the computing unit 64 is supplied to the variable length code encoder 68, and the output of the encoder 68 is taken out to the output terminal 73 as second layer data.
[0055]
The second hierarchy data d10 predicted by the class classification adaptive prediction circuit 60 and the output data d13 of the inverse quantizer 55 supplied with the encoded data from the quantizer 51 are supplied to the class classification adaptive prediction circuit 59. In the class classification adaptive prediction circuit 59, the data d13 is added to the data d10 to form second-level local decoded data, and prediction processing is performed using the local decoded data. A predicted value d9 is generated, and the predicted value d9 is supplied to the subtractor 46. In the subtractor 46, a difference value between the input pixel data d0 supplied from the input terminal 41 and the predicted value d9 is obtained, and the difference value d5 is supplied to the quantizer 50.
[0056]
The output data of the quantizer 50 supplied with the difference value d5 is supplied to the calculator 63. The calculator 63 performs a process of thinning out one pixel from four pixels. The first layer data d17 output from the computing unit 63 is supplied to the variable length code encoder 67, and the output of the encoder 67 is taken out to the output terminal 72 as the first layer data.
[0057]
Each of the class classification adaptive prediction circuits 59, 60, 61, and 62 predicts a lower layer pixel to be predicted based on a level distribution of a plurality of spatially neighboring pixels (included in the upper layer). It is. FIG. 12 shows an example of a class classification adaptive prediction circuit. Local decoded data from the input terminal 141 is supplied to the peripheral code value forming unit 142. The peripheral code value forming unit 142 includes a plurality of pieces of data x located in the vicinity of the lower layer pixel to be predicted.₁, X₂, ..., x_nAre synchronized. The peripheral code value is supplied to the class classification unit 143 and the delay unit 145. The class classification unit 143 calculates the peripheral code value x₁~ X_nThe class code corresponding to the level distribution pattern is output. As the class code, the peripheral code value itself may be used. However, since the number of classes becomes enormous, a code obtained by compressing each bit number of the peripheral code to 1 bit by ADRC or the like is used. The class code generated from the class classification unit 143 is supplied to the prediction coefficient memory 144 as an address signal.
[0058]
The prediction coefficient memory 144 stores the prediction coefficient w acquired in advance by learning.₁~ W_nIs stored for each address. That is, using a teacher signal (for example, fourth layer data) and an input signal (fifth layer data formed by averaging from the fourth layer data), a plurality of input signal data and coefficients A predicted value is obtained by linear linear combination of the above, and a coefficient that minimizes the square sum of errors between the predicted value and the true value of the teacher signal is obtained for each class by the method of least squares. Prediction coefficients w1 to w read from the prediction coefficient memory 144 corresponding to the class code._nAnd the peripheral code value x from the delay unit 145₁~ X_nAre supplied to the prediction calculation unit 146.
[0059]
In the prediction calculation unit 146, the predicted value y is calculated by the following linear linear combination formula.
y = w₁x₁+ W₂x₂+ ... + w_nx_n
The predicted value y obtained by the prediction calculation unit 146 is taken out to the output terminal 147. Note that the peripheral code value used for classification and the peripheral code value used for prediction calculation may be different.
[0060]
A configuration similar to that of the encoding unit 125 (see FIG. 2) in the above-described embodiment is also provided on the encoder side of the hierarchical encoding. That is, each of the quantizers 50, 51, 52, 53, and 54 has the same configuration as the configuration of the preceding stage up to the quantization circuit 6, and the configuration similar to the configuration of the subsequent stage from the bit plane encoding circuit 7 has a variable length. Each of the encoders 67, 68, 69, 70, 71 has.
[0061]
Next, FIG. 11 shows a configuration example on the decoder side of the hierarchical encoding corresponding to the encoder described above. Each hierarchical data generated on the encoder side is supplied to input terminals 81, 82, 83, 84, 85 as d30 to d34, respectively. The variable length code decoders 86, 87, 88, 89, 90 decode the variable length code. Inverse decoders 91, 92, 93, 94, and 95 are connected to these decoders, respectively.
[0062]
First, the fifth hierarchy input data d34 is subjected to decoding processing corresponding to the quantization performed by the encoder in the inverse quantizer 95, and becomes image data d39, which is supplied to the class classification adaptive prediction circuit 107 and the arithmetic unit 103. Is done. Further, the image data d39 is taken out from the output terminal 112 as the image output of the fifth hierarchy.
[0063]
In the class classification adaptive prediction circuit 107, class classification adaptive prediction is performed on the image data of the fourth layer, and the predicted value d47 of the fourth layer data is generated. The adder 99 adds the data d38 (that is, the difference value) from the inverse quantizer 94 and the predicted value d47. The image data d43 is supplied from the adder 99 to the calculator 103, and the calculator 103 performs the above-described calculation to obtain the value of the non-transmission pixel, and the image data d39 supplied from the inverse quantizer 95 and All pixel values in the fourth layer are restored from the image data d43. In this calculator 103, all the restored pixel values are supplied as image data d51 to the class classification adaptive prediction circuit 106 and the calculator 102. The image data d51 is taken out from the output terminal 111 as the output of the fourth hierarchy.
[0064]
In the class classification adaptive prediction circuit 106, class classification adaptive prediction is performed on the image data of the third hierarchy, and the predicted value d46 of the third hierarchy data is generated. The adder 98 adds the data d37 from the inverse quantizer 93 and the predicted value d46. The image data d42 is supplied from the adder 98 to the computing unit 102, the value of the non-transmission pixel is obtained by the computing unit 102, and all the pixel values of the third hierarchy are obtained from the image data d51 and the image data d42 supplied from the computing unit 103. Is restored. In this computing unit 102, all the restored pixel values are supplied as image data d50 to the class classification adaptive prediction circuit 105 and the computing unit 101. The image data d50 is taken out from the output terminal 110 as the output of the third hierarchy.
[0065]
Also, the class classification adaptive prediction circuit 105 performs class classification adaptive prediction on the second layer image data, and generates a predicted value d45 of the second layer data. The adder 97 adds the data d36 from the inverse quantizer 92 and the predicted value d45. The image data d41 is supplied from the adder 97 to the computing unit 101, the value of the non-transmission pixel is obtained by the computing unit 101, and all the pixel values of the second hierarchy are obtained from the image data d50 and the image data d41 supplied from the computing unit 102. Is restored. In this computing unit 101, all the restored pixel values are supplied as image data d49 to the class classification adaptive prediction circuit 104 and the computing unit 100. Further, the image data d49 is taken out from the output terminal 109 as the output of the second hierarchy.
[0066]
Further, the class classification adaptive prediction circuit 104 performs class classification adaptive prediction on the first layer image data, and generates a predicted value d44 of the first layer data. The adder 96 adds the data d35 from the inverse quantizer 91 and the predicted value d44. The image data d40 is supplied from the adder 96 to the computing unit 100, the value of the non-transmission pixel is obtained by the computing unit 100, and all the pixel values of the first layer are obtained from the image data d49 and the image data d40 supplied from the computing unit 101. Is restored. In this computing unit 100, all restored pixel values are taken out from the output terminal 108 as image data d48 as the output of the first layer. Each of the class classification adaptive prediction circuits 104, 105, 106, and 107 has the specific configuration shown in FIG. In this way, it is possible to improve coding efficiency by introducing class classification adaptive prediction in hierarchical coding in which an increase in the number of pixels to be coded is prevented.
[0067]
The variable length code decoders 86, 87, 88, 89, 90 and the inverse quantizers 91, 92, 93, 94, 95 have the same configuration as that of the decoding unit 134 of the above-described embodiment. Therefore, according to another embodiment in which the present invention is applied to the above-described hierarchical coding, as in the case of the above-described one embodiment, when the level range of the blocked residual signal satisfies a predetermined condition, The number of quantization bits can be reduced.
[0068]
Various methods for transmitting the mode signal MODE and the encoded residual signal generated in the present invention are possible. For example, the mode signals for one frame can be transmitted together and the encoded residual signal can be transmitted thereafter. Further, in the present invention, three or more quantization modes can be set. For example, considering not only the level range of the residual signal of the block but also its dynamic range DR, when the dynamic range DR is small, the third quantization mode having a smaller number of bits than the number of bits in the second quantization mode is obtained. It may be set. Furthermore, the quantization mode may be determined based on the level range of the quantization output of the first quantization mode.
[0069]
Furthermore, the present invention can be applied to quantization of a residual signal generated by predictive encoding other than the predictive encoding described above. The present invention can also be applied to a system having a buffering configuration for controlling the amount of generated data by controlling the quantization step width.
[0070]
【The invention's effect】
According to the present invention, it is possible to detect a case where the frequency distribution of the blocked residual signal does not cross 0, in which case the number of quantization bits can be further reduced and the amount of transmission data can be further reduced. it can.
[Brief description of the drawings]
FIG. 1 is a block diagram of an example of a recording / reproducing or transmission system to which the present invention can be applied.
FIG. 2 is a block diagram of an embodiment of the present invention.
FIG. 3 is a schematic diagram used for explaining generation of a residual signal and its blocking in one embodiment of the present invention.
FIG. 4 is a schematic diagram used for explaining quantization processing in one embodiment of the present invention;
FIG. 5 is a block diagram of an example of a quantization circuit in one embodiment of the present invention.
FIG. 6 is a schematic diagram used for explaining a bit plane in one embodiment of the present invention;
FIG. 7 is a block diagram of a decoding unit in one embodiment of the present invention.
FIG. 8 is a schematic diagram used to describe an example of hierarchical encoding.
FIG. 9 is a schematic diagram used to describe an example of hierarchical encoding.
FIG. 10 is a block diagram showing an example of a configuration on the encoding side of another embodiment in which the present invention is applied to hierarchical encoding.
FIG. 11 is a block diagram showing an example of a configuration on the decoding side according to another embodiment of the present invention.
FIG. 12 is a block diagram showing an example of the configuration of a class classification adaptive prediction circuit according to another embodiment of the present invention.
[Explanation of symbols]
2 ... Maximum value detection circuit, 3 ... Minimum value detection circuit, 5 ... Mode decision circuit
6 ... Quantization circuit

Claims

In an information signal encoding apparatus for encoding an input digital information signal so as to reduce the amount of generated data,
Means for generating a residual signal between the input digital information signal and a predicted value of the input digital information signal ;
Means for blocking the residual signal;
Means for detecting the maximum and minimum values for each block;
It is determined from the maximum value and the minimum value whether or not the existence range of the residual signal for each block crosses 0, and when the existence range crosses 0, the first quantization mode is indicated and the presence Mode determining means for instructing the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual is calculated with a number of bits less than the original number of bits. Quantization means adapted to quantize the signal and reduce the number of bits by code conversion;
Information identifying the first and second quantization modes;
An information signal encoding apparatus comprising transmission means for transmitting the output of the quantization means.

In the information signal encoding device according to claim 1,
The mode determining means includes
An information signal encoding apparatus characterized by instructing a second quantization mode when the minimum value is 0 or more, or when the maximum value is 0 or less.

In an information signal encoding method for encoding an input digital information signal so as to reduce the amount of generated data,
Generating a residual signal between the input digital information signal and a predicted value of the input digital information signal ;
Blocking the residual signal;
Detecting a maximum value and a minimum value for each of the blocks;
It is determined from the maximum value and the minimum value whether or not the existence range of the residual signal for each block crosses 0, and when the existence range crosses 0, the first quantization mode is indicated and the presence A mode determining step for indicating the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual is calculated with a number of bits less than the original number of bits. Quantization of the signal, and a quantization step for reducing the number of bits by code conversion,
Information identifying the first and second quantization modes;
An information signal encoding method comprising the step of transmitting the output of the quantization means.

In an information signal encoding apparatus for forming at least first and second layer data from an input digital information signal and encoding and transmitting the first and second layer data,
Means for forming the second hierarchical data having a lower resolution than the first hierarchical data;
Means for predicting the first hierarchical data from the second hierarchical data;
Means for forming a residual signal between the predicted data and the first hierarchical data;
Means for blocking the residual signal;
Means for detecting the maximum and minimum values for each block;
It is determined from the maximum value and the minimum value whether or not the existence range of the residual signal for each block crosses 0, and when the existence range crosses 0, the first quantization mode is indicated and the presence Mode determining means for instructing the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual is calculated with a number of bits less than the original number of bits. Quantization means adapted to quantize the signal and reduce the number of bits by code conversion;
Information identifying the first and second quantization modes;
An information signal encoding apparatus comprising transmission means for transmitting the output of the quantization means.

In an information signal encoding method for forming at least first and second layer data from an input digital information signal, and encoding and transmitting the first and second layer data,
Forming the second hierarchical data having a lower resolution than the first hierarchical data;
Predicting the first hierarchical data from the second hierarchical data;
Forming a residual signal between the predicted data and the first hierarchical data;
Blocking the residual signal;
Detecting a maximum value and a minimum value for each of the blocks;
It is determined from the maximum value and the minimum value whether or not the existence range of the residual signal for each block crosses 0, and when the existence range crosses 0, the first quantization mode is indicated and the presence A mode determining step for indicating the second quantization mode when the range does not cross 0;
In the first quantization mode, the residual signal is quantized with a predetermined number of bits less than the original number of bits, and in the second quantization mode, the residual is calculated with a number of bits less than the original number of bits. Quantization of the signal, and a quantization step for reducing the number of bits by code conversion,
Information identifying the first and second quantization modes;
An information signal encoding method comprising: a transmission step for transmitting the quantized output.

A residual signal between an input digital information signal and a predicted value of the input digital information signal is generated, the residual signal is blocked, and a maximum value and a minimum value are detected for each block, and the maximum value and the minimum value are detected. It is determined from the value whether or not the existence range of the residual signal for each block crosses 0, and when the existence range crosses 0, the first quantization mode is indicated, and the existence range crosses 0. If not, the second quantization mode is instructed . In the first quantization mode, the residual signal is quantized with a smaller number of bits than the original number of bits, and in the second quantization mode, the residual is The signal is quantized with a smaller number of bits than the original number of bits and encoded to reduce the number of bits by code conversion, and information identifying the first and second quantization modes, and a residual Signal In the decoding method for the information signal encoding method in which the broadcast is transmitted,
Based on the identification information, in the first quantization mode, the data is inversely quantized, and in the second quantization mode, the data is transcoded and inversely quantized,
An information signal decoding method comprising the steps of: block-decomposing the inversely quantized residual signal and converting it to the original order.