JP4709189B2

JP4709189B2 - Scalable video encoding method, scalable video encoding device, program thereof, and computer-readable recording medium recording the program

Info

Publication number: JP4709189B2
Application number: JP2007190880A
Authority: JP
Inventors: 和也早瀬; 幸浩坂東; 誠之高村; 一人上倉; 由幸八島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2007-07-23
Filing date: 2007-07-23
Publication date: 2011-06-22
Anticipated expiration: 2027-07-23
Also published as: JP2009027610A

Description

本発明は，スケーラブル動画像符号化技術に関し，特に主観画質を損なわずに符号量を削減できるようにした効果的なプレフィルタを実現するスケーラブル動画像符号化の技術に関する。 The present invention relates to a scalable video encoding technique, particularly to a scalable video encoding technique to achieve effective prefilter you can reduce the code amount without impairing the subjective image quality.

従来から，動画像の符号化において，人間の視覚特性を利用することによって，符号化効率を向上させる研究がなされている。この研究では，人間の視覚系が持つ時空間周波数に対する視覚感度を考慮して，劣化が目立ちにくい部分の情報量を落とす試みがなされている。 Conventionally, studies have been made to improve coding efficiency by using human visual characteristics in moving picture coding. In this research, an attempt is made to reduce the amount of information that is difficult to degrade, taking into account the visual sensitivity to the spatio-temporal frequency of the human visual system.

その代表的な例として，符号化処理前段に置かれるプレフィルタ処理がある。画像内には，視覚的劣化が分かりにくい領域と分かりやすい領域とが存在する。そこで，視覚的に劣化が分かりにくい領域の空間高周波成分の情報を，低域通過フィルタを用いてあらかじめ欠落させることで，主観画質を下げずに符号量を削減させることができる。ここでカギとなるのは，如何にして各領域の視覚的劣化の分かりやすさを算出し，適用する低域通過フィルタの帯域制限幅を制御するか，という点にある。 As a typical example, there is a pre-filter process that is placed before the encoding process. In the image, there are an area where visual deterioration is difficult to understand and an area where it is easy to understand. Therefore, the amount of code can be reduced without lowering the subjective image quality by deleting information on spatial high-frequency components in a region where deterioration is difficult to visually recognize using a low-pass filter in advance. The key here is how to calculate the visibility of visual degradation in each region and control the bandwidth limit of the low-pass filter to be applied.

非特許文献１に記載された動画像符号化方式では，初期値として与えられた低域通過フィルタを適用し，マクロブロックごとに非適用の場合と比較したＰＳＮＲ（Peak Signal to Noise Ratio）を算出し，経験則によって作成したＰＳＮＲとフィルタの帯域制限幅との対応関係を格納したテーブルを参照することで，低域通過フィルタの強度を制御している。 In the moving picture coding method described in Non-Patent Document 1, a low-pass filter given as an initial value is applied, and a PSNR (Peak Signal to Noise Ratio) compared with the case of non-application is calculated for each macroblock. The strength of the low-pass filter is controlled by referring to a table that stores the correspondence relationship between the PSNR and the band limit width of the filter created based on empirical rules.

特許文献１に記載された高能率動画像符号化方式では，入力動画像信号の動きベクトルのノルムから動き量を算出し，動き量の値に応じて２次元空間フィルタを施すことにより，視覚的な劣化を最小限に抑えながら情報を削減している。 In the high-efficiency video coding method described in Patent Document 1, the amount of motion is calculated from the norm of the motion vector of the input video signal, and a two-dimensional spatial filter is applied according to the value of the amount of motion. Information is reduced while minimizing unnecessary degradation.

一方，現在，Ｈ．２６４／ＡＶＣのスケーラブル拡張方式であるＪＳＶＣ（Joint Scalable Video Coding ）の標準化が，ＩＳＯとＩＴＵ−Ｔの合同団体であるＪＶＴにおいて取り組まれている。ＪＳＶＣでは，空間スケーラビリティをサポートするために，以下に示すレイヤ構造をとる。まず，サポートしたい解像度の画像を原画像からの縮小処理によりあらかじめ作成する。次に，１つの解像度を１つのレイヤとみなし，各レイヤに対して非スケーラブルのＨ．２６４／ＡＶＣを適用する。最後に，得られた各レイヤの符号化ストリームを統合する。ここで，最も低解像度の画像を処理するレイヤを基本レイヤ，それより上のレイヤを拡張レイヤと呼ぶ。 On the other hand, H. Standardization of JSVC (Joint Scalable Video Coding), which is a scalable extension method of H.264 / AVC, is being tackled by JVT, a joint organization of ISO and ITU-T. In JSVC, the following layer structure is adopted in order to support spatial scalability. First, an image with a resolution to be supported is created in advance by reduction processing from the original image. Next, one resolution is regarded as one layer, and a non-scalable H.264 format is used for each layer. H.264 / AVC is applied. Finally, the obtained encoded streams of each layer are integrated. Here, the layer that processes the image with the lowest resolution is called the basic layer, and the layer above it is called the extension layer.

非特許文献２に示されたＪＳＶＣ符号化参照ソフトウェアのＪＳＶＭでは，下位レイヤから順に符号化を行っていく。このとき，拡張レイヤではレイヤ間の冗長性を除去するためにレイヤ間予測を行う。具体的には，動きベクトルや残差信号など動き予測に関する情報を直下レイヤのそれから予測する動き情報予測，および直下レイヤにおける内部復号信号をアップサンプルしたものを予測参照とするテクスチャ予測を行う。 In JSVM of JSVC encoding reference software shown in Non-Patent Document 2, encoding is performed in order from the lower layer. At this time, the enhancement layer performs inter-layer prediction in order to remove redundancy between layers. More specifically, motion information prediction for predicting information related to motion prediction such as motion vectors and residual signals from the immediately lower layer, and texture prediction using an upsampled internal decoded signal in the immediately lower layer as prediction reference are performed.

非特許文献３で提案されているＪＳＶＣプレフィルタ処理では，視覚的劣化の目立ちやすさを視覚的歪み量として算出し，その視覚的歪み量の値に応じて，直下レイヤにより生成されたテクスチャ予測の参照信号を原信号に当てはめることにより，レイヤ間の冗長性除去を行っている。 In the JSVC pre-filter processing proposed in Non-Patent Document 3, the conspicuousness of visual degradation is calculated as a visual distortion amount, and the texture prediction generated by the immediately lower layer is calculated according to the visual distortion amount value. Is applied to the original signal to eliminate redundancy between layers.

なお，非特許文献４には，本発明の実施例で用いることができる時空間の周波数成分とコントラスト感度との関係性を示す空間コントラスト感度関数の例が記載されている。また，非特許文献５には，ＪＭとしてよく知られている動画像の非スケーラブル符号化方式の技術が記載されている。
三反崎暁経，小野尚紀，上倉一人，八島由幸：“フレーム内及びフレーム間における画質変動を考慮した適応型プレフィルタの検討，”信学技報，CS2006-5，IE2006-118，Dec.2006. J.Reichel ，H.Schwarz and M.Wien：“Joint Scalable Video Model JSVM-8.0 ，”ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6 ，JVT-U202，2006. 早瀬和也，坂東幸浩，高村誠之，上倉一人，八島由幸：“レイヤ間予測を利用したＪＳＶＣ空間拡張レイヤプレフィルタ，”信学総大，Mar.2007. N.B.Nill：“A visual model weighted cosine transform for image compression and quality assessment ，”IEEE Trans. Commun. ，Vol. COM-33 ，No. 12，pp.551-557，June 1985. K.P.Lim ，G.Sullivan and T.Wiegand：“Text Description of Joint model Reference Encoding Methods and Decoding Concealment Methods ，”Joint Video Team（JVT ）of ISO/IEC MPEG and ITU-T VCEG，JVT-R95 ，Jan. 2006. 特開平６−６２３９２号公報 Non-Patent Document 4 describes an example of a spatial contrast sensitivity function indicating the relationship between the spatio-temporal frequency component and the contrast sensitivity that can be used in the embodiment of the present invention. Non-Patent Document 5 describes a technique of a non-scalable encoding method for moving images, which is well known as JM.
Sankeizaki, K., Ono, N., Uekura, Y., Yashima, Y .: "Examination of adaptive pre-filter considering image quality variation within and between frames," IEICE Tech. Reports, CS2006-5, IE2006-118, Dec. 2006. J. Reichel, H. Schwarz and M. Wien: “Joint Scalable Video Model JSVM-8.0,” ISO / IEC JTC1 / SC29 / WG11 and ITU-T SG16 Q.6, JVT-U202, 2006. Kazuya Hayase, Yukihiro Bando, Noriyuki Takamura, Hitoshi Uekura, Yoshiyuki Yashima: “JSVC Spatial Extension Layer Prefilter Using Inter-layer Prediction,” IEICE, Mar. 2007. NBNill: “A visual model weighted cosine transform for image compression and quality assessment,” IEEE Trans. Commun., Vol. COM-33, No. 12, pp.551-557, June 1985. KPLim, G. Sullivan and T. Wiegand: “Text Description of Joint model Reference Encoding Methods and Decoding Concealment Methods,” Joint Video Team (JVT) of ISO / IEC MPEG and ITU-T VCEG, JVT-R95, Jan. 2006. JP-A-6-62392

画像信号の単純な二乗誤差の値は，主観画質としばしば乖離する。ブロック内の差分信号を周波数変換した場合，上記のブロック内の信号の単純な二乗誤差値は，各空間周波数の視覚劣化に及ぼす寄与が等価とみなしていることになる。しかし，一般的に画像内の空間高周波成分に対する人間のコントラスト感度は，小さいことが知られている。つまり，差分信号の持つ空間高周波成分は，視覚劣化に対する寄与が低周波に比べて小さいと言い換えられる。仮に，高周波成分を多く持つ画像と持たない２つの画像のＰＳＮＲの値が同じであると仮定した場合，高周波成分を多く持つ画像の方が視覚的劣化が目立ちにくくなる。 The value of the simple square error of the image signal often deviates from the subjective image quality. When the frequency of the difference signal in the block is converted, the simple square error value of the signal in the block is regarded as equivalent to the contribution to visual degradation of each spatial frequency. However, it is generally known that human contrast sensitivity with respect to spatial high-frequency components in an image is small. In other words, the spatial high-frequency component of the differential signal has a small contribution to visual degradation compared to the low frequency. If it is assumed that the PSNR value of an image having a high frequency component is the same as that of two images having no high frequency component, the image having a high frequency component is less noticeable in visual deterioration.

前述した非特許文献１は，ブロック内の信号の二乗誤差値の平均に基づいて劣化の目立ちやすさを算出し，帯域通過幅を導出している。しかし，この二乗誤差の値が主観画質と乖離してしまう領域に対しては，適正な帯域通過幅を適用することができない。また，特許文献１では空間内の劣化の目立ちやすさを帯域制限幅の制御に使用していないため，空間高周波成分の削減の余地を残す。 Non-Patent Document 1 described above calculates the conspicuousness of deterioration based on the average of the square error values of the signals in the block, and derives the band pass width. However, an appropriate bandpass width cannot be applied to a region where the square error value deviates from the subjective image quality. Further, in Patent Document 1, since the ease of conspicuous deterioration in space is not used for controlling the bandwidth limit width, there remains room for reduction of spatial high-frequency components.

また，非特許文献３で提案されるスケーラブル符号化のためのプレフィルタは，下位レイヤからのテクスチャ予測信号で原信号を置換するか否かの２通りの自由度しか持ち合わせていない。置換しないと判定された場合には，原信号がそのまま後段の符号化処理に入力されるが，この原信号内には除去することができる空間高周波成分が残存している可能性が高い。この残存する空間高周波成分の除去のために，非特許文献１を適用することができるが，同様に前述したＰＳＮＲと主観画質との乖離の問題に直面してしまう。 In addition, the pre-filter for scalable coding proposed in Non-Patent Document 3 has only two degrees of freedom whether to replace the original signal with the texture prediction signal from the lower layer. If it is determined not to be replaced, the original signal is directly input to the subsequent encoding process, but there is a high possibility that a spatial high-frequency component that can be removed remains in the original signal. Non-patent document 1 can be applied to remove the remaining spatial high-frequency components, but similarly, the aforementioned problem of the difference between PSNR and subjective image quality is encountered.

本発明は，かかる事情に鑑みてなされたものであって，原画像とフィルタリング画像との間の視覚的劣化の目立ちやすさを示す視覚的歪み量に応じて，直下レイヤにおける内部復号信号のアップサンプル信号で置換するフィルタ処理と通常の低域通過フィルタ処理とを適応的に行い，さらに低域通過フィルタを使用する場合に，視覚的歪み量の値に応じて低域通過フィルタの帯域制限幅を決定してフィルタリングを行う動画像符号化器の設計法を確立することを目的とする。 The present invention has been made in view of such circumstances, and it is possible to increase the internal decoded signal in the immediately lower layer according to the amount of visual distortion indicating the conspicuous visual degradation between the original image and the filtered image. When the filter processing to replace with the sample signal and the normal low-pass filter processing are adaptively performed, and the low-pass filter is used, the band-limiting width of the low-pass filter according to the value of the visual distortion amount The purpose is to establish a design method of a moving image encoder that performs filtering by determining the above.

レイヤ構造をもつスケーラブル符号化における本発明のプレフィルタ処理は，以下に記す２つのステップで構成される。拡張レイヤの原信号に対して，任意のサイズの小領域単位で以下の処理を適用する。 The prefiltering process of the present invention in scalable coding with a layer structure is composed of the following two steps. The following processing is applied to the original signal of the enhancement layer in units of small areas of any size.

〔Ｓｔｅｐ１：直下レイヤテクスチャ予測信号による置換判定〕
拡張レイヤの原信号における視覚的劣化が目立ちにくい領域についてのみ，直下レイヤからのテクスチャ予測信号（直下レイヤの復号画像信号をアップサンプルした画像信号）で置換する。手順は，非特許文献３のプレフィルタ手順と同じである。 [Step 1: Replacement determination using the immediately below layer texture prediction signal]
Only a region where visual deterioration in the original signal of the enhancement layer is not conspicuous is replaced with a texture prediction signal from the immediately lower layer (an image signal obtained by up-sampling the decoded image signal of the immediately lower layer). The procedure is the same as the prefilter procedure in Non-Patent Document 3.

拡張レイヤの原信号と直下レイヤのテクスチャ予測信号との間の視覚的歪み量Ｄ₀を計算する。視覚的歪み量は，視覚系のコントラスト感度を用いて算出する劣化の目立ちやすさを表す尺度であり，値が小さいほど目立ちにくいということを示す。視覚的歪み量の算出手順は後述する。 A visual distortion amount D ₀ between the original signal of the enhancement layer and the texture prediction signal of the immediately lower layer is calculated. The amount of visual distortion is a measure of the degree of conspicuous deterioration calculated using the contrast sensitivity of the visual system, and the smaller the value, the less noticeable it is. The calculation procedure of the visual distortion amount will be described later.

算出された視覚的歪み量Ｄ₀が閾値Ｄ_th以下であれば，原信号を直下レイヤの補間信号であるテクスチャ予測信号で置換する。Ｄ₀＞Ｄ_thであれば，Ｓｔｅｐ２に移る。閾値Ｄ_thは，劣化が知覚されない，もしくは，気にならない視覚的歪み量の許容値であり，外部から与えられる。閾値Ｄ_thの与え方の一例としては，ディスプレイのコントラストや明るさから経験的に決定する方法が考えられる。 If the calculated visual distortion amount D ₀ is equal to or less than the threshold value D _th , the original signal is replaced with a texture prediction signal that is an interpolation signal of the immediately lower layer. If D ₀ > D _th , the process proceeds to Step 2. The threshold value D _th is an allowable value of the visual distortion amount in which deterioration is not perceived or is not worrisome, and is given from the outside. As an example of how to give the threshold value _Dth , a method of empirically determining from the contrast and brightness of the display can be considered.

〔Ｓｔｅｐ２：低域通過フィルタ信号による置換判定〕
Ｓｔｅｐ１において，拡張レイヤの原信号と直下レイヤのテクスチャ予測信号との間の視覚的歪み量Ｄ₀が閾値Ｄ_thを超える場合には，当該ブロックの原信号に対して低域通過フィルタを施す。 [Step 2: Replacement determination by low-pass filter signal]
In Step 1, when the visual distortion amount D ₀ between the enhancement layer original signal and the texture prediction signal of the immediately lower layer exceeds the threshold D _th , a low-pass filter is applied to the original signal of the block.

あらかじめ，帯域通過幅の異なる低域通過フィルタｆ_nを，Ｎパターン用意する。ここでは，帯域制限が強い順に，ｎ＝１，... ，Ｎとする。Ｎパターンの与え方は任意だが，帯域制限によって減少するエネルギーの変動量が均等になるように設定するのが望ましい。 Previously, different low-pass filter f _n of the band pass width and N patterns prepared. Here, it is assumed that n = 1,... Although the method of giving the N pattern is arbitrary, it is desirable to set the fluctuation amount of energy that is reduced by the band limitation to be equal.

次に，帯域制限が強いフィルタから順に，当該ブロックの原信号に対してフィルタリングを施し，そのつど視覚的歪み量Ｄ_nを算出する。Ｄ_n≦Ｄ_thとなった時点で，そのときのフィルタリング信号で原信号を置換する。Ｄ_N＞Ｄ_thの場合には，原信号をそのまま出力する。ここで使用する閾値Ｄ_thは，Ｓｔｅｐ１で使用した閾値と同じ値を用いる。 Next, filtering is performed on the original signal of the block in order from the filter with the strong band limitation, and the visual distortion amount D _n is calculated each time. When D _n ≦ D _th , the original signal is replaced with the filtering signal at that time. When D _N > D _th , the original signal is output as it is. The threshold value D _th used here is the same value as the threshold value used in Step 1.

ちなみに，参考として，レイヤ構造を持たない非スケーラブル符号化の場合には，Ｓｔｅｐ１を行わずにＳｔｅｐ２に記した低域通過フィルタ信号による置換判定のみを行えば，視覚的歪み量の値に応じたプレフィルタ処理となる。本発明のレイヤ構造を持つスケーラブル符号化の場合には，拡張レイヤの原信号に対してはＳｔｅｐ１とＳｔｅｐ２の両方を実行し，基本レイヤの原信号に対してはＳｔｅｐ１を行わずにＳｔｅｐ２のみを実行すればよい。 Incidentally, as a reference , in the case of non-scalable coding that does not have a layer structure, if only the replacement determination by the low-pass filter signal described in Step 2 is performed without performing Step 1, it corresponds to the value of the visual distortion amount. Pre-filter processing. In the case of scalable coding having the layer structure of the present invention, both Step 1 and Step 2 are executed for the enhancement layer original signal, and only Step 2 is not performed for the base layer original signal. Just do it.

また，フレーム画像（フィールド画像を含む）ごとに外部から与えられるプレフィルタ適用フレーム指定情報，またはあらかじめ定めたフィルタ適用の判定条件により，フィルタを適用するか否かを判定し，その判定結果にしたがって，上記フィルタの適用を制御してもよい。フィルタ適用の判定条件の一例としては，当該フレームが動き予測の被参照フレームとなるなら非適用，被参照フレームとならないなら適用，という条件が考えられる。例えば当該フレームが動き予測の被参照フレームとなる場合には，プレフィルタ適用フレーム指定情報として，フィルタを適用しないという指定情報を外部からフィルタ処理部に対して与え，当該フレームが動き予測の被参照フレームとならない場合に，プレフィルタ適用フレーム指定情報として，フィルタを適用するという指定情報を外部から与える。後者の場合にだけ，本発明を用いたプレフィルタ処理を実行する。 In addition, it is determined whether to apply a filter according to prefilter application frame designation information given from the outside for each frame image (including field images) or a predetermined filter application determination condition, and according to the determination result. The application of the filter may be controlled. As an example of the determination condition for applying a filter, there may be a condition of non-application when the frame is a referenced frame for motion prediction, and application when the frame is not a referenced frame. For example, when the frame is a reference frame for motion prediction, as pre-filter application frame designation information, designation information indicating that the filter is not applied is externally given to the filter processing unit, and the frame is referred to for motion prediction. When it is not a frame, designation information for applying a filter is given from the outside as pre-filter application frame designation information. Only in the latter case, pre-filter processing using the present invention is executed.

また，フレーム単位にフィルタを適用するか否かの指定情報を外部から与えたり，適用判定条件を設定するのではなく，スライス等のいくつかの小領域を含む画像部分領域単位で，フィルタを適用するか否かの指定や判定条件の設定を行ってもよい。 Also, the filter is applied in units of image partial areas including several small areas such as slices, rather than externally specifying information on whether to apply the filter in units of frames or setting application judgment conditions. It may be possible to specify whether or not to perform determination and to set determination conditions.

［視覚的歪み量の算出方法］
前述したように，画像内の空間高周波成分に対する人間のコントラスト感度は，小さいことが知られている。非特許文献４では，この時空間の周波数成分とコントラスト感度との関係性を，空間コントラスト感度関数として示している。この関数は，各周波数成分に対するコントラスト感度を与え，最大のコントラスト感度に対する相対値が導き出される。本発明では，このコントラスト感度を各空間周波数が持つ視覚劣化への寄与率と想定する。原信号と比較信号との差分信号を周波数領域に展開し，コントラスト感度を各空間周波数のエネルギーに重み付けし，その重み付けされたエネルギーの総和を視覚的歪み量とみなす。以下に算出手順を詳説する。 [Visual distortion calculation method]
As described above, it is known that human contrast sensitivity with respect to spatial high-frequency components in an image is small. Non-Patent Document 4 shows the relationship between the time-space frequency component and the contrast sensitivity as a spatial contrast sensitivity function. This function gives the contrast sensitivity for each frequency component, and the relative value for the maximum contrast sensitivity is derived. In the present invention, this contrast sensitivity is assumed to be a contribution rate to visual degradation of each spatial frequency. The difference signal between the original signal and the comparison signal is developed in the frequency domain, the contrast sensitivity is weighted to the energy of each spatial frequency, and the sum of the weighted energy is regarded as the visual distortion amount. The calculation procedure will be described in detail below.

拡張レイヤの原信号とテクスチャ予測信号との間の視覚的歪み量Ｄ₀，および拡張レイヤの原信号と低域通過フィルタｆ_nによる低域通過フィルタ信号との間の視覚的歪み量Ｄ_nは，次のように求める。 Visual distortion amount D _n between the visual distortion amount D _0, and the original signal and the low-pass filter signal by low-pass filter f _n of the enhancement layer between the original signal and the texture prediction signal enhancement layers , As follows.

ここで，Ｆ₀（ｕ，ｖ）は拡張レイヤの原信号とテクスチャ予測信号との差分信号に対する空間周波数（ｕ，ｖ）のフーリエ係数であり，Ｆ_n（ｕ，ｖ）は拡張レイヤの原信号と低域通過フィルタｆ_nによる低域通過フィルタ信号との差分信号に対する空間周波数（ｕ，ｖ）のフーリエ係数である。ｗ（ｕ，ｖ）はそのフーリエ係数に対する重み係数であり，空間周波数に対するコントラスト感度を指す。この値は，空間コントラスト感度関数に基づいて，次式のように算出する。一例として，非特許文献４に記されているものを挙げるが，使用可能なコントラスト感度関数はこれに限らない。 Here, F ₀ (u, v) is a Fourier coefficient of the spatial frequency (u, v) for the difference signal between the enhancement layer original signal and the texture prediction signal, and F _n (u, v) is the enhancement layer original. It is a Fourier coefficient of the spatial frequency (u, v) for the difference signal between the signal and the low-pass filter signal by the low-pass filter f _n . w (u, v) is a weighting coefficient for the Fourier coefficient, and indicates the contrast sensitivity with respect to the spatial frequency. This value is calculated as follows based on the spatial contrast sensitivity function. An example is described in Non-Patent Document 4, but the usable contrast sensitivity function is not limited to this.

ｗ（η）＝（０．２＋０．４５η）ｅｘｐ（−０．１８η） … 式（３）
ここで，ηと（ｕ，ｖ）とは，次に示すような関係にある。 w (η) = (0.2 + 0.45η) exp (−0.18η) (3)
Here, η and (u, v) have the following relationship.

θは，縦幅ｈの画像を視距離ａｈで観測する場合のＭ×Ｍブロックの視野角であり，次のように与えられる。 θ is a viewing angle of an M × M block when an image having a vertical width h is observed at a viewing distance ah, and is given as follows.

視距離ａｈは，外部から与えられる。視距離ａｈの与え方の一例としては，主観画質評価の前提条件に応じて決定する方法や，想定される視聴環境に応じて決定する方法などが考えられる。 The viewing distance ah is given from the outside. As an example of how to provide the viewing distance ah, a method of determining according to a precondition for subjective image quality evaluation, a method of determining according to an assumed viewing environment, and the like can be considered.

本発明により，主観画質を劣化させずに，可能な限り空間高周波成分を削減する低域通過フィルタの最適な帯域制限幅を，画像内の局所領域ごとに決定することができる。 The present invention, without degrading the subjective image quality, the optimum band-limited width of the low-pass filter to reduce the high spatial frequency components as possible, can be determined for each local region in an image.

また，レイヤ間の情報の冗長性の削減と空間内の情報の冗長性の削減とを，同時に実現することができる。 Also, a redundancy reduction of information in the reduction and space information redundancy between Layer, can be achieved simultaneously.

［処理の流れ］
本発明のスケーラブル符号化処理の実施形態について，図１を参照して説明する。 [Process flow]
An embodiment of the scalable encoding process of the present invention will be described with reference to FIG.

ステップＳ１１：当該符号化対象レイヤが基本レイヤであるか否かの判定処理を行い，真の場合にはステップＳ１２の処理を行い，偽の場合にはステップＳ１３の処理を行う。 Step S11: A process of determining whether or not the current encoding target layer is a base layer is performed. If true, the process of step S12 is performed. If false, the process of step S13 is performed.

ステップＳ１２：基本レイヤにおける原信号を読み込み，基本レイヤ向けのプレフィルタ処理を行い，生成されたプレフィルタ信号を出力する。本処理については，後述の図３にて詳説する。 Step S12: Read the original signal in the base layer, perform prefilter processing for the base layer, and output the generated prefilter signal. This process will be described in detail later with reference to FIG.

ステップＳ１３：当該拡張レイヤにおける原信号を読み込み，拡張レイヤ向けのプレフィルタ処理を行い，生成されたプレフィルタ信号を出力する。本処理については，後述の図４にて詳説する。 Step S13: Read the original signal in the enhancement layer, perform prefilter processing for the enhancement layer, and output the generated prefilter signal. This process will be described in detail later with reference to FIG.

ステップＳ１４：ステップＳ１２の処理またはステップＳ１３の処理によって生成されたプレフィルタ信号を読み込み，当該符号化対象レイヤにおける符号化処理を実施し，当該符号化対象レイヤの符号化情報を出力する。この符号化処理の一例としては，非特許文献２に示されるＪＳＶＭが挙げられる。 Step S14: The prefilter signal generated by the process of step S12 or the process of step S13 is read, the encoding process in the encoding target layer is performed, and the encoding information of the encoding target layer is output. An example of this encoding process is JSVM shown in Non-Patent Document 2.

ステップＳ１５：すべての符号化対象レイヤの符号化処理が完了しているか否かの判定処理を行い，真の場合にはステップＳ１７の処理を行い，偽の場合にはステップＳ１６の処理を行う。 Step S15: A process for determining whether or not all the encoding target layers have been encoded is performed. If true, the process of step S17 is performed. If false, the process of step S16 is performed.

ステップＳ１６：次の符号化対象レイヤに移り，新しい符号化対象レイヤに対してステップＳ１１の処理を行う。 Step S16: Move to the next encoding target layer, and perform the process of step S11 on the new encoding target layer.

ステップＳ１７：各レイヤの符号化情報を多重化し，符号化ストリームとして出力する。 Step S17: The encoded information of each layer is multiplexed and output as an encoded stream.

本発明の関連技術として，非スケーラブル符号化処理の場合の例について，図２を参照して説明する。 An example of non-scalable encoding processing will be described with reference to FIG. 2 as a related technique of the present invention.

ステップＳ２１：原信号を読み込み，プレフィルタ処理を行い，生成されたプレフィルタ信号を出力する。本処理については，後述の図３にて詳説する。 Step S21: Read the original signal, perform prefilter processing, and output the generated prefilter signal. This process will be described in detail later with reference to FIG.

ステップＳ２２：ステップＳ２１によって生成されたプレフィルタ信号を読み込み，符号化処理を実施し，符号化情報を出力する。この符号化処理の一例としては，非特許文献２に示されるＪＳＶＭや，非特許文献５に示されるＪＭが挙げられる。 Step S22: The prefilter signal generated in step S21 is read, encoding processing is performed, and encoding information is output. Examples of this encoding process include JSVM shown in Non-Patent Document 2 and JM shown in Non-Patent Document 5.

図１に示すステップＳ１２の処理（スケーラブル符号化処理における基本レイヤ向けのプレフィルタ処理），および図２に示すステップＳ２１の処理（非スケーラブル符号化処理におけるプレフィルタ処理）の詳細を，図３を用いて説明する。 Details of the processing in step S12 shown in FIG. 1 (prefilter processing for the base layer in the scalable encoding processing) and the processing in step S21 shown in FIG. 2 (prefiltering processing in the non-scalable encoding processing) are shown in FIG. It explains using.

ステップＳ３１：外部から与えられるプレフィルタ適用フレーム指定情報を読み込み，当該フレームがプレフィルタ適用フレームか否かの判定処理を行い，真の場合にはステップＳ３２の処理に移り，偽の場合にはステップＳ３３の処理に移る。プレフィルタ適用フレーム指定情報とは，上記のフィルタを適用するか否かを指定する情報である。 Step S31: Read prefilter application frame designation information given from the outside, and determine whether or not the frame is a prefilter application frame. If true, the process proceeds to step S32; The process proceeds to S33. The pre-filter application frame designation information is information that designates whether or not to apply the above filter.

ステップＳ３２：当該フレームの原信号を読み込み，低域通過フィルタを用いた小領域ごとの置換処理を行い，当該フレームのプレフィルタ信号を出力する。本処理については，後述の図５にて詳説する。 Step S32: Read the original signal of the frame, perform a replacement process for each small area using a low-pass filter, and output a prefilter signal of the frame. This process will be described in detail later with reference to FIG.

ステップＳ３３：すべてのフレームに対してプレフィルタ処理を行ったか否かの判定処理を行い，真の場合にはプレフィルタ信号を出力し，偽の場合にはステップＳ３４の処理に移る。なお，基本レイヤ向けのプレフィルタ処理では，出力されるプレフィルタ信号は，基本レイヤのプレフィルタ信号である。 Step S33: A determination process is performed to determine whether or not pre-filter processing has been performed on all frames. If true, a pre-filter signal is output. If false, the process proceeds to step S34. In the prefiltering process for the base layer, the output prefilter signal is a base layer prefilter signal.

ステップＳ３４：次のフレームに移り，新しいフレームに対してステップＳ３１の処理を行う。 Step S34: Move to the next frame and perform the process of step S31 on the new frame.

図１に示すステップＳ１３の処理（スケーラブル符号化処理における拡張レイヤ向けのプレフィルタ処理）の詳細を，図４を用いて説明する。 Details of the processing in step S13 shown in FIG. 1 (prefiltering processing for the enhancement layer in scalable coding processing) will be described with reference to FIG.

ステップＳ４１：外部から与えられるプレフィルタ適用フレーム指定情報を読み込み，当該フレームがプレフィルタ適用フレームか否かの判定処理を行い，真の場合にはステップＳ４２の処理に移り，偽の場合にはステップＳ４３の処理に移る。 Step S41: Read prefilter application frame designation information given from the outside, and determine whether or not the frame is a prefilter application frame. If true, the process proceeds to step S42; The process proceeds to S43.

ステップＳ４２：当該フレームの原信号を読み込み，直下レイヤテクスチャ予測信号や低域通過フィルタによるフィルタ信号で，小領域ごとに適応的に置換する処理を行い，当該フレームのプレフィルタ信号を出力する。本処理については，後述の図６にて詳説する。 Step S42: Read the original signal of the frame, perform an adaptive replacement process for each small region with the filter signal from the immediately lower layer texture prediction signal or the low-pass filter, and output the pre-filter signal of the frame. This process will be described in detail with reference to FIG.

ステップＳ４３：当該拡張レイヤにおけるすべてのフレームに対してプレフィルタ処理を行ったか否かの判定処理を行い，真の場合には当該拡張レイヤのプレフィルタ信号を出力し，偽の場合にはステップＳ４４の処理に移る。 Step S43: A determination process is performed to determine whether or not prefiltering has been performed on all frames in the enhancement layer. If true, the prefilter signal of the enhancement layer is output. If false, step S44. Move on to processing.

ステップＳ４４：次のフレームに移り，新しいフレームに対してステップＳ４１の処理を行う。 Step S44: Move to the next frame and perform the process of step S41 on the new frame.

図３に示すステップＳ３２の処理（低域通過フィルタ信号による置換処理）の詳細を，図５を用いて説明する。 Details of the processing in step S32 shown in FIG. 3 (replacement processing using a low-pass filter signal) will be described with reference to FIG.

ステップＳ５１：外部から与えられる視覚的歪み量閾値Ｄ_thを読み込み，その値をレジスタに書き込む。 Step S51: The visual distortion amount threshold value _Dth given from the outside is read and the value is written in the register.

ステップＳ５２：当該プレフィルタ対象フレームを任意の小領域に分割し，各小領域の原信号をバッファに書き込む。 Step S52: The prefilter target frame is divided into arbitrary small areas, and the original signal of each small area is written in the buffer.

ステップＳ５３：当該小領域に対して適用する当該低域通過フィルタｆ_nのフィルタ係数を読み込み，その係数値をレジスタに書き込む。 Step S53: The filter coefficient of the low-pass filter f _n applied to the small area is read, and the coefficient value is written in the register.

ステップＳ５４：当該低域通過フィルタｆ_nのフィルタ係数をレジスタから読み込み，そのフィルタ係数によって構成される低域通過フィルタを当該小領域の原信号に対して施し，得られた当該小領域の低域通過フィルタ信号をバッファに書き込む。 Step S54: The filter coefficient of the low-pass filter f _n is read from the register, the low-pass filter constituted by the filter coefficient is applied to the original signal of the small area, and the low band of the small area obtained is obtained. Write the pass filter signal to the buffer.

ステップＳ５５：当該小領域の原信号および低域通過フィルタｆ_nによる低域通過フィルタ信号をバッファから読み込み，それらの差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ_nを算出し，その値をレジスタに書き込む。 Step S55: Read the low-pass filter signal by the original signal of the small region and the low-pass filter f _n from the buffer, generate a difference signal thereof, apply Fourier transform to the difference signal, and perform the Fourier transform performs weighting using the contrast sensitivity function on each frequency component of, calculating the visual distortion amount D _n between the two signals by taking the sum of the frequency components thereof weighting, the value in the register Write.

ステップＳ５６：当該小領域における当該低域通過フィルタｆ_nを使用して算出された視覚的歪み量Ｄ_nと，視覚的歪み量閾値Ｄ_thとをレジスタから読み込み，Ｄ_n≦Ｄ_thの場合にはステップＳ５７の処理に移り，Ｄ_n＞Ｄ_thの場合にはステップＳ５８の処理に移る。 Step S56: The visual distortion amount D _n calculated using the low-pass filter f _n in the small region and the visual distortion amount threshold value D _th are read from the register, and when D _n ≦ D _th Moves to the process of step S57, and if D _n > D _th , moves to the process of step S58.

ステップＳ５７：当該小領域の当該低域通過フィルタｆ_nによる低域通過フィルタ信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込む。 Step S57: reads the low-pass filter signal by the low-pass filter f _n of the small area, replacing the original signal of the small region, write to the buffer as a pre-filter signal.

ステップＳ５８：当該小領域に対してすでにすべての低域通過フィルタを適用したか否かの判定処理を行い，真の場合にはステップＳ６０の処理に移り，偽の場合にはステップＳ５９の処理に移る。 Step S58: It is determined whether or not all the low-pass filters have already been applied to the small area. If true, the process proceeds to step S60. If false, the process proceeds to step S59. Move.

ステップＳ５９：次の低域通過フィルタに移り，新しい低域通過フィルタを用いて，ステップＳ５３の処理を行う。 Step S59: The process proceeds to the next low-pass filter, and the process of step S53 is performed using the new low-pass filter.

ステップＳ６０：当該フレームにおけるすべての小領域に対して置換判定処理を実行したか否かの判定処理を行い，真の場合にはプレフィルタ信号を出力する。偽の場合には，ステップＳ６１の処理に移る。 Step S60: A determination process is performed as to whether or not the replacement determination process has been executed for all the small regions in the frame, and if true, a prefilter signal is output. If false, the process proceeds to step S61.

ステップＳ６１：次の小領域に移り，新しい小領域に対して，ステップＳ５３の処理を行う。 Step S61: Move to the next small area and perform the process of step S53 on the new small area.

図４に示すステップＳ４２の処理（適応的置換処理）の詳細を，図６を用いて説明する。 Details of the process (adaptive replacement process) in step S42 shown in FIG. 4 will be described with reference to FIG.

ステップＳ７１：外部から与えられる視覚的歪み量閾値Ｄ_thを読み込み，その値をレジスタに書き込む。 Step S71: The visual distortion amount threshold value _Dth given from the outside is read and the value is written in the register.

ステップＳ７２：当該プレフィルタ対象フレームを任意の小領域に分割し，各小領域の原信号をバッファに書き込む。 Step S72: The prefilter target frame is divided into arbitrary small areas, and the original signal of each small area is written in the buffer.

ステップＳ７３：当該小領域の直下レイヤテクスチャ予測信号を，バッファから読み込む。 Step S73: The layer texture prediction signal directly under the small area is read from the buffer.

ステップＳ７４：当該小領域の原信号をバッファから読み込み，当該小領域の原信号と直下レイヤテクスチャ予測信号との差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ₀を算出し，その値をレジスタに書き込む。 Step S74: Read the original signal of the small region from the buffer, generate a difference signal between the original signal of the small region and the immediately below layer texture prediction signal, apply Fourier transform to the difference signal, Each frequency component is weighted using a contrast sensitivity function, and the visual distortion amount D ₀ between the two signals is calculated by taking the sum of the weighted frequency components, and the value is written to the register. .

ステップＳ７５：当該小領域における直下レイヤテクスチャ予測信号を使用して算出された視覚的歪み量Ｄ₀と，視覚的歪み量閾値Ｄ_thとをレジスタから読み込み，Ｄ₀≦Ｄ_thの場合にはステップＳ８３の処理に移り，Ｄ₀＞Ｄ_thの場合にはステップＳ７６の処理に移る。 Step S75: The visual distortion amount D ₀ and the visual distortion amount threshold value D _th calculated by using the immediately lower layer texture prediction signal in the small region are read from the register. If D ₀ ≦ D _th , step S75 is performed. The process proceeds to S83, and if D ₀ > D _th , the process proceeds to Step S76.

ステップＳ７６：当該小領域に対して適用する当該低域通過フィルタｆ_nのフィルタ係数を読み込み，その係数値をレジスタに書き込む。 Step S76: The filter coefficient of the low-pass filter f _n applied to the small area is read, and the coefficient value is written in the register.

ステップＳ７７：当該低域通過フィルタｆ_nのフィルタ係数をレジスタから読み込み，そのフィルタ係数によって構成される低域通過フィルタを当該小領域の原信号に対して施し，得られた当該小領域の低域通過フィルタ信号をバッファに書き込む。 Step S77: The filter coefficient of the low-pass filter f _n is read from the register, the low-pass filter constituted by the filter coefficient is applied to the original signal of the small area, and the low band of the small area obtained is obtained. Write the pass filter signal to the buffer.

ステップＳ７８：当該小領域の原信号および低域通過フィルタｆ_nによる低域通過フィルタ信号をバッファから読み込み，それらの差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ_nを算出し，その値をレジスタに書き込む。 Step S78: The original low-frequency signal and the low-pass filter signal by the low-pass filter f _n are read from the buffer, a difference signal between them is generated, a Fourier transform is applied to the difference signal, and the Fourier transform performs weighting using the contrast sensitivity function on each frequency component of, calculating the visual distortion amount D _n between the two signals by taking the sum of the frequency components thereof weighting, the value in the register Write.

ステップＳ７９：当該小領域における当該低域通過フィルタｆ_nを使用して算出された視覚的歪み量Ｄ_nと，視覚的歪み量閾値Ｄ_thとをレジスタから読み込み，Ｄ_n≦Ｄ_thの場合にはステップＳ８０の処理に移り，Ｄ_n＞Ｄ_thの場合にはステップＳ８１の処理に移る。 Step S79: The visual distortion amount D _n and the visual distortion amount threshold value D _th calculated using the low-pass filter f _n in the small region are read from the register, and when D _n ≦ D _th Moves to step S80, and if D _n > D _th , moves to step S81.

ステップＳ８０：当該小領域の当該低域通過フィルタｆ_nによる低域通過フィルタ信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込む。 Step S80: The low-pass filter signal by the low-pass filter f _{n in the} small region is read, replaced with the original signal in the small region, and written in the buffer as a pre-filter signal.

ステップＳ８１：当該小領域に対してすでにすべての低域通過フィルタを適用したか否かの判定処理を行い，真の場合にはステップＳ８４の処理に移り，偽の場合にはステップＳ８２の処理に移る。 Step S81: It is determined whether or not all the low-pass filters have already been applied to the small area. If true, the process proceeds to step S84. If false, the process proceeds to step S82. Move.

ステップＳ８２：次の低域通過フィルタに移り，新しい低域通過フィルタを用いて，ステップＳ７６の処理を行う。 Step S82: The process proceeds to the next low-pass filter, and the process of step S76 is performed using a new low-pass filter.

ステップＳ８３：当該小領域の直下レイヤテクスチャ予測信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込む。 Step S83: The layer texture prediction signal immediately under the small area is read, replaced with the original signal of the small area, and written into the buffer as a prefilter signal.

ステップＳ８４：当該フレームにおけるすべての小領域に対して置換判定処理を実行したか否かの判定処理を行い，真の場合には適応的に置換されたプレフィルタ信号を出力し，偽の場合にはステップＳ８５の処理に移る。 Step S84: It is determined whether or not replacement determination processing has been executed for all the small regions in the frame, and if true, an adaptively replaced prefilter signal is output, and if false, Moves to the process of step S85.

ステップＳ８５：次の小領域に移り，新しい小領域に対して，ステップＳ７３の処理を行う。 Step S85: Move to the next small area and perform the process of step S73 on the new small area.

［処理装置］
本実施の形態によるスケーラブル符号化装置の構成例を図７に示す。図７は，ｎ階層（但し，ｎ≧１）の空間スケーラビリティを持つ場合の符号化装置図である。 [Processing equipment]
An example of the configuration of the scalable coding apparatus according to this embodiment is shown in FIG. FIG. 7 is a diagram of an encoding apparatus in the case of having spatial scalability of n layers (where n ≧ 1).

原信号縮小処理部１０１：スケーラブル符号化の符号化対象である原信号を読み込み，空間スケーラビリティを持たせる所望の解像度を縮小処理により複数用意し，各レイヤの原信号記憶部（基本レイヤ原信号記憶部１０２，第１拡張レイヤ原信号記憶部１１０，... ，第ｎ拡張レイヤ原信号記憶部１１８）に出力する。任意の縮小方法の適用が可能であり，その一例としては，Lanczos 3-lobed sinc filter が挙げられる。 Original signal reduction processing unit 101: Reads an original signal which is an encoding target of scalable encoding, prepares a plurality of desired resolutions for providing spatial scalability by reduction processing, and generates an original signal storage unit (basic layer original signal storage) for each layer. Unit 102, first enhancement layer original signal storage unit 110,..., Nth enhancement layer original signal storage unit 118). Arbitrary reduction methods can be applied, an example of which is Lanczos 3-lobed sinc filter.

基本レイヤプレフィルタ処理部１０３：基本レイヤ原信号記憶部１０２から基本レイヤの原信号を読み込み，プレフィルタ処理を行い，プレフィルタ信号を基本レイヤプレフィルタ信号記憶部１０４に出力する。本処理装置については，後述の図９にて詳説する。 Base layer prefilter processing unit 103: Reads the base layer original signal from the base layer original signal storage unit 102, performs prefilter processing, and outputs the prefilter signal to the base layer prefilter signal storage unit 104. This processing apparatus will be described in detail later with reference to FIG.

基本レイヤ符号化部１０５：基本レイヤプレフィルタ信号記憶部１０４から基本レイヤのプレフィルタ信号を読み込み，基本レイヤの符号化処理を実施し，符号化情報を基本レイヤ符号化情報記憶部１０６に出力する。この符号化処理の一例としては，非特許文献２に示されるＪＳＶＭや，非特許文献５に示されるＪＭが挙げられる。 Base layer encoding unit 105: reads a base layer prefilter signal from base layer prefilter signal storage unit 104, performs base layer encoding processing, and outputs encoded information to base layer encoded information storage unit 106 . Examples of this encoding process include JSVM shown in Non-Patent Document 2 and JM shown in Non-Patent Document 5.

基本レイヤ内部復号部１０７：基本レイヤ符号化情報記憶部１０６から基本レイヤの符号化情報を読み込み，基本レイヤの符号化処理内において復号処理を行い，生成された復号信号をバッファに書き込む。 Base layer inner decoding unit 107: Reads base layer coding information from the base layer coding information storage unit 106, performs decoding processing within the base layer coding processing, and writes the generated decoded signal to a buffer.

基本レイヤテクスチャ予測信号生成部１０８：バッファから基本レイヤの復号信号を読み込み，補間フィルタを用いて第１拡張レイヤ上で使用するテクスチャ予測信号を生成し，生成されたテクスチャ予測信号を基本レイヤテクスチャ予測信号記憶部１０９に出力する。 Base layer texture prediction signal generation unit 108: reads a base layer decoded signal from the buffer, generates a texture prediction signal to be used on the first enhancement layer using an interpolation filter, and uses the generated texture prediction signal as a base layer texture prediction The signal is output to the signal storage unit 109.

第１拡張レイヤプレフィルタ処理部１１１：第１拡張レイヤ原信号記憶部１１０から第１拡張レイヤの原信号を読み込み，プレフィルタ処理を行い，プレフィルタ信号を第１拡張レイヤプレフィルタ信号記憶部１１２に出力する。本処理装置については，後述の図１０にて詳説する。 First enhancement layer prefilter processing unit 111: reads the first enhancement layer original signal from the first enhancement layer original signal storage unit 110, performs prefilter processing, and converts the prefilter signal into the first enhancement layer prefilter signal storage unit 112. Output to. This processing apparatus will be described in detail later with reference to FIG.

第１拡張レイヤ符号化部１１３：第１拡張レイヤプレフィルタ信号記憶部１１２から第１拡張レイヤのプレフィルタ信号を読み込み，拡張レイヤの符号化処理を実施する。また，このときテクスチャ予測に使用する参照信号を基本レイヤテクスチャ予測信号記憶部１０９から読み込む。得られた符号化情報を，第１拡張レイヤ符号化情報記憶部１１４に出力する。この符号化処理の一例としては，非特許文献２に示されるＪＳＶＭが挙げられる。 First enhancement layer encoding unit 113: Reads the first enhancement layer prefilter signal from the first enhancement layer prefilter signal storage unit 112, and performs enhancement layer encoding processing. At this time, a reference signal used for texture prediction is read from the base layer texture prediction signal storage unit 109. The obtained encoded information is output to first enhancement layer encoded information storage section 114. An example of this encoding process is JSVM shown in Non-Patent Document 2.

第１拡張レイヤ内部復号部１１５：第１拡張レイヤ符号化情報記憶部１１４から第１拡張レイヤの符号化情報を読み込み，第１拡張レイヤの符号化処理内において復号処理を行い，生成された復号信号をバッファに書き込む。 First enhancement layer inner decoding unit 115: reads the first enhancement layer coding information from the first enhancement layer coding information storage unit 114, performs the decoding process in the first enhancement layer coding process, and generates the generated decoding Write signal to buffer.

第１拡張レイヤテクスチャ予測信号生成部１１６：バッファから第１拡張レイヤの復号信号を読み込み，補間フィルタを用いて第２拡張レイヤ上で使用するテクスチャ予測信号を生成し，生成されたテクスチャ予測信号を第１拡張レイヤテクスチャ予測信号記憶部１１７に出力する。 First enhancement layer texture prediction signal generation unit 116: reads a decoded signal of the first enhancement layer from the buffer, generates a texture prediction signal to be used on the second enhancement layer using an interpolation filter, and generates the generated texture prediction signal The first enhancement layer texture prediction signal storage unit 117 outputs the result.

第ｎ拡張レイヤプレフィルタ処理部１１９：第ｎ拡張レイヤ原信号記憶部１１８から第ｎ拡張レイヤの原信号を読み込み，プレフィルタ処理を行い，プレフィルタ信号を第ｎ拡張レイヤプレフィルタ信号記憶部１２０に出力する。本処理装置については，後述の図１０にて詳説する。 N-th enhancement layer pre-filter processing unit 119: reads the n-th enhancement layer original signal from the n-th enhancement layer original signal storage unit 118, performs pre-filter processing, and converts the pre-filter signal into the n-th enhancement layer pre-filter signal storage unit 120. Output to. This processing apparatus will be described in detail later with reference to FIG.

第ｎ拡張レイヤ符号化部１２１：第ｎ拡張レイヤプレフィルタ信号記憶部１２０から第ｎ拡張レイヤのプレフィルタ信号を読み込み，拡張レイヤの符号化処理を実施する。また，このときテクスチャ予測に使用する参照信号を第ｎ−１拡張レイヤテクスチャ予測信号記憶部（図示省略）から読み込む。得られた符号化情報を，第ｎ拡張レイヤ符号化情報記憶部１２２に出力する。この符号化処理の一例としては，非特許文献２に示されるＪＳＶＭが挙げられる。 N-th enhancement layer encoding unit 121: reads an n-th enhancement layer pre-filter signal from the n-th enhancement layer pre-filter signal storage unit 120, and performs enhancement layer encoding processing. At this time, a reference signal used for texture prediction is read from the (n-1) th enhancement layer texture prediction signal storage unit (not shown). The obtained encoded information is output to the nth enhancement layer encoded information storage unit 122. An example of this encoding process is JSVM shown in Non-Patent Document 2.

符号化情報多重化部１２３：各レイヤの符号化情報記憶部（基本レイヤ符号化情報記憶部１０６，第１拡張レイヤ符号化情報記憶部１１４，... ，第ｎ拡張レイヤ符号化情報記憶部１２２）から符号化情報を読み込み，それらを多重化して符号化ストリームとして出力する。 Coding information multiplexing unit 123: Coding information storage unit of each layer (base layer coding information storage unit 106, first enhancement layer coding information storage unit 114,..., Nth enhancement layer coding information storage unit The encoded information is read from 122), multiplexed, and output as an encoded stream.

本発明の関連技術として，非スケーラブル符号化装置の構成例を図８に示す。 As a related technique of the present invention, a configuration example of a non-scalable encoding apparatus is shown in FIG.

原信号記憶部２０１：原信号を読み込み，バッファに書き込む。 Original signal storage unit 201: Reads an original signal and writes it to a buffer.

プレフィルタ処理部２０２：原信号記憶部２０１から原信号を読み込み，プレフィルタ処理を行い，プレフィルタ信号をプレフィルタ信号記憶部２０３に出力する。本処理装置については，後述の図９にて詳説する。 Pre-filter processing unit 202: Reads the original signal from the original signal storage unit 201, performs pre-filter processing, and outputs the pre-filter signal to the pre-filter signal storage unit 203. This processing apparatus will be described in detail later with reference to FIG.

符号化部２０４：プレフィルタ信号記憶部２０３からプレフィルタ信号を読み込み，そのプレフィルタ信号を入力として符号化処理を実施し，得られた符号化情報を符号化情報記憶部２０５に出力する。この符号化処理の一例としては，非特許文献５に示されるＪＭが挙げられる。 Encoding unit 204: reads a prefilter signal from the prefilter signal storage unit 203, performs an encoding process using the prefilter signal as an input, and outputs the obtained encoded information to the encoded information storage unit 205. An example of this encoding process is JM shown in Non-Patent Document 5.

図７に示すスケーラブル符号化装置における基本レイヤプレフィルタ処理部１０３，および図８に示す非スケーラブル符号化装置におけるプレフィルタ処理部２０２の詳細を，図９を用いて説明する。 Details of the base layer prefilter processing unit 103 in the scalable encoding device shown in FIG. 7 and the prefilter processing unit 202 in the non-scalable encoding device shown in FIG. 8 will be described with reference to FIG.

プレフィルタ適用フレーム指定情報記憶部３０１：外部から与えられるプレフィルタ適用フレーム指定情報を読み込み，レジスタに書き込む。プレフィルタ適用フレーム指定情報とは，上記のフィルタを適用するか否かを指定する情報である。 Prefilter application frame designation information storage unit 301: Reads prefilter application frame designation information given from the outside and writes it to a register. The pre-filter application frame designation information is information that designates whether or not to apply the above filter.

プレフィルタ適用フレーム判定部３０２：プレフィルタ適用フレーム指定情報記憶部３０１からプレフィルタ適用フレーム指定情報を読み込み，当該フレームがプレフィルタ適用フレームか否かの判定処理を行い，真の場合には低域通過フィルタ信号置換処理部３０４に移り，偽の場合にはプレフィルタ処理対象フレーム更新部３０６に移る。 Pre-filter application frame determination unit 302: Reads pre-filter application frame designation information from the pre-filter application frame designation information storage unit 301, determines whether or not the frame is a pre-filter application frame, The process proceeds to the pass filter signal replacement processing unit 304. If false, the process proceeds to the prefilter processing target frame update unit 306.

低域通過フィルタセット記憶部３０３：外部から与えられる低域通過フィルタセットを読み込み，レジスタに書き込む。低域通過フィルタセットとは，帯域通過幅が異なる複数の低域通過フィルタのセットである。 Low-pass filter set storage unit 303: Reads a low-pass filter set given from the outside and writes it to a register. The low pass filter set is a set of a plurality of low pass filters having different band pass widths.

低域通過フィルタ信号置換処理部３０４：当該フレームの原信号を読み込み，低域通過フィルタセット記憶部３０３から読み込まれる低域通過フィルタのフィルタ係数で構成される低域通過フィルタを用いて小領域ごとに置換処理を行い，得られた当該フレームのプレフィルタ信号をバッファに書き込み，プレフィルタ処理終了判定部３０５に移る。なお，基本レイヤプレフィルタ処理部１０３では，読み込まれる原信号は，基本レイヤの原信号である。本処理装置については，後述の図１１にて詳説する。 Low-pass filter signal replacement processing unit 304: Reads the original signal of the frame and uses a low-pass filter composed of filter coefficients of the low-pass filter read from the low-pass filter set storage unit 303 for each small region. The pre-filter signal of the obtained frame is written in the buffer, and the pre-filter processing end determination unit 305 moves. Note that in the base layer prefilter processing unit 103, the read original signal is the base layer original signal. This processing apparatus will be described in detail later with reference to FIG.

プレフィルタ処理終了判定部３０５：すべてのフレームに対してプレフィルタ処理を行ったか否かの判定処理を行い，真の場合にはプレフィルタ信号を出力し，偽の場合にはプレフィルタ処理対象フレーム更新部３０６に移る。なお，基本レイヤプレフィルタ処理部１０３では，出力されるプレフィルタ信号は，基本レイヤのプレフィルタ信号である。 Pre-filter processing end determination unit 305: Determines whether or not pre-filter processing has been performed on all frames, outputs a pre-filter signal if true, and pre-filter processing target frame if false Move to update unit 306. In the base layer prefilter processing unit 103, the output prefilter signal is a base layer prefilter signal.

プレフィルタ処理対象フレーム更新部３０６：次のフレームに処理対象を更新し，プレフィルタ適用フレーム判定部３０２に移る。 Prefilter processing target frame update unit 306: Updates the processing target to the next frame, and moves to the prefilter application frame determination unit 302.

図７に示すスケーラブル符号化装置における第１拡張レイヤプレフィルタ処理部１１１，第ｎ拡張レイヤプレフィルタ処理部１１９およびその他の拡張レイヤプレフィルタ処理部の詳細を，図１０を用いて説明する。 Details of the first enhancement layer prefilter processing unit 111, the nth enhancement layer prefilter processing unit 119, and other enhancement layer prefilter processing units in the scalable coding apparatus shown in FIG. 7 will be described with reference to FIG.

プレフィルタ適用フレーム指定情報記憶部４０１：外部から与えられるプレフィルタ適用フレーム指定情報を読み込み，レジスタに書き込む。 Prefilter application frame designation information storage unit 401: Reads prefilter application frame designation information given from the outside and writes it to the register.

プレフィルタ適用フレーム判定部４０２：プレフィルタ適用フレーム指定情報記憶部４０１からプレフィルタ適用フレーム指定情報を読み込み，当該フレームがプレフィルタ適用フレームか否かの判定処理を行い，真の場合には適応的置換処理部４０４に移り，偽の場合にはプレフィルタ処理対象フレーム更新部４０６に移る。 Pre-filter application frame determination unit 402: Reads pre-filter application frame designation information from the pre-filter application frame designation information storage unit 401, determines whether or not the frame is a pre-filter application frame, and is adaptive if true. The process proceeds to the replacement processing unit 404. If false, the process proceeds to the prefilter processing target frame updating unit 406.

低域通過フィルタセット記憶部４０３：外部から与えられる低域通過フィルタセットを読み込み，レジスタに書き込む。 Low-pass filter set storage unit 403: Reads a low-pass filter set given from the outside and writes it to a register.

適応的置換処理部４０４：当該フレームの原信号を読み込み，低域通過フィルタセット記憶部４０３から読み込まれる低域通過フィルタのフィルタ係数で構成される低域通過フィルタによるフィルタ信号や，直下レイヤからのテクスチャ予測信号で，小領域ごとに適応的に置換する処理を行い，得られた当該フレームのプレフィルタ信号をバッファに書き込み，プレフィルタ処理終了判定部４０５に移る。本処理装置については，後述の図１２にて詳説する。 Adaptive replacement processing unit 404: Reads the original signal of the frame, and receives the filter signal from the low pass filter composed of the filter coefficients of the low pass filter read from the low pass filter set storage unit 403, The texture prediction signal is subjected to adaptive replacement processing for each small region, the obtained prefilter signal of the frame is written in the buffer, and the prefilter processing end determination unit 405 moves. This processing apparatus will be described in detail later with reference to FIG.

プレフィルタ処理終了判定部４０５：当該拡張レイヤにおけるすべてのフレームに対してプレフィルタ処理を行ったか否かの判定処理を行い，真の場合には当該拡張レイヤのプレフィルタ信号を出力し，偽の場合にはプレフィルタ処理対象フレーム更新部４０６に移る。 Pre-filter processing end determination unit 405: Performs determination processing on whether or not pre-filter processing has been performed on all frames in the enhancement layer. If true, the pre-filter signal of the enhancement layer is output, and false In this case, the process proceeds to the prefilter processing target frame update unit 406.

プレフィルタ処理対象フレーム更新部４０６：次のフレームに処理対象を更新し，プレフィルタ適用フレーム判定部４０２に移る。 Prefilter processing target frame update unit 406: Updates the processing target to the next frame, and moves to the prefilter application frame determination unit 402.

図９に示す低域通過フィルタ信号置換処理部３０４の詳細を，図１１を用いて説明する。 Details of the low-pass filter signal replacement processing unit 304 shown in FIG. 9 will be described with reference to FIG.

小領域分割部５０１：当該フレームの原信号を読み込み，それを指定されたサイズの小領域に分割し，各小領域の原信号を小領域信号記憶部５０２に出力する。 Small region dividing unit 501: Reads the original signal of the frame, divides it into small regions of a specified size, and outputs the original signal of each small region to the small region signal storage unit 502.

適用低域通過フィルタ係数読み込み部５０３：当該小領域に対して適用する当該低域通過フィルタｆ_nのフィルタ係数を読み込み，その係数値をレジスタに書き込む。 Applied low-pass filter coefficient reading unit 503: Reads the filter coefficient of the low-pass filter f _n to be applied to the small area, and writes the coefficient value to the register.

低域通過フィルタ処理部５０４：小領域信号記憶部５０２から当該小領域の原信号を読み込み，また，レジスタから当該低域通過フィルタｆ_nのフィルタ係数を読み込み，そのフィルタ係数によって構成される低域通過フィルタを当該小領域の原信号に対して施し，得られた当該小領域の低域通過フィルタ信号を低域通過フィルタ信号記憶部５０５に出力する。 Low-pass filter processing unit 504: Reads the original signal of the small region from the small region signal storage unit 502, reads the filter coefficient of the low-pass filter f _n from the register, and configures the low band constituted by the filter coefficient The pass filter is applied to the original signal of the small region, and the obtained low pass filter signal of the small region is output to the low pass filter signal storage unit 505.

視距離パラメータ記憶部５０６：外部から与えられる視距離パラメータを読み込み，レジスタに書き込む。 Viewing distance parameter storage unit 506: Reads a viewing distance parameter given from the outside and writes it to a register.

視覚的歪み量算出部５０７：小領域信号記憶部５０２から当該小領域の原信号を読み込み，また，低域通過フィルタ信号記憶部５０５から低域通過フィルタ信号を読み込み，それらの差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ_nを算出し，その値を視覚的歪み量記憶部５０８に出力する。 Visual distortion amount calculation unit 507: Reads the original signal of the small region from the small region signal storage unit 502, reads the low-pass filter signal from the low-pass filter signal storage unit 505, and generates a difference signal thereof. The Fourier transform is applied to the difference signal, the frequency components of the Fourier transform are weighted using the contrast sensitivity function, and the sum of the weighted frequency components is taken to obtain a difference between the two signals. The visual distortion amount D _n is calculated and the value is output to the visual distortion amount storage unit 508.

視覚的歪み量閾値記憶部５０９：外部から与えられる視覚的歪み量閾値Ｄ_thを読み込み，レジスタに書き込む。 Visual distortion amount threshold storage unit 509: Reads a visual distortion amount threshold value _Dth given from the outside, and writes it in a register.

視覚的歪み量比較部５１０：視覚的歪み量記憶部５０８から当該小領域における当該低域通過フィルタｆ_nを使用して算出された視覚的歪み量Ｄ_nを読み込み，視覚的歪み量閾値記憶部５０９から視覚的歪み量閾値Ｄ_thを読み込み，Ｄ_n≦Ｄ_thの場合には低域通過フィルタ信号置換処理部５１１に移り，Ｄ_n＞Ｄ_thの場合には適用低域通過フィルタセット終了判定部５１２に移る。 Visual distortion amount comparison unit 510: The visual distortion amount D _n calculated by using the low-pass filter f _n in the small region is read from the visual distortion amount storage unit 508, and the visual distortion amount threshold storage unit The visual distortion amount threshold value D _th is read from 509. If D _n ≦ D _th , the process proceeds to the low-pass filter signal replacement processing unit 511, and if D _n > D _th , the end of the applied low-pass filter set is determined. Move to section 512.

低域通過フィルタ信号置換処理部５１１：当該小領域の当該低域通過フィルタｆ_nによる低域通過フィルタ信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込み，置換処理終了判定部５１４に移る。 Low-pass filter signal replacement processing unit 511: Reads a low-pass filter signal by the low-pass filter f _n of the small region, replaces it with the original signal of the small region, writes it to the buffer as a pre-filter signal, and completes the replacement process Move to determination unit 514.

適用低域通過フィルタセット終了判定部５１２：当該小領域に対してすでにすべての低域通過フィルタを適用したか否かの判定処理を行い，真の場合には置換処理終了判定部５１４に移り，偽の場合には適用低域通過フィルタ係数更新部５１３に移る。 Applicable low-pass filter set end determination unit 512: Determines whether all low-pass filters have already been applied to the small region, and if true, moves to a replacement processing end determination unit 514. In the case of false, the process proceeds to the applicable low-pass filter coefficient update unit 513.

適用低域通過フィルタ係数更新部５１３：次の低域通過フィルタに適用対象を移し，適用低域通過フィルタ係数読み込み部５０３に移る。 Applied low-pass filter coefficient updating unit 513: The application target is transferred to the next low-pass filter, and the applied low-pass filter coefficient reading unit 503 is moved.

置換処理終了判定部５１４：当該フレームにおけるすべての小領域に対して置換判定処理を実行したか否かの判定処理を行い，真の場合にはプレフィルタ信号を出力し，偽の場合には処理対象小領域更新部５１５に移る。 Replacement processing end determination unit 514: Performs determination processing on whether or not replacement determination processing has been executed for all the small regions in the frame, outputs a prefilter signal if true, and processing if false The process proceeds to the target small area update unit 515.

処理対象小領域更新部５１５：次の小領域に処理対象を移し，低域通過フィルタ処理部５０４に移る。 Processing target small area update unit 515: Moves the processing target to the next small area and moves to the low-pass filter processing unit 504.

図１０に示す適応的置換処理部４０４の詳細を，図１２を用いて説明する。 Details of the adaptive replacement processing unit 404 shown in FIG. 10 will be described with reference to FIG.

小領域分割部６０１：当該フレームの原信号を読み込み，それを指定されたサイズの小領域に分割し，各小領域の原信号を小領域信号記憶部６０２に出力する。 Small region dividing unit 601: Reads the original signal of the frame, divides it into small regions of a specified size, and outputs the original signal of each small region to the small region signal storage unit 602.

テクスチャ予測信号読み込み部６０３：直下レイヤからのテクスチャ予測信号を読み込み，バッファに書き込む。 Texture prediction signal reading unit 603: Reads a texture prediction signal from the immediately lower layer and writes it to the buffer.

テクスチャ予測信号視覚的歪み量算出部６０４：小領域信号記憶部６０２から当該小領域の原信号を読み込み，また，バッファから直下レイヤのテクスチャ予測信号を読み込み，それらの差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ₀を算出し，その値をテクスチャ予測信号視覚的歪み量記憶部６０６に出力する。 Texture prediction signal visual distortion amount calculation unit 604: Reads the original signal of the small region from the small region signal storage unit 602, reads the texture prediction signal of the immediately lower layer from the buffer, generates a difference signal thereof, Applying a Fourier transform to the signal, weighting each frequency component of the Fourier transform using the contrast sensitivity function, and taking the sum of the weighted frequency components gives a visual The distortion amount D ₀ is calculated, and the value is output to the texture prediction signal visual distortion amount storage unit 606.

視距離パラメータ記憶部６０５：外部から与えられる視距離パラメータを読み込み，レジスタに書き込む。 Viewing distance parameter storage unit 605: Reads a viewing distance parameter given from the outside and writes it to a register.

視覚的歪み量閾値記憶部６０７：外部から与えられる視覚的歪み量閾値Ｄ_thを読み込み，レジスタに書き込む。 Visual distortion amount threshold storage unit 607: Reads a visual distortion amount threshold value _Dth given from the outside, and writes it into a register.

テクスチャ予測信号視覚的歪み量比較部６０８：テクスチャ予測信号視覚的歪み量記憶部６０６から直下レイヤのテクスチャ予測信号を使用して算出された視覚的歪み量Ｄ₀を読み込み，視覚的歪み量閾値記憶部６０７から視覚的歪み量閾値Ｄ_thを読み込み，Ｄ_n≦Ｄ_thの場合にはテクスチャ予測信号置換処理部６１８に移り，Ｄ_n＞Ｄ_thの場合には低域通過フィルタ処理部６１０に移る。 Texture prediction signal visual distortion quantity comparing unit 608: reads the visual distortion amount D ₀ calculated using a texture prediction signal immediately-lower layer from the texture prediction signal visual distortion amount storage unit 606, visual distortion amount threshold storage The visual distortion amount threshold value D _th is read from the unit 607. If D _n ≦ D _th , the process proceeds to the texture prediction signal replacement processing unit 618. If D _n > D _th , the process proceeds to the low-pass filter processing unit 610. .

適用低域通過フィルタ係数読み込み部６０９：当該小領域に対して適用する当該低域通過フィルタｆ_nのフィルタ係数を読み込み，その係数値をレジスタに書き込む。 Applied low-pass filter coefficient reading unit 609: Reads the filter coefficient of the low-pass filter f _n to be applied to the small area, and writes the coefficient value to the register.

低域通過フィルタ処理部６１０：当該低域通過フィルタｆ_nのフィルタ係数をレジスタから読み込み，そのフィルタ係数によって構成される低域通過フィルタを当該小領域の原信号に対して施し，得られた当該小領域の低域通過フィルタ信号を低域通過フィルタ信号記憶部６１１に出力する。 Low-pass filter processing unit 610: Reads a filter coefficient of the low-pass filter f _n from the register, applies a low-pass filter constituted by the filter coefficient to the original signal of the small area, and obtains the obtained The small-pass low-pass filter signal is output to the low-pass filter signal storage unit 611.

低域通過フィルタ信号視覚的歪み量算出部６１２：小領域信号記憶部６０２から当該小領域の原信号を読み込み，また，低域通過フィルタ信号記憶部６１１から低域通過フィルタ信号を読み込み，それらの差分信号を生成し，その差分信号に対してフーリエ変換を適用し，そのフーリエ変換の各周波数成分に対してコントラスト感度関数を用いて重み付けを行い，その重み付けされた各周波数成分の総和をとることにより両信号の間の視覚的歪み量Ｄ_nを算出し，その値を低域通過フィルタ信号視覚的歪み量記憶部６１３に出力する。 Low-pass filter signal visual distortion amount calculation unit 612: Reads the original signal of the small region from the small-region signal storage unit 602, reads the low-pass filter signal from the low-pass filter signal storage unit 611, Generate a difference signal, apply a Fourier transform to the difference signal, weight each frequency component of the Fourier transform using a contrast sensitivity function, and take the sum of the weighted frequency components To calculate the visual distortion amount D _n between the two signals, and output the value to the low-pass filter signal visual distortion storage unit 613.

低域通過フィルタ信号視覚的歪み量比較部６１４：低域通過フィルタ信号視覚的歪み量記憶部６１３から当該小領域における当該低域通過フィルタｆ_nを使用して算出された視覚的歪み量Ｄ_nを読み込み，視覚的歪み量閾値記憶部６０７から視覚的歪み量閾値Ｄ_thを読み込み，Ｄ_n≦Ｄ_thの場合には低域通過フィルタ信号置換処理部６１５に移り，Ｄ_n＞Ｄ_thの場合には適用低域通過フィルタセット終了判定部６１６に移る。 Low-pass filter signal visual distortion amount comparison unit 614: Visual distortion amount D _n calculated from the low-pass filter signal visual distortion amount storage unit 613 using the low-pass filter f _n in the small region. And the visual distortion amount threshold value D _th is read from the visual distortion amount threshold value storage unit 607. If D _n ≦ D _th , the process proceeds to the low-pass filter signal replacement processing unit 615, and if D _n > D _th Then, the process proceeds to the applied low-pass filter set end determination unit 616.

低域通過フィルタ信号置換処理部６１５：当該小領域の当該低域通過フィルタｆ_nによる低域通過フィルタ信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込み，置換処理終了判定部６１９に移る。 Low-pass filter signal replacement processing unit 615: Reads a low-pass filter signal by the low-pass filter f _n of the small region, replaces it with the original signal of the small region, writes it to the buffer as a pre-filter signal, and completes the replacement process Move to determination unit 619.

適用低域通過フィルタセット終了判定部６１６：当該小領域に対してすでにすべての低域通過フィルタを適用したか否かの判定処理を行い，真の場合には置換処理終了判定部６１９に移り，偽の場合には適用低域通過フィルタ係数更新部６１７に移る。 Applicable low-pass filter set end determination unit 616: Determines whether all the low-pass filters have already been applied to the small region, and if true, moves to a replacement processing end determination unit 619, In the case of false, the process proceeds to the applicable low-pass filter coefficient update unit 617.

適用低域通過フィルタ係数更新部６１７：次の低域通過フィルタに適用対象を移し，適用低域通過フィルタ係数読み込み部６０９に移る。 Applied low-pass filter coefficient updating unit 617: The application target is transferred to the next low-pass filter, and the applied low-pass filter coefficient reading unit 609 is moved.

テクスチャ予測信号置換処理部６１８：当該小領域における直下レイヤのテクスチャ予測信号を読み込み，当該小領域の原信号と置き換え，プレフィルタ信号としてバッファに書き込み，置換処理終了判定部６１９に移る。 Texture prediction signal replacement processing unit 618: Reads the texture prediction signal of the immediately lower layer in the small region, replaces it with the original signal of the small region, writes it into the buffer as a prefilter signal, and moves to the replacement processing end determination unit 619.

置換処理終了判定部６１９：当該フレームにおけるすべての小領域に対して置換判定処理を実行したか否かの判定処理を行い，真の場合には適応的に置換されたプレフィルタ信号を出力し，偽の場合には処理対象小領域更新部６２０に移る。 Replacement processing end determination unit 619: Performs determination processing as to whether or not replacement determination processing has been executed for all the small regions in the frame, and outputs an adaptively replaced prefilter signal if true, If false, the process proceeds to the processing target small area update unit 620.

処理対象小領域更新部６２０：次の小領域に処理対象を移し，テクスチャ予測信号視覚的歪み量算出部６０４に移る。 Processing target small region update unit 620: Moves the processing target to the next small region, and moves to the texture prediction signal visual distortion amount calculation unit 604.

［実験］
非特許文献２に記載されているＪＳＶＣ参照エンコーダＪＳＶＭ８．０を用いて，本発明によるプレフィルタの性能を検証するための実験を行った。映像はＪＳＶＣ標準映像のＣｉｔｙ，Ｃｒｅｗ，Ｈａｒｂｏｕｒ，Ｓｏｃｃｅｒを用いた。下位レイヤがＱＣＩＦ，上位レイヤがＣＩＦとし，両レイヤで同じ量子化パラメータＱＰを用いた。ピクチャ構造はＩＰＰＰとし，３００枚符号化した。また，本発明によるプレフィルタに用いる視距離は３ｈ，閾値は１とし，低域通過フィルタを５パターン用意した。 [Experiment]
An experiment for verifying the performance of the prefilter according to the present invention was performed using the JSVC reference encoder JSVM8.0 described in Non-Patent Document 2. For the video, City, Crew, Harbor, and Soccer of JSVC standard video were used. The lower layer is QCIF, the upper layer is CIF, and the same quantization parameter QP is used in both layers. The picture structure is IPPP and 300 pictures are encoded. The viewing distance used for the pre-filter according to the present invention was 3h, the threshold was 1, and five low-pass filters were prepared.

プレフィルタのｏｎ／ｏｆｆの場合の符号量の削減結果を，表１に示す。４．５％〜２４％（平均１１．９％）の高い符号量削減が実現されており，高レートになるほど効果が高くなる傾向が見られる。復号信号の主観画質は，プレフィルタのｏｎ／ｏｆｆ両者において差がないことを確認している。高レートほど，符号量増加を招く非ゼロの量子化レベルが高周波に生じやすいが，プレフィルタによりそのレベルがゼロにされるため，低レートよりも相対的に効果が高くなる。 Table 1 shows the result of code amount reduction when the prefilter is on / off. A high code amount reduction of 4.5% to 24% (average of 11.9%) has been realized, and the effect tends to increase as the rate increases. It has been confirmed that there is no difference in the subjective image quality of the decoded signal between on / off of the prefilter. The higher the rate, the more likely a non-zero quantization level that causes an increase in the amount of code occurs in the high frequency, but since the level is made zero by the prefilter, the effect is relatively higher than the low rate.

また，ＣｒｅｗやＳｏｃｃｅｒといった高周波成分が小さい映像ほど削減効果が大きい。含まれる高周波成分が小さいと，原信号との視覚的歪み量は小さくなる。したがって，帯域制限が強いフィルタによる信号や補間信号で置換されるようになり，削減効果が大きくなると考えられる。各映像に対してプレフィルタによるマクロブロックの置換率についても計測したが，確かに高周波成分が大きい映像ほど帯域制限の強いフィルタが多く使用されることが確認できた。 In addition, the smaller the high frequency components such as Crew and Soccer, the greater the reduction effect. When the high-frequency component included is small, the amount of visual distortion with the original signal is small. Therefore, it is considered that the signal is replaced with a signal by a filter having a strong band limitation or an interpolation signal, and the reduction effect is increased. We also measured the macroblock replacement rate by prefilter for each video, but it was confirmed that a video with a larger high-frequency component used more filters with higher bandwidth restrictions.

以上の動画像符号化の処理は，コンピュータとソフトウェアプログラムとによっても実現することができ，そのプログラムをコンピュータ読み取り可能な記録媒体に記録して提供することも，ネットワークを通して提供することも可能である。 The above moving image encoding processing can be realized by a computer and a software program, and the program can be provided by being recorded on a computer-readable recording medium or provided through a network. .

本実施の形態によるスケーラブル符号化処理の流れを示す図である。It is a figure which shows the flow of the scalable encoding process by this Embodiment. 関連技術による非スケーラブル符号化処理の流れを示す図である。It is a figure which shows the flow of the non-scalable encoding process by related technology . 本実施の形態によるプレフィルタ処理の流れを示す図である。It is a figure which shows the flow of the pre filter process by this Embodiment. 本実施の形態による拡張レイヤ向けのプレフィルタ処理の流れを示す図である。It is a figure which shows the flow of the pre filter process for the enhancement layers by this Embodiment. 本実施の形態による低域通過フィルタ信号による置換処理の流れを示す図である。It is a figure which shows the flow of the replacement process by the low-pass filter signal by this Embodiment. 本実施の形態による適応的置換処理の流れを示す図である。It is a figure which shows the flow of the adaptive replacement process by this Embodiment. 本実施の形態によるスケーラブル符号化装置の構成例を示す図である。It is a figure which shows the structural example of the scalable coding apparatus by this Embodiment. 関連技術による非スケーラブル符号化装置の構成例を示す図である。It is a figure which shows the structural example of the non-scalable encoding apparatus by related technology . 基本レイヤプレフィルタ処理部およびプレフィルタ処理部の装置構成例を示す図である。It is a figure which shows the apparatus structural example of a base layer pre filter process part and a pre filter process part. 拡張レイヤプレフィルタ処理部の装置構成例を示す図である。It is a figure which shows the apparatus structural example of an extended layer prefilter process part. 低域通過フィルタ信号置換処理部の装置構成例を示す図である。It is a figure which shows the apparatus structural example of a low-pass filter signal substitution process part. 適応的置換処理部の装置構成例を示す図である。It is a figure which shows the apparatus structural example of an adaptive replacement process part.

Explanation of symbols

１０１原信号縮小処理部
１０２基本レイヤ原信号記憶部
１０３基本レイヤプレフィルタ処理部
１０４基本レイヤプレフィルタ信号記憶部
１０５基本レイヤ符号化部
１０６基本レイヤ符号化情報記憶部
１０７基本レイヤ内部復号部
１０８基本レイヤテクスチャ予測信号生成部
１０９基本レイヤテクスチャ予測信号記憶部
１１０第１拡張レイヤ原信号記憶部
１１１第１拡張レイヤプレフィルタ処理部
１１２第１拡張レイヤプレフィルタ信号記憶部
１１３第１拡張レイヤ符号化部
１１４第１拡張レイヤ符号化情報記憶部
１１５第１拡張レイヤ内部復号部
１１６第１拡張レイヤテクスチャ予測信号生成部
１１７第１拡張レイヤテクスチャ予測信号記憶部
１１８第ｎ拡張レイヤ原信号記憶部
１１９第ｎ拡張レイヤプレフィルタ処理部
１２０第ｎ拡張レイヤプレフィルタ信号記憶部
１２１第ｎ拡張レイヤ符号化部
１２２第ｎ拡張レイヤ符号化情報記憶部
１２３符号化情報多重化部
２０１原信号記憶部
２０２プレフィルタ処理部
２０３プレフィルタ信号記憶部
２０４符号化部
２０５符号化情報記憶部 DESCRIPTION OF SYMBOLS 101 Original signal reduction process part 102 Base layer original signal storage part 103 Base layer pre-filter process part 104 Base layer pre-filter signal storage part 105 Base layer encoding part 106 Base layer encoding information storage part 107 Base layer internal decoding part 108 Basic Layer texture prediction signal generation unit 109 Base layer texture prediction signal storage unit 110 First enhancement layer original signal storage unit 111 First enhancement layer prefilter processing unit 112 First enhancement layer prefilter signal storage unit 113 First enhancement layer encoding unit 114 1st enhancement layer coding information storage part 115 1st enhancement layer inner decoding part 116 1st enhancement layer texture prediction signal generation part 117 1st enhancement layer texture prediction signal storage part 118 nth enhancement layer original signal storage part 119 nth Enhanced layer prefilter Processing unit 120 nth enhancement layer prefilter signal storage unit 121 nth enhancement layer encoding unit 122 nth enhancement layer encoded information storage unit 123 encoded information multiplexing unit 201 original signal storage unit 202 prefilter processing unit 203 prefilter Signal storage unit 204 Coding unit 205 Coding information storage unit

Claims

In a scalable video coding method that forms a layer structure,
A small area dividing process for dividing the original image signal in the enhancement layer to be encoded into small areas;
A process for calculating the visual distortion amount of the immediately lower layer decoded upsample signal for calculating the visual distortion amount of the original image signal and the image signal obtained by upsampling the decoded image signal of the immediately lower layer for each divided small area;
It is determined whether or not the calculated visual distortion amount is equal to or less than a predetermined threshold value. A sub-region image signal direct layer decoding up-sample signal replacement process for performing a low-pass filter process described later;
A low-pass filter processing process in which the filter processing for the small region is performed from a low-pass filter having a small band pass width among low-pass filter sets having different band pass widths prepared in advance;
A low-pass filter signal visual distortion amount calculation process for calculating a visual distortion amount between the derived filter signal and the original image signal;
It is determined whether or not the calculated visual distortion amount is equal to or less than a predetermined threshold value. If the calculated visual distortion amount is equal to or less than the threshold value, the original image signal is replaced with the filter signal. Determines whether all low-pass filter sets have been applied. If all are applied, the original image signal is output as it is. If not, the band-pass width of the low-pass filter applied to the small region is applied. Is updated to the one with the smallest bandpass width next to the bandpass width, and the low-pass filter process for re-executing the low-pass filter process,
An image signal that is finally calculated by performing processing from the immediately lower layer decoded upsample signal visual distortion amount calculation process to the small area image signal low-pass filter signal replacement process for all the small areas And a process of inputting the signal as an encoded input signal to the enhancement layer to the encoder and performing the encoding.

The scalable video encoding method according to claim 1 ,
Only for a specific frame image or its image partial area designated from the outside, or for a specific frame image or its image partial area defined by a predetermined criterion, for all small areas in the area The processing from the immediately lower layer decoded upsample signal visual distortion amount calculation process to the small area image signal low-pass filter signal replacement process is executed, and for the small area other than the specific frame image or its image partial area, The scalable moving picture coding method, wherein the process from the immediately lower layer decoded upsample signal visual distortion amount calculation process to the small area image signal low-pass filter signal replacement process is not executed.

In a scalable video encoding device that constitutes a layer structure,
A small area division processing means for dividing the original image signal in the enhancement layer to be encoded into small areas;
A direct layer decoded upsample signal visual distortion amount calculation processing means for calculating a visual distortion amount between an original image signal and an image signal obtained by upsampling a decoded image signal of a direct layer for each of the divided small regions;
It is determined whether or not the calculated visual distortion amount is equal to or less than a predetermined threshold value. A small region image signal direct layer decoding up-sample signal replacement processing means for executing low pass filter processing means described later;
Low-pass filter processing means for performing filter processing on the small region from a low-pass filter having a small bandpass width among low-pass filter sets having different bandpass widths prepared in advance;
A low-pass filter signal visual distortion amount calculation processing means for calculating a visual distortion amount between the derived filter signal and the original image signal;
It is determined whether or not the calculated visual distortion amount is equal to or less than a predetermined threshold value. If the calculated visual distortion amount is equal to or less than the threshold value, the original image signal is replaced with the filter signal. Determines whether all low-pass filter sets have been applied. If all are applied, the original image signal is output as it is. If not, the band-pass width of the low-pass filter applied to the small region is applied. A small region image signal low-pass filter signal replacement processing means for renewing the band-pass width next to the band-pass width and re-executing the filter processing by the low-pass filter processing means;
An image signal finally calculated by executing the processing from the immediately lower layer decoded upsample signal visual distortion amount calculation processing means to the small area image signal low-pass filter signal replacement processing means for all the small areas. And a means for performing encoding by inputting the signal as an encoded input signal to the enhancement layer to the encoder.

Scalable video encoding program for executing a scalable video encoding method according to claim 1 or claim 2 wherein the computer.

A computer-readable recording medium on which a scalable video encoding program for causing a computer to execute the scalable video encoding method according to claim 1 or 2 is recorded.