JP2008141436A

JP2008141436A - Moving image encoding method, device and program

Info

Publication number: JP2008141436A
Application number: JP2006324996A
Authority: JP
Inventors: Naoto Date; 直人伊達; Tomoo Yamakage; 朋夫山影
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2006-11-30
Filing date: 2006-11-30
Publication date: 2008-06-19

Abstract

<P>PROBLEM TO BE SOLVED: To avoid the distortion of a vertical boundary of a block seen when adding a weak film grain image in film grain technology. <P>SOLUTION: This moving image encoding device comprises: a noise reducing part 11 for performing noise elimination processing of an input image of an encoding target including a noise image to generate a first main image; an estimating part 12 for estimating the noise image from the input image and the first main image to generate model information; a generating part 13 for generating a block vertical boundary signal being an image component to be damaged by smoothing filter processing to a boundary between respective noise image blocks of a film grain image with reference to the model information; an adding part 14 for adding the first main image and the block vertical boundary signal to generate a second main image; an encoding part 15 for encoding the second main image to generate encoded data; and a multiplexing part 16 for multiplexing the encoded data with the model information and outputting an encoded stream. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、フィルムグレインを良好に再現可能な動画像符号化方法、装置及びプログラムに関する。 The present invention relates to a moving image encoding method, apparatus, and program capable of satisfactorily reproducing film grain.

デジタルバーサタイルディスク（ＤＶＤ）のコンテンツ用途等を目的として映画フィルム映像を符号化する場合、映画本来の雰囲気を損なってしまうという問題がある。これは、符号化に伴い映像中のフィルムグレイン（フィルムの粒子感）が失われるためである。フィルムグレインはハリウッドの魂とも呼ばれており、ＤＶＤオーサリングスタジオにおいて非常に重要視されている。 When a movie film image is encoded for the purpose of digital versatile disc (DVD) content use, there is a problem that the original atmosphere of the movie is impaired. This is because film grain in the video (film grain feeling) is lost with encoding. Film grain is also called the soul of Hollywood and is very important in DVD authoring studios.

フィルムグレインをうまく再現するための方法として、特許文献１に開示されたフィルムグレインシミュレーション方法があげられる。代表例に、動画像符号化の国際標準規格の一つであるＨ．２６４におけるフィルムグレインテクノロジー（Film Grain Technology：ＦＧＴ）がある。 As a method for successfully reproducing the film grain, there is a film grain simulation method disclosed in Patent Document 1. As a representative example, H.I. There is Film Grain Technology (FGT) in H.264.

ＦＧＴにおける符号化処理では、まず動画像である入力画像に対してフィルムグレイン除去処理を施すことにより、フィルムグレインを除去したメイン画像を生成する。生成されたメイン画像に対して、符号化部により動き補償、直交変換及び量子化を用いた圧縮符号化の処理を行う。一方、ノイズ推定部により入力画像とメイン画像との差分からフィルムグレイン画像を生成し、フィルムグレインを再現するための情報（引用するフィルムグレイン画像データベースを識別するための情報、フィルムグレインの強度等）を推定してモデル情報を生成する。 In the encoding process in FGT, first, a main image from which film grain is removed is generated by performing film grain removal processing on an input image that is a moving image. The generated main image is subjected to compression coding processing using motion compensation, orthogonal transformation, and quantization by the coding unit. On the other hand, information for generating a film grain image from the difference between the input image and the main image by the noise estimation unit and reproducing the film grain (information for identifying the cited film grain image database, film grain strength, etc.) Is used to generate model information.

次に、ＦＧＴにおける復号処理では、復号部によって復号画像としてフィルムグレインが除去されているメイン画像を生成し、ノイズ生成部によって符号化部から送られてきたモデル情報を用いてメイン画像からノイズ画像、すなわちフィルムグレイン画像を生成する。生成されたメイン画像とフィルムグレイン画像を足し合わせることで、フィルムグレインが再現された出力画像を得る。 Next, in the decoding process in FGT, a main image from which film grain is removed as a decoded image is generated by a decoding unit, and a noise image is generated from the main image using model information transmitted from the encoding unit by a noise generation unit. That is, a film grain image is generated. By adding the generated main image and the film grain image, an output image in which the film grain is reproduced is obtained.

このようなＦＧＴの利点は、フィルムグレインを分離してモデル化するため、符号化処理によってフィルムグレインが失われることがなく、映画本来の雰囲気を再現できることである。
米国特許出願公開第２００６／００８２６４９号明細書 The advantage of such an FGT is that the film grain is separated and modeled, so that the film grain is not lost by the encoding process and the original atmosphere of the movie can be reproduced.
US Patent Application Publication No. 2006/0082649

発明者らの検討によれば、ＦＧＴにおいて復号画像に対して微弱なフィルムグレイン画像を足し合わせると、すなわちフィルムグレインを再現するための情報のうちフィルムグレインの強度を示す情報の値が小さいときに、出力画像のブロック縦境界に歪みが見えるという現象が生じることが確認された。これは出力画像の品質を大きく損ねるため、問題である。これはＦＧＴのフィルムグレイン画像生成処理において、フィルムグレイン画像の継ぎ目を目立たなくする目的で、ブロックの縦境界（vertical edge）に対して、デブロッキングフィルタと呼ばれる平滑フィルタ処理を施すことが原因と思われる。 According to the studies by the inventors, when a weak film grain image is added to the decoded image in the FGT, that is, when the value of information indicating the strength of the film grain is small among the information for reproducing the film grain. It was confirmed that the phenomenon of distortion appearing at the block vertical boundary of the output image occurred. This is a problem because it greatly deteriorates the quality of the output image. This is considered to be caused by applying a smoothing filter process called deblocking filter to the vertical edges of blocks in order to make the seams of film grain images inconspicuous in FGT film grain image generation processing. It is.

本発明は、微弱なフィルムグレイン画像を加算する際に見られるブロックの縦境界の歪みを回避することを目的とする。 An object of the present invention is to avoid the distortion of the vertical boundary of a block seen when adding a weak film grain image.

本発明の一態様によると、ノイズ画像を含む符号化対象の入力画像に対しノイズ除去処理を施して第１メイン画像を生成するステップと、前記入力画像及び第１メイン画像から前記ノイズ画像を推定して前記ノイズ画像に対応するモデル情報を生成するステップと、前記モデル情報を参照して、フィルムグレイン再現のためのフィルムグレイン画像の各ノイズ画像ブロック間の境界に対する平滑フィルタ処理によって損なわれる画像成分を生成するステップと、前記第１メイン画像と前記画像成分を加算して第２メイン画像を生成するステップと、前記第２メイン画像を符号化して符号化データを生成するステップと、
前記符号化データに前記モデル情報を多重化して符号化ストリームを出力するステップと、を具備する動画像符号化方法が提供される。 According to an aspect of the present invention, a step of generating a first main image by performing noise removal processing on an input image to be encoded including a noise image, and estimating the noise image from the input image and the first main image Generating model information corresponding to the noise image, and referring to the model information, an image component that is impaired by smooth filter processing on a boundary between each noise image block of the film grain image for film grain reproduction Generating a second main image by adding the first main image and the image component, encoding the second main image to generate encoded data,
And a step of multiplexing the model information with the encoded data and outputting an encoded stream.

本発明のさらに具体的な態様によると、第１ノイズ画像を含む符号化対象の入力画像に対しノイズ除去処理を施して第１メイン画像を生成するステップと、前記入力画像及び第１メイン画像から前記第１ノイズ画像を推定して前記第１ノイズ画像に対応するデータベースＩＤ及び強度変調係数を含むモデル情報を生成するステップと、前記モデル情報を参照して前記第１メイン画像中の複数の画素を含む画像ブロック毎に該画像ブロックの画素平均値に関連づけられたデータベースＩＤに従って、ノイズ画像ブロックを格納したノイズデータベースを選択するステップと、前記ノイズデータベースからノイズ画像ブロックを切り出すステップと、前記モデル情報を参照して前記画素平均値に関連づけられた強度変調係数を選択するステップと、前記強度変調係数によって前記ノイズ画像ブロックを変調して変調ノイズ画像を生成するステップと、前記変調ノイズ画像の各ノイズ画像ブロック間の境界に平滑フィルタ処理を施して平滑化ノイズ画像を生成するステップと、前記変調ノイズ画像から前記平滑化ノイズ画像を差し引いて差分画像を生成するステップと、前記第１メイン画像と前記差分画像を加算して第２メイン画像を生成するステップと、前記第２メイン画像を符号化して符号化データを生成するステップと、前記符号化データに前記モデル情報を多重化して符号化ストリームを出力するステップと、を具備する動画像符号化方法が提供される。 According to a more specific aspect of the present invention, a step of generating a first main image by performing a noise removal process on an input image to be encoded including a first noise image, and from the input image and the first main image Estimating the first noise image to generate model information including a database ID and an intensity modulation coefficient corresponding to the first noise image; and a plurality of pixels in the first main image with reference to the model information Selecting a noise database storing a noise image block according to a database ID associated with the pixel average value of the image block for each image block including: cutting out a noise image block from the noise database; and the model information Selecting an intensity modulation coefficient associated with the pixel average value with reference to Modulating the noise image block by the intensity modulation coefficient to generate a modulated noise image; and applying a smoothing filter process to a boundary between the noise image blocks of the modulated noise image to generate a smoothed noise image; Subtracting the smoothed noise image from the modulated noise image to generate a difference image; adding the first main image and the difference image to generate a second main image; and the second main image A moving picture encoding method is provided, which includes the steps of: generating encoded data by encoding the model information; and outputting the encoded stream by multiplexing the model information with the encoded data.

ここで、前記第２メイン画像を生成するステップは、前記差分画像を該差分画像に係数を乗じた後、前記第１メイン画像に加算するようにしてもよい。 Here, the step of generating the second main image may include adding the difference image to the first main image after multiplying the difference image by a coefficient.

また、前記第２メイン画像を生成するステップは、前記差分画像を該差分画像にオフセットを足し合わせた後、前記第１メイン画像に加算するようにしてもよい。 In the step of generating the second main image, the difference image may be added to the first main image after adding the offset to the difference image.

本発明の他の態様によると、ノイズ画像を含む符号化対象の入力画像に対しノイズ除去処理を施して第１メイン画像を生成するステップと、前記入力画像及び第１メイン画像から前記ノイズ画像を推定して前記ノイズ画像に対応する、複数のパラメータを有するモデル情報を生成するステップと、フィルムグレイン再現のためのフィルムグレイン画像の各ノイズ画像ブロック間の境界に対する平滑フィルタ処理によって周期パターンが発生する特定パラメータ範囲を記憶するステップと、前記パラメータの値が前記特定パラメータ範囲内にあるか否かを判別するステップと、前記判別するステップにより前記パラメータの値が前記特定パラメータ範囲内にあると判別された場合に、前記パラメータの値を前記特定パラメータ範囲外の値に修正するステップと、前記メイン画像を符号化して符号化データを生成するステップと、前記符号化データに前記修正するステップを経たモデル情報を多重化して符号化ストリームを出力するステップと、を具備する動画像符号化方法が提供される。 According to another aspect of the present invention, a step of generating a first main image by performing a noise removal process on an input image to be encoded including a noise image, and the noise image from the input image and the first main image. A periodic pattern is generated by generating model information having a plurality of parameters corresponding to the noise image by estimation and smoothing filter processing on a boundary between each noise image block of the film grain image for film grain reproduction. The step of storing the specific parameter range, the step of determining whether or not the value of the parameter is within the specific parameter range, and the step of determining determine that the value of the parameter is within the specific parameter range. The parameter value is corrected to a value outside the specified parameter range. And a step of generating encoded data by encoding the main image, and a step of multiplexing the model information that has undergone the correcting step and outputting an encoded stream to the encoded data. An image encoding method is provided.

さらに、本発明の別の態様によると、上述した動画像符号化処理をハードウェアにより実現する動画像符号化装置あるいは上述した動画像符号化処理をコンピュータに行わせるためのプログラムが提供される。 Furthermore, according to another aspect of the present invention, there is provided a moving image encoding apparatus that implements the above-described moving image encoding processing by hardware, or a program for causing a computer to perform the above-described moving image encoding processing.

本発明によれば、ＦＧＴにおいて微弱なフィルムグレイン画像を加算する際に見られるブロックの縦境界の歪みを回避することが可能となる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes possible to avoid the distortion of the vertical boundary of the block seen when adding a weak film grain image in FGT.

以下、添付図面を参照して本発明の実施の形態を説明する。なお、以下の説明では画像信号を単に画像と呼ぶ。 Embodiments of the present invention will be described below with reference to the accompanying drawings. In the following description, the image signal is simply referred to as an image.

（第１の実施形態）
図１に示されるように、本発明の一実施形態に係る動画像符号化装置は、ノイズリダクション部１１、ノイズ推定部１２、ブロック縦境界信号生成部１３、加算部１４、符号化部（エンコーダ）１５及び多重化部（マルチプレクサ）１６を有する。一方、図１の動画像符号化装置に対応する動画像復号化装置は、図２に示されるように分離部（デマルチプレクサ）２１、復号部（デコーダ）２２、ノイズ生成部２３及び加算部２４を有する。 (First embodiment)
As shown in FIG. 1, a moving image encoding apparatus according to an embodiment of the present invention includes a noise reduction unit 11, a noise estimation unit 12, a block vertical boundary signal generation unit 13, an addition unit 14, an encoding unit (encoder). ) 15 and a multiplexing unit (multiplexer) 16. On the other hand, a moving picture decoding apparatus corresponding to the moving picture encoding apparatus of FIG. 1 includes a separation unit (demultiplexer) 21, a decoding unit (decoder) 22, a noise generation unit 23, and an addition unit 24 as shown in FIG. Have

図１の動画像符号化装置では、入力端子１０に入力される動画像である入力画像１０１に対して、ノイズリダクション部１１によりノイズ除去処理が施され、メイン画像１０２（第１メイン画像）が生成される。ノイズ推定部１２は、入力画像１０１とメイン画像１０２の差分から、入力画像１０１中に含まれるノイズ（フィルムグレイン）をモデル化し、モデル情報１０３を出力する。モデル情報１０３は、具体的には例えば後述するＦＧＴテーブルを構築するための情報であり、film grain characteristics SEIメッセージとも呼ばれる。 In the moving image encoding apparatus in FIG. 1, noise reduction processing is performed by the noise reduction unit 11 on the input image 101 that is a moving image input to the input terminal 10, and a main image 102 (first main image) is obtained. Generated. The noise estimation unit 12 models noise (film grain) included in the input image 101 from the difference between the input image 101 and the main image 102 and outputs model information 103. Specifically, the model information 103 is, for example, information for constructing an FGT table described later, and is also called a film grain characteristics SEI message.

ブロック縦境界信号発生部１３は、ノイズ推定部１２から出力されるモデル情報１０３を参照して、メイン画像１０２からＦＧＴの平滑フィルタ処理によってブロック縦境界において損なわれる画像成分（この画像成分は、加算部１４によってメイン画像１０２のブロック縦境界に加算される信号であり、以後ブロック縦境界信号という）１０４を生成する。ブロック縦境界信号１０４は、加算部１４においてノイズリダクション部１１からのメイン画像１０２に足し合わされ、ブロック縦境界の歪みを補償可能なメイン画像１０５（第２メイン画像）が生成される。 The block vertical boundary signal generation unit 13 refers to the model information 103 output from the noise estimation unit 12, and the image component that is damaged at the block vertical boundary by the FGT smoothing filter processing from the main image 102 (this image component is added) This signal is added to the block vertical boundary of the main image 102 by the unit 14, and is hereinafter referred to as a block vertical boundary signal) 104. The block vertical boundary signal 104 is added to the main image 102 from the noise reduction unit 11 in the adding unit 14 to generate a main image 105 (second main image) that can compensate for distortion of the block vertical boundary.

加算器１４から出力されるメイン画像１０５は、符号化部１５（例えばＨ．２６４エンコーダ）により動き補償、ＤＣＴのような直交変換及び量子化を用いた圧縮符号化が施され、符号化データ１０６が生成される。符号化データ１０６は多重化部１６によりノイズ推定部１３から出力されるモデル情報１０３と共に多重化され、符号化ビットストリーム１０７として出力される。符号化ビットストリーム１０７は、例えば図示しないＤＶＤあるいはＨＤＤＶＤのような蓄積メディアに記録されるか、あるいはインターネットのような伝送媒体に送出される。 The main image 105 output from the adder 14 is subjected to compression coding using motion compensation, orthogonal transformation such as DCT, and quantization by an encoding unit 15 (for example, an H.264 encoder). Is generated. The encoded data 106 is multiplexed together with the model information 103 output from the noise estimation unit 13 by the multiplexing unit 16 and output as an encoded bit stream 107. The encoded bit stream 107 is recorded on a storage medium such as a DVD or HD DVD (not shown), or sent to a transmission medium such as the Internet.

一方、図２の動画像復号化装置においては、入力端子２０に上記の蓄積メディアから再生される、あるいは伝送媒体を介して送られてくる符号化ビットストリーム２０１（基本的に符号化ストリーム１０７と同じ）が入力される。分離部２１では、符号化ビットストリーム２０１から符号化データ２０２とモデル情報２０３が分離される。復号部２２では、符号化データ２０２が復号されることによって、メイン画像（復号画像）２０４が生成される。ノイズ生成部１０８は、モデル情報２０３に従ってメイン画像２０４からノイズ画像（フィルムグレイン画像）２０５が生成される。最後に、加算部２４においてメイン画像２０４とノイズ画像２０５が加算されることによって、フィルムグレインが再現された動画像の出力画像２０６が得られる。 On the other hand, in the moving picture decoding apparatus in FIG. 2, an encoded bit stream 201 (basically, an encoded stream 107 and an image reproduced from the above storage medium or sent via a transmission medium to the input terminal 20 is used. The same) is entered. In the separation unit 21, the encoded data 202 and the model information 203 are separated from the encoded bitstream 201. In the decoding unit 22, a main image (decoded image) 204 is generated by decoding the encoded data 202. The noise generation unit 108 generates a noise image (film grain image) 205 from the main image 204 according to the model information 203. Finally, by adding the main image 204 and the noise image 205 in the addition unit 24, an output image 206 of a moving image in which film grain is reproduced is obtained.

ノイズ生成部２３での処理には、特許文献１と同様にノイズ画像の継ぎ目を目立たなくすることを目的とする、ブロック縦境界に対する平滑フィルタ処理が含まれ、これがブロック縦境界に歪みを発生させる原因となる。図３は、その様子を示す図であり、ノイズ画像（フィルムグレイン画像）をメイン画像（復号画像）に加算して得られる出力画像（結果画像）においては、フィルムグレインが再現されるが、ブロック縦境界の歪みである縦線が入ってしまい、画質を損ねる結果となる。ＦＧＴの規格とは異なるが、このような平滑フィルタ処理を行わずにノイズ生成処理を行ったところ、出力画像のブロック縦境界への歪みは出現せず、正常な画像が得られることが確認された。 The processing in the noise generation unit 23 includes smoothing filter processing for the block vertical boundary for the purpose of making the noise image seam inconspicuous as in Patent Document 1, and this generates distortion in the block vertical boundary. Cause. FIG. 3 is a diagram showing the situation. In the output image (result image) obtained by adding the noise image (film grain image) to the main image (decoded image), the film grain is reproduced. A vertical line, which is a distortion of the vertical boundary, is entered, resulting in a deterioration in image quality. Although different from the FGT standard, when noise generation processing is performed without performing such smoothing filter processing, it is confirmed that distortion to the block vertical boundary of the output image does not appear and a normal image can be obtained. It was.

そこで、図１に示した動画像符号化装置では、図２の動画像復号化装置のノイズ生成部２３における平滑フィルタ処理によって損なわれる画像成分（これをブロック縦境界信号という）１０４をブロック縦境界信号生成部１３により生成し、このブロック縦境界信号１０４をノイズリダクション部１１から出力されるメイン画像１０２に加算する。これによって出力画像２０６は、ノイズ生成部２３における平滑フィルタ処理を省略した場合の画像に近づくため、図３に示したようなブロック縦境界の歪みのない、高品質な画像となる。 Therefore, in the moving picture encoding apparatus shown in FIG. 1, an image component (this is called a block vertical boundary signal) 104 that is damaged by the smoothing filter processing in the noise generating unit 23 of the moving picture decoding apparatus in FIG. Generated by the signal generator 13, the block vertical boundary signal 104 is added to the main image 102 output from the noise reduction unit 11. As a result, the output image 206 is close to the image when the smoothing filter processing in the noise generating unit 23 is omitted, so that the output image 206 is a high quality image without the distortion of the block vertical boundary as shown in FIG.

次に、ブロック縦境界信号生成部１３について詳細に説明する。
図４は、ノイズ推定部１２によって生成されるモデル情報１０３であるＦＧＴテーブル３０１の例であり、ここではレンジＬ、レンジＨ、ＤＢ＿ＩＤ及びＣＯＦＦの４項目を有する。レンジＬ及びレンジＨは、メイン画像の例えば８×８画素ブロックの平均画素値に対して設定される下限及び上限を表す。ＤＢ＿ＩＤは、レンジＬ及びレンジＨで示される範囲に対応する図５のノイズデータベース３０２を指し示すＩＤである。ＣＯＦＦは、レンジＬ及びレンジＨで示される範囲に対応する強度（振幅）変調係数である。ノイズデータベース３０２においては、例えば図５に示されるようにＩＤ１，ＩＤ２，・・・の位置にノイズ画像ブロックのデータが格納されている。 Next, the block vertical boundary signal generation unit 13 will be described in detail.
FIG. 4 is an example of the FGT table 301 which is the model information 103 generated by the noise estimation unit 12, and here has four items of range L, range H, DB_ID, and COFF. Range L and range H represent lower and upper limits set for the average pixel value of, for example, an 8 × 8 pixel block of the main image. DB_ID is an ID indicating the noise database 302 in FIG. 5 corresponding to the range indicated by the range L and the range H. COFF is an intensity (amplitude) modulation coefficient corresponding to the range indicated by the range L and the range H. In the noise database 302, for example, as shown in FIG. 5, noise image block data is stored at positions ID1, ID2,.

ここで、図２に示した動画像復号化装置では、以下のようにしてノイズ生成部２３によりノイズ画像２０５が生成される。まず、復号部２２から出力されるメイン画像２０４の画素ブロックの平均画素値が入るレンジＬ及びレンジＨで示される範囲に対応するＤＢ＿ＩＤに従ってノイズデータベースからノイズ画像ブロックが切り出される。切り出されたノイズ画像ブロックに対して、メイン画像２０４の画素ブロックの平均画素値に対応する強度変調係数ＣＯＦＦが乗じられて変調が施されることにより、変調ノイズ画像が生成される。生成された変調ノイズ画像のノイズ画像ブロック間の縦境界に平滑フィルタ処理が施されることによって、メイン画像２０４に対して加算されるべきノイズ画像２０５が生成される。 Here, in the moving picture decoding apparatus shown in FIG. 2, the noise image 205 is generated by the noise generation unit 23 as follows. First, the noise image block is cut out from the noise database according to DB_ID corresponding to the range indicated by the range L and the range H in which the average pixel value of the pixel block of the main image 204 output from the decoding unit 22 is entered. The extracted noise image block is modulated by being multiplied by the intensity modulation coefficient COFF corresponding to the average pixel value of the pixel block of the main image 204, thereby generating a modulated noise image. A smoothing filter process is performed on the vertical boundary between noise image blocks of the generated modulated noise image, so that a noise image 205 to be added to the main image 204 is generated.

図１中に示したブロック縦境界信号生成部１３は、図２の動画像復号化装置におけるノイズ生成部２３の上述した処理に着目して、例えば図６のフローチャートに示す手順によりブロック縦境界信号１０４を生成する。 The block vertical boundary signal generation unit 13 shown in FIG. 1 pays attention to the above-described processing of the noise generation unit 23 in the moving picture decoding apparatus of FIG. 2, for example, according to the procedure shown in the flowchart of FIG. 104 is generated.

まず、メイン画像１０２（ノイズリダクション部１１から出力されるノイズ除去後の画像）の例えば８×８画素ブロックの平均画素値を算出する（ステップＳ１０１）。この平均画素値をキーとしてＦＧＴテーブル３０１を参照することにより、平均画素値に対応するノイズデータベース３０２のＩＤであるＤＢ＿ＩＤを決定する（ステップＳ１０２）。ステップＳ１０２で決定されたＤＢ＿ＩＤをキーとしてノイズデータベース３０２を参照し、ＤＢ＿ＩＤに対応するノイズ画像ブロックを切り出す（ステップＳ１０３）。 First, for example, an average pixel value of an 8 × 8 pixel block of the main image 102 (image after noise removal output from the noise reduction unit 11) is calculated (step S101). By referring to the FGT table 301 using the average pixel value as a key, DB_ID that is an ID of the noise database 302 corresponding to the average pixel value is determined (step S102). With reference to the noise database 302 using the DB_ID determined in step S102 as a key, a noise image block corresponding to the DB_ID is cut out (step S103).

次に、ステップＳ１０２と同様に平均画素値によってＦＧＴテーブル３０１を参照し、平均画素値に対応する強度変調係数ＣＯＦＦを決定する（ステップＳ１０４）。ステップＳ１０３で切り出されたノイズ画像ブロックに対して、ステップＳ１０４で決定されたＣＯＦＦに従って強度変調を施す（ステップＳ１０５）。 Next, as in step S102, the intensity modulation coefficient COFF corresponding to the average pixel value is determined by referring to the FGT table 301 based on the average pixel value (step S104). The noise image block cut out in step S103 is intensity-modulated according to COFF determined in step S104 (step S105).

例えば、ステップＳ１０１で算出された平均画素値が“３０”であれば、図４より平均画素値“３０”に対応するＤＢ＿ＩＤは“２”である。このため、ステップＳ１０２ではＤＢ＿ＩＤはＩＤ２と決定され、ステップＳ１０３においてＩＤ２に対応するノイズ画像ブロックが切り出される。一方、図４より平均画素値“３０”に対応する強度変調係数ＣＯＦＦは0.5と決定され、ステップＳ１０３で切り出されたノイズ画像ブロックに対して0.5が乗じられる。 For example, if the average pixel value calculated in step S101 is “30”, the DB_ID corresponding to the average pixel value “30” is “2” from FIG. For this reason, DB_ID is determined to be ID2 in step S102, and a noise image block corresponding to ID2 is cut out in step S103. On the other hand, the intensity modulation coefficient COFF corresponding to the average pixel value “30” is determined to be 0.5 from FIG. 4, and 0.5 is multiplied to the noise image block cut out in step S103.

ステップＳ１０１〜Ｓ１０５の処理がメイン画像１０２の全ての画像ブロックについて行われることにより、ステップＳ１０５ではメイン画像１０２の全ての画像ブロックに対応しかつ強度変調が施されたノイズ画像ブロックからなる変調ノイズ画像が生成される。 By performing the processing of steps S101 to S105 for all image blocks of the main image 102, in step S105, a modulated noise image including noise image blocks corresponding to all image blocks of the main image 102 and subjected to intensity modulation. Is generated.

次に、ステップＳ１０５で生成された変調ノイズ画像のノイズ画像ブロックの縦境界に平滑フィルタ処理を施して平滑化ノイズ画像を生成する（ステップＳ１０６）。ステップＳ１０５で生成された変調ノイズ画像から、ステップＳ１０６で生成された平滑化ノイズ画像を差し引くことにより得られる差分画像をブロック縦境界信号１０４として生成する（ステップＳ１０７）。 Next, a smoothing noise image is generated by performing smoothing filter processing on the vertical boundary of the noise image block of the modulated noise image generated in step S105 (step S106). A difference image obtained by subtracting the smoothed noise image generated in step S106 from the modulated noise image generated in step S105 is generated as the block vertical boundary signal 104 (step S107).

以上のように本実施形態では、図１の動画像符号化装置においてノイズリダクション部１１から出力されるノイズ除去後のメイン画像１０２にブロック縦境界信号生成部１３により生成されるブロック縦境界信号１０４を足し合わせた後、符号化部１５により符号化を行う。ブロック縦境界信号１０４は、前述したように動画像復号化装置のノイズ生成部２３における平滑フィルタ処理によって損なわれる画像成分であり、符号化前に予めメイン画像１０２に加算される。 As described above, in the present embodiment, the block vertical boundary signal 104 generated by the block vertical boundary signal generation unit 13 in the main image 102 after noise removal output from the noise reduction unit 11 in the moving image encoding device of FIG. Then, the encoding unit 15 performs encoding. The block vertical boundary signal 104 is an image component that is impaired by the smoothing filter processing in the noise generation unit 23 of the video decoding device as described above, and is added to the main image 102 in advance before encoding.

従って、動画像復号化装置においては、ノイズ生成部２３からのノイズ画像２０５が復号部２２からのメイン画像２０４に加算されることによりフィルムグレインが再現されつつも、ノイズ生成部２３での平滑フィルタ処理によるブロック縦境界の歪みのない高品質の出力画像１０７を得ることができる。 Therefore, in the moving picture decoding apparatus, the noise image 205 from the noise generation unit 23 is added to the main image 204 from the decoding unit 22 to reproduce the film grain, but the smoothing filter in the noise generation unit 23 is reproduced. A high-quality output image 107 without distortion of the block vertical boundary due to the processing can be obtained.

次に、第１の実施形態の変形例について説明する。図７は、変形例に係るブロック縦境界信号生成部１３の処理手順であり、図６のステップＳ１０７の後にステップＳ１０８の処理を追加している。ステップＳ１０８では、ステップＳ１０７で変調ノイズ画像から平滑化ノイズ画像を差し引いて得られる差分画像に対して、符号化歪みによる劣化を考慮した係数を掛け合わせることで強度変調を施すことによってブロック縦境界信号１０４を生成する。 Next, a modification of the first embodiment will be described. FIG. 7 shows a processing procedure of the block vertical boundary signal generation unit 13 according to the modification, and the processing of step S108 is added after step S107 of FIG. In step S108, the block vertical boundary signal is obtained by performing intensity modulation by multiplying the difference image obtained by subtracting the smoothed noise image from the modulated noise image in step S107 by a coefficient that takes into account deterioration due to encoding distortion. 104 is generated.

図８は、他の変形例に係るブロック縦境界信号生成部１３の処理手順であり、図６のステップＳの後にステップＳ１０９の処理を追加している。ステップＳ１０９では、ステップＳ１０７で変調ノイズ画像から平滑化ノイズ画像を差し引いて得られる差分画像に対して、オフセットを加算することによってブロック縦境界信号１０４を生成する。 FIG. 8 shows a processing procedure of the block vertical boundary signal generation unit 13 according to another modification, and the processing of step S109 is added after step S of FIG. In step S109, the block vertical boundary signal 104 is generated by adding an offset to the difference image obtained by subtracting the smoothed noise image from the modulation noise image in step S107.

図２の動画像復号化装置におけるノイズ生成部２３では、平滑フィルタの丸めによりブロック縦境界の歪みは負側に出る確率が高い。そこで、ステップＳ１０９では例えば正方向のオフセットを差分画像に対して加算することによって、ブロック縦境界の歪みをより正確に補償できるようなブロック縦境界信号１０４を生成する。 In the noise generating unit 23 in the moving picture decoding apparatus in FIG. 2, there is a high probability that the distortion of the block vertical boundary appears on the negative side due to rounding of the smoothing filter. Therefore, in step S109, for example, a block vertical boundary signal 104 that can more accurately compensate for block vertical boundary distortion is generated by adding a positive offset to the difference image.

なお、図７のステップＳ１０８と図８のステップＳ１０９の処理を組み合わせてもよく、それによってブロック縦境界の歪みをさらに正確に補償することが可能である。 Note that the processing in step S108 in FIG. 7 and step S109 in FIG. 8 may be combined, thereby making it possible to more accurately compensate for distortion at the block vertical boundary.

（第２の実施形態）
図９は、本発明の第２の実施形態に係る動画像符号化装置を示している。本実施形態では、図１からブロック縦境界信号生成部１３及び加算部１４が除去され、代わりにパラメータ範囲判別部１７と特定パラメータ範囲記憶部１８及びパラメータ修正部１９が追加されている。 (Second Embodiment)
FIG. 9 shows a video encoding apparatus according to the second embodiment of the present invention. In this embodiment, the block vertical boundary signal generation unit 13 and the addition unit 14 are removed from FIG. 1, and a parameter range determination unit 17, a specific parameter range storage unit 18, and a parameter correction unit 19 are added instead.

ノイズ推定部12から出力されるモデル情報１０３は、パラメータ修正部１９を経て多重化部１６に入力される。パラメータ範囲判別部１７は、モデル情報１０３が特定パラメータ範囲記憶部１８に記憶されている予め定められた範囲（特定パラメータ範囲という）内にあるか否かを判別する。 The model information 103 output from the noise estimation unit 12 is input to the multiplexing unit 16 via the parameter correction unit 19. The parameter range determination unit 17 determines whether or not the model information 103 is within a predetermined range (referred to as a specific parameter range) stored in the specific parameter range storage unit 18.

ここで、特定パラメータ範囲とは、ＦＧＴの弊害である周期パターンの歪み（言い換えればブロック縦境界の歪み）が観測されるようなモデル情報１０３のパラメータ範囲をいう。図１０は、このような特定パラメータ範囲の一例を示す図であり、この例では先に例示したモデル情報１０３（ＦＧＴテーブル）のうちＤＢ＿ＩＤを横軸にとり、ＣＯＦＦを縦軸にとっており、図１０の斜線で示す範囲が特定パラメータ範囲である。 Here, the specific parameter range refers to the parameter range of the model information 103 in which the distortion of the periodic pattern (in other words, the distortion of the block vertical boundary), which is an adverse effect of FGT, is observed. FIG. 10 is a diagram illustrating an example of such a specific parameter range. In this example, in the model information 103 (FGT table) exemplified above, DB_ID is taken on the horizontal axis, and COFF is taken on the vertical axis. The range indicated by diagonal lines is the specific parameter range.

パラメータ修正部１９は、入力されたモデル情報１０３が特定パラメータ範囲外にある場合は、モデル情報１０３をそのままモデル情報１０８として出力する。一方、モデル情報１０３が特定パラメータ範囲内にある場合、パラメータ修正部１９はモデル情報１０３を特定パラメータ外となるように修正してモデル情報１０８を出力する。 If the input model information 103 is outside the specific parameter range, the parameter correction unit 19 outputs the model information 103 as model information 108 as it is. On the other hand, when the model information 103 is within the specific parameter range, the parameter correction unit 19 corrects the model information 103 to be out of the specific parameter and outputs the model information 108.

このように本実施形態によると、モデル情報のパラメータのうち動画像復号化装置の出力画像２０６に周期パターンの歪みを発生させるような範囲の値については利用しないようにすることにより、ブロック縦境界の歪みのような周期パターンの歪みが発生しない高品質な出力画像２０６が得られる。 As described above, according to the present embodiment, by not using the values of the range of the model information parameters that cause the distortion of the periodic pattern in the output image 206 of the video decoding device, the block vertical boundary Thus, a high-quality output image 206 in which no periodic pattern distortion such as the above-described distortion occurs is obtained.

以上説明した本発明の実施形態に基づく動画像符号化処理は、ハードウェアでも実現可能であるが、パーソナルコンピュータのようなコンピュータを用いてソフトウェアにより実行することも可能である。従って、本発明によれば上述した動画像符号化処理をコンピュータに行わせるためのプログラム、あるいは当該プログラムを格納したコンピュータ読み取り可能な記憶媒体を提供することができる。 The moving image encoding processing based on the embodiment of the present invention described above can be realized by hardware, but can also be executed by software using a computer such as a personal computer. Therefore, according to the present invention, it is possible to provide a program for causing a computer to perform the above-described moving image encoding process or a computer-readable storage medium storing the program.

なお、本発明は上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、図６〜図８のフローチャートに示した各ステップの処理順序は適宜変更できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態にわたる構成要素を適宜組み合わせてもよい。 Note that the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, the processing order of each step shown in the flowcharts of FIGS. 6 to 8 can be changed as appropriate. In addition, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, constituent elements over different embodiments may be appropriately combined.

第１の実施形態に係る動画像符号化装置を示すブロック図1 is a block diagram showing a video encoding device according to a first embodiment. 第１の実施形態に係る動画像復号化装置を示すブロック図1 is a block diagram showing a moving picture decoding apparatus according to a first embodiment. ＦＧＴの副作用による画質劣化を説明するための図Illustration for explaining image quality degradation due to side effects of FGT モデル情報であるＦＧＴテーブルの一例を示す図The figure which shows an example of the FGT table which is model information ノイズデータベースの一例を示す図Figure showing an example of a noise database ブロック縦境界信号生成部の処理手順を示すフローチャートFlow chart showing processing procedure of block vertical boundary signal generation unit ブロック縦境界信号生成部の他の処理手順を示すフローチャートThe flowchart which shows the other processing procedure of a block vertical boundary signal generation part ブロック縦境界信号生成部の更に別の処理手順を示すフローチャートThe flowchart which shows another processing procedure of a block vertical boundary signal generation part. 第２の実施形態に係る動画像符号化装置を示すブロック図The block diagram which shows the moving image encoder which concerns on 2nd Embodiment. 第２の実施形態における特定パラメータ範囲の例を示す図The figure which shows the example of the specific parameter range in 2nd Embodiment

Explanation of symbols

１０・・・動画像の入力端子
１１・・・ノイズリダクション部
１２・・・ノイズ推定部
１３・・・ブロック縦境界信号生成部
１４・・・加算部
１５・・・符号化部
１６・・・多重化部
１７・・・パラメータ範囲判別部
１８・・・特定パラメータ範囲記憶部
１９・・・パラメータ修正部
２０・・・符号化ストリームの入力端子
２１・・・分離部
２２・・・復号部
２３・・・ノイズ生成部
２４・・・加算部
１０１・・・入力画像
１０２・・・第１メイン画像
１０３・・・モデル情報
１０４・・・ブロック縦境界信号
１０５・・・第２メイン画像
１０６・・・符号化データ
１０７・・・符号化ビットストリーム
１０８・・・モデル情報
２０１・・・符号化ストリーム
２０２・・・符号化データ
２０３・・・モデル情報
２０４・・・メイン画像
２０５・・・ノイズ画像
２０６・・・出力画像 DESCRIPTION OF SYMBOLS 10 ... Moving image input terminal 11 ... Noise reduction part 12 ... Noise estimation part 13 ... Block vertical boundary signal generation part 14 ... Adder 15 ... Encoding part 16 ... Multiplexer 17... Parameter range discriminator 18... Specific parameter range storage unit 19... Parameter modifier 20... Encoded stream input terminal 21. ... Noise generator 24 ... Adder 101 ... Input image 102 ... First main image 103 ... Model information 104 ... Block vertical boundary signal 105 ... Second main image 106 .. Encoded data 107 ... Encoded bit stream 108 ... Model information 201 ... Encoded stream 202 ... Encoded data 203 ... Model information 204 The main image 205 ... noise image 206 ... output image

Claims

Performing a noise removal process on an input image to be encoded including a noise image to generate a first main image;
Estimating the noise image from the input image and the first main image to generate model information corresponding to the noise image;
Referring to the model information, generating an image component that is impaired by smoothing filter processing on a boundary between each noise image block of a film grain image for film grain reproduction; and
Adding the first main image and the image component to generate a second main image;
Encoding the second main image to generate encoded data;
And a step of multiplexing the model information with the encoded data and outputting an encoded stream.

Performing a noise removal process on an input image to be encoded including a first noise image to generate a first main image;
Estimating the first noise image from the input image and the first main image to generate model information including a database ID and an intensity modulation coefficient corresponding to the first noise image;
Selecting a noise database storing a noise image block for each image block including a plurality of pixels in the first main image according to a database ID associated with a pixel average value of the image block with reference to the model information When,
Cutting out a noise image block from the noise database;
Selecting an intensity modulation coefficient associated with the pixel average value with reference to the model information;
Modulating the noise image block with the intensity modulation coefficient to generate a modulated noise image;
Applying a smoothing filter process to a boundary between each noise image block of the modulated noise image to generate a smoothed noise image;
Subtracting the smoothed noise image from the modulated noise image to generate a difference image;
Adding the first main image and the difference image to generate a second main image;
Encoding the second main image to generate encoded data;
And a step of multiplexing the model information with the encoded data and outputting an encoded stream.

The moving image encoding method according to claim 2, wherein the step of generating the second main image is configured to add the difference image to the first main image after multiplying the difference image by a coefficient.

The moving image encoding method according to claim 2, wherein the step of generating the second main image is configured to add the difference image to the difference image and add the offset to the first main image. .

Performing a noise removal process on an input image to be encoded including a noise image to generate a first main image;
Estimating the noise image from the input image and the first main image to generate model information having a plurality of parameters corresponding to the noise image;
Storing a specific parameter range in which a periodic pattern is generated by a smoothing filter process for a boundary between each noise image block of a film grain image for film grain reproduction;
Determining whether the value of the parameter is within the specific parameter range; and
Correcting the parameter value to a value outside the specific parameter range when the parameter value is determined to be within the specific parameter range by the determining step;
Encoding the main image to generate encoded data;
A method of multiplexing the model information that has undergone the correcting step and outputting an encoded stream to the encoded data.

The moving picture encoding method according to claim 5, wherein the model information includes a database ID and an intensity modulation coefficient corresponding to the noise image as the parameters.

A noise reduction unit that generates a first main image by performing noise removal processing on an input image to be encoded including a noise image;
An estimation unit that estimates the noise image from the input image and the first main image and generates model information corresponding to the noise image;
A generation unit that refers to the model information and generates an image component that is impaired by smoothing filter processing on a boundary between each noise image block of a film grain image for film grain reproduction;
An adder for adding the first main image and the image component to generate a second main image;
An encoding unit that encodes the second main image to generate encoded data;
And a multiplexing unit that multiplexes the model information with the encoded data and outputs an encoded stream.

A noise reduction unit that generates a first main image by performing a noise removal process on an input image to be encoded including a first noise image;
An estimation unit that estimates the first noise image from the input image and the first main image and generates model information including a database ID and an intensity modulation coefficient corresponding to the first noise image;
Means for selecting a noise database storing a noise image block for each image block including a plurality of pixels in the first main image according to a database ID associated with a pixel average value of the image block with reference to the model information When,
Means for extracting a noise image block from the noise database;
Means for selecting an intensity modulation coefficient associated with the pixel average value with reference to the model information;
Means for modulating the noise image block with the intensity modulation coefficient to generate a modulated noise image;
Means for applying a smoothing filter process to a boundary between each noise image block of the modulated noise image to generate a smoothed noise image;
A generating unit that subtracts the smoothed noise image from the modulated noise image to generate a difference image;
An adder for adding the first main image and the difference image to generate a second main image;
An encoding unit that encodes the second main image to generate encoded data;
And a multiplexing unit that multiplexes the model information with the encoded data and outputs an encoded stream.

The moving image encoding apparatus according to claim 8, wherein the generation unit is configured to add the difference image to the first main image after multiplying the difference image by a coefficient.

The moving image encoding apparatus according to claim 8, wherein the generation unit is configured to add the difference image to the first main image after adding the offset to the difference image.

A generation unit that performs noise removal processing on an input image to be encoded including a noise image to generate a first main image;
A generation unit configured to generate model information having a plurality of parameters corresponding to the noise image by estimating the noise image from the input image and the first main image;
A storage unit for storing a specific parameter range in which a periodic pattern is generated by smoothing filter processing on a boundary between each noise image block of a film grain image for film grain reproduction;
A determination unit for determining whether or not the value of the parameter is within the specific parameter range;
A correction unit that corrects the value of the parameter to a value outside the specific parameter range when the value of the parameter is determined to be within the specific parameter range by the determining step;
An encoding unit that encodes the main image to generate encoded data;
And a multiplexing unit that multiplexes the encoded data with the model information that has passed through the correction unit and outputs an encoded stream.

A process of generating a first main image by performing a noise removal process on an input image to be encoded including a noise image;
Processing for estimating the noise image from the input image and the first main image and generating model information corresponding to the noise image;
With reference to the model information, a process of generating an image component that is damaged by a smoothing filter process on a boundary between each noise image block of a film grain image for film grain reproduction;
A process of adding the first main image and the image component to generate a second main image;
A process of generating the encoded data by encoding the second main image;
A program for causing a computer to perform a moving image encoding process including: a process of multiplexing the model information with the encoded data and outputting an encoded stream.

A process of generating a first main image by performing a noise removal process on an input image to be encoded including a first noise image;
Processing for estimating the first noise image from the input image and the first main image and generating model information including a database ID and an intensity modulation coefficient corresponding to the first noise image;
Processing for selecting a noise database storing a noise image block for each image block including a plurality of pixels in the first main image according to a database ID associated with a pixel average value of the image block with reference to the model information When,
A process of cutting out a noise image block from the noise database;
Selecting an intensity modulation coefficient associated with the pixel average value with reference to the model information;
Processing to generate a modulated noise image by modulating the noise image block with the intensity modulation coefficient;
Processing to generate a smoothed noise image by applying a smoothing filter process to a boundary between each noise image block of the modulated noise image;
A process of subtracting the smoothed noise image from the modulated noise image to generate a difference image;
A process of generating the second main image by adding the first main image and the difference image;
A process of generating the encoded data by encoding the second main image;
A program for causing a computer to perform a moving image encoding process including: a process of multiplexing the model information with the encoded data and outputting an encoded stream.

The program according to claim 13, wherein the process of generating the second main image is a process of adding the difference image to the first main image after multiplying the difference image by a coefficient.

The program according to claim 13, wherein the process of generating the second main image is a process of adding the difference image to the first main image after adding an offset to the difference image.

A process of generating a first main image by performing a noise removal process on an input image to be encoded including a noise image;
Processing to generate model information having a plurality of parameters corresponding to the noise image by estimating the noise image from the input image and the first main image;
A process for storing a specific parameter range in which a periodic pattern is generated by a smoothing filter process on a boundary between each noise image block of a film grain image for film grain reproduction;
Processing for determining whether or not the value of the parameter is within the specific parameter range;
A process for correcting the parameter value to a value outside the specific parameter range when the parameter value is determined to be within the specific parameter range by the determining process;
A process of encoding the main image to generate encoded data;
A program for causing a computer to perform a moving image encoding process including: a process of multiplexing model information that has undergone the correction process to the encoded data and outputting an encoded stream.