JP2019022080A

JP2019022080A - Coding apparatus, coding method, and program

Info

Publication number: JP2019022080A
Application number: JP2017138939A
Authority: JP
Inventors: 成記望月; Shigeki Mochizuki
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2017-07-18
Filing date: 2017-07-18
Publication date: 2019-02-07

Abstract

To provide a coding apparatus, a coding method, and a program that can perform efficient coding according to an amount of noise.SOLUTION: A coding apparatus includes determination means that estimates a noise amount included in image data and determines whether the noise amount is greater than a threshold value, frequency conversion means that generates a conversion coefficient by frequency-converting the image data when the noise amount is larger than a threshold value, and coding means that encodes the conversion coefficient when the noise amount is greater than the threshold value and encode the image data when the noise amount is less than the threshold value.SELECTED DRAWING: Figure 5

Description

本発明は、符号化装置、符号化方法及びプログラムに関する。 The present invention relates to an encoding device, an encoding method, and a program.

デジタルスチルカメラやデジタルビデオカメラなどのデジタル撮像装置は、広く普及し、一般に利用されている。これらのデジタル撮像装置は、ＣＣＤやＣＭＯＳセンサなどの撮像素子で受光した光をデジタル信号に変換することにより、デジタル画像データを生成する。デジタル画像データを生成する過程では、撮像素子や回路の特性により、暗電流ノイズ、熱雑音、及びショットノイズなどが発生し、デジタル画像データにランダムなノイズ（ランダムノイズ）が混入する。 Digital imaging devices such as digital still cameras and digital video cameras are widely used and are generally used. These digital imaging devices generate digital image data by converting light received by an imaging device such as a CCD or CMOS sensor into a digital signal. In the process of generating digital image data, dark current noise, thermal noise, shot noise, and the like are generated due to the characteristics of the image sensor and the circuit, and random noise (random noise) is mixed into the digital image data.

近年の撮像素子の小型化、高画素化に伴い、画素ピッチが極小化しているため、ランダムノイズが目立ちやすくなっており、特に撮像感度を高くした場合などは、ランダムノイズが顕著に発生する。こうした観点から、撮像データを記録する場合、ランダムノイズ成分を考慮した符号化を行い、記録することが望ましい。そこで、撮像条件に応じて、量子化テーブルを選択的に切り替えて符号化する方法が開示されている（例えば、特許文献１参照）。 With recent miniaturization of imaging devices and higher pixel counts, the pixel pitch has been minimized, so that random noise is conspicuous. Especially when imaging sensitivity is increased, random noise is noticeably generated. From this point of view, when recording imaging data, it is desirable to perform encoding in consideration of a random noise component and record it. In view of this, a method is disclosed in which encoding is performed by selectively switching quantization tables in accordance with imaging conditions (see, for example, Patent Document 1).

特許第４７６５２４０号公報Japanese Patent No. 4765240

しかし、ＲＡＷデータを符号化して記録する場合、素材性を重視するため、情報損失のない可逆符号化が重要となり、量子化が利用できない場合がある。そこで、撮像データに混入したノイズ量を考慮して効率的に可逆符号化する方法が望まれている。 However, when RAW data is encoded and recorded, lossless encoding without information loss is important because the materiality is important, and quantization may not be used. Thus, a method for efficiently performing lossless encoding in consideration of the amount of noise mixed in the image data is desired.

本発明の目的は、ノイズ量に応じた効率的な符号化を行うことができる符号化装置、符号化方法及びプログラムを提供することである。 An object of the present invention is to provide an encoding device, an encoding method, and a program that can perform efficient encoding according to the amount of noise.

本発明の符号化装置は、画像データに含まれるノイズ量を推定し、前記ノイズ量が閾値より多いか否かを判定する判定手段と、前記ノイズ量が閾値より多い場合には、前記画像データを周波数変換することにより変換係数を生成する周波数変換手段と、前記ノイズ量が閾値より多い場合には、前記変換係数に対して符号化を行い、前記ノイズ量が閾値より少ない場合には、前記画像データに対して符号化を行う符号化手段とを有する。 The encoding apparatus of the present invention estimates a noise amount included in image data, determines whether or not the noise amount is greater than a threshold value, and if the noise amount is greater than a threshold value, the image data Frequency conversion means for generating a transform coefficient by frequency transforming, and when the amount of noise is greater than a threshold, encoding is performed on the transform coefficient, and when the amount of noise is less than a threshold, Coding means for coding image data.

本発明によれば、ノイズ量に応じた効率的な符号化を行うことができる。 According to the present invention, efficient encoding according to the amount of noise can be performed.

第１の実施形態による撮像装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the imaging device by 1st Embodiment. ＲＡＷデータの構成例を示す図である。It is a figure which shows the structural example of RAW data. サブバンド分割を示す図である。It is a figure which shows subband division | segmentation. ＭＥＤ予測方法を示す図である。It is a figure which shows a MED prediction method. 第１の実施形態による符号化方法を示すフローチャートである。It is a flowchart which shows the encoding method by 1st Embodiment. ＩＳＯ感度と発生符号量の関係を示すグラフである。It is a graph which shows the relationship between ISO sensitivity and the amount of generated codes. ＩＳＯ感度と標準偏差の関係を示すグラフである。It is a graph which shows the relationship between ISO sensitivity and a standard deviation. 第２の実施形態による符号化方法を示すフローチャートである。It is a flowchart which shows the encoding method by 2nd Embodiment. ＲＡＷデータの分割例を示す図である。It is a figure which shows the example of a division | segmentation of RAW data. 第３の実施形態による符号化方法を示すフローチャートである。It is a flowchart which shows the encoding method by 3rd Embodiment.

（第１の実施形態）
図１（ａ）は、本発明の第１の実施形態による撮像装置１００の構成例を示すブロック図である。撮像装置１００は、デジタルカメラ、ビデオカメラの他、スマートフォン、タブレット、工業用カメラ、医療用カメラ等に適用可能である。撮像装置１００は、ＲＡＷデータを可逆符号化可能な符号化装置であり、制御部１０１、撮像部１０２、メモリＩ／Ｆ（インターフェース）部１０３、メモリ１０４、ＲＡＷデータ符号化部１０５、記録処理部１０６、及び記録媒体１０７を有する。 (First embodiment)
FIG. 1A is a block diagram illustrating a configuration example of the imaging apparatus 100 according to the first embodiment of the present invention. The imaging device 100 can be applied to a smart phone, a tablet, an industrial camera, a medical camera, and the like in addition to a digital camera and a video camera. The imaging device 100 is an encoding device capable of lossless encoding of RAW data, and includes a control unit 101, an imaging unit 102, a memory I / F (interface) unit 103, a memory 104, a RAW data encoding unit 105, and a recording processing unit. 106 and a recording medium 107.

制御部１０１は、撮像装置１００を構成する各処理部を制御する。撮像部１０２は、光学レンズ、絞り、フォーカス制御及びレンズ駆動部を含む光学ズームが可能なレンズ光学系と、レンズ光学系からの光情報を電気信号に変換するＣＣＤイメージセンサ又はＣＭＯＳセンサなどの撮像素子を含む。撮像部１０２は、撮像素子により得られた電気信号をデジタル信号へ変換することにより、ＲＡＷデータ（画像データ）を生成する。制御部１０１は、撮像部１０２により生成されたＲＡＷデータを、メモリＩ／Ｆ部１０３を介して、メモリ１０４へ書き込む。 The control unit 101 controls each processing unit that configures the imaging apparatus 100. The imaging unit 102 is a lens optical system capable of optical zoom including an optical lens, an aperture, a focus control and a lens driving unit, and an imaging such as a CCD image sensor or a CMOS sensor that converts optical information from the lens optical system into an electrical signal. Including elements. The imaging unit 102 generates RAW data (image data) by converting an electrical signal obtained by the imaging element into a digital signal. The control unit 101 writes the RAW data generated by the imaging unit 102 to the memory 104 via the memory I / F unit 103.

図２は、ＲＡＷデータの構成例を示す図である。ＲＡＷデータは、有効画素領域２０１と、オプティカルブラック領域２０２とを有する。オプティカルブラック領域２０２は、水平オプティカルブラック領域２０３と、垂直オプティカルブラック領域２０４とを有する。撮像部１０２の撮像素子は、遮光されている複数のオプティカルブラック画素と、遮光されていない複数の有効画素とを有する。有効画素領域２０１は、有効画素により生成されたＲＡＷデータの領域である。オプティカルブラック領域２０２は、オプティカルブラック画素により生成されたＲＡＷデータの領域である。ＲＡＷデータは、ベイヤ配列のＲ（赤）、Ｇ１（緑）、Ｇ２（緑）、Ｂ（青）の４つの色要素の画素のデータで構成される。なお、ＲＡＷデータの配列及び色要素は、この構成に限定されず、他の方式であってもよい。 FIG. 2 is a diagram illustrating a configuration example of RAW data. The RAW data has an effective pixel area 201 and an optical black area 202. The optical black area 202 includes a horizontal optical black area 203 and a vertical optical black area 204. The imaging element of the imaging unit 102 includes a plurality of optical black pixels that are shielded from light and a plurality of effective pixels that are not shielded from light. The effective pixel area 201 is an area of RAW data generated by effective pixels. The optical black area 202 is an area of RAW data generated by optical black pixels. The RAW data is composed of pixel data of four color elements of R (red), G1 (green), G2 (green), and B (blue) in a Bayer array. Note that the arrangement and color elements of the RAW data are not limited to this configuration, and other methods may be used.

図１（ａ）において、メモリＩ／Ｆ部１０３は、各処理部からのメモリアクセス要求を調停し、メモリ１０４に対する読み出し又は書き込みの制御を行う。メモリ１０４は、例えば揮発性メモリであり、撮像装置１００を構成する各処理部から出力される各種データを格納する。 In FIG. 1A, the memory I / F unit 103 arbitrates memory access requests from the respective processing units, and controls reading or writing with respect to the memory 104. The memory 104 is, for example, a volatile memory, and stores various data output from each processing unit included in the imaging device 100.

ＲＡＷデータ符号化部１０５は、周波数変換部１０５ａ、予測差分生成部１０５ｂ、及びエントロピー符号化部１０５ｃを有する。制御部１０１は、メモリ１０４からベイヤ配列のＲＡＷデータを読み出し、ＲＡＷデータ符号化部１０５に出力する。周波数変換部１０５ａは、ベイヤ配列のＲＡＷデータを各色要素プレーンに分離し、分離された色要素プレーン単位で、所望の分解レベルで可逆型の離散ウェーブレット変換による周波数変換処理を行い、予測差分生成部１０５ｂへ変換係数を出力する。 The RAW data encoding unit 105 includes a frequency conversion unit 105a, a prediction difference generation unit 105b, and an entropy encoding unit 105c. The control unit 101 reads the Bayer array RAW data from the memory 104 and outputs the RAW data to the RAW data encoding unit 105. The frequency conversion unit 105a separates the Bayer array RAW data into color element planes, performs frequency conversion processing by reversible discrete wavelet transform at a desired separation level in units of the separated color element planes, and generates a prediction difference generation unit The conversion coefficient is output to 105b.

図３（ａ）は、分解レベル１のサブバンド分割処理に関わる離散ウェーブレット変換を実現するためのフィルタバンク構成の例を示す図である。ステップＳ３０１において、周波数変換部１０５ａは、式（１）のローパスフィルタの伝達関数ｌｐｆ（ｚ）により、入力のＲＡＷデータに対して、水平方向のローパスフィルタ処理を行う。次に、ステップＳ３０２において、周波数変換部１０５ａは、ステップＳ３０１の処理後のデータに対して、２：１の間引きを行う。 FIG. 3A is a diagram illustrating an example of a filter bank configuration for realizing discrete wavelet transform related to the decomposition level 1 subband division processing. In step S301, the frequency conversion unit 105a performs horizontal low-pass filter processing on the input RAW data with the transfer function lpf (z) of the low-pass filter of Expression (1). Next, in step S302, the frequency conversion unit 105a performs 2: 1 decimation on the data after the processing in step S301.

次に、ステップＳ３０３において、周波数変換部１０５ａは、ステップＳ３０２の処理後のデータに対して、式（１）のローパスフィルタの伝達関数ｌｐｆ（ｚ）により、垂直方向のローパスフィルタ処理を行う。次に、ステップＳ３０４において、周波数変換部１０５ａは、ステップＳ３０３の処理後のデータに対して、２：１の間引きを行うことにより、図３（ｂ）のＬＬサブバンドの変換係数を生成する。 Next, in step S303, the frequency conversion unit 105a performs a low-pass filter process in the vertical direction on the data after the process in step S302, using the transfer function lpf (z) of the low-pass filter of Expression (1). Next, in step S304, the frequency conversion unit 105a generates the conversion coefficient of the LL subband in FIG. 3B by performing 2: 1 decimation on the data after the processing in step S303.

ステップＳ３０５において、周波数変換部１０５ａは、ステップＳ３０２の処理後のデータに対して、式（２）のハイパスフィルタの伝達関数ｈｐｆ（ｚ）により、垂直方向のハイパスフィルタ処理を行う。次に、ステップＳ３０６において、周波数変換部１０５ａは、ステップＳ３０５の処理後のデータに対して、２：１の間引きを行うことにより、図３（ｂ）のＨＬサブバンドの変換係数を生成する。 In step S305, the frequency conversion unit 105a performs vertical high-pass filter processing on the data after processing in step S302, using the high-pass filter transfer function hpf (z) of Expression (2). Next, in step S306, the frequency conversion unit 105a generates a conversion coefficient of the HL subband in FIG. 3B by performing 2: 1 decimation on the data after the processing in step S305.

ステップＳ３１１において、周波数変換部１０５ａは、式（２）のハイパスフィルタの伝達関数ｈｐｆ（ｚ）により、入力のＲＡＷデータに対して、水平方向のハイパスフィルタ処理を行う。次に、ステップＳ３１２において、周波数変換部１０５ａは、ステップＳ３１１の処理後のデータに対して、２：１の間引きを行う。 In step S311, the frequency conversion unit 105a performs a high-pass filter process in the horizontal direction on the input RAW data by using the transfer function hpf (z) of the high-pass filter of Expression (2). Next, in step S312, the frequency conversion unit 105a performs 2: 1 decimation on the data after the processing in step S311.

次に、ステップＳ３１３において、周波数変換部１０５ａは、ステップＳ３１２の処理後のデータに対して、式（１）のローパスフィルタの伝達関数ｌｐｆ（ｚ）により、垂直方向のローパスフィルタ処理を行う。次に、ステップＳ３１４において、周波数変換部１０５ａは、ステップＳ３１３の処理後のデータに対して、２：１の間引きを行うことにより、図３（ｂ）のＬＨサブバンドの変換係数を生成する。 Next, in step S313, the frequency conversion unit 105a performs a low-pass filter process in the vertical direction on the data after the process in step S312 by the transfer function lpf (z) of the low-pass filter of Expression (1). Next, in step S314, the frequency conversion unit 105a performs 2: 1 decimation on the data after the processing in step S313, thereby generating the conversion coefficient of the LH subband in FIG.

ステップＳ３１５において、周波数変換部１０５ａは、ステップＳ３１２の処理後のデータに対して、式（２）のハイパスフィルタの伝達関数ｈｐｆ（ｚ）により、垂直方向のハイパスフィルタ処理を行う。次に、ステップＳ３１６において、周波数変換部１０５ａは、ステップＳ３１５の処理後のデータに対して、２：１の間引きを行うことにより、図３（ｂ）のＨＨサブバンドの変換係数を生成する。 In step S315, the frequency conversion unit 105a performs a high-pass filter process in the vertical direction on the data after the process in step S312 using the high-pass filter transfer function hpf (z) of Expression (2). Next, in step S316, the frequency conversion unit 105a generates the conversion coefficient of the HH subband in FIG. 3B by performing 2: 1 decimation on the data after the process in step S315.

以上のように、周波数変換部１０５ａは、図２のＲＡＷデータを離散ウェーブレット変換することにより、図３（ｂ）のＬＬサブバンドＬＬ、ＨＬサブバンド、ＬＨサブバンド及びＨＨサブバンドの変換係数を生成する。 As described above, the frequency conversion unit 105a performs discrete wavelet transform on the RAW data in FIG. 2, thereby converting the conversion coefficients of the LL subband LL, the HL subband, the LH subband, and the HH subband in FIG. Generate.

周波数変換部１０５ａは、分解レベルが１よりも大きい場合には、ＬＬサブバンドに対して、階層的にサブバンド分割を実行する。例えば、分解レベルが３の場合、周波数変換部１０５ａは、図３（ｃ）に示すようなサブバンド分割を行う。分解レベルが０の場合には、周波数変換部１０５ａは、離散ウェーブレット変換処理を行わず、入力されたＲＡＷデータをそのまま予測差分生成部１０５ｂに出力する。ここで、離散ウェーブレット変換は、式（１）及び（２）に示すように、５タップのローパスフィルタと３タップのハイパスフィルタを用いたが、これとは異なるタップ数及び異なる係数のフィルタを用いてもよく、可逆型であれば他の周波数変換方式であってもよい。 When the decomposition level is higher than 1, the frequency conversion unit 105a performs subband division hierarchically on the LL subband. For example, when the decomposition level is 3, the frequency conversion unit 105a performs subband division as illustrated in FIG. When the decomposition level is 0, the frequency conversion unit 105a does not perform the discrete wavelet conversion process and outputs the input RAW data as it is to the prediction difference generation unit 105b. Here, the discrete wavelet transform uses a 5-tap low-pass filter and a 3-tap high-pass filter as shown in equations (1) and (2), but uses a filter with a different number of taps and different coefficients. As long as it is a reversible type, other frequency conversion methods may be used.

図１（ａ）において、予測差分生成部１０５ｂは、周波数変換部１０５ａから入力されたＲＡＷデータの画素値又は変換係数に対して、ＭＥＤ（ＭｅｄｉａｎＥｄｇｅＤｅｔｅｃｔｏｒ）予測により予測値ｐｄを生成する。 In FIG. 1A, the prediction difference generation unit 105b generates a prediction value pd by MED (Media Edge Detector) prediction for the pixel value or conversion coefficient of the RAW data input from the frequency conversion unit 105a.

図４は、ＭＥＤ予測方法を示す図である。予測差分生成部１０５ｂは、画素値又は変換係数の予測差分データを求める。以下、画素値の予測差分データを生成する方法を例に説明するが、変換係数の予測差分データの生成方法も同様である。予測差分生成部１０５ｂは、ＭＥＤ予測により、符号化対象画素ｘの周辺画素ａ〜ｄの値から、符号化対象画素の値に対する予測値ｐｄを生成する。画素ａ及びｂの値のうちの小さい方の値が画素ｃの値以上である場合には、画素ａ及びｂの値のうちの大きい方の値を、符号化対象画素ｘの予測値ｐｄとする。また、画素ａ及びｂの値のうちの大きい方の値が画素ｃの値以下である場合には、画素ａ及びｂの値のうちの小さい方の値を、符号化対象画素ｘの予測値ｐｄとする。それ以外の場合には、画素ａ及びｂの値の加算値から画素ｃの値を減算した値を、符号化対象画素ｘの予測値ｐｄとする。 FIG. 4 is a diagram illustrating a MED prediction method. The prediction difference generation unit 105b obtains pixel difference or conversion coefficient prediction difference data. Hereinafter, a method for generating prediction difference data for pixel values will be described as an example, but a method for generating prediction difference data for transform coefficients is also the same. The prediction difference generation unit 105b generates a prediction value pd for the value of the encoding target pixel from the values of the surrounding pixels a to d of the encoding target pixel x by MED prediction. When the smaller value of the values of the pixels a and b is greater than or equal to the value of the pixel c, the larger value of the values of the pixels a and b is set to the predicted value pd of the encoding target pixel x. To do. In addition, when the larger value of the values of the pixels a and b is less than or equal to the value of the pixel c, the smaller value of the values of the pixels a and b is used as the predicted value of the encoding target pixel x. Let pd. In other cases, a value obtained by subtracting the value of the pixel c from the addition value of the values of the pixels a and b is set as the predicted value pd of the encoding target pixel x.

次に、予測差分生成部１０５ｂは、符号化対象画素ｘの値と予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。なお、予測差分生成部１０５ｂによる予測差分が生成されるのは、周波数変換部１０５ａにより生成された低周波数成分サブバンド（ＬＬサブバンド）のみであり、高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）には適用されない。 Next, the prediction difference generation unit 105b generates difference data between the value of the encoding target pixel x and the prediction value pd, and outputs the difference data to the entropy encoding unit 105c. Note that the prediction difference generated by the prediction difference generation unit 105b is generated only in the low frequency component subband (LL subband) generated by the frequency conversion unit 105a, and the high frequency component subband (HL subband, LH). This does not apply to subbands and HH subbands.

エントロピー符号化部１０５ｃは、予測差分生成部１０５ｂから入力されたデータに対して、例えばハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。なお、予測方式及びエントロピー符号化方式は、他の方式であってもよい。制御部１０１は、エントロピー符号化部１０５ｃにより生成された符号化データを、メモリＩ／Ｆ部１０３を介して、メモリ１０４へ書き込む。 The entropy encoding unit 105c performs entropy encoding on the data input from the prediction difference generation unit 105b by, for example, Huffman encoding or Golomb encoding, and generates encoded data. Note that the prediction method and the entropy coding method may be other methods. The control unit 101 writes the encoded data generated by the entropy encoding unit 105 c into the memory 104 via the memory I / F unit 103.

制御部１０１は、メモリ１０４から符号化データを読み出し、記録処理部１０６に出力する。記録処理部１０６は、符号化データに対して、所定の記録フォーマット化を行い、記録媒体１０７へ記録する。記録媒体１０７は、例えば、不揮発性メモリである。 The control unit 101 reads the encoded data from the memory 104 and outputs it to the recording processing unit 106. The recording processing unit 106 performs a predetermined recording format on the encoded data and records it on the recording medium 107. The recording medium 107 is, for example, a nonvolatile memory.

ここで、一般的なＲＡＷデータは、隣接する画素値の類似性が高く、空間的な相関性が高いため、前述のような予測差分を利用した予測差分符号化が効率的なデータ圧縮に有効である。しかし、高ＩＳＯ感度条件等で撮像されたＲＡＷデータは、ノイズ成分を多く含むので、空間的な相関性が低下し、予測差分符号化における圧縮効率が低下してしまう。そのため、ＲＡＷデータ符号化部１０５は、周波数変換処理により周波数変換を行い、周波数帯域毎に相関性を高めたデータとして符号化を行うことにより、圧縮効率を向上させられると考えられる。 Here, since general RAW data has high similarity between adjacent pixel values and high spatial correlation, predictive differential encoding using predictive differences as described above is effective for efficient data compression. It is. However, since RAW data imaged under high ISO sensitivity conditions contain a lot of noise components, the spatial correlation is reduced and the compression efficiency in predictive differential encoding is reduced. Therefore, it is considered that the RAW data encoding unit 105 can improve compression efficiency by performing frequency conversion by frequency conversion processing and performing encoding as data with increased correlation for each frequency band.

ここで、ＩＳＯ感度等の撮像条件に依存してＲＡＷデータに混入されるノイズ量は、有効画素領域２０１とオプティカルブラック領域２０２で同様であると考えられる。そこで、撮像装置１００は、オプティカルブラック領域２０２を解析することにより、ＲＡＷデータに含まれるノイズ量を推定し、そのノイズ量に応じて周波数変換の有効性を判定し、選択的に符号化する。これにより、撮像装置１００は、効率的に圧縮を行うことが可能となる。 Here, the amount of noise mixed into the RAW data depending on the imaging conditions such as ISO sensitivity is considered to be the same in the effective pixel area 201 and the optical black area 202. Therefore, the imaging apparatus 100 estimates the amount of noise included in the RAW data by analyzing the optical black region 202, determines the effectiveness of frequency conversion in accordance with the amount of noise, and selectively encodes it. Thereby, the imaging device 100 can perform compression efficiently.

図５は、第１の実施形態による撮像装置１００の符号化方法を示すフローチャートである。ステップＳ５０１において、ＲＡＷデータ符号化部１０５は、ＲＡＷデータのオプティカルブラック領域２０２の解析を行う。具体的には、ＲＡＷデータ符号化部１０５は、オプティカルブラック領域２０２の一部又は全体の画素値を符号化し、その符号化により発生した符号量を取得し、発生符号量に基づき、ＲＡＷデータに含まれるノイズ量を推定する。 FIG. 5 is a flowchart illustrating the encoding method of the imaging apparatus 100 according to the first embodiment. In step S501, the RAW data encoding unit 105 analyzes the optical black area 202 of the RAW data. Specifically, the RAW data encoding unit 105 encodes a part or the entire pixel value of the optical black area 202, acquires the code amount generated by the encoding, and converts the generated RAW data into the RAW data based on the generated code amount. Estimate the amount of noise contained.

次に、ステップＳ５０２において、ＲＡＷデータ符号化部１０５は、推定されたノイズ量を基に、周波数変換の有効性を判定する。ＲＡＷデータ符号化部１０５は、判定手段により、ノイズ量が閾値より少ない場合には、周波数変換が有効でないと判定し、ステップＳ５０５に処理を進める。ステップＳ５０５において、周波数変換部１０５ａは、周波数変換を行わず、予測差分生成部１０５ｂは、入力のＲＡＷデータの色要素プレーン単位で、符号化対象画素の周辺の画素から予測差分データを生成する。具体的には、予測差分生成部１０５ｂは、図４と同様に、符号化対象画素ｘの周辺画素ａ〜ｄからＭＥＤ予測により予測値ｐｄを生成し、符号化対象画素ｘの値と予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ５０６において、エントロピー符号化部１０５ｃは、予測差分生成部１０５ｂから入力された差分データに対して、例えばハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 Next, in step S502, the RAW data encoding unit 105 determines the effectiveness of frequency conversion based on the estimated noise amount. The RAW data encoding unit 105 determines that the frequency conversion is not effective when the noise amount is smaller than the threshold by the determination unit, and advances the process to step S505. In step S505, the frequency conversion unit 105a does not perform frequency conversion, and the prediction difference generation unit 105b generates prediction difference data from pixels around the encoding target pixel in units of color element planes of the input RAW data. Specifically, similarly to FIG. 4, the prediction difference generation unit 105 b generates a prediction value pd by MED prediction from the surrounding pixels a to d of the encoding target pixel x, and the value of the encoding target pixel x and the prediction value Difference data with respect to pd is generated, and the difference data is output to the entropy encoding unit 105c. Next, in step S506, the entropy encoding unit 105c performs entropy encoding on the difference data input from the prediction difference generation unit 105b by, for example, Huffman encoding or Golomb encoding, and the encoded data is obtained. Generate.

ステップＳ５０２において、ＲＡＷデータ符号化部１０５は、判定手段により、ノイズ量が閾値より多い場合には、周波数変換が有効であると判定し、ステップＳ５０３に処理を進める。ステップＳ５０３において、周波数変換部１０５ａは、入力のＲＡＷデータの色要素プレーン単位で、周波数変換処理を行う。具体的には、周波数変換部１０５ａは、図３（ａ）〜（ｃ）と同様に、ベイヤ配列のＲＡＷデータを各色要素プレーンに分離し、分離された色要素プレーン単位で、所望の分解レベルで離散ウェーブレット変換による周波数変換処理を行い、変換係数を生成する。例えば、周波数変換部１０５ａは、低周波数成分サブバンド（ＬＬサブバンド）及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数を生成する。次に、ステップＳ５０４において、ＲＡＷデータ符号化部１０５は、低周波数成分サブバンド（ＬＬサブバンド）の変換係数については、ステップＳ５０５に処理を進める。そして、ＲＡＷデータ符号化部１０５は、高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数については、ステップＳ５０６に処理を進める。ステップＳ５０５において、予測差分生成部１０５ｂは、低周波数成分サブバンド（ＬＬサブバンド）の変換係数に対して、符号化対象変換係数の周辺の変換係数から予測差分データを生成する。具体的には、予測差分生成部１０５ｂは、図４と同様に、ＭＥＤ予測により、符号化対象変換係数ｘの周辺変換係数ａ〜ｄから、符号化対象変換係数に対する予測値ｐｄを生成する。そして、予測差分生成部１０５ｂは、符号化対象変換係数ｘと予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ５０６において、エントロピー符号化部１０５ｃは、低周波数成分サブバンド（ＬＬサブバンド）の差分データ及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数に対して、符号化データを生成する。例えば、エントロピー符号化部１０５ｃは、ハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 In step S502, the RAW data encoding unit 105 determines that the frequency conversion is effective when the amount of noise is larger than the threshold by the determination unit, and advances the process to step S503. In step S503, the frequency conversion unit 105a performs frequency conversion processing in units of color element planes of input RAW data. Specifically, the frequency conversion unit 105a separates the RAW data of the Bayer array into each color element plane in the same manner as in FIGS. 3A to 3C, and a desired separation level in units of the separated color element planes. Then, frequency conversion processing by discrete wavelet transform is performed to generate transform coefficients. For example, the frequency conversion unit 105a generates conversion coefficients for a low-frequency component subband (LL subband) and a high-frequency component subband (HL subband, LH subband, and HH subband). Next, in step S504, the RAW data encoding unit 105 advances the process to step S505 for the transform coefficient of the low frequency component subband (LL subband). Then, the RAW data encoding unit 105 proceeds to step S506 for the transform coefficients of the high frequency component subbands (HL subband, LH subband, and HH subband). In step S505, the prediction difference generation unit 105b generates prediction difference data from the transform coefficients around the encoding target transform coefficient for the transform coefficient of the low frequency component subband (LL subband). Specifically, the prediction difference generation unit 105b generates a prediction value pd for the encoding target transform coefficient from the peripheral transform coefficients a to d of the encoding target transform coefficient x by MED prediction, as in FIG. Then, the prediction difference generation unit 105b generates difference data between the encoding target transform coefficient x and the prediction value pd, and outputs the difference data to the entropy encoding unit 105c. Next, in step S506, the entropy encoding unit 105c converts the difference data of the low frequency component subband (LL subband) and the transform coefficient of the high frequency component subband (HL subband, LH subband, HH subband). On the other hand, encoded data is generated. For example, the entropy encoding unit 105c performs entropy encoding by Huffman encoding or Golomb encoding, and generates encoded data.

次に、実際のＲＡＷデータの例を用いて、周波数変換の有効性の判定方法を具体的に説明する。図６は、ＩＳＯ感度とオプティカルブラック領域２０２の発生符号量の関係を示すグラフである。図６は、同じシーンにおいて、ＩＳＯ感度を１００から２０４８００まで１段ずつ変更して撮像したＲＡＷデータに対するＩＳＯ感度とオプティカルブラック領域２０２の発生符号量の関係を示す。「ＯＢ」は、ＲＡＷデータ符号化部１０５がオプティカルブラック領域２０２を符号化した場合に発生する符号量を示す。「ＨＯＢ」は、ＲＡＷデータ符号化部１０５が水平オプティカルブラック領域２０３を符号化した場合に発生する符号量を示す。「ＶＯＢ」は、ＲＡＷデータ符号化部１０５が垂直オプティカルブラック領域２０４を符号化した場合に発生する符号量を示す。横軸は、ＩＳＯ感度の段数を示しており、「１」が「ＩＳＯ感度＝１００」を示し、以降、１段進む毎にＩＳＯ感度の値は２倍となり、「１２」が「ＩＳＯ感度＝２０４８００」である。縦軸は、ＩＳＯ感度が１００の発生符号量（基準値）を１とした場合の発生符号量の比率を示している。 Next, a method for determining the effectiveness of frequency conversion will be specifically described using an example of actual RAW data. FIG. 6 is a graph showing the relationship between the ISO sensitivity and the generated code amount of the optical black area 202. FIG. 6 shows the relationship between the ISO sensitivity and the generated code amount of the optical black area 202 for RAW data picked up by changing the ISO sensitivity from 100 to 204800 one step at a time in the same scene. “OB” indicates the amount of code generated when the RAW data encoding unit 105 encodes the optical black area 202. “HOB” indicates a code amount generated when the RAW data encoding unit 105 encodes the horizontal optical black area 203. “VOB” indicates a code amount generated when the RAW data encoding unit 105 encodes the vertical optical black area 204. The abscissa indicates the number of steps of ISO sensitivity. “1” indicates “ISO sensitivity = 100”. Thereafter, the ISO sensitivity value doubles every time one step proceeds, and “12” indicates “ISO sensitivity = 100”. 204800 ". The vertical axis represents the ratio of the generated code amount when the generated code amount (reference value) with an ISO sensitivity of 100 is 1.

このＲＡＷデータの例では、ＩＳＯ感度＝１００からＩＳＯ感度＝４００までは、ほぼ同等の発生符号量である。ＩＳＯ感度８００以降は、一次関数的に発生符号量が増加している。これは、混入したノイズ量に対応して、発生符号量が増加していることを示している。そこで、図５に示したフローチャートに当てはめた場合、例えばＩＳＯ感度＝１００のオプティカルブラック領域２０２の発生符号量を基準として、比率が２倍以上の場合は、混入されたノイズ量が極めて多いことから、空間的な相関性も極めて低いと考えられる。そのため、ＲＡＷデータ符号化部１０５は、発生符号量の比率が２倍以上の場合には、周波数変換が有効であると判定することができる。例えば、ＲＡＷデータ符号化部１０５は、図６に示したＶＯＢの発生符号量の比率を基に、ＩＳＯ感度が２５６００以上の場合に、周波数変換が有効であると判定することが可能となる。 In this example of RAW data, the ISO sensitivity = 100 to ISO sensitivity = 400 are almost the same amount of generated codes. After ISO sensitivity 800, the amount of generated code increases in a linear function. This indicates that the amount of generated code increases corresponding to the amount of mixed noise. Therefore, when applied to the flowchart shown in FIG. 5, for example, when the ratio is 2 times or more on the basis of the generated code amount of the optical black area 202 with ISO sensitivity = 100, the amount of mixed noise is extremely large. The spatial correlation is also considered to be extremely low. Therefore, the RAW data encoding unit 105 can determine that the frequency conversion is effective when the ratio of the generated code amount is twice or more. For example, the RAW data encoding unit 105 can determine that the frequency conversion is effective when the ISO sensitivity is 25600 or more based on the ratio of the generated code amount of VOB shown in FIG.

以上のように、ＲＡＷデータ符号化部１０５は、オプティカルブラック領域２０２の全体又は一部の発生符号量が閾値より多い場合には、周波数変換が有効であると判定する。また、ＲＡＷデータ符号化部１０５は、オプティカルブラック領域２０２の全体又は一部の発生符号量が閾値より少ない場合には、周波数変換が有効でないと判定する。 As described above, the RAW data encoding unit 105 determines that the frequency conversion is effective when the generated code amount of all or part of the optical black region 202 is larger than the threshold. The RAW data encoding unit 105 determines that the frequency conversion is not effective when the generated code amount of the entire or part of the optical black region 202 is smaller than the threshold value.

また、ＲＡＷデータ符号化部１０５は、オプティカルブラック領域２０２の全体又は一部の発生符号量の基準値に対する比率が閾値より多い場合には、周波数変換が有効であると判定する。また、ＲＡＷデータ符号化部１０５は、オプティカルブラック領域２０２の全体又は一部の発生符号量の基準値に対する比率が閾値より少ない場合には、周波数変換が有効でないと判定する。 The RAW data encoding unit 105 determines that the frequency conversion is effective when the ratio of the generated code amount in the entire optical black region 202 or a part of the generated code amount to the reference value is larger than the threshold value. Also, the RAW data encoding unit 105 determines that the frequency conversion is not effective when the ratio of the generated code amount of the entire optical black region 202 or a part of the generated code amount to the reference value is smaller than the threshold value.

ここでは、説明の単純化のために、周波数変換の有効性を判定する方法を示したが、周波数変換が有効である場合においては、発生符号量に応じて分解レベルを変更するようにしてもよい。 Here, for simplification of explanation, a method for determining the effectiveness of frequency conversion is shown. However, when the frequency conversion is effective, the decomposition level may be changed according to the generated code amount. Good.

図１（ｂ）は、撮像装置１００の他の構成例を示すブロック図である。図１（ｂ）の撮像装置１００は、図１（ａ）の撮像装置１００に対して、オプティカルブラック解析部１０８を追加したものである。オプティカルブラック解析部１０８は、ＲＡＷデータ符号化部１０５の外部に設けられる。図１（ａ）の撮像装置１００では、オプティカルブラック領域２０２の解析として、ＲＡＷデータ符号化部１０５がオプティカルブラック領域２０２の発生符号量を算出する例を示した。これに対し、図１（ｂ）の撮像装置１００では、オプティカルブラック解析部１０８を専用に設け、オプティカルブラック解析部１０８がオプティカルブラック領域２０２の解析を行う。具体的には、オプティカルブラック解析部１０８は、オプティカルブラック領域２０２の画素値に対して、式（３）に示す分散σ²、又は式（４）に示す標準偏差σを算出することにより、オプティカルブラック領域２０２の画素値のバラツキを解析する。ここで、σ²は分散、σは標準偏差、ｎは測定対象画素数、ｘは画素値、μは測定対象画素の平均値である。オプティカルブラック解析部１０８は、分散σ²又は標準偏差σを基に、周波数変換の有効性を判定する。 FIG. 1B is a block diagram illustrating another configuration example of the imaging apparatus 100. The imaging apparatus 100 in FIG. 1B is obtained by adding an optical black analysis unit 108 to the imaging apparatus 100 in FIG. The optical black analysis unit 108 is provided outside the RAW data encoding unit 105. In the imaging apparatus 100 in FIG. 1A, as an analysis of the optical black region 202, an example in which the RAW data encoding unit 105 calculates the generated code amount of the optical black region 202 is shown. On the other hand, in the imaging apparatus 100 of FIG. 1B, the optical black analysis unit 108 is provided exclusively, and the optical black analysis unit 108 analyzes the optical black region 202. Specifically, the optical black analysis unit 108 calculates the variance σ ² shown in Expression (3) or the standard deviation σ shown in Expression (4) with respect to the pixel value of the optical black region 202, thereby obtaining the optical black. Variations in pixel values in the black area 202 are analyzed. Here, σ ² is the variance, σ is the standard deviation, n is the number of pixels to be measured, x is the pixel value, and μ is the average value of the pixels to be measured. The optical black analysis unit 108 determines the effectiveness of frequency conversion based on the variance σ ² or the standard deviation σ.

図７（ａ）は、ＩＳＯ感度とオプティカルブラック領域２０２の標準偏差の関係を示すグラフである。図７（ａ）は、図６の例と同じＲＡＷデータを用いて、同様に、ＩＳＯ感度を１００〜２０４８００まで１段ずつ変更して撮像したＲＡＷデータに対するＩＳＯ感度とオプティカルブラック領域２０２の画素値の標準偏差の関係を示す。「ＯＢ」は、オプティカルブラック領域２０２の画素値の標準偏差を示す。「ＨＯＢ」は、水平オプティカルブラック領域２０３の画素値の標準偏差を示す。「ＶＯＢ」は、垂直オプティカルブラック領域２０４の画素値の標準偏差を示す。横軸は、ＩＳＯ感度の段数を示しており、「１」が「ＩＳＯ感度＝１００」を示し、以降、１段進む毎にＩＳＯ感度の値は２倍となり「１２」が「ＩＳＯ感度＝２０４８００」である。縦軸は、ＩＳＯ感度が１００の標準偏差（基準値）を１とした場合の標準偏差の比率を示している。 FIG. 7A is a graph showing the relationship between the ISO sensitivity and the standard deviation of the optical black area 202. FIG. 7A shows the ISO sensitivity and the pixel value of the optical black area 202 with respect to the RAW data obtained by using the same raw data as in the example of FIG. The standard deviation relationship is shown. “OB” indicates the standard deviation of the pixel value of the optical black area 202. “HOB” indicates the standard deviation of the pixel value of the horizontal optical black area 203. “VOB” indicates the standard deviation of the pixel value of the vertical optical black area 204. The horizontal axis indicates the number of steps of ISO sensitivity. “1” indicates “ISO sensitivity = 100”, and the value of ISO sensitivity is doubled every time one step is advanced, and “12” is “ISO sensitivity = 204800”. It is. The vertical axis indicates the ratio of the standard deviation when the standard deviation (reference value) with an ISO sensitivity of 100 is 1.

このＲＡＷデータの例では、図７（ａ）に示すように、ＩＳＯ感度＝１００からＩＳＯ感度＝４００までは、ほぼ同等の標準偏差を示している。ＩＳＯ感度８００以降は、二次関数的に標準偏差が増加しており、混入したノイズ量に対応し、標準偏差が増大していることを示している。 In the example of the RAW data, as shown in FIG. 7A, from ISO sensitivity = 100 to ISO sensitivity = 400 shows almost the same standard deviation. After ISO sensitivity 800, the standard deviation increases as a quadratic function, indicating that the standard deviation increases corresponding to the amount of mixed noise.

図７（ｂ）は、図７（ａ）の縦軸の標準偏差を対数目盛で置き換えたグラフであり、ＩＳＯ感度とオプティカルブラック領域２０２の標準偏差の関係を示す。図７（ｂ）の標準偏差の比率を示すグラフは、図６の発生符号量の比率を示すグラフと同様に、一次関数的に表現することが可能となる。オプティカルブラック解析部１０８は、図６の発生符号量を用いた場合と同様の方法で、標準偏差を用いて周波数変換の有効性を判定することができる。 FIG. 7B is a graph in which the standard deviation on the vertical axis in FIG. 7A is replaced with a logarithmic scale, and shows the relationship between the ISO sensitivity and the standard deviation of the optical black region 202. The graph showing the ratio of the standard deviation in FIG. 7B can be expressed in a linear function like the graph showing the ratio of the generated code amount in FIG. The optical black analysis unit 108 can determine the effectiveness of frequency conversion using the standard deviation in the same way as when the generated code amount in FIG. 6 is used.

すなわち、オプティカルブラック解析部１０８は、オプティカルブラック領域２０２の全体又は一部の画素値の標準偏差又は分散が閾値より大きい場合には、周波数変換が有効であると判定する。また、オプティカルブラック解析部１０８は、オプティカルブラック領域２０２の全体又は一部の画素値の標準偏差又は分散が閾値より小さい場合には、周波数変換が有効でないと判定する。 In other words, the optical black analysis unit 108 determines that the frequency conversion is effective when the standard deviation or variance of all or some of the pixel values in the optical black region 202 is greater than the threshold value. Further, the optical black analysis unit 108 determines that the frequency conversion is not effective when the standard deviation or variance of all or some of the pixel values of the optical black region 202 is smaller than the threshold value.

また、オプティカルブラック解析部１０８は、オプティカルブラック領域２０２の全体又は一部の画素値の標準偏差又は分散の基準値に対する比率が閾値より大きい場合には、周波数変換が有効であると判定する。また、オプティカルブラック解析部１０８は、オプティカルブラック領域２０２の全体又は一部の画素値の標準偏差又は分散の基準値に対する比率が閾値より小さい場合には、周波数変換が有効でないと判定する。 Further, the optical black analysis unit 108 determines that the frequency conversion is effective when the ratio of the whole or a part of the pixel values of the optical black region 202 to the standard deviation or variance reference value is larger than the threshold value. Further, the optical black analysis unit 108 determines that the frequency conversion is not effective when the ratio of the whole or a part of the pixel values of the optical black region 202 to the standard deviation or dispersion reference value is smaller than the threshold value.

なお、基準となるＩＳＯ感度が１００の場合のオプティカルブラック領域２０２の解析結果は、事前に記憶させておいてもよいし、例えば電源投入時に毎回、解析して解析結果を取得するようにしてもよい。また、図６及び図７（ｂ）のグラフからも分かるように、オプティカルブラック領域２０２の発生符号量又は標準偏差の比率は、同様の傾向を示している。そのため、解析対象とするオプティカルブラック領域は、オプティカルブラック領域２０２、水平オプティカルブラック領域２０３、垂直オプティカルブラック領域２０４、又は同等の結果が得られるより小さい一部の領域であってもよい。 Note that the analysis result of the optical black region 202 when the standard ISO sensitivity is 100 may be stored in advance, or may be analyzed and acquired, for example, every time the power is turned on. Good. Further, as can be seen from the graphs of FIG. 6 and FIG. 7B, the generated code amount or the standard deviation ratio of the optical black area 202 shows the same tendency. Therefore, the optical black area to be analyzed may be the optical black area 202, the horizontal optical black area 203, the vertical optical black area 204, or a part of a smaller area where an equivalent result is obtained.

以上のように、撮像装置１００は、ＲＡＷデータに含まれるオプティカルブラック領域２０２の発生符号量、標準偏差又は分散を算出することにより、ＲＡＷデータに混入されたノイズ量を正確に推定することができる。撮像装置１００は、そのノイズ量に基づき、周波数変換の有効性を判定し、効率的に符号化して記録することができる。 As described above, the imaging apparatus 100 can accurately estimate the amount of noise mixed in the RAW data by calculating the generated code amount, standard deviation, or variance of the optical black area 202 included in the RAW data. . The imaging apparatus 100 can determine the effectiveness of frequency conversion based on the amount of noise, and can efficiently encode and record it.

（第２の実施形態）
図８は本発明の第２の実施形態による撮像装置１００の符号化方法を示すフローチャートであり、図９はＲＡＷデータの分割例を示す図である。本実施形態による撮像装置１００は、第１の実施形態で示した図１の撮像装置１００と同様の構成を有する。本実施形態の撮像装置１００は、第１の実施形態の撮像装置１００に対して、ＲＡＷデータの符号化方法が異なるため、共通となる説明は省略し、ＲＡＷデータの符号化方法について図８のフローチャートに基づき説明を行う。ＲＡＷデータは、図９に示すように、有効画素領域２０１と、水平オプティカルブラック領域２０３と、垂直オプティカルブラック領域２０４に分割されて符号化される。なお、分割の仕方や分割数については、これに限定されるものではなく、例えば垂直オプティカルブラック領域２０４と、有効画素領域２０１を含むそれ以外の領域との２つに分割してもよい。 (Second Embodiment)
FIG. 8 is a flowchart illustrating an encoding method of the imaging apparatus 100 according to the second embodiment of the present invention, and FIG. 9 is a diagram illustrating an example of dividing RAW data. The imaging device 100 according to the present embodiment has the same configuration as the imaging device 100 of FIG. 1 shown in the first embodiment. The imaging apparatus 100 according to the present embodiment is different from the imaging apparatus 100 according to the first embodiment in the RAW data encoding method. Therefore, a common description is omitted, and the RAW data encoding method is illustrated in FIG. An explanation will be given based on the flowchart. As shown in FIG. 9, the RAW data is divided into an effective pixel area 201, a horizontal optical black area 203, and a vertical optical black area 204 and encoded. Note that the way of division and the number of divisions are not limited to this. For example, the division may be divided into two areas, that is, a vertical optical black area 204 and other areas including the effective pixel area 201.

ステップＳ８０１において、ＲＡＷデータ符号化部１０５は、ＲＡＷデータの水平オプティカルブラック領域２０３及び垂直オプティカルブラック領域２０４の画素値を符号化し、符号を発生させる。次に、ステップＳ８０２において、ＲＡＷデータ符号化部１０５は、その符号化により発生した符号量を取得する。次に、ステップＳ８０３において、ＲＡＷデータ符号化部１０５は、その発生した符号量に基づきノイズ量を推定し、ノイズ量を基に、有効画素領域２０１の符号化に対する周波数変換の有効性を判定する。例えば、ＲＡＷデータ符号化部１０５は、その発生した符号量が閾値より多い場合には、ノイズ量が閾値より多いと判定し、有効画素領域２０１の符号化に対する周波数変換が有効であると判定する。また、ＲＡＷデータ符号化部１０５は、その発生した符号量が閾値より少ない場合には、ノイズ量が閾値より少ないと判定し、有効画素領域２０１の符号化に対する周波数変換が有効でないと判定する。 In step S801, the RAW data encoding unit 105 encodes the pixel values of the horizontal optical black region 203 and the vertical optical black region 204 of the RAW data, and generates a code. Next, in step S802, the RAW data encoding unit 105 acquires a code amount generated by the encoding. Next, in step S803, the RAW data encoding unit 105 estimates a noise amount based on the generated code amount, and determines the effectiveness of frequency conversion for encoding the effective pixel region 201 based on the noise amount. . For example, when the generated code amount is larger than the threshold value, the RAW data encoding unit 105 determines that the noise amount is larger than the threshold value, and determines that the frequency conversion for the encoding of the effective pixel region 201 is effective. . Further, when the generated code amount is less than the threshold value, the RAW data encoding unit 105 determines that the noise amount is less than the threshold value, and determines that the frequency conversion for encoding the effective pixel region 201 is not effective.

ステップＳ８０３において、ＲＡＷデータ符号化部１０５は、周波数変換が有効でないと判定した場合には、ステップＳ８０６に処理を進める。ステップＳ８０６において、周波数変換部１０５ａは、周波数変換を行わず、予測差分生成部１０５ｂは、入力のＲＡＷデータの有効画素領域２０１の色要素プレーン単位で、符号化対象画素の周辺の画素から予測差分データを生成する。具体的には、予測差分生成部１０５ｂは、図４と同様に、符号化対象画素ｘの周辺画素ａ〜ｄからＭＥＤ予測により予測値ｐｄを生成し、符号化対象画素ｘの値と予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ８０７において、エントロピー符号化部１０５ｃは、予測差分生成部１０５ｂから入力された差分データに対して、例えばハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 In step S803, if the RAW data encoding unit 105 determines that the frequency conversion is not effective, the process proceeds to step S806. In step S806, the frequency conversion unit 105a does not perform frequency conversion, and the prediction difference generation unit 105b performs prediction differences from pixels around the encoding target pixel in units of color element planes in the effective pixel region 201 of the input RAW data. Generate data. Specifically, similarly to FIG. 4, the prediction difference generation unit 105 b generates a prediction value pd by MED prediction from the surrounding pixels a to d of the encoding target pixel x, and the value of the encoding target pixel x and the prediction value Difference data with respect to pd is generated, and the difference data is output to the entropy encoding unit 105c. Next, in step S807, the entropy encoding unit 105c performs entropy encoding on the difference data input from the prediction difference generation unit 105b by, for example, Huffman encoding or Golomb encoding, and the encoded data is obtained. Generate.

また、ステップＳ８０３において、ＲＡＷデータ符号化部１０５は、周波数変換が有効であると判定した場合には、ステップＳ８０４に処理を進める。ステップＳ８０４において、周波数変換部１０５ａは、入力のＲＡＷデータの有効画素領域２０１の色要素プレーン単位で、周波数変換処理を行う。周波数変換部１０５ａは、図３（ａ）〜（ｃ）と同様に、ＲＡＷデータの有効画素領域２０１を各色要素プレーンに分離し、分離された色要素プレーン単位で、所望の分解レベルで離散ウェーブレット変換による周波数変換処理を行い、変換係数を生成する。例えば、周波数変換部１０５ａは、低周波数成分サブバンド（ＬＬサブバンド）及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数を生成する。次に、ステップＳ８０５において、ＲＡＷデータ符号化部１０５は、低周波数成分サブバンド（ＬＬサブバンド）の変換係数については、ステップＳ８０６に処理を進める。そして、ＲＡＷデータ符号化部１０５は、高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数については、ステップＳ８０７に処理を進める。ステップＳ８０６において、予測差分生成部１０５ｂは、低周波数成分サブバンド（ＬＬサブバンド）の変換係数に対して、符号化対象変換係数の周辺の変換係数から予測差分データを生成する。予測差分生成部１０５ｂは、図４と同様に、符号化対象変換係数ｘの周辺変換係数ａ〜ｄからＭＥＤ予測により予測値ｐｄを生成し、符号化対象変換係数ｘと予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ８０７において、エントロピー符号化部１０５ｃは、低周波数成分サブバンド（ＬＬサブバンド）の差分データ及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数に対して、符号化データを生成する。例えば、エントロピー符号化部１０５ｃは、ハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 In step S803, if the RAW data encoding unit 105 determines that the frequency conversion is valid, the process proceeds to step S804. In step S804, the frequency conversion unit 105a performs frequency conversion processing in units of color element planes in the effective pixel area 201 of the input RAW data. Similarly to FIGS. 3A to 3C, the frequency conversion unit 105a separates the effective pixel area 201 of the RAW data into each color element plane, and performs discrete wavelet at a desired separation level in units of the separated color element planes. A frequency conversion process is performed by conversion to generate a conversion coefficient. For example, the frequency conversion unit 105a generates conversion coefficients for a low-frequency component subband (LL subband) and a high-frequency component subband (HL subband, LH subband, and HH subband). Next, in step S805, the RAW data encoding unit 105 proceeds to step S806 for the transform coefficient of the low frequency component subband (LL subband). Then, the RAW data encoding unit 105 proceeds to step S807 for the transform coefficients of the high frequency component subbands (HL subband, LH subband, and HH subband). In step S806, the prediction difference generation unit 105b generates prediction difference data from the transform coefficients around the encoding target transform coefficient for the transform coefficient of the low frequency component subband (LL subband). As in FIG. 4, the prediction difference generation unit 105b generates a prediction value pd by the MED prediction from the peripheral conversion coefficients a to d of the encoding target conversion coefficient x, and the difference between the encoding target conversion coefficient x and the prediction value pd. Data is generated and the difference data is output to the entropy encoding unit 105c. Next, in step S807, the entropy encoding unit 105c converts the difference data of the low frequency component subband (LL subband) and the transform coefficient of the high frequency component subband (HL subband, LH subband, HH subband). On the other hand, encoded data is generated. For example, the entropy encoding unit 105c performs entropy encoding by Huffman encoding or Golomb encoding, and generates encoded data.

以上のように、撮像装置１００は、ＲＡＷデータをオプティカルブラック領域２０２と有効画素領域２０１に分割し、ステップＳ８０１でオプティカルブラック領域２０２の符号化を行うことで、その発生符号量を基にノイズ量を正確に推定することができる。撮像装置１００は、そのノイズ量に基づき、有効画素領域２０１に対する周波数変換の有効性を判定し、効率的に符号化して記録することができる。 As described above, the imaging apparatus 100 divides the RAW data into the optical black area 202 and the effective pixel area 201, and performs coding of the optical black area 202 in step S801, so that the noise amount is based on the generated code quantity. Can be estimated accurately. The imaging apparatus 100 can determine the effectiveness of frequency conversion for the effective pixel region 201 based on the amount of noise, and can efficiently encode and record it.

（第３の実施形態）
図１０は、本発明の第３の実施形態による撮像装置１００の符号化方法を示すフローチャートである。本実施形態による撮像装置１００は、第１の実施形態で示した図１の撮像装置１００と同様の構成を有する。本実施形態の撮像装置１００は、第１の実施形態の撮像装置１００に対して、ＲＡＷデータの符号化方法が異なるため、共通となる説明は省略し、ＲＡＷデータの符号化方法について図１０のフローチャートに基づき説明を行う。 (Third embodiment)
FIG. 10 is a flowchart showing an encoding method of the imaging apparatus 100 according to the third embodiment of the present invention. The imaging device 100 according to the present embodiment has the same configuration as the imaging device 100 of FIG. 1 shown in the first embodiment. The imaging apparatus 100 according to the present embodiment is different from the imaging apparatus 100 according to the first embodiment in the RAW data encoding method. Therefore, a common description is omitted, and the RAW data encoding method is illustrated in FIG. An explanation will be given based on the flowchart.

ステップＳ１００１において、ＲＡＷデータ符号化部１０５は、撮像情報を取得する。具体的には、ＲＡＷデータ符号化部１０５は、主にＩＳＯ感度（撮像感度）や露光時間に起因するノイズ量を推定するために、撮像情報として、撮像時のＩＳＯ感度情報又は露光時間情報を取得する。次に、ステップＳ１００２において、ＲＡＷデータ符号化部１０５は、撮像情報を基に、ＲＡＷデータに含まれるノイズ量を推定し、ノイズ量に基づき周波数変換の有効性を判定する。この判定条件は、撮像部１０２の特性によって決定される条件であり、予め実験により統計的に得られた値を基に判定する。例えば、ＲＡＷデータ符号化部１０５は、ＩＳＯ感度又は露光時間が閾値より大きい場合には、ノイズ量が閾値より多いと判定し、周波数変換が有効であると判定する。また、ＲＡＷデータ符号化部１０５は、ＩＳＯ感度又は露光時間が閾値より小さい場合には、ノイズ量が閾値より小さいと判定し、周波数変換が有効でないと判定する。 In step S1001, the RAW data encoding unit 105 acquires imaging information. Specifically, the RAW data encoding unit 105 mainly uses ISO sensitivity information or exposure time information at the time of imaging as imaging information in order to estimate noise amount due to ISO sensitivity (imaging sensitivity) and exposure time. get. Next, in step S1002, the RAW data encoding unit 105 estimates the amount of noise included in the RAW data based on the imaging information, and determines the effectiveness of frequency conversion based on the amount of noise. This determination condition is a condition determined by the characteristics of the imaging unit 102, and is determined based on a value statistically obtained in advance through experiments. For example, when the ISO sensitivity or the exposure time is larger than the threshold, the RAW data encoding unit 105 determines that the amount of noise is larger than the threshold, and determines that the frequency conversion is effective. Further, when the ISO sensitivity or the exposure time is smaller than the threshold, the RAW data encoding unit 105 determines that the noise amount is smaller than the threshold, and determines that the frequency conversion is not effective.

ステップＳ１００２において、ＲＡＷデータ符号化部１０５は、周波数変換が有効でないと判定した場合には、ステップＳ１００５に処理を進める。ステップＳ１００５において、周波数変換部１０５ａは、周波数変換を行わず、予測差分生成部１０５ｂは、入力のＲＡＷデータの色要素プレーン単位で、符号化対象画素の周辺の画素から予測差分データを生成する。具体的には、予測差分生成部１０５ｂは、図４と同様に、符号化対象画素ｘの周辺画素ａ〜ｄからＭＥＤ予測により予測値ｐｄを生成し、符号化対象画素ｘの値と予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ１００６において、エントロピー符号化部１０５ｃは、予測差分生成部１０５ｂから入力された差分データに対して、例えばハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 If the RAW data encoding unit 105 determines in step S1002 that the frequency conversion is not valid, the process proceeds to step S1005. In step S1005, the frequency conversion unit 105a does not perform frequency conversion, and the prediction difference generation unit 105b generates prediction difference data from pixels around the encoding target pixel in units of color element planes of the input RAW data. Specifically, similarly to FIG. 4, the prediction difference generation unit 105 b generates a prediction value pd by MED prediction from the surrounding pixels a to d of the encoding target pixel x, and the value of the encoding target pixel x and the prediction value Difference data with respect to pd is generated, and the difference data is output to the entropy encoding unit 105c. Next, in step S1006, the entropy encoding unit 105c performs entropy encoding on the difference data input from the prediction difference generation unit 105b by, for example, Huffman encoding or Golomb encoding, and the encoded data is obtained. Generate.

また、ステップＳ１００２において、ＲＡＷデータ符号化部１０５は、周波数変換が有効であると判定した場合には、ステップＳ１００３に処理を進める。ステップＳ１００３において、周波数変換部１０５ａは、入力のＲＡＷデータの色要素プレーン単位で、周波数変換処理を行う。具体的には、周波数変換部１０５ａは、図３（ａ）〜（ｃ）と同様に、ベイヤ配列のＲＡＷデータを各色要素プレーンに分離し、分離された色要素プレーン単位で、所望の分解レベルで離散ウェーブレット変換による周波数変換処理を行い、変換係数を生成する。例えば、周波数変換部１０５ａは、低周波数成分サブバンド（ＬＬサブバンド）及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数を生成する。次に、ステップＳ１００４において、ＲＡＷデータ符号化部１０５は、低周波数成分サブバンド（ＬＬサブバンド）の変換係数については、ステップＳ１００５に処理を進める。そして、ＲＡＷデータ符号化部１０５は、高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数については、ステップＳ１００６に処理を進める。ステップＳ１００５において、予測差分生成部１０５ｂは、低周波数成分サブバンド（ＬＬサブバンド）の変換係数に対して、符号化対象変換係数の周辺の変換係数から予測差分データを生成する。予測差分生成部１０５ｂは、図４と同様に、符号化対象変換係数ｘの周辺変換係数ａ〜ｄからＭＥＤ予測により予測値ｐｄを生成し、符号化対象変換係数ｘと予測値ｐｄとの差分データを生成し、その差分データをエントロピー符号化部１０５ｃへ出力する。次に、ステップＳ１００６において、エントロピー符号化部１０５ｃは、低周波数成分サブバンド（ＬＬサブバンド）の差分データ及び高周波数成分サブバンド（ＨＬサブバンド、ＬＨサブバンド、ＨＨサブバンド）の変換係数に対して、符号化データを生成する。例えば、エントロピー符号化部１０５ｃは、ハフマン符号化又はゴロム符号化等により、エントロピー符号化を行い、符号化データを生成する。 In step S1002, when the RAW data encoding unit 105 determines that the frequency conversion is effective, the process proceeds to step S1003. In step S1003, the frequency conversion unit 105a performs frequency conversion processing in units of color element planes of input RAW data. Specifically, the frequency conversion unit 105a separates the RAW data of the Bayer array into each color element plane in the same manner as in FIGS. 3A to 3C, and a desired separation level in units of the separated color element planes. Then, frequency conversion processing by discrete wavelet transform is performed to generate transform coefficients. For example, the frequency conversion unit 105a generates conversion coefficients for a low-frequency component subband (LL subband) and a high-frequency component subband (HL subband, LH subband, and HH subband). Next, in step S1004, the RAW data encoding unit 105 advances the process to step S1005 for the transform coefficient of the low frequency component subband (LL subband). Then, the RAW data encoding unit 105 proceeds to step S1006 for the transform coefficients of the high frequency component subbands (HL subband, LH subband, and HH subband). In step S1005, the prediction difference generation unit 105b generates prediction difference data from the transform coefficients around the encoding target transform coefficient for the transform coefficient of the low frequency component subband (LL subband). As in FIG. 4, the prediction difference generation unit 105b generates a prediction value pd by the MED prediction from the peripheral conversion coefficients a to d of the encoding target conversion coefficient x, and the difference between the encoding target conversion coefficient x and the prediction value pd. Data is generated and the difference data is output to the entropy encoding unit 105c. Next, in step S1006, the entropy encoding unit 105c converts the difference data of the low frequency component subband (LL subband) and the transform coefficient of the high frequency component subband (HL subband, LH subband, HH subband). On the other hand, encoded data is generated. For example, the entropy encoding unit 105c performs entropy encoding by Huffman encoding or Golomb encoding, and generates encoded data.

以上のように、撮像装置１００は、ＲＡＷデータに対するＩＳＯ感度情報や露光時間情報等の撮像情報に基づき、ＲＡＷデータに混入されたノイズ量を推定する。撮像装置１００は、そのノイズ量に基づき、周波数変換の有効性を判定し、効率的に符号化して記録することができる。 As described above, the imaging apparatus 100 estimates the amount of noise mixed in the RAW data based on imaging information such as ISO sensitivity information and exposure time information for the RAW data. The imaging apparatus 100 can determine the effectiveness of frequency conversion based on the amount of noise, and can efficiently encode and record it.

（その他の実施形態）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読み出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 (Other embodiments)
The present invention supplies a program that realizes one or more functions of the above-described embodiments to a system or apparatus via a network or a storage medium, and one or more processors in the computer of the system or apparatus read and execute the program This process can be realized. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

なお、上記実施形態は、何れも本発明を実施するにあたっての具体化の例を示したものに過ぎず、これらによって本発明の技術的範囲が限定的に解釈されてはならないものである。すなわち、本発明はその技術思想、又はその主要な特徴から逸脱することなく、様々な形で実施することができる。 The above-described embodiments are merely examples of implementation in carrying out the present invention, and the technical scope of the present invention should not be construed in a limited manner. That is, the present invention can be implemented in various forms without departing from the technical idea or the main features thereof.

１０５：ＲＡＷデータ符号化部、１０５ａ：周波数変換部、１０５ｂ：予測差分生成部、１０５ｃ：エントロピー符号化部 105: RAW data encoding unit, 105a: frequency converting unit, 105b: prediction difference generating unit, 105c: entropy encoding unit

Claims

A determination unit that estimates a noise amount included in the image data and determines whether the noise amount is greater than a threshold;
When the amount of noise is greater than a threshold value, a frequency conversion unit that generates a conversion coefficient by frequency converting the image data;
Encoding means for encoding the transform coefficient when the noise amount is greater than a threshold value, and encoding the image data when the noise amount is less than the threshold value. An encoding device characterized by the above.

Further, when the amount of noise is larger than a threshold value, a prediction value for the encoding target transform coefficient among the generated transform coefficients is generated, and difference data between the encoding target transform coefficient and the prediction value is generated. If the amount of noise is greater than a threshold value, a predicted value for the encoding target pixel value in the image data is generated, and difference data between the encoding target pixel value and the predicted value is generated. A prediction difference generation means;
The encoding apparatus according to claim 1, wherein the encoding unit encodes the difference data.

The frequency conversion means generates a low-frequency component conversion coefficient and a high-frequency component conversion coefficient,
The prediction difference generation means generates the difference data for the conversion coefficient of the low frequency component when the amount of noise is greater than a threshold value,
The encoding unit performs encoding on the difference data of the low-frequency component conversion coefficient and the high-frequency component conversion coefficient when the amount of noise is greater than a threshold value. 2. The encoding device according to 2.

The encoding apparatus according to claim 1, wherein the frequency conversion unit performs wavelet transform.

The said determination means estimates the noise amount contained in the said image data based on the pixel value of all or one part of the optical black area | region contained in the said image data, The one of Claims 1-4 characterized by the above-mentioned. The encoding device according to item 1.

The determination unit encodes all or a part of pixel values of an optical black area included in the image data, and determines whether the noise amount is larger than a threshold based on a code amount generated by the encoding. The encoding apparatus according to claim 1, wherein:

The determination unit encodes all or a part of pixel values of an optical black area included in the image data, and when the amount of code generated by the encoding is larger than a threshold, the amount of noise is larger than the threshold. The encoding apparatus according to claim 1, wherein if the amount of code generated by the encoding is less than a threshold, it is determined that the amount of noise is less than the threshold. .

The determination unit encodes all or a part of pixel values of the optical black area included in the image data, and when the ratio of the code amount generated by the encoding to a reference value is larger than a threshold value, the noise amount The noise amount is determined to be less than the threshold value when the ratio of the code amount generated by the encoding to the reference value is less than the threshold value. The encoding device according to any one of claims.

The image data has an optical black area and an effective pixel area,
The determination means performs encoding on the optical black region,
The encoding apparatus according to claim 6, wherein the encoding unit performs encoding on the effective pixel area.

The determination unit calculates a standard deviation or variance of all or some of the pixel values of the optical black area included in the image data, and based on the standard deviation or variance, whether or not the noise amount is greater than a threshold value. The encoding apparatus according to any one of claims 1 to 5, wherein:

The determination unit calculates a standard deviation or variance of all or some of the pixel values of the optical black area included in the image data, and when the standard deviation or variance is greater than a threshold, the amount of noise is greater than the threshold. 6. The encoding apparatus according to claim 1, wherein it is determined that the amount of noise is less than a threshold value when the standard deviation or variance is less than a threshold value.

The determination means calculates the standard deviation or variance of all or part of the pixel values of the optical black area included in the image data, and when the ratio of the standard deviation or variance to the reference value is greater than a threshold value, The noise amount is determined to be greater than a threshold value, and when the ratio of the standard deviation or variance to a reference value is less than the threshold value, the noise amount is determined to be less than the threshold value. The encoding device according to claim 1.

The encoding apparatus according to claim 1, wherein the determination unit estimates a noise amount included in the image data based on imaging information of the image data.

The encoding apparatus according to claim 1, wherein the determination unit estimates an amount of noise included in the image data based on an imaging sensitivity or an exposure time of the image data. .

The determination means determines that the amount of noise is greater than a threshold when the imaging sensitivity or exposure time of the image data is greater than a threshold, and if the imaging sensitivity or exposure time of the image data is less than a threshold, The encoding apparatus according to claim 1, wherein the amount of noise is determined to be less than a threshold value.

A determination step of determining a noise amount included in the image data by a determination unit and determining whether the noise amount is greater than a threshold;
When the amount of noise is greater than a threshold by a frequency conversion means, a frequency conversion step of generating a conversion coefficient by frequency converting the image data;
Encoding means for encoding the transform coefficient when the amount of noise is greater than a threshold, and encoding the image data when the amount of noise is less than the threshold. And an encoding method.

The program for functioning a computer as each means of the encoding apparatus described in any one of Claims 1-15.