JP4730105B2

JP4730105B2 - Image processing apparatus and method, learning apparatus and method, and program

Info

Publication number: JP4730105B2
Application number: JP2006012386A
Authority: JP
Inventors: 幸司矢野; 健治高橋; 勉市川; 貴志沢尾; 克尚神明; 哲二郎近藤
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2006-01-20
Filing date: 2006-01-20
Publication date: 2011-07-20
Anticipated expiration: 2026-01-20
Also published as: JP2007195023A

Description

本発明は画像処理装置および方法、学習装置および方法、並びにプログラムに関し、特に、より高精細な画像を得ることができるようにした画像処理装置および方法、学習装置および方法、並びにプログラムに関する。 The present invention relates to an image processing apparatus and method, a learning apparatus and method, and a program, and more particularly, to an image processing apparatus and method, a learning apparatus and method, and a program that can obtain a higher-definition image.

CCD（Charge Coupled Devices）などの撮像素子を利用して画像を撮像する撮像装置においては、撮像装置の小型化を図るために、撮像素子として単板CCDが用いられることが多い。この単板CCDにおいては、各画素はＲ（赤）、Ｇ（緑）、およびＢ（青）の３原色のうちのいずれかの色のデータのみを出力し、各画素がどの色のデータを出力するかは単板CCDの前面に配置された色フィルタアレイによって定まる。 In an imaging apparatus that captures an image using an imaging element such as a CCD (Charge Coupled Devices), a single-plate CCD is often used as the imaging element in order to reduce the size of the imaging apparatus. In this single-plate CCD, each pixel outputs only data of any one of the three primary colors R (red), G (green), and B (blue), and each pixel outputs data of which color. Whether to output is determined by the color filter array arranged on the front surface of the single-plate CCD.

例えば、Ｇの色フィルタが配置された画素は、Ｇの成分のデータを出力するが、Ｒの成分およびＢの成分は出力しない。また、Ｒの色フィルタが配置された画素は、Ｒの成分のデータは出力するが、Ｇの成分およびＢの成分は出力しない。同様に、Ｂの色フィルタが配置された画素は、Ｂの成分は出力するが、Ｇの成分およびＲの成分は出力しない。 For example, a pixel in which a G color filter is arranged outputs G component data, but does not output an R component and a B component. In addition, the pixel in which the R color filter is arranged outputs the R component data, but does not output the G component and the B component. Similarly, the pixel in which the B color filter is arranged outputs the B component, but does not output the G component and the R component.

ところで、撮像により得られた画像データに対して各種の処理を行う場合、画素毎にRGBの各色成分が必要となる。そこで、従来の撮像装置には、クラス分類適応処理により単板CCDが出力する画像データから、３板CCD出力相当の画像データを求めるものもある（例えば、特許文献１参照）。 By the way, when various processes are performed on image data obtained by imaging, RGB color components are required for each pixel. Therefore, some conventional imaging devices obtain image data equivalent to 3-plate CCD output from image data output by a single-plate CCD by class classification adaptive processing (see, for example, Patent Document 1).

特開２００２−６４８３６号公報JP 2002-64836 A

しかしながら、上述した技術においては、３板CCD出力相当の画像の注目している画素の所定の色（の画素値）を予測するために用いる予測タップとして、単板CCDの画像の予測する色とは異なる色の画素を用いた場合、予測する色の画素のダイナミックレンジに対して、異なる色の画素のダイナミックレンジが大きすぎると画像に破綻が生じてしまう。 However, in the above-described technique, as a prediction tap used for predicting a predetermined color (pixel value) of a pixel of interest in an image corresponding to a three-plate CCD output, When pixels of different colors are used, if the dynamic range of pixels of different colors is too large compared to the dynamic range of pixels of the color to be predicted, the image will break down.

例えば、図１に示すように、所定の位置ＡにおけるＲ（赤）の画素値を予測するために、予測タップとしてＧ（緑）の画素を用いると、求められたＲの画素値は、周囲の位置のＲの画素値と比べて突出した値となってしまう。なお、図中、横軸は撮像された画像における位置を示し、縦軸は各位置における各色の画素値（レベル）を示している。 For example, as shown in FIG. 1, when a G (green) pixel is used as a prediction tap to predict an R (red) pixel value at a predetermined position A, the obtained R pixel value is It becomes a prominent value compared with the pixel value of R at the position. In the figure, the horizontal axis indicates the position in the captured image, and the vertical axis indicates the pixel value (level) of each color at each position.

曲線１１は、現実世界の各位置におけるＧの値（波形）を示しており、Ｇの値は位置Ａの前後において急峻に増加している。また、曲線１２は、現実世界の各位置におけるＲの値（波形）を示しており、Ｒの値は曲線１１に示されるＧの値と比べると、位置Ａの前後においてなだらかに増加している。 A curve 11 shows a value (waveform) of G at each position in the real world, and the value of G increases steeply before and after the position A. The curve 12 shows the R value (waveform) at each position in the real world, and the R value increases gradually before and after the position A as compared with the G value shown in the curve 11. .

さらに、図中、丸および四角形は、それぞれ撮像により得られたＧの画素（の画素値）およびＲの画素（の画素値）を示している。ここで、図１の左側の図の丸により示される３つのＧの画素、および四角形により示される２つのＲの画素を予測タップとして用いて、クラス分類適応処理により位置ＡにおけるＲの画素値を予測する場合を考える。 Further, in the drawing, circles and rectangles indicate G pixels (pixel values) and R pixels (pixel values) obtained by imaging, respectively. Here, using the three G pixels indicated by the circles on the left side of FIG. 1 and the two R pixels indicated by the rectangles as prediction taps, the R pixel value at position A is determined by the class classification adaptive processing. Consider the case of prediction.

この場合、予測タップとして用いられるＧの画素のダイナミックレンジは、予測タップとして用いられるＲの画素のダイナミックレンジと比較して非常に大きく、位置Ａの付近においてＧの画素値は急激に変化している。ここで、画素のダイナミックレンジとは、予測タップを構成する画素の画素値の最大値と最小値との差をいう。 In this case, the dynamic range of the G pixel used as the prediction tap is very large compared to the dynamic range of the R pixel used as the prediction tap, and the G pixel value changes rapidly in the vicinity of the position A. Yes. Here, the dynamic range of a pixel means the difference between the maximum value and the minimum value of the pixel values of the pixels constituting the prediction tap.

位置Ａの付近におけるＧの画素値の急激な変化は、位置ＡのＲの画素値の予測にも影響を与え、位置ＡにおけるＲの波形の傾きの大きさが、位置ＡにおけるＧの波形の傾きの大きさと同じ大きさとなるように、位置ＡにおけるＲの画素値が予測される。 The abrupt change in the G pixel value in the vicinity of the position A also affects the prediction of the R pixel value in the position A, and the magnitude of the slope of the R waveform in the position A is equal to the G waveform in the position A. The R pixel value at the position A is predicted so as to be the same as the inclination.

そのため、図１の右側の図の三角形に示すように、予測により得られた位置ＡのＲの画素値は、周囲のＧの画素に影響されて過度（過剰）に強調され、周囲のＲの画素値と比べて図中、上方向に突出してしまう。したがって、曲線１３に示すように、クラス分類適応処理によって得られた画像のＲの画素値は、位置Ａにおける画素値だけが周囲の画素値に比べて大きくなってしまう。 Therefore, as shown in the right triangle of FIG. 1, the R pixel value at the position A obtained by the prediction is influenced excessively by the surrounding G pixels, and is excessively emphasized. Compared with the pixel value, it protrudes upward in the figure. Therefore, as indicated by the curve 13, only the pixel value at the position A of the R pixel value of the image obtained by the classification adaptation process is larger than the surrounding pixel values.

逆に、３板CCD出力相当の画像の注目している画素の所定の色（の画素値）を予測するために用いる予測タップとして、単板CCDの画像の予測する画素の色とは異なる色の画素を用いた場合、予測する色の画素のダイナミックレンジに対して、異なる色の画素のダイナミックレンジが小さすぎると、その異なる色の画素は、予測する色の画素値に殆ど影響を与えないために、予測する色の画素が充分に強調されず、クラス分類適応処理により得られた３板CCD出力相当の画像は、精細感のないものとなってしまう。 Conversely, as a prediction tap used to predict a predetermined color (pixel value) of a pixel of interest in an image corresponding to a three-plate CCD output, a color different from the color of a pixel predicted in a single-plate CCD image If the dynamic range of a pixel of a different color is too small relative to the dynamic range of the pixel of the color to be predicted, the pixel of the different color has little influence on the pixel value of the color to be predicted. For this reason, the pixels of the color to be predicted are not sufficiently emphasized, and the image corresponding to the three-plate CCD output obtained by the class classification adaptive process has no fineness.

本発明は、このような状況に鑑みてなされたものであり、より精度よく高精細な画像を得ることができるようにするものである。 The present invention has been made in view of such a situation, and makes it possible to obtain a high-definition image with higher accuracy.

本発明の第１の側面の画像処理装置は、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データを、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換する画像処理装置であって、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素を、前記第１の画像データから予測タップとして抽出する予測タップ抽出手段と、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第１の画像データからクラスタップとして抽出するクラスタップ抽出手段と、前記クラスタップを用いて、前記注目画素のクラス分類を行うクラス分類手段と、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化する正規化手段と、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとを用いて、前記注目画素の前記第１の成分の値、または前記第２の成分の値を予測演算する予測演算手段とを備える。 An image processing apparatus according to a first aspect of the present invention provides first image data including pixels having a value of a first component of a component video signal and pixels having a value of a second component. An image processing apparatus for converting to second image data composed of pixels having a value of a first component and a value of the second component, the pixel being focused on in the second image data A prediction tap extracting unit that extracts a plurality of pixels used for predicting a target pixel as a prediction tap from the first image data, and a class classification that classifies the target pixel into one of a plurality of classes. Class tap extracting means for extracting a plurality of pixels to be used as class taps from the first image data; class classifying means for classifying the target pixel using the class tap; The prediction tap is normalized so that the dynamic range of the value of the first component of the pixel constituting the prediction tap is equal to the dynamic range of the value of the second component of the pixel constituting the prediction tap. The first component value of the pixel of interest, or the second value using the normalizing means for converting, the tap coefficient obtained in advance for the class of the pixel of interest, and the normalized prediction tap Prediction calculating means for predicting and calculating the value of the component.

前記正規化手段には、前記注目画素の前記第１の成分の値を予測演算する場合、前記第１の成分の値のダイナミックレンジの値を前記第２の成分の値のダイナミックレンジの値で除算して得られる値を、前記予測タップを構成する画素が有する前記第２の成分の値に乗算させることにより前記予測タップを正規化させることができる。 In the normalizing means, when predicting the value of the first component of the pixel of interest, the dynamic range value of the first component value is calculated as the dynamic range value of the second component value. The prediction tap can be normalized by multiplying the value obtained by the division by the value of the second component of the pixels constituting the prediction tap.

前記第１の成分または前記第２の成分は、赤、青、または緑のいずれかの色を表す成分とすることができる。 The first component or the second component may be a component representing any one of red, blue, and green.

前記第１の成分または前記第２の成分は、輝度または色差を表す成分とすることができる。 The first component or the second component may be a component representing luminance or color difference.

本発明の第１の側面の画像処理方法は、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データを、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換する画像処理方法であって、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素を、前記第１の画像データから予測タップとして抽出し、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第１の画像データからクラスタップとして抽出し、前記クラスタップを用いて、前記注目画素のクラス分類を行い、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化し、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとを用いて、前記注目画素の前記第１の成分の値、または前記第２の成分の値を予測演算するステップを含む。 In the image processing method according to the first aspect of the present invention, the first image data including pixels having a value of a first component of a component video signal and pixels having a value of a second component is An image processing method for converting to second image data composed of pixels having a value of a first component and a value of the second component, the pixel being focused on in the second image data A plurality of pixels used for predicting a target pixel are extracted as prediction taps from the first image data, and a plurality of pixels used for class classification that classifies the target pixel into one of a plurality of classes. , Extracting as a class tap from the first image data, classifying the pixel of interest using the class tap, and dynamics of the value of the first component of the pixels constituting the prediction tap Normalizing the prediction tap so that the clean range is equal to the dynamic range of the value of the second component of the pixels constituting the prediction tap, and tap coefficients that are obtained in advance for the class of the pixel of interest; Predicting the value of the first component or the value of the second component of the pixel of interest using the normalized prediction tap.

本発明の第１の側面のプログラムは、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データを、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換する画像処理をコンピュータに実行させるプログラムであって、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素を、前記第１の画像データから予測タップとして抽出し、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第１の画像データからクラスタップとして抽出し、前記クラスタップを用いて、前記注目画素のクラス分類を行い、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化し、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとを用いて、前記注目画素の前記第１の成分の値、または前記第２の成分の値を予測演算するステップを含む。 According to a first aspect of the present invention, there is provided a program for storing first image data including pixels having a value of a first component of a component video signal and pixels having a value of a second component. A program for causing a computer to execute image processing for conversion to second image data composed of pixels having the value of the second component and the value of the second component, and paying attention to the second image data A plurality of pixels used for predicting a pixel of interest that is a pixel are extracted as prediction taps from the first image data, and are used for class classification that classifies the pixel of interest into one of a plurality of classes. Are extracted as class taps from the first image data, and the class tap is used to classify the pixel of interest, and there are pixels constituting the prediction tap. The prediction tap is normalized so that the dynamic range of the value of the first component is equal to the dynamic range of the value of the second component of the pixels constituting the prediction tap, and the class of the target pixel Predicting the value of the first component or the value of the second component of the pixel of interest using a tap coefficient determined in advance and the normalized prediction tap.

本発明の第１の側面においては、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データが、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換される場合、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素が、前記第１の画像データから予測タップとして抽出され、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素が、前記第１の画像データからクラスタップとして抽出され、前記クラスタップが用いられて、前記注目画素のクラス分類が行われ、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップが正規化され、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとが用いられて、前記注目画素の前記第１の成分の値、または前記第２の成分の値が予測演算される。 In the first aspect of the present invention, first image data composed of pixels having a value of a first component of a component video signal and pixels having a value of a second component is the first image data. In order to predict a pixel of interest that is a pixel of interest of the second image data when converted into second image data composed of pixels having a component value and a value of the second component A plurality of pixels to be used are extracted as prediction taps from the first image data, and a plurality of pixels used for class classification that classifies the pixel of interest into one of a plurality of classes are the first image data. As a class tap, the class tap is used to classify the pixel of interest, and the dynamic range of the value of the first component included in the pixels constituting the prediction tap The prediction tap is normalized so that the dynamic range of the value of the second component of the pixels constituting the prediction tap is equal, and the tap coefficient obtained in advance for the class of the target pixel is The predicted prediction tap is used to predict the value of the first component or the value of the second component of the pixel of interest.

本発明の第２の側面の学習装置は、コンポーネント映像信号の第１の成分および第２の成分の値を有する画素から構成される第１の画像データから、注目している画素である注目画素の前記第１の成分の値を抽出する注目画素抽出手段と、前記注目画素を予測するために用いる複数の画素を、前記第１の成分の値を有する画素と、前記第２の成分の値を有する画素とから構成される第２の画像データから予測タップとして抽出する予測タップ抽出手段と、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第２の画像データからクラスタップとして抽出するクラスタップ抽出手段と、前記クラスタップを用いて、前記注目画素のクラス分類を行うクラス分類手段と、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化する正規化手段と、前記注目画素の前記第１の成分の値、および正規化された前記予測タップを用いて、正規化された前記予測タップから前記注目画素の前記第１の成分の値を予測するために用いられる、前記注目画素のクラスに対応するタップ係数を求める演算手段とを備える。 The learning device according to the second aspect of the present invention is a pixel of interest which is a pixel of interest from first image data composed of pixels having values of a first component and a second component of a component video signal. Pixel-of-interest extracting means for extracting the value of the first component, a plurality of pixels used for predicting the pixel of interest, a pixel having the value of the first component, and a value of the second component Prediction tap extracting means for extracting as a prediction tap from second image data composed of pixels having a plurality of pixels, and a plurality of pixels used for class classification to classify the pixel of interest into one of a plurality of classes, A class tap extraction unit that extracts a class tap from the second image data, a class classification unit that classifies the pixel of interest using the class tap, and the prediction tap Normalization that normalizes the prediction tap so that the dynamic range of the value of the first component of the pixel is equal to the dynamic range of the value of the second component of the pixel constituting the prediction tap Using the means, the value of the first component of the pixel of interest, and the normalized prediction tap, to predict the value of the first component of the pixel of interest from the normalized prediction tap And calculating means for obtaining a tap coefficient corresponding to the class of the target pixel.

前記正規化手段には、前記注目画素の前記第１の成分の値を予測するために用いられる前記タップ係数を求める場合、前記第１の成分の値のダイナミックレンジの値を前記第２の成分の値のダイナミックレンジの値で除算して得られる値を、前記予測タップを構成する画素が有する前記第２の成分の値に乗算させることにより前記予測タップを正規化させることができる。 When obtaining the tap coefficient used for predicting the value of the first component of the pixel of interest, the normalizing means uses the dynamic range value of the value of the first component as the second component. The prediction tap can be normalized by multiplying the value obtained by dividing the dynamic range value by the value of the second component of the pixels constituting the prediction tap.

本発明の第２の側面の学習方法またはプログラムは、コンポーネント映像信号の第１の成分および第２の成分の値を有する画素から構成される第１の画像データから、注目している画素である注目画素の前記第１の成分の値を抽出し、前記注目画素を予測するために用いる複数の画素を、前記第１の成分の値を有する画素と、前記第２の成分の値を有する画素とから構成される第２の画像データから予測タップとして抽出し、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第２の画像データからクラスタップとして抽出し、前記クラスタップを用いて、前記注目画素のクラス分類を行い、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化し、前記注目画素の前記第１の成分の値、および正規化された前記予測タップを用いて、正規化された前記予測タップから前記注目画素の前記第１の成分の値を予測するために用いられる、前記注目画素のクラスに対応するタップ係数を求めるステップを含む。 The learning method or program according to the second aspect of the present invention is a pixel of interest from first image data composed of pixels having values of a first component and a second component of a component video signal. A plurality of pixels used for extracting the value of the first component of the target pixel and predicting the target pixel are a pixel having the value of the first component and a pixel having the value of the second component. A plurality of pixels extracted from the second image data configured as a prediction tap and used for class classification to classify the pixel of interest into one of a plurality of classes. Extracting as a tap, classifying the pixel of interest using the class tap, the dynamic range of the value of the first component of the pixels constituting the prediction tap, and the prediction Normalizing the prediction tap so that the dynamic range of the value of the second component of the pixels constituting the pixel is equal to the value of the first component of the pixel of interest, and the normalized prediction Determining a tap coefficient corresponding to the class of the target pixel, which is used to predict the value of the first component of the target pixel from the normalized prediction tap using the tap.

本発明の第２の側面においては、コンポーネント映像信号の第１の成分および第２の成分の値を有する画素から構成される第１の画像データから、注目している画素である注目画素の前記第１の成分の値が抽出され、前記注目画素を予測するために用いる複数の画素が、前記第１の成分の値を有する画素と、前記第２の成分の値を有する画素とから構成される第２の画像データから予測タップとして抽出され、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素が、前記第２の画像データからクラスタップとして抽出され、前記クラスタップが用いられて、前記注目画素のクラス分類が行われ、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップが正規化され、前記注目画素の前記第１の成分の値、および正規化された前記予測タップが用いられて、正規化された前記予測タップから前記注目画素の前記第１の成分の値を予測するために用いられる、前記注目画素のクラスに対応するタップ係数が求められる。 In the second aspect of the present invention, from the first image data composed of pixels having the values of the first component and the second component of the component video signal, the pixel of interest that is the pixel of interest is described above. The value of the first component is extracted, and the plurality of pixels used for predicting the target pixel are composed of pixels having the value of the first component and pixels having the value of the second component. A plurality of pixels that are extracted as prediction taps from the second image data and used for class classification to classify the pixel of interest into one of a plurality of classes, and are extracted from the second image data as class taps. The class tap is used to classify the pixel of interest, the dynamic range of the value of the first component of the pixels constituting the prediction tap, and the prediction tap The prediction tap is normalized so that the dynamic range of the value of the second component included in the constituent pixels is equal, and the value of the first component of the target pixel and the normalized prediction tap are Used to determine the tap coefficient corresponding to the class of the pixel of interest used to predict the value of the first component of the pixel of interest from the normalized prediction tap.

本発明の第１の側面によれば、高精細な画像を得ることができる。特に、本発明の第１の側面によれば、より精度よく高精細な画像を得ることができる。 According to the first aspect of the present invention, a high-definition image can be obtained. In particular, according to the first aspect of the present invention, a high-definition image can be obtained with higher accuracy.

また、本発明の第２の側面によれば、高精細な画像を得ることができる。特に、本発明の第２の側面によれば、より精度よく高精細な画像を得ることができる。 Further, according to the second aspect of the present invention, a high-definition image can be obtained. In particular, according to the second aspect of the present invention, a high-definition image can be obtained with higher accuracy.

以下に本発明の実施の形態を説明するが、本発明の構成要件と、明細書又は図面に記載の実施の形態との対応関係を例示すると、次のようになる。この記載は、本発明をサポートする実施の形態が、明細書又は図面に記載されていることを確認するためのものである。従って、明細書又は図面中には記載されているが、本発明の構成要件に対応する実施の形態として、ここには記載されていない実施の形態があったとしても、そのことは、その実施の形態が、その構成要件に対応するものではないことを意味するものではない。逆に、実施の形態が構成要件に対応するものとしてここに記載されていたとしても、そのことは、その実施の形態が、その構成要件以外の構成要件には対応しないものであることを意味するものでもない。 Embodiments of the present invention will be described below. Correspondences between the constituent elements of the present invention and the embodiments described in the specification or the drawings are exemplified as follows. This description is intended to confirm that the embodiments supporting the present invention are described in the specification or the drawings. Therefore, even if there is an embodiment which is described in the specification or the drawings but is not described here as an embodiment corresponding to the constituent elements of the present invention, that is not the case. It does not mean that the form does not correspond to the constituent requirements. Conversely, even if an embodiment is described here as corresponding to a configuration requirement, that means that the embodiment does not correspond to a configuration requirement other than the configuration requirement. Not something to do.

本発明の第１の側面の画像処理装置は、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データを、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換する画像処理装置（例えば、図３の撮像装置４１）であって、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素を、前記第１の画像データから予測タップとして抽出する予測タップ抽出手段（例えば、図５の予測タップブロック化回路１１４）と、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第１の画像データからクラスタップとして抽出するクラスタップ抽出手段（例えば、図５のADRCブロック化回路１１１）と、前記クラスタップを用いて、前記注目画素のクラス分類を行うクラス分類手段（例えば、図５のクラス分類回路１１３）と、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化する正規化手段（例えば、図５の予測タップ正規化部１１５）と、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとを用いて、前記注目画素の前記第１の成分の値、または前記第２の成分の値を予測演算する予測演算手段（例えば、図５の適応処理回路１１６）とを備える。 An image processing apparatus according to a first aspect of the present invention provides first image data including pixels having a value of a first component of a component video signal and pixels having a value of a second component. An image processing device (for example, the imaging device 41 in FIG. 3) that converts to second image data composed of pixels having a value of a first component and a value of the second component, Prediction tap extraction means for extracting a plurality of pixels used for predicting a pixel of interest, which is a pixel of interest of image data, as a prediction tap from the first image data (for example, the prediction tap blocking circuit of FIG. 5). 114) and a class tap extracting means for extracting, as class taps, a plurality of pixels used for class classification that classifies the target pixel into any one of a plurality of classes. For example, the ADRC blocking circuit 111 in FIG. 5, class classification means for classifying the pixel of interest using the class tap (for example, the class classification circuit 113 in FIG. 5), and the prediction tap are configured. Normalizing means for normalizing the prediction tap so that the dynamic range of the value of the first component included in the pixel is equal to the dynamic range of the value of the second component included in the pixel constituting the prediction tap. (For example, the prediction tap normalization unit 115 in FIG. 5), the tap coefficient obtained in advance for the class of the target pixel, and the normalized prediction tap, the first of the target pixel. Prediction calculating means (for example, the adaptive processing circuit 116 in FIG. 5) for predicting and calculating the component value or the second component value is provided.

前記正規化手段には、前記注目画素の前記第１の成分の値を予測演算する場合、前記第１の成分の値のダイナミックレンジの値を前記第２の成分の値のダイナミックレンジの値で除算して得られる値を、前記予測タップを構成する画素が有する前記第２の成分の値に乗算させることにより前記予測タップを正規化させる（例えば、図６のステップＳ４９の処理）ことができる。 In the normalizing means, when predicting the value of the first component of the pixel of interest, the dynamic range value of the first component value is calculated as the dynamic range value of the second component value. The prediction tap can be normalized by multiplying the value obtained by the division by the value of the second component of the pixels constituting the prediction tap (for example, the process of step S49 in FIG. 6). .

本発明の第１の側面の画像処理方法は、コンポーネント映像信号の第１の成分の値を有する画素と、第２の成分の値を有する画素とから構成される第１の画像データを、前記第１の成分の値および前記第２の成分の値を有する画素から構成される第２の画像データに変換する画像処理方法であって、前記第２の画像データの注目している画素である注目画素を予測するために用いる複数の画素を、前記第１の画像データから予測タップとして抽出し（例えば、図６のステップＳ４８）、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第１の画像データからクラスタップとして抽出し（例えば、図６のステップＳ４５）、前記クラスタップを用いて、前記注目画素のクラス分類を行い（例えば、図６のステップＳ４７）、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化し（例えば、図６のステップＳ４９）、前記注目画素のクラスについて予め求められているタップ係数と、正規化された前記予測タップとを用いて、前記注目画素の前記第１の成分の値、または前記第２の成分の値を予測演算する（例えば、図６のステップＳ５０）ステップを含む。 In the image processing method according to the first aspect of the present invention, the first image data including pixels having a value of a first component of a component video signal and pixels having a value of a second component is An image processing method for converting to second image data composed of pixels having a value of a first component and a value of the second component, the pixel being focused on in the second image data A plurality of pixels used for predicting the pixel of interest are extracted as prediction taps from the first image data (for example, step S48 in FIG. 6), and the pixel of interest is classified into one of a plurality of classes. A plurality of pixels used for class classification to be extracted as class taps from the first image data (for example, step S45 in FIG. 6), and class classification of the pixel of interest is performed using the class taps ( For example, step S47 in FIG. 6, the dynamic range of the value of the first component included in the pixels constituting the prediction tap, and the dynamic range of the value of the second component included in the pixels constituting the prediction tap, The prediction taps are normalized so as to be equal to each other (for example, step S49 in FIG. 6), and the attention tap is obtained using the tap coefficient obtained in advance for the class of the attention pixel and the normalized prediction tap. A step of predicting and calculating the value of the first component or the value of the second component of the pixel (for example, step S50 in FIG. 6).

なお、本発明の第１の側面のプログラムも、上述した本発明の第１の側面の画像処理方法と基本的に同様の処理であるため、繰り返しになるのでその説明は省略する。 Note that the program according to the first aspect of the present invention is basically the same processing as the image processing method according to the first aspect of the present invention described above, and is therefore repeated, so that the description thereof is omitted.

本発明の第２の側面の学習装置（例えば、図１１の学習装置２０１）は、コンポーネント映像信号の第１の成分および第２の成分の値を有する画素から構成される第１の画像データから、注目している画素である注目画素の前記第１の成分の値を抽出する注目画素抽出手段（例えば、図１１の教師画像ブロック化回路２１７）と、前記注目画素を予測するために用いる複数の画素を、前記第１の成分の値を有する画素と、前記第２の成分の値を有する画素とから構成される第２の画像データから予測タップとして抽出する予測タップ抽出手段（例えば、図１１の予測タップブロック化回路２１５）と、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第２の画像データからクラスタップとして抽出するクラスタップ抽出手段（例えば、図１１のADRCブロック化回路２１２）と、前記クラスタップを用いて、前記注目画素のクラス分類を行うクラス分類手段（例えば、図１１のクラス分類回路２１４）と、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化する正規化手段（例えば、図１１の予測タップ正規化部２１６）と、前記注目画素の前記第１の成分の値、および正規化された前記予測タップを用いて、正規化された前記予測タップから前記注目画素の前記第１の成分の値を予測するために用いられる、前記注目画素のクラスに対応するタップ係数を求める演算手段（例えば、図１１の演算回路２２０）とを備える。 The learning device according to the second aspect of the present invention (for example, the learning device 201 in FIG. 11) is based on first image data composed of pixels having values of the first component and the second component of the component video signal. A pixel-of-interest extracting means (for example, the teacher image blocking circuit 217 in FIG. 11) for extracting the value of the first component of the pixel of interest that is the pixel of interest, and a plurality of pixels used for predicting the pixel of interest Predictive tap extracting means (for example, FIG. 5) extracts a pixel as a prediction tap from second image data composed of a pixel having the value of the first component and a pixel having the value of the second component. 11 prediction tap blocking circuits 215), and a plurality of pixels used for class classification for classifying the pixel of interest into one of a plurality of classes, from the second image data. Class tap extracting means (for example, the ADRC blocking circuit 212 in FIG. 11) and class classifying means for classifying the pixel of interest using the class tap (for example, the class classifying circuit 214 in FIG. 11). ) And the dynamic range of the value of the first component included in the pixels constituting the prediction tap and the dynamic range of the value of the second component included in the pixels constituting the prediction tap are equal to each other. Normalization using normalization means for normalizing the prediction tap (for example, the prediction tap normalization unit 216 in FIG. 11), the value of the first component of the pixel of interest, and the normalized prediction tap An operation for obtaining a tap coefficient corresponding to the class of the target pixel, which is used to predict the value of the first component of the target pixel from the predicted tap that has been performed. And means (e.g., the arithmetic circuit 220 in FIG. 11).

前記正規化手段には、前記注目画素の前記第１の成分の値を予測するために用いられる前記タップ係数を求める場合、前記第１の成分の値のダイナミックレンジの値を前記第２の成分の値のダイナミックレンジの値で除算して得られる値を、前記予測タップを構成する画素が有する前記第２の成分の値に乗算させることにより前記予測タップを正規化させる（例えば、図１２のステップＳ８６の処理）ことができる。 When obtaining the tap coefficient used for predicting the value of the first component of the pixel of interest, the normalizing means uses the dynamic range value of the value of the first component as the second component. The prediction tap is normalized by multiplying the value obtained by dividing the value of the dynamic range by the value of the second component of the pixel constituting the prediction tap (for example, FIG. 12). Step S86).

本発明の第２の側面の学習方法またはプログラムは、コンポーネント映像信号の第１の成分および第２の成分の値を有する画素から構成される第１の画像データから、注目している画素である注目画素の前記第１の成分の値を抽出し（例えば、図１２のステップＳ８７）、前記注目画素を予測するために用いる複数の画素を、前記第１の成分の値を有する画素と、前記第２の成分の値を有する画素とから構成される第２の画像データから予測タップとして抽出し（例えば、図１２のステップＳ８５）、前記注目画素を複数のクラスのうちのいずれかにクラス分けするクラス分類に用いる複数の画素を、前記第２の画像データからクラスタップとして抽出し（例えば、図１２のステップＳ８２）、前記クラスタップを用いて、前記注目画素のクラス分類を行い（例えば、図１２のステップＳ８４）、前記予測タップを構成する画素が有する前記第１の成分の値のダイナミックレンジと、前記予測タップを構成する画素が有する前記第２の成分の値のダイナミックレンジとが等しくなるように前記予測タップを正規化し（例えば、図１２のステップＳ８６）、前記注目画素の前記第１の成分の値、および正規化された前記予測タップを用いて、正規化された前記予測タップから前記注目画素の前記第１の成分の値を予測するために用いられる、前記注目画素のクラスに対応するタップ係数を求める（例えば、図１２のステップＳ９０）ステップを含む。 The learning method or program according to the second aspect of the present invention is a pixel of interest from first image data composed of pixels having values of a first component and a second component of a component video signal. The value of the first component of the pixel of interest is extracted (for example, step S87 in FIG. 12), and a plurality of pixels used for predicting the pixel of interest are pixels having the value of the first component; Extracted as prediction taps from second image data composed of pixels having the value of the second component (for example, step S85 in FIG. 12), and classifying the pixel of interest into one of a plurality of classes A plurality of pixels used for class classification are extracted as class taps from the second image data (for example, step S82 in FIG. 12), and the class tap is used to select the pixel of interest. Classification (for example, step S84 in FIG. 12), the dynamic range of the value of the first component included in the pixels constituting the prediction tap, and the second component included in the pixel constituting the prediction tap. The prediction tap is normalized so that the dynamic range of the value becomes equal (for example, step S86 in FIG. 12), the value of the first component of the pixel of interest, and the normalized prediction tap are used. Obtaining a tap coefficient corresponding to the class of the target pixel used to predict the value of the first component of the target pixel from the normalized prediction tap (for example, step S90 in FIG. 12); Including.

本発明は、動画像を撮影するデジタルビデオカメラ、静止画像を撮像するデジタルスチルカメラ、カメラ一体型VTR（Video Tape Recorder）等の映像機器、放送業務に用いられる画像処理装置、プリンタ、スキャナなどに適用することができる。 The present invention is applied to a digital video camera for capturing moving images, a digital still camera for capturing still images, video equipment such as a camera-integrated VTR (Video Tape Recorder), an image processing apparatus used in broadcasting services, a printer, a scanner, and the like. Can be applied.

以下、図面を参照して本発明を適用した実施の形態について説明する。 Embodiments to which the present invention is applied will be described below with reference to the drawings.

図２は、本発明を適用した撮像装置が、単板CCDの出力からより高精細な画像データを生成する処理の原理を表している。 FIG. 2 shows the principle of processing in which the imaging apparatus to which the present invention is applied generates higher-definition image data from the output of a single-plate CCD.

図２に示すように、撮像装置においては、ｎ×ｍ（図２では縦６×横８）の単板CCDが出力する画像データから、ｎ×ｍのＲ（赤）の画像データ、ｎ×ｍのＧ（緑）の画像データ、およびｎ×ｍのＢ（青）の画像データが、それぞれ直接演算により生成される。換言すれば、撮像装置は、単板CCDが出力する画像データを、Ｒの画像データ、Ｇの画像データ、およびＢの画像データからなる３板CCD出力相当の画像データに変換する。 As shown in FIG. 2, in the imaging apparatus, n × m R (red) image data, n × m (n × m (vertical 6 × horizontal 8 in FIG. 2) image data output) m G (green) image data and n × m B (blue) image data are respectively generated by direct calculation. In other words, the imaging apparatus converts the image data output from the single-plate CCD into image data corresponding to a three-plate CCD output composed of R image data, G image data, and B image data.

撮像装置の単板CCDの前面には色フィルタアレイが配置されており、色フィルタアレイの色配列は、例えば図２の左側に示すように、ベイヤー配列と称される色配列とされる。図２では、Ｇの色のフィルタが市松状に配置され、残りの部分にＲの色のフィルタおよびＢの色のフィルタが一行ごとに交互に配置されている。 A color filter array is arranged on the front surface of the single CCD of the imaging apparatus, and the color arrangement of the color filter array is, for example, a color arrangement called a Bayer arrangement as shown on the left side of FIG. In FIG. 2, the G color filters are arranged in a checkered pattern, and the R color filters and the B color filters are alternately arranged for each line in the remaining portion.

このようにして撮像装置は、単板CCDから出力されたＲの画素値を有する画素、Ｇの画素値を有する画素、およびＢの画素値を有する画素から構成される画像データを、Ｒの画素値、Ｇの画素値、およびＢの画素値を有する画素から構成される３板CCD出力相当の画像データに変換する。 In this way, the imaging apparatus converts image data composed of pixels having an R pixel value, pixels having a G pixel value, and pixels having a B pixel value output from a single-plate CCD into R pixels. Is converted into image data equivalent to a three-plate CCD output composed of pixels having a value, a pixel value of G, and a pixel value of B.

図３は、以上のような原理に従って、被写体を撮像して高精細な画像データを生成する撮像装置の構成例を示すブロック図である。 FIG. 3 is a block diagram illustrating a configuration example of an imaging apparatus that captures a subject and generates high-definition image data according to the principle described above.

撮像装置４１のレンズ５１は被写体からの光を集光し、集光した光を、アイリス５２および色フィルタアレイ５３を介して単板のCCD５４に入射させる。すなわち、レンズ５１は入射した光をCCD５４上に結像させる。また、色フィルタアレイ５３は、１画素ごとに割り当てられた色フィルタから構成され、CCD５４の前面（図中、左側）に配置されている。 The lens 51 of the imaging device 41 condenses the light from the subject and makes the collected light enter the single-plate CCD 54 via the iris 52 and the color filter array 53. That is, the lens 51 forms an image of the incident light on the CCD 54. The color filter array 53 is composed of color filters assigned to each pixel, and is arranged on the front surface (the left side in the figure) of the CCD 54.

CCD５４は、入射された光を光電変換して、光電変換により得られた電荷を蓄積し、蓄積した電荷をアナログ信号である画像信号としてAGC（Automatic Gain Control）/CDS（Correlated Double Sampling）回路５５に供給する。 The CCD 54 photoelectrically converts incident light, accumulates electric charges obtained by the photoelectric conversion, and uses the accumulated electric charges as an image signal that is an analog signal as an AGC (Automatic Gain Control) / CDS (Correlated Double Sampling) circuit 55. To supply.

AGS/CDS回路５５は、供給された画像信号の大きさ（レベル）が一定になるようにゲインを調整して出力するとともに、CCD５４において発生する１／ｆノイズを除去する。さらに、AGC/CDS回路５５は、メインCPU（Central Processing Unit）５９の制御に基づいて、CCD５４における電荷の蓄積時間を電気的に変化させる電子シャッタ処理も行う。AGC/CDS回路５５から出力された画像信号は、Ａ／Ｄ（Analog/Digital）変換部５６に入力され、アナログ信号からデジタル信号である画像データに変換された後、画像信号処理回路５７に入力される。画像信号処理回路５７は、入力された画像データに対して、欠陥補正処理、デジタルクランプ処理、ホワイトバランス調整処理、ガンマ補正処理、クラス分類適応処理を用いた補間処理などの所定の処理を行う。 The AGS / CDS circuit 55 adjusts and outputs the gain so that the magnitude (level) of the supplied image signal is constant, and removes 1 / f noise generated in the CCD 54. Further, the AGC / CDS circuit 55 performs electronic shutter processing for electrically changing the charge accumulation time in the CCD 54 based on the control of a main CPU (Central Processing Unit) 59. The image signal output from the AGC / CDS circuit 55 is input to an A / D (Analog / Digital) conversion unit 56, converted from an analog signal to image data that is a digital signal, and then input to an image signal processing circuit 57. Is done. The image signal processing circuit 57 performs predetermined processing such as defect correction processing, digital clamp processing, white balance adjustment processing, gamma correction processing, and interpolation processing using class classification adaptation processing on the input image data.

タイミングジェネレータ５８は、メインCPU５９の制御に基づいて、各種のタイミング信号を発生し、CCD５４、AGC/CDS回路５５、Ａ／Ｄ変換部５６、メインCPU５９などに供給する。 The timing generator 58 generates various timing signals based on the control of the main CPU 59, and supplies them to the CCD 54, the AGC / CDS circuit 55, the A / D converter 56, the main CPU 59, and the like.

モータ６０は、メインCPU５９の制御に基づいて、アイリス５２を駆動し、レンズ５１からCCD５４に入射される光の量を制御する。モータ６１は、メインCPU５９により制御され、レンズ５１を制御して、レンズ５１のCCD５４に対するフォーカス状態を制御する。発光部６２は、メインCPU５９により制御され、撮像時に被写体に対して所定の閃光を照射する。 The motor 60 drives the iris 52 based on the control of the main CPU 59 and controls the amount of light incident on the CCD 54 from the lens 51. The motor 61 is controlled by the main CPU 59 and controls the lens 51 to control the focus state of the lens 51 with respect to the CCD 54. The light emitting unit 62 is controlled by the main CPU 59 and irradiates a predetermined flash to the subject during imaging.

記録媒体インターフェース６３は、画像信号処理回路５７から出力された画像データを、必要に応じてメモリ６４に記憶させ、所定のインターフェース処理を実行した後、記録媒体６５に供給して記録させる。記録媒体６５は、例えば、不揮発性のフラッシュメモリ、磁気ディスク、光ディスク、光磁気ディスクなどから構成され、撮像装置４１に着脱可能とされている。 The recording medium interface 63 stores the image data output from the image signal processing circuit 57 in the memory 64 as necessary, executes a predetermined interface process, and then supplies the image data to the recording medium 65 for recording. The recording medium 65 includes, for example, a non-volatile flash memory, a magnetic disk, an optical disk, a magneto-optical disk, and the like, and is detachable from the imaging device 41.

コントローラ６６は、メインCPU５９の制御のもと、画像信号処理回路５７および記録媒体インターフェース６３を制御する。また、メインCPU５９には、シャッタボタンやスイッチなどから構成される操作部７０から、撮像者の操作に応じて各種の指令が入力される。電源部６７は、バッテリ６８およびDC/DCコンバータ６９を内蔵しており、DC/DCコンバータ６９は、バッテリ６８からの電力を所定の値の直流電圧に変換し、撮像装置４１の各部に供給する。充電可能なバッテリ６８は、撮像装置４１に着脱可能とされている。 The controller 66 controls the image signal processing circuit 57 and the recording medium interface 63 under the control of the main CPU 59. In addition, various commands are input to the main CPU 59 from the operation unit 70 including shutter buttons, switches, and the like in accordance with the operation of the photographer. The power supply unit 67 includes a battery 68 and a DC / DC converter 69, and the DC / DC converter 69 converts power from the battery 68 into a DC voltage having a predetermined value and supplies it to each unit of the imaging device 41. . The rechargeable battery 68 can be attached to and detached from the imaging device 41.

次に、図４のフローチャートを参照して、撮像装置４１が所定の被写体を撮像し、撮像により得られた画像データを記録媒体６５に記録させる処理について説明する。 Next, with reference to the flowchart of FIG. 4, a process in which the imaging device 41 images a predetermined subject and causes the recording medium 65 to record image data obtained by the imaging will be described.

ステップＳ１１において、レンズ５１は被写体からの光を集光し、集光した光をアイリス５２および色フィルタアレイ５３を介してCCD５４に入射させる。また、メインCPU５９は、モータ６０を制御してアイリス５２を駆動し、CCD５４に入射するレンズ５１からの光量を所定の値に調整させるとともに、モータ６１を制御してレンズ５１の位置を調整させ、フォーカス制御を実行する。 In step S 11, the lens 51 condenses the light from the subject, and causes the collected light to enter the CCD 54 via the iris 52 and the color filter array 53. The main CPU 59 controls the motor 60 to drive the iris 52 to adjust the light amount from the lens 51 incident on the CCD 54 to a predetermined value, and also controls the motor 61 to adjust the position of the lens 51. Perform focus control.

ステップＳ１２において、メインCPU５９は、撮像者によりシャッタボタンが操作されたか否かを判定する。例えば、撮像者が撮像装置４１に設けられた、図示せぬモニタやファインダに表示された画像を見ることで画角を確認し、シャッタボタンとしての操作部７０を操作すると、操作部７０からメインCPU５９には、シャッタボタンが操作された旨の信号が供給される。メインCPU５９は、シャッタボタンが操作された旨の信号が供給された場合、シャッタボタンが操作されたと判定する。 In step S12, the main CPU 59 determines whether or not the shutter button has been operated by the photographer. For example, when the imager confirms the angle of view by viewing an image displayed on a monitor or finder (not shown) provided in the imaging device 41 and operates the operation unit 70 as a shutter button, the operation unit 70 performs main operation. The CPU 59 is supplied with a signal indicating that the shutter button has been operated. The main CPU 59 determines that the shutter button has been operated when a signal indicating that the shutter button has been operated is supplied.

ステップＳ１２において、シャッタボタンが操作されていないと判定された場合、処理はステップＳ１１に戻り、それ以降の処理が繰り返される。 If it is determined in step S12 that the shutter button has not been operated, the process returns to step S11, and the subsequent processes are repeated.

これに対して、ステップＳ１２において、シャッタボタンが操作されたと判定された場合、処理はステップＳ１３に進む。また、このとき、メインCPU５９は発光部６２を駆動して閃光を発生させ、被写体に照射させるとともに、タイミングジェネレータ５８を制御して各種のタイミング信号を発生させる。タイミングジェネレータ５８は、発生させたタイミング信号をCCD５４、AGC/CDS回路５５、Ａ／Ｄ変換部５６、およびメインCPU５９に供給する。 On the other hand, if it is determined in step S12 that the shutter button has been operated, the process proceeds to step S13. At this time, the main CPU 59 drives the light emitting unit 62 to generate a flash and irradiate the subject, and controls the timing generator 58 to generate various timing signals. The timing generator 58 supplies the generated timing signal to the CCD 54, the AGC / CDS circuit 55, the A / D converter 56, and the main CPU 59.

ステップＳ１３において、CCD５４は、タイミングジェネレータ５８からのタイミング信号に同期して、レンズ５１から入射した光を光電変換し（電荷に変換し）、これにより得られたアナログ信号を画像信号としてAGC/CDS回路５５に供給する。 In step S13, the CCD 54 photoelectrically converts (converts into charges) the light incident from the lens 51 in synchronization with the timing signal from the timing generator 58, and uses the analog signal obtained thereby as an image signal as an AGC / CDS. This is supplied to the circuit 55.

ステップＳ１４において、AGC/CDS回路５５は、CCD５４から供給された画像信号の１／ｆノイズ成分を除去する処理を行った後、画像信号の大きさが一定になるようにゲインを調整して画像信号をＡ／Ｄ変換部５６に供給する。 In step S14, the AGC / CDS circuit 55 performs processing for removing the 1 / f noise component of the image signal supplied from the CCD 54, and then adjusts the gain so that the magnitude of the image signal is constant, The signal is supplied to the A / D converter 56.

ステップＳ１５において、Ａ／Ｄ変換部５６は、AGC/CDS回路５５から供給された画像信号をアナログ信号からデジタル信号である画像データに変換し、変換により得られた画像データを画像信号処理回路５７に供給する。 In step S15, the A / D converter 56 converts the image signal supplied from the AGC / CDS circuit 55 from an analog signal into image data that is a digital signal, and the image data obtained by the conversion is converted into an image signal processing circuit 57. To supply.

ステップＳ１６において、画像信号処理回路５７はＡ／Ｄ変換部５６から供給された画像データに対して、画像信号処理を施す。その詳細は図６のフローチャートを参照して後述するが、欠陥補正処理、デジタルクランプ処理、ホワイトバランス調整処理、ガンマ補正処理、クラス分類適応処理などの画像信号処理が施される。画像信号処理回路５７は、画像信号処理を施した画像データを必要に応じてメモリ６４に一時的に記憶させ、記録媒体インターフェース６３を介して記録媒体６５に供給する。 In step S 16, the image signal processing circuit 57 performs image signal processing on the image data supplied from the A / D conversion unit 56. Although details will be described later with reference to the flowchart of FIG. 6, image signal processing such as defect correction processing, digital clamp processing, white balance adjustment processing, gamma correction processing, class classification adaptation processing, and the like is performed. The image signal processing circuit 57 temporarily stores the image data subjected to the image signal processing in the memory 64 as necessary, and supplies the image data to the recording medium 65 via the recording medium interface 63.

ステップＳ１７において、記録媒体６５は、記録媒体インターフェース６３から供給された画像データを記録して、撮像処理は終了する。なお、撮像により得られた画像データを記録媒体６５に記録せずに、撮像装置４１に接続された他の装置に所定の信号フォーマットで出力するようにしてもよい。 In step S17, the recording medium 65 records the image data supplied from the recording medium interface 63, and the imaging process ends. Note that the image data obtained by imaging may be output in a predetermined signal format to another device connected to the imaging device 41 without being recorded on the recording medium 65.

このようにして、撮像装置４１は被写体を撮像し、撮像により得られた画像データを記録媒体６５に記録する。 In this manner, the imaging device 41 images the subject and records the image data obtained by the imaging on the recording medium 65.

ところで、画像データに対して画像信号処理を施す画像信号処理回路５７は、例えば、図５に示すように構成される。 By the way, the image signal processing circuit 57 that performs image signal processing on image data is configured as shown in FIG. 5, for example.

画像信号処理回路５７は、欠陥補正回路１０１、クランプ回路１０２、ホワイトバランス回路１０３、ガンマ補正回路１０４、補間処理部１０５、補正回路１０６、およびRGBマトリクス回路１０７から構成される。 The image signal processing circuit 57 includes a defect correction circuit 101, a clamp circuit 102, a white balance circuit 103, a gamma correction circuit 104, an interpolation processing unit 105, a correction circuit 106, and an RGB matrix circuit 107.

単板のCCD５４（図３）から出力された画像信号（画像データ）は、Ａ／Ｄ変換部５６（図３）を介して欠陥補正回路１０１に供給される。ここで、欠陥補正回路１０１に供給される画像データは、例えば、Ｒ（赤）、Ｇ（緑）、およびＢ（青）のそれぞれの色を成分とするコンポーネント映像信号とされる。欠陥補正回路１０１は、CCD５４のうち光に反応しない画素に対応する成分、あるいは、常に電荷を保持している画素に対応する成分などの欠陥成分を検出し、検出の結果に基づいてＡ／Ｄ変換部５６から供給された画像データを補正する欠陥補正処理を行う。 The image signal (image data) output from the single CCD 54 (FIG. 3) is supplied to the defect correction circuit 101 via the A / D converter 56 (FIG. 3). Here, the image data supplied to the defect correction circuit 101 is, for example, a component video signal whose components are R (red), G (green), and B (blue). The defect correction circuit 101 detects a defect component such as a component corresponding to a pixel that does not react to light in the CCD 54 or a component corresponding to a pixel that always holds a charge, and performs A / D based on the detection result. Defect correction processing for correcting the image data supplied from the conversion unit 56 is performed.

また、画像信号が読み出される場合、画像の各画素の画素値を示す信号が有する値よりも小さい値が検出されると、画像の１ライン分の画像信号の読み出しが終了したと判定される。画素値を示す信号が有する値の最小値は０であるので、負の値が検出されると画像の１ライン分の画像信号の読み出しが終了したと判定されることになる。そこで、Ａ／Ｄ変換部５６は、画像信号の負の値がカットされるのを防ぐため、信号値を若干正の方向へシフトさせた状態で、アナログ信号である画像信号をデジタル信号である画像データに変換する。クランプ回路１０２は、欠陥補正回路１０１から供給された画像データの正の方向へのシフト量を元に戻すクランプ処理を行う。 Further, when an image signal is read, if a value smaller than a value included in a signal indicating a pixel value of each pixel of the image is detected, it is determined that reading of the image signal for one line of the image is completed. Since the minimum value of the signal indicating the pixel value is 0, when a negative value is detected, it is determined that reading of the image signal for one line of the image has been completed. Therefore, the A / D conversion unit 56 is a digital signal that is an analog image signal with the signal value slightly shifted in the positive direction in order to prevent the negative value of the image signal from being cut. Convert to image data. The clamp circuit 102 performs clamp processing for restoring the shift amount in the positive direction of the image data supplied from the defect correction circuit 101.

ホワイトバランス回路１０３は、クランプ回路１０２から供給された画像データとしてのＲ、Ｇ、およびＢの各色信号のそれぞれのゲインの補正を行う。ガンマ補正回路１０４は、ホワイトバランス回路１０３から供給された各色信号の値をガンマ曲線に従って補正する。 The white balance circuit 103 corrects the respective gains of the R, G, and B color signals as image data supplied from the clamp circuit 102. The gamma correction circuit 104 corrects the value of each color signal supplied from the white balance circuit 103 according to the gamma curve.

補間処理部１０５は、クラス分類適応処理を行うことによって、ガンマ補正回路１０４から供給された画像データを３板CCD出力相当の画像データに変換する。また、補間処理部１０５は、変換により得られた画像データを補正回路１０６に供給する。 The interpolation processing unit 105 converts the image data supplied from the gamma correction circuit 104 into image data corresponding to a three-plate CCD output by performing a class classification adaptive process. Further, the interpolation processing unit 105 supplies the image data obtained by the conversion to the correction circuit 106.

補間処理部１０５は、ADRC（Adaptive Dynamic Range Control）ブロック化回路１１１、ADRC処理回路１１２、クラス分類回路１１３、予測タップブロック化回路１１４、予測タップ正規化部１１５、適応処理回路１１６、および係数メモリ１１７から構成される。 The interpolation processing unit 105 includes an ADRC (Adaptive Dynamic Range Control) blocking circuit 111, an ADRC processing circuit 112, a class classification circuit 113, a prediction tap blocking circuit 114, a prediction tap normalization unit 115, an adaptive processing circuit 116, and a coefficient memory. 117.

ADRCブロック化回路１１１は、単板のCCD５４から出力された画像データを変換して得ようとする３板CCD出力相当の画像データ（この３板CCD出力相当の画像データはこれから求めようとする画像データであり、現段階では存在しないため仮想的に想定される）を構成する画素を、順次、注目画素とし、ガンマ補正回路１０４から供給された画像データを構成する画素のいくつかを、注目画素を複数のクラスのうちのいずれかにクラス分けするためのクラスタップとして抽出する。 The ADRC blocking circuit 111 converts the image data output from the single-plate CCD 54 and obtains image data equivalent to the 3-plate CCD output (the image data equivalent to the 3-plate CCD output is the image to be obtained from now on). Pixels that are data and are virtually assumed because they do not exist at this stage) are sequentially set as target pixels, and some of the pixels that form the image data supplied from the gamma correction circuit 104 are the target pixels. Is extracted as a class tap for classifying into one of a plurality of classes.

また、ADRCブロック化回路１１１は、抽出したクラスタップをADRC処理回路１１２に供給する。 Further, the ADRC blocking circuit 111 supplies the extracted class tap to the ADRC processing circuit 112.

ADRC処理回路１１２は、ADRCブロック化回路１１１から供給されたクラスタップにADRC処理を施し、ADRC処理が施されたクラスタップをクラス分類回路１１３に供給する。クラス分類回路１１３は、ADRC処理回路１１２からのクラスタップに基づいて（クラスタップを用いて）注目画素をクラス分類し、分類されたクラスを示すクラスコード（クラス番号）を適応処理回路１１６に供給する。 The ADRC processing circuit 112 performs ADRC processing on the class tap supplied from the ADRC blocking circuit 111 and supplies the class tap subjected to ADRC processing to the class classification circuit 113. The class classification circuit 113 classifies the target pixel based on the class tap from the ADRC processing circuit 112 (using the class tap), and supplies a class code (class number) indicating the classified class to the adaptive processing circuit 116. To do.

予測タップブロック化回路１１４は、ガンマ補正回路１０４から供給された画像データを構成する画素のいくつかを、注目画素の画素値を予測するために用いられる予測タップとして抽出し、予測タップ正規化部１１５に供給する。 The prediction tap blocking circuit 114 extracts some of the pixels constituting the image data supplied from the gamma correction circuit 104 as prediction taps used to predict the pixel value of the target pixel, and a prediction tap normalization unit 115.

予測タップ正規化部１１５は、予測タップブロック化回路１１４から供給された予測タップを正規化して、正規化された予測タップを適応処理回路１１６に供給する。 The prediction tap normalization unit 115 normalizes the prediction tap supplied from the prediction tap blocking circuit 114 and supplies the normalized prediction tap to the adaptive processing circuit 116.

適応処理回路１１６は、クラス分類回路１１３から供給されたクラスコードに対応するタップ係数を係数メモリ１１７から読み出し、タップ係数を予測タップ正規化部１１５から供給された予測タップを構成する画素の画素値に乗算して、必要な画素値を予測演算する。 The adaptive processing circuit 116 reads the tap coefficient corresponding to the class code supplied from the class classification circuit 113 from the coefficient memory 117, and extracts the tap coefficient from the pixel value of the pixels constituting the prediction tap supplied from the prediction tap normalization unit 115. And a necessary pixel value is predicted and calculated.

係数メモリ１１７は注目画素の画素値を予測するために用いられるクラスごとのタップ係数（各クラスについて予め求められているタップ係数）を記憶する。例えば、係数メモリ１１７には、各クラスに分類された注目画素のＲの画素値を予測するためのタップ係数、Ｇの画素値を予測するためのタップ係数、およびＢの画素値を予測するためのタップ係数が予め記憶されている。 The coefficient memory 117 stores a tap coefficient for each class used for predicting the pixel value of the target pixel (a tap coefficient determined in advance for each class). For example, the coefficient memory 117 predicts the tap coefficient for predicting the R pixel value of the target pixel classified into each class, the tap coefficient for predicting the G pixel value, and the B pixel value. The tap coefficients are stored in advance.

補正回路１０６は、補間処理部１０５の適応処理回路１１６から供給された画像データに対し、エッジ強調などの画像を視覚的に良く見せるために必要な補正処理を行う。RGBマトリクス回路１０７は、補正回路１０６から供給された画像データとしてのＲ、Ｇ、およびＢの各色信号をそのまま出力するか、または、各色信号に予め用意されている所定の変換マトリクスを乗算し、YUV方式（輝度Ｙ、色差Ｕ、および色差Ｖを成分とする信号方式）などの所定の信号フォーマットの画像データに変換して出力する。 The correction circuit 106 performs correction processing necessary for visually enhancing the image such as edge enhancement on the image data supplied from the adaptive processing circuit 116 of the interpolation processing unit 105. The RGB matrix circuit 107 outputs the R, G, and B color signals as image data supplied from the correction circuit 106 as they are, or multiplies each color signal by a predetermined conversion matrix, The image data is converted into image data of a predetermined signal format such as YUV method (signal method using luminance Y, color difference U, and color difference V as components) and output.

次に、図６のフローチャートを参照して、図４のステップＳ１６の処理において、画像信号処理回路５７により行われる画像信号処理について説明する。 Next, image signal processing performed by the image signal processing circuit 57 in the process of step S16 of FIG. 4 will be described with reference to the flowchart of FIG.

ステップＳ４１において、欠陥補正回路１０１は、Ａ／Ｄ変換部５６から供給された画像データに対して、欠陥画素があればこれを補正する欠陥補正処理を施し、欠陥補正処理が施された画像データをクランプ回路１０２に供給する。例えば、欠陥補正回路１０１は、光に反応しない画素、または常に電荷を有している画素に対しては、その画素の画素値を隣接する画素の画素値の平均値で置換することにより画像データを補正する。 In step S 41, the defect correction circuit 101 performs defect correction processing for correcting any defective pixels on the image data supplied from the A / D conversion unit 56, and the image data subjected to the defect correction processing. Is supplied to the clamp circuit 102. For example, the defect correction circuit 101 replaces the pixel value of a pixel that does not respond to light, or a pixel that always has a charge with the average value of the pixel values of adjacent pixels, thereby replacing the image data. Correct.

ステップＳ４２において、クランプ回路１０２は、欠陥補正回路１０１から供給された画像データに対して、Ａ／Ｄ変換部５６におけるオフセット値を補正するためのクランプ処理を施し、クランプ処理が施された画像データをホワイトバランス回路１０３に供給する。 In step S42, the clamp circuit 102 performs a clamp process for correcting the offset value in the A / D conversion unit 56 on the image data supplied from the defect correction circuit 101, and the image data subjected to the clamp process. Is supplied to the white balance circuit 103.

ステップＳ４３において、ホワイトバランス回路１０３は、クランプ回路１０２から供給された画像データに対してホワイトバランス調整処理を施し、画像データとしてのＲ、Ｇ、およびＢの各色信号のそれぞれのレベルを適正な白が表現できるレベルに調整する。そして、ホワイトバランス回路１０３は、ホワイトバランス調整処理が施された画像データをガンマ補正回路１０４に供給する。 In step S43, the white balance circuit 103 performs white balance adjustment processing on the image data supplied from the clamp circuit 102, and sets the levels of the R, G, and B color signals as image data to appropriate white levels. Adjust to a level that can be expressed. Then, the white balance circuit 103 supplies the image data subjected to the white balance adjustment process to the gamma correction circuit 104.

ステップＳ４４において、ガンマ補正回路１０４は、ホワイトバランス回路１０３から供給された画像データに対してガンマ補正処理を施し、ガンマ補正処理が施された画像データを補間処理部１０５のADRCブロック化回路１１１および予測タップブロック化回路１１４に供給する。 In step S 44, the gamma correction circuit 104 performs gamma correction processing on the image data supplied from the white balance circuit 103, and the image data subjected to the gamma correction processing is converted into the ADRC blocking circuit 111 of the interpolation processing unit 105 and the image data. This is supplied to the prediction tap blocking circuit 114.

ステップＳ４５において、ADRCブロック化回路１１１は、ガンマ補正回路１０４から供給された画像データをｐ×ｑ個（但し、ｐおよびｑは正の整数）のブロックに分割し、分割された各ブロックからクラスタップを抽出する。ADRCブロック化回路１１１は、抽出したクラスタップをADRC処理回路１１２に供給する。 In step S45, the ADRC blocking circuit 111 divides the image data supplied from the gamma correction circuit 104 into p × q blocks (where p and q are positive integers), and classifies each divided block as a class. Extract taps. The ADRC blocking circuit 111 supplies the extracted class tap to the ADRC processing circuit 112.

例えば、図７Ａに示すように、注目画素の位置が矢印Ｋ１１により示されるＧの画素に対応する位置である場合、ADRCブロック化回路１１１は、図中、矢印Ｋ１１により示されるＧの画素の左上、左下、右上、および右下に隣接する４個のＧの画素、矢印Ｋ１１により示されるＧの画素から上下方向にＲの画素を介して隣接する２個のＧの画素、および横方向にＢの画素を介して隣接する２個のＧの画素、並びに矢印Ｋ１１により示されるＧの画素からなる計９個のＧの画素をクラスタップとして抽出する。 For example, as shown in FIG. 7A, when the position of the target pixel is a position corresponding to the G pixel indicated by the arrow K11, the ADRC blocking circuit 111 displays the upper left corner of the G pixel indicated by the arrow K11 in the figure. , Four G pixels adjacent to the lower left, upper right, and lower right, two G pixels adjacent to each other through the R pixel in the vertical direction from the G pixel indicated by the arrow K11, and B in the horizontal direction A total of nine G pixels composed of two G pixels adjacent to each other through the pixel and the G pixel indicated by the arrow K11 are extracted as class taps.

また、例えば、図７Ｂに示すように、注目画素の位置が矢印Ｋ１２により示されるＧの画素に対応する位置である場合、ADRCブロック化回路１１１は、図中、矢印Ｋ１２により示されるＧの画素の左上、左下、右上、および右下に隣接する４個のＧの画素、矢印Ｋ１２により示されるＧの画素から上下方向にＢの画素を介して隣接する２個のＧの画素、および横方向にＲの画素を介して隣接する２個のＧの画素、並びに矢印Ｋ１２により示されるＧの画素からなる計９個のＧの画素をクラスタップとして抽出する。 Also, for example, as shown in FIG. 7B, when the position of the target pixel is a position corresponding to the G pixel indicated by the arrow K12, the ADRC blocking circuit 111 displays the G pixel indicated by the arrow K12 in the drawing. 4 G pixels adjacent to the upper left, lower left, upper right, and lower right, two G pixels adjacent to each other via the B pixel in the vertical direction from the G pixel indicated by the arrow K12, and the horizontal direction A total of nine G pixels consisting of two G pixels adjacent to each other through R pixels and a G pixel indicated by an arrow K12 are extracted as class taps.

図６のフローチャートの説明に戻り、クラスタップが抽出されると、ステップＳ４６において、ADRC処理回路１１２はADRCブロック化回路１１１から供給されたクラスタップにADRC処理を施し、その結果得られたADRCコードをADRC処理が施されたクラスタップとしてクラス分類回路１１３に供給する。 Returning to the description of the flowchart of FIG. 6, when class taps are extracted, in step S46, the ADRC processing circuit 112 performs ADRC processing on the class taps supplied from the ADRC blocking circuit 111, and the resulting ADRC code is obtained. Is supplied to the class classification circuit 113 as a class tap subjected to ADRC processing.

例えば、ＫビットADRC処理においては、ADRC処理回路１１２は、クラスタップを構成する画素の画素値の最大値MAXおよび最小値MINを検出し、検出された画素値の最大値MAXと最小値MINとの差ＤＲ（ＤＲ=MAX-MIN）を、クラスタップを構成する画素の集合の局所的なダイナミックレンジとする。ADRC処理回路１１２はこのダイナミックレンジＤＲに基づいて、クラスタップを構成する画素（の画素値）をＫビットに再量子化する。即ち、ADRC処理回路１１２は、クラスタップを構成する各画素の画素値から最小値MINを減算し、その減算値をDR/2Kで除算（量子化）する。 For example, in the K-bit ADRC processing, the ADRC processing circuit 112 detects the maximum value MAX and the minimum value MIN of the pixel values constituting the class tap, and detects the maximum value MAX and the minimum value MIN of the detected pixel values. The difference DR (DR = MAX−MIN) is a local dynamic range of a set of pixels constituting the class tap. Based on the dynamic range DR, the ADRC processing circuit 112 requantizes the pixels (the pixel values thereof) constituting the class tap into K bits. That is, the ADRC processing circuit 112 subtracts the minimum value MIN from the pixel value of each pixel constituting the class tap, and divides (quantizes) the subtracted value by DR / 2K.

そして、ADRC処理回路１１２は、以上のようにして得られるクラスタップを構成するＫビットの各画素の画素値を所定の順番で並べたビット列を、ADRCコードとして出力する。したがって、クラスタップが、例えば、１ビットADRC処理された場合には、そのクラスタップを構成する各画素の画素値は、最小値MINが減算された後に最大値MAXと最小値MINとの差の１／２で除算され（小数点以下切り捨て）、これにより、各画素の画素値が１ビットとされる（２値化される）。そして、その１ビットの画素値を所定の順番で並べたビット列がADRCコードとして出力される。 Then, the ADRC processing circuit 112 outputs, as an ADRC code, a bit string in which the pixel values of the K-bit pixels constituting the class tap obtained as described above are arranged in a predetermined order. Therefore, for example, when a class tap is subjected to 1-bit ADRC processing, the pixel value of each pixel constituting the class tap is the difference between the maximum value MAX and the minimum value MIN after the minimum value MIN is subtracted. Divided by 1/2 (rounded down), the pixel value of each pixel is made 1 bit (binarized). A bit string in which the 1-bit pixel values are arranged in a predetermined order is output as an ADRC code.

ステップＳ４７において、クラス分類回路１１３は、ADRC処理回路１１２から供給されたADRCコードに基づいてクラス分類を行う。すなわち、クラス分類回路１１３は、クラスタップをADRC処理して得られたADRCコードに対応するクラスを決定し、そのクラスを示すクラスコードを生成して適応処理回路１１６に供給する。 In step S47, the class classification circuit 113 performs class classification based on the ADRC code supplied from the ADRC processing circuit 112. That is, the class classification circuit 113 determines a class corresponding to the ADRC code obtained by ADRC processing the class tap, generates a class code indicating the class, and supplies the generated class code to the adaptive processing circuit 116.

ステップＳ４８において、予測タップブロック化回路１１４は、ガンマ補正回路１０４から供給された画像データをｐ×ｑ個（但し、ｐおよびｑは正の整数）のブロックに分割し、分割された各ブロックから予測タップを抽出する。予測タップブロック化回路１１４は、抽出した予測タップを予測タップ正規化部１１５に供給する。 In step S48, the prediction tap blocking circuit 114 divides the image data supplied from the gamma correction circuit 104 into p × q blocks (where p and q are positive integers), and from each of the divided blocks. Extract prediction taps. The prediction tap blocking circuit 114 supplies the extracted prediction tap to the prediction tap normalization unit 115.

例えば、図７Ａに示したように、矢印Ｋ１１により示されるＧの画素に対応する位置が注目画素の位置である場合、図８Ａに示すように、予測タップブロック化回路１１４は、矢印Ｋ１１により示されるＧの画素を含む、そのＧの画素の周囲の５×５個の画素を予測タップとして抽出する。この場合、予測タップの画素には、Ｒの画素、Ｇの画素、およびＢの画素が含まれている。 For example, when the position corresponding to the G pixel indicated by the arrow K11 is the position of the target pixel as shown in FIG. 7A, the prediction tap blocking circuit 114 is indicated by the arrow K11 as shown in FIG. 8A. 5 × 5 pixels around the G pixel including the G pixel to be extracted are extracted as prediction taps. In this case, the pixel of the prediction tap includes an R pixel, a G pixel, and a B pixel.

また、図７Ｂに示したように、矢印Ｋ１２により示されるＧの画素に対応する位置が注目画素の位置である場合、図８Ｂに示すように、予測タップブロック化回路１１４は、矢印Ｋ１２により示されるＧの画素を含む、そのＧの画素の周囲の５×５個の画素を予測タップとして抽出する。この場合も図８Ａにおける場合と同様に、予測タップの画素には、Ｒの画素、Ｇの画素、およびＢの画素が含まれている。 As shown in FIG. 7B, when the position corresponding to the G pixel indicated by the arrow K12 is the position of the target pixel, as shown in FIG. 8B, the prediction tap blocking circuit 114 is indicated by the arrow K12. 5 × 5 pixels around the G pixel including the G pixel to be extracted are extracted as prediction taps. Also in this case, as in the case of FIG. 8A, the pixels of the prediction tap include an R pixel, a G pixel, and a B pixel.

図６のフローチャートの説明に戻り、予測タップが抽出されると、ステップＳ４９において、予測タップ正規化部１１５は、予測タップブロック化回路１１４から供給された予測タップを正規化して、正規化された予測タップを適応処理回路１１６に供給する。 Returning to the description of the flowchart of FIG. 6, when the prediction tap is extracted, the prediction tap normalization unit 115 normalizes the prediction tap supplied from the prediction tap blocking circuit 114 in step S 49. The prediction tap is supplied to the adaptive processing circuit 116.

例えば、図９に示すように、注目画素の位置が画素Ｇ３３の位置であり、注目画素のＲの画素値を予測する場合を考える。なお、図９の１つの四角形は１つの画素を示しており、画素Ｇ１１乃至画素Ｇ５５のそれぞれは、Ｇの画素を表し、画素Ｂ１２乃至画素Ｂ５４のそれぞれは、Ｂの画素を表し、画素Ｒ２１乃至画素Ｒ４５のそれぞれは、Ｒの画素を表している。 For example, as shown in FIG. 9, consider the case where the position of the target pixel is the position of the pixel G33 and the R pixel value of the target pixel is predicted. Note that one square in FIG. 9 represents one pixel, each of the pixels G11 to G55 represents a G pixel, each of the pixels B12 to B54 represents a B pixel, and each of the pixels R21 to R21 Each of the pixels R45 represents an R pixel.

ここで、注目画素に対する予測タップを構成する画素を、画素Ｇ３３を中心とする５×５個の画素（すなわち、図９に示す全ての画素）とすると、予測タップには、画素Ｇ１１乃至画素Ｇ５５の計１３個のＧの画素、画素Ｂ１２乃至画素Ｂ５４の計６個のＢの画素、および画素Ｒ２１乃至画素Ｒ４５の計６個のＲの画素が含まれる。 Here, assuming that the pixels constituting the prediction tap for the target pixel are 5 × 5 pixels centering on the pixel G33 (that is, all the pixels shown in FIG. 9), the prediction taps include pixels G11 to G55. A total of 13 G pixels, a total of 6 B pixels of pixels B12 to B54, and a total of 6 R pixels of pixels R21 to R45 are included.

このように、Ｒの画素値を予測する場合、予測タップにＲとは異なる色のＧの画素およびＢの画素が含まれているとき、図１を参照して説明したように、Ｒの画素のダイナミックレンジに対して、Ｇの画素またはＢの画素のダイナミックレンジが大きすぎると画像に破綻が生じてしまう恐れがある。 In this way, when predicting the R pixel value, when the prediction tap includes G pixels and B pixels of colors different from R, as described with reference to FIG. 1, the R pixels If the dynamic range of the G pixel or the B pixel is too large with respect to the dynamic range, the image may be broken.

そこで、予測タップ正規化部１１５は、予測タップを構成するＧの画素のダイナミックレンジ、およびＢの画素のダイナミックレンジが、Ｒの画素のダイナミックレンジと等しくなるように、予測タップを正規化する。 Therefore, the prediction tap normalization unit 115 normalizes the prediction tap so that the dynamic range of the G pixel and the dynamic range of the B pixel that constitute the prediction tap are equal to the dynamic range of the R pixel.

具体的には、まず、予測タップ正規化部１１５は、式（１）を計算することにより、Ｒの画素のダイナミックレンジＤＲ（Ｒ）を求める。 Specifically, first, the prediction tap normalization unit 115 calculates the dynamic range DR (R) of the R pixel by calculating Expression (1).

ＤＲ（Ｒ）＝Rmax−Rmin ・・・（１） DR (R) = Rmax−Rmin (1)

ここで、式（１）におけるRmaxおよびRminは、それぞれ予測タップを構成するＲの画素の画素値のうちの最大値および最小値を表している。したがって、例えば、図９の例では、予測タップを構成する画素Ｒ２１乃至画素Ｒ４５の６個のＲの画素の画素値のうちの最大値と最小値とが、それぞれRmaxおよびRminとされ、ＤＲ（Ｒ）は、その最大値と最小値との差の値とされる。 Here, Rmax and Rmin in Equation (1) represent the maximum value and the minimum value among the pixel values of the R pixels that constitute the prediction tap, respectively. Therefore, for example, in the example of FIG. 9, the maximum value and the minimum value of the pixel values of the six R pixels of the pixels R21 to R45 constituting the prediction tap are Rmax and Rmin, respectively, and DR ( R) is a difference value between the maximum value and the minimum value.

同様に、予測タップ正規化部１１５は、式（２）および式（３）を計算することにより、予測タップを構成するＧの画素のダイナミックレンジＤＲ（Ｇ）、およびＢの画素のダイナミックレンジＤＲ（Ｂ）を求める。 Similarly, the prediction tap normalization unit 115 calculates the dynamic range DR (G) of the G pixel and the dynamic range DR of the B pixel that form the prediction tap by calculating Expressions (2) and (3). (B) is obtained.

ＤＲ（Ｇ）＝Gmax−Gmin ・・・（２） DR (G) = Gmax−Gmin (2)

ＤＲ（Ｂ）＝Bmax−Bmin ・・・（３） DR (B) = Bmax−Bmin (3)

ここで、式（２）におけるGmaxおよびGminは、それぞれ予測タップを構成するＧの画素の画素値のうちの最大値および最小値を表しており、式（３）におけるBmaxおよびBminは、それぞれ予測タップを構成するＢの画素の画素値のうちの最大値および最小値を表している。 Here, Gmax and Gmin in Expression (2) represent the maximum value and minimum value of the pixel values of the G pixels constituting the prediction tap, respectively, and Bmax and Bmin in Expression (3) are respectively predicted. It represents the maximum value and the minimum value among the pixel values of the B pixels constituting the tap.

次に、予測タップ正規化部１１５は、求められたＲの画素およびＧの画素のダイナミックレンジＤＲ（Ｒ）およびＤＲ（Ｇ）を用いて式（４）を計算し、予測タップを構成するＧの画素の画素値を正規化する。 Next, the prediction tap normalization unit 115 calculates Equation (4) using the obtained dynamic range DR (R) and DR (G) of the R pixel and the G pixel, and forms G that constitutes the prediction tap. Normalize the pixel values of the pixels.

GNij＝Gij×（ＤＲ（Ｒ）／ＤＲ（Ｇ））・・・（４） GNij = Gij × (DR (R) / DR (G)) (4)

ここで、Gijは、図９におけるＧの画素Ｇｉｊ（但し、１≦ｉ≦５，１≦ｊ≦５）の画素値を示しており、GNijは正規化された画素Ｇｉｊの画素値を示している。したがって、例えば、予測タップ正規化部１１５は、式（４）を計算して画素Ｇ１１乃至画素Ｇ５５のそれぞれの画素値を正規化する。 Here, Gij represents the pixel value of the G pixel Gij (where 1 ≦ i ≦ 5, 1 ≦ j ≦ 5) in FIG. 9, and GNij represents the normalized pixel value of the pixel Gij. Yes. Therefore, for example, the prediction tap normalization unit 115 normalizes the pixel values of the pixels G11 to G55 by calculating Expression (4).

同様に、予測タップ正規化部１１５は、求められたＲの画素およびＢの画素のダイナミックレンジＤＲ（Ｒ）およびＤＲ（Ｂ）を用いて式（５）を計算し、予測タップを構成するＢの画素の画素値を正規化する。 Similarly, the prediction tap normalization unit 115 calculates Equation (5) using the obtained dynamic ranges DR (R) and DR (B) of the R pixel and the B pixel, and forms the prediction tap. Normalize the pixel values of the pixels.

BNij＝Bij×（ＤＲ（Ｒ）／ＤＲ（Ｂ））・・・（５） BNij = Bij × (DR (R) / DR (B)) (5)

ここで、Bijは、図９におけるＢの画素Ｂｉｊ（但し、１≦ｉ≦５，１≦ｊ≦５）の画素値を示しており、BNijは正規化された画素Ｂｉｊの画素値を示している。 Here, Bij represents the pixel value of the B pixel Bij (where 1 ≦ i ≦ 5, 1 ≦ j ≦ 5) in FIG. 9, and BNij represents the normalized pixel value of the pixel Bij. Yes.

予測タップ正規化部１１５は、Ｇの画素の画素値、およびＢの画素の画素値を正規化すると、Ｒの画素の画素値、正規化されたＧの画素の画素値、および正規化されたＢの画素の画素値を、正規化された予測タップを構成する画素の画素値とする。 When the prediction tap normalization unit 115 normalizes the pixel value of the G pixel and the pixel value of the B pixel, the pixel value of the R pixel, the pixel value of the normalized G pixel, and the normalized pixel value Let the pixel value of the pixel of B be the pixel value of the pixel which comprises the normalized prediction tap.

また、予測タップ正規化部１１５は、注目画素のＢの画素値を予測する場合、予測タップを構成するＧの画素の画素値に、Ｂの画素のダイナミックレンジＤＲ（Ｂ）をＧの画素のダイナミックレンジＤＲ（Ｇ）で除算した値を乗算して、Ｇの画素の画素値を正規化する。また、予測タップ正規化部１１５は、Ｒの画素の画素値に、Ｂの画素のダイナミックレンジＤＲ（Ｂ）をＲの画素のダイナミックレンジＤＲ（Ｒ）で除算した値を乗算して、Ｒの画素の画素値を正規化する。そして、予測タップ正規化部１１５は、正規化されたＲの画素の画素値、正規化されたＧの画素の画素値、およびＢの画素の画素値を、正規化された予測タップを構成する画素の画素値とする。 In addition, when predicting the B pixel value of the target pixel, the prediction tap normalization unit 115 sets the dynamic range DR (B) of the B pixel to the pixel value of the G pixel constituting the prediction tap. The pixel value of the G pixel is normalized by multiplying the value divided by the dynamic range DR (G). Further, the prediction tap normalization unit 115 multiplies the pixel value of the R pixel by the value obtained by dividing the dynamic range DR (B) of the B pixel by the dynamic range DR (R) of the R pixel, and Normalize the pixel value of the pixel. Then, the prediction tap normalization unit 115 configures the normalized prediction tap by using the normalized pixel value of the R pixel, the normalized pixel value of the G pixel, and the pixel value of the B pixel. The pixel value of the pixel.

同様に、予測タップ正規化部１１５は、注目画素のＧの画素値を予測する場合、予測タップを構成するＢの画素の画素値に、Ｇの画素のダイナミックレンジＤＲ（Ｇ）をＢの画素のダイナミックレンジＤＲ（Ｂ）で除算した値を乗算して、Ｂの画素の画素値を正規化する。また、予測タップ正規化部１１５は、Ｒの画素の画素値に、Ｇの画素のダイナミックレンジＤＲ（Ｇ）をＲの画素のダイナミックレンジＤＲ（Ｒ）で除算した値を乗算して、Ｒの画素の画素値を正規化する。そして、予測タップ正規化部１１５は、正規化されたＲの画素の画素値、Ｇの画素の画素値、および正規化されたＢの画素の画素値を、正規化された予測タップを構成する画素の画素値とする。 Similarly, when predicting the G pixel value of the target pixel, the prediction tap normalization unit 115 sets the dynamic range DR (G) of the G pixel to the B pixel value as the pixel value of the B pixel constituting the prediction tap. The pixel value of the B pixel is normalized by multiplying the value divided by the dynamic range DR (B). Further, the prediction tap normalization unit 115 multiplies the pixel value of the R pixel by the value obtained by dividing the dynamic range DR (G) of the G pixel by the dynamic range DR (R) of the R pixel, and Normalize the pixel value of the pixel. Then, the prediction tap normalization unit 115 configures the normalized prediction tap with the normalized pixel value of the R pixel, the pixel value of the G pixel, and the normalized pixel value of the B pixel. The pixel value of the pixel.

このように、予測タップを正規化することによって、例えば、図１０に示すように所定の位置ＢにおけるＲの画素値を予測するために、予測タップとしてＧの画素を用いても、Ｇの画素のダイナミックレンジを、Ｒの画素のダイナミックレンジと等しくなるようにすることができる。なお、図中、横軸は撮像された画像における位置を示し、縦軸は各位置における各色の成分の画素値（レベル）を示している。 In this way, by normalizing the prediction tap, for example, as shown in FIG. 10, in order to predict the R pixel value at the predetermined position B, the G pixel may be used even if the G pixel is used as the prediction tap. Can be made equal to the dynamic range of the R pixel. In the figure, the horizontal axis indicates the position in the captured image, and the vertical axis indicates the pixel value (level) of each color component at each position.

曲線１４１は、現実世界の各位置におけるＧの値（波形）を示しており、Ｇの値は位置Ｂの前後において急峻に増加している。また、曲線１４２は、現実世界の各位置におけるＲの値（波形）を示しており、Ｒの値は曲線１４１により示されるＧの値と比べると、位置Ｂの前後においてなだらかに増加している。 A curve 141 indicates the G value (waveform) at each position in the real world, and the G value increases steeply before and after the position B. A curve 142 indicates the R value (waveform) at each position in the real world, and the R value increases gradually before and after the position B as compared to the G value indicated by the curve 141. .

さらに、図中、丸および四角形は、予測タップを構成するＧの画素（の画素値）およびＲの画素（の画素値）を示している。ここで、図１０の左側の図の丸により示される３つのＧの画素、および四角形により示される２つのＲの画素を予測タップとして用いて、クラス分類適応処理により位置ＢにおけるＲの画素値を予測する場合を考える。 Further, in the drawing, circles and squares indicate G pixels (pixel values) and R pixels (pixel values) constituting the prediction tap. Here, using the three G pixels indicated by circles in the left side of FIG. 10 and the two R pixels indicated by squares as prediction taps, the pixel value of R at position B is obtained by class classification adaptive processing. Consider the case of prediction.

この場合、予測タップとして用いられるＧの画素のダイナミックレンジは、予測タップとして用いられるＲの画素のダイナミックレンジと比較して非常に大きく、位置Ｂの付近においてＧの画素値は急激に変化している。 In this case, the dynamic range of the G pixel used as the prediction tap is very large compared to the dynamic range of the R pixel used as the prediction tap, and the G pixel value changes rapidly in the vicinity of the position B. Yes.

そこで、上述したように、Ｇの画素の画素値を正規化すると、正規化されたＧの画素値（波形）は、図１０の右側の図の曲線１４３に示すように、位置Ｂの前後において曲線１４２とほぼ同じ傾きを持ち、なだらかに増加する曲線となる。このように、Ｇの画素値を正規化すると、正規化されたＧの画素のダイナミックレンジは、Ｒの画素のダイナミックレンジと等しくなり、画像に破綻が生じてしまうこともなく、予測された注目画素の画素値は適度に強調された値となる。 Therefore, as described above, when the pixel value of the G pixel is normalized, the normalized G pixel value (waveform) is obtained before and after the position B as shown by the curve 143 in the right diagram of FIG. The curve has almost the same slope as the curve 142 and is a gently increasing curve. As described above, when the G pixel value is normalized, the normalized dynamic range of the G pixel becomes equal to the dynamic range of the R pixel, and the predicted attention is not caused without causing the image to fail. The pixel value of the pixel is a value that is moderately emphasized.

図６のフローチャートの説明に戻り、予測タップが正規化されると、ステップＳ５０において、適応処理回路１１６は、クラス分類回路１１３から供給されたクラスコードに対応するタップ係数を係数メモリ１１７から読み出し、そのタップ係数を予測タップ正規化部１１５からの正規化された予測タップに乗算することで、注目画素の画素値（例えば、Ｒ、Ｇ、またはＢの画素値）を予測演算する。適応処理回路１１６は、予測演算により注目画素の画素値を求めて３板CCD出力相当の画像データを生成し、補正回路１０６に供給する。 Returning to the description of the flowchart of FIG. 6, when the prediction tap is normalized, in step S 50, the adaptive processing circuit 116 reads the tap coefficient corresponding to the class code supplied from the class classification circuit 113 from the coefficient memory 117, By multiplying the normalized prediction tap from the prediction tap normalization unit 115 by the tap coefficient, the pixel value of the target pixel (for example, R, G, or B pixel value) is predicted and calculated. The adaptive processing circuit 116 obtains the pixel value of the pixel of interest by prediction calculation, generates image data corresponding to the three-plate CCD output, and supplies the image data to the correction circuit 106.

ステップＳ５１において、補間処理部１０５は、全てのブロックについてクラス分類適応処理が終了したか否かを判定する。 In step S51, the interpolation processing unit 105 determines whether or not the class classification adaptation processing has been completed for all blocks.

ステップＳ５１において、全てのブロックについてクラス分類適応処理が終了していないと判定された場合、処理はステップＳ４５に戻り、それ以降の処理が繰り返される。 If it is determined in step S51 that the class classification adaptation process has not been completed for all blocks, the process returns to step S45, and the subsequent processes are repeated.

これに対して、ステップＳ５１において、全てのブロックについてクラス分類適応処理が終了したと判定された場合、処理はステップＳ５２に進み、補正回路１０６は、適応処理回路１１６からの画像データに対して画像を視覚的に良く見せるための補正処理（いわゆる画像作り）を施し、補正処理が施された画像データをRGBマトリクス回路１０７に供給する。 On the other hand, if it is determined in step S51 that the class classification adaptive processing has been completed for all the blocks, the processing proceeds to step S52, and the correction circuit 106 performs image processing on the image data from the adaptive processing circuit 116. Correction processing (so-called image creation) is performed to make the image look good visually, and the image data subjected to the correction processing is supplied to the RGB matrix circuit 107.

ステップＳ５３において、RGBマトリクス回路１０７は、必要に応じて、補正回路１０６から供給された画像データとしてのＲ、Ｇ、およびＢの各色信号をYUV方式の画像データに変換するなどの色空間の変換処理を行い、変換処理により得られた画像データを記録媒体インターフェース６３に供給し、画像信号処理は終了する。 In step S53, the RGB matrix circuit 107 converts the color space such as converting the R, G, and B color signals as image data supplied from the correction circuit 106 into YUV image data as necessary. The image data obtained by the conversion process is supplied to the recording medium interface 63, and the image signal process ends.

このようにして、画像信号処理回路５７は、画像データから抽出した予測タップを正規化し、正規化した予測タップを用いてクラス分類適応処理を行う。 In this way, the image signal processing circuit 57 normalizes the prediction tap extracted from the image data, and performs the class classification adaptation process using the normalized prediction tap.

このように、予測タップを正規化してクラス分類適応処理を行うことで、予測タップの各色の画素のダイナミックレンジを等しくすることができ、画像の破綻を抑制することができる。これにより、単板のCCD５４から出力された画像データから、より高精細な３板CCD出力相当の画像データをより精度よく生成することができる。 In this way, by performing the classification adaptation process by normalizing the prediction tap, the dynamic range of the pixels of each color of the prediction tap can be made equal, and the failure of the image can be suppressed. As a result, higher-definition image data equivalent to a three-plate CCD output can be generated with higher accuracy from the image data output from the single-plate CCD 54.

しかも、クラス分類適応処理により、３板CCD出力相当の画像データとしてのＲの画像データ、Ｇの画像データ、およびＢの画像データを得ることができるので、エッジ部分や細部の鮮鋭度が増して、Ｓ／Ｎ評価値も向上させることができ、より鮮明な画像を得ることができる。なお、図７および図８に注目画素に対するクラスタップおよび予測タップの一例を示したが、クラスタップまたは予測タップを構成する画素の位置は任意であり、それぞれ最も効率のよいように定められる。 In addition, the class classification adaptive processing can obtain R image data, G image data, and B image data as image data equivalent to a three-plate CCD output, thereby increasing the sharpness of edge portions and details. , The S / N evaluation value can be improved, and a clearer image can be obtained. 7 and 8 show an example of the class tap and the prediction tap for the target pixel. However, the positions of the pixels constituting the class tap or the prediction tap are arbitrary and are determined to be most efficient.

次に、図５の適応処理回路１１６における予測演算と、係数メモリ１１７に記憶されたタップ係数の学習について説明する。 Next, prediction calculation in the adaptive processing circuit 116 in FIG. 5 and learning of tap coefficients stored in the coefficient memory 117 will be described.

例えば、クラス分類適応処理として、単板のCCD５４から出力された画像データから予測タップを抽出し、その予測タップとタップ係数とを用いて、３板CCD出力相当の画像の画素（以下、適宜、高画質画素と称する）の画素値を所定の予測演算によって求める（予測する）ことを考える。 For example, as the class classification adaptive processing, a prediction tap is extracted from image data output from a single-plate CCD 54, and an image pixel corresponding to a three-plate CCD output (hereinafter, appropriately, using the prediction tap and the tap coefficient). Let us consider obtaining (predicting) a pixel value of a high-quality pixel) by a predetermined prediction calculation.

所定の予測演算として、例えば、線形１次予測演算を採用することとすると、高画質画素の画素値ｙは、式（６）に示す線形１次式によって求められることになる。 If, for example, a linear primary prediction calculation is adopted as the predetermined prediction calculation, the pixel value y of the high-quality pixel is obtained by a linear primary expression shown in Expression (6).

・・・（６）

... (6)

但し、式（６）において、ｘ_iは、高画質画素の画素値ｙについての予測タップを構成する、ｉ番目の単板のCCD５４から出力された画像データの画素（以下、適宜、低画質画素と称する）の画素値を表し、ｗ_iは、ｉ番目の低画質画素（の画素値）と乗算されるｉ番目のタップ係数を表す。なお、式（６）では、予測タップがＮ個の低画質画素ｘ₁，ｘ₂，・・・，ｘ_Nで構成されるものとしてある。 However, in Expression (6), x _i is a pixel of the image data output from the i-th single-plate CCD 54 that constitutes a prediction tap for the pixel value y of the high-quality pixel (hereinafter referred to as a low-quality pixel as appropriate). It represents a pixel value of a designated), w _i represents the i th i th tap coefficient multiplied with the low-quality pixel (pixel value of). In equation (6), the prediction tap is composed of N low image quality pixels x ₁ , x ₂ ,..., X _N.

ここで、高画質画素の画素値ｙは、式（６）に示した線形１次式ではなく、２次以上の高次の線形関数の式によって求めるようにすることも可能である。また、線形関数に限らず、非線形関数によって高画質画素の画素値を求めるようにしてもよい。 Here, the pixel value y of the high-quality pixel can be obtained not by the linear primary expression shown in Expression (6) but by an expression of a higher-order linear function of the second or higher order. Further, not only the linear function but also the pixel value of the high image quality pixel may be obtained by a non-linear function.

いま、第ｓサンプルの高画質画素の画素値の真値をｙ_sと表すとともに、式（６）によって得られるその真値ｙ_sの予測値をｙ_s'と表すと、その予測誤差ｅ_sは、次式（７）で表される。 Now, when the true value of the pixel value of the high-quality pixel of the s-th sample is expressed as y _s and the predicted value of the true value y _s obtained by the equation (6) is expressed as y _s ′, the prediction error _es Is represented by the following equation (7).

ｅ_s＝（ｙ_s−ｙ_s'）・・・（７） e _s = (y _s −y _s ′) (7)

いま、式（７）の予測値ｙ_s'は、式（６）にしたがって求められるため、式（７）のｙ_s'を、式（６）にしたがって置き換えると、次式（８）が得られる。 Now, since the predicted value y _s ′ of equation (7) is obtained according to equation (6), the following equation (8) is obtained by replacing y _s ′ of equation (7) according to equation (6). It is done.

・・・（８）

... (8)

但し、式（８）において、ｘ_s,iは、第ｓサンプルの高画質画素についての予測タップを構成するｉ番目の低画質画素を表す。 In Equation (8), x _{s, i} represents the i-th low-quality pixel that constitutes the prediction tap for the high-quality pixel of the s-th sample.

式（８）（または式（７））の予測誤差ｅ_sを０とするタップ係数ｗ_iが、高画質画素を予測するのに最適なものとなるが、すべての高画質画素について、そのようなタップ係数ｗ_iを求めることは、一般には困難である。 Prediction error e _s the tap coefficient w _i to 0 in Equation (8) (or formula (7)) is, is the optimal for predicting the high-quality pixel, for all the high-quality pixel, such In general, it is difficult to obtain a large tap coefficient w _i .

そこで、タップ係数ｗ_iが最適なものであることを表す規範として、例えば、最小自乗法を採用することとすると、最適なタップ係数ｗ_iは、次式（９）で表される自乗誤差の総和Ｅを最小にすることで求めることができる。 Therefore, as a standard indicating that the tap coefficient w _i is optimum, for example, when the least square method is adopted, the optimum tap coefficient w _i is expressed by the square error represented by the following equation (9). It can be obtained by minimizing the sum E.

・・・（９）

... (9)

但し、式（９）において、Ｓは、高画質画素ｙ_sと、その高画質画素ｙ_sについての予測タップを構成する低画質画素ｘ_s,1，ｘ_s,2，・・・，ｘ_s,Nとのセットのサンプル数（学習用のサンプルの数）を表す。 However, in the equation (9), S constitutes a high image quality pixel y _s, the prediction taps for the high-quality pixel y _s low-quality pixels _{_{x s, 1, x s,}} 2, ···, x s _{, N} represents the number of samples (the number of learning samples).

式（９）の自乗誤差の総和Ｅの最小値（極小値）は、式（１０）に示すように、総和Ｅをタップ係数ｗ_iで偏微分したものを０とするｗ_iによって与えられる。 The minimum value (minimum value) of the sum E of square errors in equation (9) is given by w _i , which is 0 as a result of partial differentiation of the sum E by the tap coefficient w _i as shown in equation (10).

・・・（１０）

... (10)

一方、上述の式（８）をタップ係数ｗ_iで偏微分すると、次式（１１）が得られる。 On the other hand, when the above equation (8) is partially differentiated by the tap coefficient w _i , the following equation (11) is obtained.

・・・（１１）

(11)

式（１０）および式（１１）から、次式（１２）が得られる。 From the equations (10) and (11), the following equation (12) is obtained.

・・・（１２）

(12)

式（１２）のｅ_sに、式（８）を代入することにより、式（１２）は、式（１３）に示す正規方程式で表すことができる。 To e _s of formula (12), by substituting equation (8), equation (12) can be represented by normal equations shown in equation (13).

・・・（１３）

... (13)

また、Ｘ_i,j，Ｙ_iをそれぞれ式（１４）および式（１５）により定義すると（但し、１≦ｉ≦Ｎ，１≦ｊ≦Ｎ）、式（１３）は、式（１６）により表すことができる。 Further, if X _{i, j} and Y _i are respectively defined by Expression (14) and Expression (15) (where 1 ≦ i ≦ N, 1 ≦ j ≦ N), Expression (13) can be expressed by Expression (16). Can be represented.

・・・（１４）

(14)

・・・（１５）

... (15)

・・・（１６）

... (16)

式（１６）の正規方程式は、例えば、掃き出し法（Gauss-Jordanの消去法）などを用いることにより、タップ係数ｗ_iについて解くことができる。 The normal equation of Expression (16) can be solved for the tap coefficient w _i by using, for example, a sweeping method (Gauss-Jordan elimination method) or the like.

式（１６）の正規方程式を、クラスごとに立てて解くことにより、最適なタップ係数ｗ_i（ここでは、自乗誤差の総和Ｅを最小にするタップ係数ｗ_i）を、クラスごとに求めることができる。 By solving the normal equation of Equation (16) for each class, the optimum tap coefficient w _i (here, the tap coefficient w _i that minimizes the sum E of square errors) can be obtained for each class. it can.

図５の補間処理部１０５では、以上のようなクラスごとのタップ係数ｗ_iを用いて、式（６）の演算を行うことにより、単板のCCD５４から出力された画像データから、３板CCD出力相当の画像データを得ることができる。 In the interpolation processing unit 105 of FIG. 5, by using the tap coefficient w _i for each class as described above, the calculation of Expression (6) is performed, so that the three-plate CCD is obtained from the image data output from the single-plate CCD 54. Image data corresponding to the output can be obtained.

次に、図１１は、式（１６）の正規方程式をクラスごとに立てて解くことによりタップ係数ｗ_iを求める学習を行う学習装置の構成例を示している。 Next, FIG. 11 shows a configuration example of a learning apparatus that performs learning for obtaining the tap coefficient w _i by solving the normal equation of Expression (16) for each class.

学習装置２０１は、間引き回路２１１、ADRCブロック化回路２１２、ADRC処理回路２１３、クラス分類回路２１４、予測タップブロック化回路２１５、予測タップ正規化部２１６、教師画像ブロック化回路２１７、演算回路２１８、学習データメモリ２１９、演算回路２２０、および係数メモリ２２１から構成される。 The learning device 201 includes a thinning circuit 211, an ADRC blocking circuit 212, an ADRC processing circuit 213, a class classification circuit 214, a prediction tap blocking circuit 215, a prediction tap normalizing unit 216, a teacher image blocking circuit 217, an arithmetic circuit 218, A learning data memory 219, an arithmetic circuit 220, and a coefficient memory 221 are included.

この学習装置２０１には、タップ係数ｗ_iの学習に用いられる３板CCD出力相当の画像データ（以下、教師画像データとも称する）が入力されるようになされている。学習装置２０１において、教師画像データは間引き回路２１１および教師画像ブロック化回路２１７に供給される。 The learning device 201 is input with image data equivalent to a 3-plate CCD output (hereinafter also referred to as teacher image data) used for learning the tap coefficient w _i . In the learning device 201, the teacher image data is supplied to the thinning circuit 211 and the teacher image blocking circuit 217.

間引き回路２１１は、教師画像データから、色フィルタアレイの各色の配置にしたがって画素を間引く間引き処理を行う。この間引き処理は、撮像装置４１（図３）のCCD５４の前面に配置される色フィルタアレイ５３（光学ローパスフィルタ）を想定したフィルタをかけることによって行う。 The thinning circuit 211 performs thinning processing for thinning out pixels from the teacher image data according to the arrangement of each color in the color filter array. This thinning process is performed by applying a filter assuming a color filter array 53 (optical low-pass filter) disposed in front of the CCD 54 of the imaging device 41 (FIG. 3).

また、間引き回路２１１は、教師画像データに対して間引き処理を施すことによって得られた、単板CCD出力相当の画像データ（以下、生徒画像データとも称する）をADRCブロック化回路２１２および予測タップブロック化回路２１５に供給する。 Further, the thinning circuit 211 converts the image data equivalent to the single-plate CCD output (hereinafter also referred to as student image data) obtained by performing the thinning process on the teacher image data into the ADRC blocking circuit 212 and the prediction tap block. Is supplied to the circuit 215.

ADRCブロック化回路２１２は、教師画像データにおける注目画素との対応をとりながら、間引き回路２１１から供給された生徒画像データから、生徒画像データを構成する画素のうちの所定の画素をクラスタップとして抽出する。また、ADRCブロック化回路２１２は、抽出したクラスタップをADRC処理回路２１３に供給する。 The ADRC blocking circuit 212 extracts a predetermined pixel from the student image data supplied from the thinning circuit 211 as a class tap from the student image data supplied from the thinning circuit 211 while taking correspondence with the target pixel in the teacher image data. To do. Further, the ADRC blocking circuit 212 supplies the extracted class tap to the ADRC processing circuit 213.

ADRC処理回路２１３は、ADRCブロック化回路２１２から供給されたクラスタップにADRC処理を施し、ADRC処理が施されたクラスタップをクラス分類回路２１４に供給する。クラス分類回路２１４は、ADRC処理回路２１３からのクラスタップに基づいて注目画素をクラス分類し、分類されたクラスを示すクラスコード（クラス番号）を演算回路２１８に供給する。 The ADRC processing circuit 213 performs ADRC processing on the class tap supplied from the ADRC blocking circuit 212 and supplies the class tap subjected to ADRC processing to the class classification circuit 214. The class classification circuit 214 classifies the target pixel based on the class tap from the ADRC processing circuit 213 and supplies a class code (class number) indicating the classified class to the arithmetic circuit 218.

予測タップブロック化回路２１５は、教師画像データにおける注目画素との対応をとりながら、間引き回路２１１から供給された生徒画像データから、生徒画像データを構成する画素のうちの所定の画素を予測タップとして抽出する。また、予測タップブロック化回路２１５は、抽出した予測タップを予測タップ正規化部２１６に供給する。 The prediction tap blocking circuit 215 uses a predetermined pixel among the pixels constituting the student image data as a prediction tap from the student image data supplied from the thinning circuit 211 while taking correspondence with the target pixel in the teacher image data. Extract. Further, the prediction tap blocking circuit 215 supplies the extracted prediction tap to the prediction tap normalization unit 216.

予測タップ正規化部２１６は、予測タップブロック化回路２１５から供給された予測タップを正規化して、正規化された予測タップを演算回路２１８に供給する。 The prediction tap normalization unit 216 normalizes the prediction tap supplied from the prediction tap blocking circuit 215 and supplies the normalized prediction tap to the arithmetic circuit 218.

教師画像ブロック化回路２１７は、生徒画像データにおけるクラスタップとの対応を取りながら、教師画像データを構成する画素を順次、注目画素とし、教師画像データから注目画素（の画素値）を抽出して演算回路２１８に供給する。 The teacher image blocking circuit 217 extracts the target pixel (the pixel value) from the teacher image data by sequentially setting the pixel constituting the teacher image data as the target pixel while taking correspondence with the class tap in the student image data. This is supplied to the arithmetic circuit 218.

演算回路２１８は、教師画像ブロック化回路２１７から供給された注目画素と、予測タップ正規化部２１６からの正規化された予測タップとの対応をとりながら、注目画素と予測タップを構成する画素とを対象とした足し込みを、クラス分類回路２１４からのクラスコードにしたがって行う。 The arithmetic circuit 218 takes the correspondence between the target pixel supplied from the teacher image blocking circuit 217 and the normalized prediction tap from the prediction tap normalization unit 216, and the pixel constituting the target pixel and the prediction tap, Is added according to the class code from the class classification circuit 214.

すなわち、演算回路２１８には、教師画像データの注目画素の画素値ｙ_s、予測タップ（を構成する生徒画像データの画素の画素値）ｘ_s,i、注目画素のクラスを表すクラスコードが供給される。 That is, the pixel value y _{s of} the target pixel of the teacher image data, the prediction tap (the pixel value of the pixel of the student image data constituting the pixel) x _{s, i} , and the class code representing the class of the target pixel are supplied to the arithmetic circuit 218 Is done.

そして、演算回路２１８はクラスコードに対応するクラスごとに、正規化された予測タップ（生徒画像データ）ｘ_s,iを用い、式（１３）の左辺の行列における生徒画像データどうしの乗算（ｘ_s,iｘ_s,i）、およびその乗算された値の総和の演算に相当する演算を行う。 Then, the arithmetic circuit 218 uses the normalized prediction tap (student image data) x _{s, i} for each class corresponding to the class code, and multiplies the student image data between the matrixes on the left side of Expression (13) (x _{s, i} x _{s, i} ) and a calculation corresponding to the calculation of the sum of the multiplied values.

さらに、演算回路２１８は、クラスコードに対応するクラスごとに、予測タップ（生徒画像データ）ｘ_s,iおよび教師画像データｙ_sを用い、式（１３）の右辺のベクトルにおける生徒画像データｘ_s,iおよび教師画像データｙ_sの乗算（ｘ_s,iｙ_s）、並びにその乗算された値の総和の演算に相当する演算を行う。 Further, the arithmetic circuit 218 uses the prediction tap (student image data) x _{s, i} and the teacher image data y _s for each class corresponding to the class code, and uses the student image data x _s in the vector on the right side of Expression (13). _{, i} and teacher image data y _s (x _{s, i} y _s ), and a calculation corresponding to the calculation of the sum of the multiplied values.

すなわち、演算回路２１８は、前回、注目画素とされた教師画像データについて求められた式（１３）における左辺の行列のコンポーネント（Σｘ_s,iｘ_s,i）と、右辺のベクトルのコンポーネント（Σｘ_s,iｙ_s）を記憶しており、その行列のコンポーネント（Σｘ_s,iｘ_s,i）またはベクトルのコンポーネント（Σｘ_s,iｙ_s）に対して、新たに注目画素とされた教師画像データについて、その教師画像データｙ_s+1および生徒データｘ_s+1,iを用いて計算される、対応するコンポーネントｘ_s+1,iｘ_s+1,iまたはｘ_s+1,iｙ_s+1を足し込む（式（１３）のΣで表される加算を行う）。 That is, the arithmetic circuit 218 lastly calculates the left-side matrix component (Σx _{s, i} x _{s, i} ) and the right-side vector component (Σx _{s, i} y _s ), and the newly selected pixel of interest for the matrix component (Σx _{s, i} x _{s, i} ) or vector component (Σx _{s, i} y _s ) the image data is calculated using the teacher image data y _{s + 1} and the student data x _{s + 1, i,} corresponding component _{x s + 1, i x s} + 1, i or x _{s + 1, i} Add y _{s + 1} (addition represented by Σ in equation (13)).

そして、演算回路２１８は、教師画像データの全ての画素を注目画素として、上述の足し込みを行うことにより、各クラスについて、式（１３）に示した正規方程式を立てる（生成する）と、その正規方程式を学習データメモリ２１９に供給する。 Then, the arithmetic circuit 218 establishes (generates) the normal equation shown in the equation (13) for each class by performing the above addition using all the pixels of the teacher image data as the target pixel, The normal equation is supplied to the learning data memory 219.

学習データメモリ２１９は、演算回路２１８による足し込みによって得られた正規方程式（のマトリクス係数）を逐次読み込んで記憶する。また、学習データメモリ２１９は、記憶している正規方程式を演算回路２２０に供給する。 The learning data memory 219 sequentially reads and stores normal equations (matrix coefficients thereof) obtained by addition by the arithmetic circuit 218. The learning data memory 219 supplies the stored normal equation to the arithmetic circuit 220.

演算回路２２０は、学習データメモリ２１９から供給される各クラスについての正規方程式を解くことにより、各クラスについて、最適なタップ係数ｗ_iを求めて係数メモリ２２１に供給する。係数メモリ２２１は、演算回路２２０から供給された各クラスのタップ係数ｗ_iを記憶する。 The arithmetic circuit 220 solves the normal equation for each class supplied from the learning data memory 219 to obtain the optimum tap coefficient w _i for each class and supplies it to the coefficient memory 221. The coefficient memory 221 stores the tap coefficient w _i of each class supplied from the arithmetic circuit 220.

次に、図１２のフローチャートを参照して、学習装置２０１による学習処理について説明する。この学習処理は、学習装置２０１の間引き回路２１１および教師画像ブロック化回路２１７に、教師画像データとしての３板CCD出力相当の画像データが供給されると開始される。 Next, the learning process by the learning device 201 will be described with reference to the flowchart of FIG. This learning process is started when image data corresponding to a 3-plate CCD output as teacher image data is supplied to the thinning circuit 211 and the teacher image blocking circuit 217 of the learning device 201.

ステップＳ８１において、間引き回路２１１は、教師画像データから画素を間引き、生徒画像データとしての単板CCD出力相当の画像データを生成し、ADRCブロック化回路２１２および予測タップブロック化回路２１５に供給する。 In step S 81, the thinning circuit 211 thins out pixels from the teacher image data, generates image data corresponding to a single-plate CCD output as student image data, and supplies the image data to the ADRC blocking circuit 212 and the prediction tap blocking circuit 215.

ステップＳ８２において、ADRCブロック化回路２１２は、間引き回路２１１から供給された生徒画像データをｐ×ｑ個（但し、ｐおよびｑは正の整数）のブロックに分割し、分割された各ブロックからクラスタップを抽出する。ADRCブロック化回路２１２は、抽出したクラスタップをADRC処理回路２１３に供給する。 In step S82, the ADRC blocking circuit 212 divides the student image data supplied from the thinning circuit 211 into p × q blocks (where p and q are positive integers), and classifies each divided block as a class. Extract taps. The ADRC blocking circuit 212 supplies the extracted class tap to the ADRC processing circuit 213.

ステップＳ８３において、ADRC処理回路２１３は、ADRCブロック化回路２１２から供給されたクラスタップにADRC処理を施し、その結果得られたADRCコードをADRC処理が施されたクラスタップとしてクラス分類回路２１４に供給する。 In step S83, the ADRC processing circuit 213 performs ADRC processing on the class tap supplied from the ADRC blocking circuit 212, and supplies the ADRC code obtained as a result to the class classification circuit 214 as a class tap subjected to ADRC processing. To do.

ステップＳ８４において、クラス分類回路２１４は、ADRC処理回路２１３からのクラスタップに基づいて注目画素をクラス分類し、分類されたクラスを示すクラスコードを演算回路２１８に供給する。 In step S 84, the class classification circuit 214 classifies the target pixel based on the class tap from the ADRC processing circuit 213, and supplies a class code indicating the classified class to the arithmetic circuit 218.

ステップＳ８５において、予測タップブロック化回路２１５は、間引き回路２１１から供給された生徒画像データをｐ×ｑ個（但し、ｐおよびｑは正の整数）のブロックに分割し、分割された各ブロックから予測タップを抽出する。予測タップブロック化回路２１５は、抽出した予測タップを予測タップ正規化部２１６に供給する。 In step S85, the prediction tap blocking circuit 215 divides the student image data supplied from the thinning-out circuit 211 into p × q blocks (where p and q are positive integers), and from each of the divided blocks. Extract prediction taps. The prediction tap blocking circuit 215 supplies the extracted prediction tap to the prediction tap normalization unit 216.

ステップＳ８６において、予測タップ正規化部２１６は、予測タップブロック化回路２１５から供給された予測タップを正規化して、正規化された予測タップを演算回路２１８に供給する。例えば、予測タップ正規化部２１６は、図９を参照して説明したように、注目画素のタップ係数を予測する所定の色の画素のダイナミックレンジと、他の色の画素のダイナミックレンジとが等しくなるように予測タップを正規化する。 In step S86, the prediction tap normalization unit 216 normalizes the prediction tap supplied from the prediction tap blocking circuit 215 and supplies the normalized prediction tap to the arithmetic circuit 218. For example, as described with reference to FIG. 9, the prediction tap normalization unit 216 equals the dynamic range of pixels of a predetermined color for predicting the tap coefficient of the target pixel and the dynamic range of pixels of other colors. The prediction tap is normalized so that

ステップＳ８７において、教師画像ブロック化回路２１７は、生徒画像データにおけるクラスタップとの対応を取りながら、教師画像データを構成する画素を順次、注目画素とし、教師画像データから注目画素（のＲ、Ｇ、およびＢの画素値）を抽出して演算回路２１８に供給する。 In step S87, the teacher image blocking circuit 217 sequentially sets the pixels constituting the teacher image data as the target pixels while taking correspondence with the class taps in the student image data, and from the teacher image data, the target pixels (R, G of , And B pixel values) are extracted and supplied to the arithmetic circuit 218.

ステップＳ８８において、演算回路２１８は、教師画像ブロック化回路２１７から供給された注目画素と、予測タップ正規化部２１６からの正規化された予測タップとの対応をとりながら、注目画素と予測タップを構成する画素とを対象として、クラス分類回路２１４からのクラスコードに対応するクラスについて立てられた式（１３）により示される正規方程式への足し込みを行う。 In step S88, the arithmetic circuit 218 determines the target pixel and the prediction tap while taking the correspondence between the target pixel supplied from the teacher image blocking circuit 217 and the normalized prediction tap from the prediction tap normalization unit 216. For the constituent pixels, addition to the normal equation shown by the equation (13) established for the class corresponding to the class code from the class classification circuit 214 is performed.

ステップＳ８９において、演算回路２１８は全てのブロックについて足し込みを行ったか否かを判定する。 In step S89, the arithmetic circuit 218 determines whether or not all blocks have been added.

ステップＳ８９において、全てのブロックについて足し込みを行っていないと判定された場合、処理はステップＳ８８に戻り、まだ足し込みが行われていないブロックについて足し込みが行われる。 If it is determined in step S89 that addition has not been performed for all blocks, the process returns to step S88, and addition is performed for blocks that have not yet been added.

これに対して、ステップＳ８９において、全てのブロックについて足し込みを行ったと判定された場合、演算回路２１８は、各クラスについて立てた正規方程式を学習データメモリ２１９に供給して記憶させる。そして、学習データメモリ２１９は、記憶している正規方程式を演算回路２２０に供給し、処理はステップＳ９０に進む。 On the other hand, if it is determined in step S89 that the addition has been performed for all the blocks, the arithmetic circuit 218 supplies the normal equation established for each class to the learning data memory 219 for storage. Then, the learning data memory 219 supplies the stored normal equation to the arithmetic circuit 220, and the process proceeds to step S90.

ステップＳ９０において、演算回路２２０は、学習データメモリ２１９から供給されたクラスごとの正規方程式を、例えば、掃き出し法によって解くことにより各クラスのタップ係数ｗ_iを求めて係数メモリ２２１に供給する。これにより、各クラスのＲの画素値、Ｇの画素値、およびＢの画素値のそれぞれを予測するために用いられるタップ係数ｗ_iのそれぞれが求められる。 In step S90, the arithmetic circuit 220 obtains the tap coefficient w _i of each class by solving the normal equation for each class supplied from the learning data memory 219 by, for example, a sweeping method, and supplies the tap coefficient w _i to the coefficient memory 221. Thereby, each of the tap coefficient w _i used for predicting each of the R pixel value, the G pixel value, and the B pixel value of each class is obtained.

ステップＳ９１において、演算回路２２０は、全てのクラスについてタップ係数ｗ_iを求めたか否かを判定する。 In step S91, the arithmetic circuit 220 determines whether tap coefficients w _i have been obtained for all classes.

ステップＳ９１において、全てのクラスについてタップ係数ｗ_iを求めていないと判定された場合、処理はステップＳ９０に戻り、まだタップ係数ｗ_iが求められていないクラスの正規方程式が解かれてタップ係数ｗ_iが求められる。 If it is determined in step S91 that the tap coefficients w _i have not been obtained for all classes, the process returns to step S90, the normal equation of the class for which tap coefficients w _i have not yet been obtained is solved, and the tap coefficient w is determined. _i is required.

これに対して、ステップＳ９１において、全てのクラスについてタップ係数ｗ_iを求めたと判定された場合、処理はステップＳ９２に進み、係数メモリ２２１は、演算回路２２０から供給された各クラスのタップ係数ｗ_iを記憶し、学習処理は終了する。 On the other hand, if it is determined in step S91 that the tap coefficients w _i have been obtained for all classes, the process proceeds to step S92, and the coefficient memory 221 receives the tap coefficient w of each class supplied from the arithmetic circuit 220. _i is stored, and the learning process ends.

図５の係数メモリ１１７には、以上のようにして求められた複数セットのクラスごとのタップ係数ｗ_iが記憶されている。 The coefficient memory 117 of FIG. 5 stores tap coefficients w _i for each of a plurality of sets of classes obtained as described above.

このようにして、学習装置２０１は、生徒画像データとしての単板CCD出力相当の画像データから抽出した予測タップを正規化し、正規化した予測タップを用いて、各クラスのタップ係数ｗ_iを求める。 In this way, the learning apparatus 201 normalizes the prediction tap extracted from the image data corresponding to the single-plate CCD output as the student image data, and obtains the tap coefficient w _i of each class using the normalized prediction tap. .

このように、単板CCD出力相当の画像データから抽出した予測タップを、各色の画素のダイナミックレンジが等しくなるように正規化し、正規化された予測タップを用いて各クラスのタップ係数ｗ_iを求めることで、より精度よくクラス分類適応処理を行うためのタップ係数ｗ_iを求めることができる。これにより、求められたタップ係数ｗ_iを用いてクラス分類適応処理を行った場合、画像の破綻を抑制することができ、単板CCDから出力された画像データから、より高精細な３板CCD出力相当の画像データをより精度よく得ることができる。 In this way, the prediction tap extracted from the image data corresponding to the single-plate CCD output is normalized so that the dynamic ranges of the pixels of each color are equal, and the tap coefficient w _i of each class is calculated using the normalized prediction tap. By obtaining the tap coefficient w _i for performing the class classification adaptive processing with higher accuracy, the tap coefficient w _i can be obtained. As a result, when the class classification adaptive processing is performed using the obtained tap coefficient w _i , it is possible to suppress the failure of the image, and from the image data output from the single-plate CCD, a higher-definition three-plate CCD Image data corresponding to output can be obtained with higher accuracy.

なお、以上においては、光の原色（Ｒ、Ｇ、Ｂ）成分を透過させる色フィルタアレイ５３の例として、図１３Ａに示すベイヤー配列を用いるようにしたが、この他、図１３Ｂに示すインタライン配列、図１３Ｃに示すＧストライプＲＢ市松配列、図１３Ｄに示すＧストライプＲＢ完全市松配列、または図１３Ｅに示す原色色差配列などとすることができる。 In the above description, the Bayer array shown in FIG. 13A is used as an example of the color filter array 53 that transmits the primary color (R, G, B) components of light. In addition, the interline shown in FIG. 13B is used. An arrangement, a G stripe RB checkered arrangement shown in FIG. 13C, a G stripe RB complete checkered arrangement shown in FIG. 13D, a primary color difference arrangement shown in FIG. 13E, or the like.

図１３Ｂに示すインタライン配列では、Ｇの色のフィルタが市松状に配置され、各行の残りの部分にＲの色のフィルタおよびＢの色のフィルタが交互に配置されている。 In the interline arrangement shown in FIG. 13B, G color filters are arranged in a checkered pattern, and R color filters and B color filters are alternately arranged in the remaining part of each row.

また、図１３Ｃに示すＧストライプＲＢ市松配列では、Ｇの色のフィルタが一列おきに図中、縦方向に並べられて配置され、残りの列の縦方向にＲの色のフィルタおよびＢの色のフィルタが交互に配置されている。このとき、各列のＲの色のフィルタまたはＢの色のフィルタは、横方向にＧの色のフィルタを介して同じ色のフィルタと隣接するように配置されている。 Further, in the G stripe RB checkered arrangement shown in FIG. 13C, every other column of G color filters is arranged in the vertical direction in the figure, and the R color filter and B color are arranged in the vertical direction of the remaining columns. Are alternately arranged. At this time, the R color filter or the B color filter in each column is arranged so as to be adjacent to the same color filter in the horizontal direction via the G color filter.

さらに、図１３Ｄに示すＧストライプＲＢ完全市松配列では、Ｇの色のフィルタが一列おきに図中、縦方向に並べられて配置され、残りの列の縦方向にＲの色のフィルタおよびＢの色のフィルタが交互に配置されている。このとき、各列のＲの色のフィルタおよびＢの色のフィルタは、横方向にＧの色のフィルタを介して、Ｂの色のフィルタおよびＲの色のフィルタと隣接するように配置されている。 Further, in the G stripe RB complete checkered arrangement shown in FIG. 13D, the G color filters are arranged in the vertical direction in every other row in the figure, and the R color filters and the B color filters are arranged in the vertical direction of the remaining columns. Color filters are arranged alternately. At this time, the R color filter and the B color filter in each row are arranged so as to be adjacent to the B color filter and the R color filter through the G color filter in the horizontal direction. Yes.

さらに、また、図１３Ｅに示す原色色差配列では、１行目乃至３行目までは、Ｇの色のフィルタが一列おきに図中、縦方向に並べられて配置され、残りの列の縦方向にＲの色のフィルタおよびＢの色のフィルタが交互に配置されている。このとき、各列のＲの色のフィルタおよびＢの色のフィルタは、横方向にＧの色のフィルタを介して同じ色のフィルタと隣接するように配置されている。そして、４行目は、Ｒの色のフィルタおよびＧの色のフィルタが交互に配置されている。 Further, in the primary color difference array shown in FIG. 13E, in the first to third rows, the G color filters are arranged in the vertical direction every other column in the figure, and the vertical direction of the remaining columns. In addition, R color filters and B color filters are alternately arranged. At this time, the R color filter and the B color filter in each column are arranged so as to be adjacent to the same color filter through the G color filter in the horizontal direction. In the fourth row, R color filters and G color filters are alternately arranged.

また、色フィルタアレイ５３を原色フィルタではなく、補色フィルタとすることもできる。この場合、撮像により得られた画像データの各画素は、イエロー（Ye）、シアン（Cy）、マゼンダ（Mg）、および緑（Ｇ）のうちのいずれか１つの値を有することになる。 Further, the color filter array 53 may be a complementary color filter instead of the primary color filter. In this case, each pixel of the image data obtained by imaging has one value of yellow (Ye), cyan (Cy), magenta (Mg), and green (G).

さらに、以上においては、単板CCD出力相当の画像データから、同じ解像度の３板CCD出力相当の画像データを生成する例について説明したが、単板CCD出力相当の画像データから、ｎ倍の解像度の３板CCD出力相当の画像データを生成することも可能である。 Furthermore, in the above description, an example of generating image data equivalent to a three-plate CCD output having the same resolution from image data equivalent to a single-plate CCD output has been described. It is also possible to generate image data equivalent to the three-plate CCD output.

また、例えば、単板CCD出力相当の画像データから、４倍の解像度の３板CCD出力相当の画像データを生成するためのタップ係数を求める場合、学習装置２０１には、図１４Ａに示すような、Ｒの画像データ、Ｇの画像データ、およびＢの画像データからなる３板CCD出力相当の画像データが入力され、教師画像データとされる。 Further, for example, when obtaining tap coefficients for generating image data equivalent to a three-plate CCD output with a quadruple resolution from image data equivalent to a single-plate CCD output, the learning apparatus 201 has a configuration as shown in FIG. 14A. , R image data, G image data, and B image data corresponding to a three-plate CCD output are input and used as teacher image data.

そして、図１４Ａに示した３板CCD出力相当の画像データは、間引き回路２１１においてローパスフィルタにより間引かれて、図１４Ｂに示すように、１／４の解像度の画像データとされる。図１４Ｂに示すＲの画像データ、Ｇの画像データ、およびＢの画像データのそれぞれは、図１４Ａに示すＲの画像データ、Ｇの画像データ、およびＢの画像データのそれぞれの１／４の解像度の画像データとなっている。 Then, the image data corresponding to the three-plate CCD output shown in FIG. 14A is thinned out by the low-pass filter in the thinning circuit 211, and becomes image data having a resolution of 1/4 as shown in FIG. 14B. Each of the R image data, the G image data, and the B image data illustrated in FIG. 14B has a resolution that is ¼ of each of the R image data, the G image data, and the B image data illustrated in FIG. 14A. Image data.

さらに、図１４Ｂに示した画像データは、間引き回路２１１においてさらに間引かれて、図１４Ｃに示すように単板CCD出力相当のベイヤー配列の画像データとされ、この単板CCD出力相当の画像データが生徒画像データとされる。 Further, the image data shown in FIG. 14B is further thinned out by the thinning circuit 211 to form Bayer array image data corresponding to the single-plate CCD output as shown in FIG. 14C. Is student image data.

さらに、図１５Ａに示すように、単板CCD出力相当の画像データの画素における面積の等しい４つの領域のそれぞれの位置に対応する画素のそれぞれが注目画素とされ、それぞれの注目画素の画素値を求めるためのタップ係数が求められる。 Furthermore, as shown in FIG. 15A, each of the pixels corresponding to the respective positions of the four areas having the same area in the pixel of the image data corresponding to the single-plate CCD output is set as the target pixel, and the pixel value of each target pixel is set as the target pixel. A tap coefficient for obtaining is obtained.

図１５Ａでは、単板CCD出力相当の画像データの画素のうち、矢印Ｋ４１により示されるＧの画素における面積の等しい４つの領域の位置に対応する画素が、それぞれ注目画素２５１−１乃至注目画素２５１−４とされる。また、ここで、注目画素２５１−１乃至注目画素２５１−４のそれぞれに対する予測タップは、例えば、矢印Ｋ４１により示されるＧの画素を中心とする３×３個の画素とされている。 In FIG. 15A, among the pixels of the image data corresponding to the single-plate CCD output, the pixels corresponding to the positions of the four regions having the same area in the G pixel indicated by the arrow K41 are the target pixel 251-1 to the target pixel 251 respectively. -4. Here, the prediction taps for each of the target pixel 251-1 to the target pixel 251-4 are, for example, 3 × 3 pixels centered on the G pixel indicated by the arrow K 41.

そして、図１５Ｂに示すように、３板CCD出力相当の画像データの注目画素２５１−１乃至注目画素２５１−４のそれぞれのＲの画素値、Ｇの画素値、およびＢの画素値と、予測タップとを基に、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＲの画素値を予測するためのタップ係数のそれぞれ、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＧの画素値を予測するためのタップ係数のそれぞれ、および注目画素２５１−１乃至注目画素２５１−４のそれぞれのＢの画素値を予測するためのタップ係数のそれぞれの計１２個のタップ係数が求められる。 Then, as shown in FIG. 15B, the R pixel value, the G pixel value, and the B pixel value of each of the target pixels 251-1 to 251-4 of the image data corresponding to the three-plate CCD output, and the prediction Based on the taps, tap coefficients for predicting the R pixel values of the target pixels 251-1 to 251-4, respectively, and the G coefficients of the target pixels 251-1 to 251-4, respectively. A total of 12 tap coefficients are obtained for each of the tap coefficients for predicting the pixel value and for each of the tap coefficients for predicting the B pixel value of each of the target pixel 251-1 to target pixel 251-4. .

この場合、演算回路２１８は、式（１６）に対応する各クラスのＲ、Ｇ、およびＢの色成分ごとに式（１７）、式（１８）、および式（１９）に示される正規方程式を立てて足しこみを行う。 In this case, the arithmetic circuit 218 calculates the normal equation represented by the equations (17), (18), and (19) for each color component of R, G, and B corresponding to the equation (16). Stand up and add.

・・・（１７）

... (17)

・・・（１８）

... (18)

・・・（１９）

... (19)

ここで、式（１７）におけるｗ_r1,i、ｗ_r2,i、ｗ_r3,i、およびｗ_r4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＲの画素値のそれぞれを予測するために用いられ、予測タップを構成するｉ番目の画素（の画素値）と乗算されるｉ番目のタップ係数のそれぞれを表している。 Here, each of _{wr1, i} , _{wr2, i} , _{wr3, i} , and _{wr4, i} in Equation (17) (where 1 ≦ i ≦ N) is the pixel of interest 251-1 to the pixel of interest 251. -4 represents each i-th tap coefficient that is used to predict each R-pixel value of −4 and is multiplied by (i.e., the pixel value of) the i-th pixel constituting the prediction tap.

また、式（１７）におけるＲ_1,i、Ｒ_2,i、Ｒ_3,i、およびＲ_4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、式（１５）に示したΣｘ_s,iｙ_sにおける画素値ｙ_sを、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＲの画素値としたものを表している。 In addition, each of R _{1, i} , R _{2, i} , R _{3, i} , and R _{4, i} in Formula (17) (where 1 ≦ i ≦ N) is Σx _s, the pixel value y _s in _i y _s, represent those pixel values of the respective R of the pixel of interest 251-1 to the pixel of interest 251-4.

同様に、式（１８）におけるｗ_g1,i、ｗ_g2,i、ｗ_g3,i、およびｗ_g4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＧの画素値のそれぞれを予測するために用いられ、予測タップを構成するｉ番目の画素（の画素値）と乗算されるｉ番目のタップ係数のそれぞれを表している。 Similarly, each of w _{g1, i} , w _{g2, i} , w _{g3, i} , and w _{g4, i} (where 1 ≦ i ≦ N) in Expression (18) is the target pixel 251-1 to the target pixel 251. -4 represents each i-th tap coefficient that is used to predict each G pixel value of −4 and is multiplied by (the pixel value of) the i-th pixel that forms the prediction tap.

また、式（１８）におけるＧ_1,i、Ｇ_2,i、Ｇ_3,i、およびＧ_4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、式（１５）に示したΣｘ_s,iｙ_sにおける画素値ｙ_sを、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＧの画素値としたものを表している。 In addition, each of G _{1, i} , G _{2, i} , G _{3, i} , and G _{4, i in} the formula (18) (where 1 ≦ i ≦ N) is Σx _s, the pixel value y _s in _i y _s, represent those the pixel value of each of the G pixel of interest 251-1 to the pixel of interest 251-4.

さらに、式（１９）におけるｗ_b1,i、ｗ_b2,i、ｗ_b3,i、およびｗ_b4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＢの画素値のそれぞれを予測するために用いられ、予測タップを構成するｉ番目の画素（の画素値）と乗算されるｉ番目のタップ係数のそれぞれを表している。 Furthermore, each of w _{b1, i} , w _{b2, i} , w _{b3, i} , and w _{b4, i in} the equation (19) (where 1 ≦ i ≦ N) is the target pixel 251-1 to the target pixel 251- 4 represents each i-th tap coefficient that is used to predict each of the four B pixel values and is multiplied by (the pixel value of) the i-th pixel constituting the prediction tap.

また、式（１９）におけるＢ_1,i、Ｂ_2,i、Ｂ_3,i、およびＢ_4,iのそれぞれ（但し、１≦ｉ≦Ｎ）は、式（１５）に示したΣｘ_s,iｙ_sにおける画素値ｙ_sを、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＢの画素値としたものを表している。 In addition, each of B _{1, i} , B _{2, i} , B _{3, i} , and B _{4, i} in equation (19) (where 1 ≦ i ≦ N) is represented by Σx _s, the pixel value y _s in _i y _s, represent those the pixel value of each B pixel of interest 251-1 to the pixel of interest 251-4.

演算回路２２０は、このようにして立てられた式（１７）乃至式（１９）の正規方程式を解くことにより、注目画素２５１−１乃至注目画素２５１−４のそれぞれのＲの画素値、Ｇの画素値、およびＢの画素値のそれぞれを予測するための１２個のタップ係数を求める。 The arithmetic circuit 220 solves the normal equations of the equations (17) to (19) established in this way, thereby obtaining the R pixel value of each of the target pixels 251-1 to 251-4, Twelve tap coefficients for predicting each of the pixel value and the B pixel value are obtained.

また、これらのタップ係数を用いることにより、適応処理回路１１６は、単板CCD出力相当の画像データから、４倍の解像度の３板CCD出力相当の画像データ（４倍密の画像データ）を生成することができる。 Further, by using these tap coefficients, the adaptive processing circuit 116 generates image data corresponding to a three-plate CCD output (4 times dense image data) with a quadruple resolution from image data corresponding to a single-plate CCD output. can do.

ところで、以上の実施の形態の効果を評価するため、発明者らは、シミュレーションを行った。３板CCD出力相当の画像から単板CCD出力相当の画像を作成し、上述した学習装置２０１が行う処理と同じ処理を行うアルゴリズムによりタップ係数を生成した。そして、単板CCD出力相当の画像を３板CCD出力相当の画像に変換する処理を、生成されたタップ係数を用いて、予測タップを正規化するクラス分類適応処理により補間生成した。 By the way, in order to evaluate the effect of the above embodiment, the inventors performed a simulation. An image corresponding to a single-plate CCD output is created from an image corresponding to a three-plate CCD output, and tap coefficients are generated by an algorithm that performs the same processing as the processing performed by the learning device 201 described above. Then, a process for converting an image corresponding to a single-plate CCD output into an image corresponding to a three-plate CCD output is generated by interpolation using a class classification adaptive process that normalizes prediction taps using the generated tap coefficients.

このようなシミュレーションの結果生成された３板CCD出力相当の画像と、従来のクラス分類適応処理を行って生成された３板CCD出力相当の画像とを比較して評価したところ、シミュレーションの結果生成された３板CCD出力相当の画像では、生成された全ての画像において、各色成分のダイナミックレンジの差により生じる画像の破綻が改善されていることが確認された。このことから、予測タップを正規化するクラス分類適応処理を行って３板CCD出力相当の画像を生成する方法は、従来のクラス分類適応処理を行って３板CCD出力相当の画像を生成する方法よりも優位性があるということができる。 An image corresponding to a 3-plate CCD output generated as a result of such a simulation and an image corresponding to a 3-plate CCD output generated by performing the conventional classification adaptation process were evaluated, and the result of the simulation was generated. In the image corresponding to the three-plate CCD output, it was confirmed that the image breakdown caused by the difference in the dynamic range of each color component was improved in all the generated images. Therefore, the method of generating the image corresponding to the 3-plate CCD output by performing the class classification adaptive processing for normalizing the prediction tap is the method of generating the image corresponding to the 3-plate CCD output by performing the conventional class classification adaptive processing. It can be said that there is an advantage.

以上のように、クラス分類適応処理に用いる予測タップを正規化することで、予測タップの各色の画素のダイナミックレンジを等しくすることができ、画像の破綻を抑制することができる。これにより、より高精細な画像データをより精度よく生成することができる。 As described above, by normalizing the prediction taps used for the class classification adaptation process, the dynamic ranges of the pixels of each color of the prediction tap can be made equal, and the failure of the image can be suppressed. Thereby, higher-definition image data can be generated more accurately.

また、各色の画素のダイナミックレンジが等しくなるように予測タップを正規化し、正規化された予測タップを用いて各クラスのタップ係数を求めることで、より精度よくクラス分類適応処理を行うためのタップ係数を求めることができる。これにより、求められたタップ係数を用いてクラス分類適応処理を行った場合、画像の破綻を抑制することができ、より高精細な画像データをより精度よく得ることができる。 Also, taps for performing class classification adaptation processing more accurately by normalizing prediction taps so that the dynamic ranges of pixels of each color are equal and obtaining tap coefficients for each class using the normalized prediction taps A coefficient can be obtained. Thereby, when the class classification adaptive process is performed using the obtained tap coefficient, it is possible to suppress the failure of the image and to obtain higher-definition image data with higher accuracy.

なお、以上においては、抽出されたクラスタップにADRC処理を施すことにより、注目画素が分類されるクラスの数を削減したが、その他、DCT(Discrete Cosine Transform)、ＶＱ（Vector Quantization）（ベクトル量子化）、DPCM(Differential Pulse Code Modulation)、BTC(Block Trancation Coding)、非線形量子化などにより、注目画素が分類されるクラスの数を削減するようにしてもよい。 In the above, the number of classes into which the target pixel is classified is reduced by performing ADRC processing on the extracted class taps. However, DCT (Discrete Cosine Transform), VQ (Vector Quantization) (vector quantum) ), DPCM (Differential Pulse Code Modulation), BTC (Block Trancation Coding), nonlinear quantization, etc., the number of classes into which the pixel of interest is classified may be reduced.

また、画像データが、RGBのそれぞれを成分とするコンポーネント映像信号であるとして説明したが、その他、輝度Ｙ、色差Ｕ、および色差Ｖを成分とするコンポーネント映像信号とするようにしてもよい。さらに、画像データが、イエロー（Ye）、シアン（Cy）、マゼンダ（Mg）、および緑（Ｇ）のうちの少なくともいずれか１つの値を各画素の画素値として有するようにしてもよい。 In addition, the image data has been described as component video signals having RGB as components, but other component video signals having luminance Y, color difference U, and color difference V as components may be used. Furthermore, the image data may have at least one value of yellow (Ye), cyan (Cy), magenta (Mg), and green (G) as the pixel value of each pixel.

さらに、撮像素子としてCCDを用いて画像を撮像すると説明したが、画像を撮像する撮像素子として、その他、例えば、CMOS（Complementary Mental Oxide Semiconductor）などを用いることも可能である。 Furthermore, although it has been described that an image is picked up using a CCD as the image pickup device, for example, a CMOS (Complementary Mental Oxide Semiconductor) or the like can be used as the image pickup device for picking up an image.

上述した一連の処理は、ハードウエアにより実行させることもできるし、ソフトウエアにより実行させることもできる。一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、専用のハードウエアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、プログラム記録媒体からインストールされる。 The series of processes described above can be executed by hardware or can be executed by software. When a series of processing is executed by software, a program constituting the software executes various functions by installing a computer incorporated in dedicated hardware or various programs. For example, it is installed from a program recording medium in a general-purpose personal computer or the like.

図１６は、上述した一連の処理をプログラムにより実行するパーソナルコンピュータの構成の例を示すブロック図である。パーソナルコンピュータ３０１のCPU３１１は、ROM（Read Only Memory）３１２、または記録部３１８に記録されているプログラムに従って各種の処理を実行する。RAM（Random Access Memory）３１３には、CPU３１１が実行するプログラムやデータなどが適宜記憶される。これらのCPU３１１、ROM３１２、およびRAM３１３は、バス３１４により相互に接続されている。 FIG. 16 is a block diagram showing an example of the configuration of a personal computer that executes the above-described series of processing by a program. The CPU 311 of the personal computer 301 executes various processes according to a program recorded in a ROM (Read Only Memory) 312 or a recording unit 318. A RAM (Random Access Memory) 313 appropriately stores programs executed by the CPU 311 and data. The CPU 311, ROM 312, and RAM 313 are connected to each other via a bus 314.

CPU３１１にはまた、バス３１４を介して入出力インターフェース３１５が接続されている。入出力インターフェース３１５には、キーボード、マウス、マイクロホンなどよりなる入力部３１６、ディスプレイ、スピーカなどよりなる出力部３１７が接続されている。CPU３１１は、入力部３１６から入力される指令に対応して各種の処理を実行する。そして、CPU３１１は、処理の結果を出力部３１７に出力する。 An input / output interface 315 is also connected to the CPU 311 via the bus 314. The input / output interface 315 is connected to an input unit 316 including a keyboard, a mouse, and a microphone, and an output unit 317 including a display and a speaker. The CPU 311 executes various processes in response to commands input from the input unit 316. Then, the CPU 311 outputs the processing result to the output unit 317.

入出力インターフェース３１５に接続されている記録部３１８は、例えばハードディスクからなり、CPU３１１が実行するプログラムや各種のデータを記録する。通信部３１９は、インターネットやローカルエリアネットワークなどのネットワークを介して外部の装置と通信する。 The recording unit 318 connected to the input / output interface 315 includes, for example, a hard disk, and records programs executed by the CPU 311 and various data. The communication unit 319 communicates with an external device via a network such as the Internet or a local area network.

また、通信部３１９を介してプログラムを取得し、記録部３１８に記録してもよい。 A program may be acquired via the communication unit 319 and recorded in the recording unit 318.

入出力インターフェース３１５に接続されているドライブ３２０は、磁気ディスク、光ディスク、光磁気ディスク、或いは半導体メモリなどのリムーバブルメディア３３１が装着されたとき、それらを駆動し、そこに記録されているプログラムやデータなどを取得する。取得されたプログラムやデータは、必要に応じて記録部３１８に転送され、記録される。 The drive 320 connected to the input / output interface 315 drives a removable medium 331 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory, and drives the program or data recorded therein. Get etc. The acquired program and data are transferred to the recording unit 318 and recorded as necessary.

コンピュータにインストールされ、コンピュータによって実行可能な状態とされるプログラムを格納するプログラム記録媒体は、図１６に示すように、磁気ディスク（フレキシブルディスクを含む）、光ディスク（CD-ROM(Compact Disc-Read Only Memory),DVD(Digital Versatile Disc)を含む）、光磁気ディスクを含む）、もしくは半導体メモリなどよりなるパッケージメディアであるリムーバブルメディア３３１、または、プログラムが一時的もしくは永続的に格納されるROM３１２や、記録部３１８を構成するハードディスクなどにより構成される。プログラム記録媒体へのプログラムの格納は、必要に応じてルータ、モデムなどのインターフェースである通信部３１９を介して、ローカルエリアネットワーク、インターネット、デジタル衛星放送といった、有線または無線の通信媒体を利用して行われる。 As shown in FIG. 16, a program recording medium for storing a program that is installed in a computer and can be executed by the computer is a magnetic disk (including a flexible disk), an optical disk (CD-ROM (Compact Disc-Read Only). Memory), DVD (Digital Versatile Disc) (including magneto-optical disk), or removable media 331 which is a package medium made of semiconductor memory, or ROM 312 in which a program is temporarily or permanently stored, The recording unit 318 is configured by a hard disk and the like. The program is stored in the program recording medium using a wired or wireless communication medium such as a local area network, the Internet, or digital satellite broadcasting via a communication unit 319 that is an interface such as a router or a modem as necessary. Done.

なお、本明細書において、プログラム記録媒体に格納されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。 In the present specification, the step of describing the program stored in the program recording medium is not limited to the processing performed in time series in the described order, but is not necessarily performed in time series. Or the process performed separately is also included.

なお、本発明の実施の形態は、上述した実施の形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々の変更が可能である。 The embodiment of the present invention is not limited to the above-described embodiment, and various modifications can be made without departing from the gist of the present invention.

予測タップとして、異なる色の画素を用いた場合に生じる画像の破綻について説明するための図である。It is a figure for demonstrating the failure of the image which arises when a pixel of a different color is used as a prediction tap. 単板CCDの出力から、３板CCD出力相当の画像データを得る原理を説明する図である。It is a figure explaining the principle which obtains the image data equivalent to a 3 plate CCD output from the output of a single plate CCD. 撮像装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of an imaging device. 撮像処理を説明するフローチャートである。It is a flowchart explaining an imaging process. 画像信号処理回路の構成例を示すブロック図である。It is a block diagram which shows the structural example of an image signal processing circuit. 画像信号処理を説明するフローチャートである。It is a flowchart explaining an image signal process. クラスタップの一例を示す図である。It is a figure which shows an example of a class tap. 予測タップの一例を示す図である。It is a figure which shows an example of a prediction tap. 予測タップの正規化を説明するための図である。It is a figure for demonstrating normalization of a prediction tap. 予測タップの正規化を説明するための図である。It is a figure for demonstrating normalization of a prediction tap. 学習装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of a learning apparatus. 学習処理を説明するフローチャートである。It is a flowchart explaining a learning process. 色フィルタアレイの他の配列の例を示す図である。It is a figure which shows the example of the other arrangement | sequence of a color filter array. 単板CCDの出力から、４倍の解像度の３板CCD出力相当の画像データを得るためのタップ係数を求める学習処理の原理を説明する図である。It is a figure explaining the principle of the learning process which calculates | requires the tap coefficient for obtaining the image data equivalent to the 3 plate CCD output of 4 times the resolution from the output of a single plate CCD. ４倍の解像度の画像データを生成するタップ係数を求める場合における注目画素を説明するための図である。It is a figure for demonstrating the attention pixel in the case of calculating | requiring the tap coefficient which produces | generates the image data of 4 times resolution. パーソナルコンピュータの構成例を示すブロック図である。And FIG. 16 is a block diagram illustrating a configuration example of a personal computer.

Explanation of symbols

４１撮像装置，５４ CCD，５７画像信号処理回路，６５記録媒体，１１１ ADRCブロック化回路，１１３クラス分類回路，１１４予測タップブロック化回路，１１５予測タップ正規化部，１１６適応処理回路，１１７係数メモリ，２０１学習装置，２１１間引き回路，２１２ ADRCブロック化回路，２１４クラス分類回路，２１５予測タップブロック化回路，２１６予測タップ正規化部，２１８演算回路，２２０演算回路，３０１パーソナルコンピュータ，３１１ CPU，３１２ ROM，３１３ RAM，３１８記録部，３３１リムーバブルメディア 41 imaging device, 54 CCD, 57 image signal processing circuit, 65 recording medium, 111 ADRC blocking circuit, 113 class classification circuit, 114 prediction tap blocking circuit, 115 prediction tap normalization unit, 116 adaptive processing circuit, 117 coefficient memory , 201 learning device, 211 decimation circuit, 212 ADRC blocking circuit, 214 class classification circuit, 215 prediction tap blocking circuit, 216 prediction tap normalization unit, 218 arithmetic circuit, 220 arithmetic circuit, 301 personal computer, 311 CPU, 312 ROM, 313 RAM, 318 recording unit, 331 removable media

Claims

First image data composed of pixels having the value of the first component of the component video signal and pixels having the value of the second component is obtained by using the values of the first component and the second component. In an image processing apparatus for converting to second image data composed of pixels having values,
A prediction tap extracting means for extracting a plurality of pixels used for predicting a target pixel which is a target pixel of the second image data as a prediction tap from the first image data;
Class tap extracting means for extracting, as class taps, a plurality of pixels used for class classification that classifies the target pixel into any of a plurality of classes;
Class classification means for classifying the pixel of interest using the class tap;
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalization means for normalization;
Predicting the value of the first component or the value of the second component of the pixel of interest using the tap coefficient previously determined for the class of the pixel of interest and the normalized prediction tap An image processing apparatus comprising:

The normalizing means, when predicting the value of the first component of the pixel of interest, divides the dynamic range value of the first component value by the dynamic range value of the second component value. The image processing apparatus according to claim 1, wherein the prediction tap is normalized by multiplying a value obtained by multiplying a value of the second component of a pixel constituting the prediction tap.

The image processing apparatus according to claim 1, wherein the first component or the second component is a component representing any one of red, blue, and green.

The image processing apparatus according to claim 1, wherein the first component or the second component is a component representing luminance or color difference.

First image data composed of pixels having the value of the first component of the component video signal and pixels having the value of the second component is obtained by using the values of the first component and the second component. In an image processing method for converting to second image data composed of pixels having values,
Extracting a plurality of pixels used for predicting a target pixel which is a target pixel of the second image data as a prediction tap from the first image data;
Extracting a plurality of pixels used for class classification to classify the target pixel into any one of a plurality of classes as a class tap from the first image data;
Using the class tap, classify the pixel of interest,
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalize,
Predicting the value of the first component or the value of the second component of the pixel of interest using the tap coefficient previously determined for the class of the pixel of interest and the normalized prediction tap An image processing method including a step.

First image data composed of pixels having the value of the first component of the component video signal and pixels having the value of the second component is obtained by using the values of the first component and the second component. In a program for causing a computer to execute image processing for conversion into second image data composed of pixels having values,
Extracting a plurality of pixels used for predicting a target pixel which is a target pixel of the second image data as a prediction tap from the first image data;
Extracting a plurality of pixels used for class classification to classify the target pixel into any one of a plurality of classes as a class tap from the first image data;
Using the class tap, classify the pixel of interest,
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalize,
Predicting the value of the first component or the value of the second component of the pixel of interest using the tap coefficient previously determined for the class of the pixel of interest and the normalized prediction tap A program that causes a computer to execute processing including steps.

Attention to extract the value of the first component of the pixel of interest, which is the pixel of interest, from the first image data composed of pixels having the values of the first component and the second component of the component video signal Pixel extraction means;
A plurality of pixels used for predicting the pixel of interest are predicted taps from second image data composed of pixels having the value of the first component and pixels having the value of the second component. A prediction tap extracting means for extracting;
Class tap extracting means for extracting, as class taps, a plurality of pixels used for class classification for classifying the target pixel into any of a plurality of classes;
Class classification means for classifying the pixel of interest using the class tap;
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalization means for normalization;
Used to predict the value of the first component of the pixel of interest from the normalized prediction tap using the value of the first component of the pixel of interest and the normalized prediction tap A learning device comprising: a calculation means for obtaining a tap coefficient corresponding to the class of the target pixel.

When obtaining the tap coefficient used for predicting the value of the first component of the pixel of interest, the normalizing unit obtains a dynamic range value of the value of the first component of the second component. The learning according to claim 7, wherein the prediction tap is normalized by multiplying a value obtained by dividing the value by a dynamic range value by a value of the second component included in a pixel constituting the prediction tap. apparatus.

The learning device according to claim 7, wherein the first component or the second component is a component representing any one of red, blue, and green.

The learning device according to claim 7, wherein the first component or the second component is a component representing luminance or color difference.

Extracting the value of the first component of the pixel of interest, which is the pixel of interest, from the first image data composed of pixels having the values of the first component and the second component of the component video signal;
A plurality of pixels used for predicting the pixel of interest are predicted taps from second image data composed of pixels having the value of the first component and pixels having the value of the second component. Extract and
Extracting a plurality of pixels used for class classification to classify the pixel of interest into any of a plurality of classes as a class tap from the second image data;
Using the class tap, classify the pixel of interest,
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalize,
Used to predict the value of the first component of the pixel of interest from the normalized prediction tap using the value of the first component of the pixel of interest and the normalized prediction tap A learning method including a step of obtaining a tap coefficient corresponding to the class of the target pixel.

Extracting the value of the first component of the pixel of interest, which is the pixel of interest, from the first image data composed of pixels having the values of the first component and the second component of the component video signal;
A plurality of pixels used for predicting the pixel of interest are predicted taps from second image data composed of pixels having the value of the first component and pixels having the value of the second component. Extract and
Extracting a plurality of pixels used for class classification to classify the pixel of interest into any of a plurality of classes as a class tap from the second image data;
Using the class tap, classify the pixel of interest,
The prediction tap is set so that the dynamic range of the value of the first component included in the pixels constituting the prediction tap is equal to the dynamic range of the value of the second component included in the pixels constituting the prediction tap. Normalize,
Used to predict the value of the first component of the pixel of interest from the normalized prediction tap using the value of the first component of the pixel of interest and the normalized prediction tap A program for causing a computer to execute processing including a step of obtaining a tap coefficient corresponding to the class of the target pixel.