JP4564856B2

JP4564856B2 - Image encoding apparatus and imaging apparatus

Info

Publication number: JP4564856B2
Application number: JP2005026530A
Authority: JP
Inventors: 純牧野
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2005-02-02
Filing date: 2005-02-02
Publication date: 2010-10-20
Anticipated expiration: 2025-02-02
Also published as: JP2006217158A

Description

本発明は、画像符号化装置及び撮像装置に関し、特に、Ｈ．２６４符号化方式を採用した場合において、最適なイントラ予測モードを選択する為の技術に関する。 The present invention relates to an image encoding apparatus and an imaging apparatus, The present invention relates to a technique for selecting an optimal intra prediction mode when the H.264 encoding method is employed.

動画像を高能率符号化するための技術として、ＭｏｔｉｏｎＪＰＥＧやＭＰＥＧ１，２といった符号化方式が確立されている。各メーカーはこれらの符号化方式を利用して動画像を保存可能としたディジタルカメラやディジタルビデオカメラといった撮像装置或いはＤＶＤレコーダーなどを開発し、製品化しており、ユーザーはこれらの装置或いはパーソナルコンピュータやＤＶＤプレーヤーなどを用いて簡単に動画像を視聴することが可能となっている。 Encoding methods such as Motion JPEG and MPEG1, 2 have been established as techniques for highly efficient encoding of moving images. Manufacturers have developed and commercialized imaging devices such as digital cameras and digital video cameras or DVD recorders that can store moving images using these encoding methods, and users can use these devices or personal computers, It is possible to easily view a moving image using a DVD player or the like.

ところで、ディジタル化された動画像は膨大なデータ量となる。そこで、上記したＭＰＥＧ１，２などよりも更なる高圧縮が望める動画像の符号化方式が研究され続けてきて、近年ＩＴＵ−Ｔ（国際電気通信連合電気通信標準化部門）とＩＳＯ（国際標準化機構）によりＨ．２６４／ＭＰＥＧ４ｐａｒｔ１０：ＡＶＣという符号化方式（以下、Ｈ．２６４と称す）が標準化された。 By the way, the digitized moving image has a huge amount of data. In view of this, moving picture coding systems that can achieve higher compression than MPEG1 and MPEG2 have been studied, and recently ITU-T (International Telecommunication Union Telecommunication Standardization Sector) and ISO (International Organization for Standardization). H. H.264 / MPEG4 part 10: AVC encoding method (hereinafter referred to as H.264) has been standardized.

Ｈ．２６４は、ＭＰＥＧ１，２等の従来の符号化方式に比べ、その符号化又は復号化により多くの演算量が要求されるが、より高い符号化効率が実現されることが知られている。Ｈ．２６４の演算処理の構成については、例えば特許文献１に開示されている。 H. H.264 requires a large amount of computation for encoding or decoding compared to conventional encoding methods such as MPEG1 and MPEG2, but is known to realize higher encoding efficiency. H. The configuration of the H.264 arithmetic processing is disclosed in Patent Document 1, for example.

ところで、Ｈ．２６４には、フレーム内画素値を用いて同一フレーム内画素値を予測する“イントラ予測”という予測方式が存在する。このイントラ予測においては、複数のイントラ予測モードが存在し、これらを選択的に用いることが規定されている。このとき入力画像に適したイントラ予測モードが選択されることにより、高レベルな圧縮を施しても劣化が少ない符号化データが生成されることになる。
特開２００４−５６８２７号公報 H. H.264 has a prediction method called “intra prediction” in which pixel values within the same frame are predicted using pixel values within the frame. In this intra prediction, there are a plurality of intra prediction modes, and it is specified that these are selectively used. At this time, by selecting an intra prediction mode suitable for the input image, encoded data with little deterioration is generated even when high-level compression is performed.
JP 2004-56827 A

前記したイントラ予測に関して、予測の精度を向上させるため、Ｈ．２６４では９種類のイントラ予測モードが用意されている。そして、複数のイントラ予測モードの中から最適な１つのイントラ予測モードを選択する方法としては、入力画像に対してすべてのイントラ予測モードを実行して試した結果に基づいて、最適な結果を得られるイントラ予測モードを選択する方法が一般的である。 In order to improve the accuracy of prediction with respect to the intra prediction described above, In H.264, nine types of intra prediction modes are prepared. As a method for selecting one optimal intra prediction mode from among a plurality of intra prediction modes, an optimal result is obtained based on the results obtained by performing all intra prediction modes on the input image. A method of selecting the intra prediction mode to be performed is common.

図１１を用いて、その背景を説明する。図１１はフレームのイメージであり、外側の四角は画面全体を示しており、内側の５つの四角は、イントラ予測を行うブロックを示している。なお、ブロックは画面全体に渡って設定されるものであるが、説明の簡略化のため、代表的な５つのみを示してある。さらに図１１は、各ブロックについて、演算の結果、最適となったイントラ予測モードが示す予測方向を矢印で示した例である。このように、同一の画面内であっても、位置によって被写体が異なるので、各ブロックに最適なイントラ予測モードはまちまちとなり、特定のモードに固定されるものではない。このような背景から、イントラ予測を行う場合には各ブロックに対し、すべてのイントラ予測モードについてその予測精度を求め、最適に予測できるモードを選択して、そのブロックの予測モードとしている。 The background will be described with reference to FIG. FIG. 11 is an image of a frame. The outer squares indicate the entire screen, and the inner five squares indicate blocks for performing intra prediction. The blocks are set over the entire screen, but only five representative ones are shown for the sake of simplicity. Furthermore, FIG. 11 is an example in which the prediction direction indicated by the intra prediction mode that is optimized as a result of the calculation is indicated by an arrow for each block. As described above, even within the same screen, the subject varies depending on the position, so that the optimal intra prediction mode for each block varies and is not fixed to a specific mode. Against this background, when performing intra prediction, the prediction accuracy of all intra prediction modes is obtained for each block, and a mode that can be optimally predicted is selected to be the prediction mode for that block.

しかしながら、複数のイントラ予測モードの中から最適な１つのイントラ予測モードを選択するために、上記した如きすべてのイントラ予測モードを計算してから選ぶ方法を用いた場合では、Ｈ．２６４符号化処理における演算量の増加につながり、ひいては符号化処理時間の超過や、消費電力の浪費につながっていた。 However, in order to select an optimal intra prediction mode from among a plurality of intra prediction modes, when the method of selecting after calculating all intra prediction modes as described above is used, This leads to an increase in the amount of computation in the H.264 encoding process, which in turn leads to excess encoding processing time and waste of power consumption.

本発明は上記問題点を鑑みてなされたものであり、入力画像に対して最適なイントラ予測モードを、前記入力画像の撮影時の情報を用いることで容易に選択できるようにすることを目的とする。 The present invention has been made in view of the above problems, and an object thereof is to make it possible to easily select an optimal intra prediction mode for an input image by using information at the time of shooting the input image. To do.

また、本発明の他の目的は、Ｈ．２６４を用いた画像符号化装置における演算処理を軽減させて、効率的な符号化を実現することにある。 Another object of the present invention is H.264. It is to reduce the arithmetic processing in the image encoding apparatus using H.264 and realize efficient encoding.

上記課題を解決するために、本発明の画像符号化装置は、複数の予測モードのうちいずれかを選び、選ばれた予測モードに従ってフレーム内の画素値を用いて同一フレーム内の符号化対象ブロックの画素値を予測して予測画像データを生成する画像符号化装置において、符号化すべき画像信号を入力する手段と、前記画像信号が生成されたときのカメラのズーム動作状態を示す撮影情報を取得する手段と、入力した前記画像信号の符号化において、フレーム内の画素値を用いて同一フレーム内の前記符号化対象ブロックの画素値を予測するときに、前記撮影情報がズーム動作中であることを示している場合には、前記符号化対象ブロックの位置に応じた予測モードを選択する手段とを備えることを特徴とする。
また、本発明の画像符号化装置は、複数の予測モードのうちいずれかを選び、選ばれた予測モードに従ってフレーム内の画素値を用いて同一フレーム内の符号化対象ブロックの画素値を予測して予測画像データを生成する画像符号化装置において、符号化すべき画像信号を入力する手段と、前記画像信号が生成されたときのカメラのフォーカス動作状態を示す撮影情報を取得する手段と、入力した前記画像信号の符号化において、フレーム内の画素値を用いて同一フレーム内の前記符号化対象ブロックの画素値を予測するときに、前記撮影情報がフォーカス合焦状態を示している場合と、フォーカス非合焦状態を示している場合とで、異なる処理によって予測モードを選択する手段とを備えることを特徴とする。 In order to solve the above-described problem, an image encoding apparatus according to the present invention selects any one of a plurality of prediction modes, and uses a pixel value in the frame according to the selected prediction mode, to encode a block to be encoded in the same frame in the image coding apparatus for predicting the pixel value generating prediction image data, means for inputting an image signal to be coded, shooting information of a camera zoom operation state of when the image signal is generated means for obtaining, in the encoding of the inputted said image signal, when predicting the pixel value of the coding target block in the same frame by using pixel values in a frame, the imaging information in a zooming operation In the case where it indicates that there is, it is provided with means for selecting a prediction mode in accordance with the position of the encoding target block .
In addition, the image coding apparatus of the present invention selects any one of a plurality of prediction modes, and predicts the pixel value of the encoding target block in the same frame using the pixel value in the frame according to the selected prediction mode. In the image encoding device for generating predicted image data, input means for inputting an image signal to be encoded, means for acquiring photographing information indicating a focus operation state of the camera when the image signal is generated, and input In the encoding of the image signal, when the pixel value of the encoding target block in the same frame is predicted using the pixel value in the frame, the shooting information indicates a focus in-focus state, And a unit that selects a prediction mode by different processing depending on whether the in-focus state is indicated.

また本発明の撮像装置は、撮像して得られる画像信号を出力する撮像手段と、前記画像信号を圧縮符号化するための符号化手段であって、複数の予測モードのうちいずれかを選び、選ばれた予測モードに従ってフレーム内の画素値を用いて同一フレーム内の画素値を予測して予測画像データを生成する処理を行う符号化手段とを備えた撮像装置において、さらに前記撮像手段におけるズーム動作を制御する制御手段を備え、前記符号化手段は、フレーム内の画素値を用いて同一フレーム内の画素値を予測するときに、前記制御手段から出力された情報を用いて、ズーム中に撮影された画像に対しては当該画像を複数の領域に分割し、前記領域ごとに所定の予測モードを選択することを特徴とする。
また本発明の撮像装置は、撮像して得られる画像信号を出力する撮像手段と、前記画像信号を圧縮符号化するための符号化手段であって、複数の予測モードのうちいずれかを選び、選ばれた予測モードに従ってフレーム内の画素値を用いて同一フレーム内の画素値を予測して予測画像データを生成する処理を行う符号化手段とを備えた撮像装置において、さらに前記撮像手段におけるフォーカス動作を制御する制御手段を備え、前記符号化手段は、フレーム内の画素値を用いて同一フレーム内の画素値を予測するときに、前記制御手段から出力された情報を用いて、フォーカスが非合焦状態と判断されるときに撮影された画像に対しては所定の予測モードを選択することを特徴とする。 The imaging apparatus of the present invention is an imaging unit that outputs an image signal obtained by imaging, and an encoding unit for compressing and encoding the image signal, and selects any one of a plurality of prediction modes, An image pickup apparatus comprising: encoding means for performing processing for generating predicted image data by predicting pixel values in the same frame using pixel values in the frame according to the selected prediction mode ; and zooming in the image pickup means Control means for controlling the operation, and the encoding means uses the information output from the control means during zooming when predicting the pixel value in the same frame using the pixel value in the frame. The captured image is divided into a plurality of regions, and a predetermined prediction mode is selected for each region .
The imaging apparatus of the present invention is an imaging unit that outputs an image signal obtained by imaging, and an encoding unit for compressing and encoding the image signal, and selects any one of a plurality of prediction modes, An imaging apparatus comprising: encoding means for performing processing for predicting pixel values in the same frame using pixel values in the frame according to the selected prediction mode and generating predicted image data; and focusing in the imaging means Control means for controlling the operation, and the encoding means uses the information output from the control means when predicting the pixel value in the same frame using the pixel value in the frame, and the focus is not A predetermined prediction mode is selected for an image captured when it is determined that the in-focus state is detected.

本発明によれば、すべてのイントラ予測モードを試すことなく、複数のイントラ予測モードの中から最適なイントラ予測モードを、撮影時の情報を利用することで容易に選択でき、さらに、Ｈ．２６４における符号化時の演算処理を軽減させることができる。 According to the present invention, it is possible to easily select an optimal intra prediction mode from among a plurality of intra prediction modes by using information at the time of shooting without trying all intra prediction modes. The calculation processing at the time of encoding in H.264 can be reduced.

以下、図面を参照しながら本発明の好適な実施の形態を説明する。 Hereinafter, preferred embodiments of the present invention will be described with reference to the drawings.

＜第１の実施形態＞
図８は、本実施形態における画像符号化装置６００の構成を表わしたブロック図である。画像符号化装置６００は、減算器６０１、整数変換部６０２、量子化部６０３、エントロピー符号化部６０４、逆量子化部６０５、逆整数変換部６０６、加算器６０７、フレームメモリ６０８及び６１１、ループフィルタ６１０、インター予測部６１２、動き検出部６１３及びスイッチ６１４、イントラ予測部６１５から構成され、入力画像データを符号化処理して符号化データを出力する。 <First Embodiment>
FIG. 8 is a block diagram showing the configuration of the image encoding device 600 according to this embodiment. The image coding apparatus 600 includes a subtractor 601, an integer transform unit 602, a quantization unit 603, an entropy coding unit 604, an inverse quantization unit 605, an inverse integer transform unit 606, an adder 607, frame memories 608 and 611, a loop. It comprises a filter 610, an inter prediction unit 612, a motion detection unit 613, a switch 614, and an intra prediction unit 615, which encodes input image data and outputs encoded data.

続いて、図８の画像符号化装置６００の符号化プロセスについて説明する。 Next, the encoding process of the image encoding device 600 in FIG. 8 will be described.

画像符号化装置６００は、Ｈ．２６４符号化方式に従って符号化処理を行う。また、入力された画像データをブロックに分割し、そのブロック毎に処理を行う。 The image encoding device 600 is an H.264 format. The encoding process is performed according to the H.264 encoding method. Also, the input image data is divided into blocks, and processing is performed for each block.

まず、減算器６０１は、画像符号化装置６００に入力された画像データ（入力画像データ）から予測画像データを減算し、画像残差データを出力する。予測画像データの生成については後述する。 First, the subtractor 601 subtracts the predicted image data from the image data (input image data) input to the image encoding device 600, and outputs image residual data. The generation of predicted image data will be described later.

整数変換部６０２は、減算器６０１から出力された画像残差データを整数変換処理して、変換係数を出力する。そして、量子化部６０３は上記変換係数を所定の量子化パラメータを用いて量子化する。 The integer conversion unit 602 performs integer conversion processing on the image residual data output from the subtractor 601 and outputs a conversion coefficient. Then, the quantization unit 603 quantizes the transform coefficient using a predetermined quantization parameter.

エントロピー符号化部６０４は、量子化部６０３で量子化された変換係数を入力し、これをエントロピー符号化して、符号化データとして出力する。 The entropy encoding unit 604 receives the transform coefficient quantized by the quantization unit 603, entropy encodes it, and outputs it as encoded data.

一方、量子化部６０３で量子化された変換係数は、後続のブロックを符号化するための予測画像データの生成に使われる。逆量子化部６０５は、量子化部６０３で量子化された変換係数を逆量子化する。さらに、逆整数変換部６０６は逆量子化部６０５で逆量子化された変換係数を逆整数変換し、復号残差データとして出力する。加算器６０７は、復号残差データと予測画像データとを加算し、再構成画像データとして出力する。 On the other hand, the transform coefficient quantized by the quantization unit 603 is used to generate predicted image data for encoding the subsequent block. The inverse quantization unit 605 performs inverse quantization on the transform coefficient quantized by the quantization unit 603. Further, the inverse integer transform unit 606 performs inverse integer transform on the transform coefficient inversely quantized by the inverse quantization unit 605 and outputs it as decoded residual data. The adder 607 adds the decoded residual data and the predicted image data, and outputs the result as reconstructed image data.

再構成画像データはフレームメモリ６０８に保存されるとともに、ループフィルタ６１０を介して、フレームメモリ６１１に保存される。再構成画像データの中で、以降の予測で参照される可能性があるデータは、フレームメモリ６０８または６１１に暫くの期間、保存される。また、ループフィルタ６１０はブロックノイズを除去する為に用いられる。 The reconstructed image data is stored in the frame memory 608 and also stored in the frame memory 611 via the loop filter 610. Among the reconstructed image data, data that may be referred to in the subsequent prediction is stored in the frame memory 608 or 611 for a period of time. The loop filter 610 is used to remove block noise.

イントラ予測部６１５は、フレームメモリ６０８に保存された再構成画像データを用いてフレーム内予測処理を行い、予測画像データを生成する。また、インター予測部６１２は、フレームメモリ６１１に保存された再構成画像データを用いて、動き検出部６１３によって検出された動きベクトル情報に基づいてフレーム間予測処理を行い、予測画像データを生成する。動き検出部６１３は入力画像データにおける動きベクトルを検出し、検出した動きベクトル情報をインター予測部６１２とエントロピー符号化部６０４へ出力する。 The intra prediction unit 615 performs intra-frame prediction processing using the reconstructed image data stored in the frame memory 608, and generates predicted image data. In addition, the inter prediction unit 612 performs inter-frame prediction processing based on the motion vector information detected by the motion detection unit 613 using the reconstructed image data stored in the frame memory 611, and generates predicted image data. . The motion detection unit 613 detects a motion vector in the input image data, and outputs the detected motion vector information to the inter prediction unit 612 and the entropy encoding unit 604.

スイッチ６１４は、マクロブロック単位で、イントラ予測、インター予測のどちらを用いるか選択するための選択部である。イントラ予測部６１５からの出力とインター予測部６１２からの出力の一方を選択して、選択された予測画像データを減算器６０１、加算器６０７へ出力する。 The switch 614 is a selection unit for selecting whether to use intra prediction or inter prediction in units of macroblocks. One of the output from the intra prediction unit 615 and the output from the inter prediction unit 612 is selected, and the selected predicted image data is output to the subtracter 601 and the adder 607.

ここで、上記イントラ予測部６１５で行うイントラ予測について、図９及び図１０を用いて補足説明する。 Here, supplemental description will be given of intra prediction performed by the intra prediction unit 615 with reference to FIGS. 9 and 10.

イントラ予測では、同一画面内で、既に符号化された隣接画素の値を予測値として用いる。図９の様に符号化対象ブロックを４×４画素サイズ（ａからｐまでの画素）とした場合、その周辺にあるＡからＭまでの画素値を利用して予測が行われる。このとき、予測の方向を指定して、必要な画素値のみを予測に用いることになるが、予測の方向や内容が異なる９種類のイントラ予測モードが用意されているので、このうち最適な予測モードを選択することで予測を行う。 In intra prediction, the value of the adjacent pixel already encoded within the same screen is used as a predicted value. When the encoding target block has a 4 × 4 pixel size (pixels from a to p) as shown in FIG. 9, prediction is performed using pixel values from A to M around the block. At this time, the prediction direction is specified, and only necessary pixel values are used for the prediction. However, since nine types of intra prediction modes having different prediction directions and contents are prepared, the optimal prediction among them is prepared. Predict by selecting the mode.

用意されている各イントラ予測モードの予測方向を図１０に示す。例えば、Ｍｏｄｅ０のＶｅｒｔｉｃａｌ（垂直方向）予測モードの場合、隣接する垂直方向の画素（図９のＡ，Ｂ，Ｃ，Ｄの各画素に相当）の値を予測値として利用し、この画素値が垂直方向に連続していると予測する。これにより、図９のａ，ｅ，ｉ，ｍの各画素については、Ａの画素値が続くものと予測され、同様に、ｂ，ｆ，ｊ，ｎの各画素については、Ｂの画素値が続くものと予測される。以下、同様である。そして、予測した結果として、各画素の差分値、即ち、ａ−Ａ，ｅ−Ａ，ｉ−Ａ，ｍ−Ａ等を求めた後、整数変換や量子化が行われるのである。 The prediction directions of each prepared intra prediction mode are shown in FIG. For example, in the mode 0 vertical (vertical direction) prediction mode, the value of an adjacent vertical pixel (corresponding to each pixel of A, B, C, and D in FIG. 9) is used as a predicted value, and this pixel value Is predicted to be continuous in the vertical direction. Accordingly, it is predicted that the pixel value of A continues for each of the pixels a, e, i, and m in FIG. 9, and similarly, the pixel value of B for each of the pixels b, f, j, and n. Is expected to continue. The same applies hereinafter. Then, as a predicted result, after obtaining a difference value of each pixel, that is, aA, eA, iA, mA, etc., integer conversion and quantization are performed.

上記したＭｏｄｅ０のＶｅｒｔｉｃａｌ予測モード以外に、図１０の如きＭｏｄｅ１〜８までのイントラ予測モードが用意されている。なお、Ｍｏｄｅ２のＤＣ予測モードとは、左及び上の画素値の平均値を用いた予測モードである。各イントラ予測モードの詳細については、Ｈ．２６４関連の規格書等に開示されているので、ここでの説明は省略する。Ｈ．２６４では、このようにイントラ予測モードが９通り用意されているので、符号化対象画素に最適な予測モードを選択することが可能となり、予測の精度を向上させ、残差データの値が僅かで済むようにしている。 In addition to the above-described Vertical prediction mode of Mode 0, intra prediction modes of Modes 1 to 8 as shown in FIG. 10 are prepared. The DC prediction mode of Mode 2 is a prediction mode using an average value of the left and upper pixel values. Details of each intra prediction mode are described in H.264. Since it is disclosed in H.264-related standards and the like, description thereof is omitted here. H. In H.264, nine types of intra prediction modes are prepared in this way, so that it is possible to select the optimal prediction mode for the encoding target pixel, improve the prediction accuracy, and the residual data value is small. I am trying to do it.

また、イントラ予測部６１５における通常のイントラ予測モードの選択処理は、図１２のフローチャートで示したような手順で行われる。図１２において、対象となるブロックのイントラ予測が開始されると、予測モードを示すカウンタｉを用意し、ｉ＝０をセットする（ステップＳ１２０１）。次にｉの値で示される予測モードによりイントラ予測を行い（ステップＳ１２０２）、そのイントラ予測の精度を評価して、それまでに評価した他の予測モードと比べて最適であるものを選択する（ステップＳ１２０３）。次にｉをインクリメントし（ステップＳ１２０４）、ｉが予測モードの最大値９を超えない範囲でＳ１２０２からの手順を繰り返す（ステップＳ１２０５）。結果的に、符号化対象ブロック毎に、全ての（９通りの）イントラ予測モードを計算することで、最適なイントラ予測モードを選択可能である。 Further, the normal intra prediction mode selection processing in the intra prediction unit 615 is performed according to the procedure shown in the flowchart of FIG. In FIG. 12, when intra prediction of a target block is started, a counter i indicating a prediction mode is prepared, and i = 0 is set (step S1201). Next, intra prediction is performed according to the prediction mode indicated by the value of i (step S1202), the accuracy of the intra prediction is evaluated, and an optimum one is selected as compared with the other prediction modes evaluated so far ( Step S1203). Next, i is incremented (step S1204), and the procedure from S1202 is repeated within a range where i does not exceed the maximum value 9 of the prediction mode (step S1205). As a result, the optimal intra prediction mode can be selected by calculating all (9 types) of intra prediction modes for each encoding target block.

また、イントラ予測部６１５は入力される制御信号に応じて、上記した通常の選択処理とは異なる手順で、イントラ予測モードの選択処理を実行可能であるが、この内容については後述する。以上が、画像符号化装置６００及びイントラ予測部６１５の説明である。 Further, the intra prediction unit 615 can execute the intra prediction mode selection process in a procedure different from the above-described normal selection process in accordance with the input control signal. This content will be described later. The above is the description of the image encoding device 600 and the intra prediction unit 615.

次に、上記した画像符号化装置６００を搭載した撮像装置１００の構成を説明するとともに、撮像装置１００の動作と絡めてイントラ予測部６１５の更なる動作を説明していく。 Next, the configuration of the imaging apparatus 100 equipped with the above-described image encoding apparatus 600 will be described, and further operations of the intra prediction unit 615 will be described in conjunction with the operations of the imaging apparatus 100.

図１は、本実施形態における撮像装置１００の構成を表わしたブロック図である。撮像装置１００は、複数のレンズ群（ズームレンズ１０１ａ及び１０１ｂ、フォーカスレンズ１０１ｃを含む）から構成される撮影レンズ部１０１、ＣＣＤ又はＣＭＯＳ等の撮像素子１０２、カメラ信号処理部１０３、符号化部１０４、カメラ制御部１０５、フォーカス（Ｆｏｃｕｓ）検出部１０６、フォーカスモータ（Ｍｏ）１０７、ズームモータ（Ｍｏ）１０８、動きセンサー１０９、各種機能の操作部１１０から構成され、撮影レンズ部１０１から入射した光学像を、画像信号に変換してからＨ．２６４符号化方式に従って圧縮符号化処理して、符号化データを出力する。なお、符号化部１０４は、上記した画像符号化装置６００に相当する。撮像装置１００は画像符号化装置６００を画像のＨ．２６４エンコーダとして搭載している。 FIG. 1 is a block diagram illustrating the configuration of the imaging apparatus 100 according to the present embodiment. The imaging apparatus 100 includes a photographing lens unit 101 including a plurality of lens groups (including zoom lenses 101a and 101b and a focus lens 101c), an imaging element 102 such as a CCD or a CMOS, a camera signal processing unit 103, and an encoding unit 104. , A camera control unit 105, a focus detection unit 106, a focus motor (Mo) 107, a zoom motor (Mo) 108, a motion sensor 109, and an operation unit 110 having various functions. The image is converted into an image signal and then H.264 is converted. The encoded data is output after compression encoding according to the H.264 encoding method. Note that the encoding unit 104 corresponds to the image encoding device 600 described above. The imaging apparatus 100 causes the image encoding apparatus 600 to perform image H.264. It is installed as a H.264 encoder.

続いて、図１の撮像装置１００の撮影プロセスについて説明する。 Next, a photographing process of the imaging device 100 in FIG. 1 will be described.

撮影動作を開始したならば、撮影レンズ部１０１から入射した光学像を撮像素子１０２で電気信号に変換する。このとき、フォーカスモータ（Ｍｏ）１０７によりフォーカスレンズ１０１ｃを動かして焦点調整を行え、ズームモータ（Ｍｏ）１０８によりズームレンズ１０１ａ及び１０１ｂを動かして、光学ズーム調整を行える。さらに、動きセンサー１０９で検出した動き情報に応じて、不図示の光学防振機構或いは電子防振機能を動作させて、動きに対する補正（手ぶれ補正）も行える。 When the photographing operation is started, the optical image incident from the photographing lens unit 101 is converted into an electric signal by the image sensor 102. At this time, the focus lens 101c is moved by the focus motor (Mo) 107 to perform focus adjustment, and the zoom lenses 101a and 101b are moved by the zoom motor (Mo) 108 to perform optical zoom adjustment. Furthermore, according to the motion information detected by the motion sensor 109, an optical anti-shake mechanism (not shown) or an electronic anti-shake function is operated to correct motion (camera shake correction).

撮像素子１０２から出力された電気信号は、画像信号として、カメラ信号処理部１０３において必要な輝度信号処理や色信号処理が行われる。さらに、カメラ信号処理部１０３から出力された画像データは、符号化部１０４において符号化処理が行われ、符号化データになって出力される。ここでの符号化処理については、図８を用いて既に説明したとおりである。なお、符号化データは不図示の記録手段によりディスク等の記録媒体に記録可能な形態となる。 The electric signal output from the image sensor 102 is subjected to necessary luminance signal processing and color signal processing in the camera signal processing unit 103 as an image signal. Further, the image data output from the camera signal processing unit 103 is subjected to encoding processing in the encoding unit 104, and is output as encoded data. The encoding process here is as already described with reference to FIG. The encoded data can be recorded on a recording medium such as a disk by recording means (not shown).

各部の動作を制御するのはカメラ制御部１０５である。操作部１１０を介してユーザ指示があった場合は、その指示に従って各部の制御を行う。 The camera control unit 105 controls the operation of each unit. When there is a user instruction through the operation unit 110, each unit is controlled according to the instruction.

また、オートフォーカス時は、フォーカス検出部１０６が撮像素子１０２から得られる信号を元にしてフォーカスの状態を検出し、フォーカス情報としてカメラ制御部１０５に伝達する。カメラ制御部１０５は、当該フォーカス情報に従って、フォーカスモータ（Ｍｏ）１０７に制御信号を送り、最適なフォーカス状態となるようなフォーカス制御を行う。また、マニュアルフォーカス時は、カメラ制御部１０５は、操作部１１０から入力される指示に従って、フォーカスモータ（Ｍｏ）１０７に制御信号を送り、所望のフォーカス状態となるようなフォーカス制御を行う。 During autofocus, the focus detection unit 106 detects the focus state based on a signal obtained from the image sensor 102 and transmits the focus state to the camera control unit 105 as focus information. The camera control unit 105 sends a control signal to the focus motor (Mo) 107 according to the focus information, and performs focus control so that an optimal focus state is obtained. During manual focus, the camera control unit 105 sends a control signal to the focus motor (Mo) 107 in accordance with an instruction input from the operation unit 110 to perform focus control so that a desired focus state is achieved.

また、カメラ制御部１０５は、操作部１１０から入力されるズーム指示に従って、ズームモータ（Ｍｏ）１０８に制御信号を送り、所望のズーム位置となるようなズーム制御を行う。 In addition, the camera control unit 105 sends a control signal to the zoom motor (Mo) 108 in accordance with the zoom instruction input from the operation unit 110 to perform zoom control so that a desired zoom position is obtained.

また、動きセンサー１０９は、加速度センサー等を利用したものであって、手ぶれ補正用の動き検出以外に、撮像装置１００の移動検出が行え、具体的には、パンやチルト等の撮影動作を検出し、その情報をカメラ制御部１０５に伝達する。 The motion sensor 109 uses an acceleration sensor or the like, and can detect movement of the imaging apparatus 100 in addition to motion detection for camera shake correction. Specifically, it detects a shooting operation such as panning or tilting. and transmits the information to the camera control unit 105.

ここで、動きセンサー１０９によってパン又はチルトが検出されたときの、カメラ制御部１０５の制御動作について説明する。 Here, a control operation of the camera control unit 105 when panning or tilting is detected by the motion sensor 109 will be described.

図２は撮影中にパンニング動作が行われたときの撮影画像の様子を表わしている。図２で示したような白矢印の方向に画面がパンニングされた場合、撮影画像は全体的に白矢印の方向（水平方向）に流れたものとなる。そのため、符号化部１０４で当該撮影画像に対してイントラ予測を行う場合、水平方向が最適な予測方向となると推測できる。このとき、図１１の画像では画面上の各ブロックでばらばらであったイントラ予測の方向が、図２のように各ブロックがパンニングの方向に合った規則的な方向を示すようになるとも推測できるので、結果として、各ブロックで一様に、水平方向のイントラ予測（ｍｏｄｅ１のＨｏｒｉｚｏｎｔａｌ予測モード）を選択すれば、最適或いは最適に近いイントラ予測モードを効率的に選択できることになる。垂直方向のチルトについては、同様にｍｏｄｅ０のＶｅｒｔｉｃａｌ予測モードを選択すればよい。また、パンやチルトが斜め方向に傾いているときには、その斜め成分を考慮して、いずれかの斜め方向の予測モードを選択すればよい。 FIG. 2 shows a captured image when a panning operation is performed during shooting. When the screen is panned in the direction of the white arrow as shown in FIG. 2, the entire captured image flows in the direction of the white arrow (horizontal direction). Therefore, when the encoding unit 104 performs intra prediction on the captured image, it can be estimated that the horizontal direction is the optimal prediction direction. At this time, it can be assumed that the direction of intra prediction, which is different in each block on the screen in the image of FIG. 11, becomes a regular direction in which each block matches the panning direction as shown in FIG. Therefore, as a result, if the intra prediction in the horizontal direction (Horizontal prediction mode of mode 1) is selected uniformly in each block, the optimal or near-optimal intra prediction mode can be efficiently selected. For vertical tilt, the mode 0 vertical prediction mode may be selected in the same manner. Further, when pan or tilt is inclined in an oblique direction, any one of the oblique prediction modes may be selected in consideration of the oblique component.

このような思想に基づき、カメラ制御部１０５は、符号化部１０４（すなわち、画像符号化装置６００）内のイントラ予測部６１５に対して、パンやチルトといった撮影動作に従ったイントラ予測モードを選択するための制御信号を送ることを特徴としている。また、イントラ予測部６１５は、受信した当該制御信号に応じて、上記した通常のイントラ予測モードの選択処理に変えて、パンやチルトの方向に従ってイントラ予測モードを一意に選択したり、あるいはパンやチルトの方向に従って９種類ある選択肢を２，３種類に絞り込んでから最適なイントラ予測モードを選択したりする。これにより、イントラ予測モードの選択の際、全てのイントラ予測モードを試してみるよりも、はるかに効率的に最適なイントラ予測モードの選択が行える。 Based on such a concept, the camera control unit 105 selects an intra prediction mode according to a shooting operation such as panning or tilting with respect to the intra prediction unit 615 in the encoding unit 104 (that is, the image encoding device 600). It is characterized by sending a control signal for this purpose. Also, the intra prediction unit 615 changes the normal intra prediction mode selection process described above according to the received control signal, and uniquely selects the intra prediction mode according to the pan and tilt directions. Depending on the tilt direction, nine options are narrowed down to a few, and then the optimal intra prediction mode is selected. Thereby, when selecting the intra prediction mode, the optimal intra prediction mode can be selected much more efficiently than trying all the intra prediction modes.

図３は上記した如きパンやチルトの方向に応じたイントラ予測モードの選択処理を行うときの、イントラ予測部６１５の動作フローを図示したフローチャートである。 FIG. 3 is a flowchart illustrating an operation flow of the intra prediction unit 615 when performing the intra prediction mode selection process according to the pan and tilt directions as described above.

イントラ予測を行う画像に対して、動きセンサー１０９によってパン又はチルトが検出されたときは（ステップＳ３０１）、検出されたパン又はチルトの方向や速度に関する情報をカメラ制御部１０５より取得する（ステップＳ３０２）。動きセンサー１０９によってパン又はチルトが検出されないときは（ステップＳ３０１）、図１２で示したような通常のイントラ予測モードの選択処理を行って、最適なイントラ予測モードを決定してからイントラ予測を行う（ステップＳ３０７）。 When pan or tilt is detected by the motion sensor 109 with respect to the image to be subjected to intra prediction (step S301), information on the detected pan or tilt direction and speed is acquired from the camera control unit 105 (step S302). ). When pan or tilt is not detected by the motion sensor 109 (step S301), the normal intra prediction mode selection process as shown in FIG. 12 is performed, and the intra prediction is performed after determining the optimal intra prediction mode. (Step S307).

ステップＳ３０２で取得したパン又はチルトの方向や速度に関する情報に基づいて、パンやチルトの方向に概ね合致する予測方向を自動選択する（ステップＳ３０３）。ステップＳ３０３において、Ｖｅｒｔｉｃａｌ（垂直方向）が選択されたときは、ｍｏｄｅ０のＶｅｒｔｉｃａｌ予測モードでイントラ予測を行い（ステップＳ３０４）、Ｈｏｒｉｚｏｎｔａｌ（水平方向）が選択されたときは、ｍｏｄｅ１のＨｏｒｉｚｏｎｔａｌ予測モードでイントラ予測を行い（ステップＳ３０５）、Ｄｏｗｎ−Ｒｉｇｈｔ（右斜め下方向）が選択されたときは、ｍｏｄｅ４のＤｏｗｎ−Ｒｉｇｈｔ予測モードでイントラ予測を行う（ステップＳ３０６）。 Based on the information about the pan or tilt direction and speed acquired in step S302, a predicted direction that roughly matches the pan or tilt direction is automatically selected (step S303). In step S303, when Vertical (vertical direction) is selected, intra prediction is performed in the vertical prediction mode of mode 0 (step S304). When Horizontal (horizontal direction) is selected, horizontal prediction mode of mode 1 Intra prediction is performed (step S305), and when Down-Right (downward rightward) is selected, intra prediction is performed in Mode 4 Down-Right prediction mode (step S306).

なお、斜め方向の予測については、動きセンサー１０９の検出精度を高めることで、他の斜め方向の予測モード（ｍｏｄｅ３，５，６，７，８）も選択できるようにすれば、より予測精度を高められる。 As for the prediction in the oblique direction, if the detection accuracy of the motion sensor 109 is increased so that other oblique prediction modes (modes 3, 5, 6, 7, and 8) can be selected, the prediction accuracy is further increased. Can be enhanced.

また、図３ではパンやチルトが行われた場合、その方向に従ってイントラ予測モードを一意に選択する構成としたが、パンやチルトの方向及び速度に従って、９種類ある予測モードを２，３種類に絞り込んだり、或いは明らかに予測方向が異なると判断する予測モードを除外したりして、選択肢を減らしてから、残った候補のうちから最適な予測モードを演算で求めるようにしてもよい。 In FIG. 3, when panning or tilting is performed, the intra prediction mode is uniquely selected according to the direction. However, according to the panning and tilting direction and speed, nine types of prediction modes are divided into two or three types. After narrowing down or excluding the prediction modes that are clearly judged to have different prediction directions, the number of options may be reduced, and the optimal prediction mode may be obtained by calculation from the remaining candidates.

このように本実施形態によれば、パンやチルトといった撮影時の情報をイントラ予測モードの選択時に利用することで、最適なイントラ予測モードを容易に選択できるとともに、符号化にかかる処理負荷も軽減できる。 As described above, according to the present embodiment, by using information at the time of shooting such as pan and tilt when selecting the intra prediction mode, it is possible to easily select the optimal intra prediction mode and to reduce the processing load for encoding. it can.

＜第２の実施形態＞
次に、図４及び図５を用いて本発明の第２の実施形態について説明する。 <Second Embodiment>
Next, a second embodiment of the present invention will be described with reference to FIGS.

本実施形態では、第１の実施形態で説明した撮像装置１００がズーム動作するときの、カメラ制御部１０５の制御動作並びにイントラ予測部６１５の動作について説明する。 In the present embodiment, the control operation of the camera control unit 105 and the operation of the intra prediction unit 615 when the imaging apparatus 100 described in the first embodiment performs a zoom operation will be described.

撮影中にズーム動作が行われた場合、ズーム・インもしくはズーム・アウトのいずれについても、ズーム中の撮影画像は画面の中心から放射状に流れた画像となる。そのため、符号化部１０４で当該撮影画像に対してイントラ予測を行う場合、符号化対象ブロックが画面の中心に対してどの方向に位置するかを考慮し、その方向に合った予測をすれば、予測効率が良いことになる。 When a zoom operation is performed during shooting, the shot image being zoomed is an image that flows radially from the center of the screen for both zoom-in and zoom-out. Therefore, in the case where the encoding unit 104 performs intra prediction on the captured image, in consideration of which direction the encoding target block is located with respect to the center of the screen and performing prediction that matches the direction, The prediction efficiency will be good.

このとき、図１１の画像では画面上の各ブロックでばらばらであったイントラ予測の方向が、ズームによって図４の各ブロックのような予測方向を示すようになると推測できる。すなわち、画面の左上部のブロックでは、画面の中心に向かって斜め方向に画像が流れるので、ｍｏｄｅ４のＤｏｗｎ−Ｒｉｇｈｔ予測モードが最適といえる。同様に、画面の右上部のブロックでは、ｍｏｄｅ３のＤｏｗｎ−Ｌｅｆｔ予測モードが最適といえる。ここで、Ｈ．２６４のイントラ予測では、予測方向が隣接する左または上側のブロックの値に限られるため、画面の左下部のブロックでは、ｍｏｄｅ３のＤｏｗｎ−Ｌｅｆｔ予測モードが最適といえる。同様に、画面の右下部のブロックでは、ｍｏｄｅ４のＤｏｗｎ−Ｒｉｇｈｔ予測モードが最適といえる。 At this time, it can be inferred that the intra prediction direction, which is different in each block on the screen in the image of FIG. 11, shows the prediction direction as in each block of FIG. 4 by zooming. That is, in the upper left block of the screen, the image flows in an oblique direction toward the center of the screen, so the mode 4 Down-Right prediction mode can be said to be optimal. Similarly, in the block in the upper right part of the screen, the mode 3 Down-Left prediction mode can be said to be optimal. Here, H. In the H.264 intra prediction, since the prediction direction is limited to the value of the adjacent left or upper block, the mode 3 Down-Left prediction mode can be said to be optimal in the lower left block of the screen. Similarly, it can be said that the mode 4 Down-Right prediction mode is optimal in the lower right block of the screen.

このような思想に基づき、カメラ制御部１０５は、符号化部１０４（すなわち、画像符号化装置６００）内のイントラ予測部６１５に対して、ズーム動作に従ったイントラ予測モードを選択するための制御信号を送ることを特徴としている。また、イントラ予測部６１５は、受信した当該制御信号に応じて、上記した通常のイントラ予測モードの選択処理に変えて、ズーム中の画像に対しては、符号化対象ブロックの位置に応じてイントラ予測モードを一意に選択したり、あるいは９種類ある選択肢を２，３種類に絞り込んでから最適なイントラ予測モードを選択したりする。これにより、イントラ予測モードの選択の際、全てのイントラ予測モードを試してみるよりも、はるかに効率的に最適なイントラ予測モードの選択が行える。 Based on this idea, the camera control unit 105 controls the intra prediction unit 615 in the encoding unit 104 (that is, the image encoding apparatus 600) to select an intra prediction mode according to the zoom operation. It is characterized by sending a signal. In addition, the intra prediction unit 615 changes to the above-described normal intra prediction mode selection process according to the received control signal, and for the image being zoomed, the intra prediction unit 615 corresponds to the position of the encoding target block. The prediction mode is uniquely selected, or the optimum intra prediction mode is selected after narrowing down the nine types of options to a few types. Thereby, when selecting the intra prediction mode, the optimal intra prediction mode can be selected much more efficiently than trying all the intra prediction modes.

図５は上記した如きズーム中の画像に対してイントラ予測モードの選択処理を行うときの、イントラ予測部６１５の動作フローを図示したフローチャートである。 FIG. 5 is a flowchart illustrating an operation flow of the intra prediction unit 615 when the intra prediction mode selection process is performed on the zoomed image as described above.

イントラ予測を行う画像に対して、操作部１１０からのズーム指示に応じてズームレンズ１０１ａ及び１０１ｂを動かすといったズーム動作が行われたときは（ステップＳ５０１）、ズーム中を示す情報及びズーム速度に関する情報をカメラ制御部１０５より取得する（ステップＳ５０２）。ズーム動作が行われないとき（超低速ズーム時を含んでも良い）は（ステップＳ５０１）、図１２で示したような通常のイントラ予測モードの選択処理を行って、最適なイントラ予測モードを決定してからイントラ予測を行う（ステップＳ５０６）。 When a zoom operation such as moving the zoom lenses 101a and 101b in response to a zoom instruction from the operation unit 110 is performed on an image to be subjected to intra prediction (step S501), information indicating that the zoom is being performed and information regarding the zoom speed Is acquired from the camera control unit 105 (step S502). When the zoom operation is not performed (including the time of ultra-low speed zoom) (step S501), the normal intra prediction mode selection process as shown in FIG. 12 is performed to determine the optimum intra prediction mode. Then, intra prediction is performed (step S506).

ステップＳ５０２でズーム中を示す情報及びズーム速度に関する情報を取得したときは、符号化対象となるブロックが位置する画面上の領域を判別し、その領域に応じた予測方向を自動選択する（ステップＳ５０３）。すなわち、画面の左上領域又は右下領域のブロックならば、ｍｏｄｅ４のＤｏｗｎ−Ｒｉｇｈｔ予測モードを、画面の右上領域及び左下領域のブロックならば、ｍｏｄｅ３のＤｏｗｎ−Ｌｅｆｔ予測モードを選択する。また、中心付近領域（Ｃｅｎｔｅｒ）のブロックならば、予測モードを定められないので、通常のイントラ予測モードの選択処理を行って、最適なイントラ予測モードを決定してからイントラ予測を行う（ステップＳ５０６）。ステップＳ５０３において、Ｄｏｗｎ−Ｌｅｆｔ予測モードが選択されたときは、ｍｏｄｅ３のＤｏｗｎ−Ｌｅｆｔ予測モードでイントラ予測を行い（ステップＳ５０４）、Ｄｏｗｎ−Ｒｉｇｈｔ予測モードが選択されたときは、ｍｏｄｅ４のＤｏｗｎ−Ｒｉｇｈｔ予測モードでイントラ予測を行う（ステップＳ５０５）。 When information indicating that the zoom is being performed and information regarding the zoom speed are acquired in step S502, a region on the screen where the block to be encoded is located is determined, and a prediction direction corresponding to the region is automatically selected (step S503). ). That is, if the block is in the upper left area or the lower right area of the screen, the mode 4 Down-Right prediction mode is selected. If the block is in the upper right area and the lower left area of the screen, the mode 3 Down-Left prediction mode is selected. In addition, since the prediction mode cannot be determined if the block is in the center vicinity region (Center), the normal intra prediction mode selection process is performed to determine the optimal intra prediction mode (step S506). ). In Step S503, when the Down-Left prediction mode is selected, intra prediction is performed in the Mode 3 Down-Left prediction mode (Step S504), and when the Down-Right prediction mode is selected, the Mode 4 Down is selected. -Intra prediction is performed in Right prediction mode (step S505).

なお、画面上の領域をより細分化して分類することにより、画面の中心からみて真上又は真下はｍｏｄｅ０のＶｅｒｔｉｃａｌ予測モードを選択し、画面の中心からみて左側又は右側はｍｏｄｅ１のＨｏｒｉｚｏｎｔａｌ予測モードを選択するようにするとより効果的である。さらに領域を細分化して、他の斜め方向の予測モード（ｍｏｄｅ５，６，７，８）も選択できるようにすれば、より予測精度を高められる。 By subdividing and classifying the area on the screen, mode 0 vertical prediction mode is selected directly above or below the center of the screen, and mode 1 horizonal prediction is on the left or right side when viewed from the center of the screen. It is more effective to select the mode. Further, if the region is subdivided so that other oblique prediction modes (modes 5, 6, 7, and 8) can be selected, the prediction accuracy can be further improved.

また、図５ではズームが行われた場合、符号化対象となるブロックの位置する領域に応じて、中心付近以外のときはイントラ予測モードを一意に選択する構成としたが、９種類ある予測モードを２，３種類に絞り込んだり、或いは明らかに予測方向が異なると判断する予測モードを除外したりして、選択肢を減らしてから、残った候補のうちから最適な予測モードを演算で求めるようにしてもよい。 Also, in FIG. 5, when zooming is performed, the intra prediction mode is uniquely selected when it is not near the center according to the area where the block to be encoded is located. By narrowing down to two or three types, or excluding prediction modes that clearly determine that the prediction direction is different, and reducing the number of options, the optimal prediction mode can be calculated from the remaining candidates. May be.

このように本実施形態によれば、ズームといった撮影時の情報をイントラ予測モードの選択時に利用することで、最適なイントラ予測モードを容易に選択できるとともに、符号化にかかる処理負荷も軽減できる。 As described above, according to the present embodiment, by using information at the time of shooting such as zooming at the time of selecting an intra prediction mode, it is possible to easily select an optimal intra prediction mode and to reduce the processing load for encoding.

＜第３の実施形態＞
次に、図６及び図７を用いて本発明の第３の実施形態について説明する。 <Third Embodiment>
Next, a third embodiment of the present invention will be described with reference to FIGS.

本実施形態では、第１の実施形態で説明した撮像装置１００がフォーカス動作するときの、カメラ制御部１０５の制御動作並びにイントラ予測部６１５の動作について説明する。 In the present embodiment, the control operation of the camera control unit 105 and the operation of the intra prediction unit 615 when the imaging apparatus 100 described in the first embodiment performs a focusing operation will be described.

撮影中にフォーカス動作が行われる場合、合焦していないときは、撮影画像の画面全体がボケた状態となる。そして、符号化部１０４でボケた撮影画像に対してイントラ予測を行う場合、特定の方向は無く、画素の平均値を利用するＤＣ予測モードの予測効率が良いことになる。 When a focus operation is performed during shooting, the entire screen of the shot image is out of focus when not in focus. And when performing intra prediction with respect to the photographic image blurred by the encoding part 104, there is no specific direction and the prediction efficiency of DC prediction mode using the average value of a pixel is good.

このとき、図１１の画像は合焦している状態であるので画面上の各ブロックでばらばらの方向を示していたイントラ予測の方向が、画像がボケ状態のときは画面全体にわたり図６のようにＤＣ予測を示すようになると推測できる。 At this time, since the image in FIG. 11 is in a focused state, the direction of intra prediction that indicates the direction of disparity in each block on the screen is as shown in FIG. 6 over the entire screen when the image is in a blurred state. It can be estimated that DC prediction is shown in FIG.

このような思想に基づき、カメラ制御部１０５は、符号化部１０４（すなわち、画像符号化装置６００）内のイントラ予測部６１５に対して、フォーカス状態に従ったイントラ予測モードを選択するための制御信号を送ることを特徴としている。また、イントラ予測部６１５は、受信した当該制御信号に応じて、上記した通常のイントラ予測モードの選択処理に変えて、ボケた画像に対してはＤＣイントラ予測モードを一意に選択する。これにより、イントラ予測モードの選択の際、全てのイントラ予測モードを試してみるよりも、はるかに効率的に最適なイントラ予測モードの選択が行える。 Based on such a concept, the camera control unit 105 controls the intra prediction unit 615 in the encoding unit 104 (that is, the image encoding apparatus 600) to select an intra prediction mode according to the focus state. It is characterized by sending a signal. Further, the intra prediction unit 615 uniquely selects the DC intra prediction mode for a blurred image instead of the above-described normal intra prediction mode selection processing according to the received control signal. Thereby, when selecting the intra prediction mode, the optimal intra prediction mode can be selected much more efficiently than trying all the intra prediction modes.

図７は上記した如きボケた画像、特にフォーカス非合焦状態で撮影された画像に対してイントラ予測モードの選択処理を行うときの、イントラ予測部６１５の動作フローを図示したフローチャートである。 FIG. 7 is a flowchart illustrating an operation flow of the intra prediction unit 615 when the intra prediction mode selection process is performed on the above-described blurred image, particularly an image photographed in an out-of-focus state.

イントラ予測を行う画像に対して、フォーカスが合焦状態で撮影されたか、或いは非合焦状態で撮影されたかを示すフォーカス情報をカメラ制御部１０５より取得する（ステップＳ７０１）。フォーカス情報により、フォーカスが合焦していると判別したときは（ステップＳ７０２）、図１２で示したような通常のイントラ予測モードの選択処理を行って、最適なイントラ予測モードを決定してからイントラ予測を行う（ステップＳ７０４）。 Focus information indicating whether the focus is captured in an in-focus state or an out-of-focus state is acquired from the camera control unit 105 with respect to an image to be subjected to intra prediction (step S701). When it is determined from the focus information that the focus is in focus (step S702), the normal intra prediction mode selection process as shown in FIG. 12 is performed, and the optimum intra prediction mode is determined. Intra prediction is performed (step S704).

一方、フォーカス情報により、フォーカスが非合焦（ボケた画像）であると判別したときは（ステップＳ７０２）、ＤＣ予測モードを自動選択し、ｍｏｄｅ２のＤＣ予測モードでイントラ予測を行う（ステップＳ７０３）。 On the other hand, when it is determined from the focus information that the focus is out of focus (blurred image) (step S702), the DC prediction mode is automatically selected, and intra prediction is performed in the mode 2 DC prediction mode (step S703). ).

このように本実施形態によれば、フォーカスといった撮影時の情報をイントラ予測モードの選択時に利用することで、最適なイントラ予測モードを容易に選択できるとともに、符号化にかかる処理負荷も軽減できる。 As described above, according to the present embodiment, by using information at the time of shooting such as focus at the time of selecting an intra prediction mode, it is possible to easily select an optimal intra prediction mode and to reduce a processing load related to encoding.

なお、第１〜第３の実施形態で説明した本発明の実施例は、それぞれ単独で用いても良いが、各実施例を組み合わせて用いるよう構成しても良い。各実施例を組み合わせて用いることで、様々な撮影状態に対応して最適なイントラ予測モードを効率的に選択できるとともに、符号化にかかる処理負荷も軽減できる。 In addition, although the Example of this invention demonstrated by the 1st-3rd embodiment may each be used independently, you may comprise so that it may be used combining each Example. By using the embodiments in combination, it is possible to efficiently select an optimal intra prediction mode corresponding to various shooting states and to reduce the processing load for encoding.

＜その他の実施形態＞
また、本発明の目的は、前述した実施形態の機能を実現するソフトウェアのプログラムコードを記録した記憶媒体を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはＣＰＵやＭＰＵ）が記憶媒体に格納されたプログラムコードを読み出し実行することによっても、達成されることは言うまでもない。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することになり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。また、コンピュータが読み出したプログラムコードを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムコードの指示に基づき、コンピュータ上で稼働しているオペレーティングシステム（ＯＳ）などが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。ここでプログラムコードを記憶する記憶媒体としては、例えば、フレキシブルディスク、ハードディスク、ＲＯＭ、ＲＡＭ、磁気テープ、不揮発性のメモリカード、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＤＶＤ、ブルーレイディスク、光ディスク、光磁気ディスク、ＭＯなどが考えられる。 <Other embodiments>
Another object of the present invention is to supply a storage medium recording a program code of software for realizing the functions of the above-described embodiments to a system or apparatus, and the computer (or CPU or MPU) of the system or apparatus stores the storage medium. Needless to say, this can also be achieved by reading and executing the program code stored in. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the storage medium storing the program code constitutes the present invention. Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also an operating system (OS) running on the computer based on the instruction of the program code. It goes without saying that a case where the function of the above-described embodiment is realized by performing part or all of the actual processing and the processing is included. Examples of the storage medium for storing the program code include a flexible disk, hard disk, ROM, RAM, magnetic tape, nonvolatile memory card, CD-ROM, CD-R, DVD, Blu-ray disk, optical disk, and magneto-optical disk. , MO, etc. are conceivable.

さらに、記憶媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張カードやコンピュータに接続された機能拡張ユニットに備わるメモリに書込まれた後、そのプログラムコードの指示に基づき、その機能拡張カードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Furthermore, after the program code read from the storage medium is written into a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer, the function is determined based on the instruction of the program code. It goes without saying that the CPU or the like provided in the expansion card or the function expansion unit performs part or all of the actual processing and the functions of the above-described embodiments are realized by the processing.

本発明を上記記憶媒体に適用する場合、その記憶媒体には、先に説明したフローチャート図に対応するプログラムコードが格納されることになる。 When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the flowcharts described above.

撮像装置１００の構成を表わしたブロック図である。1 is a block diagram illustrating a configuration of an imaging apparatus 100. FIG. パンニング中の画像に対するイントラ予測方向の例を示した図である。It is the figure which showed the example of the intra prediction direction with respect to the image in panning. パンやチルトの方向に応じたイントラ予測モードの選択処理を図示したフローチャートである。It is the flowchart which illustrated the selection process of the intra prediction mode according to the direction of pan or tilt. ズーム中の画像に対するイントラ予測方向の例を示した図である。It is the figure which showed the example of the intra prediction direction with respect to the image in zoom. ズームに応じたイントラ予測モードの選択処理を図示したフローチャートである。It is the flowchart which illustrated the selection process of the intra prediction mode according to zoom. ボケた画像に対するイントラ予測の例を示した図である。It is the figure which showed the example of the intra prediction with respect to a blurred image. フォーカスに応じたイントラ予測モードの選択処理を図示したフローチャートである。It is the flowchart which illustrated the selection process of the intra prediction mode according to a focus. 画像符号化装置６００の構成を表わしたブロック図である。2 is a block diagram illustrating a configuration of an image encoding device 600. FIG. イントラ予測に用いる画素の説明図である。It is explanatory drawing of the pixel used for intra prediction. イントラ予測における各予測モードの図である。It is a figure of each prediction mode in intra prediction. 通常の画像におけるイントラ予測方向の例を示した図である。It is the figure which showed the example of the intra prediction direction in a normal image. 通常のイントラ予測モードの選択処理を図示したフローチャートである。It is the flowchart which illustrated the selection process of normal intra prediction mode.

Explanation of symbols

１０１撮影レンズ部
１０２撮像素子
１０３カメラ信号処理部
１０４符号化部
１０５カメラ制御部
１０６フォーカス検出部
１０７フォーカスモータ
１０８ズームモータ
１０９動きセンサー
１１０操作部
６０１減算器
６０２整数変換部
６０３量子化部
６０４エントロピー符号化部
６０５逆量子化部
６０６逆整数変換部
６０７加算器
６０８フレームメモリ
６１０ループフィルタ
６１１フレームメモリ
６１２インター予測部
６１３動き検出部
６１４スイッチ
６１５イントラ予測部 DESCRIPTION OF SYMBOLS 101 Photographic lens part 102 Image pick-up element 103 Camera signal processing part 104 Encoding part 105 Camera control part 106 Focus detection part 107 Focus motor 108 Zoom motor 109 Motion sensor 110 Operation part 601 Subtractor 602 Integer conversion part 603 Quantization part 604 Entropy code Conversion unit 605 Inverse quantization unit 606 Inverse integer conversion unit 607 Adder 608 Frame memory 610 Loop filter 611 Frame memory 612 Inter prediction unit 613 Motion detection unit 614 Switch 615 Intra prediction unit

Claims

An image encoding device that selects one of a plurality of prediction modes and predicts a pixel value of a block to be encoded in the same frame using a pixel value in the frame according to the selected prediction mode to generate predicted image data In
Means for inputting an image signal to be encoded;
It means for obtaining photographic information of a camera zoom operation state of when the image signal is generated,
In encoding of the input image signal, when the pixel value of the encoding target block in the same frame is predicted using the pixel value in the frame, it indicates that the shooting information is being zoomed. In this case, the image coding apparatus further comprises means for selecting a prediction mode corresponding to the position of the coding target block .

The means for selecting the prediction mode, when the shooting information indicates that a zoom operation is being performed, when the image represented by the image signal is divided into a plurality of regions, The image coding apparatus according to claim 1, wherein a region including a block is determined , and a predetermined prediction mode is selected for each region.

The image encoding apparatus according to claim 2 , wherein the means for selecting the prediction mode selects a prediction mode having the same prediction direction for each region.

An image encoding device that selects one of a plurality of prediction modes and predicts a pixel value of a block to be encoded in the same frame using a pixel value in the frame according to the selected prediction mode to generate predicted image data In
Means for inputting an image signal to be encoded;
Means for acquiring shooting information indicating a focus operation state of the camera when the image signal is generated;
In the encoding of the input image signal, when the pixel value of the encoding target block in the same frame is predicted using the pixel value in the frame, the shooting information indicates a focus in-focus state An image encoding apparatus comprising: means for selecting a prediction mode by different processing depending on whether the focus is out of focus .

5. The image encoding apparatus according to claim 4 , wherein the prediction mode selection unit selects a predetermined prediction mode when the shooting information indicates a focus out-of-focus state .

Imaging means for outputting an image signal obtained by imaging;
An encoding means for compressing and encoding the image signal, selecting any one of a plurality of prediction modes and predicting a pixel value in the same frame using a pixel value in the frame according to the selected prediction mode In an imaging apparatus including an encoding unit that performs processing for generating predicted image data,
The image processing device further includes control means for controlling a zoom operation in the image pickup means, and the encoding means uses information output from the control means when predicting pixel values in the same frame using pixel values in the frame. And an image pickup device that divides the image into a plurality of regions and selects a predetermined prediction mode for each region.

Imaging means for outputting an image signal obtained by imaging;
An encoding means for compressing and encoding the image signal, selecting any one of a plurality of prediction modes and predicting a pixel value in the same frame using a pixel value in the frame according to the selected prediction mode In an imaging apparatus including an encoding unit that performs processing for generating predicted image data,
The image processing device further includes control means for controlling a focus operation in the image pickup means, and the encoding means uses information output from the control means when predicting pixel values in the same frame using pixel values in the frame. An imaging apparatus, wherein a predetermined prediction mode is selected for an image captured when it is determined that the focus is out of focus.