JP5328854B2

JP5328854B2 - Motion vector detection apparatus and motion vector detection method

Info

Publication number: JP5328854B2
Application number: JP2011171130A
Authority: JP
Inventors: 大輔坂本
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2006-08-08
Filing date: 2011-08-04
Publication date: 2013-10-30
Anticipated expiration: 2027-06-26
Also published as: CN100576919C; CN101123728A; JP2011217421A

Abstract

A motion vector detection apparatus is configured to calculate a temporal distance between a frame to be coded and each of a plurality of reference candidate frames referred to by the frame to be coded. The motion vector detection apparatus searches for a candidate motion vector between the frame to be coded and each the plurality of reference candidate frames and detects a motion vector for the frame to be coded from the candidate motion vectors. In searching for and detecting a candidate motion vector, the motion vector detection apparatus changes an amount of the calculation performed during the detection of a candidate motion vector according to the calculated temporal distance between the frame to be coded and the reference candidate frame, and a coding type of the reference candidate frame.

Description

本発明は、動きベクトル検出装置及び動きベクトル検出方法に関し、特に、画面間の動きベクトルを検出するために用いて好適なものである。 The present invention relates to a motion vector detection device and a motion vector detection method, and is particularly suitable for use in detecting a motion vector between screens.

近年、マルチメディアに関連する情報のデジタル化が急進している。これに伴い、映像情報の高画質化への要求が高まっている。例えば、従来の７２０×４８０画素のＳＤ（ＳｔａｎｄａｒｄＤｅｆｉｎｉｔｉｏｎ）から、１９２０×１０８０画素のＨＤ（ＨｉｇｈＤｅｆｉｎｉｔｉｏｎ）に放送メディアが移行されつつある。しかしながら、このような映像情報の高画質化への要求は、同時にデジタルデータの増大化を引き起こす。従って、従来技術の性能を上回る圧縮符号化技術及び復号化技術が要求されている。 In recent years, the digitization of information related to multimedia has been rapidly progressing. Along with this, there is an increasing demand for higher image quality of video information. For example, broadcasting media is being transferred from a conventional SD (Standard Definition) of 720 × 480 pixels to an HD (High Definition) of 1920 × 1080 pixels. However, such a demand for higher image quality of video information simultaneously causes an increase in digital data. Therefore, there is a demand for compression encoding technology and decoding technology that exceed the performance of the prior art.

これらの要求に対し、ＩＴＵ−ＴＳＧ１６や、ＩＳＯ／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１の活動で、ピクチャ間の相関性を利用するピクチャ間予測を用いた符号化方式の標準化作業が進められている。このような符号化方式の中で、現状、最も高能率で符号化できるといわれる符号化方式として、Ｈ．２６４／ＭＰＥＧ−４ＰＡＲＴ１０（ＡＶＣ）がある。以下の説明では、この符号化方式をＨ．２６４と称する。 In response to these requirements, standardization of an encoding scheme using inter-picture prediction using correlation between pictures is being promoted by activities of ITU-T SG16 and ISO / IEC JTC1 / SC29 / WG11. Among such encoding schemes, as an encoding scheme that is said to be capable of encoding with the highest efficiency, H.264 is currently available. H.264 / MPEG-4 PART10 (AVC). In the following description, this encoding method is referred to as H.264. H.264.

Ｈ．２６４では、画面間の動きベクトルを検出するための参照画像を比較的自由に選べるようになった。更に、Ｈ．２６４では、マクロブロック単位或いはそれ以下の単位まで符号化対象の画像を分割して動きベクトルを検出することにより、従来よりも細かい単位で動きベクトルを検出できるようになった。これによって、発生する符号量をより低減させることができるようになった。 H. In H.264, reference images for detecting motion vectors between screens can be selected relatively freely. Further, H.C. In H.264, it is possible to detect a motion vector in a smaller unit than in the prior art by detecting a motion vector by dividing an image to be encoded into units of a macroblock or less. As a result, the amount of generated code can be further reduced.

Ｈ．２６４を用いた技術として、特許文献１には、複数のフレームメモリを有し、符号化対象の画像を符号化する場合に参照する画像を、前記複数のフレームメモリ内に格納された複数の画像から選択する構成が開示されている。 H. As a technique using H.264, Patent Document 1 has a plurality of frame memories, and a plurality of images stored in the plurality of frame memories are referred to when an image to be encoded is encoded. A configuration to select from is disclosed.

ＭＰＥＧ−１、ＭＰＥＧ−２、ＭＰＥＧ−４等の従来の符号化方式は、動きの予測を行う場合の機能として、過去の画像から未来の画像を予測する順方向予測の機能と、未来の画像から過去の画像を予測する逆方向予測の機能とを有している。ここで、未来の画像から過去の画像を予測するとは、符号化をスキップした画像を現在の画像から予測することである。なお、以下の説明では、ＭＰＥＧ−１、ＭＰＥＧ−２、ＭＰＥＧ−４等の従来の符号化方式を、ＭＰＥＧ符号化方式と総称する。 Conventional encoding schemes such as MPEG-1, MPEG-2, and MPEG-4 have a forward prediction function for predicting a future image from a past image and a future image as a function for motion prediction. And a backward prediction function for predicting past images. Here, to predict a past image from a future image is to predict an image skipped from encoding from the current image. In the following description, conventional encoding methods such as MPEG-1, MPEG-2, and MPEG-4 are collectively referred to as MPEG encoding methods.

符号化対象となる画像と時間的に近傍に位置する画像ほど、多くの場合、画像間の相関性が高いと考えられる。そこで、ＭＰＥＧ符号化方式における順方向予測及び逆方向予測では、通常、符号化対象となる画像の近傍に位置するＩピクチャ又はＰピクチャを参照画像として用いる。 In many cases, the correlation between images is considered to be higher as the image is closer to the encoding target in time. Therefore, in forward prediction and backward prediction in the MPEG encoding method, an I picture or P picture located in the vicinity of an image to be encoded is usually used as a reference image.

しかしながら、ＭＰＥＧ符号化方式のコーデック（符号化／復号化器）を搭載するビデオカメラでは、動画像の撮影時にパン・チルト等のカメラの動きが速い場合や、カットチェンジ直後の画像のように画像間の変化が急激な場合がある。このような場合には、時間的に近傍な画像であっても画像間の相関性が低くなる。従って、動き補償予測の利点が活用できないという問題があった。 However, in a video camera equipped with a codec (encoder / decoder) of the MPEG encoding method, when a moving image is shot, the camera moves quickly such as pan / tilt, or an image just like an image immediately after a cut change. The change between them may be abrupt. In such a case, even if the images are temporally close, the correlation between the images is low. Therefore, there is a problem that the advantage of motion compensation prediction cannot be utilized.

この問題を改善できると目される技術がＨ．２６４で採用された予測方式である。Ｈ．２６４では、近傍の画像のみならず、時間的に離れた位置の画像に対しても予測符号化を行い、近傍の画像よりも符号化効率の向上が見込まれるのであれば、離れた位置の画像を参照画像として利用することができる。 A technology that is expected to improve this problem is H.264. This is a prediction method adopted in H.264. H. In H.264, predictive coding is performed not only on neighboring images but also on images at positions that are separated in time, and if improvement in coding efficiency is expected over neighboring images, images at separated positions are used. Can be used as a reference image.

特開２００５−１８４６９４号公報JP 2005-184694 A

前述したように、Ｈ．２６４では、動画像を撮影したカメラの動きが速い場合や、カットチェンジが発生した場合でも、入力された画像と、既に符号化された画像との誤差が最小となる画像を自由に参照画像として選択できる。これにより、動き補償予測の精度を高めることが可能である。 As described above, H.P. In H.264, even when the motion of the camera that captured the moving image is fast or when a cut change occurs, an image that minimizes the error between the input image and the already encoded image can be freely used as a reference image. You can choose. Thereby, it is possible to improve the precision of motion compensation prediction.

しかしながら、既に符号化された画像の全てに対して、入力画像との誤差が最小となる画像を選択する演算を行うと、参照候補となる画像の数に比例して演算量が増大し、符号化に時間がかかるという問題がある。 However, if an operation that selects an image that minimizes the error from the input image is performed on all the already encoded images, the amount of calculation increases in proportion to the number of reference candidate images. There is a problem that it takes time.

また、ビデオカメラ等のモバイル機器の場合には、演算負荷が増大すると、駆動するバッテリーの消費量が増大する。従って、撮影できる時間が短くなってしまうという問題がある。 In the case of a mobile device such as a video camera, when the calculation load increases, the consumption of the battery to be driven increases. Therefore, there is a problem that the time that can be taken is shortened.

本発明は、このような問題点に鑑みてなされたものであり、動きベクトルの検出精度を向上させつつ、当該動きベクトルを検出する際の演算量の増大を抑制することを目的とする。 The present invention has been made in view of such problems, and an object of the present invention is to suppress an increase in the amount of calculation when detecting the motion vector while improving the detection accuracy of the motion vector.

上記の目的を達成するために、本発明に係る動きベクトル検出装置は、画面間の動きベクトルを検出する動きベクトル検出装置であって、符号化対象画像と、前記符号化対象画像によって参照される参照画像の候補である複数の候補画像のそれぞれとの間の時間的な距離を演算する演算手段と、前記符号化対象画像と、前記複数の候補画像のそれぞれとの間で動きベクトルを探索し、探索した結果に基づいて動きベクトルを検出する動きベクトル検出手段とを有し、前記符号化対象画像と、前記複数の候補画像のそれぞれとの間で動きベクトルを探索する際に、前記動きベクトル検出手段は、前記演算手段が演算した各候補画像に対する時間的な距離と前記各候補画像のピーク信号対雑音比（ＰＳＮＲ）の値とに応じて実行する演算の量を変更することを特徴とする。 In order to achieve the above object, a motion vector detection device according to the present invention is a motion vector detection device that detects a motion vector between screens, and is referred to by an encoding target image and the encoding target image. A calculation unit that calculates a temporal distance between each of a plurality of candidate images that are reference image candidates, and a motion vector is searched between the encoding target image and each of the plurality of candidate images. A motion vector detecting means for detecting a motion vector based on the search result, and when searching for a motion vector between the encoding target image and each of the plurality of candidate images, the motion vector detecting means, varying the amount of operation of said operation means is performed in response to the value of the peak signal to noise ratio of the temporal distance between the respective candidate images for each candidate image computed (PSNR) Characterized in that it.

また、本発明に係る動きベクトル検出方法は、画面間の動きベクトルを検出する動きベクトル検出方法であって、符号化対象画像と、前記符号化対象画像によって参照される参照画像の候補である複数の候補画像のそれぞれとの間の時間的な距離を演算する演算工程と、前記符号化対象画像と、前記複数の候補画像のそれぞれとの間で動きベクトルを探索し、探索した結果に基づいて動きベクトルを検出する動きベクトル検出工程とを有し、前記符号化対象画像と、前記複数の候補画像のそれぞれとの間で動きベクトルを探索する際に、前記動きベクトル検出工程で実行する演算の量が、前記演算工程で演算された各候補画像に対する時間的な距離と前記各候補画像のピーク信号対雑音比（ＰＳＮＲ）の値とに応じて変更されることを特徴とする。 The motion vector detection method according to the present invention is a motion vector detection method for detecting a motion vector between screens, and includes a plurality of encoding target images and reference image candidates referred to by the encoding target image. A calculation step of calculating a temporal distance between each of the candidate images, a motion vector between the encoding target image and each of the plurality of candidate images, and based on the search result A motion vector detection step for detecting a motion vector, and when performing a motion vector search between the encoding target image and each of the plurality of candidate images, an operation executed in the motion vector detection step amount, and characterized in that it is changed according to the value of the peak signal to noise ratio of the temporal distance between the respective candidate images for each candidate image calculated by said calculation step (PSNR) That.

本発明によれば、動きベクトルの検出精度を向上させつつ、動きベクトルを検出する際の演算量の増大を抑制することができる。 ADVANTAGE OF THE INVENTION According to this invention, the increase in the amount of calculations at the time of detecting a motion vector can be suppressed, improving the detection accuracy of a motion vector.

本発明の第１の実施形態に係るビデオカメラ装置の構成を示すブロック図である。It is a block diagram which shows the structure of the video camera apparatus which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る信号処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the signal processing part which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る動きベクトル検出器の構成を示すブロック図である。It is a block diagram which shows the structure of the motion vector detector which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る符号化対象フレームと参照候補フレームとの関係の一例を示した図である。It is the figure which showed an example of the relationship between the encoding object frame and reference candidate frame which concern on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る動きベクトル検出器の動作の一例について説明するフローチャートである。It is a flowchart explaining an example of operation | movement of the motion vector detector which concerns on the 1st Embodiment of this invention. 本発明の第２の実施形態に係る動きベクトル検出器の構成を示すブロック図である。It is a block diagram which shows the structure of the motion vector detector which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る動きベクトル検出器の動作の一例について説明するフローチャートである。It is a flowchart explaining an example of operation | movement of the motion vector detector which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る縮小画像の一例を示す図である。It is a figure which shows an example of the reduced image which concerns on the 2nd Embodiment of this invention. 本発明の第３の実施形態に係る動きベクトル検出器の構成を示すブロック図である。It is a block diagram which shows the structure of the motion vector detector which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係る動きベクトル検出器の動作の一例について説明するフローチャートである。It is a flowchart explaining an example of operation | movement of the motion vector detector which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係る符号化対象フレームと参照候補フレームとの関係の一例を示した図である。It is the figure which showed an example of the relationship between the encoding object frame and reference candidate frame which concern on the 3rd Embodiment of this invention. 本発明の第４の実施形態に係る信号処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the signal processing part which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係る動きベクトル検出器の構成を示すブロック図である。It is a block diagram which shows the structure of the motion vector detector which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係る動きベクトル検出器の動作の一例について説明するフローチャートである。It is a flowchart explaining an example of operation | movement of the motion vector detector which concerns on the 4th Embodiment of this invention.

（第１の実施形態）
以下に、図面を参照しながら、本発明の第１の実施形態について説明する。 (First embodiment)
Hereinafter, a first embodiment of the present invention will be described with reference to the drawings.

図１は、本実施形態におけるビデオカメラ装置１０の構成の一例を示す図である。なお、本実施形態では、Ｈ．２６４で符号化を行うビデオカメラ装置１０を例に挙げて説明を行う。 FIG. 1 is a diagram illustrating an example of a configuration of a video camera device 10 according to the present embodiment. In the present embodiment, H.264. The video camera apparatus 10 that performs encoding by H.264 will be described as an example.

図１において、撮像部１１は、例えば、レンズ等で構成される光学系、光電変換素子、Ａ／Ｄ変換回路等を用いてデジタルの撮像画像データを生成し、生成した撮像画像データを不図示の画像メモリに格納する。信号処理部１２は、表示や記録等を行うための所望の形式となるように、画像メモリから読み出された撮像画像データに対して、種々の信号処理や変換を行う。信号処理部１２の詳細については、図２を用いて後述する。 In FIG. 1, the imaging unit 11 generates digital captured image data using, for example, an optical system including a lens, a photoelectric conversion element, an A / D conversion circuit, and the like, and the generated captured image data is not illustrated. Stored in the image memory. The signal processing unit 12 performs various signal processing and conversion on the captured image data read from the image memory so as to have a desired format for display and recording. Details of the signal processing unit 12 will be described later with reference to FIG.

記録部１３は、画像データ等を記録媒体に記録したり、記録媒体から画像データ等を読み出したりするためのものである。記録媒体としては、例えば半導体メモリを用いることができる。 The recording unit 13 is for recording image data or the like on a recording medium, or reading image data or the like from the recording medium. For example, a semiconductor memory can be used as the recording medium.

システム制御部１４は、ビデオカメラ装置１０全体の制御及び各種演算を行うためのものである。このシステム制御部１４は、例えば、ＣＰＵ、ＲＯＭ、及びＲＡＭを備え、ＣＰＵがＲＯＭに記憶されているプログラムを、ＲＡＭを用いて実行することにより、ビデオカメラ装置１０全体の制御及び各種演算を行う。 The system control unit 14 is for performing control of the entire video camera device 10 and various calculations. The system control unit 14 includes, for example, a CPU, a ROM, and a RAM. The CPU executes a program stored in the ROM using the RAM, thereby performing control of the entire video camera device 10 and various calculations. .

表示部１５は、信号処理部１２で処理された信号や、システム制御部１４で演算された信号を入力し、入力した信号に基づく画像をＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）等の表示装置に表示する。 The display unit 15 receives a signal processed by the signal processing unit 12 or a signal calculated by the system control unit 14 and displays an image based on the input signal on a display device such as an LCD (Liquid Crystal Display).

操作部１６は、ビデオカメラ装置１０の電源をオン・オフするためのメインスイッチや、撮影（記録）の開始・終了を指示するためのスイッチ等、ユーザがビデオカメラ装置１０に対する動作指示を行うための各種のスイッチを有する。 The operation unit 16 is for a user to give an operation instruction to the video camera device 10 such as a main switch for turning on / off the power of the video camera device 10 and a switch for instructing start / end of shooting (recording). With various switches.

なお、信号処理部１２、記録部１３、システム制御部１４、表示部１５、及び操作部１６間の信号の入出力は、バス１７を介して行われる。 Note that signal input / output among the signal processing unit 12, the recording unit 13, the system control unit 14, the display unit 15, and the operation unit 16 is performed via the bus 17.

図２は、本実施形態に係る信号処理部１２の構成を示すブロック図である。なお、以下の説明では符号化対象画像や、予測に用いられる参照画像（参照候補画像）を、それぞれフレーム画像（単にフレームと称す）を例にして説明する。そして、画面間、すなわちフレーム間の動きベクトルを検出する例について説明する。 FIG. 2 is a block diagram illustrating a configuration of the signal processing unit 12 according to the present embodiment. In the following description, an encoding target image and a reference image (reference candidate image) used for prediction will be described using frame images (simply referred to as frames) as examples. An example of detecting a motion vector between screens, that is, between frames will be described.

図２において、セレクタ１２ａは、フレーム内符号化／フレーム間符号化の各符号化モードに応じて、撮像部１１内の画像メモリから読み出された撮像画像データの出力先を選択する。イントラ予測器１２ｂは、撮像画像データをセレクタ１２ａから入力し、入力した撮像画像データに対して、Ｈ．２６４符号化方式によるイントラ予測処理を実行する。 In FIG. 2, the selector 12 a selects the output destination of the captured image data read from the image memory in the imaging unit 11 in accordance with each encoding mode of intraframe encoding / interframe encoding. The intra predictor 12b receives the captured image data from the selector 12a, and outputs H.264 to the input captured image data. An intra prediction process using the H.264 encoding method is executed.

減算器１２ｃは、セレクタ１２ａより出力された撮像画像データから、動き補償器１２ｌより出力された予測画像データを減算して、動き予測誤差データを算出する。変換器１２ｄは、減算器１２ｃやイントラ予測器１２ｂから出力されたデータに対して直交変換を施し、この直交変換により得られる直交変換係数を量子化器１２ｅへ出力する。量子化器１２ｅは、直交変換係数を量子化し、量子化した全ての直交変換係数をスキャン処理器１２ｆと逆量子化器１２ｈとに出力する。 The subtractor 12c subtracts the predicted image data output from the motion compensator 121 from the captured image data output from the selector 12a to calculate motion prediction error data. The converter 12d performs orthogonal transform on the data output from the subtractor 12c and the intra predictor 12b, and outputs the orthogonal transform coefficient obtained by the orthogonal transform to the quantizer 12e. The quantizer 12e quantizes the orthogonal transform coefficients and outputs all quantized orthogonal transform coefficients to the scan processor 12f and the inverse quantizer 12h.

スキャン処理器１２ｆは、符号化モードに応じて、量子化された直交変換係数に対してジグザグスキャン等のスキャン処理を行う。エントロピー符号化器１２ｇは、スキャン処理器１２ｆからの出力をエントロピー符号化し、エントロピー符号化されたデータ（符号化データ）をバス・インタフェース（Ｉ／Ｆ）１２ｎに出力する。バスＩ／Ｆ１２ｎに出力された符号化データは、バス１７を介して記録部１３に供給され、記録部１３によって記録される。なお、記録部１３によって記録された符号化データを、更にハードディスクや光ディスク等の記録媒体に移動して記録する構成にしても良い。 The scan processor 12f performs a scan process such as a zigzag scan on the quantized orthogonal transform coefficient in accordance with the encoding mode. The entropy encoder 12g entropy-encodes the output from the scan processor 12f, and outputs the entropy-encoded data (encoded data) to the bus interface (I / F) 12n. The encoded data output to the bus I / F 12n is supplied to the recording unit 13 via the bus 17 and is recorded by the recording unit 13. The encoded data recorded by the recording unit 13 may be further moved to a recording medium such as a hard disk or an optical disk for recording.

逆量子化器１２ｈは、量子化された直交変換係数を量子化器１２ｅから入力して、逆量子化する。逆変換器１２ｉは、逆量子化器１２ｈで逆量子化された直交変換係数を逆直交変換し、減算器１２ｃで得られた動き予測誤差データを復号化する。加算器１２ｊは、逆変換器１２ｉから出力された予測誤差データと、動き補償器１２ｌから出力された予測画像データとを加算して、復元画像（ローカルデコード画像）を生成する。生成された復元画像のデータはバスＩ／Ｆ１２ｎに出力される。バスＩ／Ｆ１２ｎに出力されたデータは、参照画像データとして、記録部１３に設けられたフレームメモリに、フレーム単位で記録される。なお、以下の説明では、記録部１３に設けられたフレームメモリに記録された参照画像データを、必要に応じて参照候補フレームと称する。 The inverse quantizer 12h receives the quantized orthogonal transform coefficient from the quantizer 12e and performs inverse quantization. The inverse transformer 12i performs inverse orthogonal transform on the orthogonal transform coefficient inversely quantized by the inverse quantizer 12h, and decodes motion prediction error data obtained by the subtractor 12c. The adder 12j adds the prediction error data output from the inverse converter 12i and the predicted image data output from the motion compensator 121 to generate a restored image (local decoded image). The generated restored image data is output to the bus I / F 12n. The data output to the bus I / F 12n is recorded in the frame unit in the frame memory provided in the recording unit 13 as reference image data. In the following description, reference image data recorded in a frame memory provided in the recording unit 13 is referred to as a reference candidate frame as necessary.

動きベクトル検出器１２ｋは、符号化対象フレームと、複数の参照候補フレームとに基づいて、最適な動きベクトルを演算する。本実施形態の動きベクトル検出器１２ｋは、システム制御部１４からバスＩ／Ｆ１２ｎを介して、符号化対象となる符号化対象フレームの番号と、参照候補フレームの番号とを入力し、入力した番号を用いて探索精度を決定する。なお、動きベクトル検出器１２ｋの詳細については、図３等を用いて後述する。 The motion vector detector 12k calculates an optimal motion vector based on the encoding target frame and the plurality of reference candidate frames. The motion vector detector 12k of this embodiment inputs the number of the encoding target frame to be encoded and the number of the reference candidate frame from the system control unit 14 via the bus I / F 12n, and the input number Is used to determine the search accuracy. Details of the motion vector detector 12k will be described later with reference to FIG.

動き補償器１２ｌは、動きベクトル検出器１２ｋで演算された動きベクトルと、予測誤差の最も小さい参照候補フレームとを用いて、予測画像データを生成する。動き符号化器１２ｍは、動きベクトル検出器１２ｋで演算された動きベクトルを符号化してバスＩ／Ｆ１２ｎに出力する。バスＩ／Ｆ１２ｎに出力された符号化された動きベクトルは、前記符号化データと関連付けて記録部１３に記録される。 The motion compensator 121 generates predicted image data using the motion vector calculated by the motion vector detector 12k and the reference candidate frame having the smallest prediction error. The motion encoder 12m encodes the motion vector calculated by the motion vector detector 12k and outputs the encoded motion vector to the bus I / F 12n. The encoded motion vector output to the bus I / F 12n is recorded in the recording unit 13 in association with the encoded data.

なお、本実施形態では、過去のフレームを参照する順方向予測、過去と未来のフレームを参照する双方向予測、及び未来のフレームを参照する逆方向予測の何れの方式も用いることができる。また、図２に示したもの以外の装置が、信号処理部１２に設けられていてもよい。 In the present embodiment, any of forward prediction referring to past frames, bidirectional prediction referring to past and future frames, and backward prediction referring to future frames can be used. Further, a device other than that shown in FIG. 2 may be provided in the signal processing unit 12.

次に、動きベクトル検出器１２ｋの詳細について説明する。図３は、本実施形態に係る動きベクトル検出器１２ｋの構成を示すブロック図である。図３の動きベクトル検出器１２ｋは、Ｈ．２６４又はそれに関連する符号化方式の符号化装置に適用して好適なものであり、またそのような符号化装置はビデオカメラ装置等に用いて好適なものである。 Next, details of the motion vector detector 12k will be described. FIG. 3 is a block diagram showing the configuration of the motion vector detector 12k according to this embodiment. The motion vector detector 12k in FIG. The present invention is suitable for application to an encoding device of H.264 or a related encoding method, and such an encoding device is suitable for use in a video camera device or the like.

図３において、動きベクトル検出器１２ｋは、符号化対象フレーム保存部１００と、参照候補フレーム保存部１０１と、探索精度決定部１０２と、動きベクトル演算部１０３とを備えて構成されている。 In FIG. 3, the motion vector detector 12k includes an encoding target frame storage unit 100, a reference candidate frame storage unit 101, a search accuracy determination unit 102, and a motion vector calculation unit 103.

符号化対象フレーム保存部１００には、動きベクトルの探索で用いられるフレームであって、これから符号化しようとしている符号化対象フレーム３００が保存される。参照候補フレーム保存部１０１には、複数の参照候補フレーム３０１が保存される。 The encoding target frame storage unit 100 stores the encoding target frame 300 that is used for motion vector search and is about to be encoded. A plurality of reference candidate frames 301 are stored in the reference candidate frame storage unit 101.

図４は、符号化対象フレームと参照候補フレームとの関係の一例を概念的に示した図である。 FIG. 4 is a diagram conceptually illustrating an example of the relationship between the encoding target frame and the reference candidate frame.

図４の例では、符号化対象フレーム３００に対する参照フレームの候補として、３枚の選択可能な参照候補フレーム３０１ａ〜３０１ｃがある。フレーム番号で言うと、参照候補フレーム番号３０３が「０」〜「２」、符号化対象フレーム番号３０２が「３」の場合を一例として示している。また、図４に示す３ｔ、２ｔ、ｔは、それぞれ符号化対象フレーム３００から、参照候補フレーム３０１ａ、３０１ｂ、３０１ｃまでの時間的な距離（時間間隔）を表している。 In the example of FIG. 4, there are three selectable reference candidate frames 301 a to 301 c as reference frame candidates for the encoding target frame 300. In terms of frame numbers, a case where the reference candidate frame number 303 is “0” to “2” and the encoding target frame number 302 is “3” is shown as an example. In addition, 3t, 2t, and t shown in FIG. 4 represent temporal distances (time intervals) from the encoding target frame 300 to the reference candidate frames 301a, 301b, and 301c, respectively.

図５は、本実施形態に係る動きベクトル検出器１２ｋの動作の一例について説明するフローチャートである。図５のフローチャートを参照しながら、図３に示す動きベクトル検出器１２ｋの動作の一例について説明する。 FIG. 5 is a flowchart for explaining an example of the operation of the motion vector detector 12k according to the present embodiment. An example of the operation of the motion vector detector 12k shown in FIG. 3 will be described with reference to the flowchart of FIG.

探索精度決定部１０２は、システム制御部１４から、バスＩ／Ｆ１２ｎを介して、符号化対象フレーム番号３０２と、参照候補フレーム番号３０３とが指定されるまで待機する（ステップＳ１０１）。 The search accuracy determination unit 102 waits until the encoding target frame number 302 and the reference candidate frame number 303 are specified from the system control unit 14 via the bus I / F 12n (step S101).

ステップＳ１０１において、符号化対象フレーム番号３０２と、参照候補フレーム番号３０３とが指定されると、探索精度決定部１０２は、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離を演算する（ステップＳ１０２）。 In step S101, when the encoding target frame number 302 and the reference candidate frame number 303 are designated, the search accuracy determination unit 102 determines the temporal relationship between the encoding target frame 300 and the reference candidate frames 301a to 301c. A simple distance is calculated (step S102).

そして、探索精度決定部１０２は、符号化対象フレーム３００と参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離に応じて、動きベクトルの探索精度を変化させる（ステップＳ１０３）。 Then, the search accuracy determination unit 102 changes the motion vector search accuracy according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c (step S103).

具体的に、図４に示した例の場合、参照候補フレーム番号３０３が「２」の参照候補フレーム３０１ｃと、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００との間の時間的な距離は「ｔ」と短いので、最適な動きベクトルが見つかる確率が大きい。従って、探索精度決定部１０２は、例えば、縦方向、横方向共に、１画素精度の詳細な探索を行うように動きベクトル演算部１０３に指示を出す。 Specifically, in the example shown in FIG. 4, the time between the reference candidate frame 301c whose reference candidate frame number 303 is “2” and the encoding target frame 300 whose encoding target frame number 302 is “3”. Since the typical distance is as short as “t”, there is a high probability that an optimal motion vector will be found. Therefore, the search accuracy determination unit 102 instructs the motion vector calculation unit 103 to perform a detailed search with one pixel accuracy in both the vertical direction and the horizontal direction, for example.

一方、参照候補フレーム番号３０３が「０」の参照候補フレーム３０１ａと、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００との間の時間的な距離は「３ｔ」であり遠く離れている。このため、参照候補フレーム３０１ａで最適な動きベクトルが見つかる確率は、参照候補フレーム３０１ｃで最適な動きベクトルが見つかる確率よりも低いと考えられる。従って、探索精度決定部１０２は、例えば、縦方向、横方向共に、１画素飛びの精度の粗い探索を行うように動きベクトル演算部１０３に指示を出す。また、探索精度決定部１０２は、例えば、参照候補フレーム３０１ｃのときと探索範囲を同じにするように動きベクトル演算部１０３に指示を出す。 On the other hand, the temporal distance between the reference candidate frame 301a whose reference candidate frame number 303 is “0” and the encoding target frame 300 whose encoding target frame number 302 is “3” is “3t”, which is far away. ing. For this reason, it is considered that the probability that an optimal motion vector is found in the reference candidate frame 301a is lower than the probability that an optimal motion vector is found in the reference candidate frame 301c. Therefore, for example, the search accuracy determination unit 102 instructs the motion vector calculation unit 103 to perform a coarse search with one pixel skip in both the vertical and horizontal directions. In addition, the search accuracy determination unit 102 instructs the motion vector calculation unit 103 so that the search range is the same as that of the reference candidate frame 301c, for example.

このように参照候補フレーム３０１ａで動きベクトルを探索する場合には、参照候補フレーム３０１ｃで動きベクトルを探索する場合と、動きベクトルの探索範囲を同じにする。更に、動きベクトルの探索精度を縦方向、横方向共に、参照候補フレーム３０１ｃで動きベクトルを探索する場合の１／２にする。このため、参照候補フレーム３０１ａで動きベクトルを探索した場合には、参照候補フレーム３０１ｃで動きベクトルを探索する場合に比べ、演算量を１／４に削減することが出来る。 In this way, when searching for a motion vector in the reference candidate frame 301a, the search range of the motion vector is made the same as in the case of searching for a motion vector in the reference candidate frame 301c. Furthermore, the motion vector search accuracy is halved in both the vertical and horizontal directions when searching for a motion vector in the reference candidate frame 301c. For this reason, when a motion vector is searched for in the reference candidate frame 301a, the amount of calculation can be reduced to ¼ compared to a case where a motion vector is searched for in the reference candidate frame 301c.

また、参照候補フレーム番号３０３が「１」の参照候補フレーム３０１ｂと、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００との間の時間的な距離は「２ｔ」である。このように、参照候補フレーム３０１ｂと符号化対象フレーム３００との間の時間的な距離は、参照候補フレーム３０１ａと符号化対象フレーム３００との間ほどでは無いがやや離れている。このため、参照候補フレーム３０１ｂで最適な動きベクトルが見つかる確率は、参照候補フレーム３０１ａで最適な動きベクトルが見つかる確率よりも高いが、参照候補フレーム３０１ｃで最適な動きベクトルが見つかる確率よりも低いと考えられる。 The temporal distance between the reference candidate frame 301b with the reference candidate frame number 303 of “1” and the encoding target frame 300 with the encoding target frame number 302 of “3” is “2t”. As described above, the temporal distance between the reference candidate frame 301b and the encoding target frame 300 is slightly different from that between the reference candidate frame 301a and the encoding target frame 300. Therefore, the probability that the optimal motion vector is found in the reference candidate frame 301b is higher than the probability that the optimal motion vector is found in the reference candidate frame 301a, but is lower than the probability that the optimal motion vector is found in the reference candidate frame 301c. Conceivable.

そこで、探索精度決定部１０２は、例えば、縦方向の探索精度を１画素毎とし、横方向の探索精度を１画素飛びとするように動きベクトル演算部１０３に指示を出す。また、探索精度決定部１０２は、例えば、参照候補フレーム３０１ｃのときと探索範囲を同じにするように動きベクトル演算部１０３に指示を出す。 Therefore, the search accuracy determination unit 102 instructs the motion vector calculation unit 103 to set the search accuracy in the vertical direction for each pixel and the search accuracy in the horizontal direction to skip one pixel, for example. In addition, the search accuracy determination unit 102 instructs the motion vector calculation unit 103 so that the search range is the same as that of the reference candidate frame 301c, for example.

このように参照候補フレーム３０１ｂを探索する場合には、参照候補フレーム３０１ｃを探索する場合と、動きベクトルの探索範囲と縦方向の探索精度とを同じにすると共に、横方向の探索精度を、参照候補フレーム３０１ｃを探索する場合の１／２にする。このため、参照候補フレーム３０１ｂで動きベクトルを探索した場合には、参照候補フレーム３０１ｃで探索を行った場合に比べ、演算量を１／２に削減することが出来る。 Thus, when searching for the reference candidate frame 301b, the search range of the motion vector and the vertical search accuracy are made the same as when searching for the reference candidate frame 301c, and the horizontal search accuracy is referred to. The candidate frame 301c is halved when searching. For this reason, when the motion vector is searched for in the reference candidate frame 301b, the amount of calculation can be reduced to ½ compared to the case where the search is performed in the reference candidate frame 301c.

以上のようにして、動きベクトルの探索精度が探索精度決定部１０２により決定されると、動きベクトル演算部１０３は、動きベクトル３０４を決定する（ステップＳ１０４）。具体的に動きベクトル演算部１０３は、符号化対象フレーム保存部１００に保存されている符号化対象フレーム３００に含まれる各マクロブロックに対し、参照候補フレーム保存部１０１に保存された参照候補フレーム３０１内で探索を行い、動きベクトルを推定する。ここで、例えば、Ｎ×Ｎ（Ｎは自然数）サイズのマクロブロックの動きベクトルを、マクロブロックよりも±Ｐ画素だけ広い範囲で探索するとすると（ｐは自然数）、探索範囲は以下の（１）式で表される。
探索範囲＝（Ｎ＋２ｐ）×（Ｎ＋２ｐ）・・・（１）
動きベクトル演算部１０３は、動きベクトルの候補になり得る（２ｐ＋１）２個の位置で相関係数を計算した後、最大相関度を示す位置を動きベクトルとして決定する。 When the motion vector search accuracy is determined by the search accuracy determination unit 102 as described above, the motion vector calculation unit 103 determines the motion vector 304 (step S104). Specifically, the motion vector calculation unit 103 performs a reference candidate frame 301 stored in the reference candidate frame storage unit 101 for each macroblock included in the encoding target frame 300 stored in the encoding target frame storage unit 100. Search and estimate the motion vector. Here, for example, if a motion vector of a macroblock having a size of N × N (N is a natural number) is searched in a range wider by ± P pixels than the macroblock (p is a natural number), the search range is (1) It is expressed by a formula.
Search range = (N + 2p) × (N + 2p) (1)
The motion vector calculation unit 103 calculates a correlation coefficient at (2p + 1) two positions that can be motion vector candidates, and then determines a position indicating the maximum correlation degree as a motion vector.

最大相関度を有する動きベクトルを推定するために、動きベクトル演算部１０３は、例えば、ＭＳＥ（ＭｅａｎＳｑｕａｒｅＥｒｒｏｒ）、ＭＡＥ（ＭｅａｎＡｂｓｏｌｕｔｅＥｒｒｏｒ）、又はＭＡＤ（ＭｅａｎＡｂｓｏｌｕｔｅＤｉｆｆｅｒｅｎｃｅ）等の評価関数を用いる。例えば、ＭＳＥは以下の（２）式で表され、ＭＡＥは以下の（３）式で表される。 In order to estimate the motion vector having the maximum degree of correlation, the motion vector calculation unit 103 uses an evaluation function such as MSE (Mean Square Error), MAE (Mean Absolute Error), or MAD (Mean Absolute Difference). For example, MSE is represented by the following formula (2), and MAE is represented by the following formula (3).

ここで、Ｓｒｅｆは、参照フレームを示し、Ｓｃｕｒ，ｋは、動きベクトルが探索されている現在のフレームにおけるｋ番目のマクロブロックを示す。（ｉ，ｊ）は、動きベクトルが探索されている現在のフレームのｋ番目のマクロブロックにおける参照フレームの空間的な位置を示す。 Here, Sref indicates a reference frame, and Scur, k indicates the k-th macroblock in the current frame for which a motion vector is searched. (I, j) indicates the spatial position of the reference frame in the kth macroblock of the current frame for which a motion vector is being searched.

また、Ｘ及びＹは、動きベクトル探索範囲の水平画素数及び垂直画素数である。ｇ及びｈは、探索精度決定部１０２からの指示された横方向の探索精度及び縦方向の探索精度を示す係数（何画素毎に演算を行うかを示す係数）である。 X and Y are the number of horizontal pixels and the number of vertical pixels in the motion vector search range. g and h are coefficients indicating the horizontal search accuracy and the vertical search accuracy specified by the search accuracy determination unit 102 (coefficients indicating how many pixels are calculated).

また、ｘ、ｙは、それぞれ以下の（４）式、（５）式で表される。
ｘ＝ｇ×ｕ・・・（４）
ｙ＝ｈ×ｖ・・・（５）
更にｘ、ｙ、ｇ、ｈは、以下の（６）式〜（９）式を満たす自然数である。
０≦ｘ≦Ｘ・・・（６）
１≦ｇ≦Ｘ・・・（７）
０≦ｙ≦Ｙ・・・（８）
１≦ｈ≦Ｙ・・・（９）
また、Ｕ、Ｖは、それぞれ以下の（１０）式、（１１）式で表される。
Ｕ＝Ｘ−｜ｉ｜・・・（１０）
Ｖ＝Ｙ−｜ｊ｜・・・（１１）
（２）式や（３）式で表される評価関数は、画素の差に基づいたものである。従って、動きベクトル演算部１０３は、最も小さいＭＡＥ値やＭＳＥ値を有する場合（ＬＲＶ値となる場合）の動きベクトルを、現在のマクロブロックにおける最終的な動きベクトルとして選定する。なお、図３に示したもの以外の機能が、動きベクトル検出器１２ｋに設けられていてもよい。 X and y are represented by the following formulas (4) and (5), respectively.
x = g × u (4)
y = h × v (5)
Furthermore, x, y, g, and h are natural numbers that satisfy the following expressions (6) to (9).
0 ≦ x ≦ X (6)
1 ≦ g ≦ X (7)
0 ≦ y ≦ Y (8)
1 ≦ h ≦ Y (9)
U and V are expressed by the following formulas (10) and (11), respectively.
U = X− | i | (10)
V = Y− | j | (11)
The evaluation functions represented by the formulas (2) and (3) are based on pixel differences. Accordingly, the motion vector calculation unit 103 selects a motion vector having the smallest MAE value or MSE value (when it becomes an LRV value) as a final motion vector in the current macroblock. It should be noted that functions other than those shown in FIG. 3 may be provided in the motion vector detector 12k.

以上のように本実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に応じて、動きベクトルの探索精度を変化させるようにした。例えば、符号化対象フレーム３００と、参照候補フレーム３０１ｃとの間の時間的な距離は「ｔ」であり短いので、最適な動きベクトルが見つかる確率が高い。従って、探索精度決定部１０２は、例えば、縦横共に、１画素精度の詳細な探索を行う。一方、符号化対象フレーム３００と、参照候補フレーム３０１ａとの間の時間的な距離は「３ｔ」であり遠く離れているので、最適な動きベクトルが見つかる確率が低い。従って、探索精度決定部１０２は、例えば参照候補フレーム３０１ｃのときと探索範囲を同じにし、且つ、縦横共に、１画素飛びの精度の粗い探索を行う。 As described above, in this embodiment, the motion vector search accuracy is changed according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. For example, since the temporal distance between the encoding target frame 300 and the reference candidate frame 301c is “t”, which is short, the probability that an optimal motion vector is found is high. Therefore, the search accuracy determination unit 102 performs a detailed search with 1 pixel accuracy in both vertical and horizontal directions, for example. On the other hand, since the temporal distance between the encoding target frame 300 and the reference candidate frame 301a is “3t”, which is far away, the probability of finding an optimal motion vector is low. Therefore, for example, the search accuracy determination unit 102 performs the search with the same search range as that of the reference candidate frame 301c and with a high accuracy of skipping one pixel both vertically and horizontally.

以上のように、時間的に離れた画像に対して符号化を行う際に、最適な動きベクトルが見つかる確率に応じて、動きベクトルの探索精度を変化させる。従って、時間的に離れた画像についても動きベクトルの探索範囲とすることで、動きベクトルの検出精度を向上させられるとともに、且つ、動きベクトルの探索精度を変化させることによって、動きベクトルを検出する際の演算量を削減できる。これにより、例えば、駆動するバッテリーの消費量が増大して撮影時間が短くなってしまうことを可及的に防止することができる。 As described above, when encoding an image separated in time, the motion vector search accuracy is changed according to the probability of finding the optimal motion vector. Therefore, the motion vector search range can be improved even for images that are separated in time, and the motion vector detection accuracy can be improved, and the motion vector search accuracy can be changed to detect a motion vector. The amount of computation can be reduced. Thereby, for example, it is possible to prevent as much as possible that the consumption of the battery to be driven increases and the photographing time is shortened.

なお、本実施形態では、時間的な距離に応じて、探索精度がそれぞれ、縦方向、横方向共に１画素とした場合と、縦方向、横方向共に１画素飛びにした場合と、縦方向を１画素、横方向を１画素飛びにした場合とを例に挙げて示した。しかしながら、探索精度はこれらに限定されない。例えば、縦方向、横方向共に２画素飛びにする等してもよい。また、本実施形態では、参照候補フレームの数が３枚の場合を例に挙げて説明したが、参照候補フレームの数はこれに限定されない。例えば、参照候補フレームの数を増やしてもよい。参照候補フレームの数を増やした場合には、探索精度を更に段階的に変化させることも可能である。 In the present embodiment, the search accuracy is set to one pixel in both the vertical direction and the horizontal direction, the case where the vertical direction and the horizontal direction are skipped by one pixel, and the vertical direction according to the temporal distance. The case where one pixel and the horizontal direction are skipped by one pixel is shown as an example. However, the search accuracy is not limited to these. For example, two pixels may be skipped in both the vertical direction and the horizontal direction. In the present embodiment, the case where the number of reference candidate frames is three has been described as an example, but the number of reference candidate frames is not limited to this. For example, the number of reference candidate frames may be increased. When the number of reference candidate frames is increased, the search accuracy can be further changed stepwise.

（第２の実施形態）
次に、本発明の第２の実施形態について説明する。前述した第１の実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に応じて、動きベクトルの探索精度を変化させるようにした。これに対して、本実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に応じて、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃの縮小率を変化させるようにした。このように、本実施形態と第１の実施形態とは、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとを用いた処理の方法が主として異なる。従って、以下の説明において第１の実施形態と同一の部分については、図１〜図５に付した符号と同一の符号を付す等して詳細な説明を省略する。 (Second Embodiment)
Next, a second embodiment of the present invention will be described. In the first embodiment described above, the motion vector search accuracy is changed according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. On the other hand, in the present embodiment, the reduction ratios of the encoding target frame 300 and the reference candidate frames 301a to 301c are set according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. I changed it. As described above, the present embodiment and the first embodiment are mainly different in the processing method using the encoding target frame 300 and the reference candidate frames 301a to 301c. Therefore, in the following description, the same parts as those of the first embodiment are denoted by the same reference numerals as those shown in FIGS.

図６は、本実施形態に係る動きベクトル検出器１２ｋの構成の一例を示すブロック図である。図７は、本実施形態に係る動きベクトル検出器１２ｋの動作の一例について説明するフローチャートである。図４及び図７を参照しながら、図６に示す動きベクトル検出器１２ｋの動作について説明する。 FIG. 6 is a block diagram showing an example of the configuration of the motion vector detector 12k according to this embodiment. FIG. 7 is a flowchart for explaining an example of the operation of the motion vector detector 12k according to this embodiment. The operation of the motion vector detector 12k shown in FIG. 6 will be described with reference to FIGS.

縮小率決定部４０２は、システム制御部１４から、バスＩ／Ｆ１２ｎを介して、符号化対象フレーム番号３０２と、参照候補フレーム番号３０３とが指定されるまで待機する（ステップＳ４０１）。ステップＳ４０１において、符号化対象フレーム番号３０２と、参照候補フレーム番号３０３とが指定されると、縮小率決定部４０２は、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離を演算する（ステップＳ４０２）。 The reduction rate determination unit 402 waits until the encoding target frame number 302 and the reference candidate frame number 303 are specified from the system control unit 14 via the bus I / F 12n (step S401). In step S401, when the encoding target frame number 302 and the reference candidate frame number 303 are designated, the reduction rate determination unit 402 determines the temporal relationship between the encoding target frame 300 and the reference candidate frames 301a to 301c. A simple distance is calculated (step S402).

そして、縮小率決定部４０２は、符号化対象フレーム３００と参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離に応じて、符号化対象フレーム３００の画像と、参照候補フレーム３０１ａ〜３０１ｃの画像の縮小率を決定する（ステップＳ４０３）。 Then, the reduction rate determination unit 402 determines the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. The reduction ratio is determined (step S403).

縮小率決定部４０２は、符号化対象フレーム３００と参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離が短いほど縮小率を小さくし、符号化対象フレーム３００と参照候補フレーム３０１ａ〜３０１ｃとの間の時間的な距離が長いほど縮小率を大きくする。 The reduction rate determination unit 402 decreases the reduction rate as the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c is shorter, and the encoding target frame 300 and the reference candidate frames 301a to 301c are reduced. The reduction ratio is increased as the time distance between them is longer.

具体的に、図４に示した例の場合、参照候補フレーム番号３０３が「２」の参照候補フレーム３０１ｃと、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００との間の時間的な距離は「ｔ」と短いので、最適な動きベクトルが見つかる確率が大きい。従って、縮小率決定部４０２は、画像の縮小を行わずに、符号化対象フレーム３００の画像と参照候補フレーム３０１ａ〜３０１ｃの画像とをそのまま用いて動きベクトルの探索を行うように動きベクトル演算部４０７に指示を出す。 Specifically, in the example shown in FIG. 4, the time between the reference candidate frame 301c whose reference candidate frame number 303 is “2” and the encoding target frame 300 whose encoding target frame number 302 is “3”. Since the typical distance is as short as “t”, there is a high probability that an optimal motion vector will be found. Therefore, the reduction rate determination unit 402 does not reduce the image, but uses the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c as they are to search for a motion vector. Instruct 407.

一方、参照候補フレーム番号３０３が「０」の参照候補フレーム３０１ａと、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００との間の時間的な距離は「３ｔ」であり遠く離れている。このため、参照候補フレーム３０１ａで最適な動きベクトルが見つかる確率は、参照候補フレーム３０１ｃで最適な動きベクトルが見つかる確率よりも低いと考えられる。従って、縮小率決定部４０２は、例えば、符号化対象フレーム３００の画像と参照候補フレーム３０１ａ〜３０１ｃの画像とを、それぞれ縦方向、横方向共に１／２倍に縮小するように縮小フレーム作成部４０４に指示を出す。縮小フレーム作成部４０４は、画像縮小手段である。 On the other hand, the temporal distance between the reference candidate frame 301a whose reference candidate frame number 303 is “0” and the encoding target frame 300 whose encoding target frame number 302 is “3” is “3t”, which is far away. ing. For this reason, it is considered that the probability that an optimal motion vector is found in the reference candidate frame 301a is lower than the probability that an optimal motion vector is found in the reference candidate frame 301c. Therefore, the reduction rate determination unit 402, for example, a reduced frame creation unit so as to reduce the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c by 1/2 times in both the vertical direction and the horizontal direction. An instruction is issued to 404. The reduced frame creating unit 404 is image reducing means.

また、縮小率決定部４０２は、例えば、参照候補フレーム３０１ｃのときと探索範囲を同じにするように動きベクトル演算部４０７に指示を出す。従って、縦方向及び横方向の画素数が共に１／２倍になる。このため、参照候補フレーム３０１ａで動きベクトルを探索する場合には、参照候補フレーム３０１ｃで動きベクトルを探索する場合に比べ、演算量を１／４に削減することが出来る。 In addition, the reduction rate determination unit 402 instructs the motion vector calculation unit 407 so that the search range is the same as that of the reference candidate frame 301c, for example. Accordingly, the number of pixels in the vertical direction and the horizontal direction are both halved. Therefore, when searching for a motion vector in the reference candidate frame 301a, the amount of calculation can be reduced to ¼ compared to searching for a motion vector in the reference candidate frame 301c.

そこで、縮小率決定部４０２は、例えば、符号化対象フレーム３００の画像と参照候補フレーム３０１ａ〜３０１ｃの画像とを、縦方向では縮小せず、横方向では１／２倍に縮小するように縮小フレーム作成部４０４に指示を出す。また、縮小率決定部４０２は、例えば、参照候補フレーム３０１ｃのときと探索範囲を同じにするように動きベクトル演算部４０７に指示を出す。従って、横方向の画素数が１／２倍になる。このため、参照候補フレーム３０１ｂで動きベクトルを探索した場合には、参照候補フレーム３０１ｃで動きベクトルを探索した場合に比べ、演算量を１／２に削減することが出来る。 Therefore, for example, the reduction rate determination unit 402 reduces the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c so that the image is not reduced in the vertical direction but reduced to 1/2 times in the horizontal direction. An instruction is issued to the frame creation unit 404. In addition, the reduction rate determination unit 402 instructs the motion vector calculation unit 407 so that the search range is the same as that of the reference candidate frame 301c, for example. Therefore, the number of pixels in the horizontal direction is halved. For this reason, when a motion vector is searched for in the reference candidate frame 301b, the amount of calculation can be reduced to ½ compared to a case where a motion vector is searched for in the reference candidate frame 301c.

以上のようにして、符号化対象フレーム３００の画像と、参照候補フレーム３０１ａ〜３０１ｃの画像の縮小率が決定されると、ステップＳ４０４に進む。そして、縮小フレーム作成部４０４は、縮小率決定部４０２によって決定された縮小率に従い、符号化対象フレーム３００の画像と、参照候補フレーム３０１ａ〜３０１ｃの各画像を縮小処理して、複数の縮小画像を生成する（ステップＳ４０４）。 As described above, when the reduction ratios of the image of the encoding target frame 300 and the reference candidate frames 301a to 301c are determined, the process proceeds to step S404. Then, the reduced frame creation unit 404 performs a reduction process on the image of the encoding target frame 300 and each of the reference candidate frames 301a to 301c according to the reduction rate determined by the reduction rate determination unit 402, and a plurality of reduced images. Is generated (step S404).

図８は、本実施形態に係る符号化対象フレーム３００の縮小画像と、参照候補フレーム３０１ａ〜３０１ｃの縮小画像の一例を示す図である。 FIG. 8 is a diagram illustrating an example of a reduced image of the encoding target frame 300 and reduced images of the reference candidate frames 301a to 301c according to the present embodiment.

例えば、図８に示す原画像８０１を、縦横共に１／２倍に縮小して縮小画像８０２ａを生成する場合には、隣接する縦・横各々の２画素の画素値を足して４で割ることで縮小画像８０２ａを生成する。例えば、縮小画像８０２ａにおける画素Ａ’は、原画像８０１の画素Ａ、Ｂ、Ｅ、Ｆの画素値を足して４で割る（（Ａ＋Ｂ＋Ｅ＋Ｆ）／４）ことで生成することが出来る。また、縮小画像８０２ａにおける画素Ｂ’は、原画像８０１の画素Ｃ、Ｄ、Ｇ、Ｈの画素値を足して４で割る（（Ｃ＋Ｄ＋Ｇ＋Ｈ）／４）ことで生成することが出来る。 For example, when the reduced image 802a is generated by reducing the original image 801 shown in FIG. 8 by a factor of 1/2 both vertically and horizontally, the pixel values of two adjacent vertical and horizontal pixels are added and divided by 4. A reduced image 802a is generated. For example, the pixel A ′ in the reduced image 802 a can be generated by adding the pixel values of the pixels A, B, E, and F of the original image 801 and dividing the result by 4 ((A + B + E + F) / 4). In addition, the pixel B ′ in the reduced image 802 a can be generated by adding the pixel values of the pixels C, D, G, and H of the original image 801 and dividing the result by 4 ((C + D + G + H) / 4).

一方、図８に示す原画像８０１を、横方向だけ１／２倍に縮小して縮小画像８０２ｂを生成する場合には、横方向に隣接する２画素の画素値を足して２で割ることで縮小画像８０２ｂを生成する。例えば、縮小画像８０２ｂにおける画素Ａ”は、原画像８０１の画素Ａ、Ｂの画素値を足して２で割る（（Ａ＋Ｂ）／２）ことで生成することが出来る。また、縮小画像８０２ｂにおける画素Ｂ”は、原画像８０１の画素Ｃ、Ｄの画素値を足して２で割る（（Ｃ＋Ｄ）／２）ことで生成することが出来る。 On the other hand, when the reduced image 802b is generated by reducing the original image 801 shown in FIG. 8 by ½ times in the horizontal direction, the pixel values of two pixels adjacent in the horizontal direction are added and divided by two. A reduced image 802b is generated. For example, the pixel A ″ in the reduced image 802b can be generated by adding the pixel values of the pixels A and B in the original image 801 and dividing by 2 ((A + B) / 2). Also, the pixel in the reduced image 802b. B ″ can be generated by adding the pixel values of the pixels C and D of the original image 801 and dividing by 2 ((C + D) / 2).

以上のようにして、縮小フレーム作成部４０４によって、符号化対象フレーム３００の縮小画像と、参照候補フレーム３０１ａ〜３０１ｃの縮小画像とが作成されると、符号化対象フレーム３００の縮小画像は、縮小符号化対象フレーム保存部４０５に保存される。更に、参照候補フレーム３０１ａ〜３０１ｃの縮小画像は、縮小参照候補フレーム保存部４０６に保存される。 As described above, when the reduced frame creation unit 404 creates the reduced image of the encoding target frame 300 and the reduced images of the reference candidate frames 301a to 301c, the reduced image of the encoding target frame 300 is reduced. It is stored in the encoding target frame storage unit 405. Further, the reduced images of the reference candidate frames 301a to 301c are stored in the reduced reference candidate frame storage unit 406.

次に、動きベクトル演算部４０７は、動きベクトル４０８を決定する（ステップＳ４０５）。ここで、動きベクトル演算部４０７は、画像の縮小を行わないことが縮小率決定部４０２から指示された場合には、縮小画像を用いずに、第１の実施形態の動きベクトル演算部１０３と同じ動作を行う。 Next, the motion vector calculation unit 407 determines a motion vector 408 (step S405). Here, when the reduction rate determination unit 402 instructs the motion vector calculation unit 407 not to reduce the image, the motion vector calculation unit 407 uses the motion vector calculation unit 103 of the first embodiment without using the reduced image. Perform the same operation.

一方、画像の縮小を行うことが縮小率決定部４０２から指示された場合、動きベクトル演算部４０７は、まず、縮小符号化対象フレーム保存部４０５から、符号化対象フレーム３００の縮小画像のマクロブロックを読み出す。次に、動きベクトル演算部４０７は、読み出したマクロブロックに対し、縮小参照候補フレーム保存部４０６から読み出した参照候補フレーム３０１ａ〜３０１ｃの縮小画像の範囲内で動きベクトルの探索を行い、その結果に基づいて動きベクトルを推定する。なお、最大相関度を有する動きベクトルを推定する方法は第１の実施形態で説明した方法と同じである。 On the other hand, when the reduction ratio determining unit 402 instructs to reduce the image, the motion vector calculation unit 407 first receives a macroblock of the reduced image of the encoding target frame 300 from the reduction encoding target frame storage unit 405. Is read. Next, the motion vector calculation unit 407 searches for the motion vector within the range of the reduced images of the reference candidate frames 301a to 301c read from the reduced reference candidate frame storage unit 406 for the read macroblock. Based on this, a motion vector is estimated. The method for estimating the motion vector having the maximum correlation is the same as the method described in the first embodiment.

なお、図６に示した以外の機能が、動きベクトル検出器に設けられていてもよい。 It should be noted that functions other than those shown in FIG. 6 may be provided in the motion vector detector.

以上のように本実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に応じて、符号化対象フレーム３００の縮小率と、参照候補フレーム３０１ａ〜３０１ｃの縮小率とを変化させるようにした。例えば、符号化対象フレーム３００と、参照候補フレーム３０１ｃとの間の時間的な距離は「ｔ」であり短いので、符号化対象フレーム３００の画像と、参照候補フレーム３０１ｃの画像とを縮小しない。一方、符号化対象フレーム３００と、参照候補フレーム３０１ａとの間の時間的な距離は「３ｔ」であり遠く離れているので、最適な動きベクトルが見つかる確率が低い。従って、符号化対象フレーム３００の画像と、参照候補フレーム３０１ｃの画像とを縦横共に１／２倍に縮小した縮小画像を生成し、生成した縮小画像を用いて動きベクトルを決定する。 As described above, in the present embodiment, the reduction rate of the encoding target frame 300 and the reduction of the reference candidate frames 301a to 301c according to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. The rate was changed. For example, since the temporal distance between the encoding target frame 300 and the reference candidate frame 301c is “t”, which is short, the image of the encoding target frame 300 and the image of the reference candidate frame 301c are not reduced. On the other hand, since the temporal distance between the encoding target frame 300 and the reference candidate frame 301a is “3t”, which is far away, the probability of finding an optimal motion vector is low. Therefore, a reduced image is generated by reducing the image of the encoding target frame 300 and the image of the reference candidate frame 301c by a factor of 1/2 both vertically and horizontally, and a motion vector is determined using the generated reduced image.

以上のように、時間的に離れた画像に対して符号化を行う際に、最適な動きベクトルが見つかる確率に応じて、符号化対象フレーム３００の縮小率と、参照候補フレーム３０１ａ〜３０１ｃの縮小率とを変化させる。従って、第１の実施形態と同様に、動きベクトルの検出精度を向上させられるとともに、動きベクトルを検出する際の演算量の増大とを可及的に防止することができる。これにより、例えば、駆動するバッテリーの消費量が増大して撮影時間が短くなってしまうことを可及的に防止することができる。 As described above, when encoding an image that is temporally separated, the reduction rate of the encoding target frame 300 and the reduction of the reference candidate frames 301a to 301c are determined according to the probability of finding the optimal motion vector. Change the rate. Therefore, as in the first embodiment, the accuracy of motion vector detection can be improved, and an increase in the amount of computation when detecting a motion vector can be prevented as much as possible. Thereby, for example, it is possible to prevent as much as possible that the consumption of the battery to be driven increases and the photographing time is shortened.

なお、本実施形態では、時間的な距離に応じて、符号化対象フレーム３００の画像と、参照候補フレーム３０１ａ〜３０１ｃの画像とを、縦横共に１／２倍に縮小する場合と、縦を縮小せずに横方向のみを１／２倍に縮小する場合を例に挙げて示した。しかしながら、符号化対象フレーム３００の画像の縮小率と、参照候補フレーム３０１ａ〜３０１ｃの画像の縮小率とは、これに限定されない。例えば、符号化対象フレーム３００の画像と、参照候補フレーム３０１ａ〜３０１ｃの画像とを、縦横共に１／３倍或いは１／４倍に縮小する等してもよい。また、本実施形態でも、参照候補フレームの数が３枚の場合を例に挙げて説明したが、参照候補フレームの数はこれに限定されない。例えば、参照候補フレームの数を増やしてもよい。参照候補フレームの数を増やした場合には、縮小率を更に段階的に変化させることも可能である。 In the present embodiment, the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c are reduced to 1/2 times both vertically and horizontally according to the temporal distance, and the image is reduced vertically. An example in which only the horizontal direction is reduced by a factor of 1/2 is shown as an example. However, the image reduction ratio of the encoding target frame 300 and the image reduction ratios of the reference candidate frames 301a to 301c are not limited to this. For example, the image of the encoding target frame 300 and the images of the reference candidate frames 301a to 301c may be reduced to 1/3 times or 1/4 times both vertically and horizontally. In the present embodiment, the case where the number of reference candidate frames is three has been described as an example, but the number of reference candidate frames is not limited to this. For example, the number of reference candidate frames may be increased. When the number of reference candidate frames is increased, the reduction ratio can be further changed in stages.

（第３の実施形態）
次に、本発明の第３の実施形態について説明する。前述した第１の実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離のみに応じて、動きベクトルの探索精度を変化させるようにした。それに対して、本実施形態では、第１の実施形態と同様の構成を有するが、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に加え、参照候補フレームの符号化タイプに応じて、動きベクトルの探索精度を変化させる。従って、以下の説明において第１の実施形態と異なる動作を行う探索精度決定部１０２以外の部分については、図１〜図５に付した符号と同一の符号を付す等して詳細な説明を省略する。なお、Ｈ．２６４等の符号化方式では、ピクチャよりも小さい、１つ以上のマクロブロックから成るスライス単位での符号化が可能なので、前記符号化タイプを“ピクチャタイプ”以外に、“スライスタイプ”と置き換えて解釈しても良い。なお、以下ではスライスタイプを例に説明する。 (Third embodiment)
Next, a third embodiment of the present invention will be described. In the first embodiment described above, the motion vector search accuracy is changed only in accordance with the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. On the other hand, the present embodiment has the same configuration as that of the first embodiment, but in addition to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c, the encoding of the reference candidate frame is performed. The motion vector search accuracy is changed according to the type. Accordingly, in the following description, portions other than the search accuracy determination unit 102 that performs an operation different from that of the first embodiment are denoted by the same reference numerals as in FIGS. To do. H. In an encoding method such as H.264, encoding in units of slices composed of one or more macroblocks smaller than a picture is possible. Therefore, the encoding type is replaced with “slice type” in addition to “picture type”. May be interpreted. Hereinafter, a slice type will be described as an example.

図９は、本実施形態に係る動きベクトル検出器１２ｋの構成の一例を示すブロック図である。図１０は、本実施形態に係る動きベクトル検出器１２ｋの動作の一例について説明するフローチャートである。図１０を参照しながら、図９に示す動きベクトル検出器１２ｋの動作について説明する。 FIG. 9 is a block diagram showing an example of the configuration of the motion vector detector 12k according to this embodiment. FIG. 10 is a flowchart for explaining an example of the operation of the motion vector detector 12k according to the present embodiment. The operation of the motion vector detector 12k shown in FIG. 9 will be described with reference to FIG.

探索精度決定部１０２は、システム制御部１４から、バスＩ／Ｆ１２ｎを介して、符号化対象フレーム番号３０２、参照候補フレーム番号３０３、及び参照候補フレームのスライスタイプ９０１が指定されるまで待機する（ステップＳ１００１）。 The search accuracy determination unit 102 waits until the encoding target frame number 302, the reference candidate frame number 303, and the slice type 901 of the reference candidate frame are specified from the system control unit 14 via the bus I / F 12n ( Step S1001).

ステップＳ１００１において、符号化対象フレーム番号３０２、参照候補フレーム番号３０３、及び参照候補フレームのスライスタイプ９０１が指定される。すると、探索精度決定部１０２は、符号化対象フレーム３００と、参照候補フレーム３０１との間の時間的な距離ｔｄを演算する（ステップＳ１００２）。探索精度決定部１０２は、演算されたｔｄに基づいて、第１の実施形態と同様に、符号化対象フレーム３００と、参照候補フレーム３０１との時間的な距離に応じて探索精度を段階的に変化させる。さらに探索精度決定部１０２は、参照候補フレームのスライスタイプ９０１に応じて探索精度を変化させる。 In step S1001, the encoding target frame number 302, the reference candidate frame number 303, and the slice type 901 of the reference candidate frame are designated. Then, the search accuracy determination unit 102 calculates a temporal distance td between the encoding target frame 300 and the reference candidate frame 301 (step S1002). Based on the calculated td, the search accuracy determination unit 102 increases the search accuracy in steps according to the temporal distance between the encoding target frame 300 and the reference candidate frame 301, as in the first embodiment. Change. Further, the search accuracy determination unit 102 changes the search accuracy according to the slice type 901 of the reference candidate frame.

すなわち、探索精度決定部１０２は、参照候補フレームのスライスタイプ９０１がＩスライスかどうかを判定する（ステップＳ１００３）。参照候補フレームスライスタイプ９０１がＩスライスであった場合（ステップＳ１００３でＹＥＳ）には、探索精度決定部１０２は、ｔｄから２ｔを引く処理を行う（ステップＳ１００４）。 In other words, the search accuracy determination unit 102 determines whether the slice type 901 of the reference candidate frame is an I slice (step S1003). If the reference candidate frame slice type 901 is an I slice (YES in step S1003), the search accuracy determination unit 102 performs a process of subtracting 2t from td (step S1004).

Ｉスライスは一般的に割り当てられる符号量が多く、画質の良い参照候補フレームである可能性が高い。そこで、探索精度決定部１０２は、Ｉスライスならば、時間的な距離が離れている場合であっても、このように２ｔを引くことで探索精度が高くなるようにする。参照候補フレームのスライスタイプ９０１がＩスライスでない場合（ステップＳ１００３でＮＯ）には、ステップＳ１００５へ進む。 An I slice generally has a large amount of code to be assigned, and is likely to be a reference candidate frame with good image quality. Therefore, the search accuracy determination unit 102 increases the search accuracy by subtracting 2t in this way, even if the temporal distance is long for an I slice. If the slice type 901 of the reference candidate frame is not an I slice (NO in step S1003), the process proceeds to step S1005.

続いて、探索精度決定部１０２は、参照候補フレームのスライスタイプ９０１がＰスライスかどうかを判定する（ステップＳ１００５）。参照候補フレームスライスタイプ９０１がＰスライスであった場合（ステップＳ１００５でＹＥＳ）には、探索精度決定部１０２は、ｔｄからｔを引く処理を行う（ステップＳ１００６）。ＰスライスはＩスライスと比べると画質が劣るが、Ｂスライスよりは画質が良い場合が多い。そこで、探索精度決定部１０２は、Ｐスライスならば、時間的な距離が離れている場合であっても、Ｉスライスほど探索精度を高めないが、Ｂスライスよりは探索精度を高めるように、ｔを引くことで探索精度が高くなるようにする。 Subsequently, the search accuracy determination unit 102 determines whether or not the slice type 901 of the reference candidate frame is a P slice (step S1005). If the reference candidate frame slice type 901 is a P slice (YES in step S1005), the search accuracy determination unit 102 performs a process of subtracting t from td (step S1006). Although the image quality of the P slice is inferior to that of the I slice, the image quality is often better than that of the B slice. Therefore, the search accuracy determination unit 102 does not increase the search accuracy as much as the I slice, even if the temporal distance is long if it is a P slice, but increases the search accuracy over the B slice. The search accuracy is increased by subtracting.

参照候補フレームのスライスタイプがＰスライスでない場合（ステップＳ１００５でＮＯ）には、ステップＳ１００７へ進む。すなわち、参照候補フレームスライスタイプ９０１がＩ，Ｐ以外のＢスライスであった場合には、探索精度決定部１０２は、探索精度を高めるような重み付け処理は行わずに、フレーム間の時間的な距離に応じてのみ探索精度を決定する。 If the slice type of the reference candidate frame is not a P slice (NO in step S1005), the process proceeds to step S1007. That is, when the reference candidate frame slice type 901 is a B slice other than I and P, the search accuracy determination unit 102 does not perform a weighting process that increases the search accuracy, and the temporal distance between frames. The search accuracy is determined only according to.

なお、従来のＭＰＥＧにおいてはＨ．２６４のＢスライスとほぼ等価となるＢピクチャは参照フレームに設定することが出来ないが、Ｈ．２６４においてはＢスライスも参照フレームに設定することが可能であるため、参照候補フレームに加えることが可能である。 In the conventional MPEG, H.264 is used. A B picture that is almost equivalent to the H.264 B slice cannot be set as a reference frame. In H.264, B slices can also be set as reference frames, and therefore can be added to reference candidate frames.

以上のようにして求めたｔｄに応じて、探索精度決定部１０２は、動きベクトルの探索精度を変化させる（ステップＳ１００７）。 In accordance with td obtained as described above, the search accuracy determination unit 102 changes the motion vector search accuracy (step S1007).

図１１は、本実施形態に係る符号化対象フレームと参照候補フレームとの関係の一例を示した図である。図１１に示した例の場合、符号化対象フレーム番号３０２が「３」の符号化対象フレーム３００に対する、参照候補フレーム番号３０３が「２」の参照候補フレーム３０１ｃとの時間的な距離ｔｄは「ｔ」である。また、参照候補フレーム番号３０３が「１」の参照候補フレーム３０１ｂとの時間的な距離ｔｄは「２ｔ」である。また、参照候補フレーム番号３０３が「０」の参照候補フレーム３０１ａとの時間的な距離ｔｄは「３ｔ」である。 FIG. 11 is a diagram illustrating an example of the relationship between the encoding target frame and the reference candidate frame according to the present embodiment. In the case of the example illustrated in FIG. 11, the temporal distance td between the encoding target frame 300 whose encoding target frame number 302 is “3” and the reference candidate frame 301 c whose reference candidate frame number 303 is “2” is “ t ”. Further, the temporal distance td from the reference candidate frame 301b whose reference candidate frame number 303 is “1” is “2t”. Further, the temporal distance td from the reference candidate frame 301a having the reference candidate frame number 303 of “0” is “3t”.

しかしながら参照候補フレーム３０１ａはＩスライスであるので２ｔを減じてｔｄ＝ｔと変更され、また参照候補フレーム３０１ｂはＰスライスであるのでｔを減じてｔｄ＝ｔと変更される。従って、図１１に示した例の場合、参照候補フレーム３０１ａ、参照候補フレーム３０１ｂ、及び参照候補フレーム３０１ｃのいずれも、ｔｄ＝ｔとなる。 However, since the reference candidate frame 301a is an I slice, 2t is subtracted and changed to td = t, and since the reference candidate frame 301b is a P slice, t is subtracted and changed to td = t. Therefore, in the example illustrated in FIG. 11, all of the reference candidate frame 301a, the reference candidate frame 301b, and the reference candidate frame 301c are td = t.

すなわち、本実施形態では、第１の実施形態の効果に加えて、参照候補フレームがＩスライスならば、時間的にかなり離れたフレームであっても探索精度を直前のフレームと同等に高くすることができる。また参照候補フレームがＰスライスならば、少々時間的に離れたフレームであっても探索精度を直前のフレームと同等に高くすることができる。なお、その際、探索回数の総数は第１の実施形態と変わらないようにする。 That is, in this embodiment, in addition to the effects of the first embodiment, if the reference candidate frame is an I slice, the search accuracy is made as high as that of the immediately preceding frame even if the frame is considerably separated in time. Can do. If the reference candidate frame is a P slice, the search accuracy can be made as high as that of the immediately preceding frame even if the reference candidate frame is a little apart in time. At that time, the total number of searches is made the same as in the first embodiment.

動きベクトルの探索精度が探索精度決定部１０２により決定されると、動きベクトル演算部１０３は、動きベクトル３０４を決定する（ステップＳ１００８）。なお、動きベクトルを求める具体的な処理は第１の実施形態と同様のため説明を省略する。 When the search accuracy of the motion vector is determined by the search accuracy determination unit 102, the motion vector calculation unit 103 determines the motion vector 304 (step S1008). Note that the specific processing for obtaining the motion vector is the same as that in the first embodiment, and a description thereof will be omitted.

以上のように本実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１（３０１ａ〜３０１ｃ）との時間的な距離に加え、参照候補フレーム３０１ａ〜３０１ｃのスライスタイプ９０１も考慮する。そして、最適な動きベクトルが見つかる確率に応じて、動きベクトルの探索精度を変化させる。 As described above, in the present embodiment, the slice type 901 of the reference candidate frames 301a to 301c is considered in addition to the temporal distance between the encoding target frame 300 and the reference candidate frames 301 (301a to 301c). Then, the motion vector search accuracy is changed according to the probability of finding the optimal motion vector.

従って、時間的に離れた画像についても動きベクトルの探索範囲とすることで、動きベクトルの検出精度を向上させられるとともに、且つ、動きベクトルの探索精度を変化させることによって、動きベクトルを検出する際の演算量を削減できる。さらに、参照候補フレームのスライスタイプ（ピクチャタイプ）を考慮することにより、動きベクトルの探索精度をより高めることができる。これにより、例えば、駆動するバッテリーの消費量が増大して撮影時間が短くなってしまうことを可及的に防止することができる。 Therefore, the motion vector search range can be improved even for images that are separated in time, and the motion vector detection accuracy can be improved, and the motion vector search accuracy can be changed to detect a motion vector. The amount of computation can be reduced. Furthermore, by considering the slice type (picture type) of the reference candidate frame, the motion vector search accuracy can be further improved. Thereby, for example, it is possible to prevent as much as possible that the consumption of the battery to be driven increases and the photographing time is shortened.

なお、本実施形態では、ｔｄから減ずる値としてＩスライスならば「２ｔ」、Ｐスライスならば「ｔ」を減じたが、これらの値は一例であって、数値が限定されるものではない。例えば、Ｉスライス時に「ｔ」、Ｐスライス時に「０．５ｔ」を減ずるように設定しても良い。 In this embodiment, “2t” is subtracted from td as a value to be subtracted from td, and “t” is subtracted from P slice. However, these values are merely examples, and the numerical values are not limited. For example, “t” may be set to be decreased at the time of I slice, and “0.5 t” may be decreased at the time of P slice.

また、本実施形態では、動きベクトルの探索精度を変化させたが、第２の実施形態のように符号化対象フレーム３００の縮小率と、参照候補フレーム３０１ａ〜３０１ｃの縮小率を、参照候補フレームのスライスタイプ（ピクチャタイプ）に応じて変化させても良い。 In this embodiment, the search accuracy of the motion vector is changed. However, as in the second embodiment, the reduction rate of the encoding target frame 300 and the reduction rates of the reference candidate frames 301a to 301c are set as reference candidate frames. It may be changed according to the slice type (picture type).

（第４の実施形態）
次に、本発明の第４の実施形態について説明する。前述した第１の実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離のみに応じて、動きベクトルの探索精度を変化させるようにした。それに対して、本実施形態では第１の実施形態と同様の構成を有するが、信号処理部１２内にピーク信号対雑音比（ＰＳＮＲ：ｐｅａｋｓｉｇｎａｌ−ｔｏ−ｎｏｉｓｅｒａｔｉｏ）演算部１２ｏを有する。そして、符号化対象フレーム３００と、参照候補フレーム３０１ａ〜３０１ｃとの時間的な距離に加え、参照候補フレームのＰＳＮＲに応じて、動きベクトルの探索精度を変化させる。従って、以下の説明において第１の実施形態と異なる動作を行う探索精度決定部１０２およびＰＳＮＲ演算部１２ｏ以外の部分については、図１〜図５に付した符号と同一の符号を付す等して詳細な説明を省略する。 (Fourth embodiment)
Next, a fourth embodiment of the present invention will be described. In the first embodiment described above, the motion vector search accuracy is changed only in accordance with the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c. In contrast, the present embodiment has the same configuration as that of the first embodiment, but includes a peak signal-to-noise ratio (PSNR) calculation unit 12o in the signal processing unit 12. Then, in addition to the temporal distance between the encoding target frame 300 and the reference candidate frames 301a to 301c, the motion vector search accuracy is changed according to the PSNR of the reference candidate frame. Accordingly, in the following description, parts other than the search accuracy determination unit 102 and the PSNR calculation unit 12o that perform operations different from those of the first embodiment are denoted by the same reference numerals as those shown in FIGS. Detailed description is omitted.

図１２は、本実施形態に係る信号処理部１２の構成の一例を示すブロック図である。本実施形態では、信号処理部１２のうち、第１の実施形態と構成の異なるＰＳＮＲ演算部１２ｏの動作についてのみ説明し、第１の実施形態と同じ動作を行う他の構成要素の説明については省略する。 FIG. 12 is a block diagram illustrating an example of the configuration of the signal processing unit 12 according to the present embodiment. In the present embodiment, only the operation of the PSNR calculation unit 12o having a configuration different from that of the first embodiment will be described in the signal processing unit 12, and the description of other components that perform the same operation as the first embodiment will be given. Omitted.

図１２においてＰＳＮＲ演算部１２ｏは、撮像部１１から入来する符号化前の画像データと、加算器１２ｊから入来する復元画像データとを比較し、画像の劣化具合の指標となるピーク信号対雑音比（ＰＳＮＲ）の値を、以下の（１２）式より算出する。 In FIG. 12, the PSNR calculation unit 12o compares the unencoded image data coming from the image pickup unit 11 with the restored image data coming from the adder 12j, and compares a pair of peak signals that serve as an index of image degradation. The value of the noise ratio (PSNR) is calculated from the following equation (12).

ここで、ＮおよびＭはそれぞれ画像の縦と横の画素数を表す。また、ｐ（ｉ，ｊ）は符号化前の画像データにおける位置（ｉ，ｊ）の画素値を表し、ｐ’（ｉ，ｊ）は復元画像データにおける位置（ｉ，ｊ）の画素値を表す。Ｔは、画像の階調数−１（８ビット／ピクセル画像ではＴ＝２５５）を表す。 Here, N and M represent the number of vertical and horizontal pixels of the image, respectively. P (i, j) represents the pixel value at the position (i, j) in the image data before encoding, and p ′ (i, j) represents the pixel value at the position (i, j) in the restored image data. Represent. T represents the number of gradations of an image minus 1 (T = 255 for an 8-bit / pixel image).

ＰＳＮＲ演算部１２ｏによって求められたＰＳＮＲの値は、図１２における動きベクトル検出器１２ｋに送信され、動きベクトルの探索精度を変化させるために使用される。 The PSNR value obtained by the PSNR calculation unit 12o is transmitted to the motion vector detector 12k in FIG. 12, and is used to change the motion vector search accuracy.

図１３は、本実施形態に係る動きベクトル検出器１２ｋの構成を示すブロック図である。また、図１４は、本実施形態に係る動きベクトル検出器１２ｋの動作の一例について説明するフローチャートである。図１４を参照しながら、図１３に示す動きベクトル検出器１２ｋの動作について説明する。 FIG. 13 is a block diagram showing the configuration of the motion vector detector 12k according to this embodiment. FIG. 14 is a flowchart illustrating an example of the operation of the motion vector detector 12k according to this embodiment. The operation of the motion vector detector 12k shown in FIG. 13 will be described with reference to FIG.

探索精度決定部１０２は、システム制御部１４から、バスＩ／Ｆ１２ｎを介して、符号化対象フレーム番号３０２、参照候補フレーム番号３０３、及び参照候補フレームのＰＳＮＲの値１３０１が指定されるまで待機する（ステップＳ１４０１）。 The search accuracy determination unit 102 waits until the encoding target frame number 302, the reference candidate frame number 303, and the PSNR value 1301 of the reference candidate frame are specified from the system control unit 14 via the bus I / F 12n. (Step S1401).

ステップＳ１４０１において、符号化対象フレーム番号３０２、参照候補フレーム番号３０３、及び参照候補フレームのＰＳＮＲ１３０１が指定される。すると、探索精度決定部１０２は、符号化対象フレーム３００と、参照候補フレーム３０１との間の時間的な距離ｔｄを演算する（ステップＳ１４０２）。探索精度決定部１０２は、演算されたｔｄに基づいて、第１の実施形態と同様に、符号化対象フレーム３００と、参照候補フレーム３０１との時間的な距離に応じて探索精度を段階的に変化させる。さらに、探索精度決定部１０２は、参照候補フレームのＰＳＮＲ１３０１に応じて探索精度を変化させる。 In step S1401, the encoding target frame number 302, the reference candidate frame number 303, and the PSNR 1301 of the reference candidate frame are designated. Then, the search accuracy determination unit 102 calculates a temporal distance td between the encoding target frame 300 and the reference candidate frame 301 (step S1402). Based on the calculated td, the search accuracy determination unit 102 increases the search accuracy in steps according to the temporal distance between the encoding target frame 300 and the reference candidate frame 301, as in the first embodiment. Change. Further, the search accuracy determination unit 102 changes the search accuracy according to the PSNR 1301 of the reference candidate frame.

すなわち、探索精度決定部１０２は、参照候補フレームのＰＳＮＲ１３０１が所定の閾値Ｔｈ１よりも大きいかどうかを判定する（ステップＳ１４０３）。参照候補フレームのＰＳＮＲ１３０１が所定の閾値Ｔｈ１より大きい場合、すなわちＰＳＮＲ＞Ｔｈ１を満たす場合（ステップＳ１４０３でＹＥＳ）には、探索精度決定部１０２は、ｔｄからｔを引く処理を行う（ステップＳ１４０４）。 That is, the search accuracy determination unit 102 determines whether or not the PSNR 1301 of the reference candidate frame is larger than the predetermined threshold Th1 (step S1403). When the PSNR 1301 of the reference candidate frame is larger than the predetermined threshold Th1, that is, when PSNR> Th1 is satisfied (YES in step S1403), the search accuracy determination unit 102 performs a process of subtracting t from td (step S1404).

一般的にＰＳＮＲが高い場合には参照候補フレームの劣化が少なく、参照フレームとして適切である可能性が高い。そこで、探索精度決定部１０２は、ＰＳＮＲ＞Ｔｈ１を満たすならば、フレーム間の時間的な距離が離れている場合であっても、このようにｔを引くことで探索精度が高くなるようにする。 In general, when the PSNR is high, there is little deterioration of the reference candidate frame, and there is a high possibility that it is suitable as a reference frame. Therefore, if PSNR> Th1 is satisfied, the search accuracy determination unit 102 increases the search accuracy by subtracting t in this way even when the temporal distance between frames is long. .

前記閾値Ｔｈ１は、例えばＳＤ（ＳｔａｎｄａｒｄＤｅｆｉｎｉｔｉｏｎ）で画質の実用レベルと言われる３０（ｄＢ）等の固定値に設定できる。もしほとんどの復元画像のＰＳＮＲが３０（ｄＢ）に満たない場合を考慮するならば、復元画像のＰＳＮＲの平均値を随時更新しながら求める可変閾値として用いても良い。参照候補フレームのＰＳＮＲ１３０１が、ＰＳＮＲ＞Ｔｈ１を満たさない場合（ステップＳ１４０３でＮＯ）には、ステップＳ１４０５へ進む。 The threshold value Th1 can be set to a fixed value such as 30 (dB), which is said to be a practical image quality level in SD (Standard Definition), for example. If the case where the PSNR of most restored images is less than 30 (dB) is taken into consideration, the average value of the PSNR of the restored images may be used as a variable threshold obtained while being updated as needed. If the PSNR 1301 of the reference candidate frame does not satisfy PSNR> Th1 (NO in step S1403), the process proceeds to step S1405.

続いて、探索精度決定部１０２は、参照候補フレームのＰＳＮＲ１３０１が所定の閾値Ｔｈ２（但し、Ｔｈ２＜Ｔｈ１）よりも小さいかどうかを判定する（ステップＳ１４０５）。参照候補フレームのＰＳＮＲ１３０１が所定の閾値Ｔｈ２より小さい場合、すなわちＰＳＮＲ＜Ｔｈ２を満たす場合（ステップＳ１４０５でＹＥＳ）には、探索精度決定部１０２は、ｔｄにｔを加える処理を行う（ステップＳ１４０６）。閾値Ｔｈ２の設定については、閾値Ｔｈ１のときと同様である。 Subsequently, the search accuracy determination unit 102 determines whether the PSNR 1301 of the reference candidate frame is smaller than a predetermined threshold Th2 (where Th2 <Th1) (step S1405). When the PSNR 1301 of the reference candidate frame is smaller than the predetermined threshold Th2, that is, when PSNR <Th2 is satisfied (YES in step S1405), the search accuracy determination unit 102 performs a process of adding t to td (step S1406). The setting of the threshold Th2 is the same as that for the threshold Th1.

一般的にＰＳＮＲが低い場合には参照候補フレームの劣化が大きく、参照フレームとして不適切である可能性が高い。そこで、探索精度決定部１０２は、ＰＳＮＲ＜Ｔｈ２を満たすならば、フレーム間の時間的な距離が近い場合であっても、このようにｔを加えることで探索精度が低くなるようにする。 In general, when the PSNR is low, the deterioration of the reference candidate frame is large, and it is highly possible that the reference frame is inappropriate. Therefore, if PSNR <Th2 is satisfied, the search accuracy determination unit 102 decreases the search accuracy by adding t in this way even when the temporal distance between frames is short.

参照候補フレームのＰＳＮＲ１３０１が、ＰＳＮＲ＜Ｔｈ２を満たさない場合（ステップＳ１４０５でＮＯ）には、ステップＳ１４０７へ進む。すなわち、ＰＳＮＲが高くも低くもない中間状態にある場合には、探索精度決定部１０２は、フレーム間の時間的な距離に応じてのみ探索精度を決定する。 If the PSNR 1301 of the reference candidate frame does not satisfy PSNR <Th2 (NO in step S1405), the process proceeds to step S1407. That is, when the PSNR is in an intermediate state that is neither high nor low, the search accuracy determination unit 102 determines the search accuracy only according to the temporal distance between frames.

以上のようにして求めたｔｄに応じて、探索精度決定部１０２は、動きベクトルの探索精度を変化させる（ステップＳ１４０７）。 In accordance with td obtained as described above, the search accuracy determination unit 102 changes the motion vector search accuracy (step S1407).

このように、本実施形態では、第１の実施形態の効果に加えて、参照候補フレームのＰＳＮＲが大きい値ならば、時間的に離れたフレームであっても探索精度を高くすることができる。また参照候補フレームのＰＳＮＲが小さい値ならば、時間的に近いフレームであっても探索精度を低くすることができる。 Thus, in this embodiment, in addition to the effects of the first embodiment, if the PSNR of the reference candidate frame is a large value, the search accuracy can be increased even for frames that are separated in time. If the PSNR of the reference candidate frame is a small value, the search accuracy can be lowered even for frames that are close in time.

動きベクトルの探索精度が探索精度決定部１０２により決定されると、動きベクトル演算部１０３は、動きベクトル３０４を決定する（ステップＳ１４０８）。なお、動きベクトルを求める具体的な処理は第１の実施形態と同様のため説明を省略する。 When the search accuracy of the motion vector is determined by the search accuracy determination unit 102, the motion vector calculation unit 103 determines the motion vector 304 (step S1408). Note that the specific processing for obtaining the motion vector is the same as that in the first embodiment, and a description thereof will be omitted.

以上のように本実施形態では、符号化対象フレーム３００と、参照候補フレーム３０１（３０１ａ〜３０１ｃ）との時間的な距離に加え、参照候補フレーム３０１ａ〜３０１ｃのＰＳＮＲ値１３０１も考慮する。そして、最適な動きベクトルが見つかる確率に応じて、動きベクトルの探索精度を変化させる。 As described above, in the present embodiment, in addition to the temporal distance between the encoding target frame 300 and the reference candidate frames 301 (301a to 301c), the PSNR value 1301 of the reference candidate frames 301a to 301c is also considered. Then, the motion vector search accuracy is changed according to the probability of finding the optimal motion vector.

従って、時間的に離れた画像についても動きベクトルの探索範囲とすることで、動きベクトルの検出精度を向上させられるとともに、且つ、動きベクトルの探索精度を変化させることによって、動きベクトルを検出する際の演算量を削減できる。さらに、参照候補フレームのＰＳＮＲ値を考慮することにより、動きベクトルの探索精度をより高め、かつ、より効率的に演算を削減することができる。これにより、例えば、駆動するバッテリーの消費量が増大して撮影時間が短くなってしまうことを可及的に防止することができる。 Therefore, the motion vector search range can be improved even for images that are separated in time, and the motion vector detection accuracy can be improved, and the motion vector search accuracy can be changed to detect a motion vector. The amount of computation can be reduced. Furthermore, by considering the PSNR value of the reference candidate frame, the motion vector search accuracy can be further improved and the calculation can be reduced more efficiently. Thereby, for example, it is possible to prevent as much as possible that the consumption of the battery to be driven increases and the photographing time is shortened.

なお、本実施形態では、ＰＳＮＲが高い場合にｔｄから減ずる値として「ｔ」を減じたが、これらの値は一例であって数値が限定されるものではない。例えば、「０．５ｔ」を減ずるように設定しても良い。 In the present embodiment, “t” is subtracted as a value to be subtracted from td when the PSNR is high. However, these values are merely examples and the numerical values are not limited. For example, you may set so that "0.5t" may be reduced.

さらに、本実施形態ではＰＳＮＲ値に応じてｔｄに補正をかけて、探索精度を決定（変更）したが、別の方法として、ｔｄに応じてＰＳＮＲ値自体に補正をかけてもよい。この場合には、補正後のＰＳＮＲによって探索精度を決定しても良い。 Furthermore, in the present embodiment, td is corrected according to the PSNR value and the search accuracy is determined (changed), but as another method, the PSNR value itself may be corrected according to td. In this case, the search accuracy may be determined based on the corrected PSNR.

つまり、ｔｄが大きく、参照候補フレームまでの時間的距離が大きい場合には、探索精度決定部１０２は、ＰＳＮＲから大きな値を引くような補正をかける。また、ｔｄが小さく、参照候補フレームまでの時間的距離が小さい場合には、探索精度決定部１０２は、ＰＳＮＲに補正をかけない等を行う。そして、探索精度決定部１０２は、補正後のＰＳＮＲが大きいときは、探索精度を高くし、小さいときは探索精度を低くするように決定する。 That is, when td is large and the temporal distance to the reference candidate frame is large, the search accuracy determination unit 102 performs correction so as to subtract a large value from the PSNR. When td is small and the temporal distance to the reference candidate frame is small, the search accuracy determination unit 102 performs, for example, no correction on the PSNR. Then, the search accuracy determination unit 102 determines to increase the search accuracy when the corrected PSNR is large and to decrease the search accuracy when the PSNR after correction is small.

また、本実施形態では、動きベクトルの探索精度を変化させたが、第２の実施形態のように、符号化対象フレーム３００の画像の縮小率と、参照候補フレーム３０１ａ〜３０１ｃの画像の縮小率を、参照候補フレームのＰＳＮＲに応じて変化させても良い。 In the present embodiment, the search accuracy of the motion vector is changed. However, as in the second embodiment, the image reduction rate of the encoding target frame 300 and the image reduction rates of the reference candidate frames 301a to 301c. May be changed according to the PSNR of the reference candidate frame.

（本発明の他の実施形態）
前述した実施形態の機能を実現するべく各種のデバイスを動作させるように、該各種デバイスと接続された装置あるいはシステム内のコンピュータに対し、前記実施形態の機能を実現するためのソフトウェアのプログラムコードを供給してもよい。そのシステムあるいは装置のコンピュータ（ＣＰＵあるいはＭＰＵ）に格納されたプログラムに従って前記各種デバイスを動作させることによって実施したものも、本発明の範疇に含まれる。 (Other embodiments of the present invention)
In order to operate various devices to realize the functions of the above-described embodiments, program codes of software for realizing the functions of the above-described embodiments are provided to an apparatus or a computer in the system connected to the various devices. You may supply. What was implemented by operating said various devices according to the program stored in the computer (CPU or MPU) of the system or apparatus is also included in the category of the present invention.

また、この場合、前記ソフトウェアのプログラムコード自体が前述した実施形態の機能を実現することになる。また、そのプログラムコード自体、及びそのプログラムコードをコンピュータに供給するための手段、例えば、かかるプログラムコードを格納した記録媒体は本発明を構成する。かかるプログラムコードを記憶する記録媒体としては、例えばフレキシブルディスク、ハードディスク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、磁気テープ、不揮発性のメモリカード、ＲＯＭ等を用いることができる。 In this case, the program code of the software itself realizes the functions of the above-described embodiment. The program code itself and means for supplying the program code to a computer, for example, a recording medium storing the program code constitute the present invention. As a recording medium for storing the program code, for example, a flexible disk, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

また、コンピュータが供給されたプログラムコードを実行することにより、前述の実施形態の機能が実現されるだけでない。そのプログラムコードがコンピュータにおいて稼働しているオペレーティングシステムあるいは他のアプリケーションソフト等と共同して前述の実施形態の機能が実現される場合にもかかるプログラムコードは本発明の実施形態に含まれることは言うまでもない。 Further, the functions of the above-described embodiments are not only realized by executing the program code supplied by the computer. It goes without saying that the program code is also included in the embodiment of the present invention even when the function of the above-described embodiment is realized in cooperation with an operating system or other application software running on the computer. Yes.

さらに、供給されたプログラムコードがコンピュータの機能拡張ボードに備わるメモリに格納された後、そのプログラムコードの指示に基づいてその機能拡張ボードに備わるＣＰＵが実際の処理の一部または全部を行う。その処理によって前述した実施形態の機能が実現される場合にも本発明に含まれることは言うまでもない。 Further, after the supplied program code is stored in the memory provided in the function expansion board of the computer, the CPU provided in the function expansion board performs part or all of the actual processing based on the instruction of the program code. Needless to say, the present invention includes the case where the functions of the above-described embodiments are realized by the processing.

また、供給されたプログラムコードがコンピュータに接続された機能拡張ユニットに備わるメモリに格納された後、そのプログラムコードの指示に基づいて機能拡張ユニットに備わるＣＰＵ等が実際の処理の一部または全部を行う。その処理によって前述した実施形態の機能が実現される場合にも本発明に含まれることは言うまでもない。 Further, after the supplied program code is stored in the memory provided in the function expansion unit connected to the computer, the CPU or the like provided in the function expansion unit performs part or all of the actual processing based on the instruction of the program code. Do. Needless to say, the present invention includes the case where the functions of the above-described embodiments are realized by the processing.

なお、前述した各実施形態は、何れも本発明を実施するにあたっての具体化の例を示したものに過ぎず、これらによって本発明の技術的範囲が限定的に解釈されてはならないものである。すなわち、本発明はその技術思想、またはその主要な特徴から逸脱することなく、様々な形で実施することができる。 Note that each of the above-described embodiments is merely a specific example for carrying out the present invention, and the technical scope of the present invention should not be construed as being limited thereto. . That is, the present invention can be implemented in various forms without departing from the technical idea or the main features thereof.

１１撮像部
１２信号処理部
１３記録部
１４システム制御部
１５表示部
１６操作部
１７バス
１２ｋ動きベクトル検出器
１００符号化対象フレーム保存部
１０１参照候補フレーム保存部
１０２探索精度決定部
１０３動きベクトル演算部
３００符号化対象フレーム
３０１参照候補フレーム
３０２符号化対象フレーム番号
３０３参照候補フレーム番号
３０４動きベクトル
４０２縮小率決定部
４０４縮小フレーム作成部
４０５縮小符号化対象フレーム保存部
４０６縮小参照候補フレーム保存部
４０７動きベクトル演算部
４０８動きベクトル
９０１参照候補フレーム
１３０１参照候補フレームのＰＳＮＲ DESCRIPTION OF SYMBOLS 11 Image pick-up part 12 Signal processing part 13 Recording part 14 System control part 15 Display part 16 Operation part 17 Bus 12k Motion vector detector 100 Encoding object frame preservation | save part 101 Reference candidate frame preservation | save part 102 Search accuracy determination part 103 Motion vector calculation part 300 encoding target frame 301 reference candidate frame 302 encoding target frame number 303 reference candidate frame number 304 motion vector 402 reduction rate determination unit 404 reduced frame creation unit 405 reduced encoding target frame storage unit 406 reduced reference candidate frame storage unit 407 motion Vector calculation unit 408 motion vector 901 reference candidate frame 1301 PSNR of reference candidate frame

Claims

A motion vector detection device for detecting a motion vector between screens,
A computing means for computing a temporal distance between the encoding target image and each of a plurality of candidate images that are candidates for a reference image referred to by the encoding target image;
A motion vector detecting means for searching for a motion vector between the encoding target image and each of the plurality of candidate images, and detecting a motion vector based on the search result;
When searching for a motion vector between the encoding target image and each of the plurality of candidate images, the motion vector detection means includes a temporal distance to each candidate image calculated by the calculation means, A motion vector detection apparatus characterized in that the amount of calculation to be executed is changed according to a peak signal-to-noise ratio (PSNR) value of a candidate image.

The motion vector search accuracy between the encoding target image and each of the plurality of candidate images is determined by calculating the temporal distance with respect to each candidate image calculated by the calculation means and the PSNR value of each candidate image. And determining means for determining according to
2. The motion vector detection unit performs a motion vector search between the encoding target image and each of the plurality of candidate images with a search accuracy determined by the determination unit. The motion vector detection device described in 1.

The determination means increases the search accuracy of the motion vector as the temporal distance to each candidate image calculated by the calculation means is shorter, and the temporal determination for each candidate image calculated by the calculation means The motion vector detection apparatus according to claim 2, wherein the search accuracy of the motion vector is lowered as the distance is longer.

4. The motion vector detection apparatus according to claim 3, wherein the determination unit increases the search accuracy of the motion vector when the PSNR value of the candidate image is larger than a predetermined threshold Th1 .

4. The motion vector detection device according to claim 3, wherein the determination unit reduces the search accuracy of the motion vector when the PSNR value of the candidate image is smaller than a predetermined threshold Th2.

The determination means increases the search accuracy of the motion vector when the PSNR value of the candidate image is larger than a predetermined threshold Th1, and when the PSNR value of the candidate image is smaller than the predetermined threshold Th2. The motion vector detection apparatus according to claim 3, wherein search accuracy of the motion vector is lowered.

A motion vector detection method for detecting a motion vector between screens,
A calculation step of calculating a temporal distance between the encoding target image and each of a plurality of candidate images that are candidates for the reference image referred to by the encoding target image;
A motion vector detection step of searching for a motion vector between the encoding target image and each of the plurality of candidate images, and detecting a motion vector based on the search result;
When searching for a motion vector between the encoding target image and each of the plurality of candidate images, the amount of calculation executed in the motion vector detection step is for each candidate image calculated in the calculation step. A motion vector detection method, wherein the method is changed according to a temporal distance and a peak signal-to-noise ratio (PSNR) value of each candidate image.