JP6055230B2

JP6055230B2 - Transmission device, reception device, and program

Info

Publication number: JP6055230B2
Application number: JP2012177518A
Authority: JP
Inventors: 俊枝三須; 井口　和久; 和久井口; 康孝松尾; 俊輔岩村; 境田　慎一; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2012-08-09
Filing date: 2012-08-09
Publication date: 2016-12-27
Anticipated expiration: 2032-08-09
Also published as: JP2014036385A

Description

本発明は、入力映像の一部のフレームを間引いて伝送する送信装置、該送信装置により間引かれたフレームを補間生成する受信装置、及びそれらのプログラムに関する。 The present invention relates to a transmission device that thins out and transmits a part of a frame of an input video, a reception device that interpolates and generates a frame thinned out by the transmission device, and a program thereof.

従来、映像(動画像)の圧縮においては、あるフレームを符号化する際、近接する時刻の他のフレームとの時間相関を利用することで、情報の削減を行っている。しかし、画像サイズが大きくなるほど、またフレームレートが高くなるほど符号化が困難となり、とくに実時間の符号化処理は処理できる画像サイズやフレームレートに限界がある。そのため、画像サイズやフレームレートの限界を超える映像を処理するためには、映像にサイズやフレームレートを送信側で間引いて符号化を行い、受信側で間引かれた画素やフレームを補間する必要がある。 Conventionally, in video (moving image) compression, when a certain frame is encoded, information is reduced by using a time correlation with another frame at a close time. However, the larger the image size and the higher the frame rate, the more difficult it is to encode, and there is a limit to the image size and frame rate that can be processed, especially in real-time encoding processing. Therefore, in order to process video that exceeds the limits of image size and frame rate, it is necessary to encode the video by decimating the size and frame rate on the transmission side and interpolating the thinned pixels and frames on the reception side There is.

例えば、送信側でフレームを時間的に間引き、間引かれたフレームを間引かれていないフレーム(キーフレーム)から復元するためのサイド情報とともに伝送する映像符号化手法として、誤り訂正技術を応用したDistributed Video Codingが提案されている。 For example, error correction technology was applied as a video encoding method that transmits frames with temporal information on the transmission side and side information to restore the thinned frames from undecimated frames (key frames). Distributed Video Coding has been proposed.

一方、画速（フレーム周波数）をより高いフレーム周波数へと変換するため、同一フレームを複数回提示する手法がある。例えば、フレーム周波数が３０Ｈｚの各フレームを２回ずつ提示することでフレーム周波数を６０Ｈｚに変換することができる。また、同一フレームを提示する回数を可変とした手法もある。例えば、２−３変換と称される手法では、フレーム周波数が２４Ｈｚの奇数番目のフレームを２回、偶数番目のフレームを３回提示することで、フレーム周波数が６０Ｈｚの映像を得ることができる。 On the other hand, there is a method of presenting the same frame a plurality of times in order to convert the image speed (frame frequency) to a higher frame frequency. For example, the frame frequency can be converted to 60 Hz by presenting each frame with a frame frequency of 30 Hz twice. There is also a technique in which the number of times the same frame is presented is variable. For example, in a technique called 2-3 conversion, an odd-numbered frame having a frame frequency of 24 Hz is presented twice and an even-numbered frame is presented three times, whereby an image having a frame frequency of 60 Hz can be obtained.

また、補間フレームの時刻に隣接する２フレームの各画素値の平均又は、重みつき平均によって補間フレームの画素値を決定する手法もある。 There is also a method of determining the pixel value of the interpolation frame by the average of the pixel values of two frames adjacent to the time of the interpolation frame or the weighted average.

さらに、映像の動きベクトルを利用して動き補償を行うことで、よりスムーズな補間フレームを生成する手法が知られている（例えば、特許文献１参照）。また、フレーム間引きを伴う映像符号化において、送信側でフレーム補間の有効性を評価して、適応的に間引きを行う手法が知られている（例えば、特許文献２参照）。 Furthermore, a method of generating a smoother interpolation frame by performing motion compensation using a motion vector of a video is known (see, for example, Patent Document 1). In video coding with frame decimation, a technique is known in which the effectiveness of frame interpolation is evaluated on the transmission side and adaptive decimation is performed (see, for example, Patent Document 2).

特許第３４９５４８５号公報Japanese Patent No. 3495485 特許第４９３０４０９号公報Japanese Patent No. 4930409

しかし、従来の同一フレームを複数回提示する手法では、映像の動きがぎこちなくなるという問題があった。また、平均や重み付き平均により補間フレームを生成する手法によれば、映像の動きのぎこちなさは改善するものの、補間フレームにおいて動物体が二重像となり、解像度が低下するという問題や、補間フレームと補間ではない元来のフレームとの画質の違いがフリッカのように見えてしまうという問題があった。 However, the conventional method of presenting the same frame multiple times has a problem that the motion of the video becomes awkward. Also, according to the method of generating an interpolation frame by averaging or weighted averaging, although the awkwardness of video motion is improved, there is a problem that the moving object becomes a double image in the interpolation frame and the resolution is lowered, and the interpolation frame There is a problem in that the difference in image quality from the original frame which is not interpolated looks like flicker.

また、特許文献１のような手法では、動きのぎこちなさや二重像は改善されるものの、動きが異なる被写体が同一ブロックに含まれている場合の動き補償誤差や、動き推定の精度の粗さによって生じる標本点の小数精度の誤差に起因して、動き補償に用いるブロックの境界が目立つことがある。動き補償を小数画素精度で行えば、標本点の小数精度の誤差は改善できるものの、小数画素精度の画素位置の移動（位相シフト）を行うディジタルフィルタの特性に起因して輪郭のぼやけやリンギングが生じることがある。 In addition, in the method as disclosed in Patent Document 1, although the awkwardness of motion and the double image are improved, the motion compensation error when the subject having different motion is included in the same block, and the coarseness of the motion estimation accuracy The boundary of the block used for motion compensation may be conspicuous due to the decimal accuracy error of the sample points caused by the above. If motion compensation is performed with decimal pixel accuracy, the decimal accuracy error of the sample point can be improved, but the blurring and ringing of the contour may occur due to the characteristics of the digital filter that moves the pixel position (phase shift) with decimal pixel accuracy. May occur.

また、特許文献２のような手法では、フレーム間引きの際に、受信側でのフレーム補間の有効に働くような間引き制御を行うものの、参照フレーム補間の方法を送信側で検証する仕組み及び検証結果を受信側へ伝送する仕組みは備えていない。そのため、受信側の動作が適切な状態となるように設計することが困難であった。 Further, in the technique such as Patent Document 2, the thinning control is performed so that the frame interpolation on the receiving side works effectively at the time of frame thinning, but the mechanism and the verification result for verifying the reference frame interpolation method on the transmitting side Is not provided to the receiver. For this reason, it has been difficult to design the receiving side to be in an appropriate state.

かかる事情に鑑みてなされた本発明の目的は、入力映像から補間画像を生成する際に、解像度の低下やフリッカを防止し、かつ、高周波成分を含む鮮鋭な輪郭像を得ることが可能な送信装置、受信装置、及びそのプログラムを提供することにある。 In view of such circumstances, an object of the present invention is to provide a transmission capable of preventing a decrease in resolution and flicker and obtaining a sharp contour image including a high-frequency component when generating an interpolated image from an input video. An apparatus, a receiving apparatus, and a program thereof are provided.

上記課題を解決するため、本発明に係る送信装置は、入力映像の一部のフレームを間引いて伝送する送信装置であって、入力映像のうち伝送されるフレームである伝送フレームと、伝送されないフレームである間引き画像とを決定する時間分割部と、前記伝送フレームから動き推定元画像、動き推定先画像、及び動き補償元画像の組を複数組決定する参照フレーム決定手段と、前記参照フレーム決定手段で決定された複数組に対してそれぞれ、前記動き推定元画像のブロック領域と画素値パターンの類似する対応領域を前記動き推定先画像から決定し、前記動き推定元画像のブロック領域と前記動き推定先画像の対応領域との対応位置関係を求める動き推定部と、前記参照フレーム決定手段で決定された複数組に対してそれぞれ、前記対応位置関係に基づいて前記動き補償元画像を前記間引き画像のフレーム位置まで動き補償した画像と、前記間引き画像との残差をブロック領域ごとに算出する動き補償残差評価部と、前記残差の値からブロック領域ごとの動き補償の有効性を示す判定情報を生成して受信側に送信する判定部と、を備えることを特徴とする。 In order to solve the above-described problem, a transmitting apparatus according to the present invention is a transmitting apparatus that thins out and transmits a part of a frame of an input video, and includes a transmission frame that is a transmitted frame of the input video and a frame that is not transmitted. A time division unit for determining a thinned-out image, reference frame determining means for determining a plurality of sets of a motion estimation original image, a motion estimation target image, and a motion compensation original image from the transmission frame, and the reference frame determination means For each of the plurality of sets determined in step (b), a corresponding region having a pixel value pattern similar to the block region of the motion estimation source image is determined from the motion estimation destination image, and the block region of the motion estimation source image and the motion estimation are determined. each motion estimation unit for obtaining the corresponding positional relationship between the corresponding region of the previous image, for a plurality of sets determined in the reference frame determining means, said corresponding position A motion compensation residual evaluation unit that calculates a residual between the image obtained by motion compensation of the motion compensated original image to a frame position of the thinned image based on a relationship and the thinned image for each block region; and a value of the residual And a determination unit that generates determination information indicating the effectiveness of motion compensation for each block region and transmits the determination information to the reception side .

さらに、本発明に係る送信装置において、フレームを符号化して伝送する際の符号化による劣化を模擬し、符号化により劣化されたフレームを生成する伝送模擬部を更に備え、前記参照フレーム決定手段は、前記伝送模擬部により生成されたフレームから動き推定元画像、動き推定先画像、及び動き補償元画像の組を複数組決定することを特徴とする。 Furthermore, in the transmitting apparatus according to the present invention, the transmitter further includes a transmission simulation unit that simulates deterioration due to encoding when a frame is encoded and transmitted, and generates a frame deteriorated due to encoding. A plurality of sets of a motion estimation source image, a motion estimation destination image, and a motion compensation source image are determined from the frame generated by the transmission simulation unit.

また上記課題を解決するため、本発明に係る受信装置は、上記送信装置により間引かれた間引き画像を補間生成する受信装置であって、前記送信装置から伝送されるフレームから動き推定元画像、動き推定先画像、及び動き補償元画像の組を複数組決定する参照フレーム決定手段と、前記送信装置から前記判定情報を取得し、該判定情報により動き補償が有効であると通知されたブロックについて、前記動き推定元画像のブロック領域と画素値パターンの類似する対応領域を前記動き推定先画像から決定し、前記動き推定元画像のブロック領域と前記動き推定先画像の対応領域との対応位置関係を求める動き推定部と、前記対応位置関係に基づいて前記動き補償元画像の重畳先座標を求める動き補償部と、前記複数組の動き補償元画像をそれぞれ重畳先座標に重畳して補間画像を生成する重畳部と、前記送信装置から伝送されたフレームと、前記重畳部により生成された補間画像とを時間多重する時間多重部と、を備えることを特徴とする。 In order to solve the above-described problem, a receiving apparatus according to the present invention is a receiving apparatus that interpolates and generates a thinned image thinned out by the transmitting apparatus, and a motion estimation original image from a frame transmitted from the transmitting apparatus, Reference frame determination means for determining a plurality of sets of a motion estimation destination image and a motion compensation source image, and a block for which the determination information is acquired from the transmission apparatus and in which motion compensation is notified by the determination information A corresponding region having a pixel value pattern similar to the block region of the motion estimation source image is determined from the motion estimation destination image, and the corresponding positional relationship between the block region of the motion estimation source image and the corresponding region of the motion estimation destination image A motion estimator for obtaining a motion compensation source image for obtaining a superimposition destination coordinate of the motion compensation source image based on the corresponding positional relationship, and a plurality of sets of motion compensation source images. A superimposing unit that generates an interpolated image by superimposing on the superimposition destination coordinates, and a time multiplexing unit that time-multiplexes the frame transmitted from the transmission device and the interpolated image generated by the superimposing unit. Features.

さらに、本発明に係る受信装置において、前記重畳部は、前記動き推定元画像、前記動き推定先画像、前記動き補償元画像、及び前記補間画像の時刻位置に応じて、前記動き補償画像を重み付けして重畳先座標に重畳し、補間画像を生成することを特徴とする。 Furthermore, in the receiving apparatus according to the present invention, the superimposing unit weights the motion compensation image according to the time position of the motion estimation original image, the motion estimation destination image, the motion compensation original image, and the interpolated image. Then, it is superimposed on the superimposition destination coordinates to generate an interpolated image.

さらに、本発明に係る受信装置において、前記重畳部は、前記動き推定元画像のブロック領域と前記動き推定先画像の対応領域との類似度に応じて、前記動き補償画像をブロック領域ごとに重み付けして重畳先座標に重畳し、補間画像を生成することを特徴とする。 Furthermore, in the receiving apparatus according to the present invention, the superimposing unit weights the motion compensated image for each block region according to the similarity between the block region of the motion estimation source image and the corresponding region of the motion estimation destination image. Then, it is superimposed on the superimposition destination coordinates to generate an interpolated image.

また、上記課題を解決するため、本発明に係るプログラムは、コンピュータを、上記送信装置、又は上記受信装置として機能させることを特徴とする。 In order to solve the above problems, a program according to the present invention causes a computer to function as the transmission device or the reception device.

本発明によれば、送信装置側で入力映像の一部のフレームを間引き、受信装置側で間引かれたフレームを高精度に補間することが可能となる。その結果、全てのフレームを伝送する場合よりも、レート歪み特性（Ｒ−Ｄ特性）の良い伝送が可能となる。 According to the present invention, it is possible to thin out some frames of an input video on the transmission device side and to interpolate frames thinned out on the reception device side with high accuracy. As a result, transmission with better rate distortion characteristics (RD characteristics) is possible than when all frames are transmitted.

本発明の一実施形態に係る送信装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the transmitter which concerns on one Embodiment of this invention. 本発明の一実施形態に係る送信装置における残差評価部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the residual evaluation part in the transmitter which concerns on one Embodiment of this invention. 本発明の一実施形態に係る送信装置における残差評価部の動作を説明する図である。It is a figure explaining operation | movement of the residual evaluation part in the transmitter which concerns on one Embodiment of this invention. 本発明の一実施形態に係る送信装置における判定部の出力するビットパターン列を示す図である。It is a figure which shows the bit pattern sequence which the determination part in the transmitter which concerns on one Embodiment of this invention outputs. 本発明の一実施形態に係る送信装置が伝送模擬部を有する場合の構成例を示すブロック図である。It is a block diagram which shows the structural example in case the transmitter which concerns on one Embodiment of this invention has a transmission simulation part. 本発明の一実施形態に係る受信装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the receiver which concerns on one Embodiment of this invention. 本発明の一実施形態に係る受信装置における補間部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the interpolation part in the receiver which concerns on one Embodiment of this invention. 本発明の一実施形態に係る受信装置における補間部の動作を説明する図である。It is a figure explaining operation | movement of the interpolation part in the receiver which concerns on one Embodiment of this invention.

以下、本発明による送信装置及び受信装置の一実施形態について、図面を参照して詳細に説明する。本明細書では、ｆはフレーム番号を示し、Ｉ（ｆ）はフレーム番号ｆの画像を示す。また、フレーム時刻位置をフレーム番号で表す。 Hereinafter, an embodiment of a transmission device and a reception device according to the present invention will be described in detail with reference to the drawings. In this specification, f indicates a frame number, and I (f) indicates an image of the frame number f. The frame time position is represented by a frame number.

［送信装置］
図１は、本発明の一実施形態に係る送信装置の構成例を示すブロック図である。図１に示すように、送信装置１は、時間分割部１０と、伝送フレームメモリ１１と、非伝送フレームメモリ１２と、走査部１３と、Ｋ個の残差評価部１４（１４−１，１４−２，・・・１４−Ｋ）と、判定部１５と、を備える。説明の便宜上、伝送フレームメモリ１１及び非伝送フレームメモリ１２を分けて図示するが、１つのフレームメモリとして構成してもよい。また、伝送フレームメモリ１１及び非伝送フレームメモリ１２を送信装置の外部に備えるように構成してもよい。送信装置１は、入力映像の一部のフレームを間引いて伝送する。 [Transmitter]
FIG. 1 is a block diagram illustrating a configuration example of a transmission apparatus according to an embodiment of the present invention. As shown in FIG. 1, the transmission apparatus 1 includes a time division unit 10, a transmission frame memory 11, a non-transmission frame memory 12, a scanning unit 13, and K residual evaluation units 14 (14-1, 14). -2, ... 14-K) and a determination unit 15. For convenience of explanation, the transmission frame memory 11 and the non-transmission frame memory 12 are illustrated separately, but may be configured as one frame memory. Further, the transmission frame memory 11 and the non-transmission frame memory 12 may be provided outside the transmission device. The transmission apparatus 1 transmits a part of the frame of the input video by thinning it out.

時間分割部１０は、入力映像のうち間引かれないで送信装置１から伝送されるフレームである伝送フレームと、入力映像から間引かれて送信装置１から伝送されないフレームである非伝送フレーム（間引き画像）とを決定する。そして、伝送フレームを伝送フレームメモリ１１に記憶するとともに外部に出力し、非伝送フレームを非伝送フレームメモリ１２に記憶する。 The time division unit 10 transmits a transmission frame that is a frame transmitted from the transmission device 1 without being thinned out of the input video, and a non-transmission frame (a thinning frame) that is a frame that is thinned out from the input video and is not transmitted from the transmission device 1. Image). Then, the transmission frame is stored in the transmission frame memory 11 and output to the outside, and the non-transmission frame is stored in the non-transmission frame memory 12.

伝送フレームメモリ１１は、送信装置１から伝送される（又は伝送された）フレームを複数フレーム記憶する。図１では、伝送フレームメモリ１１は、記憶された３時点のフレームを残差評価部１４に出力可能なように構成されている。 The transmission frame memory 11 stores a plurality of frames transmitted (or transmitted) from the transmission device 1. In FIG. 1, the transmission frame memory 11 is configured to be able to output the stored frames at three time points to the residual evaluation unit 14.

走査部１３は、入力映像上に配置するブロック領域（部分領域）の位置を示すブロック位置情報を残差評価部１４に出力する。 The scanning unit 13 outputs block position information indicating the position of a block region (partial region) to be arranged on the input video to the residual evaluation unit 14.

図２は、残差評価部１４の構成例を示すブロック図である。図２に示すように、残差評価部１４は、ブロック切出部１４１と、動き推定部１４２と、動き補償残差評価部１４３と、を備える。 FIG. 2 is a block diagram illustrating a configuration example of the residual evaluation unit 14. As illustrated in FIG. 2, the residual evaluation unit 14 includes a block cutout unit 141, a motion estimation unit 142, and a motion compensation residual evaluation unit 143.

図３は、残差評価部１４の動作の概要を説明するための図である。動き推定元画像とは、動き推定部１４２にて動き推定の対象元となる画像フレームである。動き推定先画像とは、動き推定部１４２にて動き推定の対象先となる画像フレームである。動き補償元画像とは、動き補償残差評価部１４３にて動き補償の対象となる画像フレームである。 FIG. 3 is a diagram for explaining an outline of the operation of the residual evaluation unit 14. The motion estimation original image is an image frame that is a motion estimation target source in the motion estimation unit 142. The motion estimation destination image is an image frame that is a target of motion estimation in the motion estimation unit 142. The motion compensation original image is an image frame that is subject to motion compensation in the motion compensation residual evaluation unit 143.

動き推定元画像のフレーム番号ｆ_ａと動き推定先画像のフレーム番号ｆ_ｂとは一致してはならないが、動き推定元画像のフレーム番号ｆ_ａと動き補償元画像のフレーム番号ｆ_ｐとは一致してもよく、また、動き推定先画像のフレーム番号ｆ_ｂと動き補償元画像のフレーム番号ｆ_ｐとが一致してもよい。なお、図３において、Ｉ（ｆ_ａ ^（1））などの上付きの（１）は、残差評価部１４−１による処理を意味し、同様に上付きの（２）は、残差評価部１４−２による処理を意味する。 Although the frame number f _b of the frame number of the motion estimation original image f _a and the motion estimation target image must match one the frame number f _p of the motion estimation original image frame number f _a and the motion compensated original image may match, also it has a frame number f _p of the frame number f _b and the motion compensation based on image motion estimation target image may coincide. In FIG. 3, superscript ⁽¹⁾ such as I (f _a ⁽¹⁾ ) means processing by the residual evaluation unit 14-1, and superscript (2) is residual evaluation. This means processing by the unit 14-2.

残差評価部１４は、伝送フレームメモリ１１に記憶された伝送フレームから、動き推定元画像Ｉ（ｆ_ａ）、動き推定先画像Ｉ（ｆ_ｂ）、及び動き補償元画像Ｉ（ｆ_ｐ）の組を決定する参照フレーム決定手段を有する。これらの参照フレームの組は、残差評価部１４ごとに異なるものとする。残差評価部１４は、動き推定元画像Ｉ（ｆ_ａ）及び動き推定先画像Ｉ（ｆ_ｂ）に基づき、画像内のブロック領域（部分領域）ごとの動きを算出し、該動きに基づいて動き補償元画像Ｉ（ｆ_ｐ）と間引き画像Ｉ（ｆ_ｑ）の位置合わせを行う。そして、位置合わせを行った動き補償元画像Ｉ（ｆ_ｐ）と間引き画像Ｉ（ｆ_ｑ）の残差ｅを算出し、判定部１５に出力する。 The residual evaluation unit 14 extracts the motion estimation original image I (f _a ), the motion estimation destination image I (f _b ), and the motion compensation original image I (f _p ) from the transmission frame stored in the transmission frame memory 11. Reference frame determining means for determining a set is provided. These sets of reference frames are different for each residual evaluation unit 14. The residual evaluation unit 14 calculates a motion for each block region (partial region) in the image based on the motion estimation original image I (f _a ) and the motion estimation destination image I (f _b ), and based on the motion The motion compensation original image I (f _p ) and the thinned image I (f _q ) are aligned. Then, a residual e between the motion compensated original image I (f _p ) and the thinned image I (f _q ) subjected to the alignment is calculated and output to the determination unit 15.

ブロック切出部１４１は、伝送フレームメモリ１１に記憶された動き推定元画像Ｉ（ｆ_ａ）から、走査部１３から入力されるブロック位置情報により指定される位置のブロック領域Ｂを切り出し、動き推定部１４２に出力する。画像座標（ｘ，ｙ）における画素値をＩ（ｆ，ｘ，ｙ）と記し、水平Ｗ画素、垂直Ｈ画素のブロック領域Ｂの画像座標（ｉ，ｊ）における画素値をＢ（ｉ，ｊ）とすると、ブロック切出部１４１は、走査部１３から入力されたブロック領域Ｂの左上座標（ｓ，ｔ）に基づき、例えば次式（１）のようにブロック領域Ｂを切り出す。 The block cutout unit 141 cuts out the block region B at the position specified by the block position information input from the scanning unit 13 from the motion estimation original image I (f _a ) stored in the transmission frame memory 11 to perform motion estimation. Output to the unit 142. The pixel value at the image coordinate (x, y) is denoted as I (f, x, y), and the pixel value at the image coordinate (i, j) of the block area B of horizontal W pixel and vertical H pixel is defined as B (i, j ), The block cutout unit 141 cuts out the block region B, for example, as in the following equation (1) based on the upper left coordinates (s, t) of the block region B input from the scanning unit 13.

動き推定部１４２は、ブロック切出部１４１により切り出された動き推定元画像Ｉ（ｆ_ａ）のブロック領域Ｂと画素値パターンの類似する対応領域を、伝送フレームメモリ１１に記憶された動き推定先画像Ｉ（ｆ_ｂ）内で類似度に基づき探索する。そして、動き推定元画像Ｉ（ｆ_ａ）のブロック領域Ｂと動き推定先画像Ｉ（ｆ_ｂ）の対応領域との対応位置関係を求め、対応位置情報を動き補償残差評価部１４３に出力する。対応位置情報は、例えば、走査部１３から入力されたブロックＢの左上座標（ｓ，ｔ）と対応領域の左上座標（ｕ、ｖ）を組として、（ｓ，ｔ，ｕ，ｖ）と表現することができる。また、対応位置情報は、ブロックＢの左上座標（ｓ，ｔ）と動きベクトル（ｕ−ｓ，ｖ−ｔ）を組として、（ｓ，ｔ，ｕ−ｓ，ｖ−ｔ）と表現することができる。対応領域の左上座標（ｕ、ｖ）は、例えば次式（２）により算出される。 The motion estimation unit 142 uses a motion estimation destination stored in the transmission frame memory 11 as a corresponding region having a pixel value pattern similar to the block region B of the motion estimation original image I (f _a ) cut out by the block cutout unit 141. Search based on the similarity in the image I (f _b ). Then, the corresponding position relationship between the block area B of the motion estimation original image I (f _a ) and the corresponding area of the motion estimation destination image I (f _b ) is obtained, and the corresponding position information is output to the motion compensation residual evaluation unit 143. . The corresponding position information is expressed as (s, t, u, v), for example, with the upper left coordinates (s, t) of the block B input from the scanning unit 13 and the upper left coordinates (u, v) of the corresponding area as a set. can do. The corresponding position information is expressed as (s, t, u-s, vt) with the upper left coordinates (s, t) of the block B and the motion vector (u-s, v-t) as a set. Can do. The upper left coordinates (u, v) of the corresponding area are calculated by the following equation (2), for example.

なお、式（２）は絶対値誤差和（ＳＡＤ；Sum of Absolute Difference）の最小化により対応領域を求めているが、二乗誤差和（ＳＳＤ；Sum of Squared Difference)の最小化や正規化相互相関(ＮＣＣ;Normalized Cross Correlation）の最大化により対応領域を求めてもよい。 Note that Equation (2) finds the corresponding region by minimizing the sum of absolute error (SAD), but minimizing the sum of squared error (SSD) and normalized cross-correlation. The corresponding region may be obtained by maximizing (NCC: Normalized Cross Correlation).

また、ブロック切出部１４１と動き推定部１４２との処理をまとめ、式（１）（２）の演算を次式（３）に示すように一つの演算で実現してもよい。 Further, the processing of the block cutout unit 141 and the motion estimation unit 142 may be integrated, and the calculations of the equations (1) and (2) may be realized by a single calculation as shown in the following equation (3).

動き補償残差評価部１４３は、動き推定部１４２により求められた対応位置情報に基づき動き補償元画像Ｉ（ｆ_ｐ）を間引き画像のフレーム時刻位置まで動き補償した画像と、非伝送フレームメモリに記憶された間引き画像Ｉ（ｆ_ｑ）との残差ｅをブロック領域ごとに算出し、判定部１５に出力する。残差ｅは、例えば次式（４）により見積もることができる。このとき、動き補償元画像Ｉ（ｆ_ｐ）上の画素位置と、間引き画像Ｉ（ｆ_ｑ）上の画素位置は、次式（５）に示すように対応位置関係と各フレームの時刻とに基づく按分により求める。 The motion compensation residual evaluation unit 143 performs motion compensation on the motion compensated original image I (f _p ) based on the corresponding position information obtained by the motion estimation unit 142 up to the frame time position of the thinned image, and the non-transmission frame memory. The residual e with the stored thinned image I (f _q ) is calculated for each block area and output to the determination unit 15. The residual e can be estimated by the following equation (4), for example. At this time, the pixel position on the motion compensation original image I (f _p ) and the pixel position on the thinned image I (f _q ) are in accordance with the corresponding positional relationship and the time of each frame as shown in the following equation (5). Obtained by apportionment based on.

なお、映像フレームＩは画像座標に関して、その各成分が通常は整数値に離散化されている一方で、α，β，γ及びδは整数とは限らない。このため、例えば、映像フレームＩに点拡がり関数（例えば、２次元のSinc関数やガウシアン関数）を畳み込んで小数画素位置の画素値を定義してもよいし、α，β，γ及びδに整数化(例えば、丸め)を施してもよい。このようにして、各残差評価部１４は、それぞれ異なるフレームを参照してフレーム補間したときの残差をブロック領域Ｂごとに見積もる。 Note that the image frame I is generally discretized into integer values with respect to image coordinates, whereas α, β, γ, and δ are not necessarily integers. Therefore, for example, the pixel value at the decimal pixel position may be defined by convolving a point spread function (for example, a two-dimensional Sinc function or a Gaussian function) with the video frame I, or α, β, γ, and δ may be defined. Integer (for example, rounding) may be performed. In this manner, each residual evaluation unit 14 estimates a residual for each block region B when frame interpolation is performed with reference to different frames.

判定部１５は、複数の残差評価部１４により算出された残差ｅに応じて、ブロック領域Ｂごとに、動き補償の有効性、（残差評価部１４に入力される参照フレームの組の有効性)を判定し、有効か否かを示すビットパターン列を出力する。例えば、走査部１３の動き補償元画像Ｉ（ｆ_ｐ）のブロック領域Ｂごとに、各残差評価部１４の残差ｅに閾値処理を行い、残差ｅが閥値θ以下（又は未満）である場合には有効と判定し、それ例外の場合には無効と判定する。例えば、走査部１３が走査のｎサンプル目（ｎは１以上Ｎ以下の整数、Ｎは自然数）に設定したブロック領域をＢ_ｎとおく。ブロック領域Ｂ_ｎに対して残差評価部１４−ｋ（ｋは１以上Ｋ以下の自然数、Ｋは自然数）が出力した残差値をｅ_ｎ ^（ｋ）とおく。判定部１５は、闘値θと残差値をｅ_ｎ ^（ｋ）との大小関係を比較し、比較結果のフラグＦ_ｎ ^（ｋ）を出力する。フラグＦ_ｎ ^（ｋ）は、例えば、次式（６）又は次式（７）により表される。 The determination unit 15 determines the effectiveness of motion compensation for each block region B in accordance with the residual e calculated by the plurality of residual evaluation units 14 (the set of reference frames input to the residual evaluation unit 14). (Validity) is determined, and a bit pattern string indicating whether or not it is valid is output. For example, threshold processing is performed on the residual e of each residual evaluation unit 14 for each block region B of the motion compensation original image I (f _p ) of the scanning unit 13, and the residual e is equal to or less than the threshold value θ (or less). If it is, it is determined to be valid, and if it is an exception, it is determined to be invalid. For example, a block region set by the scanning unit 13 at the nth sample (n is an integer between 1 and N and N is a natural number) is set as _Bn . The residual value output by the residual evaluation unit 14-k (k is a natural number between 1 and K and K is a natural number) for the block region _Bn is set as e _n ^(k) . The determination unit 15 compares the threshold value θ and the residual value with e _n ^(k) and outputs a comparison result flag F _n ^(k) . The flag F _n ^(k) is expressed by, for example, the following formula (6) or the following formula (7).

判定部１５の出力するビットパターン列（Ｐ_ｎ）_{ｎ＝１，２，…，Ｎ}は、例えば次式（８）により定義することができる。Ｐ_ｎは、ブロック領域Ｂ_ｎについて、残差評価部１４−ｋの評価する動き補償が有効である場合にはＬＳＢ側から数えて第ｋビットを１に、有効でない場合には第ｋビットを０に設定したパターンである。 The bit pattern string (P _n ) _{n = 1, 2,..., N} output from the determination unit 15 can be defined by the following equation (8), for example. P _n is the block area B _n , when the motion compensation evaluated by the residual evaluation unit 14-k is valid, the k-th bit is counted from the LSB side, and when it is not valid, the k-th bit is counted. The pattern is set to 0.

図４は、判定部１５の出力するビットパターン列Ｐ_ｎを示す図である。図４では、Ｋ＝８の場合にＰ_ｎを１６進数で示しており、この例ではＰ_０＝ＦＦ，Ｐ_１＝ＦＡ，Ｐ_２＝ＦＣである。 FIG. 4 is a diagram illustrating the bit pattern sequence P _n output from the determination unit 15. In FIG. 4, when K = 8, P _n is represented by a hexadecimal number, and in this example, P ₀ = FF, P ₁ = FA, and P ₂ = FC.

また、判定部１５は、例えば、間引き画像Ｉ（ｆ_ｑ）の画素位置ごとに各動き補償（各残差評価部１４の評価する参照フレームの組による補間法）が有効であるか否かを記録したビットパターン列を出力してもよい。この場合、各画素位置の判定の方法には、例えば、（１）残差の少なかったＬ個（Ｌは自然数）の動き補償を有効とし他の動き補償を無効と判定する方法、（２）残差が閾値θ以下（又は未満）であった動き補償を有効とし他の動き補償を無効と判定する方法、あるいは（３）残差の少なかったＬ個（Ｌは自然数）の動き補償のうち、残差が闘値θ以下（又は未満）であった動き補償を有効とし他の動き補償を無効と判定する方法が考えられる。 Also, the determination unit 15 determines whether each motion compensation (interpolation method using a set of reference frames evaluated by each residual evaluation unit 14) is effective for each pixel position of the thinned image I (f _q ), for example. The recorded bit pattern sequence may be output. In this case, as a method of determining each pixel position, for example, (1) a method of determining that L (L is a natural number) motion compensation with few residuals is valid and determining that other motion compensation is invalid, (2) A method of determining that motion compensation whose residual is less than (or less than) the threshold value θ is valid and invalidating other motion compensation, or (3) Of L motion compensation where L is small (L is a natural number) A method can be considered in which motion compensation whose residual is equal to or less than (or less than) the threshold value θ is valid and other motion compensation is judged invalid.

画素位置（ｘ，ｙ）における判定結果のビットパターン配列をＰ（ｘ，ｙ）とおく。また、ブロック領域Ｂ_ｎに対する対応位置の４つ組を（ｓ_ｎ，ｔ_ｎ，ｕ_ｎ，ｖ_ｎ）とおく。上記（２）の方法による場合、例えば次式（９）で表されるビット演算を漸化的に適用することでビットパターンの配列Ｐ(ｘ，ｙ)を得ることができる。なお、演算子「｜＝」は、Ｃ＋＋言語における演算子「｜＝」と同じであり、左辺値と右辺値のビットごとの論理和をとった結果を、左辺値に戻すビット操作を表す。 The bit pattern array of the determination result at the pixel position (x, y) is set as P (x, y). Also, placing the four pairs of corresponding positions relative to the block area _{B n} and _{_{_{(s n, t n, u}}} n, v n). In the case of the above method (2), the bit pattern array P (x, y) can be obtained by, for example, gradually applying the bit operation represented by the following equation (9). Note that the operator “| =” is the same as the operator “| =” in the C ++ language, and represents a bit operation for returning the result of the logical sum of the left side value and the right side value for each bit to the left side value.

送信装置１は、判定部１５による判定結果を示す判定情報（例えば、ビットパターン列（Ｐ_ｎ）_{ｎ＝１，２，…，Ｎ}やビットパターン配列Ｐ（ｘ，ｙ））を、受信装置に対して伝送する。伝送時に可逆圧縮を行ってもよい。 The transmission apparatus 1 sends determination information (for example, bit pattern sequence (P _n ) _{n = 1, 2,..., N} or bit pattern array P (x, y)) indicating the determination result by the determination unit 15 to the reception apparatus. Transmit. Lossless compression may be performed during transmission.

送信装置１は、図５に示すように、更に伝送模擬部１６を備える構成としてもよい。伝送模擬部１６は、フレームを符号化して伝送する際の符号化による劣化を模擬し、符号化により劣化されたフレームを生成し、伝送フレームメモリ１１に記憶する。例えば、伝送フレームがＭＰＥＧ−４ＡＶＣ／Ｈ．２６４方式により符号化される場合には、該方式と同様に伝送フレームを局所復号（ローカルデコード）し、局所復号したフレーム画像を伝送フレームメモリ１１に記憶する。残差評価部１４はそれぞれ、伝送模擬部１６により生成されたフレームから動き推定元画像Ｉ（ｆ_ａ）、動き推定先画像Ｉ（ｆ_ｂ）、及び動き補償元画像Ｉ（ｆ_ｐ）の組を決定する As shown in FIG. 5, the transmission device 1 may further include a transmission simulation unit 16. The transmission simulation unit 16 simulates deterioration due to encoding when encoding and transmitting a frame, generates a frame deteriorated by encoding, and stores the frame in the transmission frame memory 11. For example, the transmission frame is MPEG-4 AVC / H. In the case of encoding by the H.264 system, the transmission frame is locally decoded (local decoding) similarly to the system, and the locally decoded frame image is stored in the transmission frame memory 11. Each of the residual evaluation units 14 includes a set of a motion estimation original image I (f _a ), a motion estimation destination image I (f _b ), and a motion compensation original image I (f _p ) from the frame generated by the transmission simulation unit 16. Decide

以上説明したように、送信装置１は、時間分割部１０により、入力映像のうち伝送されるフレームである伝送フレームと、伝送されないフレームである間引き画像とを決定し、残差評価部１４により、伝送フレームから動き推定元画像Ｉ（ｆ_ａ）、動き推定先画像Ｉ（ｆ_ｂ）、及び動き補償元画像Ｉ（ｆ_ｐ）の組を複数組決定し、動き推定元画像Ｉ（ｆ_ａ）のブロック領域と画素値パターンの類似する対応領域を動き推定先画像Ｉ（ｆ_ｂ）から決定し、該ブロック領域と該対応領域との対応位置関係に基づいて動き補償元画像Ｉ（ｆ_ｐ）を間引き画像のフレーム時刻位置まで動き補償した画像と、間引き画像Ｉ（ｆ_ｑ）との残差ｅをブロック領域ごとに算出し、判定部１５により、残差値からブロック領域ごとの動き補償の有効性を示す判定情報を生成する。このため、送信装置１によれば、間引かれた映像フレームの原画を参照しつつ、受信装置側で実施すべきフレーム補間の手法を決定することができるため、送信装置１で入力映像の一部のフレームを間引き、受信装置側で間引かれたフレームを高精度に補間することが可能となる。その結果、全てのフレームを伝送する場合よりも、レート歪み特性（Ｒ−Ｄ特性）の良い伝送が可能となる。 As described above, in the transmission device 1, the time division unit 10 determines a transmission frame that is a frame transmitted from the input video and a thinned image that is a frame that is not transmitted, and the residual evaluation unit 14 A plurality of sets of a motion estimation original image I (f _a ), a motion estimation destination image I (f _b ), and a motion compensation original image I (f _p ) are determined from the transmission frame, and the motion estimation original image I (f _a ) Corresponding region having a pixel value pattern similar to that of the block region is determined from the motion estimation target image I (f _b ), and the motion compensation original image I (f _p ) is based on the corresponding positional relationship between the block region and the corresponding region. and image motion compensation to the frame time and position of the thinned image, the residual e in the thinned image I (f _q) is calculated for each block area, the determination unit 15, from the residual of the motion compensation for each block area Effectiveness It generates determination information indicating. Therefore, according to the transmission apparatus 1, it is possible to determine a frame interpolation method to be performed on the reception apparatus side while referring to the original image of the thinned video frame. It is possible to thin out some frames and to interpolate frames thinned out on the receiving apparatus side with high accuracy. As a result, transmission with better rate distortion characteristics (RD characteristics) is possible than when all frames are transmitted.

また、送信装置１は伝送模擬部１６を更に備えることにより、受信装置で実施すべきフレーム補間の手法を決定する際に、符号化による劣化をも考慮した決定が可能となるため、レート歪み特性を更に改善することが可能となる。 In addition, since the transmission apparatus 1 further includes the transmission simulation unit 16, when determining a frame interpolation method to be performed by the reception apparatus, it is possible to make a determination in consideration of deterioration due to encoding. Can be further improved.

なお、上述した送信装置１として機能させるためにコンピュータを好適に用いることができ、そのようなコンピュータは、送信装置１の各機能を実現する処理内容を記述したプログラムを、当該コンピュータの記憶部に格納しておき、当該コンピュータのＣＰＵによってこのプログラムを読み出して実行させることで実現することができる。 Note that a computer can be suitably used to cause the transmission device 1 to function as described above, and such a computer stores a program describing processing contents for realizing each function of the transmission device 1 in a storage unit of the computer. This can be realized by storing the program and executing it by the CPU of the computer.

［受信装置］
図６は、本発明の一実施形態に係る受信装置の構成例を示すブロック図である。図６に示すように、受信装置２は、フレームメモリ２１と、Ｋ個の補間部２２（２２−１，２２−２，・・・，２２−Ｋ）と、重畳部２３と、時間多重部２４と、を備える。補間部２２の個数Ｋは、送信装置１の残差評価部１４個数Ｋと同一である。受信装置２は、上述した送信装置１により間引かれた間引き画像を補間生成する。 [Receiver]
FIG. 6 is a block diagram illustrating a configuration example of a receiving apparatus according to an embodiment of the present invention. As illustrated in FIG. 6, the reception device 2 includes a frame memory 21, K interpolation units 22 (22-1, 22-2,..., 22-K), a superimposition unit 23, and a time multiplexing unit. 24. The number K of interpolation units 22 is the same as the number K of residual evaluation units 14 of the transmission apparatus 1. The receiving device 2 interpolates and generates the thinned image thinned out by the transmitting device 1 described above.

フレームメモリ２１は、送信装置１から伝送された映像フレームＩ_Ｒを複数時点記憶する。なお、映像フレームＩ_Ｒ（ｆ）は、伝送フレームの伝送手段が無劣化の場合には入力映像のフレームＩ（ｆ）と同一であるが、伝送フレームの伝送手段が非可逆圧縮を行う場合には、符号化による劣化を受けている。 Frame memory 21, a plurality time points stores the video frame I _R transmitted from the transmitter 1. Note that the video frame I _R (f) is the same as the frame I (f) of the input video when the transmission means of the transmission frame is not deteriorated, but when the transmission means of the transmission frame performs irreversible compression. Has undergone degradation due to encoding.

図７は、補間部２２の構成例を示すブロック図である。図７に示すように、補間部２２は、ブロック切出部２２１と、動き推定部２２２と、動き補償部２２３と、を備える。補間部２２は、ｋ番目の補間部２２−ｋは、ｋ番目の残差評価部１４−ｋと同じフレーム番号の画像（動き推定元画像、動き推定先画像、及び動き補償元画像）を参照するものとする。また、補間部２２−ｋは、残差評価部１４−ｋについてのブロック位置情報及び判定情報を取得する。 FIG. 7 is a block diagram illustrating a configuration example of the interpolation unit 22. As illustrated in FIG. 7, the interpolation unit 22 includes a block cutout unit 221, a motion estimation unit 222, and a motion compensation unit 223. The interpolation unit 22 refers to an image (motion estimation source image, motion estimation destination image, and motion compensation source image) having the same frame number as that of the kth residual evaluation unit 14-k. It shall be. In addition, the interpolation unit 22-k acquires block position information and determination information for the residual evaluation unit 14-k.

図８は、補間部２２の動作の概要を説明するための図である。動き推定元画像とは、動き推定部２２２にて、動き推定の対象元となる画像フレームである。動き推定先画像とは、動き推定部２２２にて、動き推定の対象先となる画像フレームである。動き補償元画像とは、動き補償の対象となる画像フレームである。補間画像とは、送信装置１から伝送されず、受信装置２で補間生成する画像フレームである。 FIG. 8 is a diagram for explaining an outline of the operation of the interpolation unit 22. The motion estimation original image is an image frame that is a motion estimation target source in the motion estimation unit 222. The motion estimation destination image is an image frame that is a target of motion estimation in the motion estimation unit 222. The motion compensation original image is an image frame that is a target of motion compensation. An interpolated image is an image frame that is not transmitted from the transmission device 1 and is generated by interpolation by the reception device 2.

動き推定元画像のフレーム番号ｆ_ａと動き推定先画像のフレーム番号ｆ_ｂとは一致してはならないが、動き推定元画像のフレーム番号ｆ_ａと動き補償元画像のフレーム番号ｆ_ｐとは一致してもよく、また、動き推定先画像のフレーム番号ｆ_ｂと動き補償元画像のフレーム番号ｆ_ｐとが一致してもよい。なお、図８において、Ｉ_Ｒ（ｆ_ａ ^（1））などの上付きの（１）は、補間部２２−１による処理を意味し、同様に上付きの（２）は、補間部２２−２による処理を意味する。 Although the frame number f _b of the frame number of the motion estimation original image f _a and the motion estimation target image must match one the frame number f _p of the motion estimation original image frame number f _a and the motion compensated original image may match, also it has a frame number f _p of the frame number f _b and the motion compensation based on image motion estimation target image may coincide. In FIG. 8, superscript ⁽¹⁾ such as I _R (f _a ⁽¹⁾ ) means processing by the interpolation unit 22-1, and superscript (2) is the interpolation unit 22-. 2 means processing.

補間部２２は、フレームメモリ２１に記憶された伝送フレームから、動き推定元画像Ｉ_Ｒ（ｆ_ａ）、動き推定先画像Ｉ_Ｒ（ｆ_ｂ）、及び動き補償元画像Ｉ_Ｒ（ｆ_ｐ）の組を決定する参照フレーム決定手段を有する。これらの参照フレームの組は、補間部２２ごとに異なり、補間部２２−ｋの参照フレームのフレーム番号は、残差評価部１４ｋの参照フレームのフレーム番号と同一である。補間部２２は、送信装置１から判定情報を取得し、該判定情報により動き補償が有効であると通知されたブロックについてのみ動き補償を行う。 The interpolation unit 22 extracts the motion estimation original image I _R (f _a ), the motion estimation destination image I _R (f _b ), and the motion compensation original image I _R (f _p ) from the transmission frame stored in the frame memory 21. Reference frame determining means for determining a set is provided. These sets of reference frames differ for each interpolation unit 22, and the frame number of the reference frame of the interpolation unit 22-k is the same as the frame number of the reference frame of the residual evaluation unit 14k. The interpolation unit 22 acquires the determination information from the transmission device 1 and performs motion compensation only for the block that is notified by the determination information that motion compensation is effective.

ブロック切出部２２１は、フレームメモリ２１に記憶された動き推定元画像Ｉ_Ｒ（ｆ_ａ）から、送信装置１から伝送されたブロック位置情報により指定される位置のブロック領域Ｂ_Ｒを切り出し、動き推定部２２２に出力する。ただし、ブロック切出部２２１は、送信装置１から伝送された判定情報において動き補償が有効であると判定されたブロックについてのみブロック切り出しを行う。水平Ｗ画素、垂直Ｈ画素のブロック領域Ｂ_Ｒの画像座標（ｉ，ｊ）における画素値をＢ_Ｒ（ｉ，ｊ）とすると、ブロック切出部２２１は、送信装置１から入力されたブロック領域Ｂの左上座標（ｓ，ｔ）に基づき、例えば次式（１０）のようにブロック領域Ｂ_Ｒを切り出す。 Block cutting unit 221, the motion estimation original image stored in the frame memory 21 I _{R (f} _a), cut block area B _R in the position specified by the block position information transmitted from the transmitting apparatus 1, the motion It outputs to the estimation part 222. However, the block cutout unit 221 performs block cutout only for a block for which motion compensation is determined to be effective in the determination information transmitted from the transmission device 1. Horizontal W pixels, the image coordinates (i, j) of the block area _{B R} of a vertical H pixels _B pixel value in R (i, j) When the block cutout section 221, block area that is input from the transmitter 1 based on the upper left coordinates (s, t) B, for example, cutting out a block area _{B R} by the following equation (10).

動き推定部２２２は、ブロック切出部２２１により切り出された動き推定元画像Ｉ_Ｒ（ｆ_ａ）のブロック領域Ｂ_Ｒと画素値パターンの類似する対応領域を動き推定先画像Ｉ_Ｒ（ｆ_ｂ）から探索し、動き推定元画像Ｉ_Ｒ（ｆ_ａ）のブロック領域Ｂ_Ｒと動き推定先画像Ｉ_Ｒ（ｆ_ｂ）の対応領域との対応位置関係を求め、対応位置情報を動き補償部２２３に出力する。対応位置情報は、例えば、送信装置１から入力されたブロック領域Ｂの左上座標（ｓ，ｔ）と対応領域の左上座標（ｕ_Ｒ、ｖ_Ｒ）を組として、（ｓ，ｔ，ｕ_Ｒ，ｖ_Ｒ）と表現することができる。また、対応位置情報は、ブロック領域Ｂの左上座標（ｓ，ｔ）と動きベクトル（ｕ_Ｒ−ｓ，ｖ_Ｒ−ｔ）を組として、（ｓ，ｔ，ｕ_Ｒ−ｓ，ｖ_Ｒ−ｔ）と表現することができる。対応領域の左上座標（ｕ_Ｒ、ｖ_Ｒ）は、例えば次式（１１）で求められる。 Motion estimation unit 222, the motion estimation based on image extracted by the block cutout unit 221 _I R _{(f a)} of the block area _{B R} and the pixel value pattern similar corresponding region motion estimation target image _I R of _(f b) search from, and we obtain the corresponding positional relationship between the corresponding region of the motion estimation original image I _{R (f} _a) of the block area B _R and the motion estimation target image I _{R (f} _b), the corresponding position information to the motion compensation unit 223 Output. Corresponding position information includes, for example, (s, t, u _R , t), a set of the upper left coordinates (s, t) of the block area B input from the transmitter 1 and the upper left coordinates (u _R , v _R ) of the corresponding area. v _R ). Moreover, the corresponding position information is the upper-left coordinates (s, t) block area B and the motion vector _{_{(u R -s, v R -t}} ) as _{set, (s, t, u R} -s, v R -t ). The upper left coordinates (u _R , v _R ) of the corresponding region are obtained by the following equation (11), for example.

なお、式（１１）は絶対値誤差和（ＳＡＤ；Sum of Absolute Difference）の最小化により対応領域を求めているが、二乗誤差和（ＳＳＤ；Sum of Squared Difference)の最小化や正規化相互相関(ＮＣＣ;Normalized Cross Correlation）の最大化により対応領域を求めてもよい。 The equation (11) obtains the corresponding region by minimizing the absolute value error sum (SAD), but the minimization of the square error sum (SSD) and the normalized cross-correlation. The corresponding region may be obtained by maximizing (NCC: Normalized Cross Correlation).

また、ブロック切出部２２１と動き推定部２２２との処理をまとめ、式（１０）（１１）の演算を次式（１２）に示すように一つの演算で実現してもよい。 Further, the processes of the block cutout unit 221 and the motion estimation unit 222 may be combined, and the calculations of the equations (10) and (11) may be realized by a single calculation as shown in the following equation (12).

動き補償部２２３は、動き推定部２２２により求められた対応位置情報に基づき、動き推定元画像Ｉ_Ｒ（ｆ_ａ）に対応する動き補償元画像Ｉ_Ｒ（ｆ_ｐ）のブロック領域である部分画像Ｄを、例えば次式（１３）により生成する。 The motion compensation unit 223 is a partial image that is a block region of the motion compensation original image I _R (f _p ) corresponding to the motion estimation original image I _R (f _a ) based on the corresponding position information obtained by the motion estimation unit 222. D is generated by the following equation (13), for example.

なお、映像フレームＩ_Ｒは、画像座標に関して、その各成分が通常は整数値に離散化されている一方で、α_Ｒ及びβ_Ｒは整数とは限らない。このため、例えば、映像フレームＩ_Ｒに点拡がり関数（例えば、２次元のＳｉｎｃ関数やガウシアン関数）を畳み込んで小数画素位置の画素値を定義してもよいし、α_Ｒ及びβ_Ｒに整数化（例えば、丸め）を施してもよい。 Note that the image frame I _R is normally discretized into integer values with respect to image coordinates, whereas α _R and β _R are not necessarily integers. Thus, for example, a point in the image frame I _R spread function (e.g., Sinc function or Gaussian function of two-dimensional) may be defined to pixel values of the sub-pel position by convoluting the integer to alpha _R and beta _R (For example, rounding) may be applied.

また、動き補償部２２３は、動き推定部２２２により求められた対応位置情報に基づき、補償元画像Ｉ_Ｒ（ｆ_ｐ）の部分画像Ｄの重畳先座標（γ_Ｒ，δ_Ｒ）を、例えば次式（１４）により算出する。 In addition, the motion compensation unit 223 calculates the superimposition destination coordinates (γ _R , δ _R ) of the partial image D of the compensation source image I _R (f _p ) based on the corresponding position information obtained by the motion estimation unit 222, for example: It calculates by Formula (14).

また、動き補償部２２３は、重畳処理をする際の部分画像Ｄの重みＷも算出するのが好適である。重みＷは、動き推定元画像のフレーム時刻位置ｆ_ａ、動き推定先画像のフレーム時刻位置ｆ_ｂ、動き補償元画像のフレーム時刻位置ｆ_ｐ、及び補間画像のフレーム時刻位置ｆ_ｑの時間の間隔に応じて算出することができる。例えば、動き推定のフレーム間隔や動き補償のフレーム間隔が広がるほど動き推定や動き補償の信頼度が低下するという仮定の下、次式（１５）により重みＷを算出することができる。 In addition, it is preferable that the motion compensation unit 223 also calculates the weight W of the partial image D when performing the superimposition process. Weight W is a frame time and position f _a motion estimation original _image, the frame time and position f _b of the motion estimation target _image, the motion frame time position f _p of the compensating an original _image, and the time interval of the frame time and position f _q of the interpolated image It can be calculated according to For example, the weight W can be calculated by the following equation (15) under the assumption that the reliability of motion estimation or motion compensation decreases as the frame interval for motion estimation or the frame interval for motion compensation increases.

また、部分領域Ｄごとに重み値Ｗを変化させる方法として、動き推定部２２２で算出される動き推定元画像Ｉ_Ｒ（ｆ_ａ）のブロック領域Ｂ_Ｒと動き推定先画像Ｉ_Ｒ（ｆ_ｂ）の対応領域との類似度の最適値（最小値又は最大値）に応じ、類似度が高い（誤差が小さい、又は相関が高い）ほど重みＷを大きくするようにしてもよい。 Further, as a method for changing the weight values W for each partial area D, motion estimation original image _I R to be calculated by the motion estimation unit 222 _{(f a)} of the block area _{B R} and the motion estimation target image _I R _(f b) Depending on the optimum value (minimum value or maximum value) of the degree of similarity with the corresponding region, the weight W may be increased as the degree of similarity is higher (the error is smaller or the correlation is higher).

重畳部２３は、補間部２２により生成された部分画像Ｄを重みＷを用いて重畳先座標（γ_Ｒ，δ_Ｒ）に重畳することにより補間画像Ｊ（ｆ_ｑ）を生成する。ｎ番目（ｎはＮ以下の自然数、Ｎは自然数）のブロック領域Ｂ_Ｒｎに対し、ｋ番目（ｋはＫ以下の自然数、Ｋは自然数）の補間部２２によって式（１３）〜（１５）で求められた部分画像をＤ_ｎ ^（ｋ）、重みをＷ_ｎ ^（ｋ）、重畳先座標を（γ_Ｒｎ ^（ｋ），δ_Ｒｎ ^（ｋ））とおくと、重畳部２３は、次式（１６）により補間画像Ｊ（ｆ_ｑ）を生成する。 The superimposing unit 23 generates the interpolated image J (f _q ) by superimposing the partial image D generated by the interpolating unit 22 on the superimposition destination coordinates (γ _R , δ _R ) using the weight W. For the n-th (n is a natural number equal to or less than N, N is a natural number) block region _BRn , the k-th (k is a natural number equal to or less than K, K is a natural number) interpolation unit 22 performs Expressions (13) to (15). Assuming that the obtained partial image is D _n ^(k) , the weight is W _n ^(k) , and the superimposition destination coordinates are (γ _Rn ^(k) , δ _Rn ^(k) ), the superimposing unit 23 calculates the following equation (16 ) To generate an interpolated image J (f _q ).

様々なｋ，ｎに対して部分画像Ｄを重畳先座標（γ_Ｒｎ ^（ｋ），δ_Ｒｎ ^（ｋ））へ重ね合わせるが、重畳先座標の集合によっては重ね合わせが生じず、その場合には式（１６）の分母がゼロとなる。そこで、そのような場合も想定して、次式（１７）のように、分母がゼロとなる画像座標（ｘ，ｙ）に関しては、既知の手法で生成された補間画像Ｍ（ｘ，ｙ）の画素値としてもよい。例えば、Ｍ（ｘ，ｙ）はフレームメモリ２１に記憶されたいずれかのフレームの画素値であってもよいし、いずれか複数のフレーム画素値から導出される値（例えば平均値や、重み付き平均値）であってもよい。 Although the partial image D is superimposed on the superimposition destination coordinates (γ _Rn ^(k) , δ _Rn ^(k) ) for various k and n, superposition does not occur depending on the set of superimposition destination coordinates. The denominator of equation (16) becomes zero. Therefore, assuming such a case, the interpolated image M (x, y) generated by a known method is used for the image coordinates (x, y) where the denominator is zero as in the following equation (17). It is good also as a pixel value. For example, M (x, y) may be a pixel value of any frame stored in the frame memory 21, or a value derived from any of a plurality of frame pixel values (for example, an average value or a weighted value) Average value).

時間多重部２４は、送信装置１から伝送されフレームメモリ２１に蓄積されている映像フレームと、重畳部２３で生成された補間画像Ｊ（ｆ_ｑ）とを時間多重することで、出力用の映像を構成する。例えば、入力映像のうち偶数フレームのみ送信装置１から伝送され、奇数フレームは重畳部２３により補間生成された場合には、時間多重部２４において、これらのフレームがフレーム番号順に交互に時間多重し、入力映像と等しいフレームレートの映像を出力する。 The time multiplexing unit 24 time-multiplexes the video frame transmitted from the transmission device 1 and stored in the frame memory 21 and the interpolated image J (f _q ) generated by the superimposing unit 23, thereby outputting an output video. Configure. For example, in the case where only even frames of the input video are transmitted from the transmission apparatus 1 and the odd frames are generated by interpolation by the superimposing unit 23, these frames are alternately time-multiplexed in the frame number order in the time multiplexing unit 24, Outputs video with the same frame rate as the input video.

以上説明したように、受信装置２は、補間部２２により、送信装置１から伝送されるフレームから動き推定元画像Ｉ_Ｒ（ｆ_ａ）、動き推定先画像Ｉ_Ｒ（ｆ_ｂ）、及び動き補償元画像Ｉ_Ｒ（ｆ_ｐ）の組を複数組決定し、送信装置１から取得した判定情報により動き補償が有効であると通知されたブロックについて、動き推定元画像Ｉ_Ｒ（ｆ_ａ）のブロック領域と画素値パターンの類似する対応領域を動き推定先画像Ｉ_Ｒ（ｆ_ｂ）から決定し、該ブロック領域と該対応領域との対応位置関係に基づいて動き補償元画像Ｉ_Ｒ（ｆ_ｐ）の重畳先座標を求める。そして、重畳部２３により、複数組の動き補償元画像をそれぞれ重畳先座標に重畳して補間画像Ｊ（ｆ_ｑ）を生成し、時間多重部２４により、送信装置１から伝送されたフレームと補間画像Ｊ（ｆ_ｑ）とを時間多重する。このため、受信装置２によれば、送信装置１から伝送されないフレームを高精度に補間することが可能となる。その結果、全てのフレームを伝送する場合よりも、レート歪み特性（Ｒ−Ｄ特性）の良い伝送が可能となる。 As described above, the reception device 2 uses the interpolation unit 22 to estimate the motion estimation source image I _R (f _a ), the motion estimation destination image I _R (f _b ), and the motion compensation from the frame transmitted from the transmission device 1. A block of the motion estimation original image I _R (f _a ) is determined for a block in which a plurality of sets of the original image I _R (f _p ) are determined and motion compensation is notified by the determination information acquired from the transmission device 1. A corresponding region having a similar pixel value pattern to the region is determined from the motion estimation destination image I _R (f _b ), and the motion compensation source image I _R (f _p ) is based on the corresponding positional relationship between the block region and the corresponding region. Find the superimposition coordinates of. Then, the superimposing unit 23 superimposes a plurality of sets of motion compensation original images on the superimposing destination coordinates to generate an interpolated image J (f _q ), and the time multiplexing unit 24 interpolates the frame transmitted from the transmission device 1 and the interpolation. The image J (f _q ) is time-multiplexed. For this reason, according to the receiving device 2, it is possible to interpolate a frame that is not transmitted from the transmitting device 1 with high accuracy. As a result, transmission with better rate distortion characteristics (RD characteristics) is possible than when all frames are transmitted.

ここで、重畳部２３は、動き推定元画像Ｉ_Ｒ（ｆ_ａ）、動き推定先画像Ｉ_Ｒ（ｆ_ｂ）、動き補償元画像Ｉ_Ｒ（ｆ_ｐ）、及び補間画像Ｊ（ｆ_ｑ）の時刻位置や、動き推定元画像Ｉ_Ｒ（ｆ_ａ）のブロック領域と動き推定先画像Ｉ_Ｒ（ｆ_ｂ）の対応領域との類似度に応じて、動き補償画像を重み付けして重畳するのが好適である。これにより、補間画像の算出に用いる画像と補間画像との時間間隔や動き推定の類似度に応じた信頼度の高低を重み付けにより反映することができるため、領域ごとに信頼性の高い画像重畳が可能となる。 Here, the superimposing unit 23 performs the motion estimation original image I _R (f _a ), the motion estimation destination image I _R (f _b ), the motion compensation original image I _R (f _p ), and the interpolated image J (f _q ). The motion compensated image is weighted and superimposed according to the time position and the similarity between the block region of the motion estimation original image I _R (f _a ) and the corresponding region of the motion estimation destination image I _R (f _b ). Is preferred. As a result, the level of reliability according to the time interval between the image used for calculation of the interpolated image and the interpolated image and the similarity of motion estimation can be reflected by weighting. It becomes possible.

なお、上述した受信装置２として機能させるためにコンピュータを好適に用いることができ、そのようなコンピュータは、受信装置２の各機能を実現する処理内容を記述したプログラムを、当該コンピュータの記憶部に格納しておき、当該コンピュータのＣＰＵによってこのプログラムを読み出して実行させることで実現することができる。 In addition, a computer can be suitably used to cause the above-described function to function as the receiving device 2, and such a computer stores a program describing processing contents for realizing each function of the receiving device 2 in a storage unit of the computer. This can be realized by storing the program and executing it by the CPU of the computer.

上述の実施形態は、代表的な例として説明したが、本発明の趣旨及び範囲内で、多くの変更及び置換ができることは当業者に明らかである。したがって、本発明は、上述の実施形態によって制限するものと解するべきではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。例えば、実施形態に記載の構成ブロックは複数を１つに組み合わせたり、あるいは分割したりすることが可能である。 Although the above embodiments have been described as representative examples, it will be apparent to those skilled in the art that many changes and substitutions can be made within the spirit and scope of the invention. Therefore, the present invention should not be construed as being limited by the above-described embodiments, and various modifications and changes can be made without departing from the scope of the claims. For example, a plurality of constituent blocks described in the embodiments can be combined into one or divided.

このように、本発明によれば、送信装置側で入力映像の一部のフレームを間引き、受信装置側で間引かれたフレームを高精度に補間することができるので、映像を送受信する任意の用途に有用である。 As described above, according to the present invention, it is possible to thin out some frames of the input video on the transmitting device side and to interpolate the thinned frames on the receiving device side with high accuracy. Useful for applications.

１送信装置
２受信装置
１０時間分割部
１１伝送フレームメモリ
１２非伝送フレームメモリ
１３走査部
１４残差評価部
１５判定部
１６伝送模擬部
２１フレームメモリ
２２補間部
２３重畳部
２４時間多重部
１４１ブロック切出部
１４２動き推定部
１４３動き補償残差評価部
２２１ブロック切出部
２２２動き推定部
２２３動き補償部 DESCRIPTION OF SYMBOLS 1 Transmission apparatus 2 Reception apparatus 10 Time division part 11 Transmission frame memory 12 Non-transmission frame memory 13 Scan part 14 Residual evaluation part 15 Judgment part 16 Transmission simulation part 21 Frame memory 22 Interpolation part 23 Superposition part 24 Time multiplexing part 141 Block cutting Output unit 142 Motion estimation unit 143 Motion compensation residual evaluation unit 221 Block cutout unit 222 Motion estimation unit 223 Motion compensation unit

Claims

A transmission device that thins out and transmits a part of a frame of an input video,
A time division unit for determining a transmission frame that is a frame to be transmitted among input video and a thinned image that is a frame that is not transmitted;
Reference frame determining means for determining a plurality of sets of a motion estimation original image, a motion estimation destination image, and a motion compensation original image from the transmission frame;
For each of the plurality of sets determined by the reference frame determining means, a corresponding region having a pixel value pattern similar to the block region of the motion estimation original image is determined from the motion estimation destination image, and the block of the motion estimation original image A motion estimation unit for obtaining a corresponding positional relationship between a region and a corresponding region of the motion estimation destination image;
For each of the plurality of sets determined by the reference frame determination unit, an image obtained by performing motion compensation on the motion compensation source image to the frame time position of the thinned image based on the corresponding positional relationship, and a residual between the thinned image A motion compensation residual evaluation unit for calculating for each block region;
A determination unit that generates determination information indicating the effectiveness of motion compensation for each block region from the value of the residual and transmits the determination information to the reception side ;
A transmission device comprising:

Simulating deterioration due to encoding when encoding and transmitting a frame, further comprising a transmission simulation unit that generates a frame deteriorated by encoding,
The reference frame determination unit determines a plurality of sets of a motion estimation source image, a motion estimation destination image, and a motion compensation source image from the frame generated by the transmission simulation unit. Transmitter.

A receiving device for interpolating and generating a thinned image thinned out by the transmitting device according to claim 1,
Reference frame determining means for determining a plurality of sets of a motion estimation original image, a motion estimation destination image, and a motion compensation original image from a frame transmitted from the transmission device;
For the block for which the determination information is acquired from the transmission apparatus and motion compensation is notified by the determination information, a corresponding region having a pixel value pattern similar to the block region of the motion estimation source image is determined as the motion estimation destination. A motion estimation unit that determines a corresponding positional relationship between a block region of the motion estimation original image and a corresponding region of the motion estimation destination image, determined from an image;
A motion compensation unit for obtaining a superimposition destination coordinate of the motion compensation original image based on the corresponding positional relationship;
A superimposing unit that generates an interpolated image by superimposing the plurality of sets of motion compensation original images on the superimposition destination coordinates, respectively;
A time multiplexing unit that time-multiplexes the frame transmitted from the transmission device and the interpolation image generated by the superimposition unit;
A receiving apparatus comprising:

The superimposing unit weights and superimposes the motion compensation image on the superimposition destination coordinates according to the time position of the motion estimation original image, the motion estimation target image, the motion compensation original image, and the interpolation image, and performs interpolation. The receiving apparatus according to claim 3, wherein an image is generated.

The superimposing unit weights the motion compensation image for each block region according to the similarity between the block region of the motion estimation original image and the corresponding region of the motion estimation destination image, and superimposes the motion compensation image on the superimposition destination coordinates. The receiving apparatus according to claim 3, wherein an image is generated.

A program for causing a computer to function as the transmission device according to claim 1.

The program for functioning a computer as a receiver as described in any one of Claims 3-5.