JP4931884B2

JP4931884B2 - Frame rate conversion apparatus, frame rate conversion method, and frame rate conversion program

Info

Publication number: JP4931884B2
Application number: JP2008227626A
Authority: JP
Inventors: 和男寅市; 徳安武; ガンバジョナ; 康宏大宮
Original assignee: Japan Science and Technology Agency; National Institute of Japan Science and Technology Agency
Current assignee: Japan Science and Technology Agency; National Institute of Japan Science and Technology Agency
Priority date: 2008-09-04
Filing date: 2008-09-04
Publication date: 2012-05-16
Anticipated expiration: 2028-09-04
Also published as: JP2010062953A

Description

本発明は、映像のフレームレートを任意のフレームレートに変換するフレームレート変換装置、フレームレート変換方法及びフレームレート変換プログラムに関する。 The present invention relates to a frame rate conversion apparatus, a frame rate conversion method, and a frame rate conversion program for converting a video frame rate to an arbitrary frame rate.

従来より、ディジタル映像制作に関する技術の１つとして、フィルムなどで撮影した映像やそれと同等のフレーム数で記録した映像信号を様々なフレームレートに変換する技術がある。この技術は特許文献１等で公知である。特に、２４フレーム／秒で構成されるプログレッシブ映像信号方式の映像を、６０フレーム／秒で構成されるプログレッシブ映像信号方式の映像に変換して記録する場合、２：３プルダウンという変換方式での変換が一般的に使われている（例えば、特許文献１参照）。 2. Description of the Related Art Conventionally, as one of techniques related to digital video production, there is a technique for converting video shot with a film or the like and video signals recorded with the same number of frames into various frame rates. This technique is known from Patent Document 1 and the like. In particular, when converting a progressive video signal system video composed of 24 frames / second into a progressive video signal system video composed of 60 frames / second and recording it, conversion using a conversion system called 2: 3 pull-down is performed. Is generally used (see, for example, Patent Document 1).

また、近年、動画性能を向上させるために、映像信号に含まれる複数のフレームと、該入力映像信号の動きベクトルを用いて装置内部で生成された補間フレームとを組み合わせて、新たなフレーム列の信号を生成するフレームレート変換処理が行われている（例えば、特許文献２参照）。 In recent years, in order to improve moving image performance, a combination of a plurality of frames included in a video signal and an interpolation frame generated inside the apparatus using a motion vector of the input video signal is used to create a new frame sequence. A frame rate conversion process for generating a signal is performed (see, for example, Patent Document 2).

近年、デジタル信号技術の進展に伴い、映像（動画像）、画像又は音声を対象にした、通信、放送、記録媒体［ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disc）］、医用画像、印刷等の分野がマルチメディア産業或いはＩＴ（Information Technology）として著しい発展を遂げている。映像や画像、音声に対するデジタル信号技術の一翼を担うのが情報量を低減する圧縮符号化であるが、その信号理論として、代表的にはシャノンの標本化定理があり、更に新しくはウェーブレット変換理論等がある。また、例えば音楽のＣＤでは、圧縮を伴わないリニアＰＣＭ（Pulse Code Modulation）が用いられるが、信号理論は同様にシャノンの標本化定理である。 In recent years, with the advancement of digital signal technology, communication, broadcasting, recording media [CD (Compact Disc), DVD (Digital Versatile Disc)], medical images, printing, etc. targeted for video (moving images), images or audio This field has made remarkable progress as a multimedia industry or IT (Information Technology). One of the digital signal technologies for video, images, and audio is compression coding that reduces the amount of information, but the signal theory is typically the Shannon sampling theorem, and more recently the wavelet transform theory. Etc. Also, for example, music CDs use linear PCM (Pulse Code Modulation) without compression, but signal theory is also Shannon's sampling theorem.

従来、映像、アニメ画像などの動画の圧縮技術としてＭＰＥＧが知られており、デジタル放送やＤＶＤにおけるＭＰＥＧ−２方式の採用や、第３世代携帯電話のインターネット・ストリーミングや移動体通信などの分野におけるＭＰＥＧ−４方式の採用などにより、映像信号のデジタル圧縮技術は、近年非常に身近なものとなっている。その背景には、蓄積メディアの大容量化、ネットワークの高速化、プロセッサの高性能化、システムＬＳＩの大規模・低価格化などがある。このように、デジタル圧縮を必要とする映像応用システムを支える環境が着々と整ってきている。 Conventionally, MPEG is known as a compression technique for moving images such as video and animation images, and is used in fields such as adoption of the MPEG-2 system in digital broadcasting and DVD, Internet streaming of third-generation mobile phones, and mobile communication. In recent years, digital compression technology for video signals has become very familiar due to the adoption of the MPEG-4 system and the like. The reasons behind this are an increase in the capacity of storage media, an increase in network speed, an increase in processor performance, and a large scale and low price of system LSI. As described above, an environment for supporting a video application system that requires digital compression has been steadily prepared.

ＭＰＥＧ２（ＩＳＯ（International Organization for Standardization）／ＩＥＣ（International Electrotechnical Commition）１３８１８−２）は、汎用の画像符号化方式として定義された方式であり、飛び越し走査方式、順次走査方式の双方に対応できるように定義され、また標準解像度画像、高精細画像の双方に対応できるように定義されている。このＭＰＥＧ２は、現在、プロフェッショナル用途及びコンシューマー用途の広範なアプリケーションに広く用いられている。ＭＰＥＧ２では、例えば７２０×４８０画素の標準解像度、飛び越し走査方式の画像データを４〜８〔Ｍｂｐｓ〕のビットレートにデータ圧縮することができ、また１９２０×１０８８画素の高解像度、飛び越し走査方式の画像データを１８〜２２〔Ｍｂｐｓ〕のビットレートにデータ圧縮することができ、高画質で高い圧縮率を確保することができる。 MPEG2 (ISO (International Organization for Standardization) / IEC (International Electrotechnical Commition) 13818-2) is a method defined as a general-purpose image encoding method, so that both the interlaced scanning method and the sequential scanning method can be supported. It is defined so as to be compatible with both standard resolution images and high-definition images. MPEG2 is currently widely used in a wide range of applications for professional use and consumer use. In MPEG2, for example, image data of a standard resolution of 720 × 480 pixels and interlaced scanning can be compressed to a bit rate of 4 to 8 [Mbps], and a high resolution image of 1920 × 1088 pixels and interlaced scanning can be used. Data can be compressed to a bit rate of 18 to 22 [Mbps], and a high compression rate can be ensured with high image quality.

一般に動画像の符号化では、時間方向および空間方向の冗長性を削減することによって情報量の圧縮を行う。そこで時間的な冗長性の削減を目的とする画面間予測符号化では、前方または後方のピクチャを参照してブロック単位で動きの検出および予測画像の作成を行い、得られた予測画像と符号化対象ピクチャとの差分値に対して符号化を行う。ここで、ピクチャとは1枚の画面を表す用語であり、プログレッシブ画像ではフレームを意味し、インタレース画像ではフレームもしくはフィールドを意味する。ここで、インタレース画像とは、１つのフレームが時刻の異なる２つのフィールドから構成される画像である。インタレース画像の符号化や復号化処理においては、１つのフレームをフレームのまま処理したり、２つのフィールドとして処理したり、フレーム内のブロック毎にフレーム構造またはフィールド構造として処理したりすることができる。 In general, in encoding of moving images, the amount of information is compressed by reducing redundancy in the time direction and the spatial direction. Therefore, in inter-picture predictive coding for the purpose of reducing temporal redundancy, motion is detected and a predicted image is created in units of blocks with reference to the forward or backward picture, and the resulting predicted image and the encoded image are encoded. Encoding is performed on the difference value from the target picture. Here, a picture is a term representing a single screen, which means a frame in a progressive image and a frame or field in an interlaced image. Here, an interlaced image is an image in which one frame is composed of two fields having different times. In interlaced image encoding and decoding processing, one frame may be processed as a frame, processed as two fields, or processed as a frame structure or a field structure for each block in the frame. it can.

特開２００３−２８４００７号公報JP 2003-284007 A 特開２００８−１６７１０３号公報JP 2008-167103 A

従来のシャノンの標本化定理に基づくＡ−Ｄ変換／Ｄ−Ａ変換系では、ナイキスト周波数によって帯域制限された信号を扱う。このとき、Ｄ−Ａ変換において、標本化によって離散的になった信号の連続波への再生に、制限された帯域内の信号を再現する函数（正則函数）が用いられていた。 A conventional AD conversion / DA conversion system based on Shannon's sampling theorem handles a signal whose band is limited by the Nyquist frequency. At this time, in the DA conversion, a function (regular function) for reproducing a signal in a limited band is used to reproduce a signal that has become discrete by sampling into a continuous wave.

本願発明者の一人は、映像（動画像）、文字図形や自然画等の画像又は音声等の信号の持つ種々の性質をフルーエンシ函数を用いて分類可能であることを見出した。この理論によれば、シャノンの標本化定理に基づく上記正則函数は、フルーエンシ函数の一つであり、信号が持つ種々の性質の内の一つの性質に適合するにとどまる。従って、種々の性質をもつ信号をシャノンの標本化定理に基づく上記正則函数のみで扱うのでは、Ｄ−Ａ変換後の再生信号の品質に限界を与える恐れがあることとなる。 One of the inventors of the present application has found that various properties of images such as video (moving images), character figures and natural images, or signals such as audio can be classified using a fluency function. According to this theory, the regular function based on Shannon's sampling theorem is one of the fluency functions, and only fits one of the various properties of the signal. Therefore, if a signal having various properties is handled only by the regular function based on Shannon's sampling theorem, there is a risk of limiting the quality of the reproduced signal after DA conversion.

上記フルーエンシ函数空間の１つであるウェーブレット変換理論は、対象を解像度で分解するマザーウェーブレットを用いて信号を表すものであるが、信号に最適のマザーウェーブレットが与えられるとは限らず、やはりＤ−Ａ変換後の再生信号の品質に限界を与える恐れがあることとなる。 The wavelet transformation theory, which is one of the above fluency function spaces, represents a signal using a mother wavelet that decomposes the object by resolution, but the optimal mother wavelet is not always given to the signal, and D- This may limit the quality of the reproduction signal after A conversion.

ここで、フルーエンシ函数は、パラメータｍ（ｍは１〜∞の正の整数）によって類別される函数である。ｍは、その函数が（ｍ−２）回のみ連続微分可能であることを表す。因みに、上記正則函数は何回でも微分可能であるので、ｍが∞である。更に、フルーエンシ函数は、（ｍ−１）次の函数で構成され、特にフルーエンシ函数の内のフルーエンシＤＡ函数は、標本間隔をτとして、着目するｋ番目の標本点ｋτで数値が与えられるが、その他の標本点では０となる函数である。 Here, the fluency function is a function categorized by the parameter m (m is a positive integer from 1 to ∞). m represents that the function can be continuously differentiated only (m−2) times. Incidentally, since the regular function can be differentiated any number of times, m is ∞. Furthermore, the fluency function is composed of (m−1) -th order functions. In particular, the fluency DA function in the fluency function is given a numerical value at the k-th sample point kτ of interest, where τ is the sample interval. It is a function that becomes 0 at other sample points.

信号の性質は、パラメータｍを持つフルーエンシ函数によって全てが分類可能となり、パラメータｍによってクラス分けされる。そのため、フルーエンシ函数を用いたフルーエンシ情報理論は、従来の信号の性質の一部を表すにとどまっていたシャノンの標本化定理やウェーブレット変換理論等を包含し、信号の全体を表す理論体系であると位置付けられる。そのような函数を用いることにより、Ｄ−Ａ変換後に、シャノンの標本化定理によって帯域制限されることのない高品質の再生信号を、信号の全体に亘って得ることが期待される。 All signal properties can be classified by a fluency function having a parameter m, and are classified by the parameter m. For this reason, the fluency information theory using the fluency function includes the Shannon sampling theorem and wavelet transformation theory, which only represent some of the properties of conventional signals, and is a theoretical system that represents the entire signal. Positioned. By using such a function, it is expected that a high-quality reproduction signal that is not band-limited by the Shannon sampling theorem is obtained over the entire signal after the DA conversion.

ところで、従来、映画の２４フレームレートをビデオの３０フレームに変換したり、ＴＶの映像を高精細化のために高フレームレート化することや、携帯のフレームレートに変換することが要求されているが、フレーム間引きや前後のフレームの内分補間で新規なフレームを生成する方法が主流となっている。 By the way, conventionally, it has been required to convert the 24 frame rate of a movie to 30 frames of video, to increase the frame rate of TV images for higher definition, or to convert it to a mobile frame rate. However, a method of generating a new frame by frame thinning or internal interpolation of preceding and following frames has become the mainstream.

しかしながら、従来のフレーム間引きや前後のフレームの内分補間で新規なフレームを生成する方法では、映像の動きが円滑でない、映像が線形でないなどの問題点があった。 However, the conventional method of generating a new frame by thinning out the frames or performing internal interpolation of the preceding and following frames has problems such as that the motion of the video is not smooth and the video is not linear.

そこで、本発明の目的は、上述の如き従来の問題点に鑑み、フレーム数を増減しても鮮明で円滑な動きで再生可能なフレームレート変換装置、フレームレート変換方法及びフレームレート変換プログラムを提供することにある。 Accordingly, an object of the present invention is to provide a frame rate conversion device, a frame rate conversion method, and a frame rate conversion program that can be reproduced with clear and smooth motion even if the number of frames is increased or decreased in view of the conventional problems as described above. There is to do.

本発明の更に他の目的、本発明によって得られる具体的な利点は、以下に説明される実施の形態の説明から一層明らかにされる。 Other objects of the present invention and specific advantages obtained by the present invention will become more apparent from the description of embodiments described below.

映像では、一般にフレームの前後で似通ったシーンが続く場面が多いので、この特徴を利用し、複数の情報を用いて、高フレームレート化を行い、高画質化を図る。複数フレーム間の局所的な対応点を推定し、対応する画像点を内挿することにより高画質な内挿フレームを構成する。 In video, generally there are many scenes where similar scenes continue before and after a frame, and this feature is used to increase the frame rate by using a plurality of pieces of information to improve the image quality. A high-quality interpolated frame is constructed by estimating local corresponding points between a plurality of frames and interpolating corresponding image points.

本発明では、フレーム間の映像対応点を追跡し、その時間推移を函数表現して、原フレームと変換するフレーム数との比で函数補間下フレームを生成することで、フレーム数を増減しても鮮明で円滑な動きの映像信号を得る。 In the present invention, video correspondence points between frames are tracked, the time transition is expressed as a function, and the number of frames is increased or decreased by generating a frame under function interpolation by the ratio between the original frame and the number of frames to be converted. Even get clear and smooth video signals.

すなわち、本発明は、フレームレート変換装置であって、基準フレームにおける複数個の画素について、各画素点の濃淡値を位置の連続関数で表し、時間を異にする複数の画像フレームにおける上記関数近似された画像濃淡の最大相関度を与える位置を対応点として推定する対応点推定処理部と、各画像フレームにおける推定した各対応位置の濃淡について、それぞれ近傍の画素点の濃淡を示す階調値から対応位置の階調値を求める第１の階調値生成処理部と、上記各画像フレーム間で、変換するフレームレート比で生成する補間フレームに対し、上記基準フレームにおける複数個の画素に対して、上記各画像フレームにおける推定した各対応位置の階調値から対応点軌跡上の濃淡をフルーエンシ函数で近似し、その函数から補間フレームにおける対応位置の各階調値を求める第２の階調値生成処理部と、上記補間フレームにおける各対応位置の階調値から、上記補間フレームにおける対応位置近傍の各画素の階調値を生成する第３の階調値生成処理部とを備えることを特徴とする。 That is, the present invention is a frame rate conversion apparatus, wherein for a plurality of pixels in a reference frame, the gray value at each pixel point is represented by a continuous function of position, and the above function approximation in a plurality of image frames having different times is provided. A corresponding point estimation processing unit that estimates a position that gives the maximum degree of correlation of image density as a corresponding point, and a tone value that indicates the density of neighboring pixel points for the estimated shade of each corresponding position in each image frame With respect to a plurality of pixels in the reference frame with respect to an interpolation frame generated at a frame rate ratio to be converted between each image frame and a first gradation value generation processing unit that obtains a gradation value at a corresponding position Approximate the shade on the locus of the corresponding point from the estimated gradation value of each corresponding position in each image frame with a fluency function, and convert the function into an interpolated frame. A tone value of each pixel in the vicinity of the corresponding position in the interpolation frame is generated from a second tone value generation processing unit that obtains each tone value of the corresponding position in the interpolation frame and the tone value of each corresponding position in the interpolation frame. And a third gradation value generation processing unit.

また、本発明は、フレームレート変換方法であって、基準フレームにおける複数個の画素について、各画素点の濃淡値を位置の連続関数で表し、時間を異にする複数の画像フレームにおける上記関数近似された画像濃淡の最大相関度を与える位置を対応点として推定する対応点推定処理ステップと、各画像フレームにおける推定した各対応位置の濃淡について、それぞれ近傍の画素点の濃淡を示す階調値から対応位置の階調値を求める第１の階調値生成処理ステップと、上記各画像フレーム間で、変換するフレームレート比で生成する補間フレームに対し、上記基準フレームにおける複数個の画素に対して、上記各画像フレームにおける推定した各対応位置の階調値から対応点軌跡上の濃淡をフルーエンシ函数で近似し、その函数から補間フレームにおける対応位置の各階調値を求める第２の階調値生成処理ステップと、上記補間フレームにおける各対応位置の階調値から、上記補間フレームにおける対応位置近傍の各画素の階調値を生成する第３の階調値生成処理ステップとを有することを特徴とする。 The present invention is also a frame rate conversion method, wherein for a plurality of pixels in a reference frame, the gray value at each pixel point is represented by a continuous function of position, and the above function approximation in a plurality of image frames having different times is provided. A corresponding point estimation processing step for estimating a position that gives the maximum degree of correlation of the image shade as a corresponding point, and a tone value indicating the shade of each neighboring pixel point for the shade of each corresponding position estimated in each image frame A first gradation value generation processing step for obtaining a gradation value at a corresponding position , and an interpolation frame generated at a frame rate ratio to be converted between the image frames, with respect to a plurality of pixels in the reference frame , the density of the corresponding point trace from tone values of the corresponding positions estimated at the respective image frames is approximated by a fluency function, interpolation from the function off A second tone value generation processing step of determining the gradation values of the corresponding position in the over-time, from the gray scale value of each corresponding position in the interpolation frame, the gradation value of each pixel adjacent corresponding position in the interpolation frame And a third gradation value generation processing step to be generated.

さらに、本発明は、フレームレート変換装置に備えられるコンピュータにより実行されるフレームレート変換プログラムであって、基準フレームにおける複数個の画素について、各画素点の濃淡値を位置の連続関数で表し、時間を異にする複数の画像フレームにおける上記関数近似された画像濃淡の最大相関度を与える位置を対応点として推定する対応点推定処理部と、各画像フレームにおける推定した各対応位置の濃淡について、それぞれ近傍の画素点の濃淡を示す階調値から対応位置の階調値を求める第１の階調値生成処理部と、上記各画像フレーム間で、変換するフレームレート比で生成する補間フレームに対し、上記基準フレームにおける複数個の画素に対して、上記各画像フレームにおける推定した各対応位置の階調値から対応点軌跡上の濃淡をフルーエンシ函数で近似し、その函数から補間フレームにおける対応位置の各階調値を求める第２の階調値生成処理部と、上記補間フレームにおける各対応位置の階調値から、上記補間フレームにおける対応位置近傍の各画素の階調値を生成する第３の階調値生成処理部として、上記コンピュータを機能させることを特徴とする。 Furthermore, the present invention is a frame rate conversion program executed by a computer provided in the frame rate conversion apparatus, wherein for a plurality of pixels in a reference frame, the gray value of each pixel point is represented by a continuous function of position, and time The corresponding point estimation processing unit that estimates the position that gives the maximum correlation of the image shades obtained by the above function approximation in a plurality of image frames having different values as corresponding points, and the shades of the corresponding positions estimated in the respective image frames, respectively A first gradation value generation processing unit that obtains a gradation value at a corresponding position from gradation values indicating the density of neighboring pixel points , and an interpolation frame that is generated at a frame rate ratio to be converted between the image frames. , Corresponding points from a plurality of pixels in the reference frame based on the gradation values of the corresponding positions estimated in the image frames From the tone value of each corresponding position in the interpolation frame, a second tone value generation processing unit that approximates the shading on the trace with a fluency function and obtains each tone value of the corresponding position in the interpolation frame from the function, The computer is caused to function as a third gradation value generation processing unit that generates a gradation value of each pixel near the corresponding position in the interpolation frame.

本発明は、フレームレート変換装置であって、基準フレームにおける複数個の画素について、その濃淡分布を画素位置の連続函数で函数近似する第１の函数近似手段と、上記第１の函数近似手段により近似された函数と、時間を異にする複数の画像フレームにおける上記濃淡分布の函数とで相関演算を行い、その最大値を与えるそれぞれの位置を上記複数の画像フレームにおいて対応する対応点位置とする対応点推定手段と、上記対応点推定手段により推定された各画像フレームにおける対応点位置を各画像フレームの原点からの水平方向、垂直方向の距離で座標化し、上記時間を異にする複数の画像フレームにおける該座標点の水平方向位置、及び垂直方向位置のそれぞれの変化を時系列信号に変換し、該時系列信号を函数近似する第２の函数近似手段と、上記第２の函数近似手段で近似された函数により、上記複数の画像フレーム間の任意の時間における補間フレームについて、上記画像フレームの対応点位置に該当する補間フレーム内の対応する位置を対応点位置とし、該補間フレームの対応点位置における濃淡値を、上記画像フレームの対応点における濃淡値で補間して求め、該補間フレームの対応点の濃淡値に合わせて上記第１の函数近似を当てはめて、該対応点近傍の濃淡分布を求め、該対応点近傍の濃淡値を補間フレームにおける画素点の濃淡値に変換する第３の函数近似手段とを備えることを特徴とする。 The present invention is a frame rate conversion apparatus, comprising: a first function approximating unit that approximates a density distribution of a plurality of pixels in a reference frame by a continuous function of pixel positions; and the first function approximating unit. a function which is approximated, performs correlation calculation between the function of the gray distribution of the plurality of image frames having different times, the respective positions which gives the maximum value and the corresponding point positions corresponding in the plurality of image frames a corresponding point estimating unit, the horizontal direction of the corresponding point positions in each image frame estimated by the corresponding point estimating means from the origin of each image frame, the coordinates of the distance in the vertical direction, a plurality of images having different said time horizontal position of the coordinate point in the frame, and converts the respective change in the vertical position in a time-series signal, a second that function approximating the time series signal The number approximating means, the approximated function by the second function approximation means, an interpolation frame at any time between the plurality of image frames, corresponding in an interpolation frame corresponding to the corresponding point position of the image frame Using the position as the corresponding point position, the gray value at the corresponding point position of the interpolation frame is obtained by interpolation with the gray value at the corresponding point of the image frame , and the first value is adjusted to the gray value of the corresponding point of the interpolation frame. A function approximation is applied to obtain a density distribution in the vicinity of the corresponding point, and third function approximating means for converting the gray value in the vicinity of the corresponding point into the gray value of the pixel point in the interpolation frame is provided.

また、本発明は、フレームレート変換方法であって、基準フレームにおける複数個の画素について、その濃淡分布を画素位置の連続函数で函数近似する第１の函数近似ステップと、上記第１の函数近似ステップで近似された函数と、時間を異にする複数の画像フレームにおける上記濃淡分布の函数とで相関演算を行い、その最大値を与えるそれぞれの位置を上記複数の画像フレームにおいて対応する対応点位置とする対応点推定ステップと、上記対応点推定ステップで推定された各画像フレームにおける対応点位置を各画像フレームの原点からの水平方向、垂直方向の距離で座標化し、上記時間を異にする複数の画像フレームにおける該座標点の水平方向位置、及び垂直方向位置のそれぞれの変化を時系列信号に変換し、該時系列信号を函数近似する第２の函数近似ステップと、上記第２の函数近似ステップで近似された函数により、上記複数の画像フレーム間の任意の時間における補間フレームについて、上記画像フレームの対応点位置に該当する補間フレーム内の対応する位置を対応点位置とし、該補間フレームの対応点位置における濃淡値を、上記画像フレームの対応点における濃淡値で補間して求め、該補間フレームの対応点の濃淡値に合わせて上記第１の函数近似を当てはめて、該対応点近傍の濃淡分布を求め、該対応点近傍の濃淡値を補間フレームにおける画素点の濃淡値に変換する第３の函数近似ステップとを有することを特徴とする。 The present invention is also a frame rate conversion method, comprising: a first function approximating step for approximating a density distribution of a plurality of pixels in a reference frame by a continuous function of pixel positions; and the first function approximation. Correlation calculation is performed between the function approximated in steps and the function of the gray distribution in a plurality of image frames at different times, and the corresponding point positions corresponding to the positions in the plurality of image frames that give the maximum value are calculated. multiple differing the corresponding point estimating step, the corresponding point positions in each image frame that is estimated by the corresponding point estimating step is coordinated with the horizontal direction, the vertical distance from the origin of each image frame, the time to horizontal position, and then converts each change in vertical position in a time-series signal, function of the time series signals of the coordinate points in the image frame A second function approximation step of similar, the approximated function by the second function approximation step, the interpolation frame at any time between the plurality of image frames, the interpolation corresponding to the corresponding point position of the image frame The corresponding position in the frame is taken as the corresponding point position, and the gray value at the corresponding point position of the interpolated frame is obtained by interpolation with the gray value at the corresponding point of the image frame , and is adjusted to the gray value of the corresponding point in the interpolated frame. And applying the first function approximation to obtain a gray level distribution near the corresponding point, and a third function approximation step for converting the gray value near the corresponding point into the gray value of the pixel point in the interpolation frame. It is characterized by.

さらに、本発明は、フレームレート変換装置に備えられるコンピュータにより実行されるフレームレート変換プログラムであって、基準フレームにおける複数個の画素について、その濃淡分布を画素位置の連続函数で函数近似する第１の函数近似手段と、上記第１の函数近似手段により近似された函数と、時間を異にする複数の画像フレームにおける上記濃淡分布の函数とで相関演算を行い、その最大値を与えるそれぞれの位置を上記複数の画像フレームにおいて対応する対応点位置とする対応点推定手段と、上記対応点推定手段により推定された各画像フレームにおける対応点位置を各画像フレームの原点からの水平方向、垂直方向の距離で座標化し、上記時間を異にする複数の画像フレームにおける該座標点の水平方向位置、及び垂直方向位置のそれぞれの変化を時系列信号に変換し、該時系列信号を函数近似する第２の函数近似手段と、上記第２の函数近似手段で近似された函数により、上記複数の画像フレーム間の任意の時間における補間フレームについて、上記画像フレームの対応点位置に該当する補間フレーム内の対応する位置を対応点位置とし、該補間フレームの対応点位置における濃淡値を、上記画像フレームの対応点における濃淡値で補間して求め、該補間フレームの対応点の濃淡値に合わせて上記第１の函数近似を当てはめて、該対応点近傍の濃淡分布を求め、該対応点近傍の濃淡値を補間フレームにおける画素点の濃淡値に変換する第３の函数近似手段として、上記コンピュータを機能させることを特徴とする。 Furthermore, the present invention is a frame rate conversion program executed by a computer provided in the frame rate conversion apparatus, wherein a first function for approximating a density distribution of a plurality of pixels in a reference frame by a continuous function of pixel positions . Each position that gives the maximum value by performing a correlation operation between the function approximating means, the function approximated by the first function approximating means, and the function of the grayscale distribution in a plurality of image frames at different times. Corresponding point estimation means for the corresponding point positions in the plurality of image frames , and the corresponding point position in each image frame estimated by the corresponding point estimation means in the horizontal and vertical directions from the origin of each image frame . was coordinated with the distance, horizontal position of the coordinate points in a plurality of image frames having different said time, and the vertical way It converts the respective change in position in a time series signal, and a second function approximation unit for function approximating the time series signals, the approximated function by the second function approximation means, among the plurality of image frames the interpolation frame at any time, the corresponding position in the interpolation frame corresponding to the corresponding point position of the image frame to a corresponding point position, the gray value at the corresponding point position of the interpolation frame, at the corresponding point of the image frame Interpolating with the gray value, applying the first function approximation to the gray value of the corresponding point of the interpolated frame to obtain the gray distribution near the corresponding point, and calculating the gray value near the corresponding point as the interpolated frame The computer is caused to function as a third function approximating means for converting to the gray value of the pixel point in FIG.

本発明では、フレーム間の映像対応点を追跡し、その時間推移を函数表現して、原フレームと変換するフレーム数との比で函数補間フレームを生成することで、フレーム数を増減しても鮮明で円滑な動きの映像信号を得ることができる。 In the present invention, video correspondence points between frames are tracked, the time transition is expressed as a function, and a function-interpolated frame is generated by the ratio between the original frame and the number of frames to be converted. A clear and smooth video signal can be obtained.

したがって、本発明によれば、表示器にあったフレームレートで鮮明で円滑な動きの映像表示を行うことができる。 Therefore, according to the present invention, it is possible to display an image with clear and smooth movement at a frame rate suitable for the display.

以下、本発明の実施の形態について、図面を参照して詳細に説明する。なお、本発明は以下の例に限定されるものではなく、本発明の要旨を逸脱しない範囲で、任意に変更可能であることは言うまでもない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Needless to say, the present invention is not limited to the following examples, and can be arbitrarily changed without departing from the gist of the present invention.

本発明に係るフレームレート変換装置１は、例えば、図１に示すように構成される。 The frame rate conversion apparatus 1 according to the present invention is configured as shown in FIG. 1, for example.

このフレームレート変換装置１は、例えば図２の（Ａ），（Ｂ）に示すように、原フレーム間に補間フレームを挿入することにより、図２の（Ａ）に示す低フレームレート（この例では３０フレーム／秒）の動画を図２の（Ｂ）に示す高フレームレートの動画（この例では６０フレーム／秒）に変換する高フレームレート化処理を行うもので、対応点推定処理部２、第１の階調値生成処理部３、第２の階調値生成処理部４、第３の階調値生成処理部５として機能するコンピュータからなる。 The frame rate conversion apparatus 1 inserts an interpolated frame between original frames as shown in FIGS. 2A and 2B, for example, so that the low frame rate shown in FIG. In this example, 30 frames / second) is converted to a high frame rate movie (60 frames / second in this example) shown in FIG. , And a computer that functions as the first gradation value generation processing unit 3, the second gradation value generation processing unit 4, and the third gradation value generation processing unit 5.

このフレームレート変換装置１において、対応点推定処理部２は、基準フレームにおける複数個の画素について、時間を異にする複数の画像フレームにおける各対応点を推定する。 In the frame rate conversion apparatus 1, the corresponding point estimation processing unit 2 estimates each corresponding point in a plurality of image frames at different times for a plurality of pixels in the reference frame.

また、第１の階調値生成処理部３は、上記対応点推定処理部２により推定した各画像フレームにおける各対応点について、それぞれ近傍の画素の濃淡を示す階調値から各階調値を求める。 In addition, the first tone value generation processing unit 3 obtains each tone value from the tone value indicating the density of the neighboring pixels for each corresponding point in each image frame estimated by the corresponding point estimation processing unit 2. .

また、第２の階調値生成処理部４は、上記基準フレームにおける複数個の画素に対して、上記推定した各画像フレームにおける各対応点の階調値から対応点軌跡上の濃淡をフルーエンシ函数で近似し、その函数から補間フレームにおける対応点の各階調値を求める。 In addition, the second tone value generation processing unit 4 calculates, for a plurality of pixels in the reference frame, the shading on the locus of corresponding points from the estimated tone values of the corresponding points in the image frames. And the tone value of each corresponding point in the interpolation frame is obtained from the function.

さらに、第３の階調値生成処理部５は、上記補間フレームにおける各対応点の階調値から、上記補間フレームにおける各画素の階調値を生成する。 Further, the third gradation value generation processing unit 5 generates the gradation value of each pixel in the interpolation frame from the gradation value of each corresponding point in the interpolation frame.

このフレームレート変換装置１は、図示しない記憶部から読み出される映像信号変換プログラムをコンピュータにより実行することにより、図３のフローチャートに示すステップＳ１〜ステップＳ４の手順に従って、対応点推定処理を行って推定した対応点の階調値を用いて均等補間により補間フレームの対応点の階調値を生成し、さらに、不均等補間により補間フレームの対応点の階調値を生成する高フレームレート化処理を実行する。 The frame rate conversion apparatus 1 performs estimation by performing corresponding point estimation processing according to the procedure of steps S1 to S4 shown in the flowchart of FIG. 3 by executing a video signal conversion program read from a storage unit (not shown) by a computer. Using the gradation value of the corresponding point, the gradation value of the corresponding point of the interpolation frame is generated by uniform interpolation, and the gradation value of the corresponding point of the interpolation frame is generated by non-uniform interpolation. Execute.

すなわち、このフレームレート変換装置１では、先ず、図４の（Ａ）に示すように、時間ｔ＝ｋの画像フレームを基準フレームＦ（ｋ）とし、基準フレームＦ（ｋ）における複数個の画素Ｐｎ（ｋ）について、時間ｔ＝ｋ＋１の画像フレームＦ（ｋ＋１）、時間ｔ＝ｋ＋２の画像フレームＦ（ｋ＋２）、・・・時間ｔ＝ｋ＋ｍの画像フレームＦ（ｋ＋ｍ）における各動きベクトルを求めて、各画像フレーム（ｋ＋１），（ｋ＋２），・・・Ｆ（ｋ＋ｍ）における各対応点Ｐｎ（ｋ＋１），Ｐｎ（ｋ＋２），・・・Ｐ（ｋ＋ｍ）を推定する対応点推定処理を行う（ステップＳ１）。 That is, in this frame rate conversion apparatus 1, first, as shown in FIG. 4A, an image frame at time t = k is set as a reference frame F (k), and a plurality of pixels in the reference frame F (k) is obtained. For Pn (k), the motion vectors in the image frame F (k + 1) at time t = k + 1, the image frame F (k + 2) at time t = k + 2,... Then, corresponding point estimation processing for estimating the corresponding points Pn (k + 1), Pn (k + 2),... P (k + m) in each image frame (k + 1), (k + 2),... F (k + m) is performed. (Step S1).

次に、上記ステップＳ１で推定した各画像フレーム（ｋ＋１），（ｋ＋２），・・・Ｆ（ｋ＋ｍ）における各対応点Ｐｎ（ｋ＋１），Ｐｎ（ｋ＋２），・・・Ｐ（ｋ＋ｍ）について、図４の（Ｂ）に示すように、それぞれ近傍の画素の濃淡を示す階調値から各階調値を求める第１の階調値生成処理を行う（ステップＳ２）。 Next, for each corresponding point Pn (k + 1), Pn (k + 2),... P (k + m) in each image frame (k + 1), (k + 2),... F (k + m) estimated in step S1 above, As shown in FIG. 4B, a first gradation value generation process is performed to obtain each gradation value from gradation values indicating the shades of neighboring pixels (step S2).

次に、上記基準フレームＦ（ｋ）における複数個の画素Ｐｎ（ｋ）に対して、図４の（Ｃ）に示すように、上記ステップＳ２で生成した各対応点Ｐｎ（ｋ＋１），Ｐｎ（ｋ＋２），・・・Ｐ（ｋ＋ｍ）における各階調値、すなわち、各画像フレーム（ｋ＋１），（ｋ＋２），・・・Ｆ（ｋ＋ｍ）の対応点軌跡上の濃淡をフルーエンシ函数で近似し、その函数から各画像フレーム（ｋ＋１），（ｋ＋２），・・・Ｆ（ｋ＋ｍ）間の補間フレームにおける対応点の各階調値を求める第２の階調値生成処理を行う（ステップＳ３）。 Next, with respect to the plurality of pixels Pn (k) in the reference frame F (k), as shown in FIG. 4C, the corresponding points Pn (k + 1), Pn ( k + 2),... P (k + m), each tone value, that is, the shade on the corresponding point trajectory of each image frame (k + 1), (k + 2),... F (k + m) is approximated by a fluency function. A second tone value generation process is performed for obtaining each tone value of the corresponding point in the interpolated frame between the image frames (k + 1), (k + 2),... F (k + m) from the function (step S3).

次のステップＳ４では、図４の（Ｄ）に示すように、上記ステップＳ３の第２の階調値生成処理により生成した補間フレームＦ（ｋ＋１／２）における各対応点の階調値から、不均等補間により時間ｔ＝ｋ＋１／２の補間フレームＦ（ｋ＋１／２）における各画素の階調値を生成する第３の階調値生成処理を行う（ステップＳ４）。 In the next step S4, as shown in FIG. 4D, from the gradation value of each corresponding point in the interpolation frame F (k + 1/2) generated by the second gradation value generation process in step S3, A third gradation value generation process for generating a gradation value of each pixel in the interpolation frame F (k + 1/2) at time t = k + 1/2 by non-uniform interpolation is performed (step S4).

ここで、複数フレームからなる動画像は、その動きのある部分画像のフレーム上の位置はフレーム毎に異なる。また、一つのフレーム上の画素点は、他のフレーム上の異なる位置の画素点に移動するとは限らず、画素間に対応することが通常である。すなわち、１つの自然画は、連続した情報としたとき、２つのフレーム上では異なる位置の画素情報をそれぞれ表していることになる。特に、フレーム間の補間により、新規フレーム画像を生成する場合は、元のフレーム上の画素情報と新規フレーム上での画素上は殆ど全て異なる。例えば、図５の（Ａ），（Ｂ）に示すような２つのフレーム画像を同一点で重ね合わせると、各フレームの画素点（ここでは、説明のための粗くしている）の関係は、図５の（Ｃ）に示すような関係となる。すなわち、画像の移動分ずれる。この２つのフレーム画像を用いて、第１フレームの格子点（印のない画素点）の濃淡値を求めるためには、不均等補間処理が必要となる。 Here, in a moving image composed of a plurality of frames, the position of the moving partial image on the frame differs for each frame. In addition, pixel points on one frame do not always move to pixel points at different positions on other frames, and usually correspond to pixels. That is, when one natural image is continuous information, it represents pixel information at different positions on the two frames. In particular, when a new frame image is generated by interpolation between frames, the pixel information on the original frame and the pixels on the new frame are almost all different. For example, when two frame images as shown in FIGS. 5A and 5B are overlapped at the same point, the relationship between the pixel points of each frame (here, rough for explanation) is The relationship is as shown in FIG. That is, the image shifts. In order to obtain the gray value of the grid point (pixel point without a mark) of the first frame using these two frame images, non-uniform interpolation processing is required.

例えば、図６に示すように、画像の解像度を変換した際に新たに生成された画素ｕ（τ_ｘ，τ_ｙ）位置の値を決める画像補間処理は、原画素ｕ（ｘ_ｉ，ｙ_ｊ）と補間函数ｈ（ｘ）の畳み込み処理により行われる。 For example, as shown in FIG. 6, the image interpolation process for determining the value of the position of the pixel u (τ _x , τ _y ) newly generated when the resolution of the image is converted is the original pixel u (x _i , y _j ) And the interpolation function h (x).

そして、複数のフレーム画像を用いて同一部分画像を対応させ、図７の（Ａ）に示すような均等補間函数を用いて所望の対応点近傍における水平（垂直）方向の画素情報から均等補間により求めた各フレーム毎の補間情報、すなわち、例えば図８に示すように、フレーム１及びフレーム２の各内挿画素値×を垂直（水平）方向の画素情報として、図７の（Ｂ）に示すような不均等補間函数を用いてフレームのずれ量に基づき不均等補間を行い、図８に示すように、フレーム１における所望の位置○の画素情報を決定する。 Then, the same partial image is made to correspond using a plurality of frame images, and uniform interpolation is performed from the pixel information in the horizontal (vertical) direction near the desired corresponding point using a uniform interpolation function as shown in FIG. The obtained interpolation information for each frame, that is, for example, as shown in FIG. 8, each interpolation pixel value x of frame 1 and frame 2 is shown as pixel information in the vertical (horizontal) direction, as shown in FIG. Using such an unequal interpolation function, unequal interpolation is performed based on the amount of frame shift, and pixel information at a desired position ◯ in frame 1 is determined as shown in FIG.

このように、フレーム間の映像対応点を追跡し、その時間推移を函数表現して、原フレームと変換するフレーム数との比で函数補間フレームを生成することで、フレーム数を増減しても鮮明で円滑な動きの映像信号を得ることができ、表示器にあったフレームレートで鮮明で円滑な動きの映像表示を行うことができる。 In this way, by tracking the video corresponding points between frames, expressing the time transition as a function, and generating a function interpolation frame with the ratio of the original frame and the number of frames to be converted, the number of frames can be increased or decreased. A clear and smooth motion video signal can be obtained, and a clear and smooth motion video display can be performed at a frame rate suitable for the display.

均等補間により補間フレームＦ（ｋ＋１／２）を生成して１／２精度の動き推定により求められる動き情報を用いてブロックマッチングにより対応点の階調値を１／２精度で生成する従来の高フレームレート化処理では、挿入される補間フレームの画像は動きのある部分が劣化するが、このフレームレート変換装置１のように、対応点推定処理を行って推定した対応点の階調値を用いて均等補間により補間フレームの対応点の階調値を生成し、さらに、不均等補間により補間フレームの対応点の階調値を生成する高フレームレート化処理では、動きのある部分が劣化することなく高フレームレート化することができた。 A conventional high-quality method of generating a halftone value of a corresponding point by half-matching by block matching using motion information obtained by half-precision motion estimation by generating an interpolated frame F (k + 1/2) by uniform interpolation. In the frame rate conversion process, a part having a motion is deteriorated in the inserted interpolated frame image. Like the frame rate conversion apparatus 1, the gradation value of the corresponding point estimated by performing the corresponding point estimation process is used. In the high frame rate processing that generates the gradation value of the corresponding point of the interpolated frame by uniform interpolation and further generates the gradation value of the corresponding point of the interpolated frame by non-uniform interpolation, the moving part deteriorates. It was possible to achieve a higher frame rate.

ここで、フレームレート変換装置１は、上述の如き高フレームレート化処理を行うとともに、２フレーム画像を用いて拡大補間処理を行う機能を備えるものとすることができる。２フレーム画像を用いて拡大補間処理を行う機能は、例えば、例えば図９に示すように、入力データ制御回路５１、出力同期信号生成回路５２、ＳＲＡＭ５３、ＳＲＡＭ選択部５４、画像処理モジュール５５により構成される拡大補間処理装置５０により実現される。 Here, the frame rate conversion apparatus 1 may have a function of performing the high frame rate processing as described above and performing an enlargement interpolation process using a two-frame image. For example, as shown in FIG. 9, for example, the function of performing the enlargement interpolation process using the two-frame image is configured by an input data control circuit 51, an output synchronization signal generation circuit 52, an SRAM 53, an SRAM selection unit 54, and an image processing module 55. This is realized by the enlarged interpolation processing device 50.

この拡大補間処理装置５０において、入力データ制御回路５１は、水平同期信号及び垂直同期信号とともに供給される入力画像すなわち各画素の画像情報をＳＲＡＭ選択部５４に順次入力する制御を行う。 In the enlargement interpolation processing device 50, the input data control circuit 51 performs control to sequentially input the input image supplied together with the horizontal synchronization signal and the vertical synchronization signal, that is, image information of each pixel, to the SRAM selection unit 54.

出力同期信号生成回路５２は、供給される水平同期信号及び垂直同期信号に基づいて出力側同期信号を生成し、生成した出力側同期信号を出力するとともにＳＲＡＭ選択部５４に供給する。 The output synchronization signal generation circuit 52 generates an output side synchronization signal based on the supplied horizontal synchronization signal and vertical synchronization signal, outputs the generated output side synchronization signal, and supplies it to the SRAM selection unit 54.

ＳＲＡＭ選択部５４は、例えば、図１０に示すように構成され、供給される同期信号に生成される書き込み制御信号及び読み出し制御信号に基づいて制御信号切り換え回路５４Ａから供給されるメモリ選択信号に応じた動作を行う書き込みデータ選択部５４Ｂと読み出しデータ選択部５４Ｃにより、入力データ制御回路５１を介して入力される入力画像を１フレームごとＳＲＡＭ５３に格納し、同時に２フレームの画像を出力同期信号生成回路５２により生成された出力側同期信号に同期して読み出す。 The SRAM selection unit 54 is configured as shown in FIG. 10, for example, and corresponds to the memory selection signal supplied from the control signal switching circuit 54A based on the write control signal and the read control signal generated in the supplied synchronization signal. The input data input via the input data control circuit 51 is stored in the SRAM 53 frame by frame by the write data selection unit 54B and the read data selection unit 54C that perform the above operations, and two frames of images are simultaneously output to the output synchronization signal generation circuit. The data is read out in synchronization with the output side synchronization signal generated by 52.

また、画像処理モジュール５５は、フレーム間情報による画像補間処理を行う例えば、図１１に示すように構成される。 Further, the image processing module 55 is configured as shown in FIG. 11, for example, which performs image interpolation processing based on inter-frame information.

すなわち、画像処理モジュール５５は、ＳＲＡＭ選択部５４を介してＳＲＡＭ５３から同時に読み出された２フレームの画像情報が入力される窓設定部５５Ａ、第１の均等補間処理部５５Ｂ及び第２の均等補間処理部５５Ｃ、上記窓設定部５５Ａにより上記２フレームの画像情報から抽出された画素の情報が入力されるずれ量推定部５５Ｄ、このずれ量推定部５５Ｄにより推定されたずれ量ベクトルと上記第２の均等補間処理部５５Ｃにより補間された画素の情報入力されるずれ補正部５５Ｅ、このずれ補正部５５Ｅにより補正された画素の情報及び上記第１の補間処理部５５Ｂにより補間された画素の情報が入力される不均等補間処理部５５Ｆからなる。 That is, the image processing module 55 receives the window setting unit 55A, the first equal interpolation processing unit 55B, and the second equal interpolation from which two frames of image information simultaneously read from the SRAM 53 via the SRAM selection unit 54 are input. A shift amount estimation unit 55D to which pixel information extracted from the image information of the two frames is input by the processing unit 55C, the window setting unit 55A, the shift amount vector estimated by the shift amount estimation unit 55D, and the second Information on the pixel interpolated by the uniform interpolation processing unit 55C, information on the pixel corrected by the deviation correction unit 55E, and information on the pixel interpolated by the first interpolation processing unit 55B. The input non-uniform interpolation processing unit 55F is included.

画像処理モジュール５５では、図１２の（Ａ），（Ｂ）に示すように、ＳＲＡＭ選択部５４を介して入力される２つのフレーム画像ｆ，ｇについて、窓設定部５５Ａにより所定のポイント（ｐ，ｑ）にウインドウを設定し、ずれ量推定部５５Ｄにより、片方のフレーム画像ｇのウインドウをずれ量（τｘ，τｙ）だけずらして、ウインドウ内の相対位置（ｘ，ｙ）の画素値により内積演算を行い、その値を相互相関値Ｒｐｑ（τｘ，τｙ）とする。 In the image processing module 55, as shown in FIGS. 12A and 12B, a predetermined point (p) is set by the window setting unit 55A for two frame images f and g input via the SRAM selection unit 54. , Q), the window of one frame image g is shifted by the shift amount (τx, τy) by the shift amount estimation unit 55D, and the inner product is calculated by the pixel value of the relative position (x, y) in the window. An operation is performed and the value is set as a cross-correlation value Rpq (τx, τy).

そして、ずれ量（τｘ，τｙ）を変化させてポイント（ｐ，ｑ）の回りでの相互相関値Ｒｐｑ（τｘ，τｙ）が最大となるずれ量（τｘ，τｙ）を抽出する。 Then, the shift amount (τx, τy) that maximizes the cross-correlation value Rpq (τx, τy) around the point (p, q) is extracted by changing the shift amount (τx, τy).

なお、２つのフレーム画像ｆ，ｇのウインドウ内画素データをそれぞれフーリエ変換して相互相関値Ｒｐｑ（τｘ，τｙ）を求めることもできる。 The cross-correlation values Rpq (τx, τy) can also be obtained by performing Fourier transform on the pixel data in the windows of the two frame images f and g, respectively.

そして、この拡大補間処理装置５０では、図１３のフローチャートに示すように手順に従って拡大補間処理を行う。 In the enlargement interpolation processing device 50, enlargement interpolation processing is performed according to the procedure as shown in the flowchart of FIG.

すなわち、画像処理モジュール５５では、ＳＲＡＭ５３からＳＲＡＭ選択部５４を介して２つのフレーム画像ｆ，ｇが読み出されると（ステップＡ）、ずれ量推定部５５Ｄで相関演算処理により２つのフレーム画像ｆ，ｇのずれ量（τｘ，τｙ）を演算する（ステップＢ）。 That is, in the image processing module 55, when the two frame images f and g are read from the SRAM 53 via the SRAM selection unit 54 (step A), the two frame images f and g are obtained by the correlation calculation processing in the deviation amount estimation unit 55D. The shift amount (τx, τy) is calculated (step B).

そして、フレーム１の画像ｆについて均等補間による内挿画素値を第１の均等補間処理部５５Ｂで演算することにより、水平方向あるいは垂直方向に拡大する（ステップＣ）。 Then, the interpolated pixel value obtained by the uniform interpolation for the image f of the frame 1 is calculated by the first uniform interpolation processing unit 55B, thereby expanding in the horizontal direction or the vertical direction (step C).

また、フレーム２の画像ｇについて均等補間による内挿画素値を第２の均等補間処理部５５Ｃで演算することにより、水平方向あるいは垂直方向に拡大する（ステップＤ）。 Further, the interpolation pixel value obtained by uniform interpolation for the image g of frame 2 is calculated by the second uniform interpolation processing unit 55C, thereby expanding in the horizontal direction or the vertical direction (step D).

さらに、フレーム２の拡大画像をフレーム１に対するずれ量分移動した画素位置での画素値をずれ量補正部５５Ｅにより演算する（ステップＥ）。 Further, the pixel value at the pixel position obtained by moving the enlarged image of frame 2 by the amount of displacement relative to frame 1 is calculated by the displacement amount correction unit 55E (step E).

そして、不均等補間処理部５５Ｆにおいて、フレーム１の内挿画素値２点とフレーム２の移動位置の画素値２点の計４点の画素値からフレーム１における求める位置の画素値を不均等補間で垂直方向あるいは水平方向に拡大演算を行い（ステップＦ）、フレーム１の補間演算結果を拡大画像として出力する（ステップＧ）。 Then, in the non-uniform interpolation processing unit 55F, non-uniform interpolation is performed on the pixel value at the position to be obtained in frame 1 from a total of four pixel values, that is, two interpolated pixel values in frame 1 and two pixel values in the moving position of frame 2. Then, enlargement calculation is performed in the vertical direction or horizontal direction (step F), and the interpolation calculation result of frame 1 is output as an enlarged image (step G).

このような拡大補間処理を行う機能を備えるフレームレート変換装置１１０は、例えば、図１４に示すように構成される。 The frame rate conversion apparatus 110 having a function of performing such an enlargement interpolation process is configured as shown in FIG. 14, for example.

このフレームレート変換装置１１０は、第１の函数近似処理部１１１、対応点推定処理部１１２、第２の函数近似処理部１１３、第３の函数近似処理部１１４として機能するコンピュータからなる。 The frame rate conversion apparatus 110 includes a computer that functions as a first function approximation processing unit 111, a corresponding point estimation processing unit 112, a second function approximation processing unit 113, and a third function approximation processing unit 114.

第１の函数近似処理部１１１は、基準フレームにおける複数個の画素について、その濃淡分布を函数近似する第１の函数近似処理を行う。 The first function approximation processing unit 111 performs a first function approximation process for approximating the density distribution of a plurality of pixels in the reference frame.

対応点推定処理部１１２は、上記第１の函数近似部１１１により近似された時間を異にする複数の上記基準フレームにおける上記濃淡分布の函数で相関演算を行い、その最大値を与えるそれぞれの位置を上記複数の基準フレームにおいて対応する対応点位置とする対応点推定処理を行う。 Corresponding point estimation processing unit 112 performs a correlation operation with functions of the gray distribution in the plurality of reference frames having different times approximated by the first function approximating unit 111, and provides each position that gives the maximum value. Corresponding point estimation processing is performed with corresponding point positions corresponding to the plurality of reference frames.

第２の函数近似処理部１１３は、上記対応点推定部１１２により推定された各基準フレームにおける対応点位置を基準フレームの原点からの水平方向、垂直方向の距離で座標化し、上記時間を異にする複数の基準フレームにおける該座標点の水平方向位置、及び垂直方向位置のそれぞれの変化を時系列信号に変換し、各基準フレームの時系列信号を函数近似する第２の函数近似を行う。 The second function approximation processing unit 113 coordinates the corresponding point position in each reference frame estimated by the corresponding point estimation unit 112 by the horizontal and vertical distances from the origin of the reference frame, and changes the time. Each change in the horizontal position and the vertical position of the coordinate point in a plurality of reference frames is converted into a time series signal, and second function approximation is performed to approximate the time series signal of each reference frame.

第３の函数近似処理部１１４は、上記第２の函数近似部１１３で近似された函数により、上記複数の基準フレーム間の任意の時間における補間フレームについて、上記基準フレームの対応点位置に該当する補間フレーム内の対応する位置を対応点位置とし、該補間フレームの対応点位置における濃淡値を、上記基準フレームの対応点における濃淡値で補間して求め、該補間フレームの対応点の濃淡値に合わせて上記第１の函数近似を当てはめて、該対応点近傍の濃淡分布を求め、該対応点近傍の濃淡値を補間フレームにおける画素点の濃淡値に変換する第３の函数近似処理を行う。 The third function approximation processing unit 114 corresponds to the corresponding point position of the reference frame for the interpolated frame at any time between the plurality of reference frames by the function approximated by the second function approximation unit 113. The corresponding position in the interpolation frame is set as the corresponding point position, and the gray value at the corresponding point position in the interpolation frame is obtained by interpolation with the gray value at the corresponding point in the reference frame. At the same time, the first function approximation is applied to obtain a gray level distribution near the corresponding point, and a third function approximation process for converting the gray value near the corresponding point into the gray value of the pixel point in the interpolation frame is performed.

このフレームレート変換装置１１０では、第１の函数近似処理部１１１により、基準フレームにおける複数個の画素について、その濃淡分布を函数近似し、対応点推定処理部１１２により、上記第１の函数近似処理部１１１で近似された時間を異にする複数の上記基準フレームにおける上記濃淡分布の函数で相関演算を行い、その最大値を与えるそれぞれの位置を上記複数の基準フレームにおいて対応する対応点位置とし、第２の函数近似処理部１１３により、上記対応点推定処理部１１２で推定された各基準フレームにおける対応点位置を基準フレームの原点からの水平方向、垂直方向の距離で座標化し、上記時間を異にする複数の基準フレームにおける該座標点の水平方向位置、及び垂直方向位置のそれぞれの変化を時系列信号に変換し、各基準フレームの時系列信号を函数近似する。そして、第３の函数近似処理部１１４により、第２の階調値生成処理部１１３で近似された函数により、上記複数の基準フレーム間の任意の時間における補間フレームについて、上記基準フレームの対応点位置に該当する補間フレーム内の対応する位置を対応点位置とし、該補間フレームの対応点位置における濃淡値を、上記基準フレームの対応点における濃淡値で補間して求め、該補間フレームの対応点の濃淡値に合わせて上記第１の函数近似を当てはめて、該対応点近傍の濃淡分布を求め、該対応点近傍の濃淡値を補間フレームにおける画素点の濃淡値に変換することにより、拡大補間処理とともに高フレームレート化処理を行う。 In the frame rate conversion apparatus 110, the first function approximation processing unit 111 performs function approximation of the density distribution of a plurality of pixels in the reference frame, and the corresponding point estimation processing unit 112 performs the first function approximation processing. The correlation calculation is performed with the function of the gray distribution in the plurality of reference frames having different times approximated by the unit 111, and each position giving the maximum value is set as the corresponding point position corresponding to the plurality of reference frames, The second function approximation processing unit 113 coordinates the corresponding point position in each reference frame estimated by the corresponding point estimation processing unit 112 with the horizontal and vertical distances from the origin of the reference frame, and changes the time. Convert the changes in the horizontal position and vertical position of the coordinate point in multiple reference frames to a time-series signal. The time series signal of the reference frame to function approximation. Then, with the function approximated by the second tone value generation processing unit 113 by the third function approximation processing unit 114, the corresponding point of the reference frame for the interpolation frame at any time between the plurality of reference frames. The corresponding position in the interpolation frame corresponding to the position is set as the corresponding point position, and the gray value at the corresponding point position of the interpolation frame is obtained by interpolation with the gray value at the corresponding point of the reference frame. By applying the first function approximation in accordance with the gray level value of the image, the gray level distribution in the vicinity of the corresponding point is obtained, and the gray level value in the vicinity of the corresponding point is converted into the gray level value of the pixel point in the interpolation frame. A high frame rate processing is performed together with the processing.

本発明は、例えば図１５に示すような構成の映像信号変換システム１００に適用され、上記フレームレート変換装置１が高フレームレート化処理部４０として映像信号変換システム１００に搭載される。 The present invention is applied to, for example, a video signal conversion system 100 configured as shown in FIG. 15, and the frame rate conversion device 1 is mounted on the video signal conversion system 100 as the high frame rate processing unit 40.

この映像信号変換システム１００は、撮像装置等の画像入力部１０から入力される画像情報にノイズ除去処理を施す前処理部２０、上記前処理部２０によりノイズ除去処理が施された画像情報が入力され、入力された画像情報を圧縮符号化する圧縮符号化処理部３０、上記圧縮符号化処理部３０により圧縮符号化された画像情報を高フレームレート化する高フレームレート化処理部４０などからなる。 The video signal conversion system 100 includes a preprocessing unit 20 that performs noise removal processing on image information input from an image input unit 10 such as an imaging device, and image information that has been subjected to noise removal processing by the preprocessing unit 20 is input. A compression encoding processing unit 30 that compresses and encodes the input image information, a high frame rate processing unit 40 that increases the image information compressed and encoded by the compression encoding processing unit 30, and the like. .

この映像信号変換システム１００における前処理部２０は、入力された画像情報に含まれるボケや手ぶれなどのノイズを、画像のテンソル演算技術とぼけ函数の適応修正処理技術により除去するフィルタリング処理を行うものであって、図１６に示すようなシステムモデルにより、真の入力画像ｆ（ｘ，ｙ）が入力されるぼけ函数Ｈ（ｘ，ｙ）の劣化モデル２１の出力 The pre-processing unit 20 in the video signal conversion system 100 performs a filtering process to remove noise such as blur and camera shake included in the input image information by an image tensor calculation technique and a blur function adaptive correction process technique. Then, the output of the deterioration model 21 of the blur function H (x, y) to which the true input image f (x, y) is input by the system model as shown in FIG.

にノイズｎ（ｘ，ｙ）を付加することにより観測画像ｇ（ｘ，ｙ）を得て、図１７に示すようなリストレーションシステムモデルにより、上記観測画像ｇ（ｘ，ｙ）を入力として推定された画像 Is added with noise n (x, y) to obtain an observed image g (x, y), and estimated by using the observed image g (x, y) as an input by a restoration system model as shown in FIG. Images

を得る逆フィルタ２２からなる。 It consists of the inverse filter 22 which obtains.

前処理部２０は、画像のテンソル演算技術とぼけ函数の適応修正処理技術により除去するフィルタリング処理を行うものであって、原画像をクロネッカー積の特性を利用して評価する。 The preprocessing unit 20 performs filtering processing to be removed by an image tensor calculation technique and a blur function adaptive correction technique, and evaluates an original image using characteristics of a Kronecker product.

クロネッカー積は、次のように定義される。 The Kronecker product is defined as:

Ａ＝［ａ_ｉｊ］をｍｎ行列、Ｂ＝［ｂ_ｉｊ］をｓｔ行列とするとき、クロネッカー積 When A = [a _ij ] is an mn matrix and B = [b _ij ] is an st matrix, the Kronecker product

は、次のような、ｍｓ×ｎｔ行列である。 Is an ms × nt matrix as follows.

ここで、 here,

は、クロネッカー積演算子を表す。 Represents the Kronecker product operator.

また、クロネッカー積の基本的な性質は、次の通りである。 The basic properties of the Kronecker product are as follows.

ここで、 here,

は、行列を列方向に伸ばし、列ベクトルを生成する操作を示す演算子である。 Is an operator indicating an operation of generating a column vector by extending the matrix in the column direction.

この前処理部２０における画像モデルでは、未知の真の入力画像ｆ（ｘ，ｙ）が存在するものと仮定して、上記劣化モデル２１の出力 In the image model in the preprocessing unit 20, it is assumed that an unknown true input image f (x, y) exists, and the output of the degradation model 21 is output.

にノイズｎ（ｘ，ｙ）を付加することにより得られる観測画像ｇ（ｘ，ｙ）は、次の式（１）にて表すことができる。 An observation image g (x, y) obtained by adding noise n (x, y) to can be expressed by the following equation (1).

ここで、 here,

は、この画像システムにより得られる劣化画像を代表し、また、ｎ（ｘ，ｙ）は付加したノイズである。そして、劣化画像 Represents a deteriorated image obtained by this image system, and n (x, y) is added noise. And degraded image

は、次の式（２）で示される。 Is represented by the following equation (2).

ここで、ｈ（ｘ，ｙ；ｘ’，ｙ’）は、劣化システムのインパルス応答を代表している。 Here, h (x, y; x ′, y ′) represents the impulse response of the deteriorated system.

使用される画像は離散量であるから、入力画像ｆ（ｘ，ｙ）の画像モデルは、式（３）のように書き換えることができる。 Since the image used is a discrete quantity, the image model of the input image f (x, y) can be rewritten as shown in Expression (3).

ここで、Ｈ_ｋ ^（ｘ）Ｈ_ｌ ^（ｙ）は、次の式（４）のようにマトリクス形式で表すことにより、劣化モデルの点像強度分布函数（ＰＳＦ:Point Spread Function）Ｈとなる。 Here, H _k ^(x) H _l ^(y) becomes a point spread function (PSF) H of the degradation model by expressing it in matrix form as in the following equation (4).

上記逆フィルタ２２の特性は、図１８のフローチャートに示す手順にしたがった学習処理により決定される。 The characteristics of the inverse filter 22 are determined by a learning process according to the procedure shown in the flowchart of FIG.

すなわち、学習処理では、先ず、観測画像ｇ（ｘ，ｙ）を入力画像ｇとして読み込み（ステップＳ１１ａ）、 That is, in the learning process, first, the observation image g (x, y) is read as the input image g (step S11a),

として画像ｇ_Ｅを構成して（ステップＳ１２ａ）、 As image g _E (step S12a),

の特異値分解（SVD:singular value decomposition）を行う（ステップＳ１３ａ）。 Singular value decomposition (SVD) is performed (step S13a).

また、劣化モデルの点像強度分布函数（ＰＳＦ:Point Spread Function）Ｈを読み込み（ステップＳ１１ｂ）、 Further, a point spread function (PSF) H of the degradation model is read (step S11b),

なるクロネッカー積で示される劣化モデルを構築して（ステップＳ１２ｂ）、上記劣化モデルの函数Ｈの特異値分解（SVD:singular value decomposition）を行う（ステップＳ１３ｂ）。 A degradation model represented by the Kronecker product is constructed (step S12b), and a singular value decomposition (SVD) of the function H of the degradation model is performed (step S13b).

ここで、システム方程式ｇは Where the system equation g is

と書き直すことができる。 Can be rewritten.

そして、 And

として新たな画像ｇ_ＫＰＡを算出する（ステップＳ１４）。 As a new image g _KPA is calculated (step S14).

そして、算出した新たな画像ｇ_ＫＰＡについて、 And about the calculated new image g _KPA ,

なる最小化処理を行い（ステップＳ１５）、得られたｆ_Ｋについて、 (Step S15), and for the obtained f _K ,

なるテスト条件を満たすか否かを判定する（ステップＳ１６）。 It is determined whether or not the test condition is satisfied (step S16).

ここで、ｋは繰り返し番号であり、ε，ｃは、それぞれ判定の閾値である。 Here, k is a repetition number, and ε and c are thresholds for determination, respectively.

そして、上記ステップＳ１６における判定結果がFaise、すなわち、上記ステップＳ１５で得られたｆ_Ｋが上記テスト条件を満たしていない場合には、上記劣化モデルの函数Ｈについて、 The determination result in step S16 is Faise, i.e., when f _K obtained in step S15 does not satisfy the above test conditions, the function H of the deterioration model,

なる最小化処理を行い（ステップＳ１７）、上記ステップＳ１３ｂに戻り、上記ステップＳ１６で得られた函数Ｈ_Ｋ＋１について特異値分解（SVD:singular value decomposition）を行い、上記ステップＳ１３ｂからステップＳ１７の処理を繰り返し行い、上記ステップＳ１６における判定結果がTrue、すなわち、上記ステップＳ１５で得られたｆ_Ｋが上記テスト条件を満たす場合に、上記ステップＳ１５で得られたｆ_Ｋを (Step S17), the process returns to step S13b, singular value decomposition (SVD) is performed on the function HK _{+ 1} obtained in step S16, and the processes from step S13b to step S17 are performed. repeatedly performs the determination result in step S16 is True, that is, when f _K obtained in the step S15 is the test condition is satisfied, the f _K obtained in step S15

として（ステップＳ１８）、１の入力画像ｇに対する学習処理を終了する。 (Step S18), the learning process for one input image g is terminated.

上記逆フィルタ２２の特性は、上記学習処理を多数の入力画像ｇについて行うことにより決定される。 The characteristics of the inverse filter 22 are determined by performing the learning process on a large number of input images g.

すなわち、ここでは、ｈ（ｘ，ｙ）＊ｆ（ｘ，ｙ）を代表してＨｆとして表し、システムの方程式を That is, here, h (x, y) * f (x, y) is represented as Hf, and the system equation is

とし、また、 And also

として、ｆを近似して、目的とする新たな画像ｇ_Ｅを次のように導出している。 As, by approximating the f, and derives a new image g _E of interest as follows.

ここで、Ｅは予測を示す。新たな画像ｇ_Ｅは、原画像のエッジ細部の保存や強調として構成される。 Here, E indicates prediction. The new image g _E is configured as preservation and enhancement of edge details of the original image.

新たな画像ｇ_Ｅは、 The new image g _E

として得られる。ここで、Ｃ_ＥＰとＣ_ＥＮは、それぞれエッジ保存とエッジ強調の演算子である。 As obtained. Here, _CEP and _CEN are edge preservation and edge enhancement operators, respectively.

そして、シンプルなラプラシアンカーネルＣ_ＥＮ＝∇^２ｆと制御パラメータβとγを持ったガウシャンカーネルＣ_ＥＰを選択し、 And select a simple Laplacian kernel C _EN = ∇ ² f and a Gaussian kernel C _EP with control parameters β and γ,

とする。 And

そして、 And

として、最小化問題を再構築し、次の特異値分解（SVD:singular value decomposition）から And reconstruct the minimization problem from the following singular value decomposition (SVD)

上記劣化モデルの函数Ｈを Function H of the above degradation model

として推定して用いる。 Estimate and use.

この映像信号変換システム１００における前処理部２０のように、入力された画像情報に含まれるボケや手ぶれなどのノイズを、画像のテンソル演算技術とぼけ函数の適応修正処理技術により除去するフィルタリング処理を行うことにより、ノイズを除去するとともに画像の鮮明化やエッジ強調などを行うことができる。 Like the pre-processing unit 20 in the video signal conversion system 100, a filtering process is performed to remove noise such as blur and camera shake included in the input image information by an image tensor calculation technique and a blur function adaptive correction process technique. Thus, noise can be removed and image sharpening and edge enhancement can be performed.

この映像信号変換システム１００は、上記前処理部２０によりノイズ除去処理が施された画像情報について、圧縮符号化処理部３０により圧縮符号化し、圧縮符号化された画像情報をフレームレート化処理部４０により高フレームレート化する。 The video signal conversion system 100 compresses and encodes the image information subjected to the noise removal processing by the preprocessing unit 20 by the compression encoding processing unit 30, and converts the compression encoded image information to the frame rate conversion processing unit 40. To increase the frame rate.

この映像信号変換システム１００における圧縮符号化処理部３０は、フルーエンシ理論に基づく圧縮符号化処理を行うもので、図１９に示すように、第１の函数化処理部３１、第２の函数化処理部３２、上記第１の函数化処理部３１と第２の函数化処理部３２で函数化された各画像情報を所定の形式で記述して符号化する符号化処理部３３などを備える。 The compression coding processing unit 30 in the video signal conversion system 100 performs compression coding processing based on the fluency theory. As shown in FIG. 19, the first functioning processing unit 31 and the second functioning processing are performed. Unit 32, and an encoding processing unit 33 for describing and encoding each piece of image information functioned by the first functioning processing unit 31 and the second functioning processing unit 32 in a predetermined format.

第１の函数化処理部３１は、上記前処理部２０によりノイズ除去処理が施された画像情報について、複数のフレーム画像間の対応点推定を行う対応点推定部３１Ａと、上記対応点推定部３１Ａにより推定された各フレーム画像の対応点の画像情報を用いて、動き部分の画像情報を函数化する動き函数化処理部３１Ｂからなる。 The first function processing unit 31 includes a corresponding point estimation unit 31A that performs corresponding point estimation between a plurality of frame images on the image information that has been subjected to noise removal processing by the preprocessing unit 20, and the corresponding point estimation unit. It comprises a motion function processing unit 31B for converting the image information of the motion part using the image information of the corresponding point of each frame image estimated by 31A.

対応点推定部３１Ａは、例えば、図２０に示すように構成される。 The corresponding point estimation unit 31A is configured as shown in FIG. 20, for example.

すなわち、対応点推定部３１Ａは、フレーム画像の部分領域を抽出する第１の部分領域抽出部３１１と、上記第１の部分領域抽出部３１１により抽出した部分領域に相似な連続する他のフレーム画像の部分領域を抽出する第２の部分領域抽出部３１２と、上記第１の部分領域抽出部３１１及び上記第２の部分領域抽出部３１２により抽出された各部分領域を同一比に変換し、変換した各画像の濃淡をフルーエンシ理論に従って区分多項式で函数表現して出力する函数近似部３１３と、上記函数近似部３１３の出力の相関値を演算する相関値演算部３１４と、上記相関値演算部３１４により算出される相関値の最大値を与える画像の位置ずれを演算し、該演算値を対応点のずれ量として出力するずれ量演算部３１５とからなる。 That is, the corresponding point estimation unit 31A includes a first partial region extraction unit 311 that extracts a partial region of a frame image and other consecutive frame images similar to the partial region extracted by the first partial region extraction unit 311. A second partial region extraction unit 312 that extracts the partial regions of the first partial region extraction unit 311 and the partial regions extracted by the first partial region extraction unit 311 and the second partial region extraction unit 312 to the same ratio, A function approximating unit 313 for expressing and outputting the shade of each image by a piecewise polynomial according to the fluency theory, a correlation value calculating unit 314 for calculating a correlation value of the output of the function approximating unit 313, and the correlation value calculating unit 314 The shift amount calculation unit 315 calculates the position shift of the image that gives the maximum correlation value calculated by the above and outputs the calculated value as the shift amount of the corresponding point.

この対応点推定部３１Ａでは、第１の部分領域抽出部３１１によりフレーム画像の部分領域をテンプレートとして抽出するとともに、上記第１の部分領域抽出部３１１により抽出した部分領域に相似な連続する他のフレーム画像の部分領域を第２の部分領域抽出部３１２により抽出し、函数近似部３１３により上記第１の部分領域抽出部３１１及び上記第２の部分領域抽出部３１２により抽出された各部分領域を同一比に変換し、変換した各画像の濃淡を区分多項式で函数表現する。 In the corresponding point estimation unit 31A, the first partial region extraction unit 311 extracts the partial region of the frame image as a template, and other continuous similarities to the partial region extracted by the first partial region extraction unit 311. A partial region of the frame image is extracted by the second partial region extraction unit 312, and each partial region extracted by the first partial region extraction unit 311 and the second partial region extraction unit 312 by the function approximation unit 313 is extracted. The image is converted to the same ratio, and the density of each converted image is expressed as a function with a piecewise polynomial.

この対応点推定部３１Ａは、画像の濃淡を連続的な変化状態として捉え、フルーエンシ情報理論により、画像の対応点を推定するものであって、第１の部分領域抽出部３１１と、第２の部分領域抽出部３１２と、函数近似部３１３と、相関値演算部３１４と、ずれ量演算部３１５からなる。 The corresponding point estimation unit 31A captures the lightness and darkness of the image as a continuous change state, and estimates the corresponding point of the image by fluency information theory. The corresponding point estimation unit 31A and the second partial region extraction unit 311 It consists of a partial area extraction unit 312, a function approximation unit 313, a correlation value calculation unit 314, and a deviation amount calculation unit 315.

この対応点推定部３１Ａにおいて、第１の部分領域抽出部３１１は、入力画像についてフレーム画像の部分領域を抽出する。 In the corresponding point estimation unit 31A, the first partial region extraction unit 311 extracts a partial region of the frame image from the input image.

また、第２の部分領域抽出部３１２は、上記第１の部分領域抽出部３１１により抽出した部分領域に相似な連続する他のフレーム画像の部分領域を抽出する。 The second partial region extraction unit 312 extracts partial regions of other continuous frame images similar to the partial region extracted by the first partial region extraction unit 311.

また、函数近似部３１３は、上記第１の部分領域抽出部３１１及び上記第２の部分領域抽出部３１２により抽出された各部分領域を同一比に変換し、変換した各画像の濃淡をフルーエンシ理論に従って区分多項式で函数表現して出力する。 The function approximating unit 313 converts the partial regions extracted by the first partial region extracting unit 311 and the second partial region extracting unit 312 into the same ratio, and the density of each converted image is converted to a fluency theory. According to, function representation with piecewise polynomial is output.

また、相関値演算部３１４は、上記函数近似部３１３の出力の相関値を演算する。 The correlation value calculation unit 314 calculates the correlation value of the output of the function approximation unit 313.

さらに、ずれ量演算部３１５は、上記相関値演算部３１４により算出される相関値の最大値を与える画像の位置ずれを演算し、該演算値を対応点のずれ量として出力する。 Further, the deviation amount calculation unit 315 calculates the positional deviation of the image that gives the maximum correlation value calculated by the correlation value calculation unit 314, and outputs the calculated value as the deviation amount of the corresponding point.

そして、この対応点推定部３１では、第１の部分領域抽出部３１１によりフレーム画像の部分領域をテンプレートとして抽出するとともに、上記第１の部分領域抽出部３１１により抽出した部分領域に相似な連続する他のフレーム画像の部分領域を第２の部分領域抽出部３１２により抽出し、函数近似部３１３により上記第１の部分領域抽出部３１１及び上記第２の部分領域抽出部３１２により抽出された各部分領域を同一比に変換し、変換した各画像の濃淡を区分多項式で函数表現する。 In the corresponding point estimation unit 31, the first partial region extraction unit 311 extracts a partial region of the frame image as a template, and is continuous similar to the partial region extracted by the first partial region extraction unit 311. The partial areas of the other frame images are extracted by the second partial area extraction unit 312, and the respective parts extracted by the function approximation unit 313 by the first partial region extraction unit 311 and the second partial region extraction unit 312. The area is converted to the same ratio, and the density of each converted image is expressed as a function with a piecewise polynomial.

ここで、画像ｆ_１（ｘ，ｙ），ｆ_２（ｘ，ｙ）は、空間Ｓ^（ｍ）（Ｒ^２）に属していると仮定し、φｍ（ｔ）を（ｍ−２）次の区分多項式で次の式（５）のように表し、 Here, it is assumed that the images f ₁ (x, y) and f ₂ (x, y) belong to the space S ^(m) (R ² ), and φm (t) is (m−2) th order It is expressed as the following equation (5) by a piecewise polynomial,

上記空間Ｓ^（ｍ）（Ｒ^２）を次の式（６）のように表すと、 When the space S ^(m) (R ² ) is expressed as the following equation (6),

フレーム間の相関関数ｃ（τ_１，τ_２）は、次の式（７）として表すことができる。 The correlation function c (τ ₁ , τ ₂ ) between frames can be expressed as the following equation (7).

そして、上記仮定、すなわち、 And the above assumption, ie,

から、フレーム相関関数を表す式（７）は、次の式（８）で示すことができる。 From Equation (7), the frame correlation function can be expressed by the following Equation (8).

すなわち、上記フレーム間の相関関数ｃ（τ_１，τ_２）は、図２１に示すような２ｍ次補間を行う空間Ｓ^（２ｍ）（Ｒ^２）に属し、上記２ｍ次補間を行う空間Ｓ^（２ｍ）（Ｒ^２）の標本化周波数ψ_２ｍ（τ_１，τ_２）は一意的に存在し、上記フレーム間の相関関数ｃ（τ_１，τ_２）は、次の式（９）にて表される。 That is, the correlation function c (τ _1, τ ₂₎ between the frame belongs to the space ^{S (2m)} ^{(R 2)} for performing 2m following interpolation as shown in FIG. 21, space performs the 2m order interpolation ^{S ( 2m)} (sampling frequency [psi _2m of ^{_{R 2) (τ 1, τ}} 2) is present uniquely, the correlation function c (tau ₁ between the frame, tau _2), at the following equation (9) expressed.

式（８）から、相関面を補間するために、（２ｍ−１）次の区分的な多項式の関数を構築することができる。 From equation (8), a (2m-1) th order piecewise polynomial function can be constructed to interpolate the correlation surface.

すなわち、ブロックに基づく動きベクトル評価アプローチによって、適切に式（７）の別々のブロック動きベクトルの初期の推定を得て、それから、任意の正確さの本当の動きを得る式（８）を適用する。 That is, the block-based motion vector estimation approach appropriately obtains an initial estimate of the separate block motion vectors in equation (7), and then applies equation (8) to obtain real motion of arbitrary accuracy. .

分離可能な相関面補間関数の一般形は、式（１０）にて表される。 The general form of the separable correlation surface interpolation function is expressed by equation (10).

ここで、Ｃｋとｄｌが補間係数であり、Ｍ_２ｍ（ｘ）＝φ_２ｍ（ｘ＋２）・φ_ｍ（ｘ）は、（ｍ−１）次のＢ−スプラインである。 Here, Ck and dl are interpolation coefficients, and M _2m (x) = φ _{2 m (} x + 2) · φ _m (x) is an (m−1) -order B-spline.

式(１０)における適切な打ちきり制限により、上記相関関数ｃ（τ_１，τ_２）は、次の式（１１）によって近似することができる。 The correlation function c (τ ₁ , τ ₂ ) can be approximated by the following equation (11) by the appropriate stroke limit in the equation (10).

ここで、Ｋ_１＝［τ_１］−ｓ＋１，Ｋ_２＝［τ_２］＋ｓ，Ｌ_１＝［τ_２］−ｓ＋１，Ｌ_２＝［τ_２］＋ｓであり、ｓはφ_ｍ（ｘ）を定める。 Here, K ₁ = [τ ₁ ] −s + 1, K ₂ = [τ ₂ ] + s, L ₁ = [τ ₂ ] −s + 1, L ₂ = [τ ₂ ] + s, and s is φ _m (x) Determine.

そして、例えば、ｍ＝２とき、次の式（１２）を式（１１）に代入することにより、望ましい補間式を得る。 Then, for example, when m = 2, a desired interpolation formula is obtained by substituting the following formula (12) into formula (11).

動きベクトルvは、次の式（１３）を使って導出される。 The motion vector v is derived using the following equation (13).

上記相関関数ｃ（τ_１，τ_２）は、整数点の情報だけを用いて再生することができ、相関値演算部３１４は、上記相関関数ｃ（τ_１，τ_２）により上記函数近似部３１３の出力の相関値を算出する。 The correlation function c (τ ₁ , τ ₂ ) can be reproduced using only integer point information, and the correlation value calculation unit 314 uses the correlation function c (τ ₁ , τ ₂ ) to calculate the function approximation unit. The correlation value of the output of 313 is calculated.

そして、ずれ量演算部３１５は、上記相関値演算部３１４により算出される相関値の最大値を与える画像の位置ずれを示す式（１３）により動きベクトルＶを演算し、得られる動きベクトルＶを対応点のずれ量として出力する。 Then, the deviation amount calculation unit 315 calculates the motion vector V by the equation (13) indicating the positional deviation of the image that gives the maximum value of the correlation value calculated by the correlation value calculation unit 314, and obtains the obtained motion vector V. Output as the deviation of the corresponding point.

ここで、上記対応点推定部３１Ａによる対応点推定による動きベクトルＶの決定の様子を図２２に模式的に示す。 Here, FIG. 22 schematically shows how the motion vector V is determined by the corresponding point estimation by the corresponding point estimation unit 31A.

すなわち、この対応点推定部３１Ａでは、図２２の（Ａ）に示すように、フレーム画像（ｋ）の部分領域を取り出し、この部分領域に相似な連続する他のフレーム画像の部分領域を抽出して、図２２の（Ｂ）に示すように、 That is, in the corresponding point estimation unit 31A, as shown in FIG. 22A, a partial area of the frame image (k) is extracted, and partial areas of other continuous frame images similar to the partial area are extracted. As shown in FIG.

にて表される相関関数ｃ（τ_１，τ_２）を用いて各フレーム間の相関を計算して、図２２の（Ｃ）に示すように、相関曲面のピーク点て動きを検出し、動きベクトルvを上記式（１３）にて求めことにより、図２２の（Ｄ）に示すように、フレーム画像（ｋ）における画素の動きを決定する。 The correlation between the frames is calculated using the correlation function c (τ ₁ , τ ₂ ) expressed by the following, and the motion is detected at the peak point of the correlation surface as shown in FIG. By obtaining the motion vector v by the above equation (13), the motion of the pixel in the frame image (k) is determined as shown in FIG.

このようにして決定されたフレーム画像（ｋ）の各ブロックの動きベクトルは、同じフレーム画像（ｋ）の各ブロックの動きベクトルを従来のブロックマッチングにより決定されたものと比較して、各ブロック間で滑らかに変化するものとなる。 The motion vector of each block of the frame image (k) determined as described above is obtained by comparing the motion vector of each block of the same frame image (k) with that determined by the conventional block matching. It will change smoothly.

すなわち、例えば、図２３の（Ａ）に示すように、被写体が回転する動きのあるフレーム１とフレーム２について、２フレーム対応点推定と不均等補間により４倍拡大を行ったところ、図２３の（Ｂ１），（Ｃ１）に示すように、従来のブロックマッチングによる推定対応点で推定された動きベクトルには変化が滑らかなでない部分が生じたが、上述の如き構成の対応点推定部３１Ａによる推定対応点で推定された動きベクトルの変化は、図２３の（Ｂ２），（Ｃ２）に示すように、全体的に滑らかなものとなっている。しかも、１／Ｎの精度での計算量は、従来手法ではＮ^２であるのに対し、本手法ではＮとなる。 That is, for example, as shown in FIG. 23 (A), when frame 1 and frame 2 in which the subject rotates, quadruple enlargement is performed by 2-frame corresponding point estimation and non-uniform interpolation. As shown in (B1) and (C1), the motion vector estimated at the estimated corresponding point by the conventional block matching has a portion where the change is not smooth. However, the corresponding point estimating unit 31A having the above-described configuration The change in the motion vector estimated at the estimated corresponding point is generally smooth as shown in (B2) and (C2) of FIG. In addition, the amount of calculation with an accuracy of 1 / N is N ² in the present method, while N ² in the conventional method.

そして、動き函数化処理部３１Ｂでは、上記対応点推定部３１Ａにおける対応点推定により得られる動きベクトルＶを用いて、動き部分の画像情報を函数化する。 Then, the motion function processing unit 31B functions the image information of the motion part using the motion vector V obtained by the corresponding point estimation in the corresponding point estimation unit 31A.

すなわち、動き函数化処理部３１Ｂでは、基準フレーム毎に部分動画像の対応点が推定されると、その移動量すなわち対応点のずれ量はフレームの座標位置ｘ、ｙの変化に対応するので、図２４に示すように、フレームの原点を左上隅に取った場合、例えば、図２５の（Ａ）に示すような各フレームの画像の動きについて、図２５の（Ｂ），（Ｃ）に示すように各フレームのX座標、Y座標の動きとして表し、X座標、Y座標それぞれの動きの変化を函数近似して函数化する。そして、図２６に示すように、その函数で補間してフレーム間の位置を推定することにより動き補償を行う。 That is, in the motion function processing unit 31B, when the corresponding point of the partial moving image is estimated for each reference frame, the movement amount, that is, the shift amount of the corresponding point corresponds to the change in the coordinate position x, y of the frame. As shown in FIG. 24, when the origin of the frame is taken at the upper left corner, for example, the movement of the image of each frame as shown in FIG. 25A is shown in FIGS. As described above, the movement of the X coordinate and the Y coordinate of each frame is expressed, and the change of the movement of each of the X coordinate and the Y coordinate is approximated by a function to make a function. Then, as shown in FIG. 26, motion compensation is performed by interpolating with the function and estimating the position between frames.

また、第２の函数化処理部３２は、フルーエンシ情報理論に基づき、輪郭、濃淡、フレーム間情報を近似するフルーエンシ函数化処理により、入力画像を符号化するものであって、領域自動分類処理部３２Ａ、輪郭函数近似処理部３２Ｂ、濃淡函数化処理部３２Ｃ、周波数函数近似処理部３２Ｄなどからなる。 The second functioning processing unit 32 encodes an input image by fluency functioning processing that approximates contour, shading, and interframe information based on the fluency information theory, and includes an automatic region classification processing unit. 32A, an outline function approximation processing unit 32B, a tone function conversion processing unit 32C, a frequency function approximation processing unit 32D, and the like.

領域自動分類処理部３２Ａは、入力画像をフルーエンシ情報理論に基づいて、区分的平面領域（ｍ≦２）、区分的曲面領域（ｍ＝３）、区分的球面領域（ｍ＝∞）、不規則領域（ｍ≧４）に分類する。 Based on the fluency information theory, the area automatic classification processing unit 32A uses a piecewise plane area (m ≦ 2), a piecewise curved surface area (m = 3), a piecewise spherical area (m = ∞), an irregularity. Classify into regions (m ≧ 4).

フルーエンシ情報理論では、信号を信号空間という概念で次数ｍによって指定されるクラスに分類する。 In the fluency information theory, signals are classified into classes specified by the order m in the concept of signal space.

信号空間^ｍＳは、（ｍ−２）回連続微分可能な変数を持つ（ｍ−１）次の区分的多項式によって表される。 The signal space ^m S is represented by a (m−1) th order piecewise polynomial with a variable that can be continuously differentiated (m−2) times.

信号空間^ｍＳは、ｍ＝１のとき、階段関数のものと等しくなり、またｍ＝∞のとき、フーリエべき関数のものと等しくなることが証明されている。フルーエンシモデルは、フルーエンシ標本化関数を定義することで、この信号空間^ｍＳに属する信号と離散時間信号との関係を明確化するモデルである。 It has been proved that the signal space ^m S is equal to that of the step function when m = 1, and equal to that of the Fourier power function when m = ∞. The fluency model is a model that clarifies the relationship between signals belonging to the signal space ^m S and discrete-time signals by defining a fluency sampling function.

輪郭函数近似処理部３２Ｂは、輪郭自動分類処理部３２１と関数近似処理部３２２からなり、上記領域自動分類処理部３２Ａにより分類された区分的平面領域（ｍ≦２）、区分的曲面領域（ｍ＝３）、区分的球面領域（ｍ＝∞）に含まれる直線、円弧、２次曲線を上記輪郭自動分類処理部３２１により抽出して関数近似処理部３２２により関数近似する。 The contour function approximation processing unit 32B includes an automatic contour classification processing unit 321 and a function approximation processing unit 322, and includes a piecewise plane area (m ≦ 2) and a piecewise curved surface area (m = 3) The straight line, arc, and quadratic curve included in the piecewise spherical area (m = ∞) are extracted by the contour automatic classification processing unit 321 and approximated by the function by the function approximation processing unit 322.

濃淡函数化処理部３２Ｃは、上記領域自動分類処理部３２Ａにより分類される区分的平面領域（ｍ≦２）、区分的曲面領域（ｍ＝３）、区分的球面領域（ｍ＝∞）について、フルーエンシ函数を用いて濃淡函数化処理を行う。 The gradation function processing unit 32C is configured to perform the piecewise plane region (m ≦ 2), the piecewise curved surface region (m = 3), and the piecewise spherical region (m = ∞) classified by the region automatic classification processing unit 32A. Perform density function using fluency function.

周波数函数近似処理部３２Ｄは、上記領域自動分類処理部２により分類される不規則領域（ｍ≧４）、すなわち、多項式表現できない領域について、ＤＣＴ等により周波数函数近似処理を行う。 The frequency function approximation processing unit 32D performs frequency function approximation processing by DCT or the like on the irregular regions (m ≧ 4) classified by the region automatic classification processing unit 2, that is, regions that cannot be expressed by a polynomial.

この第２の函数化処理部３２では、映像のフレーム毎に多数の多変数フルーエンシ函数を使って、画像の濃淡や輪郭を表現することができる。 The second function processing unit 32 can express the shading and contour of an image using a number of multivariable fluency functions for each frame of the video.

そして、符号化処理部３３は、上記第１の函数化処理部３１と第２の函数化処理部３２で函数化された各画像情報を所定の形式で記述して符号化する。 Then, the encoding processing unit 33 describes and encodes each image information functioned by the first functioning processing unit 31 and the second functioning processing unit 32 in a predetermined format.

そして、この映像信号変換システム１００では、上述の如く、撮像装置等の画像入力部１０から入力される画像情報に前処理部２０によりノイズ除去処理を施し、上記前処理部２０によりノイズ除去処理が施された画像情報を圧縮符号化処理部３０により圧縮符号化し、上記フレームレート変換装置１を用いた高フレームレート化処理部４０において、フレーム間の映像対応点を追跡し、その時間推移を函数表現して、原フレームと変換するフレーム数との比で函数補間フレームを生成する。 In the video signal conversion system 100, as described above, the image information input from the image input unit 10 such as the imaging device is subjected to noise removal processing by the preprocessing unit 20, and the noise removal processing is performed by the preprocessing unit 20. The applied image information is compression-encoded by the compression-encoding processor 30, and the high-frame-rate processor 40 using the frame rate conversion device 1 tracks video corresponding points between frames, and the time transition is a function. The function interpolation frame is generated by expressing the ratio of the original frame and the number of frames to be converted.

すなわち、この映像信号変換システム１００は、映像のフレーム毎に多数のフルーエンシ函数を使って輪郭などを表現し、離散フレーム列を時間方向で区分多項式に基づく連続関数で表すことで、任意フレームレートの高品位な映像を再生できるようにしたものである。 That is, the video signal conversion system 100 expresses a contour or the like using a number of fluency functions for each video frame, and expresses a discrete frame sequence by a continuous function based on a piecewise polynomial in the time direction, thereby allowing an arbitrary frame rate. High-definition video can be played back.

フルーエンシ情報理論では、信号が連続的に微分可能であるという度数に基づいて、次数ｍによって指定されるクラスにおける信号空間を分類する。 In fluency information theory, the signal space in the class specified by the order m is classified based on the frequency that the signal is continuously differentiable.

そして、どのようなｍ＞２に対しても、subspace spannedは、（ｍ−２）回のみ連続微分可能な（ｍ−１）次の区分的な多項式によって表される。
（ｍ＝３）クラスの標本化関数ψ(x) は、１回のみ連続微分可能な２次の区分多項式の線形結合により次の式（１４）で表される。 For any m> 2, subspace spanned is expressed by a (m−1) -order piecewise polynomial that can be continuously differentiated only (m−2) times.
The sampling function ψ (x) of the (m = 3) class is expressed by the following equation (14) by linear combination of second-order piecewise polynomials that can be continuously differentiated only once.

ここで、φ(x)は次の式（１５）で示される。 Here, φ (x) is expressed by the following equation (15).

そして、ψ(x)は、標本化函数であるため、標本列と畳み込み演算で区間の函数を求めることができる。 Since ψ (x) is a sampling function, a section function can be obtained by a sampling sequence and a convolution operation.

ここで、τ＝１とき、式（１４）は、次の式（１６）よって与えられる区分的な多項式
として表されることができる。 Here, when τ = 1, the equation (14) can be expressed as a piecewise polynomial given by the following equation (16).

例えば、（ｍ＝３）クラスの不均等フルーエンシ補間函数 For example, (m = 3) class of unequal fluency interpolation functions

は、図２７に示すような函数である。 Is a function as shown in FIG.

不均等補間フルーエンシ関数 Non-uniform interpolation fluency function

は、度数２の８部分多項式から成り、（ｍ＝３）クラスの不均等補間フルーエンシ関数は、図２７で示すようにｓ_１（ｘ）〜Ｓ_８（ｘ）に指定される不均等の間隔で定められ、その構成要素は、次の式（１７）で与えられる。 Is composed of 8 partial polynomials of frequency 2, and the unequal interpolation fluency function of (m = 3) class is an unequal interval specified by s ₁ (x) to S ₈ (x) as shown in FIG. The component is given by the following equation (17).

ここで、 here,

である。 It is.

ここで、高解像度補間の実例を図２８に示す。 Here, FIG. 28 shows an example of high resolution interpolation.

また、図２９は、補間のためのピクセル構造の具体例を示している。 FIG. 29 shows a specific example of a pixel structure for interpolation.

図２９において、Frame_1のピクセルは、Frame_2でピクセルを変える異なる動きベクトル In FIG. 29, the pixel of Frame_1 is a different motion vector that changes the pixel in Frame_2.

を持つ。 have.

図２８は２つの連続的なフレームから一次元イメージ補間の概念を例示している。 FIG. 28 illustrates the concept of one-dimensional image interpolation from two consecutive frames.

動き評価は、ブロック・サイズと検索ウィンドウ・サイズが知られている全検索ブロックマッチングのアルゴリズムによるものとされる。 The motion estimation is based on an all search block matching algorithm whose block size and search window size are known.

高解像度フレームピクセルは、f（τ_x，τ_y）によって表され、ピクセル構造は、図２９の高解像度補間アプローチの１例に示すようである。 The high resolution frame pixel is represented by f (τ _x , τ _y ), and the pixel structure is as shown in one example of the high resolution interpolation approach of FIG.

第１のステップでは、ビデオ・シーケンスから２つの連続的なフレームを得て、ｆ_１（ｘ，ｙ）とｆ_２（ｘ，ｙ）によって表す。 In the first step, two consecutive frames are obtained from the video sequence and represented by f ₁ (x, y) and f ₂ (x, y).

第２のステップでは、動きベクトルの初期推定を行う。 In the second step, an initial estimation of the motion vector is performed.

にて、動きベクトルの初期推定を行う。 The initial estimation of the motion vector is performed.

ここで、 here,

である。 It is.

式(１８)において、 In equation (18):

はサーチウインドウの平均を表し、そして、 Represents the average of the search window, and

はマッチングにおける現ブロックの平均を表す。 Represents the average of the current blocks in the matching.

第３のステップでは、式（１２）と式（１７）を用いた全ピクセル In the third step, all pixels using equations (12) and (17)

のために、第２ステップから動きベクトル For the motion vector from the second step

付近の一つのピクセルの中から動きベクトルを得る。 A motion vector is obtained from one nearby pixel.

第４のステップでは、以下の通りに均等水平補間を実行する。 In the fourth step, uniform horizontal interpolation is performed as follows.

第５のステップでは、第４のステップで得られるピクセルを使っている不均等垂直補間を式（２０）により実行する。 In the fifth step, non-uniform vertical interpolation using the pixels obtained in the fourth step is executed according to equation (20).

第４のステップと第５のステップは、高解像度イメージで全てのピクセルのために繰り返される。 The fourth and fifth steps are repeated for all pixels in the high resolution image.

フルーエンシ理論に基づく動画符号化では、原信号に適した信号空間を選択し、函数化を行うことでシャープさを保存したまま高圧縮化することができる。 In moving picture coding based on the fluency theory, a signal space suitable for the original signal is selected, and the function is converted into a highly compressed image while preserving sharpness.

フルーエンシ理論に基づいて、フレーム間の相関函数の属する函数空間を正確に決定することにより、任意精度で動きベクトルを求めることができる。 A motion vector can be obtained with arbitrary accuracy by accurately determining a function space to which a correlation function between frames belongs based on the fluency theory.

例えば、図３０の（Ａ）に示すように、フレームｋとフレームｋ＋１の間の任意時刻におけるフレームを生成する場合、均等補間により補間フレームＦ（ｋ＋１／２）を生成して１／２精度の動き推定により求められる動き情報を用いてブロックマッチングにより対応点の階調値を１／２精度で生成する従来の高フレームレート化処理では、図３０の（Ｂ１），（Ｃ１）に示すように、挿入される補間フレームの画像は動きのある部分が劣化するが、上記高フレームレート化処理部４０のように、対応点推定処理を行って推定した対応点の階調値を用いて均等補間により補間フレームの対応点の階調値を生成し、さらに、不均等補間により補間フレームの対応点の階調値を生成する高フレームレート化処理では、図３０の（Ｂ２），（Ｃ２）に示すように、動きのある部分が劣化することなく高フレームレート化することができた。 For example, as shown in FIG. 30A, when generating a frame at an arbitrary time between the frame k and the frame k + 1, the interpolation frame F (k + 1/2) is generated by equal interpolation, and the ½ precision is obtained. In the conventional high frame rate processing in which the gradation value of the corresponding point is generated with the half accuracy by block matching using the motion information obtained by the motion estimation, as shown in (B1) and (C1) of FIG. In the interpolated frame image to be inserted, the moving portion is deteriorated, but as in the high frame rate processing unit 40, uniform interpolation is performed using the gradation value of the corresponding point estimated by performing the corresponding point estimation process. 30 generates the gradation value of the corresponding point of the interpolation frame, and further generates the gradation value of the corresponding point of the interpolation frame by non-uniform interpolation, (B2) and (C2) in FIG. As shown, it could be higher frame rate without the parts of the motion deteriorates.

この映像信号変換システム１００では、撮像装置等の画像入力部１０から入力される画像情報に前処理部２０によりノイズ除去処理を施し、上記前処理部２０によりノイズ除去処理が施された画像情報を圧縮符号化処理部３０により圧縮符号化し、上記高フレームレート化処理部４０において、フレーム間の映像対応点を追跡し、その時間推移を函数表現して、原フレームと変換するフレーム数との比で函数補間フレームを生成することで、上記圧縮符号化処理部３０により圧縮符号化された画像情報を高フレームレート化することにより、鮮明で円滑な動きの映像信号を得ることができる。 In the video signal conversion system 100, image information input from the image input unit 10 such as an imaging device is subjected to noise removal processing by the preprocessing unit 20, and the image information subjected to noise removal processing by the preprocessing unit 20 is processed. The compression encoding processing unit 30 performs compression encoding, and the high frame rate processing unit 40 tracks video correspondence points between frames, expresses the time transition as a function, and compares the ratio between the original frame and the number of frames to be converted. By generating a function interpolation frame, the image information compression-encoded by the compression-encoding processor 30 is increased in frame rate, so that a video signal with clear and smooth motion can be obtained.

フレームレート変換装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of a frame rate conversion apparatus. 上記フレームレート変換装置による高フレームレート化処理を模式的に示す図である。It is a figure which shows typically the frame rate increase process by the said frame rate conversion apparatus. 上記フレームレート変換装置による高フレームレート化処理の実行手順を示すフローチャートである。It is a flowchart which shows the execution procedure of the high frame rate process by the said frame rate conversion apparatus. 上記フレームレート変換装置による高フレームレート化処理の内容を模式的に示す図である。It is a figure which shows typically the content of the frame rate increase process by the said frame rate conversion apparatus. 上記フレームレート変換装置おける不均等補間処理の説明に供する図である。It is a figure where it uses for description of the non-uniform interpolation process in the said frame rate conversion apparatus. 画像の解像度を変換した際に新たに生成された画素位置の値を決める画像補間処理の説明に供する図である。It is a figure which uses for description of the image interpolation process which determines the value of the pixel position newly produced | generated when converting the resolution of an image. 均等補間函数と不均等補間函数の例を示す図である。It is a figure which shows the example of a uniform interpolation function and a nonuniform interpolation function. 上記画像補間処理の内容を模式的に示す図である。It is a figure which shows the content of the said image interpolation process typically. 拡大補間処理装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of an expansion interpolation processing apparatus. 上記拡大補間処理装置におけるＳＲＡＭ選択部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the SRAM selection part in the said expansion interpolation processing apparatus. 上記拡大補間処理装置における画像処理ブロックの構成例を示すブロック図である。It is a block diagram which shows the structural example of the image processing block in the said expansion interpolation processing apparatus. 上記拡大補間処理装置における画像処理モジュールに入力される２つのフレーム画像を模式的に示す図である。It is a figure which shows typically two frame images input into the image processing module in the said expansion interpolation processing apparatus. 上記拡大補間処理装置における拡大補間処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the expansion interpolation process in the said expansion interpolation processing apparatus. 拡大補間処理機能を有するフレームレート変換装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the frame rate conversion apparatus which has an expansion interpolation process function. 本発明を適用した映像信号変換システムの構成を示すブロック図である。It is a block diagram which shows the structure of the video signal conversion system to which this invention is applied. 上記映像信号変換システムにおける前処理部を構築するために用いるシステムモデルを示すブロック図である。It is a block diagram which shows the system model used in order to construct | assemble the pre-processing part in the said video signal conversion system. 上記映像信号変換システムにおける前処理部を構築するために用いるリストレーションシステムモデルを示すブロック図である。It is a block diagram which shows the restoration system model used in order to construct | assemble the pre-processing part in the said video signal conversion system. 上記前処理部に用いる逆フィルタの特性の各処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of each process of the characteristic of the inverse filter used for the said pre-processing part. 上記映像信号変換システムにおける圧縮符号化処理部の構成を示すブロック図である。It is a block diagram which shows the structure of the compression encoding process part in the said video signal conversion system. 上記圧縮符号化処理部に備えられた対応点推定部の構成を示すブロック図である。It is a block diagram which shows the structure of the corresponding point estimation part with which the said compression encoding process part was equipped. フレーム間の相関関数が属する２ｍ次補間を行う空間の説明に供する図である。It is a figure where it uses for description of the space which performs 2m order interpolation to which the correlation function between frames belongs. 上記対応点推定部による対応点推定による動きベクトルの決定の様子を模式的に示す図である。It is a figure which shows typically the mode of determination of the motion vector by the corresponding point estimation by the said corresponding point estimation part. 上記対応点推定部による対応点推定により決定した動きベクトルと、従来のブロックマッチングにより決定された動きベクトルとを比較して示す図である。It is a figure which compares and shows the motion vector determined by the corresponding point estimation by the said corresponding point estimation part, and the motion vector determined by the conventional block matching. 上記圧縮符号化処理部に備えられた動き函数化処理部で扱うフレーム画像の原点の説明に供する図である。It is a figure where it uses for description of the origin of the frame image handled by the motion function processing part with which the said compression encoding process part was equipped. 各フレームの画像の動きを各フレームのX座標、Y座標の動きとして模式的に示す図である。It is a figure which shows typically the motion of the image of each frame as a motion of the X coordinate of each frame, and a Y coordinate. フレーム間の位置を推定処理の内容を模式的に示す図である。It is a figure which shows typically the content of the estimation process of the position between frames. （ｍ＝３）クラスの不均等フルーエンシ補間函数を示す図である。It is a figure which shows the unequal fluency interpolation function of a class (m = 3). 高解像度補間アプローチの実例を示す図である。It is a figure which shows the example of a high resolution interpolation approach. 補間のためのピクセル構造の具体例を示す図である。It is a figure which shows the specific example of the pixel structure for interpolation. 上記高フレームレート化処理部により生成される中間フレームと従来手法により生成される中間フレームとを比較して示す図である。It is a figure which compares and shows the intermediate | middle frame produced | generated by the said high frame rate process part, and the intermediate | middle frame produced | generated by the conventional method.

Explanation of symbols

１フレームレート変換装置、２対応点推定処理部、３第１の階調値生成処理部、４第２の階調値生成処理部、５第３の階調値生成処理部５、１０画像入力部、２０前処理部、２１劣化モデル、２２逆フィルタ、３０圧縮符号化処理部、４０高フレームレート化処理部、１００映像信号変換システム、１１０フレームレート変換装置、１１１第１の函数近似処理部、１１２対応点推定処理部、１１３第２の函数近似処理部、１１４第３の函数近似処理部 DESCRIPTION OF SYMBOLS 1 Frame rate conversion apparatus, 2 Corresponding point estimation process part, 3 1st gradation value generation process part, 4 2nd gradation value generation process part, 5 3rd gradation value generation process part 5, 10 Image input , 20 preprocessing unit, 21 degradation model, 22 inverse filter, 30 compression encoding processing unit, 40 high frame rate conversion processing unit, 100 video signal conversion system, 110 frame rate conversion device, 111 first function approximation processing unit , 112 corresponding point estimation processing unit, 113 second function approximation processing unit, 114 third function approximation processing unit

Claims

For a plurality of pixels in the reference frame, the gray value at each pixel point is expressed by a continuous function of the position, and the position that gives the maximum correlation between the above-mentioned function approximate image gray levels in a plurality of image frames at different times A corresponding point estimation processing unit that estimates as
A first gradation value generation processing unit that obtains a gradation value at a corresponding position from a gradation value indicating a gradation of a neighboring pixel point for the estimated gradation at each corresponding position in each image frame;
For an interpolated frame generated at a frame rate ratio to be converted between the image frames, a corresponding point trajectory from a plurality of pixels in the reference frame from tone values estimated at corresponding positions in the image frames. A second gradation value generation processing unit that approximates the upper and lower shades with a fluency function, and obtains each gradation value of the corresponding position in the interpolation frame from the function;
A frame rate conversion comprising: a third gradation value generation processing unit configured to generate a gradation value of each pixel in the vicinity of the corresponding position in the interpolation frame from the gradation value of each corresponding position in the interpolation frame. apparatus.

For a plurality of pixels in the reference frame, the gray value at each pixel point is expressed by a continuous function of the position, and the position that gives the maximum correlation between the above-mentioned function approximate image gray levels in a plurality of image frames at different times Corresponding point estimation processing step to be estimated as
A first gradation value generation processing step for obtaining a gradation value at a corresponding position from a gradation value indicating the gradation of a neighboring pixel point for the estimated gradation at each corresponding position in each image frame;
For the interpolated frames generated at the frame rate ratio to be converted between the above image frames,
For a plurality of pixels in the reference frame, the gray level on the corresponding point locus is approximated by a fluency function from the gradation value of each corresponding position estimated in each image frame, and each level of the corresponding position in the interpolation frame is calculated from the function. A second tone value generation processing step for obtaining a tone value;
And a third gradation value generation processing step for generating a gradation value of each pixel in the vicinity of the corresponding position in the interpolation frame from the gradation value of each corresponding position in the interpolation frame. Method.

A frame rate conversion program executed by a computer provided in the frame rate conversion apparatus,
For a plurality of pixels in the reference frame, the gray value at each pixel point is expressed by a continuous function of the position, and the position that gives the maximum correlation between the above-mentioned function approximate image gray levels in a plurality of image frames at different times A corresponding point estimation processing unit that estimates as
A first gradation value generation processing unit that obtains a gradation value at a corresponding position from a gradation value indicating a gradation of a neighboring pixel point for the estimated gradation at each corresponding position in each image frame;
For the interpolated frames generated at the frame rate ratio to be converted between the above image frames,
For a plurality of pixels in the reference frame, the gray level on the corresponding point locus is approximated by a fluency function from the gradation value of each corresponding position estimated in each image frame, and each level of the corresponding position in the interpolation frame is calculated from the function. A second tone value generation processing unit for obtaining a tone value;
As a third gradation value generation processing unit that generates the gradation value of each pixel near the corresponding position in the interpolation frame from the gradation value of each corresponding position in the interpolation frame,
A frame rate conversion program for causing the computer to function.

First function approximating means for approximating the density distribution of a plurality of pixels in the reference frame with a continuous function of pixel positions;
Correlation calculation is performed between the function approximated by the first function approximating means and the function of the density distribution in the plurality of image frames at different times, and each position that gives the maximum value is set to the plurality of image frames. Corresponding point estimation means for corresponding corresponding point positions in
The corresponding point position in each image frame estimated by the corresponding point estimation means is coordinated by the horizontal and vertical distances from the origin of each image frame, and the coordinate point in the plurality of image frames having different times is used. A second function approximation means for converting each change in the horizontal position and the vertical position into a time-series signal and approximating the time-series signal with a function;
With respect to the interpolation frame at any time between the plurality of image frames, the corresponding position in the interpolation frame corresponding to the corresponding point position of the image frame is determined by the function approximated by the second function approximation means. The gray value at the corresponding point position of the interpolated frame is obtained by interpolation with the gray value at the corresponding point of the image frame, and the first function approximation is applied according to the gray value of the corresponding point of the interpolated frame. A frame rate conversion device comprising: a third function approximating unit that obtains a gray level distribution in the vicinity of the corresponding point and converts a gray level value in the vicinity of the corresponding point into a gray level value of a pixel point in the interpolation frame.

A first function approximating step for approximating the density distribution of a plurality of pixels in the reference frame with a continuous function of pixel positions;
A correlation operation is performed between the function approximated in the first function approximation step and the function of the gray distribution in a plurality of image frames at different times, and each position giving the maximum value is defined as the plurality of image frames. A corresponding point estimation step as a corresponding point position in FIG.
The corresponding point position in each image frame estimated in the corresponding point estimation step is coordinated by the horizontal and vertical distances from the origin of each image frame, and the coordinate points of the plurality of image frames having different times are coordinated. A second function approximation step of converting each change in the horizontal position and the vertical position into a time series signal and approximating the time series signal to a function;
With respect to the interpolation frame at any time between the plurality of image frames, the corresponding position in the interpolation frame corresponding to the corresponding point position of the image frame is determined by the function approximated in the second function approximation step. The gray value at the corresponding point position of the interpolated frame is obtained by interpolation with the gray value at the corresponding point of the image frame, and the first function approximation is applied according to the gray value of the corresponding point of the interpolated frame. A frame rate conversion method comprising: a third function approximating step for obtaining a gray level distribution in the vicinity of the corresponding point and converting the gray level value in the vicinity of the corresponding point into a gray level value of the pixel point in the interpolation frame.

A frame rate conversion program executed by a computer provided in the frame rate conversion apparatus,
First function approximating means for approximating the density distribution of a plurality of pixels in the reference frame with a continuous function of pixel positions;
Correlation calculation is performed between the function approximated by the first function approximating means and the function of the density distribution in the plurality of image frames at different times, and each position that gives the maximum value is set to the plurality of image frames. Corresponding point estimation means for corresponding corresponding point positions in
The corresponding point position in each image frame estimated by the corresponding point estimation means is coordinated by the horizontal and vertical distances from the origin of each image frame, and the coordinate point in the plurality of image frames having different times is used. A second function approximation means for converting each change in the horizontal position and the vertical position into a time-series signal and approximating the time-series signal with a function;
With respect to the interpolation frame at any time between the plurality of image frames, the corresponding position in the interpolation frame corresponding to the corresponding point position of the image frame is determined by the function approximated by the second function approximation means. The gray value at the corresponding point position of the interpolated frame is obtained by interpolation with the gray value at the corresponding point of the image frame, and the first function approximation is applied according to the gray value of the corresponding point of the interpolated frame. As a third function approximating means for obtaining a density distribution in the vicinity of the corresponding point and converting the gray value in the vicinity of the corresponding point into a gray value of the pixel point in the interpolation frame,
A frame rate conversion program for causing the computer to function.