JP6832252B2

JP6832252B2 - Super-resolution device and program

Info

Publication number: JP6832252B2
Application number: JP2017142681A
Authority: JP
Inventors: 俊枝三須; 境田　慎一; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2017-07-24
Filing date: 2017-07-24
Publication date: 2021-02-24
Anticipated expiration: 2037-07-24
Also published as: JP2019023798A

Description

本発明は、超解像装置およびプログラムに関する。 The present invention relates to super-resolution devices and programs.

画像の解像度を向上する手法として、画素間を補間する手法がある。また、ナイキスト周波数を超える信号成分を補い、または復元しつつ解像度を向上する超解像技術がある。超解像技術として、例えばベイズ推定に基づく手法や機械学習による手法がある。 As a method of improving the resolution of an image, there is a method of interpolating between pixels. There is also a super-resolution technology that improves resolution while supplementing or restoring signal components that exceed the Nyquist frequency. As super-resolution technology, for example, there are a method based on Bayesian estimation and a method based on machine learning.

ベイズ推定に基づく手法では、尤度と事前確率の積が事後確率に比例することに基づき、事後確率を最大化する高解像画像を求めることで出力すべき高解像画像を得る。具体的には、例えば、仮定した高解像画像を低解像化した結果が入力低解像画像と矛盾しないかを定量化したものを尤度として定義する。また、例えば、仮定した高解像画像が自然な画像であるか（例えば、画素値が空間的に滑らかか）を定量化したものを事前確率として定義する。ベイズ推定に基づく手法において、その解法に逐次モンテカルロ法を用いる手法が開発されている。特許文献１には、逐次モンテカルロ法を用いてベイズ推定を行う技術が記載されている。 In the method based on Bayesian estimation, a high-resolution image to be output is obtained by obtaining a high-resolution image that maximizes the posterior probability based on the product of the likelihood and the prior probability being proportional to the posterior probability. Specifically, for example, the likelihood is defined as quantifying whether or not the result of lowering the resolution of the assumed high-resolution image does not contradict the input low-resolution image. Further, for example, a prior probability is defined as quantifying whether the assumed high-resolution image is a natural image (for example, whether the pixel value is spatially smooth). In the method based on Bayesian inference, a method using the successive Monte Carlo method for the solution has been developed. Patent Document 1 describes a technique for performing Bayesian estimation using the sequential Monte Carlo method.

機械学習による手法としては、畳み込みニューラルネットワーク（ＣＮＮ）を用いるものが提案されている。ＣＮＮによる手法では、低解像画像を入力側、高解像画像を出力側にとった深層ＣＮＮを構成し、予め低解像画像と高解像画像の対を用いてネットワークの学習（ニューラルネットワークのシナプス結合の重みの調整）を行うものが主流である。
非特許文献１には、機械学習とベイズ推定とを組み合わせた手法として、高解像画像からの事前確率演算にＣＮＮを応用したものが記載されている。 As a method by machine learning, a method using a convolutional neural network (CNN) has been proposed. In the CNN method, a deep CNN with a low-resolution image on the input side and a high-resolution image on the output side is constructed, and network learning (neural network) is performed using a pair of a low-resolution image and a high-resolution image in advance. (Adjusting the weight of synaptic connections) is the mainstream.
Non-Patent Document 1 describes a method in which CNN is applied to prior probability calculation from a high-resolution image as a method combining machine learning and Bayesian estimation.

特許第５４０５３８９号公報Japanese Patent No. 5405389

Ryan Dahl, Mohammad Norouzi, and Jonathon Shlens: “Pixel Recursive Super Resolution,” arXiv:1702.00783v1 [cs.CV], 2 Feb. 2017.Ryan Dahl, Mohammad Norouzi, and Jonathon Shlens: “Pixel Recursive Super Resolution,” arXiv: 1702.00783v1 [cs.CV], 2 Feb. 2017.

補間による高解像化の手法では、入力低解像画像のナイキスト周波数を超える周波数成分が欠如するため、精細感のないぼやけた出力高解像画像しか得られない。 In the high resolution method by interpolation, since the frequency component exceeding the Nyquist frequency of the input low resolution image is lacking, only a blurred output high resolution image without a sense of fineness can be obtained.

超解像技術のうち、ベイズ推定に基づく手法では、画像を高次元で表現した際の高次元空間の効率的なモデル化に課題が残る。非特許文献１の手法では、対象画素の画素値の生起確率を、すでに復元した隣接画素値列から条件付き確率のチェインルールにより逐次的に再構成する方法が用いられている。しかし、チェインルールによる方法では、モデル化や学習の誤差や画素のスキャン順によって画像歪が発生するおそれがある。 Among the super-resolution techniques, the method based on Bayesian estimation still has a problem in efficient modeling of a high-dimensional space when an image is expressed in a high dimension. In the method of Non-Patent Document 1, a method is used in which the occurrence probability of the pixel value of the target pixel is sequentially reconstructed from the already restored adjacent pixel value sequence by the chain rule of the conditional probability. However, in the chain rule method, image distortion may occur due to modeling and learning errors and pixel scanning order.

本発明は、上記の課題認識に基づいて行なわれたものであり、入力された低解像画像を基に、視覚的に精細感のある高解像画像を得ることのできる超解像装置およびプログラムを提供しようとするものである。 The present invention has been made based on the above-mentioned problem recognition, and is a super-resolution device capable of obtaining a visually fine high-resolution image based on an input low-resolution image. It is intended to provide a program.

［１］上記の課題を解決するため、本発明の一態様による超解像装置は、反復演算により低解像画像から高解像画像を得る超解像装置であって、前記反復演算における時点に対応させた高解像画像候補を記憶する高解像画像候補記憶部と、前記反復演算における時点に対応させた事後確率マップを記憶する事後確率マップ記憶部と、前記反復演算における前時点の高解像画像候補から、所定の確率モデルにしたがって、仮高解像画像を確率的に生成する標本生成部と、前記仮高解像画像と前記低解像画像との間の局所領域ごとの整合度を評価して整合度の空間分布を尤度マップとして出力する尤度マップ演算部と、前記仮高解像画像が自然な画像である度合いを評価して自然な画像である度合いの空間分布を事前確率マップとして出力する事前確率マップ演算部と、前記高解像画像候補を更新して前記高解像画像候補記憶部に記憶させるとともに、前記事後確率マップを更新して前記事後確率マップ記憶部に記憶させる標本更新部と、前記高解像画像候補の時点の系列から、前記高解像画像の各画素位置における値を決定して出力する代表値演算部と、を具備し、前記標本更新部は、前記尤度マップ演算部が出力した尤度マップの各画素位置における値と前記事前確率マップ演算部が出力した前記事前確率マップの対応する画素位置における値の積と、前時点の前記事後確率マップの対応する画素位置における値と、所定の乱数値とに基づいて、前記仮高解像画像の当該画素位置における画素値または前時点の前記高解像画像候補の当該画素位置における画素値のいずれかを現時点の前記高解像画像候補の当該画素位置の画素値として決定するとともに、前記尤度マップ演算部が出力した尤度マップの各画素位置における値と前記事前確率マップ演算部が出力した前記事前確率マップの対応する画素位置における値の積と、前時点の前記事後確率マップの対応する画素位置における確率マップの同画素位置の値と、現時点の前記高解像画像候補の当該画素位置の画素値として決定するために用いた当該乱数値とに基づいて、現時点の対応する画素位置における前記積、または前時点の前記事後確率マップの対応する画素位置における値のいずれかを、現時点の前記事後確率マップの対応する画素位置における値として決定する、ことを特徴とする。 [1] In order to solve the above problems, the super-resolution device according to one aspect of the present invention is a super-resolution device that obtains a high-resolution image from a low-resolution image by iterative calculation, and is a time point in the iterative calculation. A high-resolution image candidate storage unit that stores high-resolution image candidates corresponding to the above, a posterior probability map storage unit that stores a posterior probability map corresponding to a time point in the iterative calculation, and a prior time point in the iterative calculation. A sample generation unit that probabilistically generates a temporary high resolution image from a high resolution image candidate according to a predetermined probability model, and each local region between the temporary high resolution image and the low resolution image. A likelihood map calculation unit that evaluates the consistency and outputs the spatial distribution of the consistency as a likelihood map, and a space that evaluates the degree to which the prior high resolution image is a natural image and is a natural image. The prior probability map calculation unit that outputs the distribution as a prior probability map and the high resolution image candidate are updated and stored in the high resolution image candidate storage unit, and the posterior probability map is updated and the posterior posterior It includes a sample update unit to be stored in the probability map storage unit, and a representative value calculation unit to determine and output a value at each pixel position of the high-resolution image from the sequence at the time of the high-resolution image candidate. , The sample update unit is the product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the prior probability map output by the prior probability map calculation unit. And, based on the value at the corresponding pixel position of the posterior probability map at the previous time point and the predetermined random number value, the pixel value at the pixel position of the prior high resolution image or the high resolution image at the previous time point. Any of the pixel values of the candidate at the pixel position is determined as the pixel value of the pixel position of the high resolution image candidate at the present time, and the value at each pixel position of the likelihood map output by the likelihood map calculation unit. And the product of the values at the corresponding pixel positions of the prior probability map output by the prior probability map calculation unit, and the value of the same pixel position of the probability map at the corresponding pixel positions of the posterior probability map at the previous time. , The product at the current corresponding pixel position, or the posterior probability map at the previous time point, based on the random number value used to determine the current pixel value of the high resolution image candidate. It is characterized in that any one of the values at the corresponding pixel positions of is determined as the value at the corresponding pixel position of the posterior probability map at the present time.

［２］また、本発明の一態様は、上記の超解像装置において、前記事前確率マップ演算部は、ニューラルネットワークによって構成され、高解像画像としてあるべき画像の例である正例と高解像画像としてあるべきではない画像の例である負例とを用いて前記ニューラルネットワークが予め学習済みである、ことを特徴とする。 [2] Further, in one aspect of the present invention, in the above-mentioned super-resolution device, the advance probability map calculation unit is composed of a neural network, and is an example of an image that should be a high-resolution image. It is characterized in that the neural network has been trained in advance using a negative example which is an example of an image which should not be a high-resolution image.

［３］また、本発明の一態様は、上記の超解像装置において、前記事前確率マップ演算部は、前記仮高解像画像の画素位置における空間方向の画素値の変化量が小さいほど前記仮高解像画像が自然な画像である度合いを高く評価して、前記事前確率マップを出力する、ことを特徴とする。 [3] Further, in one aspect of the present invention, in the above-mentioned super-resolution device, the advance probability map calculation unit increases the amount of change in the pixel value in the spatial direction at the pixel position of the provisional high-resolution image as small as possible. It is characterized in that the degree to which the provisional high-resolution image is a natural image is highly evaluated, and the advance probability map is output.

［４］また、本発明の一態様は、上記の超解像装置において、前記事前確率マップ演算部は、前記仮高解像画像の画素位置の所定範囲内における画素値の総変化量が小さいほど前記仮高解像画像が自然な画像である度合いを高く評価して、前記事前確率マップを出力する、ことを特徴とする。 [4] Further, in one aspect of the present invention, in the super-resolution device, the prior probability map calculation unit has a total change amount of pixel values within a predetermined range of pixel positions of the provisional high-resolution image. The smaller the value, the higher the degree to which the provisional high-resolution image is a natural image is evaluated, and the pre-probability map is output.

［５］また、本発明の一態様は、コンピューターを、反復演算により低解像画像から高解像画像を得る超解像装置であって、前記反復演算における時点に対応させた高解像画像候補を記憶する高解像画像候補記憶部と、前記反復演算における時点に対応させた事後確率マップを記憶する事後確率マップ記憶部と、前記反復演算における前時点の高解像画像候補から、所定の確率モデルにしたがって、仮高解像画像を確率的に生成する標本生成部と、前記仮高解像画像と前記低解像画像との間の局所領域ごとの整合度を評価して整合度の空間分布を尤度マップとして出力する尤度マップ演算部と、前記仮高解像画像が自然な画像である度合いを評価して自然な画像である度合いの空間分布を事前確率マップとして出力する事前確率マップ演算部と、前記高解像画像候補を更新して前記高解像画像候補記憶部に記憶させるとともに、前記事後確率マップを更新して前記事後確率マップ記憶部に記憶させる標本更新部と、前記高解像画像候補の時点の系列から、前記高解像画像の各画素位置における値を決定して出力する代表値演算部と、を具備し、前記標本更新部は、前記尤度マップ演算部が出力した尤度マップの各画素位置における値と前記事前確率マップ演算部が出力した前記事前確率マップの対応する画素位置における値の積と、前時点の前記事後確率マップの対応する画素位置における値と、所定の乱数値とに基づいて、前記仮高解像画像の当該画素位置における画素値または前時点の前記高解像画像候補の当該画素位置における画素値のいずれかを現時点の前記高解像画像候補の当該画素位置の画素値として決定するとともに、前記尤度マップ演算部が出力した尤度マップの各画素位置における値と前記事前確率マップ演算部が出力した前記事前確率マップの対応する画素位置における値の積と、前時点の前記事後確率マップの対応する画素位置における確率マップの同画素位置の値と、現時点の前記高解像画像候補の当該画素位置の画素値として決定するために用いた当該乱数値とに基づいて、現時点の対応する画素位置における前記積、または前時点の前記事後確率マップの対応する画素位置における値のいずれかを、現時点の前記事後確率マップの対応する画素位置における値として決定する、超解像装置として機能させるためのプログラムである。 [5] Further, one aspect of the present invention is a super-resolution device in which a computer obtains a high-resolution image from a low-resolution image by an iterative calculation, and a high-resolution image corresponding to a time point in the iterative calculation. Predetermined from the high-resolution image candidate storage unit that stores the candidates, the posterior probability map storage unit that stores the posterior probability map corresponding to the time point in the iterative calculation, and the high-resolution image candidate at the previous time point in the iterative calculation. According to the probabilistic model of, the sample generator that probabilistically generates a pseudo-high resolution image and the consistency of each local region between the temporary high-resolution image and the low-resolution image are evaluated and the consistency is achieved. A likelihood map calculation unit that outputs the spatial distribution of the above as a likelihood map, and an evaluation of the degree to which the provisional high-resolution image is a natural image, and outputs the spatial distribution of the degree of the natural image as a prior probability map. A sample in which the prior probability map calculation unit and the high-resolution image candidate are updated and stored in the high-resolution image candidate storage unit, and the posterior probability map is updated and stored in the posterior probability map storage unit. The sample update unit includes an update unit and a representative value calculation unit that determines and outputs a value at each pixel position of the high-resolution image from a sequence at the time of the high-resolution image candidate. The product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the prior probability map output by the prior probability map calculation unit, and the post-event of the previous time. Based on the value at the corresponding pixel position of the probability map and the predetermined random number value, the pixel value at the pixel position of the temporary high resolution image or the pixel value at the pixel position of the high resolution image candidate at the previous time point. Is determined as the pixel value of the pixel position of the high-resolution image candidate at the present time, and the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the prior probability map calculation unit. The product of the values at the corresponding pixel positions of the prior probability map output by, the value of the same pixel position of the probability map at the corresponding pixel position of the posterior probability map at the previous time, and the high resolution image at the present time. Based on the random number value used to determine the pixel value of the candidate pixel position, the product at the current corresponding pixel position or the value at the corresponding pixel position of the prior probability map at the previous time. This is a program for functioning as a super-resolution device, which determines one of them as a value at a corresponding pixel position of the prior probability map at the present time.

本発明によれば、入力される低解像画像を基に、高精細で且つ確からしい高解像画像を効率よく得ることができる。 According to the present invention, a high-definition and reliable high-resolution image can be efficiently obtained based on an input low-resolution image.

本発明の第１実施形態による超解像装置の概略機能構成を示すブロック図である。It is a block diagram which shows the schematic functional structure of the super-resolution apparatus according to 1st Embodiment of this invention. 同実施形態による記憶部３００の概略構成を示す機能ブロック図である。It is a functional block diagram which shows the schematic structure of the storage part 300 by the same embodiment. 同実施形態による事前確率マップ演算部１５が利用するニューラルネットワークの概略構成を示す概略図である。It is a schematic diagram which shows the schematic structure of the neural network used by the prior probability map calculation unit 15 by the same embodiment. 同実施形態におけるニューラルネットワークの学習処理に用いるのに適した正例および負例の例を示す概略図である。It is the schematic which shows the example of the positive example and the negative example suitable for use in the learning process of the neural network in the same embodiment.

以下、本発明の実施形態について、図面を参照しながら説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

［第１実施形態］
図１は、本実施形態による超解像装置の概略機能構成を示すブロック図である。図示するように、超解像装置１は、遅延部１０と、遅延部１１と、標本生成部１２と、切替部１３と、尤度マップ演算部１４と、事前確率マップ演算部１５と、乗算部１６と、標本更新部１８と、代表値演算部１９と、初期化部２０と、記憶部３００と、を含んで構成される。これらの各機能部は、例えば、電子回路を用いて実現される。また、記憶部３００は、そして必要に応じて他の機能部は、半導体メモリーや磁気ハードディスク装置などといった記憶手段を内部に備える。また、各機能を、コンピューターおよびソフトウェアによって実現するようにしてもよい。 [First Embodiment]
FIG. 1 is a block diagram showing a schematic functional configuration of the super-resolution device according to the present embodiment. As shown in the figure, the super-resolution device 1 multiplies the delay unit 10, the delay unit 11, the sample generation unit 12, the switching unit 13, the likelihood map calculation unit 14, and the prior probability map calculation unit 15. It is composed of a unit 16, a sample update unit 18, a representative value calculation unit 19, an initialization unit 20, and a storage unit 300. Each of these functional parts is realized by using, for example, an electronic circuit. In addition, the storage unit 300, and if necessary, other functional units include storage means such as a semiconductor memory and a magnetic hard disk device. In addition, each function may be realized by a computer and software.

この超解像装置１は、マルコフ連鎖モンテカルロ法のメトロポリス・ヘイスティングス法による解法を応用した手法により、低解像画像から高解像画像へのベイズ推定を行う装置である。超解像装置１は、反復演算により低解像画像から高解像画像を得る。なお、「メトロポリス・ヘイスティングス法による解法を応用」の「応用」とは、特に、画像を構成する個々の画素の画素値が各々独立に確率的に標本化されるよう構成した手法であることを意味する。また、ベイズ推定を行う超解像装置１は、下の式（１）により高解像画像を求める。 This super-resolution device 1 is a device that performs Bayesian estimation from a low-resolution image to a high-resolution image by a method applying the solution method by the Metropolis-Hastings method of the Markov chain Monte Carlo method. The super-resolution device 1 obtains a high-resolution image from a low-resolution image by iterative calculation. The "application" of "applying the solution method by the Metropolis-Hastings method" is, in particular, a method configured so that the pixel values of the individual pixels constituting the image are sampled independently and probabilistically. Means. Further, the super-resolution device 1 that performs Bayesian estimation obtains a high-resolution image by the following equation (1).

式（１）において、Ｙは入力される低解像画像であり、Ｘは低解像画像に対応する高解像画像である。また、Ｐ（Ｘ）は、高解像画像Ｘが高解像画像としてふさわしいか否かを表す事前確率である。また、Ｐ（Ｙ｜Ｘ）は、高解像画像Ｘを低解像化したときに低解像画像Ｙになる度合いを表す条件付き確率（尤度）である。また、Ｘ＾（Ｘの上にハット）は、入力されるＹに対して超解像装置１が求めるべき高解像画像である。入力されるＹに対して高解像画像Ｘ＾（Ｘの上にハット）を求めるにあたり、Ｐ（Ｙ）はＸに依存せず一定であるため、定数として扱うことができる。
つまり、超解像装置１は、尤度と事前確率の積が事後確率に比例することに基づき、事後確率Ｐ（Ｘ｜Ｙ）を最大化する高解像画像を求める。 In the formula (1), Y is an input low-resolution image, and X is a high-resolution image corresponding to the low-resolution image. Further, P (X) is a prior probability indicating whether or not the high-resolution image X is suitable as a high-resolution image. Further, P (Y | X) is a conditional probability (likelihood) indicating the degree to which the high-resolution image X becomes a low-resolution image Y when the high-resolution image X is reduced in resolution. Further, X ^ (hat on X) is a high-resolution image that the super-resolution device 1 should obtain for the input Y. In obtaining the high-resolution image X ^ (hat on top of X) for the input Y, P (Y) is constant without depending on X, and can be treated as a constant.
That is, the super-resolution device 1 obtains a high-resolution image that maximizes the posterior probability P (X | Y) based on the product of the likelihood and the prior probability being proportional to the posterior probability.

超解像装置１への入力は、入力低解像画像Ｙである。画像における画素位置をｒと表す。なお、画素位置rは画像内の座標を特定するための数値である。例えば、rは座標値そのものを表す2次元のベクトル値であってもよいし、画像の各標本位置に一対一対応させた1次元のインデックス値であってもよい。画素位置rにおける画像Ｙの画素値をＹ（ｒ）と書く。以下において、Ｙ以外の画像（後述する確率マップも同様）に関しても、同様に「（ｒ）」を後置することにより、画素位置ｒにおける画素値への参照を表すものとする。また、各種画像の画素値はスカラーであってもよいし、色別や波長別の輝度等（輝度、色差、色相、彩度など）を表すベクトルであってもよい。 The input to the super-resolution device 1 is the input low-resolution image Y. The pixel position in the image is represented by r. The pixel position r is a numerical value for specifying the coordinates in the image. For example, r may be a two-dimensional vector value representing the coordinate value itself, or a one-dimensional index value having a one-to-one correspondence with each sample position of the image. The pixel value of the image Y at the pixel position r is written as Y (r). In the following, for images other than Y (the same applies to the probability map described later), the reference to the pixel value at the pixel position r is represented by adding "(r)" in the same manner. Further, the pixel values of various images may be scalars, or may be vectors representing luminance and the like (luminance, color difference, hue, saturation, etc.) for each color and wavelength.

超解像装置１からの出力を出力高解像画像Ｘ＾（Ｘの上にハット）とおく。超解像装置１は反復演算により出力高解像画像Ｘ＾（Ｘの上にハット）を導く。超解像装置１は、後述する記憶部３００内の高解像画像候補記憶部３０２に、高解像画像候補を記憶する。上記の反復演算における第t回（tは１以上の自然数）の反復後における高解像画像候補をＸ_ｔと表す。 The output from the super-resolution device 1 is set as the output high-resolution image X ^ (hat on top of X). The super-resolution device 1 derives an output high-resolution image X ^ (a hat on X) by iterative calculation. The super-resolution device 1 stores high-resolution image candidates in the high-resolution image candidate storage unit 302 in the storage unit 300, which will be described later. The t time in the iterative calculation (t is a natural number of 1 or more) represents a high-resolution image candidate after repeating the X _t.

また、超解像装置１は、後述する記憶部３００内の事後確率マップ記憶部３０４に、高解像画像候補Ｘ_ｔの画素ごとの画素値の確からしさを表す画像（これを、事後確率マップと呼ぶ）Ｐ_ｔを記憶する。なお、事後確率マップＰ_ｔは、事後確率そのものが記録されたものでなくとも、事後確率Ｐ（Ｘ_ｔ（ｒ）｜Ｙ（ｒ））に比例する値を記録したものであればよい。例えば、画素位置ごとの、Ｘ_ｔとＹの画素値対に対する同時確率Ｐ（Ｘ_ｔ（ｒ），Ｙ（ｒ））、即ち、Ｐ（Ｘ_ｔ（ｒ）｜Ｙ（ｒ））Ｐ（Ｙ（ｒ））を事後確率マップＰ_ｔ
として記憶するようにしてもよい。 Further, super-resolution device 1, the posterior probability map storage unit 304 in the storage unit 300 described later, an image (this representing the likelihood of the pixel value of each pixel of the high resolution image candidate X _t, posterior probability map ) P _t is stored. The posterior probability map P _t may be a map that records a value proportional to the posterior probability P (X _t (r) | Y (r)) even if the posterior probability itself is not recorded. For example, the simultaneous probabilities P (X _t (r), Y (r)) _{for each pixel value pair of X t} _{and Y for each pixel position, that is, P (X t} (r) | Y (r)) P (Y). (R)) is the posterior probability map P _t
It may be memorized as.

次に、超解像装置１を構成する各部について説明する。 Next, each part constituting the super-resolution device 1 will be described.

遅延部１０は、高解像画像候補Ｘ_ｔを高解像画像候補記憶部３０２に記憶させるとともに、１時点遅延した（即ち、ｔに対してｔ−１の）高解像画像候補Ｘ_ｔ−１を高解像画像候補記憶部３０２から読み出して出力する。なお、高解像画像候補の初期値Ｘ_０は任意の画像であってよい。
遅延部１１は、事後確率マップＰ_ｔを事後確率マップ記憶部３０４に記憶させるとともに、１時点遅延した（即ち、ｔに対してｔ−１の）事後確率マップＰ_ｔ−１を事後確率マップ記憶部３０４から読み出して出力する。なお、事後確率マップの初期値Ｐ_０は、全画素位置について０以下の任意の値を持つマップである。 Delay unit 10, together with the stores the high resolution image candidates X _t in the high-resolution image candidate storage unit 302, 1 and the time delay (i.e., the t-1 against t) high resolution image candidate X _{t- 1} is read from the high-resolution image candidate storage unit 302 and output. _{The initial value X 0} of the high-resolution image candidate may be any image.
The delay unit 11 stores the posterior probability map P _t _{in the posterior probability map storage unit 304, and stores the posterior probability map P t-1} delayed by one time point (that is, t-1 with respect to t) in the posterior probability map storage unit 304. Read from unit 304 and output. The initial value P ₀ of the posterior probability map is a map having an arbitrary value of 0 or less for all pixel positions.

標本生成部１２は、高解像画像候補Ｘ_ｔ−１から仮の高解像画像Ｘ’（「仮高解像画像」とも呼ぶ）を生成し、出力する。つまり、反復演算における前時点の高解像画像候補から、所定の確率モデルにしたがって、仮高解像画像を確率的に生成する。
具体的には、標本生成部１２は、例えば、高解像画像候補Ｘ_ｔ−１に対して雑音を付加することで仮の高解像画像Ｘ’を生成する。例えば、Ｎ（μ,Σ）を、平均値μ、分散共分散行列Σ（1次元の場合にあっては分散値）の正規分布としたとき、標本生成部１２は、下の式（２）により仮の高解像画像Ｘ’を生成する。 The sample generation unit 12 generates and outputs a temporary high-resolution image X'(also referred to as a "temporary high-resolution image") from the high-resolution _{image candidate X t-1.} That is, a tentative high-resolution image is stochastically generated from the high-resolution image candidates at the previous time point in the iterative operation according to a predetermined probability model.
Specifically, the sample generation unit 12 generates a temporary high-resolution image X'by adding noise to, for example, the high-resolution image candidate X _t-1. For example, when N (μ, Σ) is a normal distribution of the mean value μ and the variance-covariance matrix Σ (dispersion value in the case of one dimension), the sample generation unit 12 uses the following equation (2). Generates a temporary high-resolution image X'.

つまり、標本生成部１２は、各時点および各画素について、平均がゼロで、分散共分散行列がΣの正規分布に従う雑音を高解像画像候補に付加することにより、仮の高解像画像Ｘ’を生成する。ここで、Σは、半正定値対称行列（１次元の場合にあっては非負の実定数）である。 That is, the sample generation unit 12 adds noise to the high-resolution image candidate, which has an average of zero and the variance-covariance matrix follows a normal distribution of Σ, at each time point and each pixel, thereby tentatively high-resolution image X. 'Generate. Here, Σ is a semi-positive definite matrix (a non-negative real constant in the case of one dimension).

切替部１３は、標本生成部１２からの入力または初期化部２０からの入力のいずれか一方を選択し、選択された信号を出力する。切替部１３は、例えば、超解像装置１の反復演算処理開始時（第１回）についてのみ初期化部２０からの入力を選択し、それ以降の回では標本生成部１２からの入力を選択する。つまり、切替部１３は、仮の高解像画像Ｘ’を出力するか、その初期値を出力するかを切り替える。切替部１３からの出力は、尤度マップ演算部１４と事前確率マップ演算部１５とに供給される。 The switching unit 13 selects either the input from the sample generation unit 12 or the input from the initialization unit 20, and outputs the selected signal. For example, the switching unit 13 selects the input from the initialization unit 20 only at the start of the iterative calculation process (first time) of the super-resolution device 1, and selects the input from the sample generation unit 12 at the subsequent times. To do. That is, the switching unit 13 switches between outputting the temporary high-resolution image X'and outputting the initial value thereof. The output from the switching unit 13 is supplied to the likelihood map calculation unit 14 and the prior probability map calculation unit 15.

尤度マップ演算部１４は、仮の高解像画像Ｘ’（但し、初回の処理においてはその初期値）が、入力低解像画像Ｙに整合するか否かを画素位置ごとに評価し、その評価値を画像（尤度マップＬ）として出力する。尤度マップ演算部１４は、例えば、仮の高解像画像Ｘ’を基に、入力低解像画像Ｙの解像度まで解像度の削減を行い、解像度削減した仮の高解像画像Ｘ’と入力低解像画像Ｙとの誤差画像に基づいて尤度マップＬを求める。つまり、尤度マップ演算部１４は、仮高解像画像と低解像画像との間の局所領域ごとの整合度を評価して整合度の空間分布を尤度マップとして出力する。
例えば、仮の高解像画像Ｘ’が入力低解像画像Ｙに対して水平方向にｍ_ｘ倍、且つ垂直方向にｍ_ｙ倍の解像度であるとする。そして、仮の高解像画像Ｘ’を入力低解像画像Ｙの大きさまで解像度削減した画像をＹ’とおく。
具体的には、例えば、尤度マップ演算部１４は、まず、仮の高解像画像Ｘ’を基に、下の式（３）によるサブサンプリングを行うことによって画像Ｙ’を得る。 The likelihood map calculation unit 14 evaluates for each pixel position whether or not the provisional high-resolution image X'(however, its initial value in the initial processing) matches the input low-resolution image Y. The evaluation value is output as an image (likelihood map L). For example, the likelihood map calculation unit 14 reduces the resolution to the resolution of the input low-resolution image Y based on the temporary high-resolution image X', and inputs the temporary high-resolution image X'with the reduced resolution. The likelihood map L is obtained based on the error image with the low resolution image Y. That is, the likelihood map calculation unit 14 evaluates the consistency of each local region between the temporary high resolution image and the low resolution image, and outputs the spatial distribution of the consistency as a likelihood map.
For example, m _x magnification in the horizontal direction with respect to the provisional high-resolution image X 'input low-resolution image Y, and assumed to be m _y times the resolution in the vertical direction. Then, the image whose resolution is reduced to the size of the input low-resolution image Y by inputting the temporary high-resolution image X'is referred to as Y'.
Specifically, for example, the likelihood map calculation unit 14 first obtains an image Y'by performing subsampling according to the following equation (3) based on the provisional high-resolution image X'.

あるいは、尤度マップ演算部１４は、式（３）によるサブサンプリングを行う代わりに、仮の高解像画像Ｘ’に、間引きフィルターＤを畳み込んだ結果をサブサンプリングすることで画像Ｙ’を得るようにしてもよい。例えば、間引きフィルターＤとして２次元Ｌａｎｃｚｏｓ−３フィルターを用いてよい。この場合、尤度マップ演算部１４は、下の式（４）および式（５）により、サブサンプリングを行う。 Alternatively, the likelihood map calculation unit 14 subsamples the result of convolving the thinning filter D into the temporary high-resolution image X'instead of subsampling by the equation (3) to obtain the image Y'. You may try to get it. For example, a two-dimensional Lanczos-3 filter may be used as the thinning filter D. In this case, the likelihood map calculation unit 14 performs subsampling according to the following equations (4) and (5).

なお、式（４）において、δはディラックのデルタ関数であり、Ｒ^２は実2次元ベクトル空間である。また、画素位置ｒが画像外や格子点以外である場合には、Ｘ’（ｒ）＝０と定義する。式（５）は、式（４）を離散表現したものである。 In equation (4), δ is a Dirac delta function, and R ² is a real two-dimensional vector space. Further, when the pixel position r is outside the image or other than the grid points, it is defined as X'(r) = 0. Equation (5) is a discrete representation of Equation (4).

また、式（４）および式（５）においては、床関数（floor function）および天井関数（ceiling function）を用いている。下の式（６）は、床関数を表すものであり、実数ｚを超えない最大の整数を返す。また、下の式（７）は、天井関数を表すものであり、実数ｚより小さくない最小の整数を返す。 Further, in the equations (4) and (5), the floor function and the ceiling function are used. Equation (6) below expresses the floor function and returns the maximum integer that does not exceed the real number z. Further, the following equation (7) represents the ceiling function and returns the smallest integer not smaller than the real number z.

以上のように画像Ｙ’（仮の高解像画像に基づく低解像画像）が得られると、続いて尤度マップ演算部１４は、画像Ｙ’と入力低解像画像Ｙとに基づいて尤度マップＬを導出する。Ｌの解像度は、仮の高解像画像Ｘ’の解像度と同一である。 When the image Y'(low resolution image based on the tentative high resolution image) is obtained as described above, the likelihood map calculation unit 14 subsequently bases the image Y'and the input low resolution image Y on. The likelihood map L is derived. The resolution of L is the same as the resolution of the provisional high-resolution image X'.

具体的には、例えば、尤度マップ演算部１４は、入力低解像画像Ｙと画像Ｙ’との誤差に基づいて求めた誤差マップＥをアップコンバートすることにより、尤度マップＬを得る。尤度マップ演算部１４は、例えば、下の式（８）により誤差マップＥを求める。 Specifically, for example, the likelihood map calculation unit 14 obtains the likelihood map L by up-converting the error map E obtained based on the error between the input low-resolution image Y and the image Y'. The likelihood map calculation unit 14 obtains the error map E by, for example, the following equation (8).

式（８）において、σは正の実定数であり、例えばσ＝１とする。誤差マップＥを定義するにあたっては、Ｙ（ｒ）とＹ’（ｒ）の差の絶対値が小さいほど、Ｅ（ｒ）の値が大きくなるようにする。式（８）もそのように誤差マップＥを定義している。 In equation (8), σ is a positive real constant, for example, σ = 1. In defining the error map E, the smaller the absolute value of the difference between Y (r) and Y'(r), the larger the value of E (r). Equation (8) also defines the error map E in this way.

尤度マップ演算部１４は、誤差マップＥをアップコンバートすることにより尤度マップＬを求める。
一例として、尤度マップ演算部１４は、下の式（９）により尤度マップＬを求める。 The likelihood map calculation unit 14 obtains the likelihood map L by up-converting the error map E.
As an example, the likelihood map calculation unit 14 obtains the likelihood map L by the following equation (9).

なお、式（９）においても、前述の床関数を用いている。式（９）は、誤差マップＥを０次補間してアップコンバートすることにより、尤度マップＬを求めるものである。
あるいは、別の例として、補間フィルターを畳み込んで小数画素位置の画素値を求めることによって、誤差マップＥをアップコンバートして、尤度マップＬを求めてもよい。例えば、尤度マップ演算部１４は、補間フィルター（Ｆとする）にＬａｎｃｚｏｓ−３フィルターを用いて、下の式（１０）によりアップコンバートを行い、尤度マップＬを求める。 The floor function described above is also used in equation (9). Equation (9) obtains the likelihood map L by up-converting the error map E by zero-order interpolation.
Alternatively, as another example, the error map E may be up-converted to obtain the likelihood map L by convolving the interpolation filter to obtain the pixel value at the decimal pixel position. For example, the likelihood map calculation unit 14 uses the Lanczos-3 filter as the interpolation filter (referred to as F) and up-converts according to the following equation (10) to obtain the likelihood map L.

事前確率マップ演算部１５は、仮の高解像画像Ｘ’に基づいて、事前確率マップＲを求めて出力する。事前確率マップＲは、仮の高解像画像Ｘ’がどの程度自然な画像であるか（自然な画像である度合い）を画素ごとに定量化した数値のマップである。自然な画像である度合いを、「画像らしさ」とも呼ぶ。つまり、事前確率マップ演算部１５は、仮高解像画像が自然な画像である度合いを評価して自然な画像である度合いの空間分布を事前確率マップとして出力する。 The prior probability map calculation unit 15 obtains and outputs the prior probability map R based on the tentative high-resolution image X'. The prior probability map R is a numerical map that quantifies how natural the tentative high-resolution image X'is (the degree of being a natural image) for each pixel. The degree to which an image is natural is also called "image-likeness". That is, the prior probability map calculation unit 15 evaluates the degree to which the temporary high resolution image is a natural image, and outputs the spatial distribution of the degree to which the temporary high resolution image is a natural image as a prior probability map.

本実施形態において、自然な画像とは、例えば空間方向における画素値の変化の度合いの大小などのみによって特徴づけられるものではない。一例として、都市風景の画像においては、建物等が有する直線的な成分が多く含まれる場合がある。このとき、その直線の近傍の局所的領域において、空間方向の画素値変化の度合い（勾配等）が大きいこともあるが、そのような画像は現実の風景を表すものであり、自然な画像である度合いは高いと言える。特定の領域における画素間の画素値の変化を単純な算術のみによって解析するだけでは、それが「自然な画像」であるか否かを適切に評価することは困難である。本実施形態は、自然な画像である度合いを適切に評価するために次に述べるような手法を用いる。
即ち、本実施形態による事前確率マップ演算部１５は、仮の高解像画像Ｘ’に含まれる局所的な画素値列が、自然な画像として生起しやすいパターンである度合いを定量化して、事前確率マップＲの画素ごとの数値を求める。 In the present embodiment, the natural image is not characterized only by, for example, the degree of change in the pixel value in the spatial direction. As an example, an image of an urban landscape may contain many linear components of a building or the like. At this time, the degree of change in pixel value in the spatial direction (gradient, etc.) may be large in the local region near the straight line, but such an image represents a real landscape and is a natural image. It can be said that the degree is high. It is difficult to properly evaluate whether or not a change in pixel value between pixels in a specific region is a "natural image" simply by analyzing it by simple arithmetic. In this embodiment, the following method is used in order to appropriately evaluate the degree of being a natural image.
That is, the prior probability map calculation unit 15 according to the present embodiment quantifies the degree to which the local pixel value sequence included in the temporary high-resolution image X'is a pattern that easily occurs as a natural image, and preliminarily. The numerical value for each pixel of the probability map R is obtained.

具体的には、事前確率マップ演算部１５は、例えば、上記の局所的な画素値列が自然な画像として生起しやすいパターンであるか否かを、機械学習により構築した判別器を用いて、定量化する。例えば、その機械学習に用いて構築する判別器として、事前確率マップ演算部１５は、ニューラルネットワークを用いるようにしてよい。ニューラルネットワークとしては、典型的には、畳み込みニューラルネットワーク（ＣＮＮ）を用いることができる。畳み込みニューラルネットワークを用いた判別器の構成については、後で、別の図を参照しながら説明する。 Specifically, the prior probability map calculation unit 15 uses, for example, a discriminator constructed by machine learning to determine whether or not the above-mentioned local pixel value sequence is a pattern that easily occurs as a natural image. Quantify. For example, the prior probability map calculation unit 15 may use a neural network as a discriminator constructed by using the machine learning. As the neural network, a convolutional neural network (CNN) can be typically used. The configuration of the discriminator using the convolutional neural network will be described later with reference to another figure.

乗算部１６は、尤度マップ演算部１４によって求められた尤度マップＬと、事前確率マップ演算部１５によって求められた事前確率マップＲとを、画素ごとに乗算する。乗算部１６は、画素ごとに乗算した結果からなる画像を、仮の事後確率マップＰ’（「仮事後確率マップ」とも呼ぶ）として出力する。具体的には、乗算部１６は、下の式（１１）によって、仮の事後確率マップＰ’を求める。 The multiplication unit 16 multiplies the likelihood map L obtained by the likelihood map calculation unit 14 and the prior probability map R obtained by the prior probability map calculation unit 15 for each pixel. The multiplication unit 16 outputs an image composed of the result of multiplication for each pixel as a temporary posterior probability map P'(also referred to as a "temporary posterior probability map"). Specifically, the multiplication unit 16 obtains a tentative posterior probability map P'by the following equation (11).

標本更新部１８は、高解像画像候補Ｘ_ｔ−１を更新して高解像画像候補Ｘ_ｔを出力するとともに、事後確率マップＰ_ｔ−１を更新して事後確率マップＰ_ｔを出力する。 The sample update unit 18 updates the high-resolution image candidate X _t-1 to output the high-resolution image candidate X _t , updates the posterior probability map P _t-1 , and outputs the posterior probability map P _t. ..

標本更新部１８は、高解像画像候補に関しては、Ｐ’の画素値Ｐ’（ｒ）とＰ_ｔ−１の画素値Ｐ_ｔ−１（ｒ）との比に基づき、仮の高解像画像の画素値Ｘ’（ｒ）を、時刻ｔにおける高解像画像候補の画素値Ｘ_ｔ（ｒ）として採択するか否かを確率的に決定する。そして、標本更新部１８は、高解像画像候補Ｘ_ｔ−１を更新して高解像画像候補Ｘ_ｔを生成し、出力する。そのため、標本更新部１８は、遅延部１０から出力された高解像画像候補Ｘ_ｔ−１と、遅延部１１から出力された事後確率マップＰ_ｔ−１と、切替部１３から出力された仮の高解像画像Ｘ’と、乗算部１６から出力された仮の事後確率マップＰ’とを取得する。 Sample updating unit 18, with respect to the high resolution image candidates, based on the ratio between the pixel value P _t-1 of the 'pixel value P' (r) and _{_{P t-1 P (r)}} , the provisional high-resolution It is stochastically determined whether or not the pixel value X'(r) of the image _{is adopted as the pixel value X t (r) of the high-resolution image candidate at time t.} Then, the sample update unit 18 updates the high-resolution image candidate X _t-1 to generate the high-resolution image candidate X _t and outputs it. Therefore, the sample update unit 18 includes the high-resolution image candidate X _t-1 _{output from the delay unit 10, the posterior probability map P t-1} output from the delay unit 11, and the provisional output from the switching unit 13. The high-resolution image X'of the above and the temporary posterior probability map P'output from the multiplication unit 16 are acquired.

なお、標本更新部１８は、マルコフ連鎖モンテカルロ法のメトロポリス・ヘイスティングス法による解法における標本の採択・棄却と同様の規則によって、上記の採択を行う。即ち、標本更新部１８は、Ｐ’、Ｐ_ｔ−１、および一様分布Ｕ（０，１）に従う標本値ｕ_ｔ（ｒ）に基づき、高解像画像候補Ｘ_ｔの各画素位置rにおける値Ｘ_ｔ（ｒ）として、Ｘ’（ｒ）を採択するか、Ｘ’（ｒ）を棄却してＸ_ｔ−１（ｒ）の値を維持するかを決定する。 The sample update unit 18 adopts the above according to the same rules as the selection / rejection of the sample in the solution method by the Metropolis-Hastings method of the Markov chain Monte Carlo method. That is, the sample update unit 18, P _', the _{P t-1,} and on the basis of the sample values _u t (r) according to the uniform distribution U (0, 1), each pixel position r of the high-resolution image candidate _{X t} It is determined _{whether X'(r) is adopted as the value X t} (r) or X'(r) is rejected and _{the value of X t-1} (r) is maintained.

例えば、標本更新部１８は、下の式（１２）により、画素位置ｒごとに、高解像画像候補の画素値Ｘ_ｔ（ｒ）を決定する。 _{For example, the sample update unit 18 determines the pixel value X t} (r) of the high-resolution image candidate for each pixel position r by the following equation (12).

なお、式（１２）において、ｕ_ｔ（ｒ）は、毎時点（ｔごと）、且つ毎画素位置（ｒごと）においてその都度標本する（乱数を発生する）ことにより得られる値である。そのため、標本更新部１８は、乱数発生器または疑似乱数発生器を用いる。また、一様分布Ｕ（０，１）にしたがう標本値ｕ_ｔ（ｒ）を、例えば、メルセンヌ・ツイスタ等の疑似乱数発生器によって近似的に生成してもよい。つまり、疑似乱数発生器等によって、一様分布Ｕ（０，１）を近似するものであってもよい。 In the equation (12), u t _(r) is (each t) every time, each time (for generating a random number) to the specimen at each pixel position (each r) and a value obtained by. Therefore, the sample update unit 18 uses a random number generator or a pseudo random number generator. Moreover, the uniform distribution U according to (0,1) sampled value u _{t (r),} for example, may be approximated generated by a pseudo-random number generator such as a Mersenne Twister. That is, the uniform distribution U (0,1) may be approximated by a pseudo random number generator or the like.

また、標本更新部１８は、事後確率マップに関しては、Ｐ’の画素値Ｐ’（ｒ）とＰ_ｔ−１の画素値Ｐ_ｔ−１（ｒ）との比に基づき、仮の事後確率マップの画素値Ｐ’（ｒ）を、時刻ｔにおける事後確率マップの画素値Ｐ_ｔ（ｒ）として採択するか否かを確率的に決定する。具体的には、標本更新部１８は、Ｐ’（ｒ）、Ｐ_ｔ−１（ｒ）、および一様分布Ｕ（０，１）にしたがう標本値ｕ_ｔ（ｒ）に基づいて、下の式（１３）により、出力すべき事後確率マップＰ_ｔの各画素位置ｒにおける値Ｐ_ｔ（ｒ）を決定する。 Further, the sample update unit 18, for posterior probability map, based on the ratio between the pixel value P _t-1 of the 'pixel value P' (r) and _{_{P t-1 P (r)}} , the temporary posterior probability map It is probabilistically determined whether or not the pixel value P'(r) of _{is adopted as the pixel value P t (r) of the posterior probability map at time t.} Specifically, the sample update unit 18 based on the _{P '(r), P t} -1 (r), and uniform distribution according to U (0, 1) sample value _u t (r), below _{The value P t} (r) at each pixel position r of the _{posterior probability map P t} to be output is determined by the equation (13).

なお、式（１３）におけるｕ_ｔ（ｒ）としては、式（１２）による計算を行う際に生成したｕ_ｔ（ｒ）の数値をそのまま用いるものとする。つまり、ｔとｒが決まれば、式（１２）におけるｕ_ｔ（ｒ）と式（１２）におけるｕ_ｔ（ｒ）とは、同じ値である。 _{As the ut} (r) in the equation (13), the numerical value of the _ut (r) generated when the calculation by the equation (12) is performed is used as it is. In other words, once the t and r, and _u t (r) in the _u t (r) and formula (12) in equation (12), the same value.

以上、まとめると、標本更新部１８は、高解像画像候補を更新して、更新後の高解像画像候補を高解像画像候補記憶部に記憶させるとともに、事後確率マップを更新して、更新後の事後確率マップを事後確率マップ記憶部に記憶させる。
その処理において、標本更新部１８は、尤度マップ演算部１４が出力した尤度マップの各画素位置における値と事前確率マップ演算部１５が出力した事前確率マップの対応する画素位置における値の積（乗算部１６が出力する積）と、前時点の事後確率マップの対応する画素位置における値と、所定の乱数値とに基づいて、仮高解像画像の当該画素位置における画素値または前時点の高解像画像候補の当該画素位置における画素値のいずれかを現時点の高解像画像候補の当該画素位置の画素値として決定する。
また、標本更新部１８は、尤度マップ演算部１４が出力した尤度マップの各画素位置における値と事前確率マップ演算部１５が出力した事前確率マップの対応する画素位置における値の積（乗算部１６が出力する積）と、前時点の事後確率マップの対応する画素位置における確率マップの同画素位置の値と、現時点の前記高解像画像候補の当該画素位置の画素値として決定するために用いた当該乱数値（上記の「所定の乱数値」を、当該時点における当該画素位置の処理において再利用する）とに基づいて、現時点の対応する画素位置における積、または前時点の事後確率マップの対応する画素位置における値のいずれかを、現時点の事後確率マップの対応する画素位置における値として決定する。 In summary, the sample update unit 18 updates the high-resolution image candidates, stores the updated high-resolution image candidates in the high-resolution image candidate storage unit, and updates the posterior probability map. The posterior probability map after the update is stored in the posterior probability map storage unit.
In that process, the sample update unit 18 is the product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit 14 and the value at the corresponding pixel position of the prior probability map output by the prior probability map calculation unit 15. Based on (the product output by the multiplication unit 16), the value at the corresponding pixel position of the posterior probability map at the previous time point, and the predetermined random number value, the pixel value or the previous time point at the pixel position of the prior high resolution image. Any one of the pixel values at the pixel position of the high-resolution image candidate is determined as the pixel value at the pixel position of the current high-resolution image candidate.
Further, the sample update unit 18 is the product (multiplication) of the value at each pixel position of the likelihood map output by the likelihood map calculation unit 14 and the value at the corresponding pixel position of the priorior probability map output by the priorior probability map calculation unit 15. (Product output by unit 16), the value of the same pixel position of the probability map at the corresponding pixel position of the posterior probability map at the previous time, and the pixel value of the pixel position of the high resolution image candidate at the present time. Based on the random value used in (the above "predetermined random value" is reused in the processing of the pixel position at the current time), the product at the corresponding pixel position at the present time, or the posterior probability at the previous time. Any of the values at the corresponding pixel positions of the map is determined as the value at the corresponding pixel positions of the current posterior probability map.

代表値演算部１９は、標本更新部１８から時系列的に出力される高解像画像候補Ｘ_ｔに対し、画素位置rごとに時間方向に見た代表値を演算し、その結果に基づいて、出力高解像画像Ｘ＾（Ｘの上にハット）を生成する。
具体的には、例えば、代表値演算部１９は、時刻Ｓから時刻Ｔ（ＳおよびＴはそれぞれ１以上の自然数、且つＳ≦Ｔ）までの間の高解像画像の画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}に基づき、下の式（１４）の計算を行う。 The representative value calculation unit 19 calculates a representative value viewed in the time direction for each pixel position r with respect to the _{high-resolution image candidate X t} output in time series from the sample update unit 18, and based on the result. , Generates output high resolution image X ^ (hat on top of X).
_{Specifically, for example, the representative value calculation unit 19 is a pixel value sequence (X t} ) of a high-resolution image between time S and time T (S and T are natural numbers of 1 or more, respectively, and S ≦ T). (R)) The following equation (14) is calculated based on _{t = S, S + 1, S + 2, ..., T-2, T-1,1.}

つまり、式（１４）を用いる場合、代表値演算部１９は、画素位置rごとに、時刻Ｓから時刻Ｔまでの時間区間における画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}の期待値（平均値）を求める。これにより、代表値演算部１９は、出力高解像画像Ｘ＾（Ｘの上にハット）を決定する。 That is, when the equation (14) is used, the representative value calculation unit 19 performs the pixel value sequence (X _t (r)) _{t = S, S + 1, S + 2 in the time interval from the time S to the time T for each pixel position r. , ..., T-2, T-1,1} Expected values (mean values) are obtained. As a result, the representative value calculation unit 19 determines the output high-resolution image X ^ (hat on top of X).

超解像装置１による反復演算において、高解像画像候補Ｘ_ｔは、反復の比較的初期の段階においてはその初期値の影響を強く受ける。したがって、初期値の影響が十分になくなる頃合いを以て上記の時刻Ｓを定めることが好ましい。一例として、Ｓ＝１００とする。また、高解像画像候補Ｘ_ｔは、高解像画像としての確からしさを表す確率密度関数からの標本である。よって、式（１４）において相加平均をとる標本数（Ｔ−Ｓ＋１）が十分に大きくないと信頼性のある出力高解像画像Ｘ＾（Ｘの上にハット）が得られない。例えば、１００個の標本からの期待値を得るためには、Ｓ＝１００の場合にはＴ＝１９９とすればよい。 In the iterative calculation by the super-resolution device 1, the high-resolution image candidate _Xt is strongly influenced by the initial value in the relatively early stage of the iteration. Therefore, it is preferable to set the above time S at a time when the influence of the initial value is sufficiently eliminated. As an example, let S = 100. Further, the high-resolution image candidate _Xt is a sample from the probability density function representing the certainty as a high-resolution image. Therefore, a reliable output high-resolution image X ^ (hat on X) cannot be obtained unless the number of samples (TS + 1) for which the arithmetic mean is taken in the equation (14) is sufficiently large. For example, in order to obtain the expected value from 100 samples, T = 199 may be set when S = 100.

あるいは、例えば、代表値演算部１９は、画素位置rごとの高解像画像の画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}から域外値を検出し、それらの域外値を排除した上で相加平均をとるようにしてもよい。これにより、代表値演算部１９は、出力高解像画像の画素値Ｘ＾（ｒ）（ただし「Ｘ＾」は、Ｘの上にハット）を得るよう動作する。この場合、代表値演算部１９は、下の式（１５）によって算出されるλ_ｔに基づいて、Ｘ_ｔ（ｒ）が域外値であるか否かを判定する。 Alternatively, for example, the representative value calculation unit 19 may use the pixel value sequence (X _t (r)) _{t = S, S + 1, S + 2, ..., T-2, T-1 of the high-resolution image for each pixel position r. , 1} may be detected, and the arithmetic mean may be taken after excluding those out-of-range values. As a result, the representative value calculation unit 19 operates to obtain the pixel value X ^ (r) (where "X ^" is a hat on X) of the output high-resolution image. In this case, the representative value calculation unit 19 _{determines whether or not X t} (r) is an out-of-range value based on _{λ t calculated by the following equation (15).}

なお、式（１５）において、μ_Ｘ（ｒ）およびσ_Ｘ（ｒ）は、それぞれ、画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}の期待値および標準偏差である。そして、代表値演算部１９は、｜λ_t｜が所定の閾値θ（例えばθ＝３）を超えた場合に、対応するＸ_ｔ（ｒ）を域外値として扱う。なお、算出過程からも明らかなように、λ_tは画素位置ｒごとに固有の値である。
つまり、代表値演算部１９は、下の式（１６）による計算を行うことにより、域外値を排除した期待値に基づいて、出力高解像画像Ｘ＾（Ｘの上にハット）を求める。 In the equation (15), μ _{X (r)} and σ _{X (r)} are pixel value strings (X _t (r)) _{t = S, S + 1, S + 2, ..., T-2, T, respectively.} Expected value and standard deviation of _-1,1. _{Then, when | λ t} | exceeds a predetermined threshold value θ (for example, θ = 3), the representative value calculation unit 19 _{treats the corresponding X t} (r) as an out-of-region value. As is clear from the calculation process, λ _t is a unique value for each pixel position r.
That is, the representative value calculation unit 19 obtains the output high-resolution image X ^ (hat on X) based on the expected value excluding the out-of-region value by performing the calculation according to the following equation (16).

さらに、例えば、代表値演算部１９は、画素位置rごとの高解像画像の画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}の中央値（ｍｅｄ）や最頻値（ｍｏｄ）に基づいて、出力高解像画像Ｘ＾（Ｘの上にハット）を求めてもよい。例えば、中央値（ｍｅｄ）により出力高解像画像Ｘ＾（Ｘの上にハット）を得る場合には、代表値演算部１９は、下の式（１７）による計算を行う。 Further, for example, the representative value calculation unit 19 has a pixel value sequence (X _t (r)) _{t = S, S + 1, S + 2, ..., T-2, T-1 of a high-resolution image for each pixel position r.} The output high-resolution image X ^ (hat on top of X) may be obtained based on the median value (med) and the mode value (mod) of _{1, 1.} For example, when the output high resolution image X ^ (hat on X) is obtained by the median value (med), the representative value calculation unit 19 performs the calculation by the following equation (17).

また、最頻値（ｍｏｄ）により出力高解像画像Ｘ＾（Ｘの上にハット）を得る場合には、代表値演算部１９は、式（１７）における「ｍｅｄ」を「ｍｏｄ」（統計的な最頻値を返す関数）による計算を行う。 Further, when the output high resolution image X ^ (hat on top of X) is obtained by the mode, the representative value calculation unit 19 changes the “med” in the equation (17) to “mod” (statistics). (A function that returns the most frequent value).

さらに、例えば、代表値演算部１９は、高解像画像候補の画素値列（Ｘ_ｔ（ｒ））_{ｔ＝Ｓ，Ｓ＋１，Ｓ＋２，・・・，Ｔ−２，Ｔ−１，１}に関するクラスタリング処理を行うようにしてもよい。この場合、得られたクラスターのうちの最大クラスターに属する画素値列の平均値を求めることによって、代表値演算部１９は、出力高解像画像Ｘ＾（Ｘの上にハット）の画素値を得る。なお、ここでのクラスタリングの処理には、例えばｋ−平均法を用いることができる。 Further, for example, the representative value calculation unit 19 clusters _{the pixel value sequence (X t} (r)) _{t = S, S + 1, S + 2, ..., T-2, T-1,1 of the high-resolution image candidate.} The process may be performed. In this case, by obtaining the average value of the pixel value strings belonging to the largest cluster among the obtained clusters, the representative value calculation unit 19 calculates the pixel value of the output high-resolution image X ^ (hat on X). obtain. For the clustering process here, for example, the k-means method can be used.

つまり、代表値演算部１９は、高解像画像候補の時点の系列から、高解像画像の各画素位置における値を決定して出力する。 That is, the representative value calculation unit 19 determines and outputs the value at each pixel position of the high-resolution image from the sequence at the time of the high-resolution image candidate.

初期化部２０は、仮の高解像画像Ｘ’の初期値を生成し、供給する。具体的には例えば、初期化部２０は、入力低解像画像Ｙを、仮の高解像画像Ｘ’の解像度までアップコンバートしたものを以て初期値Ｘ’とする。ここで初期化部２０が実行するアップコンバートは、例えば、最近傍補間によるものであってもよいし、補間フィルターを用いる手法であってもよい。補間フィルターを用いる手法としては、例えば、双一次補間や、双三次補間、Lanczos補間（例えば、Lanczos-3補間やLanczos-4補間）、sinc関数を有限タップ長で打ち切った補間フィルターに基づく手法などを用いることができる。 The initialization unit 20 generates and supplies an initial value of a temporary high-resolution image X'. Specifically, for example, the initialization unit 20 sets the input low-resolution image Y up-converted to the resolution of the temporary high-resolution image X'as the initial value X'. Here, the up-conversion executed by the initialization unit 20 may be, for example, by nearest neighbor interpolation or by a method using an interpolation filter. Examples of methods that use interpolation filters include bilinear interpolation, bicubic interpolation, Lanczos interpolation (for example, Lanczos-3 interpolation and Lanczos-4 interpolation), and a method based on an interpolation filter in which the sinc function is cut off with a finite tap length. Can be used.

記憶部３００は、超解像装置１に含まれる各部が上述した処理を行う過程で、データを、少なくとも一時的に記憶する機能を有する。記憶部３００の詳細については、図２を参照しながら次に説明する。 The storage unit 300 has a function of storing data at least temporarily in the process in which each unit included in the super-resolution device 1 performs the above-described processing. The details of the storage unit 300 will be described below with reference to FIG.

図２は、本実施形態による記憶部３００の構成の一例を概略として示す機能ブロック図である。図示するように、記憶部３００は、入力低解像画像記憶部３０１と、高解像画像候補記憶部３０２と、仮高解像画像記憶部３０３と、事後確率マップ記憶部３０４と、仮事後確率マップ記憶部３０５と、事前確率マップ記憶部３０６と、尤度マップ記憶部３０７と、出力高解像画像記憶部３２０と、を含んで構成される。 FIG. 2 is a functional block diagram schematically showing an example of the configuration of the storage unit 300 according to the present embodiment. As shown in the figure, the storage unit 300 includes an input low-resolution image storage unit 301, a high-resolution image candidate storage unit 302, a provisional high-resolution image storage unit 303, an ex post facto probability map storage unit 304, and a provisional post-mortem. It includes a probability map storage unit 305, a prior probability map storage unit 306, a likelihood map storage unit 307, and an output high-resolution image storage unit 320.

入力低解像画像記憶部３０１は、入力低解像画像Ｙを記憶する。入力低解像画像記憶部３０１が記憶する入力低解像画像Ｙは、初期化部２０や尤度マップ演算部１４によって読み出される。
高解像画像候補記憶部３０２は、高解像画像候補Ｘ_ｔを、時系列に記憶する。前述の通り高解像画像候補の初期値Ｘ_０は任意の画像であってよく、この画像Ｘ_０は、適宜、高解像画像候補記憶部３０２に書き込まれる。また、ｔ≧１におけるＸ_ｔは、標本更新部１８によって書き込まれる。なお、超解像装置１を構成する各部は、任意のｔに関するＸ_ｔを高解像画像候補記憶部３０２から読み出すことができる。つまり、高解像画像候補記憶部３０２は、反復演算における時点に対応させた高解像画像候補を記憶する。
仮高解像画像記憶部３０３は、仮の高解像画像Ｘ’を少なくとも一時的に記憶する。仮の高解像画像Ｘ’は、標本生成部１２によって書き込まれる。ただし、仮の高解像画像Ｘ’の初期値については、初期化部２０によって書き込まれる。そして、仮の高解像画像Ｘ’は、尤度マップ演算部１４や事前確率マップ演算部１５や標本更新部１８によって読み出される。
事後確率マップ記憶部３０４は、事後確率マップＰ_ｔを、少なくとも遅延部１１が遅延させる期間において記憶する。前述の通り事後確率マップの初期値Ｐ_０は、全画素位置について０以下の任意の値を持つマップであり、このマップＰ_０は、適宜、事後確率マップ記憶部３０４に書き込まれる。また、ｔ≧１におけるＰ_ｔは、標本更新部１８によって書き込まれる。つまり、事後確率マップ記憶部３０４は、反復演算における時点に対応させた事後確率マップを記憶する。 The input low resolution image storage unit 301 stores the input low resolution image Y. The input low-resolution image Y stored in the input low-resolution image storage unit 301 is read out by the initialization unit 20 and the likelihood map calculation unit 14.
The high-resolution image candidate storage unit 302 stores the high-resolution image candidate _Xt in chronological order. As described above, the initial value X ₀ of the high-resolution image candidate may be an arbitrary image, and this image X ₀ is appropriately written in the high-resolution image candidate storage unit 302. _{Further, X t} in t ≧ 1 is written by the sample update unit 18. It should be noted that each unit constituting the super-resolution device 1 _{can read X t} with respect to an arbitrary t from the high-resolution image candidate storage unit 302. That is, the high-resolution image candidate storage unit 302 stores the high-resolution image candidate corresponding to the time point in the iterative operation.
The temporary high-resolution image storage unit 303 stores the temporary high-resolution image X'at least temporarily. The tentative high-resolution image X'is written by the sample generation unit 12. However, the initial value of the temporary high-resolution image X'is written by the initialization unit 20. Then, the temporary high-resolution image X'is read out by the likelihood map calculation unit 14, the prior probability map calculation unit 15, and the sample update unit 18.
Posterior probability map storage unit 304, the posterior probability map P _t, least delay unit 11 stores in the period to delay. As described above, the initial value P ₀ of the posterior probability map is a map having an arbitrary value of 0 or less for all pixel positions, and this map P ₀ is appropriately written in the posterior probability map storage unit 304. _{Further, P t} in t ≧ 1 is written by the sample update unit 18. That is, the posterior probability map storage unit 304 stores the posterior probability map corresponding to the time point in the iterative operation.

仮事後確率マップ記憶部３０５は、仮の事後確率マップＰ’を少なくとも一時的に記憶する。仮の事後確率マップＰ’は、乗算部１６によって仮事後確率マップ記憶部３０５に書き込まれる。また、仮の事後確率マップＰ’は、標本更新部１８によって仮事後確率マップ記憶部３０５から読み出される。
事前確率マップ記憶部３０６は、事前確率マップＲを少なくとも一時的に記憶する。事前確率マップＲは、事前確率マップ演算部１５によって事前確率マップ記憶部３０６に書き込まれる。また、事前確率マップＲは、乗算部１６によって事前確率マップ記憶部３０６から読み出される。
尤度マップ記憶部３０７は、尤度マップＬを少なくとも一時的に記憶する。尤度マップＬは、尤度マップ演算部１４によって尤度マップ記憶部３０７に書き込まれる。また、尤度マップＬは、乗算部１６によって尤度マップ記憶部３０７から読み出される。
出力高解像画像記憶部３２０は、代表値演算部１９によって求められた出力高解像画像Ｘ＾（Ｘの上にハット）を、少なくとも一時的に記憶する。
なお、データが生成された後、直ちに下流の処理過程によって処理するように超解像装置１を構成する場合には、そのデータを記憶する記憶部を持たない構成としてもよい。また、出力高解像画像Ｘ＾（Ｘの上にハット）が生成されて直ちに超解像装置１の外部に出力される場合には、出力高解像画像記憶部３２０を持たない構成としてもよい。 The temporary posterior probability map storage unit 305 stores the temporary posterior probability map P'at least temporarily. The tentative posterior probability map P'is written in the tentative posterior probability map storage unit 305 by the multiplication unit 16. Further, the temporary posterior probability map P'is read from the temporary posterior probability map storage unit 305 by the sample update unit 18.
The prior probability map storage unit 306 stores the prior probability map R at least temporarily. The prior probability map R is written in the prior probability map storage unit 306 by the prior probability map calculation unit 15. Further, the prior probability map R is read from the prior probability map storage unit 306 by the multiplication unit 16.
The likelihood map storage unit 307 stores the likelihood map L at least temporarily. The likelihood map L is written in the likelihood map storage unit 307 by the likelihood map calculation unit 14. Further, the likelihood map L is read from the likelihood map storage unit 307 by the multiplication unit 16.
The output high-resolution image storage unit 320 stores the output high-resolution image X ^ (hat on top of X) obtained by the representative value calculation unit 19 at least temporarily.
When the super-resolution device 1 is configured to be processed by a downstream processing process immediately after the data is generated, the configuration may not have a storage unit for storing the data. Further, when the output high-resolution image X ^ (hat on X) is generated and immediately output to the outside of the super-resolution device 1, the configuration does not have the output high-resolution image storage unit 320. Good.

次に、事前確率マップ演算部１５を構成するニューラルネットワークの詳細について説明する。
図３は、本実施形態が利用するニューラルネットワークの概略構成を示す概略図である。図示するように、事前確率マップ演算部１５の一部であるニューラルネットワークは、層構造を有し、入力層３０と、中間層３４（第一の成分）と、中間層３５（第二の成分）と、出力層４０とを含んで構成される。なお、より一般的な構成として、ニューラルネットワークは、入力層と、０層以上の中間層と、出力層とを含む構成としてよい。各層は画像の画素配列に対応したマトリックスの構造を有している。マトリクス内のセル（１区画）には、値を設定できる。ニューロンは、複数のセルの値について重み付け和（weighted sum）を算出し、その値に対応する信号を出力する。ニューロンにおける重みの値は、後述する学習により調整可能である。 Next, the details of the neural network constituting the prior probability map calculation unit 15 will be described.
FIG. 3 is a schematic diagram showing a schematic configuration of a neural network used in this embodiment. As shown in the figure, the neural network that is a part of the prior probability map calculation unit 15 has a layer structure, and has an input layer 30, an intermediate layer 34 (first component), and an intermediate layer 35 (second component). ) And the output layer 40. As a more general configuration, the neural network may include an input layer, an intermediate layer of 0 or more layers, and an output layer. Each layer has a matrix structure corresponding to the pixel arrangement of the image. Values can be set for cells (1 partition) in the matrix. The neuron calculates a weighted sum for the values of a plurality of cells and outputs a signal corresponding to the values. The weight value in the neuron can be adjusted by learning described later.

各層間の接続は、畳み込みニューラルネットワークの形態である。図示する例では入力層３０の局所領域３１の画素値列（この例では、３画素×３画素のブロック））に対して、ニューロン３２およびニューロン３３は、それぞれ重み付け和を計算する。そして、ニューロン３２およびニューロン３３は、それぞれの重み付け和に対して活性化関数（例えば、シグモイド関数）を適用した結果を出力する。各ニューロン出力値は中間層における対応画素の各成分値として設定される。この例では、中間層の各画素は、２次元の値（第一の成分、および第二の成分）を保持する。より具体的には、局所領域３１の画素値列に基づいてニューロン３２が算出した出力値３６（第一の成分）は、中間層３４（第一の成分）における対応画素の成分値として設定される。また、局所領域３１の画素値列に基づいてニューロン３３が算出した出力値３７（第二の成分）は、中間層３５（第二の成分）における対応画素の成分値として設定される。 The connection between each layer is in the form of a convolutional neural network. In the illustrated example, the neuron 32 and the neuron 33 calculate the weighted sum with respect to the pixel value sequence of the local region 31 of the input layer 30 (in this example, a block of 3 pixels × 3 pixels). Then, the neuron 32 and the neuron 33 output the result of applying the activation function (for example, the sigmoid function) to the respective weighted sums. Each neuron output value is set as each component value of the corresponding pixel in the intermediate layer. In this example, each pixel in the intermediate layer holds a two-dimensional value (first and second components). More specifically, the output value 36 (first component) calculated by the neuron 32 based on the pixel value sequence of the local region 31 is set as the component value of the corresponding pixel in the intermediate layer 34 (first component). To. Further, the output value 37 (second component) calculated by the neuron 33 based on the pixel value sequence of the local region 31 is set as the component value of the corresponding pixel in the intermediate layer 35 (second component).

画素ごとの順次処理を行う場合、ニューラルネットワークは、入力層における前記局所領域と中間層において対応する画素位置とを画像内で移動させつつ、入力層から中間層への信号値の伝搬を行う。同様に（中間層が複数存在すれば）上流の中間層から下流の中間層への信号値の伝搬を行う。中間層から出力層への信号値の伝搬も同様である。 When performing sequential processing for each pixel, the neural network propagates the signal value from the input layer to the intermediate layer while moving the local region in the input layer and the corresponding pixel position in the intermediate layer in the image. Similarly, the signal value is propagated from the upstream intermediate layer to the downstream intermediate layer (if there are a plurality of intermediate layers). The same applies to the propagation of signal values from the intermediate layer to the output layer.

畳み込みニューラルネットワークを構成する各ニューロンにおけるパラメーターは可変である。このパラメーターは、学習時に更新され得る。このパラメーターは、前記の重み付け和を計算する際の重みの値として使用される。 The parameters in each neuron that make up the convolutional neural network are variable. This parameter can be updated during training. This parameter is used as the weight value when calculating the weight sum.

畳み込みニューラルネットワークは、例えば、正例および負例からなる学習データセットを用い、誤差逆伝播法（backpropagation）により学習を行う。
正例としては、例えば、実際の事物等を撮影して（あるいは描いて）得られた精細な画像である学習用高解像画像Ｘ_{ｔｒａｉｎ}を用いる。正例としての学習用高解像画像Ｘ_{ｔｒａｉｎ}には、無意味な雑音や、ぼやけた画像や、所定基準以上の符号化劣化を伴う画像等を含まないようにする。正例としての学習用高解像画像Ｘ_{ｔｒａｉｎ}に写真や、ビデオ映像フレームや、イラストや、ＣＧ（コンピューターグラフィクス）や、線画を含めてもよい。
また、正例としての学習用高解像画像Ｘ_{ｔｒａｉｎ}に対応させて、画面全体が所定値ρ（例えば、ρ＝１）である学習用事前確率マップＲ_{ｔｒａｉｎ}を用いる。
正例として、上記のＸ_{ｔｒａｉｎ}とＲ_{ｔｒａｉｎ}との対の１対以上を用いる。
一方、負例としては、例えば、精細な画像ではない画像である学習用高解像画像Ｘ_{ｔｒａｉｎ’}を用いる。負例としての学習用高解像画像Ｘ_{ｔｒａｉｎ’}には、無意味な雑音や、ぼやけた画像や、所定基準以上の符号化劣化を伴う画像などを含めるようにする。
また、負例としての学習用高解像画像Ｘ_{ｔｒａｉｎ’}に対応させて、画面全体が所定値ρ’（例えば、ρ’＝０）である学習用事前確率マップＲ_{ｔｒａｉｎ’}を用いる。
負例として、上記のＸ_{ｔｒａｉｎ’}とＲ_{ｔｒａｉｎ’}との対の１対以上を用いる。
このような学習データセットを用いた学習処理を事前に行うことにより、ニューラルネットワーク内のパラメーターが適切に調整される。そして、学習済みのニューラルネットワークを用いることにより、事前確率マップ演算部１５は、入力される仮の高解像画像Ｘ’が画像としてどの程度自然であるかを画素位置ごとに数値として評価した結果である事前確率マップＲを出力することができる。 The convolutional neural network uses, for example, a training data set consisting of positive and negative examples, and performs training by an error backpropagation method.
_{As a positive example, for example, a high-resolution learning image X train} , which is a fine image obtained by photographing (or drawing) an actual thing or the like, is used. As a positive example, the learning high-resolution image X _train does not include meaningless noise, a blurry image, an image with coding deterioration exceeding a predetermined reference, and the like. A high-resolution image for learning X _train as a positive example may include a photograph, a video image frame, an illustration, CG (computer graphics), or a line art.
_{Further, a learning prior probability map R train in} which the entire screen has a predetermined value ρ (for example, ρ = 1) is used in correspondence with the learning high-resolution image X _{train as a positive example.}
As a positive example, one or more pairs of _{the above X train} and R _{train are used.}
On the other hand, as a negative example, for example, a learning high-resolution image X _train'which is an image that is not a fine image is used. As a negative example, the learning high-resolution image X _{train'includes} meaningless noise, a blurred image, an image with coding deterioration of a predetermined reference or more, and the like.
Further, in correspondence with the learning high-resolution image X _train'as _{a negative example, the learning prior probability map R train' in} which the entire screen has a predetermined value ρ'(for example, ρ'= 0) is used.
As a negative example, one or more pairs of _{the above X train'and} R _{train' are used.}
By performing the training process using such a training data set in advance, the parameters in the neural network are appropriately adjusted. Then, by using the trained neural network, the prior probability map calculation unit 15 evaluates how natural the input temporary high-resolution image X'is as an image as a numerical value for each pixel position. It is possible to output the prior probability map R.

なお、ニューラルネットワーク自体、および誤差逆伝播法によるニューラルネットワークの学習処理自体は、既存技術により実現できる。 The neural network itself and the learning process itself of the neural network by the error back propagation method can be realized by the existing technology.

図４は、本実施形態によるニューラルネットワークの学習処理に用いるのに適した正例および負例の例を示す概略図である。
図示する例において、正例は、（Ｘ_{ｔｒａｉｎ１}，Ｒ_{ｔｒａｉｎ１}）の対、および（Ｘ_{ｔｒａｉｎ２}，Ｒ_{ｔｒａｉｎ２}）の対である。Ｘ_{ｔｒａｉｎ１}およびＸ_{ｔｒａｉｎ２}は、それぞれ、精細で自然な画像である。また、Ｒ_{ｔｒａｉｎ１}およびＲ_{ｔｒａｉｎ２}は、全画素に対応させて１を保持する画像（全白）である。
また、負例は、（Ｘ_{ｔｒａｉｎ３}，Ｒ_{ｔｒａｉｎ３}）の対、および（Ｘ_{ｔｒａｉｎ４}，Ｒ_{ｔｒａｉｎ４}）の対である。Ｘ_{ｔｒａｉｎ３}は、精細ではない、非常にぼやけた自動車の画像である。また、Ｘ_{ｔｒａｉｎ４}は、自然な画像とは言えないパターンを有する画像である。また、Ｒ_{ｔｒａｉｎ３}およびＲ_{ｔｒａｉｎ４}は、全画素に対応させて０を保持する画像（全黒）である。 FIG. 4 is a schematic diagram showing examples of positive and negative examples suitable for use in the learning process of the neural network according to the present embodiment.
In the illustrated example, a positive sample _is a pair _{(X train1, R} train1) pairs, and _{_{(X train2,} R} _train2). X _{train 1} and X _{train 2} are fine and natural images, respectively. Further, R _{train 1} and R _{train 2} are images (all white) in which 1 is retained corresponding to all pixels.
Also, negative cases _are pairs of _{(X train3, R} train3) pairs, and _{_{(X train4,} R} _train4). X _{train 3} is a non-fine, very blurry image of a car. Further, X _{train 4} is an image having a pattern that cannot be said to be a natural image. Further, R _{train 3} and R _{train 4} are images (all black) in which 0 is held corresponding to all pixels.

以上説明したように、本実施形態によれば、尤度マップ演算部１４は、仮の高解像画像と入力画像である低解像画像との整合性を評価する。また事前確率マップ演算部１５は、仮の高解像画像が、高解像画像としてあるべき画像的特徴を有するか否かを評価する。これら２つの評価値（マップ）を用いて、標本生成部１２および標本更新部１８は、高評価な高解像画像を確率的に生成する。この確率的な生成はマルコフ連鎖モンテカルロ法に準ずる高効率な標本生成過程であり、確率的に生成された高解像画像群の代表値を代表値演算部１９が求めることにより、最も確からしい超解像結果を速やかに得ることができる。 As described above, according to the present embodiment, the likelihood map calculation unit 14 evaluates the consistency between the provisional high-resolution image and the low-resolution image which is the input image. Further, the prior probability map calculation unit 15 evaluates whether or not the provisional high-resolution image has image features that should be as a high-resolution image. Using these two evaluation values (maps), the sample generation unit 12 and the sample update unit 18 probabilistically generate a highly evaluated high-resolution image. This probabilistic generation is a highly efficient sample generation process similar to the Markov chain Monte Carlo method, and the representative value calculation unit 19 obtains the representative value of the stochastically generated high-resolution image group, which is the most probable super. The resolution result can be obtained quickly.

また、本実施形態によれば、上記の事前確率マップ演算部１５が、高解像画像としてあるべき画像的特徴を有するか否かを定量化するために、生物の神経回路網を模した機械学習の手法（ニューラルネットワーク）を導入している。このため、本実施形態による超解像装置は、人間の視覚系（眼や脳の視覚神経系）を近似した応答特性を実現することが期待される。即ち、本実施形態では、視覚的に精細感のある超解像を実現することが可能となる。 Further, according to the present embodiment, in order to quantify whether or not the prior probability map calculation unit 15 has image features that should be as a high-resolution image, a machine that imitates a neural network of an organism. We have introduced a learning method (neural network). Therefore, the super-resolution device according to the present embodiment is expected to realize response characteristics that approximate the human visual system (visual nervous system of the eye or brain). That is, in the present embodiment, it is possible to realize super-resolution with a visually fine feeling.

また、本実施形態によれば、従来技術の方法（非特許文献１に記載された方法など）よりも良い処理効率で、高解像画像を取得することができる。 Further, according to the present embodiment, a high-resolution image can be obtained with better processing efficiency than the method of the prior art (such as the method described in Non-Patent Document 1).

［第２実施形態］
次に、第２実施形態について説明する。なお、前実施形態において既に説明した事項については以下において説明を省略する場合がある。ここでは、本実施形態に特有の事項を中心に説明する。
本実施形態による超解像装置１０１の構成は、図１に示した機能構成における事前確率マップ演算部１５を、事前確率マップ演算部１１５によって置換した構成である。 [Second Embodiment]
Next, the second embodiment will be described. The matters already described in the previous embodiment may be omitted below. Here, the matters peculiar to the present embodiment will be mainly described.
The configuration of the super-resolution device 101 according to the present embodiment is such that the prior probability map calculation unit 15 in the functional configuration shown in FIG. 1 is replaced by the prior probability map calculation unit 115.

事前確率マップ演算部１１５は、仮の高解像画像Ｘ’に基づいて、事前確率マップＲを求めて出力する。事前確率マップＲは、仮の高解像画像Ｘ’がどの程度自然な画像であるか（自然な画像である度合い）を画素ごとに定量化した数値のマップである。
具体的には、事前確率マップ演算部１１５は、空間方向の画素値の変化量（例えば、輝度勾配）が相対的に小さい場合に仮の高解像画像Ｘ’が自然な画像である度合いを高く評価して、事前確率マップＲの画素値を決定する。 The prior probability map calculation unit 115 obtains and outputs the prior probability map R based on the tentative high-resolution image X'. The prior probability map R is a numerical map that quantifies how natural the tentative high-resolution image X'is (the degree of being a natural image) for each pixel.
Specifically, the prior probability map calculation unit 115 determines the degree to which the temporary high-resolution image X'is a natural image when the amount of change in the pixel value in the spatial direction (for example, the brightness gradient) is relatively small. Highly evaluated, the pixel value of the prior probability map R is determined.

一例として、事前確率マップ演算部１１５は、下の式（１８）により、事前確率マップＲを算出する。 As an example, the prior probability map calculation unit 115 calculates the prior probability map R by the following equation (18).

式（１８）の右辺の指数関数ｅｘｐの中に含まれる演算子ナブラ（nabla）は、仮の高解像画像Ｘ’の画素位置ｒにおける勾配を表す。また、τは正の実定数であり、例えばτ＝１とする。式（１８）によって事前確率マップＲを算出する場合、画素位置ｒにおける勾配の絶対値が大きいほど、画素位置ｒにおける事前確率マップＲの値は小さくなる。 The operator nabla included in the exponential function exp on the right side of equation (18) represents the gradient at the pixel position r of the tentative high-resolution image X'. Further, τ is a positive real constant, for example, τ = 1. When the prior probability map R is calculated by the equation (18), the larger the absolute value of the gradient at the pixel position r, the smaller the value of the prior probability map R at the pixel position r.

本実施形態では、事前確率マップ演算部１１５が出力した事前確率マップＲ（ｒ）を用いて、乗算部１６が乗算を行う。事前確率マップ演算部１１５以外の各部の処理は、第１実施形態において対応する各部の処理と同様である。 In the present embodiment, the multiplication unit 16 performs multiplication using the prior probability map R (r) output by the prior probability map calculation unit 115. The processing of each part other than the prior probability map calculation unit 115 is the same as the processing of the corresponding parts in the first embodiment.

［第３実施形態］
次に、第３実施形態について説明する。なお、前実施形態までにおいて既に説明した事項については以下において説明を省略する場合がある。ここでは、本実施形態に特有の事項を中心に説明する。
本実施形態による超解像装置２０１の構成は、図１に示した機能構成における事前確率マップ演算部１５を、事前確率マップ演算部２１５によって置換した構成である。 [Third Embodiment]
Next, the third embodiment will be described. The matters already explained up to the previous embodiment may be omitted below. Here, the matters peculiar to the present embodiment will be mainly described.
The configuration of the super-resolution device 201 according to the present embodiment is such that the prior probability map calculation unit 15 in the functional configuration shown in FIG. 1 is replaced by the prior probability map calculation unit 215.

事前確率マップ演算部２１５は、仮の高解像画像Ｘ’に基づいて、事前確率マップＲを求めて出力する。事前確率マップＲは、仮の高解像画像Ｘ’がどの程度自然な画像であるか（自然な画像である度合い）を画素ごとに定量化した数値のマップである。
具体的には、事前確率マップ演算部２１５は、局所的な画素値の総変化量（例えば、トータルバリエーションノルム，total variation norm）が相対的に小さい場合に仮の高解像画像Ｘ’が自然な画像である度合いを高く評価して、事前確率マップＲの画素値を決定する。 The prior probability map calculation unit 215 obtains and outputs the prior probability map R based on the tentative high-resolution image X'. The prior probability map R is a numerical map that quantifies how natural the tentative high-resolution image X'is (the degree of being a natural image) for each pixel.
Specifically, in the prior probability map calculation unit 215, when the total amount of change in the local pixel value (for example, total variation norm) is relatively small, the temporary high-resolution image X'is natural. The pixel value of the prior probability map R is determined by highly evaluating the degree of the image.

一例として、事前確率マップ演算部２１５は、下の式（１９）により事前確率マップＲを算出する。 As an example, the prior probability map calculation unit 215 calculates the prior probability map R by the following equation (19).

式（１９）において、演算子ナブラ（nabla）は、式（１８）における場合と同様に、仮の高解像画像Ｘ’の画素位置（ｒ＋ｘ）における勾配を表す。また、ｘは、画素平面に沿ったベクトルである。つまり、式（１９）の右辺に含まれる定積分は、画素位置ｒの周辺の半径ρ以下の画素領域のトータルバリエーションを算出するためのものである。なお、半径ρは正の実定数であり、例えばρ＝２とする。また、φは正の実定数であり、例えばφ＝１とする。式（１９）に事前確率マップＲを算出する場合、画素位置ｒおよびその周辺における勾配の絶対値が大きいほど、画素位置ｒにおける事前確率マップＲの値は小さくなる。 In equation (19), the operator nabla represents the gradient of the tentative high-resolution image X'at the pixel position (r + x), as in equation (18). Further, x is a vector along the pixel plane. That is, the definite integral included in the right side of the equation (19) is for calculating the total variation of the pixel region having a radius ρ or less around the pixel position r. The radius ρ is a positive real constant, and for example, ρ = 2. Further, φ is a positive real constant, and for example, φ = 1. When the prior probability map R is calculated by the equation (19), the larger the absolute value of the gradient at the pixel position r and its surroundings, the smaller the value of the prior probability map R at the pixel position r.

本実施形態では、事前確率マップ演算部２１５が出力した事前確率マップＲ（ｒ）を用いて、乗算部１６が乗算を行う。事前確率マップ演算部２１５以外の各部の処理は、第１実施形態において対応する各部の処理と同様である。 In the present embodiment, the multiplication unit 16 performs multiplication using the prior probability map R (r) output by the prior probability map calculation unit 215. The processing of each part other than the prior probability map calculation unit 215 is the same as the processing of the corresponding parts in the first embodiment.

［変形例］
上記の第１実施形態、第２実施形態、および第３実施形態では、それぞれ、事前確率マップ演算部１５、１１５、および２１５が、事前確率マップを算出していた。変形例として、事前確率マップ演算部１５、１１５、および２１５のそれぞれによる方法を複数組み合わせることにより、「どの程度自然であるか」を表す指標を定量化してもよい。
例えば、事前確率マップ演算部１５による方法で求められる事前確率マップと、事前確率マップ演算部１１５による方法で求められる事前確率マップとの、重み付け平均によるマップを用いてもよい。
また、例えば、事前確率マップ演算部１１５による方法で求められる事前確率マップと、事前確率マップ演算部２１５による方法で求められる事前確率マップとの、重み付け平均によるマップを用いてもよい。
また、例えば、事前確率マップ演算部２１５による方法で求められる事前確率マップと、事前確率マップ演算部１５による方法で求められる事前確率マップとの、重み付け平均によるマップを用いてもよい。
また、その他の方法により、事前確率マップを算出するようにしてもよい。 [Modification example]
In the first embodiment, the second embodiment, and the third embodiment described above, the prior probability map calculation units 15, 115, and 215 have calculated the prior probability map, respectively. As a modification, the index indicating "how natural" may be quantified by combining a plurality of methods by the prior probability map calculation units 15, 115, and 215.
For example, a map based on a weighted average of the prior probability map obtained by the method by the prior probability map calculation unit 15 and the prior probability map obtained by the method by the prior probability map calculation unit 115 may be used.
Further, for example, a map based on a weighted average of the prior probability map obtained by the method by the prior probability map calculation unit 115 and the prior probability map obtained by the method by the prior probability map calculation unit 215 may be used.
Further, for example, a map based on a weighted average of the prior probability map obtained by the method by the prior probability map calculation unit 215 and the prior probability map obtained by the method by the prior probability map calculation unit 15 may be used.
Further, the prior probability map may be calculated by another method.

なお、上述した実施形態における超解像装置の機能の少なくとも一部をコンピューターで実現するようにしても良い。その場合、この機能を実現するためのプログラムをコンピューター読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピューターシステムに読み込ませ、実行することによって実現しても良い。なお、ここでいう「コンピューターシステム」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピューター読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、ＵＳＢメモリー等の可搬媒体、コンピューターシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピューター読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバーやクライアントとなるコンピューターシステム内部の揮発性メモリーのように、一定時間プログラムを保持しているものも含んでも良い。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピューターシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 It should be noted that at least a part of the functions of the super-resolution device in the above-described embodiment may be realized by a computer. In that case, a program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read by a computer system and executed. The term "computer system" as used herein includes hardware such as an OS and peripheral devices. The "computer-readable recording medium" is a portable medium such as a flexible disk, a magneto-optical disk, a ROM, a CD-ROM, a DVD-ROM, or a USB memory, or a storage device such as a hard disk built in a computer system. Say that. Further, a "computer-readable recording medium" is a communication line for transmitting a program via a network such as the Internet or a communication line such as a telephone line, and dynamically holds the program for a short period of time. It may also include a program that holds a program for a certain period of time, such as a volatile memory inside a computer system that serves as a server or a client in that case. Further, the above-mentioned program may be for realizing a part of the above-mentioned functions, and may be further realized by combining the above-mentioned functions with a program already recorded in the computer system.

以上説明した少なくとも一つの実施形態あるいはその変形例によれば、超解像装置は、低解像画像を基に、高精細で確からしい高解像画像を、速やかに得ることができるようになる。 According to at least one embodiment or a modification thereof described above, the super-resolution device can quickly obtain a high-definition and reliable high-resolution image based on the low-resolution image. ..

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 Although the embodiments of the present invention have been described in detail with reference to the drawings, the specific configuration is not limited to this embodiment, and the design and the like within a range not deviating from the gist of the present invention are also included.

本発明は、画像あるいは映像を処理するあらゆる産業において利用可能である。 The present invention is available in any industry that processes images or video.

１超解像装置
１０遅延部
１１遅延部
１２標本生成部
１３切替部
１４尤度マップ演算部
１５事前確率マップ演算部
１６乗算部
１８標本更新部
１９代表値演算部
２０初期化部
３０入力層
３１入力層における注目する局所領域
３２ニューロン
３３ニューロン
３４中間層（第一の成分）
３５中間層（第二の成分）
３６ニューロン出力値（第一の成分）
３７ニューロン出力値（第二の成分）
３８中間層における注目する局所領域
３９ニューロン
４０出力層
４１ニューロン出力値
１０１超解像装置
１１５事前確率マップ演算部
２０１超解像装置
２１５事前確率マップ演算部
３００記憶部
３０１入力低解像画像記憶部
３０２高解像画像候補記憶部
３０３仮高解像画像記憶部
３０４事後確率マップ記憶部
３０５仮事後確率マップ記憶部
３０６事前確率マップ記憶部
３０７尤度マップ記憶部
３２０出力高解像画像記憶部 1 Super-resolution device 10 Delay unit 11 Delay unit 12 Sample generation unit 13 Switching unit 14 Likelihood map calculation unit 15 Prior probability map calculation unit 16 Multiplication unit 18 Sample update unit 19 Representative value calculation unit 20 Initialization unit 30 Input layer 31 Local region of interest in the input layer 32 Neuron 33 Neuron 34 Intermediate layer (first component)
35 Intermediate layer (second component)
36 Neuron output value (first component)
37 Neuron output value (second component)
38 Local area of interest in the middle layer 39 Neuron 40 Output layer 41 Neuron output value 101 Super-resolution device 115 Priorior probability map calculation unit 201 Priorior probability map calculation unit 215 Priorior probability map calculation unit 300 Storage unit 301 Input low-resolution image storage unit 302 High-resolution image candidate storage unit 303 Temporary high-resolution image storage unit 304 Posterior probability map storage unit 305 Priorior probability map storage unit 306 Prior probability map storage unit 307 Probability map storage unit 320 Output high-resolution image storage unit

Claims

A super-resolution device that obtains a high-resolution image from a low-resolution image by iterative calculation.
A high-resolution image candidate storage unit that stores high-resolution image candidates corresponding to time points in the iterative calculation,
A posterior probability map storage unit that stores a posterior probability map corresponding to a time point in the iterative operation,
A sample generation unit that probabilistically generates a provisional high-resolution image from a high-resolution image candidate at a previous time in the iterative operation according to a predetermined probability model.
A likelihood map calculation unit that evaluates the consistency of each local region between the provisional high-resolution image and the low-resolution image and outputs the spatial distribution of the consistency as a likelihood map.
A prior probability map calculation unit that evaluates the degree to which the temporary high resolution image is a natural image and outputs a spatial distribution of the degree of being a natural image as a prior probability map.
A sample update unit that updates the high-resolution image candidate and stores it in the high-resolution image candidate storage unit, and updates the posterior probability map and stores it in the posterior probability map storage unit.
A representative value calculation unit that determines and outputs a value at each pixel position of the high-resolution image from the sequence at the time of the high-resolution image candidate, and
Equipped with
The sample update unit
The product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the prior probability map output by the advance probability map calculation unit, and the above-mentioned thing at the previous time. Based on the value at the corresponding pixel position of the post-probability map and the predetermined random number value, the pixel value at the pixel position of the provisional high-resolution image or the pixel at the pixel position of the high-resolution image candidate at the previous time point. One of the values is determined as the pixel value of the pixel position of the high-resolution image candidate at the present time, and is also determined.
The product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the advance probability map output by the advance probability map calculation unit, and the above-mentioned thing at the previous time. Based on the value of the same pixel position of the probability map at the corresponding pixel position of the post-probability map and the random value used to determine the pixel value of the pixel position of the high-resolution image candidate at the present time. Either the product at the corresponding pixel position of, or the value at the corresponding pixel position of the posterior probability map at the previous time point is determined as the value at the corresponding pixel position of the posterior probability map at the present time.
A super-resolution device characterized by this.

The advance probability map calculation unit is composed of a neural network, and uses a positive example which is an example of an image which should be a high-resolution image and a negative example which is an example of an image which should not be a high-resolution image. The neural network has been trained in advance.
The super-resolution device according to claim 1.

The prior probability map calculation unit highly evaluates the degree to which the temporary high resolution image is a natural image as the amount of change in the pixel value in the spatial direction at the pixel position of the temporary high resolution image is small. Output pre-article probability map,
The super-resolution device according to claim 1 or 2.

The pre-probability map calculation unit highly evaluates the degree to which the temporary high-resolution image is a natural image as the total amount of change in the pixel values within a predetermined range of the pixel positions of the temporary high-resolution image is smaller. , Output the pre-probability map,
The super-resolution device according to any one of claims 1 to 3, wherein the super-resolution device is characterized.

Computer,
A super-resolution device that obtains a high-resolution image from a low-resolution image by iterative calculation.
A high-resolution image candidate storage unit that stores high-resolution image candidates corresponding to time points in the iterative calculation,
A posterior probability map storage unit that stores a posterior probability map corresponding to a time point in the iterative operation,
A sample generation unit that probabilistically generates a provisional high-resolution image from a high-resolution image candidate at a previous time in the iterative operation according to a predetermined probability model.
A likelihood map calculation unit that evaluates the consistency of each local region between the provisional high-resolution image and the low-resolution image and outputs the spatial distribution of the consistency as a likelihood map.
A prior probability map calculation unit that evaluates the degree to which the temporary high resolution image is a natural image and outputs a spatial distribution of the degree of being a natural image as a prior probability map.
A sample update unit that updates the high-resolution image candidate and stores it in the high-resolution image candidate storage unit, and updates the posterior probability map and stores it in the posterior probability map storage unit.
A representative value calculation unit that determines and outputs a value at each pixel position of the high-resolution image from the sequence at the time of the high-resolution image candidate, and
Equipped with
The sample update unit
The product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the prior probability map output by the advance probability map calculation unit, and the above-mentioned thing at the previous time. Based on the value at the corresponding pixel position of the post-probability map and the predetermined random number value, the pixel value at the pixel position of the provisional high-resolution image or the pixel at the pixel position of the high-resolution image candidate at the previous time point. One of the values is determined as the pixel value of the pixel position of the high-resolution image candidate at the present time, and is also determined.
The product of the value at each pixel position of the likelihood map output by the likelihood map calculation unit and the value at the corresponding pixel position of the advance probability map output by the advance probability map calculation unit, and the above-mentioned thing at the previous time. Based on the value of the same pixel position of the probability map at the corresponding pixel position of the post-probability map and the random value used to determine the pixel value of the pixel position of the high-resolution image candidate at the present time. Either the product at the corresponding pixel position of, or the value at the corresponding pixel position of the posterior probability map at the previous time point is determined as the value at the corresponding pixel position of the posterior probability map at the present time.
A program to function as a super-resolution device.