JP6347028B1

JP6347028B1 - UPSAMPLE DEVICE, COMPUTER PROGRAM, COMPUTER SYSTEM, DEVICE, AND UPSAMPLE METHOD

Info

Publication number: JP6347028B1
Application number: JP2017199915A
Authority: JP
Inventors: 修二奥野
Original assignee: TSUBASA FACTORY CO., LTD.
Current assignee: TSUBASA FACTORY CO., LTD.
Priority date: 2017-10-13
Filing date: 2017-10-13
Publication date: 2018-06-27
Anticipated expiration: 2037-10-13
Also published as: JP2019074892A

Abstract

【課題】高精度なアップサンプルデータを高速に作成する。【解決手段】アップサンプル処理装置１は、第２サンプルレートより高レートの第３サンプルレートのデータを含む第１階層の複数の学習用サブセットについて、第２サンプルレートから第３サンプルレートにアップサンプルする第１フィルタ及び第２サンプルレートよりも低レートの第１サンプルレートから第２サンプルレートにアップサンプルする第２フィルタを取得する第１階層フィルタ取得部と、第２サンプルレートの入力データから第１サンプルレートの縮小データを作成する縮小データ作成部と、各第２フィルタを用いて、縮小データをアップサンプルして第２サンプルレートの候補データを作成する候補データ作成部と、候補データと入力データとの差分が最小となる第２フィルタに対応する第１フィルタを用いて、第２サンプルレートの入力データをアップサンプルするアップサンプル処理部とを備える。【選択図】図１High-precision upsampled data is created at high speed. An up-sampling apparatus (1) up-samples a plurality of learning subsets in a first hierarchy including data of a third sample rate higher than a second sample rate from a second sample rate to a third sample rate. A first layer filter acquisition unit that acquires a second filter that up-samples from a first sample rate lower than the first sample rate and the second sample rate to a second sample rate, and second input from the second sample rate input data A reduced data creation unit that creates reduced data at one sample rate, a candidate data creation unit that upsamples the reduced data to create candidate data at the second sample rate using each second filter, and input candidate data Using the first filter corresponding to the second filter that minimizes the difference from the data, And a up sample processing unit for upsampling input data Rureto. [Selection] Figure 1

Description

本発明は、入力データにアップサンプル処理を施すアップサンプル装置、該アップサンプル装置を実現するコンピュータプログラム、該アップサンプル装置を活用するコンピュータシステム、該アップサンプル装置を活用する機器、及び該アップサンプル装置に用いられるアップサンプル方法に関する。 The present invention relates to an upsampling apparatus that performs upsampling processing on input data, a computer program that realizes the upsampling apparatus, a computer system that uses the upsampling apparatus, a device that uses the upsampling apparatus, and the upsampling apparatus The present invention relates to an upsampling method used in the above.

画像、音響、その他のデータを高品質にするために、単位尺度当たりのサンプル数を増やすアップサンプル処理が行われる。特に、近年、入力画像に超解像度処理を施して、高解像度化する超解像度処理の技術が実用化されている（例えば、特許文献１参照）。 In order to improve the quality of images, sound, and other data, an upsampling process that increases the number of samples per unit scale is performed. In particular, in recent years, a super-resolution processing technique for performing high-resolution processing by applying super-resolution processing to an input image has been put into practical use (see, for example, Patent Document 1).

アップサンプル処理の手法としては、入力データを補間し必要に応じて変形するが、いかなる補間、変形を行うべきかを入力データと出力データとから学習したフィルタを用いることが考えられる。フィルタの例として、超解像度処理においては、機械学習済みの畳み込みニューラルネットワーク（Convolutional Neural Network：以下、「ＣＮＮ」という）に入力画像を入力し、当該入力画像に対して超解像度処理を施して、高解像度化された出力画像を出力する方法が知られている。このＣＮＮは、オリジナル画像と、当該オリジナル画像の縮小画像とに基づいて、縮小画像を入力とし、オリジナル画像を出力として各層におけるニューラルネットの重みおよびバイアスを機械学習することにより構成される。 As a method of up-sampling processing, input data is interpolated and deformed as necessary, but it is conceivable to use a filter that learns from the input data and output data what interpolation and deformation should be performed. As an example of a filter, in super-resolution processing, an input image is input to a machine-learned convolutional neural network (hereinafter referred to as “CNN”), and the input image is subjected to super-resolution processing. A method for outputting an output image with a high resolution is known. The CNN is configured by performing machine learning on the weight and bias of the neural network in each layer, using the reduced image as an input and the original image as an output based on the original image and the reduced image of the original image.

特表２０１３−５３２８７８号公報Special table 2013-532878 gazette

データが画像であるとすると、画像には人物写真、風景写真、イラストなど様々な種類が含まれる。例えば、イラストはエッジ成分を多く含むため強い高周波成分を含むのに対し、風景画像はすべての周波数成分を満遍なく含むなど、画像の種類によって特性が異なる。 If the data is an image, the image includes various types such as a portrait photograph, a landscape photograph, and an illustration. For example, illustrations include many high frequency components because they include many edge components, while landscape images include all frequency components evenly.

また、データが音響であるとすると、音響には人間の声、楽器（例えばバイオリン）を演奏した音楽、スポーツ（例えば野球）における録音など様々な種類が含まれる。例えば、人間の声は発声の制約から強い高周波成分を含まないのに対し、バイオリンの演奏では弦の倍音としての高周波成分を含むなど、音響の種類によって特性が異なる。 If the data is sound, the sound includes various types such as a human voice, music played with a musical instrument (for example, a violin), and recording in sports (for example, baseball). For example, human voices do not contain strong high-frequency components due to utterance restrictions, whereas violin performances contain high-frequency components as harmonics of strings.

しかしながら、これらのデータの特性を無視して、１つのフィルタ（以下、フィルタを「ＣＮＮ」とする。ＣＮＮ以外のもの、例えば補間に用いるパラメータの最小二乗推定結果に基づく処理、であるとしても同様である。）を用いてアップサンプルしたのでは、満足の行く結果が得られない場合がある。 However, ignoring the characteristics of these data, one filter (hereinafter referred to as “CNN”. Even if it is processing other than CNN, for example, processing based on the least square estimation result of parameters used for interpolation) In some cases, satisfactory results may not be obtained.

例えば、画像について、低周波成分を多く含む画像を多く用いて機械学習したＣＮＮにより高周波成分を多く含む画像を超解像度処理した場合には、高周波成分の再現性が損なわれることが考えられる。また、高周波成分を多く含む画像を多く用いて機械学習したＣＮＮにより低周波成分を多く含む画像を超解像度処理した場合には、高周波ノイズが発生したりすることが考えられる。 For example, when an image containing many high-frequency components is subjected to super-resolution processing by CNN that has been machine-learned using many images containing many low-frequency components, the reproducibility of high-frequency components may be impaired. Further, when an image containing a large amount of low-frequency components is subjected to super-resolution processing by machine learning using a large number of images containing a large amount of high-frequency components, high-frequency noise may be generated.

また、音響については、アップサンプルは、低音域に影響せず、必然的に高音域を強調又は非強調することとなるが、高音域をあまり強調すべきでない音響（例えばバス又はバリトンの歌手の歌唱）を多く用いて機械学習したＣＮＮにより高音域を強調すべき音響（例えばバイオリンの演奏）をアップサンプル処理した場合には、高音が伸びずに音色が豊かにならないことが考えられる。また、高音域を強調すべき音響を多く用いて機械学習したＣＮＮにより、高音域をあまり強調すべきでない音響をアップサンプル処理した場合には、強調された高音により不自然な音色となってしまうことが考えられる。 Also, for acoustics, upsampling does not affect the low range and will inevitably emphasize or de-emphasize the high range, but it should not emphasize the high range very much (eg bass or baritone singer). When a sound (for example, a performance of a violin) that should emphasize the high frequency range is upsampled by CNN that is machine-learned using a lot of singing), it is conceivable that the high tone does not grow and the tone does not become rich. In addition, when CNN that has been machine-learned using a large amount of sound that should emphasize the high range is subjected to upsampling for sounds that should not emphasize the high range so much, an unnatural timbre will result due to the emphasized high tone. It is possible.

そのうえ、複数のＣＮＮのプリセットの中からユーザが最適なものを選択するのは一般的に困難である。このため、従来、画像の超解像度処理においては、上述のように多くの種類の学習用画像を利用してＣＮＮを作成せざるを得なかった。多くの種類の学習用画像を利用する場合には、学習用画像の枚数が膨大となるため、機械学習に時間を要し、ＣＮＮの構成も複雑なものとなる。画像以外のデータについても同様と考えられる。 In addition, it is generally difficult for the user to select an optimal one from a plurality of CNN presets. For this reason, conventionally, in the super-resolution processing of an image, it has been necessary to create a CNN using many types of learning images as described above. When many types of learning images are used, the number of learning images is enormous, so that time is required for machine learning and the configuration of the CNN is complicated. The same applies to data other than images.

本発明はこのような事情に鑑みてなされたものであり、高精度なアップサンプルデータを高速に作成することのできるアップサンプル処理装置、機器、コンピュータプログラム及びアップサンプル方法を提供することを目的とする。 The present invention has been made in view of such circumstances, and an object thereof is to provide an upsample processing apparatus, device, computer program, and upsample method that can create high-precision upsample data at high speed. To do.

上記課題を解決するため、本発明の一形態のアップサンプル装置は、１次元又は２次元以上の空間に対する値を前記空間のサンプル点における離散値として表現したデータについて、
第２サンプルレートの入力データをアップサンプルして、第３サンプルレートの出力データを得るためのアップサンプル装置であって、
第３サンプルレートのデータを含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された、第２サンプルレートのデータを第３サンプルレートのデータにアップサンプルするための第１フィルタと、
前記第２サンプルレートのデータをダウンサンプルした第１サンプルレートのデータを前記第２サンプルレートにアップサンプルするための第２フィルタを取得する第１階層フィルタ取得部と、
前記第１階層フィルタ取得部が取得した各前記第２フィルタを用いて、前記第１サンプルレートのデータをアップサンプルして第２サンプルレートの候補データを作成する候補データ作成部と、
前記候補データ作成部が作成した前記候補データと前記入力データとの差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記入力データをアップサンプルすることにより、第３サンプルレートの出力データを作成する出力用アップサンプル部と
を備える。 In order to solve the above-described problem, an upsampling apparatus according to an aspect of the present invention relates to data representing a value for a one-dimensional or two-dimensional space as a discrete value at a sample point in the space.
An upsampling device for upsampling input data of a second sample rate to obtain output data of a third sample rate,
For each of the plurality of learning subsets in the first hierarchy, which is a subset of the learning set including the data of the third sample rate, the data of the second sample rate that has been machine-learned using the learning subset is the third sample. A first filter for upsampling to rate data;
A first layer filter acquisition unit for acquiring a second filter for up-sampling data of a first sample rate obtained by down-sampling data of the second sample rate to the second sample rate;
A candidate data creation unit that upsamples the data of the first sample rate and creates candidate data of the second sample rate by using the second filters obtained by the first hierarchy filter obtaining unit;
By up-sampling the input data using a first filter corresponding to a second filter that minimizes the difference between the candidate data created by the candidate data creation unit and the input data, a third sample rate And an output upsampling unit for creating output data.

この構成によると、第２サンプルレートの候補データと第２サンプルレートの入力データとの差分が最小となる第２フィルタを複数の第２フィルタの中から選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力データをアップサンプルすることができる。第２フィルタは、第１フィルタと同じ学習用サブセットを用いて機械学習されているため、第１フィルタと同様の性質を有し、かつ、第２フィルタが対象とする入力データのサンプルレートは第１フィルタが対象とする入力データのサンプルレートよりも小さい。このため、入力データに最適な性質を有する第２フィルタを高速に選択することができ、選択した第２フィルタに対応した第１フィルタを用いて入力データにアップサンプル処理を実行することで、高精度なアップサンプルデータ（出力データ）を高速に作成することができる。なお、「アップサンプル装置」には、当然に「超解像度処理装置」が含まれる。 According to this configuration, the second filter that minimizes the difference between the second sample rate candidate data and the second sample rate input data can be selected from the plurality of second filters. Also, the input data can be upsampled using the first filter corresponding to the selected second filter. Since the second filter is machine-learned using the same learning subset as the first filter, it has the same properties as the first filter, and the sample rate of the input data targeted by the second filter is the first. One filter is smaller than the sample rate of the target input data. For this reason, the second filter having the optimal property for the input data can be selected at high speed, and by performing the upsampling process on the input data using the first filter corresponding to the selected second filter, Accurate upsampled data (output data) can be created at high speed. The “upsampling apparatus” naturally includes “super-resolution processing apparatus”.

本発明のある形態のアップサンプル装置は、３次元の空間に対する値をアップサンプルする。 One form of upsampling apparatus of the present invention upsamples values for a three-dimensional space.

この構成によると、動画データ、ＣＴ画像データ、その他の３次元の空間に係るデータに対してアップサンプルを行うことができる。 According to this configuration, upsampling can be performed on moving image data, CT image data, and other data relating to a three-dimensional space.

本発明の好ましいある形態のアップサンプル装置は、
前記差分が最小となる前記第２フィルタを機械学習するのに用いた前記第１階層の学習用サブセットのサブセットである第２階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された第１フィルタおよび第２フィルタを取得する第２階層フィルタ取得部を備え、
前記候補データ作成部は、前記第２階層フィルタ取得部が取得した前記第２階層の各前記第２フィルタを用いて、前記第１サンプルレートのデータをアップサンプルして前記候補データを作成し、
前記出力用アップサンプル部は、前記候補データ作成部が前記第２階層の前記第２フィルタを用いて作成した前記候補データと前記入力データとの差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記入力データをアップサンプルすることにより、第３サンプルレートの出力データを作成する。 A preferred form of the upsampling device of the present invention is as follows:
For each of a plurality of learning subsets in the second hierarchy, which is a subset of the learning subset in the first hierarchy used for machine learning of the second filter with the smallest difference, using the learning subset A second hierarchical filter acquisition unit that acquires the first and second machine-learned filters;
The candidate data creation unit uses the second filters of the second hierarchy acquired by the second hierarchy filter acquisition unit to upsample the data of the first sample rate to create the candidate data,
The output upsampling unit corresponds to a first filter corresponding to a second filter that minimizes a difference between the candidate data created by the candidate data creation unit using the second filter of the second hierarchy and the input data. Output data at the third sample rate is created by up-sampling the input data using a filter.

この構成によると、第１階層において第２フィルタを選択し、選択した第２フィルタに基づいて、さらに、第２階層における第２フィルタを選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力データをアップサンプル処理することができる。このように、第１フィルタおよび第２フィルタを階層構造化することができるため、効率的に第２フィルタおよび該第２フィルタに対応する第１フィルタを選択することができる。これにより、高精度な出力データを高速に作成することができる。 According to this configuration, it is possible to select the second filter in the first hierarchy, and further select the second filter in the second hierarchy based on the selected second filter. Further, the input data can be upsampled using the first filter corresponding to the selected second filter. Thus, since the first filter and the second filter can be hierarchically structured, the second filter and the first filter corresponding to the second filter can be efficiently selected. Thereby, highly accurate output data can be created at high speed.

本発明のコンピュータプログラムは、
コンピュータを本発明のアップサンプル装置として動作させる。 The computer program of the present invention is:
A computer is operated as the upsampling apparatus of the present invention.

この構成によると、本発明のアップサンプル装置がコンピュータプログラムとして実装される。 According to this configuration, the upsampling apparatus of the present invention is implemented as a computer program.

本発明のある形態のコンピュータシステムは、
本発明のアップサンプル装置又はコンピュータプログラムをを備え、
前記アップサンプル装置又は前記コンピュータプログラムに向けて前記入力データを送信し、前記出力データを受信して利用することを特徴とする。 A computer system according to an aspect of the present invention includes:
Comprising the upsampling apparatus or computer program of the present invention ,
The input data is transmitted to the upsampling apparatus or the computer program , and the output data is received and used.

この構成によると、コンピュータプログラムによって処理されたデータに対するアップサンプル処理が当然に行えるとともに、コンピュータシステムがアップサンプルを必要とするときに本発明のアップサンプル装置又はコンピュータプログラムを活用することができる。本発明のアップサンプル装置又はコンピュータプログラムを活用したコンピュータシステムを構築することができる。 According to this configuration, the upsampling process can be naturally performed on the data processed by the computer program, and the upsampling apparatus or the computer program of the present invention can be utilized when the computer system needs the upsampling. A computer system using the upsampling apparatus or computer program of the present invention can be constructed.

本発明の機器は、
本発明のアップサンプル装置又はコンピュータプログラムをを備え、
前記アップサンプル装置又は前記コンピュータプログラムに向けて前記入力データを送信する機器であって、テレビジョン機器、カメラ、計測装置、スマートフォン、ゲーム機器、オーディオ機器、その他の機器である。 The device of the present invention
Comprising the upsampling apparatus or computer program of the present invention ,
A device that transmits the input data to the upsampling device or the computer program , such as a television device, a camera, a measurement device, a smartphone, a game device, an audio device, and other devices.

この構成によると、種種の機器が、本発明のアップサンプル装置を活用して動作する。 According to this configuration, various devices operate using the upsampling apparatus of the present invention.

本発明のアップサンプル方法は、
１次元又は２次元以上の空間に対する値を前記空間のサンプル点における離散値として表現したデータについて、
第２サンプルレートの入力データをアップサンプルして、第３サンプルレートの出力データを得るためのアップサンプル装置又はコンピュータプログラムを機能させるための方法であって、
第３サンプルレートのデータを含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された、第２サンプルレートのデータを第３サンプルレートのデータにアップサンプルするための第１フィルタを取得するステップと、
前記第２サンプルレートのデータをダウンサンプルした第１サンプルレートのデータを前記第２サンプルレートにアップサンプルするための第２フィルタを取得するステップと、
前記第１階層フィルタ取得部が取得した各前記第２フィルタを用いて、前記第１サンプルレートのデータをアップサンプルして第２サンプルレートの候補データを作成するステップと、
前記候補データと前記入力データとの差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記入力データをアップサンプルすることにより、第３サンプルレートの出力データを作成するステップと
を備える。 The upsampling method of the present invention comprises:
For data expressing a value for a space of one dimension or two or more dimensions as a discrete value at a sampling point of the space,
A method for causing an upsampling device or computer program to upsample input data at a second sample rate to obtain output data at a third sample rate, comprising:
For each of the plurality of learning subsets in the first hierarchy, which is a subset of the learning set including the data of the third sample rate, the data of the second sample rate that has been machine-learned using the learning subset is the third sample. Obtaining a first filter for upsampling to rate data;
Obtaining a second filter for up-sampling the data of the first sample rate down-sampled to the second sample rate to the second sample rate;
Using the second filters acquired by the first hierarchical filter acquisition unit to upsample the data of the first sample rate to create candidate data of the second sample rate;
Creating output data at a third sample rate by up-sampling the input data using a first filter corresponding to a second filter that minimizes the difference between the candidate data and the input data; Prepare.

この構成によると、上述のアップサンプル装置の処理手順を方法として活用し、必ずしも装置でない（例えば半導体集積回路）ものとして実装することができる。すなわち、第２サンプルレートの候補データと第２サンプルレートの入力データとの差分が最小となる第２フィルタを複数の第２フィルタの中から選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力データをアップサンプルすることができる。第２フィルタは、第１フィルタと同じ学習用サブセットを用いて機械学習されているため、第１フィルタと同様の性質を有し、かつ、第２フィルタが対象とする入力データサンプルレートは第１フィルタが対象とする入力データサンプルレートよりも小さい。このため、入力データに最適な性質を有する第２フィルタを高速に選択することができ、選択した第２フィルタに対応した第１フィルタを用いて入力データにアップサンプル処理を実行することで、高精度なアップサンプルデータ（出力データ）を高速に作成することができる。なお、「アップサンプル方法」には、当然に「超解像度処理方法」が含まれる。 According to this configuration, the processing procedure of the upsampling apparatus described above can be used as a method, and can be mounted as a device that is not necessarily a device (for example, a semiconductor integrated circuit). That is, the second filter that minimizes the difference between the candidate data of the second sample rate and the input data of the second sample rate can be selected from the plurality of second filters. Also, the input data can be upsampled using the first filter corresponding to the selected second filter. Since the second filter is machine-learned using the same learning subset as the first filter, it has the same properties as the first filter, and the input data sample rate targeted by the second filter is the first. Less than the input data sample rate targeted by the filter. For this reason, the second filter having the optimal property for the input data can be selected at high speed, and by performing the upsampling process on the input data using the first filter corresponding to the selected second filter, Accurate upsampled data (output data) can be created at high speed. The “upsampling method” naturally includes a “super-resolution processing method”.

本発明のアップサンプル方法は、
前記差分が最小となる前記第２フィルタを機械学習するのに用いた前記第１階層の学習用サブセットのサブセットである第２階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された第１フィルタおよび第２フィルタを取得するステップをさらに備え、
前記候補データ作成するステップは、前記第２フィルタを取得するステップにおいて取得された前記第２階層の各前記第２フィルタを用いて、前記第１サンプルレートのデータをアップサンプルして前記候補データを作成し、
前記第３サンプルレートの出力データを作成するステップは、前記候補データを作成するステップにおいて前記第２階層の前記第２フィルタを用いて作成された前記候補データと前記入力データとの差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記入力データをアップサンプルすることにより、第３サンプルレートの出力データを作成する。 The upsampling method of the present invention comprises:
For each of a plurality of learning subsets in the second hierarchy, which is a subset of the learning subset in the first hierarchy used for machine learning of the second filter with the smallest difference, using the learning subset Obtaining a machine-learned first filter and a second filter;
The step of creating candidate data up-samples the data of the first sample rate by using the second filters of the second hierarchy acquired in the step of acquiring the second filter, make,
Step includes: a difference between the candidates data and the input data that was created using the second filter of the second hierarchy in the step of preparing the candidate data is minimized to create an output data of said third sample rate Using the first filter corresponding to the second filter, the input data is upsampled to generate output data at the third sample rate.

この構成によると、第１階層において第２フィルタを選択し、選択した第２フィルタに基づいて、さらに、第２階層における第２フィルタを選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力データをアップサンプル処理することができる。このように、第１フィルタおよび第２フィルタを階層構造化することができるため、効率的に第２フィルタおよび該第２フィルタに対応する第１フィルタを選択することができる。これにより、より高精度な出力データをより高速に作成することができる。 According to this configuration, it is possible to select the second filter in the first hierarchy, and further select the second filter in the second hierarchy based on the selected second filter. Further, the input data can be upsampled using the first filter corresponding to the selected second filter. Thus, since the first filter and the second filter can be hierarchically structured, the second filter and the first filter corresponding to the second filter can be efficiently selected. As a result, more accurate output data can be created at a higher speed.

本発明のアップサンプル装置、コンピュータプログラム及びアップサンプル方法によれば、高精度な出力データを高速に作成することができる。 According to the upsampling apparatus, computer program, and upsampling method of the present invention, highly accurate output data can be created at high speed.

本発明のコンピュータシステム及び機器によれば、アップサンプルデータ（出力データ）を活用することができる。 According to the computer system and apparatus of the present invention, upsampled data (output data) can be utilized.

図１は、アップサンプル処理装置の構成を示すブロック図である。FIG. 1 is a block diagram showing the configuration of the upsample processing apparatus. 図２は、データのサイズおよびフィルタを説明するための図である。FIG. 2 is a diagram for explaining the data size and the filter. 図３は、階層構造化された学習用セットおよびフィルタの一例を示す図である。FIG. 3 is a diagram illustrating an example of a learning set and filter having a hierarchical structure. 図４は、アップサンプル処理装置が実行するフィルタ作成処理の処理手順を示すフローチャートである。FIG. 4 is a flowchart showing a processing procedure of filter creation processing executed by the upsample processing apparatus. 図５は、アップサンプル処理装置が実行する入力データのアップサンプル処理の処理手順を示すフローチャートである。FIG. 5 is a flowchart showing a processing procedure of input data upsampling processing executed by the upsampling processing apparatus. 図６は、アップサンプル処理装置を活用するコンピュータシステム及び機器示す図である。FIG. 6 is a diagram illustrating a computer system and devices that utilize the upsample processing apparatus.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。なお、以下で説明する実施の形態は、いずれも本発明の好ましい一具体例を示すものである。以下の実施の形態で示される数値、形状、構成要素、構成要素の配置位置および接続形態、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。本発明は、特許請求の範囲によって特定される。よって、以下の実施の形態における構成要素のうち、本発明の最上位概念を示す独立請求項に記載されていない構成要素については、本発明の課題を達成するのに必ずしも必要ではないが、より好ましい形態を構成するものとして説明される。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Each of the embodiments described below shows a preferred specific example of the present invention. Numerical values, shapes, components, arrangement positions and connection forms of components, steps, order of steps, and the like shown in the following embodiments are merely examples, and are not intended to limit the present invention. The invention is specified by the claims. Therefore, among the constituent elements in the following embodiments, constituent elements that are not described in the independent claims indicating the highest concept of the present invention are not necessarily required to achieve the object of the present invention. It will be described as constituting a preferred form.

図１は、本発明の実施の形態に係るアップサンプル処理装置の構成を示すブロック図である。アップサンプル処理装置１は、入力データに対してアップサンプル処理を施し、入力データよりもサンプルレートの高いデータを作成する装置であり、入力データ取得部１０と、縮小データ作成部１１と、機械学習部１２と、第１階層フィルタ取得部１３と、候補データ作成部１４と、第２階層フィルタ取得部１５と、アップサンプル処理部１６と、記憶装置１７とを備える。以下、アップサンプル処理装置１の行う処理について説明する。 FIG. 1 is a block diagram showing the configuration of the upsample processing apparatus according to the embodiment of the present invention. The upsampling processing device 1 is a device that performs upsampling processing on input data and creates data having a higher sample rate than the input data. The upsampling processing device 1 includes an input data acquisition unit 10, a reduced data creation unit 11, and machine learning. Unit 12, first layer filter acquisition unit 13, candidate data creation unit 14, second layer filter acquisition unit 15, upsample processing unit 16, and storage device 17. Hereinafter, processing performed by the upsample processing apparatus 1 will be described.

入力データ取得部１０は、アップサンプル処理の対象となる入力データを取得する。入力データ取得部１０は、例えば、記憶装置１７に記憶されているデータの中からユーザがキーボード等を用いて選択したデータを、記憶装置１７から読み出すことにより、当該データを入力データとして取得してもよい。また、入力データ取得部１０は、ネットワーク等を介してデータをダウンロードすることにより、当該データを入力データとして取得してもよい。 The input data acquisition unit 10 acquires input data to be subjected to upsampling processing. For example, the input data acquisition unit 10 reads out data selected by the user from the data stored in the storage device 17 using a keyboard or the like from the storage device 17 to acquire the data as input data. Also good. The input data acquisition unit 10 may acquire the data as input data by downloading the data via a network or the like.

縮小データ作成部１１は、入力データ取得部１０が取得した入力データをダウンサンプルすることにより縮小データを作成する。ここで、ダウンサンプルは任意の方法で、データの性質に応じたものとすることができる。整数比のダウンサンプリングであれば単純に間引く（データの一部を削除する）こととしてもよく、非整数比のダウンサンプリングであればデータ間で補間（例えば線形補間）を行ってもよく、これら以外の方法を用いてもよい（例えば音響データであればダウンサンプリング後のサンプリング周波数以下の周波数特性をダウンサンプル前のデータと同一にしてもよい）。 The reduced data creation unit 11 creates reduced data by down-sampling the input data acquired by the input data acquisition unit 10. Here, down-sampling can be performed in an arbitrary manner and according to the nature of the data. If it is an integer ratio downsampling, it may be simply thinned out (a part of the data is deleted), and if it is a non-integer ratio downsampling, interpolation (eg linear interpolation) may be performed between the data. (For example, in the case of acoustic data, the frequency characteristics below the sampling frequency after down-sampling may be the same as the data before down-sampling).

なお、縮小データ作成部１１は、入力データ以外にも、アップサンプル処理のためのフィルタを機械学習するために用意された学習用データもダウンサンプルする。 In addition to the input data, the reduced data creation unit 11 downsamples learning data prepared for machine learning of a filter for upsampling processing.

記憶装置１７には、予め第３サンプルレートの複数の学習用データを含む学習用セットが記憶されているものとする。また、学習用セットには、様々な種類の学習用データが含まれるものとする。縮小データ作成部１１は、記憶装置１７から学習用セットを読み出し、学習用セットに含まれる各学習用データを、第２サンプルレートにダウンサンプルした縮小データと、第１サンプルレートにダウンサンプルした縮小データとを作成する。 It is assumed that a learning set including a plurality of pieces of learning data at the third sample rate is stored in the storage device 17 in advance. The learning set includes various types of learning data. The reduced data creation unit 11 reads the learning set from the storage device 17, reduces the learning data included in the learning set to downsampled to the second sample rate, and reduced downsampled to the first sample rate. Create data and.

以下、学習用データから作成した第２サンプルレートの縮小データおよび第２サンプルレートの縮小データのことも「学習用データ」と呼ぶ。また、学習用セットには、複数の第３サンプルレートの学習用画像のみならず、これらから作成した第２サンプルレートの学習用画像および第１サンプルレートの学習用画像が含まれるものとする。 Hereinafter, the reduced data of the second sample rate and the reduced data of the second sample rate created from the learning data are also referred to as “learning data”. In addition, the learning set includes not only a plurality of learning images at the third sample rate but also a learning image at the second sample rate and a learning image at the first sample rate created from them.

機械学習部１２は、学習用セットに含まれるデータを用いて、アップサンプル処理を行うためのフィルタを機械学習により作成する。なお、機械学習部１２は、学習用セットに含まれるデータを階層構造化して、各階層において、フィルタを作成する。 The machine learning unit 12 creates a filter for performing the upsampling process by machine learning using data included in the learning set. Note that the machine learning unit 12 hierarchically structures the data included in the learning set, and creates a filter in each hierarchy.

図３は、階層構造化された学習用セットおよびフィルタの一例を示す図である。つまり、機械学習部１２は、以下のような手順により、階層構造化された学習用セットおよびフィルタを作成する。 FIG. 3 is a diagram illustrating an example of a learning set and filter having a hierarchical structure. In other words, the machine learning unit 12 creates learning sets and filters that are hierarchically structured in the following procedure.

まず、機械学習部１２は、学習用セットに含まれる第２サンプルレートの学習用データと、第３サンプルレートの学習用データとを用いて、第２サンプルレートのデータを第３サンプルレートのデータにアップサンプル処理するためのＣＮＮ（Convolutional Neural Network）を作成する。図２に示すように、第２サンプルレートのデータを第３サンプルレートのデータにアップサンプル処理するＣＮＮを、第１ＣＮＮまたは第１フィルタと言う。機械学習部１２は、第２サンプルレートの学習用データをＣＮＮの入力とし、第３サンプルレートの学習用データをＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第１ＣＮＮを作成する。学習用セットを用いて作成した第１ＣＮＮを、図３に示すように第１ＣＮＮ０と言う。 First, the machine learning unit 12 uses the second sample rate learning data and the third sample rate learning data included in the learning set to convert the second sample rate data into the third sample rate data. A CNN (Convolutional Neural Network) for upsampling processing is created. As shown in FIG. 2, the CNN that upsamples the data of the second sample rate into the data of the third sample rate is referred to as a first CNN or a first filter. The machine learning unit 12 uses the learning data of the second sample rate as the input of the CNN, the learning data of the third sample rate as the output of the CNN, and performs machine learning of the weights and biases in each layer of the CNN, thereby performing the first CNN. Create The first CNN created using the learning set is referred to as the first CNN0 as shown in FIG.

なお、本実施の形態では、機械学習部１２は、アップサンプル処理するためのフィルタとしてＣＮＮを例に説明するが、フィルタはＣＮＮに限定されるものではなく、複数の学習用データから機械学習されるその他のフィルタに対しても本発明を適用することができる。 In the present embodiment, the machine learning unit 12 will be described using CNN as an example of a filter for upsampling processing, but the filter is not limited to CNN, and machine learning is performed from a plurality of learning data. The present invention can also be applied to other filters.

ＣＮＮ以外のフィルタとしては、例えば、移動平均、多項式近似等のパラメータを有する近似曲線を用い、そのパラメータの値を学習するものが考えられる。 As a filter other than CNN, for example, an approximate curve having parameters such as moving average and polynomial approximation may be used, and the value of the parameter may be learned.

次に、機械学習部１２は、第１ＣＮＮ０を用いて、学習用セットを３つのサブセット（学習用サブセット１〜３）に分類する。例えば、機械学習部１２は、第１ＣＮＮ０によりアップサンプル処理されたデータと、学習用データとの差分に基づいて、上記分類を行う。 Next, the machine learning unit 12 classifies the learning set into three subsets (learning subsets 1 to 3) using the first CNN0. For example, the machine learning unit 12 performs the classification based on the difference between the data up-sampled by the first CNN0 and the learning data.

具体的には、機械学習部１２は、学習用セットに含まれる第２サンプルレートの各学習用データを第１ＣＮＮ０に入力することにより、第３サンプルレートのデータを作成する。 Specifically, the machine learning unit 12 creates data of the third sample rate by inputting each learning data of the second sample rate included in the learning set to the first CNN0.

次に、機械学習部１２は、第１ＣＮＮ０が作成した第３サンプルレートのデータと、上記第２サンプルレートの学習用データを作成するのに用いた第３サンプルレートの学習用データとの差分を算出する。データ間の差分は、例えば、個々のデータ値の差分の二乗和または絶対値和により求めることができる。なお、三乗和の絶対値、その他の基準を用いてもよい。 Next, the machine learning unit 12 calculates a difference between the third sample rate data created by the first CNN0 and the third sample rate learning data used to create the second sample rate learning data. calculate. The difference between the data can be obtained by, for example, a sum of squares or a sum of absolute values of differences between individual data values. In addition, you may use the absolute value of a cube sum, and another reference | standard.

ここで、データ間の差分は、必ずしも個々のデータ値の差分に基づいて求めなくともよい。例えば、音響データであれば、適宜に窓掛け（例えばハニング窓）しフーリエ変換をした結果の周波数特性の差分に基づいて求めることが考えられる。要するに、データの使用目的に合わせてデータ間の差分を任意に定義してよい。 Here, the difference between the data does not necessarily have to be obtained based on the difference between the individual data values. For example, in the case of acoustic data, it is conceivable to obtain it based on a difference in frequency characteristics as a result of performing a Fourier transform by appropriately applying a window (for example, Hanning window). In short, a difference between data may be arbitrarily defined according to the purpose of use of the data.

次に、機械学習部１２は、算出した差分に基づいて、第３サンプルレートの学習用データを３つのサブセットのいずれかに分類する。 Next, the machine learning unit 12 classifies the learning data at the third sample rate into one of the three subsets based on the calculated difference.

例えば、機械学習部１２は、差分が第１閾値よりも大きい第３サンプルレートの学習用データを学習用サブセット１に分類する。また、機械学習部１２は、差分が第２閾値（ただし、第２閾値は第１閾値よりも小さい値）よりも大きいく、かつ第１閾値以下の第３サンプルレートの学習用データを学習用サブセット２に分類する。また、機械学習部１２は、差分が第２閾値以下の第３サンプルレートの学習用データを学習用サブセット３に分類する。なお、機械学習部１２は、第３サンプルレートの学習用データから縮小データ作成部１１が作成した第２サンプルレートの学習用データおよび第１サンプルレートの学習用データも、第３サンプルレートの学習用データと同じ学習用サブセットに分類する。これにより、図３に示すように、学習用セットが学習用サブセット１〜３に分類される。 For example, the machine learning unit 12 classifies the learning data having the third sample rate whose difference is larger than the first threshold into the learning subset 1. In addition, the machine learning unit 12 uses learning data for learning at a third sample rate whose difference is greater than a second threshold (where the second threshold is smaller than the first threshold) and equal to or less than the first threshold. Classify into subset 2. Further, the machine learning unit 12 classifies the learning data having the third sample rate whose difference is equal to or smaller than the second threshold into the learning subset 3. The machine learning unit 12 also learns the second sample rate learning data and the first sample rate learning data created by the reduced data creation unit 11 from the third sample rate learning data. Classify into the same learning subset as the training data. Thereby, as shown in FIG. 3, the learning set is classified into learning subsets 1 to 3.

上記の差分は、アップサンプル処理されたデータの誤差を示しているため、学習用セットが誤差に基づいて、複数の学習用サブセットに分類されることになる。 Since the above difference indicates an error of the upsampled data, the learning set is classified into a plurality of learning subsets based on the error.

機械学習部１２は、学習用サブセット１〜３のそれぞれを用いて、第１ＣＮＮと、第１サンプルレートのデータを第２サンプルレートのデータにアップサンプル処理するＣＮＮ（第２ＣＮＮまたは第２フィルタと言う）を作成する。 The machine learning unit 12 uses each of the learning subsets 1 to 3 to perform a first CNN and a CNN (second CNN or second filter) that upsamples the first sample rate data to the second sample rate data. ).

例えば、機械学習部１２は、学習用サブセット１に含まれる第２サンプルレートの学習用データをＣＮＮの入力とし、学習用サブセット１に含まれる第３サンプルレートの学習用データをＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第１ＣＮＮを作成する。 For example, the machine learning unit 12 uses the learning data of the second sample rate included in the learning subset 1 as the input of the CNN, and the learning data of the third sample rate included in the learning subset 1 as the output of the CNN. A first CNN is created by machine learning of weights and biases in each layer of the CNN.

また、機械学習部１２は、学習用サブセット１に含まれる第１サンプルレートの学習用データをＣＮＮの入力とし、学習用サブセット１に含まれる第２サンプルレートの学習用データをＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第２ＣＮＮを作成する。 Further, the machine learning unit 12 uses the learning data of the first sample rate included in the learning subset 1 as the input of the CNN, and the learning data of the second sample rate included in the learning subset 1 as the output of the CNN. A second CNN is created by machine learning of weights and biases in each layer of the CNN.

機械学習部１２は、学習用サブセット２および３についても、同様に、第１ＣＮＮおよび第２ＣＮＮを作成する。 The machine learning unit 12 similarly creates the first CNN and the second CNN for the learning subsets 2 and 3 as well.

学習用サブセットｉから作成された第１ＣＮＮをおよび第２ＣＮＮを、それぞれ、第１ＣＮＮｉおよび第２ＣＮＮｉと記載する（ｉ＝１〜３）。 The first CNN and the second CNN created from the learning subset i are described as the first CNNi and the second CNNi, respectively (i = 1 to 3).

機械学習部１２は、学習用サブセット１〜３のそれぞれを、さらに、３つのサブセットに分類する。 The machine learning unit 12 further classifies each of the learning subsets 1 to 3 into three subsets.

例えば、機械学習部１２は、第１ＣＮＮ１を用いて、学習用サブセット１を３つのサブセット（学習用サブセット１−１〜１−３）に分類する。分類方法は、学習用セットの分類方法と分類の対象が異なる以外同様である。このため、その詳細な説明はここでは繰り返さない。 For example, the machine learning unit 12 classifies the learning subset 1 into three subsets (learning subsets 1-1 to 1-3) using the first CNN1. The classification method is the same as that of the learning set except that the classification target is different. Therefore, detailed description thereof will not be repeated here.

また、機械学習部１２は、学習用サブセット１−１〜１−３のそれぞれについて、第１ＣＮＮおよび第２ＣＮＮを作成する。ＣＮＮの作成方法は、学習用サブセット１を用いた第１ＣＮＮおよび第２ＣＮＮの作成方法と、利用する学習用サブセットが異なる以外同様である。このため、その詳細な説明はここでは繰り返さない。 In addition, the machine learning unit 12 creates a first CNN and a second CNN for each of the learning subsets 1-1 to 1-3. The creation method of the CNN is the same as the creation method of the first CNN and the second CNN using the learning subset 1, except that the learning subset to be used is different. Therefore, detailed description thereof will not be repeated here.

機械学習部１２は、学習用サブセット２および３についても、学習用サブセットの分類処理および分類された学習用サブセットを用いたＣＮＮの作成処理を行う。 The machine learning unit 12 also performs learning subset classification processing and CNN creation processing using the classified learning subsets for the learning subsets 2 and 3.

学習用サブセットｉを分類することにより得られる学習用サブセットを、学習用サブセットｉ−ｊと記載する（ｉ＝１〜３，ｊ＝１〜３）。また、学習用サブセットｉ−ｊから作成された第１ＣＮＮおよび第２ＣＮＮを、それぞれ、第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊと記載する。 A learning subset obtained by classifying the learning subset i is referred to as a learning subset i-j (i = 1 to 3, j = 1 to 3). Further, the first CNN and the second CNN created from the learning subset i-j are referred to as a first CNNi-j and a second CNNi-j, respectively.

機械学習部１２が実行する処理により、図３に示すような解像構造化された学習用セットおよびＣＮＮの分類木が作成される。学習用セットおよび第１ＣＮＮ０を第０階層とすると、第１階層には、学習用セットｉ、第１ＣＮＮｉおよび第２ＣＮＮｉが含まれる。また、第２階層には、学習用セットｉ−ｊ、第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊが含まれる。 By the processing executed by the machine learning unit 12, a learning set and a CNN classification tree having a resolution structure as shown in FIG. 3 are created. Assuming that the learning set and the first CNN0 are the zeroth hierarchy, the first hierarchy includes the learning set i, the first CNNi, and the second CNNi. The second hierarchy includes a learning set i-j, a first CNNi-j, and a second CNNi-j.

以下に説明する第１階層フィルタ取得部１３〜アップサンプル処理部１６は、入力データをアップサンプル処理するための処理部である。 The first hierarchical filter acquisition unit 13 to the upsampling processing unit 16 described below are processing units for upsampling input data.

第１階層フィルタ取得部１３は、第１階層の学習用サブセット１〜３のそれぞれについて、第１ＣＮＮおよび第２ＣＮＮを取得する。 The first hierarchy filter acquisition unit 13 acquires the first CNN and the second CNN for each of the learning subsets 1 to 3 in the first hierarchy.

候補データ作成部１４は、第１階層フィルタ取得部１３が取得した第２ＣＮＮｉを用いて、入力データをダウンサンプルした第１サンプルレートの縮小データをアップサンプル処理し、第２サンプルレートのデータを作成する。作成した第２サンプルレートのデータを候補データと呼ぶ。つまり、候補データ作成部１４は、第１サンプルレートの縮小データを、第２ＣＮＮｉに入力することにより、３枚の第２サンプルレートの候補データを作成する。 The candidate data creation unit 14 uses the second CNNi obtained by the first hierarchical filter obtaining unit 13 to upsample the reduced data of the first sample rate obtained by down-sampling the input data, and creates data of the second sample rate. To do. The created second sample rate data is referred to as candidate data. In other words, the candidate data creation unit 14 creates three pieces of second sample rate candidate data by inputting the reduced data of the first sample rate to the second CNNi.

第２階層フィルタ取得部１５は、第２階層の複数の学習用サブセットｉ−ｊの各々について、学習用サブセットｉ−ｊを用いて機械学習された第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊを取得する（ｉ＝１〜３，ｊ＝１〜３）。 The second hierarchy filter acquisition unit 15 acquires the first CNNi-j and the second CNNi-j machine-learned using the learning subset i-j for each of the plurality of learning subsets i-j in the second hierarchy. (I = 1-3, j = 1-3).

候補データ作成部１４は、さらに、第２階層フィルタ取得部１５が取得した第２ＣＮＮｉ−ｊを用いて、入力データをダウンサンプルした第１サンプルレートの縮小データをアップサンプル処理し、第２サンプルレートのデータ（以下、「候補データ」という）を作成する。つまり、候補データ作成部１４は、第１サンプルレートの縮小データを、第２ＣＮＮｉ−ｊに入力することにより、第２サンプルレートの候補データを作成する。 The candidate data creation unit 14 further uses the second CNNi-j acquired by the second hierarchical filter acquisition unit 15 to upsample the reduced data of the first sample rate obtained by down-sampling the input data, thereby obtaining the second sample rate. Data (hereinafter referred to as “candidate data”). That is, the candidate data creation unit 14 creates candidate data for the second sample rate by inputting the reduced data for the first sample rate to the second CNNi-j.

アップサンプル処理部１６は、候補データ作成部１４が第２ＣＮＮｉから作成した３枚の第２サンプルレートの候補データのそれぞれと、第２サンプルレートの入力データとの差分を算出する。データ間の差分の算出方法については、上述した通りである。 The upsample processing unit 16 calculates a difference between each of the three pieces of second sample rate candidate data created from the second CNNi by the candidate data creation unit 14 and the input data of the second sample rate. The method for calculating the difference between data is as described above.

アップサンプル処理部１６は、算出した差分が最小となる候補データを作成するに用いた第２ＣＮＮｍを特定し、特定した第２ＣＮＮｍを機械学習するに用いた学習用サブセットｍを特定する。 The upsampling processing unit 16 specifies the second CNNm used to create candidate data with the smallest calculated difference, and specifies the learning subset m used for machine learning of the specified second CNNm.

アップサンプル処理部１６は、特定した学習用サブセットｍのサブセットである学習用サブセットｍ−ｊ（ｊ＝１〜３）を特定する。例えば、ｍ＝１の場合には、学習用サブセット１−１〜１−３が特定される。 The upsample processing unit 16 specifies a learning subset mj (j = 1 to 3) that is a subset of the specified learning subset m. For example, when m = 1, learning subsets 1-1 to 1-3 are specified.

アップサンプル処理部１６は、さらに、学習用サブセットｍ−ｊ（ｊ＝１〜３）の第２ＣＮＮｍ−ｊを用いて作成された３枚の候補データの中から、第２サンプルレートの入力データとの差分が最小となる候補データを特定する。 The up-sampling processing unit 16 further selects input data of the second sample rate from among the three candidate data created using the second CNNm-j of the learning subset m-j (j = 1 to 3). The candidate data with the smallest difference is identified.

アップサンプル処理部１６は、特定した候補データを作成するのに用いた第２ＣＮＮｍ−ｎに対応する第１ＣＮＮｍ−ｎを用いて、第２サンプルレートの入力データをアップサンプル処理することにより、第３サンプルレートのアップサンプルデータを作成する。 The up-sampling processing unit 16 uses the first CNNm-n corresponding to the second CNNm-n used to create the identified candidate data to upsample the second sample rate input data, thereby Create sample rate upsampled data.

記憶装置１７は、データや各種データを記憶するための記憶装置であり、ＨＤＤ（Hard Disk Drive）、不揮発性メモリまたは揮発性メモリなどにより構成される。
次に、アップサンプル処理装置１が実行する処理の手順について説明する。 The storage device 17 is a storage device for storing data and various types of data, and includes a hard disk drive (HDD), a nonvolatile memory, a volatile memory, or the like.
Next, a procedure of processing executed by the upsample processing apparatus 1 will be described.

図４は、本発明の実施の形態に係るアップサンプル処理装置１が実行するフィルタ作成処理の処理手順を示すフローチャートである。 FIG. 4 is a flowchart showing the processing procedure of the filter creation process executed by the upsample processing apparatus 1 according to the embodiment of the present invention.

図４に示すように、縮小データ作成部１１は、記憶装置１７に記憶されている学習用セットに含まれる第３サンプルレートの各学習用データから、当該データを、縦および横それぞれ１／２にダウンサンプルした第２サンプルレートの学習用データと、縦および横それぞれ１／４にダウンサンプルした第１サンプルレートの学習用データとを作成する（Ｓ１）。 As shown in FIG. 4, the reduced data creation unit 11 divides the data from the learning data of the third sample rate included in the learning set stored in the storage device 17 by 1/2 each of the vertical and horizontal data. The learning data of the second sample rate down-sampled to 1 and the learning data of the first sample rate down-sampled to 1/4 each of the vertical and horizontal are created (S1).

機械学習部１２は、学習用セットに含まれる第３サンプルレートの学習用データと第２サンプルレートの縮小データとに基づいて、機械学習により、第２サンプルレートのデータを第３サンプルレートのデータに変換するための第１フィルタ（第１ＣＮＮ０）を作成する（Ｓ２）。 The machine learning unit 12 converts the second sample rate data to the third sample rate data by machine learning based on the third sample rate learning data and the second sample rate reduced data included in the learning set. A first filter (first CNN0) for converting to (S2) is created.

機械学習部１２は、学習用セットに含まれる第２サンプルレートの各学習用データを第１ＣＮＮ０に入力することにより、第３サンプルレートのデータを作成する。次に、機械学習部１２は、第１ＣＮＮ０が作成した第３サンプルレートのデータと、上記第２サンプルレートの学習用データを作成するのに用いた第３サンプルレートの学習用データとの差分を算出する。機械学習部１２は、算出した差分に基づいて、第３サンプルレートの学習用データを３つの学習用サブセット１〜３のいずれかに分類する（Ｓ３）。 The machine learning unit 12 creates data of the third sample rate by inputting each learning data of the second sample rate included in the learning set to the first CNN0. Next, the machine learning unit 12 calculates a difference between the third sample rate data created by the first CNN0 and the third sample rate learning data used to create the second sample rate learning data. calculate. Based on the calculated difference, the machine learning unit 12 classifies the learning data at the third sample rate into one of the three learning subsets 1 to 3 (S3).

機械学習部１２は、学習用サブセットｉ（ｉ＝１〜３）を用いて、第１ＣＮＮｉと、第２ＣＮＮｉを作成する（Ｓ４）。機械学習部１２は、作成したＣＮＮを、記憶装置１７に記憶させる。なお、機械学習部１２は、学習用サブセットｉ（ｉ＝１〜３）を適宜交換しながら、機械学習を繰り返し実行することにより、各学習用サブセットとの誤差が小さくなるように第１ＣＮＮｉおよび第２ＣＮＮｉを作成してもよい。 The machine learning unit 12 creates the first CNNi and the second CNNi using the learning subset i (i = 1 to 3) (S4). The machine learning unit 12 stores the created CNN in the storage device 17. The machine learning unit 12 repeatedly executes machine learning while appropriately replacing the learning subset i (i = 1 to 3), so that the error with each learning subset is reduced. 2CNNi may be created.

機械学習部１２は、学習用サブセット１〜３のそれぞれについて、学習用サブセットｉ（ｉ＝１〜３）に含まれる第２サンプルレートの各学習用データを第１ＣＮＮｉに入力することにより、第３サンプルレートのデータを作成する。次に、機械学習部１２は、第１ＣＮＮｉが作成した第３サンプルレートのデータと、上記第２サンプルレートの学習用データを作成するのに用いた第３サンプルレートの学習用データとの差分を算出する。機械学習部１２は、算出した差分に基づいて、第３サンプルレートの学習用データを３つの学習用サブセットｉ−ｊ（ｊ＝１〜３）のいずれかに分類する（Ｓ５）。これにより、学習用サブセット１〜３の各々が、さらに、３つの学習用サブセットに分類される。 For each of the learning subsets 1 to 3, the machine learning unit 12 inputs each learning data of the second sample rate included in the learning subset i (i = 1 to 3) into the first CNNi, thereby Create sample rate data. Next, the machine learning unit 12 calculates a difference between the third sample rate data created by the first CNNi and the third sample rate learning data used to create the second sample rate learning data. calculate. The machine learning unit 12 classifies the learning data at the third sample rate into one of the three learning subsets ij (j = 1 to 3) based on the calculated difference (S5). Thereby, each of the learning subsets 1 to 3 is further classified into three learning subsets.

機械学習部１２は、学習用サブセットｉ−ｊ（ｉ＝１〜３，ｊ＝１〜３）を用いて、機械学習により、第１ＣＮＮｉ−ｊと、第２ＣＮＮｉ−ｊを作成する（Ｓ６）。機械学習部１２は、作成したＣＮＮを、記憶装置１７に記憶させる。なお、機械学習部１２は、学習用サブセットｉ−ｊ（ｉ＝１〜３，ｊ＝１〜３）を適宜交換しながら、機械学習を繰り返し実行することにより、各学習用サブセットとの誤差が小さくなるように第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊを作成してもよい。 The machine learning unit 12 creates the first CNNi-j and the second CNNi-j by machine learning using the learning subset ij (i = 1 to 3, j = 1 to 3) (S6). The machine learning unit 12 stores the created CNN in the storage device 17. Note that the machine learning unit 12 repeatedly performs machine learning while appropriately replacing the learning subsets ij (i = 1 to 3, j = 1 to 3), thereby causing an error from each learning subset. The first CNNi-j and the second CNNi-j may be created to be smaller.

図４に示したフィルタ作成処理により、図３に示したような階層構図化されたＣＮＮが作成される。 By the filter creation process shown in FIG. 4, a hierarchically structured CNN as shown in FIG. 3 is created.

図５は、本発明の実施の形態に係るアップサンプル処理装置１が実行する入力データのアップサンプル処理の処理手順を示すフローチャートである。 FIG. 5 is a flowchart showing the processing procedure of the input data upsampling process executed by the upsampling apparatus 1 according to the embodiment of the present invention.

図５に示すように、入力データ取得部１０は、アップサンプル処理の対象となる第２サンプルレートの入力データを取得する（Ｓ１１）。 As shown in FIG. 5, the input data acquisition unit 10 acquires input data of the second sample rate that is the target of the upsampling process (S11).

縮小データ作成部１１は、入力データ取得部１０が取得した第２サンプルレートの入力データを縮小することにより第１サンプルレートの縮小データを作成する（Ｓ１２）。 The reduced data creation unit 11 creates reduced data of the first sample rate by reducing the input data of the second sample rate acquired by the input data acquisition unit 10 (S12).

第１階層フィルタ取得部１３は、図３に示した分類木の第１階層のフィルタを取得する（Ｓ１３）。つまり、第１階層フィルタ取得部１３は、第１ＣＮＮ１〜３と、第２ＣＮＮ１〜３とを、記憶装置１７から取得する。 The first hierarchy filter acquisition unit 13 acquires the first hierarchy filter of the classification tree shown in FIG. 3 (S13). That is, the first hierarchy filter acquisition unit 13 acquires the first CNN 1 to 3 and the second CNN 1 to 3 from the storage device 17.

候補データ作成部１４は、第２ＣＮＮ１〜３に、入力データをダウンサンプルした第１サンプルレートの縮小データを入力することにより、３枚の第２サンプルレートの候補データを作成する（Ｓ１４）。 The candidate data creation unit 14 creates three second sample rate candidate data by inputting the reduced data of the first sample rate obtained by down-sampling the input data to the second CNN 1 to 3 (S14).

アップサンプル処理部１６は、ステップＳ１４で作成された３枚の第２サンプルレートの候補データのそれぞれと、第２サンプルレートの入力データとの差分を算出する（Ｓ１５）。 The upsample processing unit 16 calculates a difference between each of the three pieces of candidate data of the second sample rate created in step S14 and the input data of the second sample rate (S15).

アップサンプル処理部１６は、ステップＳ１５で算出した差分が最小となる候補データを作成するに用いた第２ＣＮＮｍを特定する（Ｓ１６）。例えば、差分が最小となる候補データを作成するのに用いが第２ＣＮＮが第２ＣＮＮ１であると特定される。 The upsample processing unit 16 specifies the second CNNm used to create candidate data that minimizes the difference calculated in step S15 (S16). For example, the second CNN used to create candidate data with the smallest difference is specified as the second CNN1.

なお、第２ＣＮＮ１〜３への入力は、第１サンプルレートの縮小データに限定されるものではない。例えば、入力データから切り出した一部のデータを第２ＣＮＮ１〜３に入力してもよい。例えば、入力データが画像の場合には、等間隔に小領域を切り出すことにより該一部のデータを作成してもよいし、入力データの画素を等間隔に間引くことで該一部のデータを作成してもよい。また、入力データと同様の性質を有するであろうと想定される複数のデータの中から、一部のデータの一部分を切り出すことにより、該一部のデータを作成しても良い。例えば、入力データが動画データの一部である場合には、入力データと同じシーンの動画データの中から、一部のデータを取り出し、取り出した一部のデータから一部分を切り出しても良い。 The input to the second CNN 1 to 3 is not limited to the reduced data at the first sample rate. For example, a part of data cut out from the input data may be input to the second CNN 1 to 3. For example, when the input data is an image, the partial data may be created by cutting out small regions at equal intervals, or the partial data may be obtained by thinning out pixels of the input data at equal intervals. You may create it. Further, the partial data may be created by cutting out a part of the partial data from a plurality of data assumed to have the same properties as the input data. For example, when the input data is a part of the moving image data, a part of the data may be extracted from the moving image data of the same scene as the input data, and a part of the extracted data may be cut out.

また、該一部のデータに基づいて第２ＣＮＮ１〜３が作成する候補データと、該一部のデータとの差分から、第２ＣＮＮｍを上記と同様に特定してもよい。このような処理により、第２ＣＮＮｍを特定するために必要な時間を短縮することができる。 Further, the second CNNm may be specified in the same manner as described above from the difference between the candidate data created by the second CNN 1 to 3 based on the partial data and the partial data. By such processing, the time required to specify the second CNNm can be shortened.

第２階層フィルタ取得部１５は、ステップＳ１６で特定された第１階層の第２ＣＮＮｍの下位の階層である第２階層のフィルタを取得する（Ｓ１７）。つまり、第２階層フィルタ取得部１５は、図３に示した分類木における第１ＣＮＮｍ−ｊと、第２ＣＮＮｍ−ｊとを、記憶装置１７から読み出す（ｊ＝１〜３）。例えば、第２階層フィルタ取得部１５は、第１ＣＮＮ１−１〜１−３と、第２ＣＮＮ１−１〜１−３とを、記憶装置１７から読み出す。 The second hierarchy filter acquisition unit 15 acquires a filter of the second hierarchy, which is a hierarchy lower than the second CNNm of the first hierarchy specified in step S16 (S17). That is, the second hierarchy filter acquisition unit 15 reads the first CNNm-j and the second CNNm-j in the classification tree shown in FIG. 3 from the storage device 17 (j = 1 to 3). For example, the second hierarchy filter acquisition unit 15 reads the first CNN 1-1 to 1-3 and the second CNN 1-1 to 1-3 from the storage device 17.

候補データ作成部１４は、第２ＣＮＮｍ−ｊ（ｊ＝１〜３）に、入力データをダウンサンプルした第１サンプルレートの縮小データを入力することにより、３枚の第２サンプルレートの候補データを作成する（Ｓ１８）。 The candidate data creation unit 14 inputs the reduced data of the first sample rate obtained by down-sampling the input data to the second CNNm-j (j = 1 to 3), thereby obtaining the three second sample rate candidate data. Create (S18).

アップサンプル処理部１６は、ステップＳ１８で作成された３枚の第２サンプルレートの候補データのそれぞれと、第２サンプルレートの入力データとの差分を算出する（Ｓ１９）。 The upsample processing unit 16 calculates the difference between each of the three pieces of candidate data of the second sample rate created in step S18 and the input data of the second sample rate (S19).

アップサンプル処理部１６は、ステップＳ１９で算出した差分が最小となる候補データを作成するに用いた第２ＣＮＮｍ−ｎを特定する（Ｓ２０）。例えば、差分が最小となる候補データを作成するのに用いた第２ＣＮＮが第２ＣＮＮ１−２であると特定される。 The upsample processing unit 16 identifies the second CNNm-n used to create candidate data that minimizes the difference calculated in step S19 (S20). For example, the second CNN used to create the candidate data with the smallest difference is specified as the second CNN 1-2.

なお、第２ＣＮＮｍ−ｊ（ｊ＝１〜３）への入力は、第１サンプルレートの縮小データに限定されるものではない。例えば、入力データから切り出した一部のデータを第２ＣＮＮｍ−ｊ（ｊ＝１〜３）に入力してもよい。例えば、入力データが画像の場合には、等間隔に小領域を切り出すことにより該一部のデータを作成してもよいし、入力データの画素を等間隔に間引くことで該一部のデータを作成してもよい。また、入力データと同様の性質を有するであろうと想定される複数のデータの中から、一部のデータの一部分を切り出すことにより、該一部のデータを作成しても良い。例えば、入力データが動画データの一部である場合には、入力データと同じシーンの動画データの中から、一部のデータを取り出し、取り出した一部のデータから一部分を切り出しても良い。 Note that the input to the second CNNm-j (j = 1 to 3) is not limited to the reduced data of the first sample rate. For example, a part of data cut out from the input data may be input to the second CNNm-j (j = 1 to 3). For example, when the input data is an image, the partial data may be created by cutting out small regions at equal intervals, or the partial data may be obtained by thinning out pixels of the input data at equal intervals. You may create it. Further, the partial data may be created by cutting out a part of the partial data from a plurality of data assumed to have the same properties as the input data. For example, when the input data is a part of the moving image data, a part of the data may be extracted from the moving image data of the same scene as the input data, and a part of the extracted data may be cut out.

また、該一部のデータに基づいて第２ＣＮＮｍ−ｊ（ｊ＝１〜３）が作成する候補データと、該一部のデータとの差分から、第２ＣＮＮｍ−ｎを上記と同様に特定してもよい。このような処理により、第２ＣＮＮｍ−ｎを特定するために必要な時間を短縮することができる。 Further, the second CNNm-n is identified in the same manner as described above from the difference between the candidate data created by the second CNNm-j (j = 1 to 3) based on the partial data and the partial data. Also good. By such processing, the time required for specifying the second CNNm-n can be shortened.

アップサンプル処理部１６は、ステップＳ２０で特定された第２階層の第２ＣＮＮｍ−ｎに対応する第１ＣＮＮｍ−ｎを用いて、第２サンプルレートの入力データをアップサンプル処理することにより、第３サンプルレートの超解像度データを作成する（Ｓ２１）。上記の例では第１ＣＮＮ１−２を用いて、アップサンプル処理が行われる。 The up-sampling processing unit 16 performs the up-sampling process on the input data of the second sample rate using the first CNNm-n corresponding to the second CNNm-n of the second hierarchy specified in step S20, so that the third sample The rate super-resolution data is created (S21). In the above example, the upsampling process is performed using the first CNN 1-2.

以上説明したように、アップサンプル処理装置１の行う処理によると、第２サンプルレートの入力データをダウンサンプルした第１サンプルレートの縮小データをアップサンプル処理した時の誤差が最小となる第２ＣＮＮを、分類木を用いて探索することができる。第２ＣＮＮは、第１ＣＮＮと同様の性質を有し、かつ第２ＣＮＮが対象とする入力データのサンプルレートは第１ＣＮＮが対象とする入力データのサンプルレートよりも小さい。このため、第２ＣＮＮを効率的に探索することができる。また、探索された第２ＣＮＮに対応する第１ＣＮＮを用いて、入力データをアップサンプル処理することにより、誤差の少ない高精度なアップサンプルデータを作成することができる。 As described above, according to the processing performed by the upsample processing device 1, the second CNN that minimizes the error when the downsampled data of the first sample rate obtained by downsampling the input data of the second sample rate is minimized. The search can be made using the classification tree. The second CNN has the same properties as the first CNN, and the sample rate of the input data targeted by the second CNN is smaller than the sample rate of the input data targeted by the first CNN. For this reason, the second CNN can be searched efficiently. Further, by performing upsampling processing on input data using the first CNN corresponding to the searched second CNN, it is possible to create highly accurate upsampled data with few errors.

つまり、第１ＣＮＮと、該第１ＣＮＮとペアをなす第２ＣＮＮとは、サイズの異なる同一の学習用データを用いて機械学習が行われる。このため、第２ＣＮＮが精度よくアップサンプル処理することのできるデータは、該第２ＣＮＮとペアをなす第１ＣＮＮによっても、精度よくアップサンプル処理されることが期待できる。これにより、誤差が最小となる第２ＮＮとペアをなす第１ＣＮＮを選択し、選択した第１ＣＮＮを用いて入力データをアップサンプル処理することにより、入力データをアップサンプル処理した時の誤差が小さくなると期待できる。 That is, the first CNN and the second CNN paired with the first CNN are subjected to machine learning using the same learning data having different sizes. For this reason, it can be expected that data that can be accurately upsampled by the second CNN is also accurately upsampled by the first CNN paired with the second CNN. As a result, the first CNN paired with the second NN that minimizes the error is selected, and the input data is upsampled using the selected first CNN, thereby reducing the error when the input data is upsampled. I can expect.

また、第２ＣＮＮの構成を第１ＣＮＮの構成よりも小さくし、第２ＣＮＮを選択した後に、選択した第２ＣＮＮに対応する第１ＣＮＮを選択することとしているため、第１ＣＮＮの選択の処理速度を向上させることができる。ただし、第２ＣＮＮの構成は必ずしも第１ＣＮＮの構成よりも小さくなくてもよい。 In addition, since the configuration of the second CNN is made smaller than that of the first CNN and the second CNN is selected and then the first CNN corresponding to the selected second CNN is selected, the processing speed of the selection of the first CNN is improved. be able to. However, the configuration of the second CNN is not necessarily smaller than the configuration of the first CNN.

なお、学習用セットおよび学習用サブセットを、それぞれ、３つの学習用サブセットに分類したが、分類する学習用サブセットの個数は３つに限定されるものではなく、２つ以上であればよい。入力データの性質に応じて設計する場よい。 The learning set and the learning subset are each classified into three learning subsets, but the number of learning subsets to be classified is not limited to three, and may be two or more. Design according to the nature of the input data.

また、図３に示したように、学習用サブセットの階層は２階層としたが、１階層であってもよいし、３階層以上あってもよい。 Further, as shown in FIG. 3, the learning subset has two hierarchies, but may have one hierarchy or three or more hierarchies.

また、上述の実施の形態では、アップサンプル処理されたデータと学習用データとの差分に基づいて、学習用セットおよび学習用サブセットを分類したが、分類の仕方はこれに限定されるものではない。例えば、データの種類（画像の場合にはイラスト、風景写真、人物写真の別等）が既知であれば、それをさらに考慮して分類を行ってもよい。 In the above-described embodiment, the learning set and the learning subset are classified based on the difference between the upsampled data and the learning data. However, the classification method is not limited to this. . For example, if the type of data (in the case of an image, illustration, landscape photograph, portrait photograph, etc.) is known, the classification may be performed with further consideration thereof.

また、第１ＣＮＮおよび第２ＣＮＮは、データの全体領域を入力として受け、アップサンプル処理を行うものとして説明したが、各ＣＮＮは、データを分割した複数のブロック領域をそれぞれ入力として受け、各ブロック領域をアップサンプル処理した後にマージする構成であってもよい。また、各ブロック領域は、隣接するブロック領域とオーバーラップしていてもよいし、ブロック領域間でサイズが異なっていてもよい。このように、ＣＮＮを構成することにより、学習用データの全体領域を入力とせずとも、一部の領域だけを利用してＣＮＮの学習を行うことが可能となる。 Further, the first CNN and the second CNN have been described as receiving the entire data area as an input and performing the upsampling process. However, each CNN receives a plurality of block areas into which the data is divided as an input, and receives each block area. It may be configured to merge after upsampling. In addition, each block area may overlap with an adjacent block area, or the size may be different between the block areas. In this way, by configuring the CNN, it is possible to learn the CNN using only a part of the area without inputting the entire area of the learning data.

アップサンプル処理装置１の行う処理は、装置の機能でなく、装置を機能させるための方法あるいは装置を離れた方法であると考えることもできる。 The processing performed by the upsample processing apparatus 1 can be considered not a function of the apparatus but a method for causing the apparatus to function or a method for leaving the apparatus.

これらの方法をコンピュータにより実現するコンピュータプログラムとして実装することも可能である。ここでコンピュータプログラムは、所定の機能を達成するために、コンピュータに対する指令を示す命令コードが複数個組み合わされて構成されたものである。 It is also possible to implement these methods as a computer program realized by a computer. Here, the computer program is configured by combining a plurality of instruction codes indicating instructions for the computer in order to achieve a predetermined function.

マイクロプロセッサ、ＲＯＭ、ＲＡＭ、ハードディスクドライブ、ディスプレイユニット、キーボード、マウスなどから構成されるコンピュータシステムとして構成されてもよい。ＲＡＭまたはハードディスクドライブには、コンピュータプログラムが記憶されている。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、アップサンプル処理装置１は、その機能を達成する。 You may comprise as a computer system comprised from a microprocessor, ROM, RAM, a hard-disk drive, a display unit, a keyboard, a mouse | mouth, etc. A computer program is stored in the RAM or hard disk drive. The upsampling processing apparatus 1 achieves its functions by the microprocessor operating according to the computer program.

さらに、上記の超解像度処理装置１を構成する構成要素の一部または全部は、アップサンプル処理装置１に脱着可能なＩＣカードまたは単体のモジュールから構成されているとしてもよい。ＩＣカードまたはモジュールは、マイクロプロセッサ、ＲＯＭ、ＲＡＭなどから構成されるコンピュータシステムである。ＩＣカードまたはモジュールは、ＬＳＩ（Large Scale Integration：大規模集積回路：複数の構成部を１個のチップ上に集積して製造された超多機能ＬＳＩを含む）を含むとしてもよい。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、ＩＣカードまたはモジュールは、その機能を達成する。このＩＣカードまたはこのモジュールは、耐タンパ性を有するとしてもよい。 Furthermore, some or all of the constituent elements constituting the super-resolution processing apparatus 1 may be configured as an IC card that can be attached to and detached from the up-sampling processing apparatus 1 or a single module. The IC card or module is a computer system that includes a microprocessor, ROM, RAM, and the like. The IC card or the module may include an LSI (Large Scale Integration: a large-scale integrated circuit: an ultra-multifunctional LSI manufactured by integrating a plurality of components on one chip). The IC card or the module achieves its function by the microprocessor operating according to the computer program. This IC card or this module may have tamper resistance.

さらに、上記のアップサンプル処理装置１を構成する構成要素の一部または全部は、１個のシステムＬＳＩから構成されているとしてもよい。システムＬＳＩは、複数の構成部を１個のチップ上に集積して製造された超多機能ＬＳＩであり、具体的には、マイクロプロセッサ、ＲＯＭ、ＲＡＭなどを含んで構成されるコンピュータシステムである。ＲＡＭには、コンピュータプログラムが記憶されている。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、システムＬＳＩは、その機能を達成する。 Further, some or all of the constituent elements constituting the upsampling processing apparatus 1 may be configured by one system LSI. The system LSI is an ultra-multifunctional LSI manufactured by integrating a plurality of components on a single chip, and specifically, a computer system including a microprocessor, ROM, RAM, and the like. . A computer program is stored in the RAM. The system LSI achieves its functions by the microprocessor operating according to the computer program.

（画像データの処理）
アップサンプル処理装置１が処理対象とするデータは、イラストや写真等の画像データとすることができる。データは縦方向及び横方向の２次元の空間に対するピクセル値（例えばＲＧＢの３つの値）である。このように、「データ」は１の数値（スカラー値）に限定されず、２以上の数値（ベクトル値）であってよい。 (Image data processing)
Data to be processed by the upsample processing apparatus 1 can be image data such as illustrations and photographs. The data is pixel values (for example, three values of RGB) for a two-dimensional space in the vertical and horizontal directions. Thus, “data” is not limited to a numerical value of 1 (scalar value), but may be a numerical value of 2 or more (vector value).

例えば、入力データが２Ｋサイズ（１９２０×１０８０ピクセル：第２サンプルレート）の画像であり、これにアップサンプル処理を施して４Ｋサイズ（３８４０×２１６０ピクセル：第３サンプルレート）の画像を作成することができる。縮小データは、ＱＨＤ（Quarter High Definition）サイズ（９６０×５４０ピクセル：第１サンプルレート）とすることができる。ただし、アップサンプル処理装置１が対象とする画像のサイズはこれらに限定されるものではない。 For example, input data is an image of 2K size (1920 × 1080 pixels: second sample rate), and an upsampling process is performed on the input data to create an image of 4K size (3840 × 2160 pixels: third sample rate). Can do. The reduced data may be QHD (Quarter High Definition) size (960 × 540 pixels: first sample rate). However, the size of the image targeted by the upsample processing apparatus 1 is not limited to these.

図２において、データのサイズおよびフィルタを説明したが、縮小データ、入力データ及び出力データを示す長方形を、上述の画像のサイズに合わせて示している。 Although the data size and filter have been described with reference to FIG. 2, rectangles representing reduced data, input data, and output data are shown in accordance with the above-described image size.

なお、入力画像のサイズは学習用セットに含まれる学習用画像と同じものとしているが、必ずしも両者のサイズは同じである必要はなく、学習用画像のサイズが入力画像のサイズと異なっていてもよい。 The size of the input image is the same as the learning image included in the learning set, but the size of both is not necessarily the same, and the size of the learning image may be different from the size of the input image. Good.

（画像以外のデータ）
本発明の適用対象は、画像データに限定されるものではない。例えば、音響データは、時間の１次元の空間に対する音圧値である。また、動画データは、縦方向、横方向、及び時間の３次元に対するピクセル値である。これらのデータは、いずれも、アップサンプル／ダウンサンプルの処理ができ、アップサンプル処理装置１の入力データとなり、本発明のアップサンプル方法を適用することができる。 (Data other than images)
The application target of the present invention is not limited to image data. For example, the acoustic data is a sound pressure value for a one-dimensional space of time. The moving image data is pixel values for three dimensions of the vertical direction, the horizontal direction, and time. Any of these data can be upsampled / downsampled and becomes input data of the upsample processing apparatus 1, and the upsampling method of the present invention can be applied.

同様に、ＭＲＩ（Magnetic Resonance Imaging）データやＣＴ（Computed Tomography）データなどの医療用のスキャンデータのような、測定データの分布図を処理対象とすることもできる。画像の場合と同様の処理が可能なためである。また、ベクトルの構造体や、ボクセル画像のような３次元構造体なども、処理対象とすることもできる。すなわち、アップサンプル／ダウンサンプルの処理ができるデータであれば、いずれのデータも、アップサンプル処理装置１の入力データとなり、本発明のアップサンプル方法を適用することができる。 Similarly, a distribution map of measurement data such as medical scan data such as MRI (Magnetic Resonance Imaging) data and CT (Computed Tomography) data can be processed. This is because the same processing as in the case of an image is possible. Also, a vector structure, a three-dimensional structure such as a voxel image, and the like can be processed. That is, any data that can be processed by upsampling / downsampling becomes input data of the upsampling processing apparatus 1, and the upsampling method of the present invention can be applied.

なお、２以上の次元の空間に対するデータについては、第１サンプルレートと第２サンプルレートとの比及び第２サンプルレートと第３サンプルレートとの比が、空間を定義する各々の次元に依存して相違してもよい。例えば、動画では縦方向及び横方向の２次元の空間についてのサンプルレート（画像の解像度）と他の１次元である時間についてのサンプルレート（いわゆる「フレームレート」）とで別個に処理することも考えられる。 For data in a space of two or more dimensions, the ratio between the first sample rate and the second sample rate and the ratio between the second sample rate and the third sample rate depend on each dimension that defines the space. May be different. For example, in a moving image, processing may be performed separately for a sample rate (resolution of an image) for a two-dimensional space in the vertical direction and the horizontal direction, and a sample rate for another one-dimensional time (a so-called “frame rate”). Conceivable.

（アップサンプル処理装置の活用）
図６は、アップサンプル処理装置を活用するコンピュータシステム及び機器示す図である。 (Utilization of upsample processing equipment)
FIG. 6 is a diagram illustrating a computer system and devices that utilize the upsample processing apparatus.

図６（Ａ）及び（Ｂ）はコンピュータシステムを示す。アップサンプル処理装置１は、コンピュータプログラムとして実装されているものとする。業務用コンピュータプログラム２１は、アップサンプル処理装置１に入力データを与え、アップサンプル処理装置１から出力データを受け取る。アップサンプル処理装置１及び業務用コンピュータプログラム２１を合わせて、コンピュータシステム２が構成される。 6A and 6B show a computer system. It is assumed that the upsample processing apparatus 1 is implemented as a computer program. The business computer program 21 provides input data to the upsample processing apparatus 1 and receives output data from the upsample processing apparatus 1. A computer system 2 is configured by combining the upsample processing apparatus 1 and the business computer program 21.

ここで、アップサンプル処理装置１及び業務用コンピュータプログラム２１は、（Ａ）のように、１のコンピュータに実装されていてもよいし、（Ｂ）のように、アップサンプル処理装置１をサーバに備え、業務用コンピュータプログラム２１との入力データ及び出力データの送受信をネットワークを介して行ってもよい。 Here, the up-sample processing apparatus 1 and the business computer program 21 may be implemented in one computer as shown in (A), or the up-sample processing apparatus 1 is used as a server as shown in (B). In addition, transmission / reception of input data and output data to / from the business computer program 21 may be performed via a network.

業務用コンピュータプログラム２１としては、静止画像又は動画像を処理するプログラム（例えばレンダリング、ＣＯＤＥＣ処理を行うプログラム）、音響を処理するプログラム（例えば録音を行うプログラム）、その他が考えられる。なお、「業務用」といえども、個人の趣味等の経済的活動も含むものとする（「業務」は「何らかの処理を行う行為全般」と解する。）。 The business computer program 21 may be a program for processing a still image or a moving image (for example, a program for performing rendering or CODEC processing), a program for processing sound (for example, a program for recording), or the like. It should be noted that “business” includes economic activities such as personal hobbies (“business” is understood as “a general act of performing some kind of processing”).

図６（Ｃ）及び（Ｄ）は機器を示す。アップサンプル処理装置１は、コンピュータプログラムとして実装されていても、ＩＣカード、ＬＳＩ、装置として別途実装されていてもよい。 6C and 6D show the device. The upsample processing apparatus 1 may be implemented as a computer program, or may be separately implemented as an IC card, LSI, or apparatus.

図６（Ｃ）では、機器３の内部にアップサンプル処理装置１が実装されている。例えば、コンピュータプログラムとしてのアップサンプル処理装置１を機器３が備えるものである。機器３の機能要素３１がアップサンプル処理装置１に入力データを与え、アップサンプル処理装置１から出力データを受け取る。 In FIG. 6C, the upsample processing device 1 is mounted inside the device 3. For example, the device 3 includes the upsampling processing apparatus 1 as a computer program. The functional element 31 of the device 3 gives input data to the upsample processing apparatus 1 and receives output data from the upsample processing apparatus 1.

図６（Ｄ）では、アップサンプル処理装置１をサーバに備え、機器３（その中の機能要素３１）との入力データ及び出力データの送受信をネットワークを介して行う。 In FIG. 6D, the upsample processing apparatus 1 is provided in a server, and input data and output data are transmitted to and received from the device 3 (the functional element 31 therein) via a network.

ここで、アップサンプル処理装置１を備えるサーバは、図６（Ｂ）と（Ｄ）とで共通であり得る。すなわち、１のサーバによって図６（Ｂ）と（Ｄ）との双方に対応することができる。 Here, the server provided with the upsampling processing apparatus 1 may be common in FIGS. 6B and 6D. That is, one server can cope with both of FIGS. 6B and 6D.

機器３（機能要素３１）としては、テレビジョン(モニタへの画像出力)、カメラ（撮影データの保存）、ＣＴ、ＭＲＩ、レントゲン、エコー、レーダーなどの測定装置（測定結果の表示）、スマートフォン（画面への画像出力、スピーカへの音響出力）、ゲーム機（画面への画像出力、スピーカへの音響出力）、オーディオ機器（アンプ又はスピーカへの音響出力）、その他が考えられる。 The device 3 (functional element 31) includes a television (image output to a monitor), a camera (storage of imaging data), a CT, an MRI, an X-ray, an echo, a radar and other measuring devices (display of measurement results), a smartphone ( Image output to a screen, sound output to a speaker), game machine (image output to a screen, sound output to a speaker), audio equipment (acoustic output to an amplifier or a speaker), and others can be considered.

本明細書において開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は、上記した意味ではなく、特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiments disclosed in this specification should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the meanings described above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

本発明は、入力データのサンプルレートを高解像度化したデータを作成するアップサンプル処理装置等に用いると有益である。 The present invention is useful when used in an up-sample processing apparatus or the like that creates data in which the sampling rate of input data is increased in resolution.

１アップサンプル処理装置
１０入力データ取得部
１１縮小データ作成部
１２機械学習部
１３第１階層フィルタ取得部
１４候補データ作成部
１５第２階層フィルタ取得部
１６アップサンプル処理部
１７記憶装置
２コンピュータシステム
２１業務用コンピュータプログラム
３機器
３１機能要素 1 Upsample Processing Device 10 Input Data Acquisition Unit 11 Reduced Data Creation Unit 12 Machine Learning Unit 13 First Hierarchy Filter Acquisition Unit 14 Candidate Data Creation Unit 15 Second Hierarchy Filter Acquisition Unit 16 Upsample Processing Unit 17 Storage Device 2 Computer System 21 Commercial computer program 3 Equipment 31 Functional elements

Claims

For data expressing a value for a space of one dimension or two or more dimensions as a discrete value at a sampling point of the space,
An upsampling device for upsampling input data of a second sample rate to obtain output data of a third sample rate,
For each of the plurality of learning subsets in the first hierarchy, which is a subset of the learning set including the data of the third sample rate, the data of the second sample rate that has been machine-learned using the learning subset is the third sample. A first filter for upsampling to rate data;
A first layer filter acquisition unit for acquiring a second filter for up-sampling data of a first sample rate obtained by down-sampling data of the second sample rate to the second sample rate;
A candidate data creation unit that upsamples the data of the first sample rate and creates candidate data of the second sample rate by using the second filters obtained by the first hierarchy filter obtaining unit;
By up-sampling the input data using a first filter corresponding to a second filter that minimizes the difference between the candidate data created by the candidate data creation unit and the input data, a third sample rate An upsampling device comprising: an output upsampling unit that creates output data.

The upsampling apparatus according to claim 1, wherein the space is a three-dimensional space.

For each of a plurality of learning subsets in the second hierarchy, which is a subset of the learning subset in the first hierarchy used for machine learning of the second filter with the smallest difference, using the learning subset A second hierarchical filter acquisition unit that acquires the first and second machine-learned filters;
The candidate data creation unit uses the second filters of the second hierarchy acquired by the second hierarchy filter acquisition unit to upsample the data of the first sample rate to create the candidate data,
The output upsampling unit corresponds to a first filter corresponding to a second filter that minimizes a difference between the candidate data created by the candidate data creation unit using the second filter of the second hierarchy and the input data. The upsampling apparatus according to claim 1 or 2, wherein output data at a third sample rate is created by upsampling the input data using a filter.

A computer program for causing a computer to operate as the upsampling apparatus according to any one of claims 1 to 3.

It is provided with the upsampling device according to any one of claims 1 to 3 or the computer program according to claim 4 ,
A computer system , wherein the input data is transmitted to the upsampling apparatus or the computer program , and the output data is received and used.

It is provided with the upsampling device according to any one of claims 1 to 3 or the computer program according to claim 4 ,
A device for transmitting the input data to the upsampling device or the computer program ,
A device characterized by being a television device, a camera, a measuring device, a smartphone, a game device, an audio device, and other devices.

For data expressing a value for a space of one dimension or two or more dimensions as a discrete value at a sampling point of the space,
A method for causing an upsampling device or computer program to upsample input data at a second sample rate to obtain output data at a third sample rate, comprising:
For each of the plurality of learning subsets in the first hierarchy, which is a subset of the learning set including the data of the third sample rate, the data of the second sample rate that has been machine-learned using the learning subset is the third sample. Obtaining a first filter for upsampling to rate data;
Obtaining a second filter for up-sampling the data of the first sample rate down-sampled to the second sample rate to the second sample rate;
Using the second filters acquired by the first hierarchical filter acquisition unit to upsample the data of the first sample rate to create candidate data of the second sample rate;
Creating output data at a third sample rate by up-sampling the input data using a first filter corresponding to a second filter that minimizes the difference between the candidate data and the input data; An up-sampling method comprising:

For each of a plurality of learning subsets in the second hierarchy, which is a subset of the learning subset in the first hierarchy used for machine learning of the second filter with the smallest difference, using the learning subset Obtaining a machine-learned first filter and a second filter;
The step of creating the candidate data up-samples the data of the first sample rate using the second filters of the second hierarchy acquired in the step of acquiring the second filter, and the candidate data Create
Step includes: a difference between the candidates data and the input data that was created using the second filter of the second hierarchy in the step of preparing the candidate data is minimized to create an output data of said third sample rate The upsampling method according to claim 7, wherein output data at a third sample rate is created by upsampling the input data using a first filter corresponding to the second filter.