CN105791980B

CN105791980B - Films and television programs renovation method based on increase resolution

Info

Publication number: CN105791980B
Application number: CN201610109909.XA
Authority: CN
Inventors: 张宏志; 赵秋实; 左旺孟; 石坚; 张垒磊
Original assignee: Harbin Super-Resolution Fx Technology Co Ltd
Current assignee: Harbin Super-Resolution Fx Technology Co Ltd
Priority date: 2016-02-29
Filing date: 2016-02-29
Publication date: 2018-09-14
Anticipated expiration: 2036-02-29
Also published as: CN105791980A

Abstract

Films and television programs of the present invention for lower resolution ratio, lower clarity, it is proposed that a kind of films and television programs renovation method based on increase resolution, concrete scheme are：First, the resolution ratio and target resolution for obtaining original video, calculate scaling；Secondly, input video is divided into set of frames by certain partitioning scheme；Then, it is converted according to pre-stored mapping relations, obtains high-resolution video frame；Finally, high-resolution video frame is combined into high-resolution video wherein, pre-stored mapping relations are obtained based on Mixture of expert model learning, the process in a computer offline complete the method for the invention have many advantages, such as adaptively it is good, speed is fast, effect is good, expansible.

Description

Films and television programs renovation method based on increase resolution

Technical field

The invention belongs to computer visions and image processing field, are related to films and television programs renovation method, and in particular to a kind of Films and television programs renovation method based on increase resolution and system.

Background technology

With video acquisition, transmission, storage, the development of display technology, films and television programs constantly develop towards high-resolution.People Appreciate video taste it is also higher and higher, constantly pursue high-resolution, high-definition films and television programs.Meanwhile high-resolution It shows the appearance (such as 4K, 5K TV and display) of equipment, and makes it possible the universal of high-resolution films and television programs.

However on the other hand, many classical films and television programs of the remote past still have relatively low since technological means limits Resolution ratio, lower clarity and poor visual effect.Simultaneously because of the remote past, the film holding time is longer, then In addition various quality degradations, occurs in the destruction of the extraneous factors such as natural calamity, war, such as breakage, noise, is trembled at flicker It moves.On the one hand people want to review classical films and television programs on the one hand have new requirement again to the quality of film.In order to full simultaneously Sufficient people review classical films and television programs and pursue the demand of high-quality video, and films and television programs reconditioning technology comes into being.Video display are made It is application image/video processing technique that product, which renovate its essence, is handled original video, and various quality degradations are eliminated, to carry The visual effect of high original video.

Films and television programs are the important culture carriers of human society as books, some classical films and television programs, although the age It is remote, but its cultureal value can not replace.Therefore, the films and television programs of the remote past to these renovated, replay just Very important meaning is shown.Specifically, the meaning of films and television programs renovation includes the following aspects：

1. some classical films and television programs, such as documentary film, are precious historical summaries.These films and television programs are renovated, It can preferably preserve, propagate these historical summaries.

2. a pair classical films and television programs renovate, more moderns is allowed to appreciate, is the important of progress culture and arts succession Form.

3. a pair classical films and television programs renovate, the taking on a new look again of classical artistic work is allowed, be to artistical maximum Respect and souvenir.

The existing method for improving films and television programs visual effect is concentrated mainly in video source modeling means, such as removal noise, Removal is fuzzy, removes and interlocks, enhances contrast, enhancing color etc..These methods can play original video visual effect enhancing Effect, but increase resolution is not carried out to video, therefore do not meet people inherently and classical films and television programs are renovated Demand.

So-called increase resolution refers to by the video (or video frame) of low resolution, by certain method, quickly and effectively One high-resolution video of generation.Its difficult point is how to break through original low-resolution video pixel quantity limitation, filling Originally the pixel being not present should keep structure, the texture of former low-resolution video, more be closed naturally in human eye again Reason.

Traditional increase resolution method includes mainly the method based on interpolation, based on rebuilding and based on study.Based on slotting The method of value is by the way that existing pixel is carried out linear combination, the pixel as missing.Interpolation algorithm is simply rapid, but It is easy appearance " mosaic " effect or excessively smooth phenomenon；Based on the algorithm of reconstruction registration weight is carried out using the similitude of multiple image It builds, but this kind of algorithm is often simple combination multiple image, the effect is unsatisfactory；Algorithm based on study mainly utilizes one The training data of fixed number amount, according to special algorithm train to obtain low-resolution video to high-resolution video mapping relations, this Requirement of the class algorithm to model is relatively high, easy over-fitting or poor fitting, and operand is big, speed is slow, and practicability is not high.It can be with It says, above-mentioned video resolution Upgrade Problem annoyings always users.

Invention content

The technology of the present invention solves the problems, such as：The present invention provides a kind of films and television programs renovation promoted based on video resolution The films and television programs (usually less than 720P) of low resolution are passed through resolution enhancement technology, are converted to compared with high score by method and system Resolution video (such as 1080P, 4K) realizes films and television programs renovation.

Technical solution of the invention is：The present invention concrete scheme be：First, obtain original video resolution ratio and Target resolution calculates scaling；Secondly, input video is divided into picture frame by certain partitioning scheme；Then, according to Pre-stored mapping relations are converted, and high-resolution video frame is obtained；Finally, high-resolution video frame is combined into high score Resolution video.Wherein, pre-stored mapping relations model is obtained based on Mixture of expert model learning, which exists It is completed offline in computer.Specific steps packet is as follows：

Learn mapping relations model：

(1) training video is pre-processed

(1.1) high-resolution video is chosen as training sample, and is split as high-resolution video frame；

(1.2) the high-resolution video frame obtained by step (1.1) is used into Gauss nuclear convolution

(1.3) amplification factor is calculated according to original low-resolution video and target high-resolution video, according to times of gained Number carries out partiting row sampling, obtains corresponding low resolution video frame；

(1.4) by high-resolution video frame and sampling gained low resolution video frame divide respectively it is blocking, as training Data.

(2) it is based on Mixture of expert model and obtains mapping relations model

(2.1) a Mixture of expert model is initialized.Mixture of expert model includes expert and gate function two parts, structure To be tree-like, as shown in Fig. 2.Leafy node in figure in tree structure is known as expert, is responsible for carrying out mapping transformation to data；Root Node is known as gate function, is responsible for data and selects expert appropriate.The present invention uses linear function as expert's function：

Y=Wx

Wherein W is expert's function parameter, and x and y indicate low resolution video frame block and corresponding high-resolution video respectively Frame block.

Gate function is responsible for determining which expert of selection converts data, and in the present invention, i-th of gate function is expressed as：

Wherein, x and y indicates low resolution video frame block and corresponding high-resolution video frame block, v respectively_iIt indicates i-th Gate function parameter, v_jIndicate that j-th of gate function parameter, K are the number of expert in Mixture of expert model, i.e. leaf in tree structure The number of node.The initialization of Mixture of expert model specifically includes following steps：

(2.1.1) specifies the quantity K of expert；

(2.1.2) assumes the probability distribution Gaussian distributed of each expert：P (y | x, W_i)=N (y (x, W_i), σ), wherein W_iIndicate that the parameter of i-th of expert, σ are the standard deviation of Gaussian Profile.It is assumed that parameter W_iDistribution also Gaussian distributed：p(W_i) =N (0, μ), wherein μ indicate the mean value of Gaussian Profile.

(2.1.3) is clustered training data according to the quantity K of expert using k- mean algorithms, at the beginning of the parameter of each expert Initial value W_i ⁽⁰⁾It is appointed as slope in class, the initial value v of each gate function parameter_i ⁽⁰⁾It is appointed as cluster centre；

(2.1.4) calculates the initial value of each gate function：

Wherein x indicates low resolution video frame block, v_i ⁽⁰⁾Indicate that the initial value of i-th of gate function parameter, K are Mixture of expert The number of expert in model, i.e., the number of leaf node in tree structure.

(2.2) training data for using step (1.4) to obtain, is iterated optimization, until iteration to Mixture of expert model Process restrains, and finally obtained model parameter is mapping relations model.Mapping relations model includes gate function parameter and expert Parameter.

(2.2.1) specifies allowable error ε when iteration ends；

(2.2.2) calculates the posterior probability of each gate function in epicycle iteration：

Wherein k is iterative steps, p_i(y | x, W_i ^(k)) and p_j(y | x, W_j ^(k)) indicate expert probability distribution, g_i ^(k)(x, v_i ^(k)) indicate that the kth of i-th of gate function walks iterative value.

(2.2.3) updates each expert parameter：

Wherein k is iterative steps, and X is the vector of all low resolution video frame block x compositions in training data, and Y is instruction Practice the vector of all high-resolution video frame block y compositions in data, X^TIndicate that the transposition of X, I indicate unit matrix, H_i ^(k+1)It indicates The vector of the posterior probability composition of all low resolution video frame block x in+1 step of kth corresponding to i-th of expert.

(2.2.4) updates each gate function parameter：

WhereinIndicate i-th of gate function parameter in kth step iteration,I-th gate function in iteration is walked for kth Posterior probability, x^(t)Indicate t-th of low resolution video frame block.

(2.2.5) calculates the output of each gate function in epicycle iteration：

(2.2.6) calculates the likelihood probability in epicycle iteration：

Wherein, p_i(y | x, W_i ^(k+1)) indicate expert probability distribution, p (W_i ^(k+1)) indicate expert parameter probability distribution.

(2.2.7) judges whether iteration restrains.When epicycle iteration likelihood probability and last round of iteration likelihood probability it Absolute value of the difference be less than iteration ends when allowable error ε when, terminate iteration.Otherwise repeat step (2.2.2)~ (2.2.7)。

The gate function parameter v obtained at the end of iteration_i, together with expert's quantity K, expert parameter W_i, expert probability distribution The mean of a probability distribution μ of standard deviation sigma and expert parameter is stored in together as final mapping relations model in disk.

After having learnt mapping relations model and having stored, resolution ratio is carried out to video using the mapping relations model of storage and is carried It rises：

(3) pending low-resolution video is pre-processed

(3.1) low-resolution video is split as low resolution video frame；

(3.2) low resolution video frame obtained by step (3.1) is divided into low resolution video frame block；

(4) low-resolution video is promoted to high-resolution video according to the mapping relations model that step (2) obtains, wrapped It includes：

(4.1) the low resolution video frame block for obtaining step (3) is obtained as the input of gate function using step (2) Mapping relations model in gate function parameter calculate the output of each gate function：

Wherein, x is the low resolution video frame block of input.K is the number of expert in Mixture of expert model, v_iIndicate i-th A gate function parameter is obtained by step (2.2).

(4.2) it uses the parameter of expert's function corresponding to the maximum gate function of output valve to calculate corresponding high-resolution to regard Frequency frame block, the wherein parameter of expert's function are obtained by step (2)；

(4.2.1) calculating obtains the serial number of maximum output value gate function：I=arg max (g_i)

Wherein, g_iFor the output of i-th of gate function, obtained by step (4.1).

(4.2.2) calculates high-resolution video frame block using i-th of expert's function：Y=W_ix

Wherein, W_iFor the parameter of i-th of expert's function, y is the high-resolution corresponding to the low resolution video frame block x of input Rate video frame block.

(4.3) increase resolution is carried out according to the step of (4.1) and (4.2) to each low resolution video frame block, obtained To corresponding high-resolution video frame block, by all high-resolution video frame blocks according to its corresponding low resolution video frame block Position in low resolution video frame is spliced into corresponding high-resolution video frame；

(4.4) after obtaining the corresponding high-resolution video frame of all low resolution video frames, it is combined into high-resolution Video.

In the step (4), between video frame block, between video frame without dependence, therefore GPU processing can be used Device, it is parallel to accelerate to handle the step.

The films and television programs renovation method based on increase resolution, can be made in the form of computer software player With can also be integrated into hardware platform (such as set-top box, smart television) and use.

The films and television programs renovation method based on increase resolution can coordinate other screen Enhancement Methods as pre- Processing or post-processing means, can further promote visual effect.

The advantages of the present invention over the prior art are that：From visual effect, obtained by implementing the present invention program The high-resolution video details that arrives is complete, edge clear, Acacia crassicarpaA are good, and fast and stable.Specifically, the features of the present invention Including：

1. adaptive.The present invention program adaptive polo placement scaling multiple, is suitable for different increase resolution demands.

2. speed is fast.Since dependence being not present between sequence of frames of video, processing speed can be improved by parallel processing. In addition, the processing procedure of algorithm is that sequence of frames of video is carried out Linear Mapping transformation, mapping parameters used can pre-save In memory, processing speed can be further increased.

3. effect is good.Used mapping parameters are to be based on Mixture of expert model during the present invention carries out increase resolution What study obtained, break and has divided the drawbacks of being detached with submodel study in traditional increase resolution algorithm based on study.Together When, statistical robust sexual clorminance and the accurate sexual clorminance based on learning algorithm are combined, avoids and is based on learning algorithm in the past The drawbacks of mass data information cannot be utilized, while precision more higher than simple statistical method is also achieved, even for this The higher video of status resolution can also obtain good effect and faster processing speed.

4. expansible.Since dependence being not present between sequence of frames of video, the technological means such as application GPU acceleration can be passed through Realize that parallel processing improves processing speed.In addition, algorithm proposed by the invention may be directly applied to image resolution ratio promotion Field.

Description of the drawings

Fig. 1 is the flow chart of the films and television programs renovation method of the present invention based on increase resolution.

Fig. 2 is Mixture of expert model structure schematic diagram of the present invention.

Video frame is divided into video frame block schematic diagram by Fig. 3 to be of the present invention.

Specific implementation mode

The method of the invention is illustrated with an example in detailed description below.

One original resolution is by the films and television programs renovation method according to the present invention based on increase resolution The process that the video of 768*432 is promoted to 3072*1728 includes the following steps：

(1) training video is pre-processed

(1.1) a high-resolution films and television programs are selected, the video flowing of films and television programs is read in using Video processing software, it will Each frame in video flowing saves as video frame, and in the present embodiment, films and television programs length is 1200 seconds, and frame rate is 25 frames/second, Gained video frame sum is：1200*25=15000；

(1.2) the use of mean value is 0 to the video frame obtained by step (1.1), the Gauss nuclear convolution that standard deviation is 1；

(1.3) original low-resolution video resolution and target resolution are obtained, amplification factor is calculated according to the two.It is original Resolution ratio is 768*432, target resolution 3072*1728, and amplification factor is：3072/768=4.It accordingly, will be after convolution Video frame is down sampled to the 1/4 of original size, obtains corresponding low resolution video frame.

(1.4) each width low resolution video frame of gained by existing segmentation standard, being divided into size is 10 × 10 pixels are not overlapped fritter, as shown in figure 3, simultaneously therefrom choosing 1,000,000 piece is used as training data.

(2) it is based on Mixture of expert model and obtains mapping relations model

(2.1) Mixture of expert model is initialized

(2.1.1) specifies the quantity K of expert.In the present embodiment, K=100 is taken；

(2.1.2) specifies the parameter σ and μ of the probability distribution of expert and the probability distribution of expert parameter, in the present embodiment, take σ=0.32, μ=0.58；

(2.1.3) is using k- mean algorithms by training data according to the quantity K clusters of expert, the W of each expert_i ⁽⁰⁾Parameter It is initialized as slope in class, gate function parameter v_i ⁽⁰⁾It is initialized as cluster centre；

(2.1.4) calculates the initial value of each gate function according to the following formula：

(2.2) training data for using step (1.4) to obtain, the Mixture of expert model obtained to step (2.1) change Generation optimization：

(2.2.1) specifies allowable error ε when iteration ends.In the present embodiment, the permitted mistake of modulus type iteration ends Poor ε=0.005.

(2.2.3) updates each expert parameter：

(2.2.4) updates each gate function parameter：

(2.2.5) calculates the output of each gate function in epicycle iteration：

(2.2.6) calculates the likelihood probability in epicycle iteration：

The gate function parameter v obtained at the end of iteration_i, together with expert's quantity K, expert parameter W_i, expert probability distribution The mean of a probability distribution μ of standard deviation sigma and expert parameter is stored in together as final mapping relations model in disk.Its In, v_i, K, σ, μ be known as the gate function parameter of mapping relations model, W_iThe referred to as expert parameter of mapping relations model.

(3) pending low-resolution video is pre-processed

(3.1) pending low-resolution video is split as low resolution video frame, in the present embodiment, films and television programs are long Degree is 2000 seconds, and frame rate is 25 frames/second, and gained video frame sum is：2000*25=50000；

(3.2) low resolution video frame obtained by step (3.1) is divided into 10 × 10 video frame block, as shown in Figure 3；

(4) low-resolution video is mapped as high-resolution video, including：

(4.2) expert's function parameter corresponding to the maximum gate function of output valve is used to calculate corresponding high-resolution video Frame block；

(4.2.1) calculating obtains the serial number of maximum output value gate function：I=arg max (g_i).Wherein, g_iIt is i-th The output of function is obtained by step (4.1).

(4.2.2) calculates high-resolution video frame block using i-th of expert's function：

Y=W_ix

Wherein, W_iFor i-th of expert's function parameter, obtained by step (2).X is the low resolution video frame block of input, Its size is that 10 × 10, y is high-resolution video frame block after increase resolution, and size is 40 × 40.

Claims

1. a kind of films and television programs renovation method based on increase resolution, it is characterised in that：Including study mapping relations model and Increase resolution two parts are carried out according to mapping relations model；

Wherein, study mapping relations model includes following two steps：

(1) training video is pre-processed, including：

(1.2) the high-resolution video frame obtained by step (1.1) is used into Gauss nuclear convolution；

(1.3) amplification factor is calculated according to original low-resolution video and target high-resolution video, according to the multiple of gained into Row partiting row sampling obtains corresponding low resolution video frame；

(1.4) by high-resolution video frame and sampling gained low resolution video frame divide respectively it is blocking, as training data；

(2) it is based on Mixture of expert model and obtains mapping relations model, including：

(2.1) a Mixture of expert model is initialized, is included the following steps：

1. the quantity K of specified expert；

2. assuming the probability distribution Gaussian distributed of each expert：P (y | x, W_i)=N (y (x, W_i), σ), wherein x indicates low point Resolution video frame block, y indicate high-resolution video frame block, W_iIndicate that the parameter of i-th of expert, σ are the standard deviation of Gaussian Profile； It is assumed that parameter W_iDistribution also Gaussian distributed：p(W_i)=N (0, μ), wherein μ indicate the mean value of Gaussian Profile；

3. using k- mean algorithms by training data according to the quantity K clusters of expert, the initial value W of the parameter of each expert_i ⁽⁰⁾Refer to It is set to slope in class, the initial value v of each gate function parameter_i ⁽⁰⁾It is appointed as cluster centre；

4. calculating the initial value of each gate function：

Wherein v_i ⁽⁰⁾Indicate that the initial value of i-th of gate function parameter, K are the number of expert in Mixture of expert model, i.e. tree structure The number of middle leaf node；

(2.2) training data for using step (1.4) to obtain, is iterated optimization, until iterative process to Mixture of expert model Convergence, finally obtained model parameter is mapping relations model；Mapping relations model includes gate function parameter and expert parameter； It is described optimization is iterated to model to include the following steps：

1. allowable error ε when specified iteration ends；

2. calculating the posterior probability of each gate function in epicycle iteration：

Wherein k is iterative steps, p_i(y | x, W_i ^(k)) and p_j(y | x, W_j ^(k)) indicate expert probability distribution, g_i ^(k)(x, v_i ^(k)) table Show the kth step iterative value of i-th of gate function；

3. updating each expert parameter：

Wherein k is iterative steps, and X is the vector of all low resolution video frame block x compositions in training data, and Y is training number The vector of all high-resolution video frame block y compositions, X in^TIndicate that the transposition of X, I indicate unit matrix, H_i ^(k+1)Expression kth+ The vector of the posterior probability composition of all low resolution video frame block x in 1 step corresponding to i-th of expert；

4. updating each gate function parameter：

WhereinIndicate i-th of gate function parameter in kth step iteration,The posteriority that i-th of gate function in iteration is walked for kth is general Rate, x^(t)Indicate t-th of low resolution video frame block；

5. calculating the output of each gate function in epicycle iteration：

6. calculating the likelihood probability in epicycle iteration：

Wherein, p_i(y | x, W_i ^(k+1)) indicate expert probability distribution, p (W_i ^(k+1)) indicate expert parameter probability distribution；

7. judging whether iteration restrains；When the absolute value of the difference of the likelihood probability of the likelihood probability and last round of iteration of epicycle iteration When allowable error ε when less than iteration ends, terminate iteration；Otherwise repeat step 2.~7.；

The gate function parameter v obtained at the end of iteration_i, together with expert's quantity K, expert parameter W_i, expert probability distribution standard The mean of a probability distribution μ of poor σ and expert parameter are stored in together as final mapping relations model in disk；

It includes following two steps to carry out increase resolution according to mapping relations model：

(3) pending low-resolution video is pre-processed, including：

(3.1) pending low-resolution video is split as low resolution video frame；

(3.2) low resolution video frame is divided blocking；

(4) low-resolution video is promoted to high-resolution video according to the mapping relations model that step (2) obtains, including：

(4.1) the low resolution video frame block for obtaining step (3) uses step as the input of Mixture of expert model gate function (2) the gate function parameter in the mapping relations model obtained calculates the output of each gate function；

(4.2) parameter of expert's function corresponding to the maximum gate function of output valve is used to calculate corresponding high-resolution video frame Block, the wherein parameter of expert's function are obtained by step (2)；

(4.3) increase resolution is carried out according to the step of (4.1) and (4.2) to each low resolution video frame block, obtained pair The high-resolution video frame block answered, by all high-resolution video frame blocks according to its corresponding low resolution video frame block low Position in resolution video frame is spliced into corresponding high-resolution video frame；

2. the films and television programs renovation method according to claim 1 based on increase resolution, it is characterised in that step (2.1) The Mixture of expert model includes expert and gate function two parts；

Expert is responsible for carrying out mapping transformation to data, and the mapping transformation in the present invention uses linear function as expert's function：

Y=Wx

Wherein W is expert parameter, and x and y indicate low resolution video frame block and corresponding high-resolution video frame block respectively；

Gate function is responsible for determining which expert of selection converts data, and i-th of gate function is expressed as in the present invention：

Wherein, v_iIndicate i-th of gate function parameter, v_jIndicate that j-th of gate function parameter, K are of expert in Mixture of expert model Number.

3. the films and television programs renovation method according to claim 1 based on increase resolution, it is characterised in that：Step (4.2) parameter using expert's function corresponding to the maximum gate function of output valve calculates corresponding high-resolution video frame Block includes the following steps：

1. calculating obtains the serial number of maximum output value gate function：I=arg max (g_i)

Wherein, g_iFor the output of i-th of gate function, obtained by step (4.1)；

2. calculating high-resolution video frame block using i-th of expert's function：Y=W_ix

Wherein, W_iFor the parameter of i-th of expert's function, y is that the high-resolution corresponding to the low resolution video frame block x of input regards Frequency frame block.