CN105791980B - Films and television programs renovation method based on increase resolution - Google Patents

Films and television programs renovation method based on increase resolution Download PDF

Info

Publication number
CN105791980B
CN105791980B CN201610109909.XA CN201610109909A CN105791980B CN 105791980 B CN105791980 B CN 105791980B CN 201610109909 A CN201610109909 A CN 201610109909A CN 105791980 B CN105791980 B CN 105791980B
Authority
CN
China
Prior art keywords
expert
resolution video
video frame
parameter
gate function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610109909.XA
Other languages
Chinese (zh)
Other versions
CN105791980A (en
Inventor
张宏志
赵秋实
左旺孟
石坚
张垒磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Super-Resolution Fx Technology Co Ltd
Original Assignee
Harbin Super-Resolution Fx Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Super-Resolution Fx Technology Co Ltd filed Critical Harbin Super-Resolution Fx Technology Co Ltd
Priority to CN201610109909.XA priority Critical patent/CN105791980B/en
Publication of CN105791980A publication Critical patent/CN105791980A/en
Application granted granted Critical
Publication of CN105791980B publication Critical patent/CN105791980B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440236Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Image Processing (AREA)
  • Television Systems (AREA)

Abstract

Films and television programs of the present invention for lower resolution ratio, lower clarity, it is proposed that a kind of films and television programs renovation method based on increase resolution, concrete scheme are:First, the resolution ratio and target resolution for obtaining original video, calculate scaling;Secondly, input video is divided into set of frames by certain partitioning scheme;Then, it is converted according to pre-stored mapping relations, obtains high-resolution video frame;Finally, high-resolution video frame is combined into high-resolution video wherein, pre-stored mapping relations are obtained based on Mixture of expert model learning, the process in a computer offline complete the method for the invention have many advantages, such as adaptively it is good, speed is fast, effect is good, expansible.

Description

Films and television programs renovation method based on increase resolution
Technical field
The invention belongs to computer visions and image processing field, are related to films and television programs renovation method, and in particular to a kind of Films and television programs renovation method based on increase resolution and system.
Background technology
With video acquisition, transmission, storage, the development of display technology, films and television programs constantly develop towards high-resolution.People Appreciate video taste it is also higher and higher, constantly pursue high-resolution, high-definition films and television programs.Meanwhile high-resolution It shows the appearance (such as 4K, 5K TV and display) of equipment, and makes it possible the universal of high-resolution films and television programs.
However on the other hand, many classical films and television programs of the remote past still have relatively low since technological means limits Resolution ratio, lower clarity and poor visual effect.Simultaneously because of the remote past, the film holding time is longer, then In addition various quality degradations, occurs in the destruction of the extraneous factors such as natural calamity, war, such as breakage, noise, is trembled at flicker It moves.On the one hand people want to review classical films and television programs on the one hand have new requirement again to the quality of film.In order to full simultaneously Sufficient people review classical films and television programs and pursue the demand of high-quality video, and films and television programs reconditioning technology comes into being.Video display are made It is application image/video processing technique that product, which renovate its essence, is handled original video, and various quality degradations are eliminated, to carry The visual effect of high original video.
Films and television programs are the important culture carriers of human society as books, some classical films and television programs, although the age It is remote, but its cultureal value can not replace.Therefore, the films and television programs of the remote past to these renovated, replay just Very important meaning is shown.Specifically, the meaning of films and television programs renovation includes the following aspects:
1. some classical films and television programs, such as documentary film, are precious historical summaries.These films and television programs are renovated, It can preferably preserve, propagate these historical summaries.
2. a pair classical films and television programs renovate, more moderns is allowed to appreciate, is the important of progress culture and arts succession Form.
3. a pair classical films and television programs renovate, the taking on a new look again of classical artistic work is allowed, be to artistical maximum Respect and souvenir.
The existing method for improving films and television programs visual effect is concentrated mainly in video source modeling means, such as removal noise, Removal is fuzzy, removes and interlocks, enhances contrast, enhancing color etc..These methods can play original video visual effect enhancing Effect, but increase resolution is not carried out to video, therefore do not meet people inherently and classical films and television programs are renovated Demand.
So-called increase resolution refers to by the video (or video frame) of low resolution, by certain method, quickly and effectively One high-resolution video of generation.Its difficult point is how to break through original low-resolution video pixel quantity limitation, filling Originally the pixel being not present should keep structure, the texture of former low-resolution video, more be closed naturally in human eye again Reason.
Traditional increase resolution method includes mainly the method based on interpolation, based on rebuilding and based on study.Based on slotting The method of value is by the way that existing pixel is carried out linear combination, the pixel as missing.Interpolation algorithm is simply rapid, but It is easy appearance " mosaic " effect or excessively smooth phenomenon;Based on the algorithm of reconstruction registration weight is carried out using the similitude of multiple image It builds, but this kind of algorithm is often simple combination multiple image, the effect is unsatisfactory;Algorithm based on study mainly utilizes one The training data of fixed number amount, according to special algorithm train to obtain low-resolution video to high-resolution video mapping relations, this Requirement of the class algorithm to model is relatively high, easy over-fitting or poor fitting, and operand is big, speed is slow, and practicability is not high.It can be with It says, above-mentioned video resolution Upgrade Problem annoyings always users.
Invention content
The technology of the present invention solves the problems, such as:The present invention provides a kind of films and television programs renovation promoted based on video resolution The films and television programs (usually less than 720P) of low resolution are passed through resolution enhancement technology, are converted to compared with high score by method and system Resolution video (such as 1080P, 4K) realizes films and television programs renovation.
Technical solution of the invention is:The present invention concrete scheme be:First, obtain original video resolution ratio and Target resolution calculates scaling;Secondly, input video is divided into picture frame by certain partitioning scheme;Then, according to Pre-stored mapping relations are converted, and high-resolution video frame is obtained;Finally, high-resolution video frame is combined into high score Resolution video.Wherein, pre-stored mapping relations model is obtained based on Mixture of expert model learning, which exists It is completed offline in computer.Specific steps packet is as follows:
Learn mapping relations model:
(1) training video is pre-processed
(1.1) high-resolution video is chosen as training sample, and is split as high-resolution video frame;
(1.2) the high-resolution video frame obtained by step (1.1) is used into Gauss nuclear convolution
(1.3) amplification factor is calculated according to original low-resolution video and target high-resolution video, according to times of gained Number carries out partiting row sampling, obtains corresponding low resolution video frame;
(1.4) by high-resolution video frame and sampling gained low resolution video frame divide respectively it is blocking, as training Data.
(2) it is based on Mixture of expert model and obtains mapping relations model
(2.1) a Mixture of expert model is initialized.Mixture of expert model includes expert and gate function two parts, structure To be tree-like, as shown in Fig. 2.Leafy node in figure in tree structure is known as expert, is responsible for carrying out mapping transformation to data;Root Node is known as gate function, is responsible for data and selects expert appropriate.The present invention uses linear function as expert's function:
Y=Wx
Wherein W is expert's function parameter, and x and y indicate low resolution video frame block and corresponding high-resolution video respectively Frame block.
Gate function is responsible for determining which expert of selection converts data, and in the present invention, i-th of gate function is expressed as:
Wherein, x and y indicates low resolution video frame block and corresponding high-resolution video frame block, v respectivelyiIt indicates i-th Gate function parameter, vjIndicate that j-th of gate function parameter, K are the number of expert in Mixture of expert model, i.e. leaf in tree structure The number of node.The initialization of Mixture of expert model specifically includes following steps:
(2.1.1) specifies the quantity K of expert;
(2.1.2) assumes the probability distribution Gaussian distributed of each expert:P (y | x, Wi)=N (y (x, Wi), σ), wherein WiIndicate that the parameter of i-th of expert, σ are the standard deviation of Gaussian Profile.It is assumed that parameter WiDistribution also Gaussian distributed:p(Wi) =N (0, μ), wherein μ indicate the mean value of Gaussian Profile.
(2.1.3) is clustered training data according to the quantity K of expert using k- mean algorithms, at the beginning of the parameter of each expert Initial value Wi (0)It is appointed as slope in class, the initial value v of each gate function parameteri (0)It is appointed as cluster centre;
(2.1.4) calculates the initial value of each gate function:
Wherein x indicates low resolution video frame block, vi (0)Indicate that the initial value of i-th of gate function parameter, K are Mixture of expert The number of expert in model, i.e., the number of leaf node in tree structure.
(2.2) training data for using step (1.4) to obtain, is iterated optimization, until iteration to Mixture of expert model Process restrains, and finally obtained model parameter is mapping relations model.Mapping relations model includes gate function parameter and expert Parameter.
(2.2.1) specifies allowable error ε when iteration ends;
(2.2.2) calculates the posterior probability of each gate function in epicycle iteration:
Wherein k is iterative steps, pi(y | x, Wi (k)) and pj(y | x, Wj (k)) indicate expert probability distribution, gi (k)(x, vi (k)) indicate that the kth of i-th of gate function walks iterative value.
(2.2.3) updates each expert parameter:
Wherein k is iterative steps, and X is the vector of all low resolution video frame block x compositions in training data, and Y is instruction Practice the vector of all high-resolution video frame block y compositions in data, XTIndicate that the transposition of X, I indicate unit matrix, Hi (k+1)It indicates The vector of the posterior probability composition of all low resolution video frame block x in+1 step of kth corresponding to i-th of expert.
(2.2.4) updates each gate function parameter:
WhereinIndicate i-th of gate function parameter in kth step iteration,I-th gate function in iteration is walked for kth Posterior probability, x(t)Indicate t-th of low resolution video frame block.
(2.2.5) calculates the output of each gate function in epicycle iteration:
(2.2.6) calculates the likelihood probability in epicycle iteration:
Wherein, pi(y | x, Wi (k+1)) indicate expert probability distribution, p (Wi (k+1)) indicate expert parameter probability distribution.
(2.2.7) judges whether iteration restrains.When epicycle iteration likelihood probability and last round of iteration likelihood probability it Absolute value of the difference be less than iteration ends when allowable error ε when, terminate iteration.Otherwise repeat step (2.2.2)~ (2.2.7)。
The gate function parameter v obtained at the end of iterationi, together with expert's quantity K, expert parameter Wi, expert probability distribution The mean of a probability distribution μ of standard deviation sigma and expert parameter is stored in together as final mapping relations model in disk.
After having learnt mapping relations model and having stored, resolution ratio is carried out to video using the mapping relations model of storage and is carried It rises:
(3) pending low-resolution video is pre-processed
(3.1) low-resolution video is split as low resolution video frame;
(3.2) low resolution video frame obtained by step (3.1) is divided into low resolution video frame block;
(4) low-resolution video is promoted to high-resolution video according to the mapping relations model that step (2) obtains, wrapped It includes:
(4.1) the low resolution video frame block for obtaining step (3) is obtained as the input of gate function using step (2) Mapping relations model in gate function parameter calculate the output of each gate function:
Wherein, x is the low resolution video frame block of input.K is the number of expert in Mixture of expert model, viIndicate i-th A gate function parameter is obtained by step (2.2).
(4.2) it uses the parameter of expert's function corresponding to the maximum gate function of output valve to calculate corresponding high-resolution to regard Frequency frame block, the wherein parameter of expert's function are obtained by step (2);
(4.2.1) calculating obtains the serial number of maximum output value gate function:I=arg max (gi)
Wherein, giFor the output of i-th of gate function, obtained by step (4.1).
(4.2.2) calculates high-resolution video frame block using i-th of expert's function:Y=Wix
Wherein, WiFor the parameter of i-th of expert's function, y is the high-resolution corresponding to the low resolution video frame block x of input Rate video frame block.
(4.3) increase resolution is carried out according to the step of (4.1) and (4.2) to each low resolution video frame block, obtained To corresponding high-resolution video frame block, by all high-resolution video frame blocks according to its corresponding low resolution video frame block Position in low resolution video frame is spliced into corresponding high-resolution video frame;
(4.4) after obtaining the corresponding high-resolution video frame of all low resolution video frames, it is combined into high-resolution Video.
In the step (4), between video frame block, between video frame without dependence, therefore GPU processing can be used Device, it is parallel to accelerate to handle the step.
The films and television programs renovation method based on increase resolution, can be made in the form of computer software player With can also be integrated into hardware platform (such as set-top box, smart television) and use.
The films and television programs renovation method based on increase resolution can coordinate other screen Enhancement Methods as pre- Processing or post-processing means, can further promote visual effect.
The advantages of the present invention over the prior art are that:From visual effect, obtained by implementing the present invention program The high-resolution video details that arrives is complete, edge clear, Acacia crassicarpaA are good, and fast and stable.Specifically, the features of the present invention Including:
1. adaptive.The present invention program adaptive polo placement scaling multiple, is suitable for different increase resolution demands.
2. speed is fast.Since dependence being not present between sequence of frames of video, processing speed can be improved by parallel processing. In addition, the processing procedure of algorithm is that sequence of frames of video is carried out Linear Mapping transformation, mapping parameters used can pre-save In memory, processing speed can be further increased.
3. effect is good.Used mapping parameters are to be based on Mixture of expert model during the present invention carries out increase resolution What study obtained, break and has divided the drawbacks of being detached with submodel study in traditional increase resolution algorithm based on study.Together When, statistical robust sexual clorminance and the accurate sexual clorminance based on learning algorithm are combined, avoids and is based on learning algorithm in the past The drawbacks of mass data information cannot be utilized, while precision more higher than simple statistical method is also achieved, even for this The higher video of status resolution can also obtain good effect and faster processing speed.
4. expansible.Since dependence being not present between sequence of frames of video, the technological means such as application GPU acceleration can be passed through Realize that parallel processing improves processing speed.In addition, algorithm proposed by the invention may be directly applied to image resolution ratio promotion Field.
Description of the drawings
Fig. 1 is the flow chart of the films and television programs renovation method of the present invention based on increase resolution.
Fig. 2 is Mixture of expert model structure schematic diagram of the present invention.
Video frame is divided into video frame block schematic diagram by Fig. 3 to be of the present invention.
Specific implementation mode
The method of the invention is illustrated with an example in detailed description below.
One original resolution is by the films and television programs renovation method according to the present invention based on increase resolution The process that the video of 768*432 is promoted to 3072*1728 includes the following steps:
(1) training video is pre-processed
(1.1) a high-resolution films and television programs are selected, the video flowing of films and television programs is read in using Video processing software, it will Each frame in video flowing saves as video frame, and in the present embodiment, films and television programs length is 1200 seconds, and frame rate is 25 frames/second, Gained video frame sum is:1200*25=15000;
(1.2) the use of mean value is 0 to the video frame obtained by step (1.1), the Gauss nuclear convolution that standard deviation is 1;
(1.3) original low-resolution video resolution and target resolution are obtained, amplification factor is calculated according to the two.It is original Resolution ratio is 768*432, target resolution 3072*1728, and amplification factor is:3072/768=4.It accordingly, will be after convolution Video frame is down sampled to the 1/4 of original size, obtains corresponding low resolution video frame.
(1.4) each width low resolution video frame of gained by existing segmentation standard, being divided into size is 10 × 10 pixels are not overlapped fritter, as shown in figure 3, simultaneously therefrom choosing 1,000,000 piece is used as training data.
(2) it is based on Mixture of expert model and obtains mapping relations model
(2.1) Mixture of expert model is initialized
(2.1.1) specifies the quantity K of expert.In the present embodiment, K=100 is taken;
(2.1.2) specifies the parameter σ and μ of the probability distribution of expert and the probability distribution of expert parameter, in the present embodiment, take σ=0.32, μ=0.58;
(2.1.3) is using k- mean algorithms by training data according to the quantity K clusters of expert, the W of each experti (0)Parameter It is initialized as slope in class, gate function parameter vi (0)It is initialized as cluster centre;
(2.1.4) calculates the initial value of each gate function according to the following formula:
Wherein x indicates low resolution video frame block, vi (0)Indicate that the initial value of i-th of gate function parameter, K are Mixture of expert The number of expert in model, i.e., the number of leaf node in tree structure.
(2.2) training data for using step (1.4) to obtain, the Mixture of expert model obtained to step (2.1) change Generation optimization:
(2.2.1) specifies allowable error ε when iteration ends.In the present embodiment, the permitted mistake of modulus type iteration ends Poor ε=0.005.
(2.2.2) calculates the posterior probability of each gate function in epicycle iteration:
Wherein k is iterative steps, pi(y | x, Wi (k)) and pj(y | x, Wj (k)) indicate expert probability distribution, gi (k)(x, vi (k)) indicate that the kth of i-th of gate function walks iterative value.
(2.2.3) updates each expert parameter:
Wherein k is iterative steps, and X is the vector of all low resolution video frame block x compositions in training data, and Y is instruction Practice the vector of all high-resolution video frame block y compositions in data, XTIndicate that the transposition of X, I indicate unit matrix, Hi (k+1)It indicates The vector of the posterior probability composition of all low resolution video frame block x in+1 step of kth corresponding to i-th of expert.
(2.2.4) updates each gate function parameter:
WhereinIndicate i-th of gate function parameter in kth step iteration,I-th gate function in iteration is walked for kth Posterior probability, x(t)Indicate t-th of low resolution video frame block.
(2.2.5) calculates the output of each gate function in epicycle iteration:
(2.2.6) calculates the likelihood probability in epicycle iteration:
Wherein, pi(y | x, Wi (k+1)) indicate expert probability distribution, p (Wi (k+1)) indicate expert parameter probability distribution.
(2.2.7) judges whether iteration restrains.When epicycle iteration likelihood probability and last round of iteration likelihood probability it Absolute value of the difference be less than iteration ends when allowable error ε when, terminate iteration.Otherwise repeat step (2.2.2)~ (2.2.7)。
The gate function parameter v obtained at the end of iterationi, together with expert's quantity K, expert parameter Wi, expert probability distribution The mean of a probability distribution μ of standard deviation sigma and expert parameter is stored in together as final mapping relations model in disk.Its In, vi, K, σ, μ be known as the gate function parameter of mapping relations model, WiThe referred to as expert parameter of mapping relations model.
(3) pending low-resolution video is pre-processed
(3.1) pending low-resolution video is split as low resolution video frame, in the present embodiment, films and television programs are long Degree is 2000 seconds, and frame rate is 25 frames/second, and gained video frame sum is:2000*25=50000;
(3.2) low resolution video frame obtained by step (3.1) is divided into 10 × 10 video frame block, as shown in Figure 3;
(4) low-resolution video is mapped as high-resolution video, including:
(4.1) the low resolution video frame block for obtaining step (3) is obtained as the input of gate function using step (2) Mapping relations model in gate function parameter calculate the output of each gate function:
Wherein, x is the low resolution video frame block of input.K is the number of expert in Mixture of expert model, viIndicate i-th A gate function parameter is obtained by step (2.2).
(4.2) expert's function parameter corresponding to the maximum gate function of output valve is used to calculate corresponding high-resolution video Frame block;
(4.2.1) calculating obtains the serial number of maximum output value gate function:I=arg max (gi).Wherein, giIt is i-th The output of function is obtained by step (4.1).
(4.2.2) calculates high-resolution video frame block using i-th of expert's function:
Y=Wix
Wherein, WiFor i-th of expert's function parameter, obtained by step (2).X is the low resolution video frame block of input, Its size is that 10 × 10, y is high-resolution video frame block after increase resolution, and size is 40 × 40.
(4.3) increase resolution is carried out according to the step of (4.1) and (4.2) to each low resolution video frame block, obtained To corresponding high-resolution video frame block, by all high-resolution video frame blocks according to its corresponding low resolution video frame block Position in low resolution video frame is spliced into corresponding high-resolution video frame;
(4.4) after obtaining the corresponding high-resolution video frame of all low resolution video frames, it is combined into high-resolution Video.

Claims (3)

1. a kind of films and television programs renovation method based on increase resolution, it is characterised in that:Including study mapping relations model and Increase resolution two parts are carried out according to mapping relations model;
Wherein, study mapping relations model includes following two steps:
(1) training video is pre-processed, including:
(1.1) high-resolution video is chosen as training sample, and is split as high-resolution video frame;
(1.2) the high-resolution video frame obtained by step (1.1) is used into Gauss nuclear convolution;
(1.3) amplification factor is calculated according to original low-resolution video and target high-resolution video, according to the multiple of gained into Row partiting row sampling obtains corresponding low resolution video frame;
(1.4) by high-resolution video frame and sampling gained low resolution video frame divide respectively it is blocking, as training data;
(2) it is based on Mixture of expert model and obtains mapping relations model, including:
(2.1) a Mixture of expert model is initialized, is included the following steps:
1. the quantity K of specified expert;
2. assuming the probability distribution Gaussian distributed of each expert:P (y | x, Wi)=N (y (x, Wi), σ), wherein x indicates low point Resolution video frame block, y indicate high-resolution video frame block, WiIndicate that the parameter of i-th of expert, σ are the standard deviation of Gaussian Profile; It is assumed that parameter WiDistribution also Gaussian distributed:p(Wi)=N (0, μ), wherein μ indicate the mean value of Gaussian Profile;
3. using k- mean algorithms by training data according to the quantity K clusters of expert, the initial value W of the parameter of each experti (0)Refer to It is set to slope in class, the initial value v of each gate function parameteri (0)It is appointed as cluster centre;
4. calculating the initial value of each gate function:
Wherein vi (0)Indicate that the initial value of i-th of gate function parameter, K are the number of expert in Mixture of expert model, i.e. tree structure The number of middle leaf node;
(2.2) training data for using step (1.4) to obtain, is iterated optimization, until iterative process to Mixture of expert model Convergence, finally obtained model parameter is mapping relations model;Mapping relations model includes gate function parameter and expert parameter; It is described optimization is iterated to model to include the following steps:
1. allowable error ε when specified iteration ends;
2. calculating the posterior probability of each gate function in epicycle iteration:
Wherein k is iterative steps, pi(y | x, Wi (k)) and pj(y | x, Wj (k)) indicate expert probability distribution, gi (k)(x, vi (k)) table Show the kth step iterative value of i-th of gate function;
3. updating each expert parameter:
Wherein k is iterative steps, and X is the vector of all low resolution video frame block x compositions in training data, and Y is training number The vector of all high-resolution video frame block y compositions, X inTIndicate that the transposition of X, I indicate unit matrix, Hi (k+1)Expression kth+ The vector of the posterior probability composition of all low resolution video frame block x in 1 step corresponding to i-th of expert;
4. updating each gate function parameter:
WhereinIndicate i-th of gate function parameter in kth step iteration,The posteriority that i-th of gate function in iteration is walked for kth is general Rate, x(t)Indicate t-th of low resolution video frame block;
5. calculating the output of each gate function in epicycle iteration:
6. calculating the likelihood probability in epicycle iteration:
Wherein, pi(y | x, Wi (k+1)) indicate expert probability distribution, p (Wi (k+1)) indicate expert parameter probability distribution;
7. judging whether iteration restrains;When the absolute value of the difference of the likelihood probability of the likelihood probability and last round of iteration of epicycle iteration When allowable error ε when less than iteration ends, terminate iteration;Otherwise repeat step 2.~7.;
The gate function parameter v obtained at the end of iterationi, together with expert's quantity K, expert parameter Wi, expert probability distribution standard The mean of a probability distribution μ of poor σ and expert parameter are stored in together as final mapping relations model in disk;
It includes following two steps to carry out increase resolution according to mapping relations model:
(3) pending low-resolution video is pre-processed, including:
(3.1) pending low-resolution video is split as low resolution video frame;
(3.2) low resolution video frame is divided blocking;
(4) low-resolution video is promoted to high-resolution video according to the mapping relations model that step (2) obtains, including:
(4.1) the low resolution video frame block for obtaining step (3) uses step as the input of Mixture of expert model gate function (2) the gate function parameter in the mapping relations model obtained calculates the output of each gate function;
(4.2) parameter of expert's function corresponding to the maximum gate function of output valve is used to calculate corresponding high-resolution video frame Block, the wherein parameter of expert's function are obtained by step (2);
(4.3) increase resolution is carried out according to the step of (4.1) and (4.2) to each low resolution video frame block, obtained pair The high-resolution video frame block answered, by all high-resolution video frame blocks according to its corresponding low resolution video frame block low Position in resolution video frame is spliced into corresponding high-resolution video frame;
(4.4) after obtaining the corresponding high-resolution video frame of all low resolution video frames, it is combined into high-resolution video.
2. the films and television programs renovation method according to claim 1 based on increase resolution, it is characterised in that step (2.1) The Mixture of expert model includes expert and gate function two parts;
Expert is responsible for carrying out mapping transformation to data, and the mapping transformation in the present invention uses linear function as expert's function:
Y=Wx
Wherein W is expert parameter, and x and y indicate low resolution video frame block and corresponding high-resolution video frame block respectively;
Gate function is responsible for determining which expert of selection converts data, and i-th of gate function is expressed as in the present invention:
Wherein, viIndicate i-th of gate function parameter, vjIndicate that j-th of gate function parameter, K are of expert in Mixture of expert model Number.
3. the films and television programs renovation method according to claim 1 based on increase resolution, it is characterised in that:Step (4.2) parameter using expert's function corresponding to the maximum gate function of output valve calculates corresponding high-resolution video frame Block includes the following steps:
1. calculating obtains the serial number of maximum output value gate function:I=arg max (gi)
Wherein, giFor the output of i-th of gate function, obtained by step (4.1);
2. calculating high-resolution video frame block using i-th of expert's function:Y=Wix
Wherein, WiFor the parameter of i-th of expert's function, y is that the high-resolution corresponding to the low resolution video frame block x of input regards Frequency frame block.
CN201610109909.XA 2016-02-29 2016-02-29 Films and television programs renovation method based on increase resolution Active CN105791980B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610109909.XA CN105791980B (en) 2016-02-29 2016-02-29 Films and television programs renovation method based on increase resolution

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610109909.XA CN105791980B (en) 2016-02-29 2016-02-29 Films and television programs renovation method based on increase resolution

Publications (2)

Publication Number Publication Date
CN105791980A CN105791980A (en) 2016-07-20
CN105791980B true CN105791980B (en) 2018-09-14

Family

ID=56403789

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610109909.XA Active CN105791980B (en) 2016-02-29 2016-02-29 Films and television programs renovation method based on increase resolution

Country Status (1)

Country Link
CN (1) CN105791980B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109151339B (en) * 2018-08-27 2021-08-13 武汉有文心文化传媒有限公司 Method for synthesizing characters in recommendation video and related products
CN109981991A (en) * 2019-04-17 2019-07-05 北京旷视科技有限公司 Model training method, image processing method, device, medium and electronic equipment

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8520736B2 (en) * 2009-04-14 2013-08-27 Fastvdo, Llc Real-time superresolution and video transmission
CN101639937B (en) * 2009-09-03 2011-12-14 复旦大学 Super-resolution method based on artificial neural network
CN104778671B (en) * 2015-04-21 2017-09-22 重庆大学 A kind of image super-resolution method based on SAE and rarefaction representation

Also Published As

Publication number Publication date
CN105791980A (en) 2016-07-20

Similar Documents

Publication Publication Date Title
CN105744357B (en) A kind of reduction network video bandwidth occupancy method based on online increase resolution
US11055828B2 (en) Video inpainting with deep internal learning
US10685282B2 (en) Machine-learning based video compression
CN109087273B (en) Image restoration method, storage medium and system based on enhanced neural network
US10902571B2 (en) Automated image synthesis using a comb neural network architecture
CN108734661B (en) High-resolution image prediction method for constructing loss function based on image texture information
CN112543317B (en) Method for converting high-resolution monocular 2D video into binocular 3D video
CN110570377A (en) group normalization-based rapid image style migration method
CN110717868B (en) Video high dynamic range inverse tone mapping model construction and mapping method and device
CN106127689B (en) Image/video super-resolution method and device
CN110533579B (en) Video style conversion method based on self-coding structure and gradient order preservation
US20170109873A1 (en) Image enhancement using self-examples and external examples
CN109191460B (en) Quality evaluation method for tone mapping image
CN106204472A (en) Video image deblurring method based on sparse characteristic
JP2021525401A (en) Image generation network training and image processing methods, equipment, electronics, and media
CN107945255A (en) A kind of virtual actor's facial expression driving method and system
CN113129212B (en) Image super-resolution reconstruction method and device, terminal device and storage medium
WO2020228347A1 (en) Superpixel-based three-dimensional object model generation method, system, and storage medium
CN105791980B (en) Films and television programs renovation method based on increase resolution
CN110428382A (en) A kind of efficient video Enhancement Method, device and storage medium for mobile terminal
Zhang et al. Deep tone mapping network in HSV color space
CN114339030A (en) Network live broadcast video image stabilization method based on self-adaptive separable convolution
Xu et al. Deep video inverse tone mapping
CN102542536A (en) Image quality strengthening method based on generalized equilibrium model
Ye et al. Real3d-portrait: One-shot realistic 3d talking portrait synthesis

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant