CN102413378A

CN102413378A - Adaptive neural network-based lost packet recovery method in video transmission

Info

Publication number: CN102413378A
Application number: CN2011103413360A
Authority: CN
Inventors: 柳毅; 王晓耘; 周涛; 刘大为
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2011-11-02
Filing date: 2011-11-02
Publication date: 2012-04-11

Abstract

The invention relates to an adaptive neural network-based lost packet recovery method in video transmission. The lower layer of a protocol which is used in the existing network transmission has an error correcting code, however, only the problem of error code in the packet rather than the problem of lost packet can be solved. The method comprises the following steps of: taking video information as the state of the current environment to be input; outputting a video encoding rate which is to be used under the input states, wherein a main network lost packet model uses a back propagation adaptive neutral network-based AHC (Actor-Critic) model; and leading the Critic to receive reward feedback of an environment in the process of studying, updating Value (x), and handing a reward forecast error to an Actor module in the manner of external feedback for guiding the Actor to correct a strategy of a selecting action. The influence of the network lost packet to internet protocol (IP) video call is effectively reduced, and the stability of video interaction is improved.

Description

A kind of video transmission loss recovery method based on adaptive neural network

Technical field

The invention belongs to the video data technology field, relate to a kind of video transmission loss recovery technology based on adaptive neural network.

Background technology

Along with the development of 3G technology and broadband network, and constantly the weeding out the old and bring forth the new of terminal, video communication service will obtain significant progress, thus Video transmission system to real-time and accuracy require increasingly high.In the video transmission process of network, the principal element that influences audio frequency tonequality has time delay, electrostatic interference, packet loss and shake etc.Wherein, the packet loss problem is a most crucial factor that influences video interactive service quality always.Although the lower protocol layers of using in the Network Transmission has error correcting code, can only solve error code in the bag, can't solve the packet loss problem.The wrong retransmission mechanism of Transmission Control Protocol can guarantee the correctness of network data flow; But it can't satisfy the real-time requirement of video signal transmission; Particularly under the situation of many group broadcasting, different recipients has different packet loss retransmission requesting, and this not only can increase the consumption of the network bandwidth; And cause Network Transmission to be stopped up conversely, cause bigger data-bag lost.Therefore, need study, to improve the quality of video transmission to the loss recovery treatment technology of application layer.

Backpropagation adaptive neural network (Back Propagation Neural Network is called for short the BP network) is to use the most extensive and successful a kind of artificial neural net at present.Mainly form by input layer, hidden layer and output layer.Feedforward network is meant, only exists the weights between the different layers neuron to connect, and do not exist with the connection between one deck neuron, and also the connection between the neuron, the just connection between adjacent layer, and the situation that interlayer connects does not appear.It is existing that oneself proves: the single hidden layer feedforward network with S type activation primitive; As long as hidden layer has fully many neurons; It just can approach any known function with precision arbitrarily, thereby shows that the BP neural net can be used as a general function approximation device.The BP Application of Neural Network among the video transmission packet loss, can be overcome the defective of traditional packet loss detection video recovery technology, thereby has good application and promotion prospect.

Summary of the invention

The present invention proposes a kind of video transmission loss recovery technology based on adaptive neural network; Comprise the Network Packet Loss model and based on backpropagation adaptive neural network (Back Propagation Neural Network; The BP network) loss recovery technology, the real-time and the accuracy of raising Video transmission system.

The present invention adopts following technical solution: at first video information is as the state input of current environment, the video frequency coding rate that output should be used under these input states.Its main Network Packet Loss model has used the AHC model (or being called the Actor-Critic model) based on the backpropagation adaptive neural network; Wherein Actor is responsible for producing an action under the current state condition; Critic then is responsible for can getablely rewarding under the study prediction current state condition, and Value (x) is Critic and environment is rewarded the function of predicting.In the learning process, Critic accepts the award feedback of environment, upgrades Value (x), and will reward predicated error and give Actor module with the form of external world's feedback, is used to instruct Actor to correct the strategy of choosing action.Wherein, Value (x) learns according to the Q-learning update rule, and Actor then adopts Gaussian ASLA function of movement to realize, Actor and Critic use BP (Back-Propagation) neural net to carry out extensive work.

Concrete grammar of the present invention may further comprise the steps:

Step 1. is selected the input variable and the output variable of video loss recovery model, confirms training sample; Concrete grammar is:

Input variable comprises three types of network parameter information and video information, and described three types of network parameter information are respectively 4 time delay state values, 4 dither state values, 1 packet loss state value;

Described video information is 6 frame complexity state values;

The video frequency coding rate of output variable for using.

Step 2. data normalization is handled, and the data in the input and output sequence is carried out normalization handle, and specifically is to the processing of standardizing of the state variable parameter of time delay, shake and frame complexity, is converted into the value of [0,1] scope.

Step 3. video delivery network packet loss model has used the AHC model (Actor-Critic model) based on the backpropagation adaptive neural network, and concrete grammar is:

Through using three neural nets to be used for the Value (x) among the match Critic, μ (x) and the α (x) among the Actor respectively, wherein μ (x) and α (x) represent the average and the standard deviation of output valve respectively;

Described Actor is responsible for producing an action under the current state condition,

Described Critic then is responsible for can getablely rewarding under the study prediction current state condition;

Described Value (x) rewards the function of predicting for Critic to environment;

The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer has 15 input units; Each neural net has a latent layer, and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the Sigmoid function.

In the learning process of step 4. training BP neural net; Concrete grammar is:

Actor selects a code check when moment t-1, then can receive the feedback incentive message of environment, promptly is the video quality information after proofreading and correct here; This moment, Critic need use the Q-learning update rule, upgraded the award anticipation function of oneself.

The Q-learning update rule is the deviation that anticipation function is predicted when moment t-1, shown in 1.

Figure 2011103413360100002DEST_PATH_IMAGE002

formula 1

R wherein _T-1Be the award that the action of t-1 is constantly received, Value _tWith Value _T-1The output valve of function Value (x) after Critic assesses environment of living in when being respectively moment t and t-1, γ is a learning parameter;

Formula 1 is used the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then.

Step 5. test b P neural net;

BP neural net to training is accomplished is tested; Historical data is formed input information according to network parameter information in the step (1) and video information; Carrying out normalization according to step (2) again handles; Through obtaining 14 dateouts altogether after the normalization, add packet loss in addition like this, it is corresponding with outl-outl5 respectively to have 15 Input; Directly call the sim function in the MatLab Neural Network Toolbox according to step (3), test matrix carried out emulation, wherein in the network each neuronic threshold function all be can be little the Sigmoid function, correspond to the packet loss predicted value of video delivery network.

The anti-normalization of step 6. data is handled;

The video code rate value that output is under the current state is carried out anti-normalization processing, uses the obedience average to produce output valve as μ (x), standard deviation as the Gaussian distribution randomizer of α (x).

The beneficial effect that the present invention has is: through the video loss recovery technology of research based on the backpropagation adaptive neural network; Permission provides significantly improved normal video quality in having the network of packet loss; With of the influence of effective reduction Network Packet Loss, improve the stability of video interactive to the IP video call.Make when receiving more relatively multimedia data information based on the video call system receiving terminal; Also can recover accurate data information more; Thereby can improve the video displaying quality; For providing, video user guarantees service quality (Quality of Service, video transmission technologies QoS).

Description of drawings

Fig. 1 is based on the sketch map of the loss recovery method of backpropagation adaptive neural network.

Embodiment

Below in conjunction with accompanying drawing the present invention is described further.

As shown in Figure 1, the present invention includes following steps:

Input variable comprises three types of network parameter information and video information, and described three types of network parameter information are respectively 4 time delay state values, 4 dither state values, 1 packet loss state value; Described video information is 6 frame complexity state values, the video frequency coding rate of output variable for using;

Step 2. data normalization is handled:

Because network delay has very big distribution, arrive the hundreds of millisecond greatly for a short time to being lower than 1 millisecond, delay variation also is like this, the scope of packet loss is then very little, can get the value between little 1 in theory, in fact is generally less than 0.01.In addition, most of Frame complexity is positioned at 10 ⁵Below, there is few frames can reach 4*10 ⁵More than.Directly the state value that these spans are widely different is very inappropriate as input, needs to use normalized method, the data in the input and output sequence is carried out normalization handle.

Therefore, to the processing of standardizing of the state variable parameter of time delay (getting 4 values), shake (getting 4 values), three parts of frame frame complexity (getting 6 values), the real number value x of each input is converted into the output that is positioned at value between [0,1].Considering above three state variables mentioning, because the span of packet loss between 0-1, does not need normalization.Like this through the normalization after obtain 14 dateouts altogether, add packet loss in addition, have 15 respectively with Fig. 1 in outl-outl5 corresponding.These 15 data are used for the state of descriptive system place environment, as Actor and Critic.The input of employed neural net in the fit procedure, respectively with Fig. 1 in Inputl-Inputl5 corresponding one by one.

The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer as shown in fig. 1 has 15 input units; Each neural net has a latent layer (Hidden Layer), and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the Sigmoid function;

In the learning process of step 4. training BP neural net; Concrete grammar is:

Actor selects a code check when moment t-1, then can receive the feedback incentive message of environment, promptly is the video quality information after proofreading and correct here; This moment, Critic need use the Q-learning update rule, upgraded the award anticipation function of oneself;

formula 1

Formula 1 is used the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then;

Step 5. test b P neural net;

BP neural net to training is accomplished is tested; Historical data is formed input information according to network parameter information in the step (1) and video information; Carrying out normalization according to step (2) again handles; Through obtaining 14 dateouts altogether after the normalization, add packet loss in addition like this, it is corresponding with outl-outl5 respectively to have 15 Input; Directly call the sim function in the MatLab Neural Network Toolbox according to step (3), test matrix carried out emulation, wherein in the network each neuronic threshold function all be can be little the Sigmoid function, correspond to the packet loss predicted value of video delivery network;

The anti-normalization of step 6. data is handled;

Claims

1. video transmission loss recovery method based on adaptive neural network is characterized in that this method may further comprise the steps:

Described video information is 6 frame complexity state values,

The video frequency coding rate of output variable for using;

Step 2. data normalization is handled, and the data in the input and output sequence is carried out normalization handle, and specifically is to the processing of standardizing of the state variable parameter of time delay, shake and frame complexity, is converted into the value of [0,1] scope;

Described Actor is responsible for producing an action under the current state condition;

The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer has 15 input units; Each neural net has a latent layer, and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the sigmoid function;

In the learning process of step 4. training BP neural net; Concrete grammar is:

The Q-learning update rule is the deviation that anticipation function is predicted when moment t-1, shown in the following formula

Following formula uses the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then;

Step 5. test b P neural net;

The anti-normalization of step 6. data is handled;