CN102413378A - Adaptive neural network-based lost packet recovery method in video transmission - Google Patents

Adaptive neural network-based lost packet recovery method in video transmission Download PDF

Info

Publication number
CN102413378A
CN102413378A CN2011103413360A CN201110341336A CN102413378A CN 102413378 A CN102413378 A CN 102413378A CN 2011103413360 A CN2011103413360 A CN 2011103413360A CN 201110341336 A CN201110341336 A CN 201110341336A CN 102413378 A CN102413378 A CN 102413378A
Authority
CN
China
Prior art keywords
video
network
value
critic
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011103413360A
Other languages
Chinese (zh)
Inventor
柳毅
王晓耘
周涛
刘大为
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Dianzi University
Original Assignee
Hangzhou Dianzi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dianzi University filed Critical Hangzhou Dianzi University
Priority to CN2011103413360A priority Critical patent/CN102413378A/en
Publication of CN102413378A publication Critical patent/CN102413378A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to an adaptive neural network-based lost packet recovery method in video transmission. The lower layer of a protocol which is used in the existing network transmission has an error correcting code, however, only the problem of error code in the packet rather than the problem of lost packet can be solved. The method comprises the following steps of: taking video information as the state of the current environment to be input; outputting a video encoding rate which is to be used under the input states, wherein a main network lost packet model uses a back propagation adaptive neutral network-based AHC (Actor-Critic) model; and leading the Critic to receive reward feedback of an environment in the process of studying, updating Value (x), and handing a reward forecast error to an Actor module in the manner of external feedback for guiding the Actor to correct a strategy of a selecting action. The influence of the network lost packet to internet protocol (IP) video call is effectively reduced, and the stability of video interaction is improved.

Description

A kind of video transmission loss recovery method based on adaptive neural network
Technical field
The invention belongs to the video data technology field, relate to a kind of video transmission loss recovery technology based on adaptive neural network.
Background technology
Along with the development of 3G technology and broadband network, and constantly the weeding out the old and bring forth the new of terminal, video communication service will obtain significant progress, thus Video transmission system to real-time and accuracy require increasingly high.In the video transmission process of network, the principal element that influences audio frequency tonequality has time delay, electrostatic interference, packet loss and shake etc.Wherein, the packet loss problem is a most crucial factor that influences video interactive service quality always.Although the lower protocol layers of using in the Network Transmission has error correcting code, can only solve error code in the bag, can't solve the packet loss problem.The wrong retransmission mechanism of Transmission Control Protocol can guarantee the correctness of network data flow; But it can't satisfy the real-time requirement of video signal transmission; Particularly under the situation of many group broadcasting, different recipients has different packet loss retransmission requesting, and this not only can increase the consumption of the network bandwidth; And cause Network Transmission to be stopped up conversely, cause bigger data-bag lost.Therefore, need study, to improve the quality of video transmission to the loss recovery treatment technology of application layer.
Backpropagation adaptive neural network (Back Propagation Neural Network is called for short the BP network) is to use the most extensive and successful a kind of artificial neural net at present.Mainly form by input layer, hidden layer and output layer.Feedforward network is meant, only exists the weights between the different layers neuron to connect, and do not exist with the connection between one deck neuron, and also the connection between the neuron, the just connection between adjacent layer, and the situation that interlayer connects does not appear.It is existing that oneself proves: the single hidden layer feedforward network with S type activation primitive; As long as hidden layer has fully many neurons; It just can approach any known function with precision arbitrarily, thereby shows that the BP neural net can be used as a general function approximation device.The BP Application of Neural Network among the video transmission packet loss, can be overcome the defective of traditional packet loss detection video recovery technology, thereby has good application and promotion prospect.
Summary of the invention
The present invention proposes a kind of video transmission loss recovery technology based on adaptive neural network; Comprise the Network Packet Loss model and based on backpropagation adaptive neural network (Back Propagation Neural Network; The BP network) loss recovery technology, the real-time and the accuracy of raising Video transmission system.
The present invention adopts following technical solution: at first video information is as the state input of current environment, the video frequency coding rate that output should be used under these input states.Its main Network Packet Loss model has used the AHC model (or being called the Actor-Critic model) based on the backpropagation adaptive neural network; Wherein Actor is responsible for producing an action under the current state condition; Critic then is responsible for can getablely rewarding under the study prediction current state condition, and Value (x) is Critic and environment is rewarded the function of predicting.In the learning process, Critic accepts the award feedback of environment, upgrades Value (x), and will reward predicated error and give Actor module with the form of external world's feedback, is used to instruct Actor to correct the strategy of choosing action.Wherein, Value (x) learns according to the Q-learning update rule, and Actor then adopts Gaussian ASLA function of movement to realize, Actor and Critic use BP (Back-Propagation) neural net to carry out extensive work.
Concrete grammar of the present invention may further comprise the steps:
Step 1. is selected the input variable and the output variable of video loss recovery model, confirms training sample; Concrete grammar is:
Input variable comprises three types of network parameter information and video information, and described three types of network parameter information are respectively 4 time delay state values, 4 dither state values, 1 packet loss state value;
Described video information is 6 frame complexity state values;
The video frequency coding rate of output variable for using.
Step 2. data normalization is handled, and the data in the input and output sequence is carried out normalization handle, and specifically is to the processing of standardizing of the state variable parameter of time delay, shake and frame complexity, is converted into the value of [0,1] scope.
Step 3. video delivery network packet loss model has used the AHC model (Actor-Critic model) based on the backpropagation adaptive neural network, and concrete grammar is:
Through using three neural nets to be used for the Value (x) among the match Critic, μ (x) and the α (x) among the Actor respectively, wherein μ (x) and α (x) represent the average and the standard deviation of output valve respectively;
Described Actor is responsible for producing an action under the current state condition,
Described Critic then is responsible for can getablely rewarding under the study prediction current state condition;
Described Value (x) rewards the function of predicting for Critic to environment;
The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer has 15 input units; Each neural net has a latent layer, and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the Sigmoid function.
In the learning process of step 4. training BP neural net; Concrete grammar is:
Actor selects a code check when moment t-1, then can receive the feedback incentive message of environment, promptly is the video quality information after proofreading and correct here; This moment, Critic need use the Q-learning update rule, upgraded the award anticipation function of oneself.
The Q-learning update rule is the deviation that anticipation function is predicted when moment t-1, shown in 1.
Figure 2011103413360100002DEST_PATH_IMAGE002
formula 1
R wherein T-1Be the award that the action of t-1 is constantly received, Value tWith Value T-1The output valve of function Value (x) after Critic assesses environment of living in when being respectively moment t and t-1, γ is a learning parameter;
Formula 1 is used the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then.
Step 5. test b P neural net;
BP neural net to training is accomplished is tested; Historical data is formed input information according to network parameter information in the step (1) and video information; Carrying out normalization according to step (2) again handles; Through obtaining 14 dateouts altogether after the normalization, add packet loss in addition like this, it is corresponding with outl-outl5 respectively to have 15 Input; Directly call the sim function in the MatLab Neural Network Toolbox according to step (3), test matrix carried out emulation, wherein in the network each neuronic threshold function all be can be little the Sigmoid function, correspond to the packet loss predicted value of video delivery network.
The anti-normalization of step 6. data is handled;
The video code rate value that output is under the current state is carried out anti-normalization processing, uses the obedience average to produce output valve as μ (x), standard deviation as the Gaussian distribution randomizer of α (x).
The beneficial effect that the present invention has is: through the video loss recovery technology of research based on the backpropagation adaptive neural network; Permission provides significantly improved normal video quality in having the network of packet loss; With of the influence of effective reduction Network Packet Loss, improve the stability of video interactive to the IP video call.Make when receiving more relatively multimedia data information based on the video call system receiving terminal; Also can recover accurate data information more; Thereby can improve the video displaying quality; For providing, video user guarantees service quality (Quality of Service, video transmission technologies QoS).
Description of drawings
Fig. 1 is based on the sketch map of the loss recovery method of backpropagation adaptive neural network.
Embodiment
Below in conjunction with accompanying drawing the present invention is described further.
As shown in Figure 1, the present invention includes following steps:
Step 1. is selected the input variable and the output variable of video loss recovery model, confirms training sample; Concrete grammar is:
Input variable comprises three types of network parameter information and video information, and described three types of network parameter information are respectively 4 time delay state values, 4 dither state values, 1 packet loss state value; Described video information is 6 frame complexity state values, the video frequency coding rate of output variable for using;
Step 2. data normalization is handled:
Because network delay has very big distribution, arrive the hundreds of millisecond greatly for a short time to being lower than 1 millisecond, delay variation also is like this, the scope of packet loss is then very little, can get the value between little 1 in theory, in fact is generally less than 0.01.In addition, most of Frame complexity is positioned at 10 5Below, there is few frames can reach 4*10 5More than.Directly the state value that these spans are widely different is very inappropriate as input, needs to use normalized method, the data in the input and output sequence is carried out normalization handle.
Therefore, to the processing of standardizing of the state variable parameter of time delay (getting 4 values), shake (getting 4 values), three parts of frame frame complexity (getting 6 values), the real number value x of each input is converted into the output that is positioned at value between [0,1].Considering above three state variables mentioning, because the span of packet loss between 0-1, does not need normalization.Like this through the normalization after obtain 14 dateouts altogether, add packet loss in addition, have 15 respectively with Fig. 1 in outl-outl5 corresponding.These 15 data are used for the state of descriptive system place environment, as Actor and Critic.The input of employed neural net in the fit procedure, respectively with Fig. 1 in Inputl-Inputl5 corresponding one by one.
Step 3. video delivery network packet loss model has used the AHC model (Actor-Critic model) based on the backpropagation adaptive neural network, and concrete grammar is:
Through using three neural nets to be used for the Value (x) among the match Critic, μ (x) and the α (x) among the Actor respectively, wherein μ (x) and α (x) represent the average and the standard deviation of output valve respectively;
Described Actor is responsible for producing an action under the current state condition,
Described Critic then is responsible for can getablely rewarding under the study prediction current state condition;
Described Value (x) rewards the function of predicting for Critic to environment;
The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer as shown in fig. 1 has 15 input units; Each neural net has a latent layer (Hidden Layer), and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the Sigmoid function;
In the learning process of step 4. training BP neural net; Concrete grammar is:
Actor selects a code check when moment t-1, then can receive the feedback incentive message of environment, promptly is the video quality information after proofreading and correct here; This moment, Critic need use the Q-learning update rule, upgraded the award anticipation function of oneself;
The Q-learning update rule is the deviation that anticipation function is predicted when moment t-1, shown in 1.
Figure 711468DEST_PATH_IMAGE002
formula 1
R wherein T-1Be the award that the action of t-1 is constantly received, Value tWith Value T-1The output valve of function Value (x) after Critic assesses environment of living in when being respectively moment t and t-1, γ is a learning parameter;
Formula 1 is used the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then;
Step 5. test b P neural net;
BP neural net to training is accomplished is tested; Historical data is formed input information according to network parameter information in the step (1) and video information; Carrying out normalization according to step (2) again handles; Through obtaining 14 dateouts altogether after the normalization, add packet loss in addition like this, it is corresponding with outl-outl5 respectively to have 15 Input; Directly call the sim function in the MatLab Neural Network Toolbox according to step (3), test matrix carried out emulation, wherein in the network each neuronic threshold function all be can be little the Sigmoid function, correspond to the packet loss predicted value of video delivery network;
The anti-normalization of step 6. data is handled;
The video code rate value that output is under the current state is carried out anti-normalization processing, uses the obedience average to produce output valve as μ (x), standard deviation as the Gaussian distribution randomizer of α (x).

Claims (1)

1. video transmission loss recovery method based on adaptive neural network is characterized in that this method may further comprise the steps:
Step 1. is selected the input variable and the output variable of video loss recovery model, confirms training sample; Concrete grammar is:
Input variable comprises three types of network parameter information and video information, and described three types of network parameter information are respectively 4 time delay state values, 4 dither state values, 1 packet loss state value;
Described video information is 6 frame complexity state values,
The video frequency coding rate of output variable for using;
Step 2. data normalization is handled, and the data in the input and output sequence is carried out normalization handle, and specifically is to the processing of standardizing of the state variable parameter of time delay, shake and frame complexity, is converted into the value of [0,1] scope;
Step 3. video delivery network packet loss model has used the AHC model (Actor-Critic model) based on the backpropagation adaptive neural network, and concrete grammar is:
Through using three neural nets to be used for the Value (x) among the match Critic, μ (x) and the α (x) among the Actor respectively, wherein μ (x) and α (x) represent the average and the standard deviation of output valve respectively;
Described Actor is responsible for producing an action under the current state condition;
Described Critic then is responsible for can getablely rewarding under the study prediction current state condition;
Described Value (x) rewards the function of predicting for Critic to environment;
The input of three neural nets is identical, all is normalization variable state value afterwards, and it is Inputl-Inputl5 that input layer has 15 input units; Each neural net has a latent layer, and latent layer has three neurons to consist of Hiden1-Hiden3; It is Out that output layer has an output neuron; The neuronic threshold function of in the network each uses be can be little the sigmoid function;
In the learning process of step 4. training BP neural net; Concrete grammar is:
Actor selects a code check when moment t-1, then can receive the feedback incentive message of environment, promptly is the video quality information after proofreading and correct here; This moment, Critic need use the Q-learning update rule, upgraded the award anticipation function of oneself;
The Q-learning update rule is the deviation that anticipation function is predicted when moment t-1, shown in the following formula
R wherein T-1Be the award that the action of t-1 is constantly received, Value tWith Value T-1The output valve of function Value (x) after Critic assesses environment of living in when being respectively moment t and t-1, γ is a learning parameter;
Following formula uses the actual value of the award that the current predicted value of anticipation function Value (x) replaces can obtaining in the future, adds by moment t-1 to the award of gained the t constantly the award that should obtain during moment t-1 exactly then;
Step 5. test b P neural net;
BP neural net to training is accomplished is tested; Historical data is formed input information according to network parameter information in the step (1) and video information; Carrying out normalization according to step (2) again handles; Through obtaining 14 dateouts altogether after the normalization, add packet loss in addition like this, it is corresponding with outl-outl5 respectively to have 15 Input; Directly call the sim function in the MatLab Neural Network Toolbox according to step (3), test matrix carried out emulation, wherein in the network each neuronic threshold function all be can be little the Sigmoid function, correspond to the packet loss predicted value of video delivery network;
The anti-normalization of step 6. data is handled;
The video code rate value that output is under the current state is carried out anti-normalization processing, uses the obedience average to produce output valve as μ (x), standard deviation as the Gaussian distribution randomizer of α (x).
CN2011103413360A 2011-11-02 2011-11-02 Adaptive neural network-based lost packet recovery method in video transmission Pending CN102413378A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011103413360A CN102413378A (en) 2011-11-02 2011-11-02 Adaptive neural network-based lost packet recovery method in video transmission

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011103413360A CN102413378A (en) 2011-11-02 2011-11-02 Adaptive neural network-based lost packet recovery method in video transmission

Publications (1)

Publication Number Publication Date
CN102413378A true CN102413378A (en) 2012-04-11

Family

ID=45915175

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011103413360A Pending CN102413378A (en) 2011-11-02 2011-11-02 Adaptive neural network-based lost packet recovery method in video transmission

Country Status (1)

Country Link
CN (1) CN102413378A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107547914A (en) * 2017-08-15 2018-01-05 浙江工业大学 The substandard video segments based on KNN Q study of DASH obtain optimization method
CN107808004A (en) * 2017-11-15 2018-03-16 北京百度网讯科技有限公司 Model training method and system, server, storage medium
CN107992939A (en) * 2017-12-06 2018-05-04 湖北工业大学 Cutting force gear working method is waited based on depth enhancing study
CN108010533A (en) * 2016-10-27 2018-05-08 北京酷我科技有限公司 The automatic identifying method and device of voice data code check
CN108063961A (en) * 2017-12-22 2018-05-22 北京联合网视文化传播有限公司 A kind of self-adaption code rate video transmission method and system based on intensified learning
CN109218083A (en) * 2018-08-27 2019-01-15 广州爱拍网络科技有限公司 A kind of voice data transmission method and device
CN109726811A (en) * 2017-10-27 2019-05-07 谷歌有限责任公司 Use priority formation neural network
CN110006486A (en) * 2019-04-01 2019-07-12 中清控(武汉)科技有限公司 A kind of intelligence flow temperature acquisition control module and intelligence flow temperature measurement method
CN112511482A (en) * 2019-09-16 2021-03-16 华为技术有限公司 Media data transmission method, device and system
CN112532349A (en) * 2020-11-24 2021-03-19 广州技象科技有限公司 Data processing method and device based on decoding abnormity

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101808244A (en) * 2010-03-24 2010-08-18 北京邮电大学 Video transmission control method and system
CN102075312A (en) * 2011-01-10 2011-05-25 西安电子科技大学 Video service quality-based hybrid selective repeat method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101808244A (en) * 2010-03-24 2010-08-18 北京邮电大学 Video transmission control method and system
CN102075312A (en) * 2011-01-10 2011-05-25 西安电子科技大学 Video service quality-based hybrid selective repeat method

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108010533A (en) * 2016-10-27 2018-05-08 北京酷我科技有限公司 The automatic identifying method and device of voice data code check
CN107547914A (en) * 2017-08-15 2018-01-05 浙江工业大学 The substandard video segments based on KNN Q study of DASH obtain optimization method
CN109726811A (en) * 2017-10-27 2019-05-07 谷歌有限责任公司 Use priority formation neural network
US11797839B2 (en) 2017-10-27 2023-10-24 Google Llc Training neural networks using priority queues
CN107808004A (en) * 2017-11-15 2018-03-16 北京百度网讯科技有限公司 Model training method and system, server, storage medium
CN107992939A (en) * 2017-12-06 2018-05-04 湖北工业大学 Cutting force gear working method is waited based on depth enhancing study
CN108063961A (en) * 2017-12-22 2018-05-22 北京联合网视文化传播有限公司 A kind of self-adaption code rate video transmission method and system based on intensified learning
CN108063961B (en) * 2017-12-22 2020-07-31 深圳市云网拜特科技有限公司 Self-adaptive code rate video transmission method and system based on reinforcement learning
CN109218083A (en) * 2018-08-27 2019-01-15 广州爱拍网络科技有限公司 A kind of voice data transmission method and device
CN109218083B (en) * 2018-08-27 2021-08-13 广州猎游信息科技有限公司 Voice data transmission method and device
CN110006486A (en) * 2019-04-01 2019-07-12 中清控(武汉)科技有限公司 A kind of intelligence flow temperature acquisition control module and intelligence flow temperature measurement method
CN112511482A (en) * 2019-09-16 2021-03-16 华为技术有限公司 Media data transmission method, device and system
CN112532349A (en) * 2020-11-24 2021-03-19 广州技象科技有限公司 Data processing method and device based on decoding abnormity

Similar Documents

Publication Publication Date Title
CN102413378A (en) Adaptive neural network-based lost packet recovery method in video transmission
CN101808244B (en) Video transmission control method and system
WO2021208771A1 (en) Reinforced learning method and device
US20180307979A1 (en) Distributed deep learning using a distributed deep neural network
CN112491714B (en) Intelligent QoS route optimization method and system based on deep reinforcement learning in SDN environment
CN110977967A (en) Robot path planning method based on deep reinforcement learning
Wei et al. TRUST: A TCP throughput prediction method in mobile networks
Mei et al. Realtime mobile bandwidth prediction using lstm neural network
CN107179700A (en) A kind of intelligent home control system and method based on Alljoyn and machine learning
Huang et al. Learning tailored adaptive bitrate algorithms to heterogeneous network conditions: A domain-specific priors and meta-reinforcement learning approach
Kamyod End-to-end reliability analysis of an IoT based smart agriculture
Muccini et al. A machine learning-driven approach for proactive decision making in adaptive architectures
CN113422751B (en) Streaming media processing method and device based on online reinforcement learning and electronic equipment
CN113748460A (en) Bandwidth extension of incoming data using neural networks
Hu et al. Intelligent decision making framework for 6G network
CN112269729A (en) Intelligent load analysis method for large-scale server cluster of online shopping platform
CN113761148A (en) Conversation information acquisition method, device, equipment and storage medium
CN113259657A (en) DPPO code rate self-adaptive control system and method based on video quality fraction
CN114024906B (en) Flow control method, device and system for optimizing video transmission user experience quality
CN115022684B (en) Video stream self-adaptive transmission method based on deep reinforcement learning under QUIC protocol
US20230336739A1 (en) Rate control machine learning models with feedback control for video encoding
CN116414951A (en) Intelligent dialogue method, model training method, device, storage medium and equipment
RU2447494C1 (en) Intelligent controller with self-modification rules of educational and control neural networks
Danner et al. Decentralized machine learning using compressed push-pull averaging
WO2023150943A1 (en) Method for updating wireless channel model, and apparatus, device and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120411