CN113329266B

CN113329266B - Panoramic video self-adaptive transmission method based on limited user visual angle feedback

Info

Publication number: CN113329266B
Application number: CN202110634899.2A
Authority: CN
Inventors: 黎洁; 韩玲; 李奇越; 张聪; 王枭; 王慧宇; 陈勇; 彭涛
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2021-06-08
Filing date: 2021-06-08
Publication date: 2022-07-05
Anticipated expiration: 2041-06-08
Also published as: CN113329266A

Abstract

The invention discloses a panorama self-adaptive transmission method based on limited user visual angle feedback, which comprises the following steps: 1. the server carries out block processing on the original panoramic video and encodes each block of video into different quality grades; 2. the server extracts the salient features of the panoramic video and predicts the view angles of all users by combining with limited user view angle feedback information; 3. the server selects a quality grade for each video block according to the predicted view angle of all users and the bandwidth information of the downlink channel, and then transmits each video block to the client. The invention can better improve the resource utilization rate and improve the experience (QoE) of multiple users under the condition of bandwidth limitation.

Description

Panoramic video self-adaptive transmission method based on limited user visual angle feedback

Technical Field

The invention relates to the field of multimedia video transmission, in particular to a streaming media self-adaptive transmission method for panoramic video.

Background

A panoramic video adaptive transmission method based on DASH (publication number: CN108235131A) is invented by Chongqing, Cone, et al, at the university of post and telecommunications, and comprises the following steps: the method comprises the steps of establishing a mapping relation model of a three-dimensional panoramic video and a two-dimensional panoramic video, carrying out region priority division on a VR panoramic video based on human vision and motion characteristics, slicing the panoramic video by a server, predicting available bandwidth by a client bandwidth estimation module through a Kalman filtering algorithm, carrying out smoothing processing on the available bandwidth by a client video cache module based on a cache area state, predicting a user window by a client user window perception module based on motion inertia, and carrying out self-adaptive video transmission by the client decision module comprehensively considering the states of the user window, a network environment and the cache area. However, the method does not consider the role of QoE in panoramic video transmission, lacks quality of experience (QoE) indexes of users in the transmission process, and cannot reflect the experience change condition of users in the transmission process.

The Panyuxuan et al of Beijing post and telecommunications university invents an reinforcement learning-based adaptive panoramic video transmission method (publication number: CN112584119A), which is characterized by comprising the following steps: the remote video server analyzes the video content, and obtains the motion speed and the depth of field of the video content by using an optical flow method to obtain a numerical result of the video quality; the remote video server carries out tile segmentation according to the video quality, adopts a two-dimensional clustering algorithm to spatially divide the video into a specified number of tiles with different sizes, and carries out coding of different quality grades on the tiles to obtain coding results of a plurality of quality versions; the remote video server trains a deep learning model according to the stored bandwidth data and the coding results of the multiple quality versions, and the deep learning model is used as a tile self-adaptive quality selector; downloading and locally running a deep learning model by a client, collecting panoramic video watching information from user equipment, obtaining a tile range contained in a future watching view field region of a user through viewpoint prediction, and requesting and obtaining corresponding video content from a remote server according to a quality selection result of the deep learning model; and after the client side obtains the video content, decoding, tile splicing and rendering are carried out on the video content, and the picture is presented to the user. But it ignores the guiding function of QoE in panoramic video transmission and cannot improve the experience of the user in the transmission process. And the prediction complexity by using reinforcement learning is high, and the time consumed by operation is long.

The landlord et al at the university of fuzhou invented a GAN-based panoramic video adaptive streaming method (publication No. CN112616014A), which comprises the following steps: step S1, constructing a time domain similarity graph; step S2, constructing a total network including an encoding network E, a generating network G and a judging network D; step S3, constructing and generating a joint cost function of the code rate and the reconstruction quality of the network G; step S4: inputting the obtained time domain similarity graph into a network, and performing model training to obtain a trained overall network; step S5: at the encoder end, compressing odd frames, extracting the latent codes of even video frames as auxiliary information, combining the latent codes with the compressed odd frame video by using an Mpeg-DASH protocol, and performing dynamic self-adaptive transmission; step S6: at the decoder side, a generator of GAN combines the latent codes of the odd and even video frames to reconstruct the even video frames. However, this method is only considered from the video itself, and does not consider the effect of QoE in panoramic video transmission at all.

People of Beijing post and telecommunications university, such as Wangyumei, have invented a panoramic video adaptive streaming media transmission method and system (publication number: CN112822564A) based on viewpoint, the method includes: the server divides the panoramic video into different tiles in space, encodes a plurality of tile videos into a panoramic video stream file, and packages and slices the panoramic video stream file; and the client predicts the future view field area of the user by using a linear regression method according to the historical viewpoint information of the user watching the panoramic video, and plays the predicted panoramic video. The invention properly enlarges the visual field area according to the deviation degree of the historical prediction, selects high code rate for the tiles in the visual field area, selects low code rate for the tiles in the non-visual field area, and dynamically selects different code rates for different tiles according to the change of network conditions, thereby improving the video definition in the visual field area, reducing the occurrence of the jam condition and effectively improving the watching quality of a user. However, this method is not suitable for multi-view prediction and has a short prediction range, and does not consider the role of QoE in panoramic video transmission.

Disclosure of Invention

In order to avoid the defects existing in the prior art, the invention provides the VR video self-adaptive transmission method based on the visual angle feedback of the limited user, so that the resource utilization rate can be better improved, and the QoE (quality of experience) of multiple users can be improved under the condition of bandwidth limitation.

The invention adopts the following technical scheme for solving the technical problems:

the invention relates to a panoramic video self-adaptive transmission method based on limited user visual angle feedback, which is characterized in that the method is applied to a network environment consisting of a panoramic video server and N clients; the panoramic video server is used for predicting the view angles of all users; transmitting the panoramic video between the panoramic video server and the client through a downlink; and a feedback channel from the client to the panoramic video server is contained in the downlink; the panoramic video self-adaptive transmission method comprises the following steps:

step one, converting the panoramic video from an ERP format into a cube projection format, performing saliency detection on the converted panoramic video to obtain a saliency heat map, converting the saliency heat map into a rectangular projection format, and recording the saliency heat map as a { S }₁,S₂,…,S_t,…,S_{t max}}；S_tA saliency heat map representing time t; tmax represents the duration of the panoramic video; t is within [1, t max ∈]；

Step two, generating a small amount of r user views by Gaussian distribution according to historical visual angle information fed back by N users through a feedback channel, and recording the user views as

The view angle view of the feedback of the R-th user is shown, R is more than or equal to 1 and less than or equal to R and less than or equal to N;

step three, using the method of area covariance to carry out t + t₀Video saliency heat map of moments

And a small number r of user heatmaps

Processing to obtain t + t₀Total user perspective of time prediction, written

t₀Representing the time interval, t + t₀∈[1,t max]；

Step four, according to the predicted view angle of the whole user

Establishing an objective function with the maximum sum of the quality experience QoE of N clients to be a total utility value, and setting corresponding constraint conditions so as to establish a panoramic video self-adaptive transmission model;

solving the panoramic video self-adaptive transmission model by using a KKT condition and a mixed branch and bound method to obtain a downlink transmission decision variable in the network environment;

step six, the panoramic video server performs blocking processing on the complete panoramic video in time to obtain K frame groups GoF, blocks each frame group GoF in space to obtain M video blocks, and records the M video blocks as { M }_1,1,M_1,2,...,M_k,m,...,M_K,M},M_k,mRepresenting the mth video block of any kth GoF, wherein K is more than or equal to 1 and less than or equal to K; m is more than or equal to 1 and less than or equal to M;

the panoramic video server is the t video block M of the kth frame group GoF_k,mD code rate selections are provided for coding processing, so that coded video blocks with D different code rate levels are obtained and recorded as

T-th video block M representing k-th frame group GoF_k,mIs subjected to encoding processingD is more than or equal to 1 and less than or equal to D of the obtained compressed video block with the D-th code rate grade;

seventhly, the panoramic video server predicts the view angles of all the users according to the prediction

And the t video block M of the k frame group GoF of the downlink transmission_k,mD-th code rate level decision variable χ_k,t,dIs selected for any nth client, the tth video block M of the kth frame group GoF is selected_k,mD code rate level of

And transmitting to the nth client through a downlink; so that the nth client receives the compressed video blocks with the d code rate grades of the M video blocks;

and step eight, the nth client decodes, maps and renders the compressed video blocks of the M received video blocks at the corresponding code rate levels, thereby synthesizing the panoramic video with the optimized QoE.

The panoramic video adaptive transmission method is also characterized in that the third step is carried out according to the following process:

step 3.1, calculate t + t₀Video saliency heat map of moments

Each pixel point of

A plurality of characteristic values of;

randomly selecting partial feature values to combine, thereby constructing a feature vector phi (I (x, y), x, y) by using the formula (1):

step 3.2, heat map of video significance

Divided into blocks of Γ × Ψ pixels, denoted as

Representing a significant heatmap

Of the ith pixel block, each pixel block comprising

1, i is more than or equal to 1 and less than or equal to gamma multiplied by psi;

structure of utility formula (2)

Ith pixel block of pixel

Covariance matrix of

In the formula (2), the reaction mixture is,

representing video saliency heat maps

The ith pixel block

And has:

step 3.3, heat map a small number of r users

All the pixel blocks are calculated according to the step 3.2 to obtain the ith pixel block in a small number of r user heat maps

Of the covariance matrix

Sum mean value

Representing the ith pixel block in the r view

The covariance matrix of (a) is determined,

representing the ith pixel block in the r view

And (c) average of (a);

construction of video saliency heat map using equation (6)

The ith pixel block

And a small number r of user heatmaps

The ith pixel block

Similarity between them

Step 3.4, similarity

Normalized to

Construction of t + t Using equation (7)₀Temporal predicted total user views

The fourth step is carried out according to the following processes:

step 4.1, obtaining the quality of experience QoE of the nth client by using the formula (8)_n：

In the formula (8), θ_k,m,dA code rate of a video block m representing a kth GoF with a quality level d; theta_k,m,DA code rate indicating when the video block m of the kth GoF is transmitted at the highest quality level D;

representing video blocks covered within the FoV of the nth client; when x_k,m,dWhen the rate is 1, the mth video block representing the kth GoF is transmitted to the client through a downlink at the d code rate level_k,m,dWhen the coding rate is 0, the mth video block representing the kth GoF is not transmitted to the client through a downlink at the d code rate level;

the objective function is constructed using equation (9):

formula (9) represents the total utility value;

and 4.2, constructing constraint conditions by using the formulas (10) to (11):

equation (10) indicates that when the mth video block of any kth GoF is transmitted to the client through the downlink, the transmitted video block can only select one code rate level;

equation (11) represents that the total code rate of all video blocks transmitted does not exceed the bit rate which can be provided by the bandwidth in the whole downlink channel; b is_kIndicating the bandwidth of the kth GoF in the downlink channel.

The fifth step is carried out according to the following processes:

step 5.1, carrying out transmission decision variable χ in the panoramic video self-adaptive transmission model_k,m,dPerforming a relaxation operation to obtain [0,1 ]]Continuously transmitting decision variables within the range;

step 5.2 according to the formula(10) -the constraint of formula (11) is

Is expressed as a function h (χ)_k,t,d) (ii) a Will be provided with

Is expressed as a function g (χ)_k,m,d) (ii) a Thereby, the lagrangian function L (χ) of the relaxed panoramic video adaptive transmission model is calculated by equation (12)_k,m,d,λ,ω)：

In equation (12), λ represents the lagrangian coefficient under the inequality constraint in equations (10) and (11), ω represents the lagrangian coefficient under the inequality constraint in equations (10) and (11), and λ represents the function h (χ)_k,m,d) Represents the function g (χ) by ω_k,m,d) Lagrange coefficients of (a).

Step 5.3, Lagrangian function L (x) according to formula (10)_k,m,dλ, ω), obtaining KKT condition of relaxed panoramic video adaptive transmission model as shown in equation (13) -equation (18):

g(χ_k,m,d)≤0 (14)

h(χ_k,m,d)＝0 (15)

λ≠0 (16)

ω≥0 (17)

ωg(χ_k,m,d)＝0 (18)

solving the formula (13) -formula (18) to obtain the optimal solution chi of the relaxed panoramic video self-adaptive transmission model_relaxAnd an optimal total utility value G_relax(ii) a Wherein, the optimal solution χ_relaxIs a transmission decision variable χ_k,t,dThe relaxation optimal solution of (a);

step 5.4, according to the optimal solution chi_relaxAnd an optimal total utility value G_relaxAs the initial input parameter of the mixed branch-and-bound method;

step 5.5, defining the branching times in the mixed branch-and-bound method as z, defining the lower bound of the optimal total utility value in the mixed branch-and-bound method as L, and defining the upper bound of the optimal total utility value in the mixed branch-and-bound method as U;

step 5.6, initializing z to be 0;

step 5.7, initializing that L is 0;

step 5.8, initialize U ═ G_relax；

Step 5.9, use chi_zRepresenting the optimal solution of the z-th branch and recording the corresponding optimal total utility value as G_zThen, will x_relaxIs given as X_zAnd the optimal solution χ of the z-th branch_zAs a root node;

step 5.10, judge χ_zIf there is a solution not meeting the constraint condition of 0-1, if there is a solution, the X is_zThe optimal solution of the relaxation in (1) is divided into a solution satisfying the 0-1 constraint condition and a solution χ not satisfying the 0-1 constraint condition_z(0,1)And executing step 5.12; otherwise, the table is χ_zThe optimal solution of the non-relaxation panoramic video self-adaptive transmission model;

step 5.11, randomly generating a random number epsilon of the z-th branch in the range of (0,1)_zAnd judging that 0 is more than x_z(0,1)＜ε_kWhether the result is true or not; if true, then the constraint "χ" is applied_z(0,1)Adding the obtained value to a non-relaxation panoramic video self-adaptive acquisition and transmission model to form a sub-branch I of the z-th branch; otherwise, the constraint of "χ" is applied_z(0,1)Adding the result 1 into a non-relaxation panoramic video self-adaptive transmission model to form a subbranch II of a z-th branch;

step 5.12, solving relaxation solutions of the sub-branch I and the sub-branch II of the z-th branch by using the KKT condition, and taking the relaxation solutions as the optimal solution chi to the z + 1-th branch_z+1And an optimal total utility value G_z+1Wherein x is_z+1The method comprises the following steps: (II) relaxation of subbranch I and subbranch II of the z +1 th branch;

step 5.13, judgeOptimum solution χ of branch z +1_z+1Whether the constraint condition of 0-1 is met or not, if yes, the optimal total utility value G is obtained_z+1Find the maximum value and assign it to L, and χ_z+1E {0,1 }; otherwise, from the optimal total utility value G_z+1Find the maximum value and assign it to U, and χ_z+1∈(0,1)；

Step 5.14, judge G_z+1If < L is true; if yes, cutting off the optimal solution χ of the z +1 th branch_z+1Assigning z +1 to z in the branch, and returning to the step 5.10; otherwise, executing step 5.15;

step 5.15, judge G_z+1If L is more than true, assigning z +1 to z, and returning to the step 5.10; otherwise, executing step 5.16;

step 5.16, judge G_z+1If the two-dimensional model is true, the optimal solution x of the z +1 th branch is the optimal solution x of the non-relaxation panoramic video adaptive transmission model_z+1And will be x_z+1Assigning an optimal solution χ to a non-relaxed panoramic video adaptive transmission model_0-1Will X_z+1Corresponding G_z+1Assigning an optimal total utility value G to a non-relaxed panoramic video adaptive transmission model_0-1(ii) a Otherwise, after z +1 is assigned to z, the procedure returns to step 5.10.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention predicts the FoV of all users by extracting video salient features and a small amount of user visual angle feedback information through the FoV prediction method of the panoramic video, and optimizes the QoE of multiple users in the panoramic video transmission process, thereby better improving the QoE of the total users in the panoramic video transmission process.

2. The invention combines the streaming media self-adaptive transmission of the panoramic video with the QoE of multiple users, and provides a method for optimizing the streaming media transmission of the panoramic video by taking the total QoE of the system as a transmission guide factor, thereby better guiding and optimizing the streaming media transmission process of the panoramic video.

3. The method solves the proposed panoramic video self-adaptive transmission model by applying the KKT condition and the mixed branch and bound method, improves the efficiency and the accuracy of the solution, and further improves the high efficiency of the panoramic video visual angle prediction and transmission method.

Drawings

Fig. 1 is an application scene diagram of a streaming media adaptive transmission method of panoramic video proposed in the present invention;

fig. 2 is a system structure diagram of the adaptive transmission method proposed in the present invention.

Detailed Description

In this embodiment, a self-adaptive transmission method based on limited user perspective feedback is applied to a multi-user network scene, as shown in fig. 1, where a panoramic video server and N clients exist in the network scene. The panoramic video server and the user side are transmitted through a downlink; a feedback channel from a user end to the panoramic video server is contained in a downlink; the feedback channel can feed back the real-time visual angle information and the downlink bandwidth information of the user to the server to help the server to carry out acquisition and transmission work. As shown in fig. 2, the method specifically includes the following steps:

step 1, converting the format of the panoramic video from ERP format to cube projection format, and performing significance detection on the converted panoramic video to obtain significance heat map and convert the significance heat map to rectangular projection format, and recording the significance heat map as { S }₁,S₂,…,S_t,…,S_{t max}}；S_tA saliency heat map representing time t; tmax represents the duration of the panoramic video; t is within [1, t max ∈]；

Step 2, generating a small amount of r user views by Gaussian distribution according to the visual angle information fed back by the N users through the feedback channel, and recording the user views as

The view angle view of the feedback of the R-th user is shown, and R is more than or equal to 1 and less than or equal to R and less than or equal to N;

step 3, using the method of area covariance to carry out t + t₀Video saliency of momentsSexual heat map

And a small number r of user heatmaps

Processing to obtain t + t₀The total user perspective of the time prediction is recorded as

t₀Representing the time interval, t + t₀∈[1,t max]；

Step 3.1, calculate t + t₀Video saliency heat map of moments

Each pixel point of

A plurality of characteristic values of;

randomly selecting partial characteristic values to combine so as to construct a characteristic vector by using the formula (1)

Step 3.2, heat map of video significance

Divided into blocks of Γ × Ψ pixels, denoted as

Representing a significant heatmap

Of the ith pixel block, each pixel block comprising

structure of utility formula (2)

Ith pixel block of pixel

Covariance matrix of

In the formula (2), the reaction mixture is,

representing video saliency heat maps

The ith pixel block

And has:

step 3.3, heat map a small number of r users

All the calculation steps are the same as the step 3.2, and the ith pixel block in a small quantity of r user heatmaps is obtained

Covariance matrix of

Sum mean value

Representing the ith pixel block in the r view

The covariance matrix of (a) is determined,

representing the ith pixel block in the r view

And (c) average of (a);

construction of video saliency heat maps using equation (4)

The ith pixel block

And a small number r of user heatmaps

The ith pixel block

Similarity between them

Step 3.4, similarity

Normalized to

Construction of t + t Using equation (5)₀Temporal predictive universal user perspective

Step 4, according to the predicted view angle of the whole user

step 4.1, obtaining the quality of experience QoE of the nth client by using the formula (5)_n：

In the formula (5), θ_k,m,dA code rate of a video block m representing a kth GoF with a quality level d; theta.theta._k,m,DA code rate indicating when the video block m of the kth GoF is transmitted at the highest quality level D;

is shown asVideo blocks covered within the FoV of the n clients; when x_k,m,dWhen the rate is 1, the mth video block representing the kth GoF is transmitted to the client through a downlink at the d code rate level_k,m,dWhen the coding rate is 0, the mth video block representing the kth GoF is not transmitted to the client through a downlink at the d code rate level;

the objective function is constructed using equation (6):

formula (6) represents the total utility value;

and 4.2, constructing constraint conditions by using the formulas (6) to (7):

equation (7) indicates that when the mth video block of any kth GoF is transmitted to the client through the downlink, the transmitted video block can only select one code rate level;

the expression (8) shows that the total code rate of all video blocks transmitted does not exceed the bit rate which can be provided by the bandwidth in the whole downlink channel; b_kIndicating the bandwidth of the kth GoF in the downlink channel.

Step 5, solving the panoramic video self-adaptive transmission model by using a KKT condition and a mixed branch and bound method to obtain a downlink transmission decision variable in a network environment;

step 5.2, according to the constraint conditions of the formula (6) to the formula (7), adding

Is expressed as a function h (χ)_k,t,d) (ii) a Will be provided with

Is expressed as a function g (χ)_k,m,d) (ii) a Thereby, the lagrangian function L (χ) of the relaxed panoramic video adaptive transmission model is calculated by equation (10)_k,m,d,λ,ω)：

In the formula (8), λ represents a lagrangian coefficient under the inequality constraint condition in the formula (6) to the formula (7), μ represents a lagrangian coefficient under the inequality constraint condition in the formula (6) to the formula (7), and λ represents a function h (χ)_k,m,d) Represents the function g (χ) by ω_k,m,d) Lagrange coefficients of (d).

Step 5.3, Lagrangian function L (x) according to the formula (8)_k,m,dλ, ω), obtaining KKT condition of relaxed panoramic video adaptive transmission model as shown in formula (10) -formula (15):

g(χ_k,m,d)≤0 (12)

h(χ_k,m,d)＝0 (13)

λ≠0 (14)

ω≥0 (15)

ωg(χ_k,m,d)＝0 (16)

the Lagrangian function L (χ) is expressed by the equations (11) and (12)_k,m,dλ, ω) is taken as a necessary condition for the extremum; the expressions (13) and (14) represent the function h (χ)_k,m,d),g(χ_k,m,d) The constraint of (2); equation (15) represents a constraint condition of the lagrangian coefficient λ, ω; the formula (16) represents a complementary relaxation condition.

Solving the formula (11) -formula (16) to obtain the optimal solution chi of the relaxed panoramic video self-adaptive transmission model_relaxAnd an optimal total utility value G_relax(ii) a Wherein, the optimal solution χ_relaxIs a transmission decision variable χ_k,m,dThe relaxation optimal solution of (a);

step 5.6, initializing z to be 0;

step 5.7, initializing that L is 0;

step 5.8, initialize U ═ G_relax；

Step 5.9, use chi_zRepresenting the optimal solution of the z-th branch and recording the corresponding optimal total utility value as G_zThen, will x_relaxIs given as X_zAnd taking the optimal solution χ of the z-th branch_zAs a root node;

step 5.11, randomly generating a random number epsilon of the z-th branch in the range of (0,1)_zAnd judging that 0 < χ_z(0,1)＜ε_kWhether the result is true; if true, then the constraint "χ" is applied_z(0,1)Adding the obtained value to a non-relaxation panoramic video self-adaptive acquisition and transmission model to form a sub-branch I of the z-th branch; otherwise, the constraint of "χ" is applied_z(0,1)Adding the result 1 into a non-relaxation panoramic video self-adaptive transmission model to form a subbranch II of a z-th branch;

step 5.12, solving relaxation solutions of the sub-branch I and the sub-branch II of the z-th branch by using the KKT condition, and taking the relaxation solutions as the optimal solution chi to the z + 1-th branch_z+1And an optimal total utility valueG_z+1Wherein x is_z+1The method comprises the following steps: (II) relaxation of subbranch I and subbranch II of the z +1 th branch;

step 5.13, judging the optimal solution chi of the z +1 th branch_z+1Whether the constraint condition of 0-1 is met, if so, the optimal total utility value G is obtained_z+1Find the maximum value and assign it to L, and χ_z+1E {0,1 }; otherwise, from the optimal total utility value G_z+1Find the maximum value and assign it to U, and χ_z+1∈(0,1)；

Step 5.14, judge G_z+1If < L is true; if true, the optimal solution χ of the z +1 th branch is clipped_z+1Assigning z +1 to z in the branch, and returning to the step 5.10; otherwise, executing step 5.15;

step 5.16, judge G_z+1If the two-dimensional model is true, the optimal solution x of the z +1 th branch is the optimal solution x of the non-relaxation panoramic video adaptive transmission model_z+1And will be x_z+1Assigning an optimal solution χ to a non-relaxed panoramic video adaptive transmission model_0-1Will X_z+1Corresponding G_z+1Assigning an optimal total utility value G to a non-relaxed panoramic video adaptive transmission model_0-1(ii) a Otherwise, after z +1 is assigned to z, the step 5.10 is returned.

And 6, the panoramic video server performs block processing on the complete panoramic video in time to obtain K frame groups GoF, and performs block processing on each frame group GoF in space to obtain M video blocks which are marked as { M_1,1,M_1,2,...,M_k,m,...,M_K,M},M_k,mRepresenting the mth video block of any kth GoF, wherein K is more than or equal to 1 and less than or equal to K; m is more than or equal to 1 and less than or equal to M;

the panoramic video server is the t video block M of the k frame group GoF_k,mD code rate selections are provided for coding processing, so that coded video blocks with D different code rate levels are obtained and recorded as

T-th video block M representing k-th frame group GoF_k,mD is more than or equal to 1 and less than or equal to D of the compressed video block with the D-th code rate grade obtained after the coding processing;

step 7, the panoramic video server predicts the view angles of all users according to the predicted view angles

And the t video block M of the k frame group GoF of the downlink transmission_k,mD type code rate grade decision variable x_k,t,dIs selected for any nth client, the tth video block M of the kth frame group GoF is selected_k,mD code rate level of

And transmitting to the nth client through a downlink; so that the nth client receives the compressed video block with the d code rate grade of the M video blocks;

and 8, the nth client decodes, maps and renders the compressed video blocks of the M received video blocks at the corresponding code rate levels, so as to synthesize the panoramic video with the optimized QoE.

Claims

1. A panoramic video self-adaptive transmission method based on limited user visual angle feedback is characterized by being applied to a network environment consisting of a panoramic video server and N clients; the panoramic video server is used for predicting the view angles of all users; transmitting the panoramic video between the panoramic video server and the client through a downlink; and a feedback channel from the client to the panoramic video server is contained in the downlink; the panoramic video self-adaptive transmission method comprises the following steps:

step one, converting the panoramic video from an ERP format into a cubic projection format, and performing saliency detection on the converted panoramic video to obtain a saliency heat mapConverted into rectangular projection format, denoted as { S₁,S₂,…,S_t,…,S_tmax}；S_tA significance heatmap representing time t; tmax represents the duration of the panoramic video; t ∈ [1, tmax)]；

step three, using the method of area covariance to process t + t₀Video saliency heat map of moments

And a small number r of user views

Processing to obtain t + t₀Total user perspective of time prediction, written

t₀Representing the time interval, t + t₀∈[1,tmax]；

Step 3.1, calculate t + t₀Video saliency heat map of moments

Each pixel point of

A plurality of characteristic values of;

step 3.2, heat map of video significance

Divided into blocks of Γ × Ψ pixels, denoted as

Representing a significant heatmap

Of the ith pixel block, each pixel block comprising

structure of utility formula (2)

Ith pixel block of pixel

Covariance matrix of

In the formula (2), the reaction mixture is,

representing video saliency heat maps

The ith pixel block

And has:

step 3.3, heat map a small number of r users

Covariance matrix of

Sum mean value

Representing the ith pixel block in the r view

The covariance matrix of (a) is determined,

representing the ith pixel block in the r view

And (c) average of (a);

construction of video saliency heat map using equation (6)

The ith pixel block

And a small number r of user heatmaps

The ith pixel block

Similarity between them

Step 3.4, similarity

Normalized to

Construction of t + t Using equation (7)₀Temporal predictive universal user perspective

Step four, according to the predicted all-user visual angle

Establishing a target function with the maximum sum of the quality experience QoE of N clients to be a total utility value, and setting corresponding constraint conditions, thereby establishing a panoramic video self-adaptive transmission model;

seventhly, the panoramic video server predicts the view angles of all the users according to the predicted view angles

And the t video block M of the k frame group GoF of the downlink transmission_k,mD type code rate grade decision variable x_k,t,dFor any nth client, select the tth video block M of the kth frame group GoF_k,mD code rate level of

2. The adaptive transmission method for panoramic video according to claim 1, wherein the fourth step is performed as follows:

the objective function is constructed using equation (9):

formula (9) represents the total utility value;

and 4.2, constructing constraint conditions by using the formulas (10) to (11):

equation (11) indicates that the total code rate of all video blocks transmitted does not exceed the bit rate that the bandwidth in the whole downlink channel can provide; b_kIndicating the bandwidth of the kth GoF in the downlink channel.

3. The adaptive panoramic video transmission method according to claim 1, wherein the fifth step is performed as follows:

step 5.2, according to the constraint conditions of the formula (10) to the formula (11), the

Is recorded as a function h (χ)_k,t,d) (ii) a Will be provided with

Is expressed as a function g (χ)_k,m,d) (ii) a Thereby calculating the relaxation after the relaxation by the equation (12)Lagrange function L (χ) of panoramic video adaptive transmission model_k,m,d,λ,ω)：

In equation (12), λ represents the lagrangian coefficient under the inequality constraint in equations (10) and (11), ω represents the lagrangian coefficient under the inequality constraint in equations (10) and (11), and λ represents the function h (χ)_k,m,d) Represents the function g (χ) by ω_k,m,d) Lagrange coefficients of (a);

g(χ_k,m,d)≤0 (14)

h(χ_k,m,d)＝0 (15)

λ≠0 (16)

ω≥0 (17)

ωg(χ_k,m,d)＝0 (18)

solving the formula (13) -formula (18) to obtain the optimal solution chi of the relaxed panoramic video self-adaptive transmission model_relaxAnd an optimal total utility value G_relax(ii) a Wherein the optimal solution χ_relaxIs a transmission decision variable χ_k,t,dThe relaxation optimal solution of (a);

step 5.6, initializing z to be 0;

step 5.7, initializing that L is 0;

step 5.8, initialize U ═ G_relax；

Step 5.9, use chi_zThe optimal solution of the z-th branch is represented, and the corresponding optimal total utility value is recorded as G_zThen, will x_relaxIs given as X_zAnd taking the optimal solution χ of the z-th branch_zAs a root node;

step 5.11, randomly generating a random number epsilon of the z-th branch in the range of (0,1)_zAnd judging that 0 < χ_z(0,1)＜ε_kWhether the result is true or not; if true, then the constraint "χ" is applied_z(0,1)Adding the obtained value to a non-relaxation panoramic video self-adaptive acquisition and transmission model to form a sub-branch I of the z-th branch; otherwise, the constraint of "χ" is applied_z(0,1)Adding the result 1 into a non-relaxation panoramic video self-adaptive transmission model to form a subbranch II of a z-th branch;

step 5.12, solving relaxation solutions of the sub-branch I and the sub-branch II of the z-th branch by using the KKT condition, and taking the relaxation solutions as the optimal solution chi to the z + 1-th branch_z+1And an optimal total utility value G_z+1Therein x_z+1The method comprises the following steps: (II) relaxation of subbranch I and subbranch II of the z +1 th branch;

step 5.13, judging the optimal solution chi of the z +1 th branch_z+1Whether the constraint condition of 0-1 is met, if yes, the optimal total utility value G is obtained_z+1Finding outMaximum value is given and assigned to L, and χ_z+1E {0,1 }; otherwise, from the optimal total utility value G_z+1Find the maximum value and assign it to U, and χ_z+1∈(0,1)；