WO2017140230A1 - Procédé et dispositif pour réguler un débit de codes cible - Google Patents
Procédé et dispositif pour réguler un débit de codes cible Download PDFInfo
- Publication number
- WO2017140230A1 WO2017140230A1 PCT/CN2017/073227 CN2017073227W WO2017140230A1 WO 2017140230 A1 WO2017140230 A1 WO 2017140230A1 CN 2017073227 W CN2017073227 W CN 2017073227W WO 2017140230 A1 WO2017140230 A1 WO 2017140230A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sub
- images
- image
- code rate
- encoded
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/154—Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
Definitions
- This document relates to, but is not limited to, the field of video coding technology, and relates to a target rate adjustment method and apparatus.
- the selection of the target bit rate is an important link, which is directly related to the visual quality of the picture.
- the target bit rate of the encoded code is usually controlled based on the network bandwidth to obtain a balance between visual quality and bandwidth usage.
- the code rate controller sets the capacity feedback value of the hypothetical reference decoding buffer based on the network bandwidth, and is used for
- the encoded target code rate is calculated and assigned to maintain a smooth video output stream as much as possible, and to provide a minimum distortion video image decoding visual quality at a minimum bit rate.
- the related art selects the target code rate based on the capacity of the imaginary reference decoding buffer, there is a problem that the selected target code rate is not accurate enough.
- the embodiment of the invention provides a method and a device for adjusting a target bit rate, which can improve the accuracy of the selected target bit rate.
- An embodiment of the present invention provides a target rate adjustment method, where the target rate adjustment method includes:
- adjusting a target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold include:
- the encoded target code rate is reduced.
- the method before the step of adjusting the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, the method further includes:
- the encoded wave strength is updated to the calculated average.
- the image frame to be encoded is divided into a plurality of first sub-images, and the plurality of first sub-images are classified according to motion vector values of each of the first sub-images, and each class is calculated.
- the steps of the PSNR value of the first sub-image include:
- a PSNR value for each type of first sub-image is calculated based on each of the mean square errors.
- the method further includes:
- the adjusted target code rate is corrected based on the current network bandwidth to be encoded using the corrected target code rate.
- the step of modifying the adjusted target bit rate based on the current network bandwidth includes:
- the network bandwidth is used as the corrected target code rate.
- the embodiment of the present invention provides a target rate adjustment apparatus, where the target rate adjustment apparatus includes:
- a first calculating module configured to divide the image frame to be encoded into a plurality of first sub-images, and classify the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculate each Peak signal to noise ratio PSNR value of the first sub-image of the class;
- a second calculating module configured to calculate an average value ⁇ of PSNR values of each type of first sub-image, and calculate a standard deviation ⁇ of PSNR values of each type of first sub-image, and correct the ratio of ⁇ to ⁇ by using a preset correction parameter After the wave strength as the code;
- the adjusting module is configured to adjust the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, wherein the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold include:
- the encoded target code rate is reduced.
- the target rate adjustment device further includes:
- a recording module configured to record the currently calculated fluctuation intensity, and calculate an average value of the recorded fluctuation strength when the recorded fluctuation intensity reaches a second preset threshold
- An update module is arranged to update the encoded wave strength to the calculated average.
- the first computing module includes:
- a classifying unit configured to divide the image frame to be encoded into a plurality of first sub-images, and calculate a motion vector value of each of the first sub-images relative to the adjacent image frames of the image frame to be encoded, according to each Sorting the plurality of first sub-images by a vector value interval in which the motion vector value of the first sub-image is located;
- a calculating unit configured to divide the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images in the same division manner as the image frame to be encoded, and calculate each of the first sub-images and a mean square error between the second sub-images corresponding to each of the first sub-images; and calculating a PSNR value for each of the first sub-images based on each of the mean square errors.
- the target rate adjustment apparatus further includes a correction module configured to correct the adjusted target code rate based on the current network bandwidth to perform encoding using the corrected target code rate.
- the modifying module corrects the adjusted target code rate based on the current network bandwidth by determining whether the current network bandwidth is greater than the adjusted target code rate; and in the current network. When the bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate.
- the image frame to be encoded is divided into a plurality of first sub-images, and the plurality of the first sub-images are classified according to the motion vector values of the first sub-images, and the first type is calculated.
- the peak signal-to-noise ratio PSNR value of the sub-image, and then the ratio of the standard deviation ⁇ and the average value ⁇ of the PSNR values of the first sub-images are corrected by the preset correction parameters as the encoded fluctuation intensity, and the obtained fluctuation intensity includes Reconstruction distortion of image frames and features of motion compensation.
- the embodiment of the invention when the fluctuation strength is increased in the encoding process, based on the time domain correlation of the adjacent image frames, the next image frame to be encoded of the image frame to be encoded needs more bits for description (ie, encoding). Therefore, compared with the related art, the target code rate is selected by the imaginary value, and the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded by the fluctuation intensity in real time, thereby improving the selected target. The accuracy of the bit rate.
- the embodiment of the invention further provides a computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions, and the computer executable instructions are implemented to implement a target rate adjustment method.
- FIG. 1 is a schematic flowchart of a target bit rate adjustment method according to Embodiment 1 of the present invention
- FIG. 2 is a schematic diagram of motion compensation in a first embodiment of a target code rate adjustment method according to Embodiment 1 of the present invention
- FIG. 3 is a real measurement diagram of target rate adjustment in the first embodiment of the code rate adjustment method according to the first embodiment of the present invention
- FIG. 4 is a schematic diagram showing a refinement flow of calculating a PSNR value of each type of first sub-image in FIG. 1;
- FIG. 5 is a motion vector distribution diagram of a first sub-image of a target bit rate adjustment method according to Embodiment 3 of the present invention.
- FIG. 6 is a schematic diagram of functional blocks of a target bit rate adjustment apparatus according to Embodiment 5 of the present invention.
- FIG. 7 is a schematic diagram of a refinement function module of the first computing module in FIG. 6.
- An embodiment of the present invention provides a target rate adjustment method.
- the target rate adjustment method includes:
- Step S10 dividing the image frame to be encoded into a plurality of first sub-images, and classifying the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculating the first sub-category of each Peak signal to noise ratio PSNR value of the image;
- the target rate adjustment method proposed in this embodiment can be applied to video coding control of streaming media, for example, when encoding, by motion compensation vector calculation, and combining the distortion statistics of the reconstructed image frame to perform coding of the target code rate. Select to accurately select the appropriate target bit rate to reduce the occupation of network bandwidth while ensuring video quality.
- the image frame to be encoded is divided into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs. For example, the image frame may be divided into a plurality of first sub-images that are rectangular.
- the size of the first sub-image may be set to 64*64.
- the image After dividing the image frame into a plurality of first sub-images, calculating motion vector values of each of the first sub-images, and pairing the plurality of the first ones according to motion vector values of each of the first sub-images One son
- the image is classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
- the motion vector value can directly reflect the fluctuation strength of the image.
- the image content of the moving image adjacent to the image frame has time domain correlation.
- motion compensation is performed for each of the first sub-images, a matching block of the first sub-image is searched for in an adjacent image frame of the image frame, and a motion vector of the matching block of the first sub-image is recorded.
- MV i,j ( ⁇ x, ⁇ y)
- a first sub-image is selected in the image frame at time t, a minimum distortion search is performed in the adjacent image frame at time t-1, a matching block with minimum distortion is obtained, and the first sub-child is calculated.
- the vector distance of the matching block of the image and the minimum distortion, as shown in Fig. 2, MV ( ⁇ x, ⁇ y), thereby calculating the motion vector value of the selected first sub-image
- the adjacent image frame of the currently selected image frame refers to the previous image frame in which the image frame is selected on the frame sequence.
- PSNR Peak Signal to Noise Ratio
- Step S20 calculating an average value ⁇ of PSNR values of each type of first sub-image, and calculating a standard deviation ⁇ of PSNR values of each type of first sub-image, and correcting the ratio of ⁇ to ⁇ by using a preset correction parameter as a coded Fluctuation intensity
- the average value of the PSNR values of each type of first sub-image Standard deviation of PSNR values for each type of first sub-image
- each vector value interval corresponds to a k value
- P k is a PSNR value of each type of first sub-image.
- the ratio of ⁇ to ⁇ is corrected using the preset correction parameter ⁇ as the encoded fluctuation intensity V, that is, the fluctuation intensity.
- the ⁇ value is an empirical parameter for normalizing the ratio of ⁇ to ⁇ to increase the processing speed. For example, the ⁇ value is set to 100 in this embodiment.
- a maximum k value k max may also be set, and a first sub-image with a motion vector value greater than k max is divided into k max intervals, where k max is an empirical parameter, for example, this embodiment will k max is set to 9.
- step S30 the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold.
- the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold.
- the encoded target code rate is reduced.
- the encoded target code rate is reduced.
- the first preset threshold may be set according to actual needs. For example, in this embodiment, the first preset threshold is set to 5.
- the step value TBR C of the target rate adjustment is preset. As a basis for each target rate adjustment.
- FIG. 3 a measured view of the target code rate adjustment of the embodiment is provided.
- the target code rate is adjusted according to the change of the fluctuation intensity in real time. It is possible that the low bit occupancy occupies the visual decoding quality of the video image with minimal distortion.
- the encoded target code rate is not adjusted.
- a first preset threshold interval may be further set, when the fluctuation strength V is greater than a maximum value in the first preset threshold interval, increasing a target code rate of the encoding; when the fluctuation strength is When the V is located in the first preset threshold interval, the encoded target code rate is not adjusted; when the fluctuation strength V is smaller than the minimum value in the first preset threshold interval, the encoded target code rate is reduced.
- the first preset interval is set to [4, 5].
- the target rate adjustment method proposed in this embodiment divides an image frame to be encoded into a plurality of first a sub-image, and classifying the plurality of first sub-images according to motion vector values of each of the first sub-images, calculating a peak signal-to-noise ratio PSNR value of each type of first sub-image, and then first The ratio of the standard deviation ⁇ of the PSNR value of the sub-image and the average value ⁇ is corrected by the preset correction parameter as the encoded fluctuation intensity, and the obtained fluctuation intensity includes the reconstruction distortion of the image frame and the characteristics of the motion compensation.
- the target code rate is selected by the imaginary value.
- the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded in real time by the fluctuation strength, and improve the selected target bit rate. Accuracy.
- the target rate adjustment method further includes:
- the encoded wave strength is updated to the calculated average.
- the image content of the moving image adjacent to the image frame has time domain correlation, the image content of the adjacent consecutive multiple image frames tends to be substantially the same, and correspondingly, the variation of the fluctuation intensity of the encoding in a short time is often negligible.
- the fluctuation strength of the encoding is calculated once per frame, the target rate is adjusted once in a long time, and there is a waste of processing resources. Therefore, in this embodiment, a window having a size of ⁇ (the aforementioned second preset threshold, specifically, can be set as needed) is set, and the average fluctuation intensity of the image frame in the window is counted every ⁇ frame to determine whether to perform The adjustment of the target bit rate.
- ⁇ is a positive number greater than 0, for example, the value range can be set to [1, 30].
- the currently calculated fluctuation intensity when the currently calculated fluctuation strength is recorded, the currently calculated fluctuation intensity may also be smoothed based on the last calculated fluctuation intensity to eliminate the influence of noise during the encoding process.
- the current The calculated fluctuation intensity is recorded as: a ⁇ V t-1 +b ⁇ V t ;
- V t-1 represents the last calculated fluctuation intensity
- V t represents the currently calculated fluctuation intensity
- the processing resource consumption is reduced on the basis of ensuring the accuracy of selecting the target code rate.
- step S10 includes:
- Step S101 dividing an image frame to be encoded into a plurality of first sub-images, and calculating a motion vector value of each of the first sub-images of the image frame to be encoded relative to adjacent image frames, according to each of the Sorting the plurality of first sub-images by a vector value interval in which a motion vector value of a sub-image is located;
- a continuous video is composed of a series of image frames.
- the image frame to be encoded is divided into a plurality of first sub-images in a square shape.
- the shape of the first sub-image may be selected according to actual needs.
- the image frame may be divided into a plurality of first sub-images that are rectangular.
- the size of the first sub-image may be set to 64*64.
- a sub-image is classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
- the motion vector value can directly reflect the fluctuation strength of the image.
- the image content of the moving image adjacent to the image frame has a time domain correlation.
- motion compensation is performed for each of the first sub-images, a matching block of the first sub-image is searched for in an adjacent image frame of the image frame, and a motion vector of the matching block of the first sub-image is recorded.
- MV i,j ( ⁇ x, ⁇ y)
- a first sub-image is selected in the image frame at time t, and a minimum distortion search is performed in the adjacent image frame at time t-1 to obtain a minimum distortion matching block, and the first first is calculated.
- the calculated motion vector distribution of each of the first sub-images of the image frame is as shown in FIG.
- Step S102 dividing the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculating each of the first sub-images and each a mean square error between the second sub-images corresponding to the first sub-image;
- the next frame of coding is encoded based on the reconstructed frame of the current frame.
- the reconstructed image frame of the image frame to be encoded is divided into a plurality of second sub-images in the same division manner as the image frame to be encoded, and each of the first sub-images is calculated.
- a mean square error between the second sub-images corresponding to each of the first sub-images wherein a mean square error between the first sub-image and its corresponding second sub-image Represents the standard deviation of the first sub-image, Indicates the standard deviation of the second sub-image.
- Step S103 calculating a PSNR value of each class first sub-image based on each of the mean square errors.
- the average distortion D k of the first sub-image of each class is first counted as ⁇ D i,j
- k is an integer, and each of the aforementioned vector value intervals corresponds to a k value.
- the target rate adjustment method further includes:
- the adjusted target code rate is corrected based on the current network bandwidth to be encoded using the corrected target code rate.
- the network bandwidth in this embodiment refers to the network bandwidth available for encoding video transmission, and those skilled in the art may understand that the network bandwidth is dynamically changed, and the network bandwidth is also local resources.
- the problem of preemption therefore, in order to ensure the smoothness of the transmission of the encoded video, in this embodiment, the currently available network bandwidth is obtained, and the adjusted target bit rate is corrected based on the obtained network bandwidth, and the method is adopted. The corrected target bit rate is encoded.
- the correcting the adjusted target bit rate based on the current network bandwidth includes:
- the network bandwidth is used as the corrected target code rate.
- the network bandwidth is smaller than the adjusted target code rate, if the adjusted target bit rate is used for encoding, the encoded video data cannot be smoothly transmitted to the target terminal, resulting in playback. Problems such as Caton affect the user's visual experience. Therefore, in this embodiment, when the current network bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate to ensure that the encoded video data can be smoothly transmitted to Target terminal.
- the target rate adjustment apparatus includes:
- the first calculating module 10 is configured to divide the image frame to be encoded into a plurality of first sub-images, and classify the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculate Peak signal to noise ratio PSNR value of the first sub-image of each class;
- the target rate adjustment apparatus proposed in this embodiment may be applied to video coding control of a streaming media, for example, at the time of encoding, by performing motion compensation vector calculation, and combining the distortion statistics of the reconstructed image frame to perform coding of the target code rate. Select to accurately select the appropriate target bit rate to ensure video quality The amount of time reduces the occupation of network bandwidth.
- the first calculation module 10 divides the image frame to be encoded into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs. For example, the first calculation module 10 may divide the image frame into a plurality of rectangles. Sub image.
- the size of the first sub-image may be set to 64*64.
- the first calculation module 10 calculates a motion vector value of each of the first sub-images, and performs motion vector value pairs for each of the first sub-images.
- the plurality of first sub-images are classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located. Among them, the motion vector value can directly reflect the fluctuation strength of the image.
- the first calculating module 10 performs motion compensation on each of the first sub-images, searches for matching blocks of the first sub-image in adjacent image frames of the image frame, and records the first sub-image.
- Matching block motion vector MV i,j ( ⁇ x, ⁇ y), Is a motion vector value, where ⁇ x represents the displacement of the first sub-image in the x-axis direction, and ⁇ y represents the displacement of the first sub-image in the y-axis direction.
- a first sub-image is selected in the image frame at time t, and a minimum distortion search is performed in the adjacent image frame at time t-1 to obtain a minimum distortion matching block, and the first first is calculated.
- the vector distance of the sub-image and the minimum distortion matching block, as shown in Fig. 2, MV ( ⁇ x, ⁇ y), thereby calculating the motion vector value of the selected first sub-image
- the adjacent image frame of the currently selected image frame refers to the previous image frame on the frame sequence in which the image frame is selected.
- the first calculation module 10 calculates a Peak Signal to Noise Ratio (PSNR) value of the first sub-image of each class.
- PSNR Peak Signal to Noise Ratio
- the second calculating module 20 is configured to calculate an average value ⁇ of the PSNR values of the first sub-images of each class, and calculate a standard deviation ⁇ of the PSNR values of the first sub-images of each class, and preset a ratio of ⁇ to ⁇ Correct the fluctuation intensity of the code as the code after correction;
- the average value of the PSNR values of the first sub-images of each class Standard deviation of PSNR values for the first sub-image of each class
- each vector value interval corresponds to a k value
- P k is a PSNR value of each type of first sub-image.
- the second calculation module 20 After calculating the ⁇ value and the ⁇ value of each type of sub-image, the second calculation module 20 corrects the ratio of ⁇ to ⁇ using the preset correction parameter ⁇ as the encoded fluctuation intensity V, that is, the fluctuation intensity.
- the ⁇ value is an empirical parameter for normalizing the ratio of ⁇ to ⁇ to increase the processing speed. For example, the ⁇ value is set to 100 in this embodiment.
- a maximum k value k max may also be set, and a first sub-image with a motion vector value greater than k max is divided into k max intervals, where k max is an empirical parameter, for example, this embodiment will k max is set to 9.
- the adjusting module 30 is configured to adjust the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, where the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold. : increasing the target code rate of the encoding when the fluctuation strength is greater than the first preset threshold;
- the encoded target code rate is reduced.
- the adjustment module 30 determines whether the fluctuation intensity V is greater than a first preset threshold, and if the fluctuation strength V is greater than the first A predetermined threshold increases the encoded target rate; otherwise, if the fluctuation intensity V is less than or equal to the first predetermined threshold, the encoded target code rate is decreased.
- the first preset threshold is set according to actual needs. For example, in this embodiment, the first preset threshold is set to 5.
- the step value TBR C of the target rate adjustment is preset. The benchmark for each target rate adjustment.
- FIG. 3 a measured view of the target code rate adjustment of the embodiment is provided.
- the target code rate is adjusted according to the change of the fluctuation intensity in real time. It is possible that the low bit occupancy occupies the visual decoding quality of the video image with minimal distortion.
- the encoded target code rate is not adjusted.
- a first preset threshold interval may be further set, when the fluctuation strength V is greater than a maximum value in the first preset threshold interval, increasing a target code rate of the encoding; when the fluctuation strength is When the V is located in the first preset threshold interval, the encoded target code rate is not adjusted; when the fluctuation strength V is smaller than the minimum value in the first preset threshold interval, the encoded target code rate is reduced.
- the first preset interval is set to [4, 5].
- the target rate adjustment apparatus of the present embodiment divides an image frame to be encoded into a plurality of first sub-images, and performs a plurality of the first sub-images according to motion vector values of each of the first sub-images. Classification, calculating the peak signal-to-noise ratio PSNR value of each type of first sub-image, and then comparing the standard deviation ⁇ of the PSNR value of each type of the first sub-image with the average value ⁇ as the encoded fluctuation intensity by using the preset correction parameter
- the resulting wave strength includes the reconstruction distortion of the image frame and the characteristics of motion compensation.
- the target code rate is selected by the imaginary value.
- the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded in real time by the fluctuation strength, and improve the selected target bit rate. Accuracy.
- the target rate adjustment apparatus further includes:
- a recording module configured to record the currently calculated fluctuation intensity, and calculate an average value of the recorded fluctuation strength when the recorded fluctuation intensity reaches a second preset threshold
- An update module is arranged to update the encoded wave strength to the calculated average.
- the recording module After completing the encoding of the image frame to be encoded and calculating the fluctuating intensity of the encoding, the recording module records the fluctuating intensity calculated at the time, and counts the calculated fluctuating intensity in the current window. Average fluctuation intensity of image frames
- the update module updates the encoded wave strength to the calculated average wave strength
- t is an integer multiple of ⁇
- t is used to indicate the position of the corresponding image frame in the sequence of frames.
- the recording module when recording the currently calculated fluctuation strength, may further smooth the currently calculated fluctuation intensity based on the last calculated fluctuation intensity to eliminate the influence of noise in the encoding process, in this embodiment.
- the recording module records the currently calculated fluctuation strength as: a ⁇ V t-1 +b ⁇ V t ;
- V t-1 represents the last calculated fluctuation intensity
- V t represents the currently calculated fluctuation intensity
- the processing resource consumption is reduced on the basis of ensuring the accuracy of selecting the target code rate.
- the first calculating module 10 includes:
- the classification unit 101 is configured to divide the image frame to be encoded into a plurality of first sub-images, and calculate a motion vector value of each of the first sub-images of the image frame to be encoded relative to adjacent image frames, according to each Sorting the plurality of first sub-images by a vector value interval in which the motion vector value of the first sub-image is located;
- a continuous video is composed of a series of image frames.
- the classification unit 101 divides the image frame to be encoded into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs.
- the classification unit 101 may divide the image frame into a plurality of first sub-images that are rectangular. .
- the size of the first sub-image may be set to 64*64.
- the classifying unit 101 calculates a motion vector value of each of the first sub-images, and pairs the motion vector values of each of the first sub-images
- the first sub-image is classified, that is, the plurality of first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
- the motion vector value can directly reflect the fluctuation strength of the image.
- the classifying unit 101 performs motion compensation on each of the first sub-images, searches for matching blocks of the first sub-image in adjacent image frames of the image frame, and records matching blocks of the first sub-image.
- Motion vector MV i,j ( ⁇ x, ⁇ y), Is a motion vector value, where ⁇ x represents the displacement of the first sub-image in the x-axis direction, and ⁇ y represents the displacement of the first sub-image in the y-axis direction.
- the calculated motion vector distribution of each first sub-image of the image frame is as shown in FIG.
- the calculating unit 102 is configured to divide the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculate each of the first sub-images And a mean square error between the second sub-images corresponding to each of the first sub-images; and calculating a PSNR value of each of the first sub-images based on each of the mean square errors.
- the next frame of coding is encoded based on the reconstructed frame of the current frame.
- the calculating unit 102 divides the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculates each of the first a mean square error between a sub-image and a second sub-image corresponding to each of the first sub-images, wherein a mean square error between the first sub-image and its corresponding second sub-image Represents the standard deviation of the first sub-image, Indicates the standard deviation of the second sub-image.
- the calculating unit 102 After calculating the mean square error between each of the first sub-images and the second sub-image corresponding to each of the first sub-images, the calculating unit 102 first counts the average distortion of the first sub-image of each class.
- D k ⁇ D i,j
- the computing unit 102 then calculates the PSNR value of each class first sub-image based on the average distortion of each class first sub-image, and the PSNR value of each class first sub-image.
- bpp represents the number of bits when the pixel point gray value is expressed in binary, as in the 0 to 255 gray level representation, the value of bpp is 8.
- the target rate adjustment apparatus further includes a correction module configured to correct the adjusted target code rate based on the current network bandwidth to adopt the corrected target.
- the code rate is encoded.
- the network bandwidth in this embodiment refers to the network bandwidth available for encoding video transmission, and those skilled in the art may understand that the network bandwidth is dynamically changed, and the network bandwidth is also local resources.
- the correction module acquires the currently available network bandwidth, and corrects the adjusted target code rate based on the acquired network bandwidth. , encoding with the corrected target bit rate.
- the modifying module corrects the adjusted target code rate based on the current network bandwidth by determining whether the current network bandwidth is greater than the adjusted target code rate; and in the current network. When the bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate.
- the correction module uses the network bandwidth as the corrected target code rate to ensure that the encoded video data can be Smooth transfer to the target terminal.
- the embodiment of the invention further provides a computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions, and the computer executable instructions are implemented to implement a target rate adjustment method.
- each module/unit in the above embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, executing a program stored in the memory by a processor. / instruction to achieve its corresponding function.
- This application is not limited to any specific combination of hardware and software.
- the above technical solution can more accurately reflect the bits required for encoding, and improve the accuracy of the selected target bit rate.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
L'invention concerne un procédé pour réguler un débit de codes cible. Le procédé pour réguler un débit de codes cible consiste : à diviser une trame d'image à coder en une pluralité de premières sous-images, à classifier la pluralité de premières sous-images selon une valeur de vecteur de mouvement de chacune des premières sous-images, et à calculer une valeur de rapport signal sur bruit de crête (PSNR) de chaque classe de premières sous-images ; à calculer une valeur moyenne de la valeur de PSNR de chaque classe de premières sous-images, à calculer l'écart-type de la valeur de PSNR de chaque classe de premières sous-images, et à corriger le rapport de la valeur moyenne sur l'écart-type par utilisation d'un paramètre de correction préétabli, puis à prendre le rapport comme la puissance de fluctuation du codage ; et à réguler un débit de codes cible pour le codage selon une relation d'amplitude entre la puissance de fluctuation et une première valeur de seuil préréglée, lorsque la puissance de fluctuation est supérieure à la première valeur de seuil préréglée, le débit de codes cible pour le codage étant accru, et lorsque la puissance de fluctuation est inférieure à la première valeur de seuil préréglée, le débit de codes cible pour le codage étant réduit. La solution technique peut améliorer la précision de sélection d'un débit de codes cible.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610086704.4A CN107087192A (zh) | 2016-02-15 | 2016-02-15 | 目标码率调整方法及装置 |
CN201610086704.4 | 2016-02-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017140230A1 true WO2017140230A1 (fr) | 2017-08-24 |
Family
ID=59614136
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/073227 WO2017140230A1 (fr) | 2016-02-15 | 2017-02-10 | Procédé et dispositif pour réguler un débit de codes cible |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107087192A (fr) |
WO (1) | WO2017140230A1 (fr) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110166781A (zh) * | 2018-06-22 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 一种视频编码方法、装置和可读介质 |
CN111447339A (zh) * | 2020-03-26 | 2020-07-24 | 西安万像电子科技有限公司 | 图像传输方法和系统 |
CN111787336A (zh) * | 2019-04-04 | 2020-10-16 | 杭州海康威视数字技术股份有限公司 | 码率切换的方法和装置 |
CN113315773A (zh) * | 2021-05-31 | 2021-08-27 | 浙江大华技术股份有限公司 | 一种码率调整方法、装置、电子设备及存储介质 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109344278B (zh) * | 2018-09-25 | 2020-08-14 | 北京邮电大学 | 一种视觉搜索方法、装置及设备 |
CN111726656B (zh) * | 2020-07-20 | 2022-07-26 | 有半岛(北京)信息科技有限公司 | 一种直播视频的转码方法、装置、服务器和存储介质 |
CN113099241B (zh) * | 2021-03-31 | 2022-11-01 | 北京百度网讯科技有限公司 | 参考帧列表更新方法、装置、设备以及存储介质 |
CN114401404B (zh) * | 2022-03-24 | 2022-07-26 | 深圳比特微电子科技有限公司 | 一种vpu编码策略的实现方法和装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020168007A1 (en) * | 2001-04-19 | 2002-11-14 | Sarnoff Corporation | Apparatus and method for allocating bits temporaly between frames in a coding system |
CN101094411A (zh) * | 2007-07-03 | 2007-12-26 | 芯瀚电子技术(上海)有限公司 | 一种视频编码的码率控制方法 |
EP1944980A2 (fr) * | 2007-01-10 | 2008-07-16 | Koninklijke Philips Electronics N.V. | Algorithme de contrôle de débit |
CN102223524A (zh) * | 2010-04-13 | 2011-10-19 | 中兴通讯股份有限公司 | 一种三维小波视频编码帧分组方法及装置 |
CN104065962A (zh) * | 2013-10-28 | 2014-09-24 | 四川大学 | 基于视觉注意的宏块层比特分配优化方法 |
-
2016
- 2016-02-15 CN CN201610086704.4A patent/CN107087192A/zh active Pending
-
2017
- 2017-02-10 WO PCT/CN2017/073227 patent/WO2017140230A1/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020168007A1 (en) * | 2001-04-19 | 2002-11-14 | Sarnoff Corporation | Apparatus and method for allocating bits temporaly between frames in a coding system |
EP1944980A2 (fr) * | 2007-01-10 | 2008-07-16 | Koninklijke Philips Electronics N.V. | Algorithme de contrôle de débit |
CN101094411A (zh) * | 2007-07-03 | 2007-12-26 | 芯瀚电子技术(上海)有限公司 | 一种视频编码的码率控制方法 |
CN102223524A (zh) * | 2010-04-13 | 2011-10-19 | 中兴通讯股份有限公司 | 一种三维小波视频编码帧分组方法及装置 |
CN104065962A (zh) * | 2013-10-28 | 2014-09-24 | 四川大学 | 基于视觉注意的宏块层比特分配优化方法 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110166781A (zh) * | 2018-06-22 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 一种视频编码方法、装置和可读介质 |
CN111787336A (zh) * | 2019-04-04 | 2020-10-16 | 杭州海康威视数字技术股份有限公司 | 码率切换的方法和装置 |
CN111447339A (zh) * | 2020-03-26 | 2020-07-24 | 西安万像电子科技有限公司 | 图像传输方法和系统 |
CN113315773A (zh) * | 2021-05-31 | 2021-08-27 | 浙江大华技术股份有限公司 | 一种码率调整方法、装置、电子设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN107087192A (zh) | 2017-08-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2017140230A1 (fr) | Procédé et dispositif pour réguler un débit de codes cible | |
US11277562B2 (en) | Image stabilization using machine learning | |
EP3488388B1 (fr) | Procédé et appareil de traitement de vidéo | |
US10499056B2 (en) | System and method for video processing based on quantization parameter | |
US10856030B1 (en) | Bitrate selection for video streaming | |
US20130027568A1 (en) | Support vector regression based video quality prediction | |
US8401311B2 (en) | Image processing device, method, and program | |
US20210409724A1 (en) | Method and device for bitrate adjustment in encoding process | |
US9606636B2 (en) | Optical processing apparatus, light source luminance adjustment method, and non-transitory computer readable medium thereof | |
US12022096B2 (en) | Human visual system adaptive video coding | |
US10536696B2 (en) | Image encoding device and image encoding method | |
CN111050169B (zh) | 图像编码中量化参数的生成方法、装置及终端 | |
US20050213843A1 (en) | Method and apparatus for calculating moving-image correction-coefficient, moving-image correcting apparatus, and computer product | |
CN112655212A (zh) | 视频编码优化方法、装置及计算机存储介质 | |
CN109219960B (zh) | 视频编码质量平滑度的优化方法、装置、设备及存储介质 | |
US20230388511A1 (en) | Perceptually-aware Online Per-title Encoding for Live Video Streaming | |
US20210075843A1 (en) | Quality Metadata Signaling for Dynamic Adaptive Streaming of Video | |
US10897619B2 (en) | Complexity control of video codec | |
US11825088B2 (en) | Adaptively encoding video frames based on complexity | |
KR20140101745A (ko) | 다수 아티팩트를 고려한 비디오 품질 측정 | |
JP4410039B2 (ja) | 動き検出装置および動き検出方法、画像処理装置 | |
US11622118B2 (en) | Determination of coding modes for video content using order of potential coding modes and block classification | |
US10848772B2 (en) | Histogram-based edge/text detection | |
US20240357138A1 (en) | Human visual system adaptive video coding | |
JP2018152793A (ja) | 画像処理装置及びその制御方法、プログラム、記憶媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17752662 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17752662 Country of ref document: EP Kind code of ref document: A1 |