WO2017140230A1 - Procédé et dispositif pour réguler un débit de codes cible - Google Patents

Procédé et dispositif pour réguler un débit de codes cible Download PDF

Info

Publication number
WO2017140230A1
WO2017140230A1 PCT/CN2017/073227 CN2017073227W WO2017140230A1 WO 2017140230 A1 WO2017140230 A1 WO 2017140230A1 CN 2017073227 W CN2017073227 W CN 2017073227W WO 2017140230 A1 WO2017140230 A1 WO 2017140230A1
Authority
WO
WIPO (PCT)
Prior art keywords
sub
images
image
code rate
encoded
Prior art date
Application number
PCT/CN2017/073227
Other languages
English (en)
Chinese (zh)
Inventor
左雯
胡祥斌
李振纲
王宁
周益民
朱策
罗敏珂
钟敏
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2017140230A1 publication Critical patent/WO2017140230A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion

Definitions

  • This document relates to, but is not limited to, the field of video coding technology, and relates to a target rate adjustment method and apparatus.
  • the selection of the target bit rate is an important link, which is directly related to the visual quality of the picture.
  • the target bit rate of the encoded code is usually controlled based on the network bandwidth to obtain a balance between visual quality and bandwidth usage.
  • the code rate controller sets the capacity feedback value of the hypothetical reference decoding buffer based on the network bandwidth, and is used for
  • the encoded target code rate is calculated and assigned to maintain a smooth video output stream as much as possible, and to provide a minimum distortion video image decoding visual quality at a minimum bit rate.
  • the related art selects the target code rate based on the capacity of the imaginary reference decoding buffer, there is a problem that the selected target code rate is not accurate enough.
  • the embodiment of the invention provides a method and a device for adjusting a target bit rate, which can improve the accuracy of the selected target bit rate.
  • An embodiment of the present invention provides a target rate adjustment method, where the target rate adjustment method includes:
  • adjusting a target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold include:
  • the encoded target code rate is reduced.
  • the method before the step of adjusting the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, the method further includes:
  • the encoded wave strength is updated to the calculated average.
  • the image frame to be encoded is divided into a plurality of first sub-images, and the plurality of first sub-images are classified according to motion vector values of each of the first sub-images, and each class is calculated.
  • the steps of the PSNR value of the first sub-image include:
  • a PSNR value for each type of first sub-image is calculated based on each of the mean square errors.
  • the method further includes:
  • the adjusted target code rate is corrected based on the current network bandwidth to be encoded using the corrected target code rate.
  • the step of modifying the adjusted target bit rate based on the current network bandwidth includes:
  • the network bandwidth is used as the corrected target code rate.
  • the embodiment of the present invention provides a target rate adjustment apparatus, where the target rate adjustment apparatus includes:
  • a first calculating module configured to divide the image frame to be encoded into a plurality of first sub-images, and classify the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculate each Peak signal to noise ratio PSNR value of the first sub-image of the class;
  • a second calculating module configured to calculate an average value ⁇ of PSNR values of each type of first sub-image, and calculate a standard deviation ⁇ of PSNR values of each type of first sub-image, and correct the ratio of ⁇ to ⁇ by using a preset correction parameter After the wave strength as the code;
  • the adjusting module is configured to adjust the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, wherein the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold include:
  • the encoded target code rate is reduced.
  • the target rate adjustment device further includes:
  • a recording module configured to record the currently calculated fluctuation intensity, and calculate an average value of the recorded fluctuation strength when the recorded fluctuation intensity reaches a second preset threshold
  • An update module is arranged to update the encoded wave strength to the calculated average.
  • the first computing module includes:
  • a classifying unit configured to divide the image frame to be encoded into a plurality of first sub-images, and calculate a motion vector value of each of the first sub-images relative to the adjacent image frames of the image frame to be encoded, according to each Sorting the plurality of first sub-images by a vector value interval in which the motion vector value of the first sub-image is located;
  • a calculating unit configured to divide the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images in the same division manner as the image frame to be encoded, and calculate each of the first sub-images and a mean square error between the second sub-images corresponding to each of the first sub-images; and calculating a PSNR value for each of the first sub-images based on each of the mean square errors.
  • the target rate adjustment apparatus further includes a correction module configured to correct the adjusted target code rate based on the current network bandwidth to perform encoding using the corrected target code rate.
  • the modifying module corrects the adjusted target code rate based on the current network bandwidth by determining whether the current network bandwidth is greater than the adjusted target code rate; and in the current network. When the bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate.
  • the image frame to be encoded is divided into a plurality of first sub-images, and the plurality of the first sub-images are classified according to the motion vector values of the first sub-images, and the first type is calculated.
  • the peak signal-to-noise ratio PSNR value of the sub-image, and then the ratio of the standard deviation ⁇ and the average value ⁇ of the PSNR values of the first sub-images are corrected by the preset correction parameters as the encoded fluctuation intensity, and the obtained fluctuation intensity includes Reconstruction distortion of image frames and features of motion compensation.
  • the embodiment of the invention when the fluctuation strength is increased in the encoding process, based on the time domain correlation of the adjacent image frames, the next image frame to be encoded of the image frame to be encoded needs more bits for description (ie, encoding). Therefore, compared with the related art, the target code rate is selected by the imaginary value, and the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded by the fluctuation intensity in real time, thereby improving the selected target. The accuracy of the bit rate.
  • the embodiment of the invention further provides a computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions, and the computer executable instructions are implemented to implement a target rate adjustment method.
  • FIG. 1 is a schematic flowchart of a target bit rate adjustment method according to Embodiment 1 of the present invention
  • FIG. 2 is a schematic diagram of motion compensation in a first embodiment of a target code rate adjustment method according to Embodiment 1 of the present invention
  • FIG. 3 is a real measurement diagram of target rate adjustment in the first embodiment of the code rate adjustment method according to the first embodiment of the present invention
  • FIG. 4 is a schematic diagram showing a refinement flow of calculating a PSNR value of each type of first sub-image in FIG. 1;
  • FIG. 5 is a motion vector distribution diagram of a first sub-image of a target bit rate adjustment method according to Embodiment 3 of the present invention.
  • FIG. 6 is a schematic diagram of functional blocks of a target bit rate adjustment apparatus according to Embodiment 5 of the present invention.
  • FIG. 7 is a schematic diagram of a refinement function module of the first computing module in FIG. 6.
  • An embodiment of the present invention provides a target rate adjustment method.
  • the target rate adjustment method includes:
  • Step S10 dividing the image frame to be encoded into a plurality of first sub-images, and classifying the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculating the first sub-category of each Peak signal to noise ratio PSNR value of the image;
  • the target rate adjustment method proposed in this embodiment can be applied to video coding control of streaming media, for example, when encoding, by motion compensation vector calculation, and combining the distortion statistics of the reconstructed image frame to perform coding of the target code rate. Select to accurately select the appropriate target bit rate to reduce the occupation of network bandwidth while ensuring video quality.
  • the image frame to be encoded is divided into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs. For example, the image frame may be divided into a plurality of first sub-images that are rectangular.
  • the size of the first sub-image may be set to 64*64.
  • the image After dividing the image frame into a plurality of first sub-images, calculating motion vector values of each of the first sub-images, and pairing the plurality of the first ones according to motion vector values of each of the first sub-images One son
  • the image is classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
  • the motion vector value can directly reflect the fluctuation strength of the image.
  • the image content of the moving image adjacent to the image frame has time domain correlation.
  • motion compensation is performed for each of the first sub-images, a matching block of the first sub-image is searched for in an adjacent image frame of the image frame, and a motion vector of the matching block of the first sub-image is recorded.
  • MV i,j ( ⁇ x, ⁇ y)
  • a first sub-image is selected in the image frame at time t, a minimum distortion search is performed in the adjacent image frame at time t-1, a matching block with minimum distortion is obtained, and the first sub-child is calculated.
  • the vector distance of the matching block of the image and the minimum distortion, as shown in Fig. 2, MV ( ⁇ x, ⁇ y), thereby calculating the motion vector value of the selected first sub-image
  • the adjacent image frame of the currently selected image frame refers to the previous image frame in which the image frame is selected on the frame sequence.
  • PSNR Peak Signal to Noise Ratio
  • Step S20 calculating an average value ⁇ of PSNR values of each type of first sub-image, and calculating a standard deviation ⁇ of PSNR values of each type of first sub-image, and correcting the ratio of ⁇ to ⁇ by using a preset correction parameter as a coded Fluctuation intensity
  • the average value of the PSNR values of each type of first sub-image Standard deviation of PSNR values for each type of first sub-image
  • each vector value interval corresponds to a k value
  • P k is a PSNR value of each type of first sub-image.
  • the ratio of ⁇ to ⁇ is corrected using the preset correction parameter ⁇ as the encoded fluctuation intensity V, that is, the fluctuation intensity.
  • the ⁇ value is an empirical parameter for normalizing the ratio of ⁇ to ⁇ to increase the processing speed. For example, the ⁇ value is set to 100 in this embodiment.
  • a maximum k value k max may also be set, and a first sub-image with a motion vector value greater than k max is divided into k max intervals, where k max is an empirical parameter, for example, this embodiment will k max is set to 9.
  • step S30 the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold.
  • the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold.
  • the encoded target code rate is reduced.
  • the encoded target code rate is reduced.
  • the first preset threshold may be set according to actual needs. For example, in this embodiment, the first preset threshold is set to 5.
  • the step value TBR C of the target rate adjustment is preset. As a basis for each target rate adjustment.
  • FIG. 3 a measured view of the target code rate adjustment of the embodiment is provided.
  • the target code rate is adjusted according to the change of the fluctuation intensity in real time. It is possible that the low bit occupancy occupies the visual decoding quality of the video image with minimal distortion.
  • the encoded target code rate is not adjusted.
  • a first preset threshold interval may be further set, when the fluctuation strength V is greater than a maximum value in the first preset threshold interval, increasing a target code rate of the encoding; when the fluctuation strength is When the V is located in the first preset threshold interval, the encoded target code rate is not adjusted; when the fluctuation strength V is smaller than the minimum value in the first preset threshold interval, the encoded target code rate is reduced.
  • the first preset interval is set to [4, 5].
  • the target rate adjustment method proposed in this embodiment divides an image frame to be encoded into a plurality of first a sub-image, and classifying the plurality of first sub-images according to motion vector values of each of the first sub-images, calculating a peak signal-to-noise ratio PSNR value of each type of first sub-image, and then first The ratio of the standard deviation ⁇ of the PSNR value of the sub-image and the average value ⁇ is corrected by the preset correction parameter as the encoded fluctuation intensity, and the obtained fluctuation intensity includes the reconstruction distortion of the image frame and the characteristics of the motion compensation.
  • the target code rate is selected by the imaginary value.
  • the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded in real time by the fluctuation strength, and improve the selected target bit rate. Accuracy.
  • the target rate adjustment method further includes:
  • the encoded wave strength is updated to the calculated average.
  • the image content of the moving image adjacent to the image frame has time domain correlation, the image content of the adjacent consecutive multiple image frames tends to be substantially the same, and correspondingly, the variation of the fluctuation intensity of the encoding in a short time is often negligible.
  • the fluctuation strength of the encoding is calculated once per frame, the target rate is adjusted once in a long time, and there is a waste of processing resources. Therefore, in this embodiment, a window having a size of ⁇ (the aforementioned second preset threshold, specifically, can be set as needed) is set, and the average fluctuation intensity of the image frame in the window is counted every ⁇ frame to determine whether to perform The adjustment of the target bit rate.
  • is a positive number greater than 0, for example, the value range can be set to [1, 30].
  • the currently calculated fluctuation intensity when the currently calculated fluctuation strength is recorded, the currently calculated fluctuation intensity may also be smoothed based on the last calculated fluctuation intensity to eliminate the influence of noise during the encoding process.
  • the current The calculated fluctuation intensity is recorded as: a ⁇ V t-1 +b ⁇ V t ;
  • V t-1 represents the last calculated fluctuation intensity
  • V t represents the currently calculated fluctuation intensity
  • the processing resource consumption is reduced on the basis of ensuring the accuracy of selecting the target code rate.
  • step S10 includes:
  • Step S101 dividing an image frame to be encoded into a plurality of first sub-images, and calculating a motion vector value of each of the first sub-images of the image frame to be encoded relative to adjacent image frames, according to each of the Sorting the plurality of first sub-images by a vector value interval in which a motion vector value of a sub-image is located;
  • a continuous video is composed of a series of image frames.
  • the image frame to be encoded is divided into a plurality of first sub-images in a square shape.
  • the shape of the first sub-image may be selected according to actual needs.
  • the image frame may be divided into a plurality of first sub-images that are rectangular.
  • the size of the first sub-image may be set to 64*64.
  • a sub-image is classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
  • the motion vector value can directly reflect the fluctuation strength of the image.
  • the image content of the moving image adjacent to the image frame has a time domain correlation.
  • motion compensation is performed for each of the first sub-images, a matching block of the first sub-image is searched for in an adjacent image frame of the image frame, and a motion vector of the matching block of the first sub-image is recorded.
  • MV i,j ( ⁇ x, ⁇ y)
  • a first sub-image is selected in the image frame at time t, and a minimum distortion search is performed in the adjacent image frame at time t-1 to obtain a minimum distortion matching block, and the first first is calculated.
  • the calculated motion vector distribution of each of the first sub-images of the image frame is as shown in FIG.
  • Step S102 dividing the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculating each of the first sub-images and each a mean square error between the second sub-images corresponding to the first sub-image;
  • the next frame of coding is encoded based on the reconstructed frame of the current frame.
  • the reconstructed image frame of the image frame to be encoded is divided into a plurality of second sub-images in the same division manner as the image frame to be encoded, and each of the first sub-images is calculated.
  • a mean square error between the second sub-images corresponding to each of the first sub-images wherein a mean square error between the first sub-image and its corresponding second sub-image Represents the standard deviation of the first sub-image, Indicates the standard deviation of the second sub-image.
  • Step S103 calculating a PSNR value of each class first sub-image based on each of the mean square errors.
  • the average distortion D k of the first sub-image of each class is first counted as ⁇ D i,j
  • k is an integer, and each of the aforementioned vector value intervals corresponds to a k value.
  • the target rate adjustment method further includes:
  • the adjusted target code rate is corrected based on the current network bandwidth to be encoded using the corrected target code rate.
  • the network bandwidth in this embodiment refers to the network bandwidth available for encoding video transmission, and those skilled in the art may understand that the network bandwidth is dynamically changed, and the network bandwidth is also local resources.
  • the problem of preemption therefore, in order to ensure the smoothness of the transmission of the encoded video, in this embodiment, the currently available network bandwidth is obtained, and the adjusted target bit rate is corrected based on the obtained network bandwidth, and the method is adopted. The corrected target bit rate is encoded.
  • the correcting the adjusted target bit rate based on the current network bandwidth includes:
  • the network bandwidth is used as the corrected target code rate.
  • the network bandwidth is smaller than the adjusted target code rate, if the adjusted target bit rate is used for encoding, the encoded video data cannot be smoothly transmitted to the target terminal, resulting in playback. Problems such as Caton affect the user's visual experience. Therefore, in this embodiment, when the current network bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate to ensure that the encoded video data can be smoothly transmitted to Target terminal.
  • the target rate adjustment apparatus includes:
  • the first calculating module 10 is configured to divide the image frame to be encoded into a plurality of first sub-images, and classify the plurality of first sub-images according to motion vector values of each of the first sub-images, and calculate Peak signal to noise ratio PSNR value of the first sub-image of each class;
  • the target rate adjustment apparatus proposed in this embodiment may be applied to video coding control of a streaming media, for example, at the time of encoding, by performing motion compensation vector calculation, and combining the distortion statistics of the reconstructed image frame to perform coding of the target code rate. Select to accurately select the appropriate target bit rate to ensure video quality The amount of time reduces the occupation of network bandwidth.
  • the first calculation module 10 divides the image frame to be encoded into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs. For example, the first calculation module 10 may divide the image frame into a plurality of rectangles. Sub image.
  • the size of the first sub-image may be set to 64*64.
  • the first calculation module 10 calculates a motion vector value of each of the first sub-images, and performs motion vector value pairs for each of the first sub-images.
  • the plurality of first sub-images are classified, that is, a plurality of the first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located. Among them, the motion vector value can directly reflect the fluctuation strength of the image.
  • the first calculating module 10 performs motion compensation on each of the first sub-images, searches for matching blocks of the first sub-image in adjacent image frames of the image frame, and records the first sub-image.
  • Matching block motion vector MV i,j ( ⁇ x, ⁇ y), Is a motion vector value, where ⁇ x represents the displacement of the first sub-image in the x-axis direction, and ⁇ y represents the displacement of the first sub-image in the y-axis direction.
  • a first sub-image is selected in the image frame at time t, and a minimum distortion search is performed in the adjacent image frame at time t-1 to obtain a minimum distortion matching block, and the first first is calculated.
  • the vector distance of the sub-image and the minimum distortion matching block, as shown in Fig. 2, MV ( ⁇ x, ⁇ y), thereby calculating the motion vector value of the selected first sub-image
  • the adjacent image frame of the currently selected image frame refers to the previous image frame on the frame sequence in which the image frame is selected.
  • the first calculation module 10 calculates a Peak Signal to Noise Ratio (PSNR) value of the first sub-image of each class.
  • PSNR Peak Signal to Noise Ratio
  • the second calculating module 20 is configured to calculate an average value ⁇ of the PSNR values of the first sub-images of each class, and calculate a standard deviation ⁇ of the PSNR values of the first sub-images of each class, and preset a ratio of ⁇ to ⁇ Correct the fluctuation intensity of the code as the code after correction;
  • the average value of the PSNR values of the first sub-images of each class Standard deviation of PSNR values for the first sub-image of each class
  • each vector value interval corresponds to a k value
  • P k is a PSNR value of each type of first sub-image.
  • the second calculation module 20 After calculating the ⁇ value and the ⁇ value of each type of sub-image, the second calculation module 20 corrects the ratio of ⁇ to ⁇ using the preset correction parameter ⁇ as the encoded fluctuation intensity V, that is, the fluctuation intensity.
  • the ⁇ value is an empirical parameter for normalizing the ratio of ⁇ to ⁇ to increase the processing speed. For example, the ⁇ value is set to 100 in this embodiment.
  • a maximum k value k max may also be set, and a first sub-image with a motion vector value greater than k max is divided into k max intervals, where k max is an empirical parameter, for example, this embodiment will k max is set to 9.
  • the adjusting module 30 is configured to adjust the target code rate of the encoding according to the magnitude relationship between the fluctuation strength and the first preset threshold, where the target code rate of the encoding is adjusted according to the magnitude relationship between the fluctuation strength and the first preset threshold. : increasing the target code rate of the encoding when the fluctuation strength is greater than the first preset threshold;
  • the encoded target code rate is reduced.
  • the adjustment module 30 determines whether the fluctuation intensity V is greater than a first preset threshold, and if the fluctuation strength V is greater than the first A predetermined threshold increases the encoded target rate; otherwise, if the fluctuation intensity V is less than or equal to the first predetermined threshold, the encoded target code rate is decreased.
  • the first preset threshold is set according to actual needs. For example, in this embodiment, the first preset threshold is set to 5.
  • the step value TBR C of the target rate adjustment is preset. The benchmark for each target rate adjustment.
  • FIG. 3 a measured view of the target code rate adjustment of the embodiment is provided.
  • the target code rate is adjusted according to the change of the fluctuation intensity in real time. It is possible that the low bit occupancy occupies the visual decoding quality of the video image with minimal distortion.
  • the encoded target code rate is not adjusted.
  • a first preset threshold interval may be further set, when the fluctuation strength V is greater than a maximum value in the first preset threshold interval, increasing a target code rate of the encoding; when the fluctuation strength is When the V is located in the first preset threshold interval, the encoded target code rate is not adjusted; when the fluctuation strength V is smaller than the minimum value in the first preset threshold interval, the encoded target code rate is reduced.
  • the first preset interval is set to [4, 5].
  • the target rate adjustment apparatus of the present embodiment divides an image frame to be encoded into a plurality of first sub-images, and performs a plurality of the first sub-images according to motion vector values of each of the first sub-images. Classification, calculating the peak signal-to-noise ratio PSNR value of each type of first sub-image, and then comparing the standard deviation ⁇ of the PSNR value of each type of the first sub-image with the average value ⁇ as the encoded fluctuation intensity by using the preset correction parameter
  • the resulting wave strength includes the reconstruction distortion of the image frame and the characteristics of motion compensation.
  • the target code rate is selected by the imaginary value.
  • the technical solution of the embodiment of the present invention can accurately reflect the bit required for the coding by adjusting the target code rate of the coded in real time by the fluctuation strength, and improve the selected target bit rate. Accuracy.
  • the target rate adjustment apparatus further includes:
  • a recording module configured to record the currently calculated fluctuation intensity, and calculate an average value of the recorded fluctuation strength when the recorded fluctuation intensity reaches a second preset threshold
  • An update module is arranged to update the encoded wave strength to the calculated average.
  • the recording module After completing the encoding of the image frame to be encoded and calculating the fluctuating intensity of the encoding, the recording module records the fluctuating intensity calculated at the time, and counts the calculated fluctuating intensity in the current window. Average fluctuation intensity of image frames
  • the update module updates the encoded wave strength to the calculated average wave strength
  • t is an integer multiple of ⁇
  • t is used to indicate the position of the corresponding image frame in the sequence of frames.
  • the recording module when recording the currently calculated fluctuation strength, may further smooth the currently calculated fluctuation intensity based on the last calculated fluctuation intensity to eliminate the influence of noise in the encoding process, in this embodiment.
  • the recording module records the currently calculated fluctuation strength as: a ⁇ V t-1 +b ⁇ V t ;
  • V t-1 represents the last calculated fluctuation intensity
  • V t represents the currently calculated fluctuation intensity
  • the processing resource consumption is reduced on the basis of ensuring the accuracy of selecting the target code rate.
  • the first calculating module 10 includes:
  • the classification unit 101 is configured to divide the image frame to be encoded into a plurality of first sub-images, and calculate a motion vector value of each of the first sub-images of the image frame to be encoded relative to adjacent image frames, according to each Sorting the plurality of first sub-images by a vector value interval in which the motion vector value of the first sub-image is located;
  • a continuous video is composed of a series of image frames.
  • the classification unit 101 divides the image frame to be encoded into a plurality of first sub-images in a square shape. It can be understood by those skilled in the art that in other embodiments, the shape of the first sub-image may be selected according to actual needs.
  • the classification unit 101 may divide the image frame into a plurality of first sub-images that are rectangular. .
  • the size of the first sub-image may be set to 64*64.
  • the classifying unit 101 calculates a motion vector value of each of the first sub-images, and pairs the motion vector values of each of the first sub-images
  • the first sub-image is classified, that is, the plurality of first sub-images are classified according to a vector value interval in which the motion vector value of each of the first sub-images is located.
  • the motion vector value can directly reflect the fluctuation strength of the image.
  • the classifying unit 101 performs motion compensation on each of the first sub-images, searches for matching blocks of the first sub-image in adjacent image frames of the image frame, and records matching blocks of the first sub-image.
  • Motion vector MV i,j ( ⁇ x, ⁇ y), Is a motion vector value, where ⁇ x represents the displacement of the first sub-image in the x-axis direction, and ⁇ y represents the displacement of the first sub-image in the y-axis direction.
  • the calculated motion vector distribution of each first sub-image of the image frame is as shown in FIG.
  • the calculating unit 102 is configured to divide the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculate each of the first sub-images And a mean square error between the second sub-images corresponding to each of the first sub-images; and calculating a PSNR value of each of the first sub-images based on each of the mean square errors.
  • the next frame of coding is encoded based on the reconstructed frame of the current frame.
  • the calculating unit 102 divides the reconstructed image frame of the image frame to be encoded into a plurality of second sub-images according to the same division manner as the image frame to be encoded, and calculates each of the first a mean square error between a sub-image and a second sub-image corresponding to each of the first sub-images, wherein a mean square error between the first sub-image and its corresponding second sub-image Represents the standard deviation of the first sub-image, Indicates the standard deviation of the second sub-image.
  • the calculating unit 102 After calculating the mean square error between each of the first sub-images and the second sub-image corresponding to each of the first sub-images, the calculating unit 102 first counts the average distortion of the first sub-image of each class.
  • D k ⁇ D i,j
  • the computing unit 102 then calculates the PSNR value of each class first sub-image based on the average distortion of each class first sub-image, and the PSNR value of each class first sub-image.
  • bpp represents the number of bits when the pixel point gray value is expressed in binary, as in the 0 to 255 gray level representation, the value of bpp is 8.
  • the target rate adjustment apparatus further includes a correction module configured to correct the adjusted target code rate based on the current network bandwidth to adopt the corrected target.
  • the code rate is encoded.
  • the network bandwidth in this embodiment refers to the network bandwidth available for encoding video transmission, and those skilled in the art may understand that the network bandwidth is dynamically changed, and the network bandwidth is also local resources.
  • the correction module acquires the currently available network bandwidth, and corrects the adjusted target code rate based on the acquired network bandwidth. , encoding with the corrected target bit rate.
  • the modifying module corrects the adjusted target code rate based on the current network bandwidth by determining whether the current network bandwidth is greater than the adjusted target code rate; and in the current network. When the bandwidth is less than the adjusted target code rate, the network bandwidth is used as the corrected target code rate.
  • the correction module uses the network bandwidth as the corrected target code rate to ensure that the encoded video data can be Smooth transfer to the target terminal.
  • the embodiment of the invention further provides a computer readable storage medium, wherein the computer readable storage medium stores computer executable instructions, and the computer executable instructions are implemented to implement a target rate adjustment method.
  • each module/unit in the above embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, executing a program stored in the memory by a processor. / instruction to achieve its corresponding function.
  • This application is not limited to any specific combination of hardware and software.
  • the above technical solution can more accurately reflect the bits required for encoding, and improve the accuracy of the selected target bit rate.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

L'invention concerne un procédé pour réguler un débit de codes cible. Le procédé pour réguler un débit de codes cible consiste : à diviser une trame d'image à coder en une pluralité de premières sous-images, à classifier la pluralité de premières sous-images selon une valeur de vecteur de mouvement de chacune des premières sous-images, et à calculer une valeur de rapport signal sur bruit de crête (PSNR) de chaque classe de premières sous-images ; à calculer une valeur moyenne de la valeur de PSNR de chaque classe de premières sous-images, à calculer l'écart-type de la valeur de PSNR de chaque classe de premières sous-images, et à corriger le rapport de la valeur moyenne sur l'écart-type par utilisation d'un paramètre de correction préétabli, puis à prendre le rapport comme la puissance de fluctuation du codage ; et à réguler un débit de codes cible pour le codage selon une relation d'amplitude entre la puissance de fluctuation et une première valeur de seuil préréglée, lorsque la puissance de fluctuation est supérieure à la première valeur de seuil préréglée, le débit de codes cible pour le codage étant accru, et lorsque la puissance de fluctuation est inférieure à la première valeur de seuil préréglée, le débit de codes cible pour le codage étant réduit. La solution technique peut améliorer la précision de sélection d'un débit de codes cible.
PCT/CN2017/073227 2016-02-15 2017-02-10 Procédé et dispositif pour réguler un débit de codes cible WO2017140230A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610086704.4A CN107087192A (zh) 2016-02-15 2016-02-15 目标码率调整方法及装置
CN201610086704.4 2016-02-15

Publications (1)

Publication Number Publication Date
WO2017140230A1 true WO2017140230A1 (fr) 2017-08-24

Family

ID=59614136

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/073227 WO2017140230A1 (fr) 2016-02-15 2017-02-10 Procédé et dispositif pour réguler un débit de codes cible

Country Status (2)

Country Link
CN (1) CN107087192A (fr)
WO (1) WO2017140230A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110166781A (zh) * 2018-06-22 2019-08-23 腾讯科技(深圳)有限公司 一种视频编码方法、装置和可读介质
CN111447339A (zh) * 2020-03-26 2020-07-24 西安万像电子科技有限公司 图像传输方法和系统
CN111787336A (zh) * 2019-04-04 2020-10-16 杭州海康威视数字技术股份有限公司 码率切换的方法和装置
CN113315773A (zh) * 2021-05-31 2021-08-27 浙江大华技术股份有限公司 一种码率调整方法、装置、电子设备及存储介质

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109344278B (zh) * 2018-09-25 2020-08-14 北京邮电大学 一种视觉搜索方法、装置及设备
CN111726656B (zh) * 2020-07-20 2022-07-26 有半岛(北京)信息科技有限公司 一种直播视频的转码方法、装置、服务器和存储介质
CN113099241B (zh) * 2021-03-31 2022-11-01 北京百度网讯科技有限公司 参考帧列表更新方法、装置、设备以及存储介质
CN114401404B (zh) * 2022-03-24 2022-07-26 深圳比特微电子科技有限公司 一种vpu编码策略的实现方法和装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020168007A1 (en) * 2001-04-19 2002-11-14 Sarnoff Corporation Apparatus and method for allocating bits temporaly between frames in a coding system
CN101094411A (zh) * 2007-07-03 2007-12-26 芯瀚电子技术(上海)有限公司 一种视频编码的码率控制方法
EP1944980A2 (fr) * 2007-01-10 2008-07-16 Koninklijke Philips Electronics N.V. Algorithme de contrôle de débit
CN102223524A (zh) * 2010-04-13 2011-10-19 中兴通讯股份有限公司 一种三维小波视频编码帧分组方法及装置
CN104065962A (zh) * 2013-10-28 2014-09-24 四川大学 基于视觉注意的宏块层比特分配优化方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020168007A1 (en) * 2001-04-19 2002-11-14 Sarnoff Corporation Apparatus and method for allocating bits temporaly between frames in a coding system
EP1944980A2 (fr) * 2007-01-10 2008-07-16 Koninklijke Philips Electronics N.V. Algorithme de contrôle de débit
CN101094411A (zh) * 2007-07-03 2007-12-26 芯瀚电子技术(上海)有限公司 一种视频编码的码率控制方法
CN102223524A (zh) * 2010-04-13 2011-10-19 中兴通讯股份有限公司 一种三维小波视频编码帧分组方法及装置
CN104065962A (zh) * 2013-10-28 2014-09-24 四川大学 基于视觉注意的宏块层比特分配优化方法

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110166781A (zh) * 2018-06-22 2019-08-23 腾讯科技(深圳)有限公司 一种视频编码方法、装置和可读介质
CN111787336A (zh) * 2019-04-04 2020-10-16 杭州海康威视数字技术股份有限公司 码率切换的方法和装置
CN111447339A (zh) * 2020-03-26 2020-07-24 西安万像电子科技有限公司 图像传输方法和系统
CN113315773A (zh) * 2021-05-31 2021-08-27 浙江大华技术股份有限公司 一种码率调整方法、装置、电子设备及存储介质

Also Published As

Publication number Publication date
CN107087192A (zh) 2017-08-22

Similar Documents

Publication Publication Date Title
WO2017140230A1 (fr) Procédé et dispositif pour réguler un débit de codes cible
US11277562B2 (en) Image stabilization using machine learning
EP3488388B1 (fr) Procédé et appareil de traitement de vidéo
US10499056B2 (en) System and method for video processing based on quantization parameter
US10856030B1 (en) Bitrate selection for video streaming
US20130027568A1 (en) Support vector regression based video quality prediction
US8401311B2 (en) Image processing device, method, and program
US20210409724A1 (en) Method and device for bitrate adjustment in encoding process
US9606636B2 (en) Optical processing apparatus, light source luminance adjustment method, and non-transitory computer readable medium thereof
US12022096B2 (en) Human visual system adaptive video coding
US10536696B2 (en) Image encoding device and image encoding method
CN111050169B (zh) 图像编码中量化参数的生成方法、装置及终端
US20050213843A1 (en) Method and apparatus for calculating moving-image correction-coefficient, moving-image correcting apparatus, and computer product
CN112655212A (zh) 视频编码优化方法、装置及计算机存储介质
CN109219960B (zh) 视频编码质量平滑度的优化方法、装置、设备及存储介质
US20230388511A1 (en) Perceptually-aware Online Per-title Encoding for Live Video Streaming
US20210075843A1 (en) Quality Metadata Signaling for Dynamic Adaptive Streaming of Video
US10897619B2 (en) Complexity control of video codec
US11825088B2 (en) Adaptively encoding video frames based on complexity
KR20140101745A (ko) 다수 아티팩트를 고려한 비디오 품질 측정
JP4410039B2 (ja) 動き検出装置および動き検出方法、画像処理装置
US11622118B2 (en) Determination of coding modes for video content using order of potential coding modes and block classification
US10848772B2 (en) Histogram-based edge/text detection
US20240357138A1 (en) Human visual system adaptive video coding
JP2018152793A (ja) 画像処理装置及びその制御方法、プログラム、記憶媒体

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17752662

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17752662

Country of ref document: EP

Kind code of ref document: A1