WO2021057478A1 - 视频编解码方法和相关装置 - Google Patents

视频编解码方法和相关装置 Download PDF

Info

Publication number
WO2021057478A1
WO2021057478A1 PCT/CN2020/114165 CN2020114165W WO2021057478A1 WO 2021057478 A1 WO2021057478 A1 WO 2021057478A1 CN 2020114165 W CN2020114165 W CN 2020114165W WO 2021057478 A1 WO2021057478 A1 WO 2021057478A1
Authority
WO
WIPO (PCT)
Prior art keywords
frame
video
resolution
decoding
decoded
Prior art date
Application number
PCT/CN2020/114165
Other languages
English (en)
French (fr)
Inventor
高欣玮
毛煦楠
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2021057478A1 publication Critical patent/WO2021057478A1/zh
Priority to US17/503,164 priority Critical patent/US20220078454A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440218Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA

Definitions

  • This application relates to the field of audio and video coding and decoding, specifically, to video coding and decoding.
  • the codec Under the condition of limited mobile bandwidth, the codec usually uses the same resolution to encode and decode the video frame, which will make the Peak Signal to Noise Ratio (PSNR) relatively low under some bandwidth.
  • PSNR Peak Signal to Noise Ratio
  • the embodiments of the present application provide a video coding and decoding method and related devices, so as to at least solve the technical problem of low video coding and decoding efficiency caused by complicated processing operations provided by related technologies.
  • a video decoding method is provided, the method is executed by a video processing device, and the method includes: determining a video frame to be decoded; The first number and the second number of the target reference frame associated with the video frame, where the first number is the number of blocks using intra-frame decoding in the target reference frame, and the second number is the number of blocks used in the target reference frame The number of inter-frame decoded blocks; the target resolution for decoding the video frame to be decoded is determined according to the relationship between the first number and the second number; the target resolution is used to decode the video frame to be decoded.
  • a video encoding method the method is executed by a video processing device, and the method includes: determining a video frame to be encoded; A first number and a second number of target reference frames associated with the video frame to be encoded, where the first number is the number of blocks that are intra-coded in the target reference frame, and the second number is the target reference frame The number of blocks that use inter-frame coding in the above; determine the target resolution for encoding the above-mentioned video frame to be encoded according to the relationship between the above-mentioned first number and the above-mentioned second number; use the above-mentioned target resolution to perform the encoding on the above-mentioned video frame to be encoded coding.
  • a video decoding device including: determining a video frame to be encoded; and obtaining the first of the target reference frame associated with the video frame to be encoded from the video frames that have been encoded.
  • the first number is the number of intra-encoded blocks in the target reference frame
  • the second number is the number of inter-encoded blocks in the target reference frame; according to the foregoing
  • the relationship between the first number and the second number determines a target resolution for encoding the video frame to be encoded; the target resolution is used to encode the video frame to be encoded.
  • a video encoding device including: determining a video frame to be encoded; obtaining the first target reference frame associated with the video frame to be encoded from the video frames that have been encoded.
  • the first number is the number of intra-encoded blocks in the target reference frame
  • the second number is the number of inter-encoded blocks in the target reference frame; according to the foregoing
  • the relationship between the first number and the second number determines a target resolution for encoding the video frame to be encoded; the target resolution is used to encode the video frame to be encoded.
  • a computer-readable storage medium stores a computer program, wherein the computer program is configured to execute the video in the above aspects when running. Decoding method or video encoding method.
  • a video processing device including:
  • the processor, the communication interface, and the memory complete mutual communication through the communication bus;
  • the communication interface is an interface of a communication module;
  • the memory is used for storing program code and transmitting the program code to the processor; the processor is used for calling the instructions of the program code in the memory to execute the video decoding method or the video encoding method in the above aspects.
  • a computer program product including instructions, which when run on a computer, causes the computer to execute the video decoding method or video encoding method in the above aspect.
  • the first number and the second number of the target reference frame associated with the video frame to be decoded may be obtained from the video frame that has been decoded.
  • a number is the number of blocks using intra-frame decoding in the target reference frame
  • the second number is the number of blocks using inter-frame decoding in the target reference frame.
  • Fig. 1 is a schematic diagram of an application environment of an optional video decoding method according to an embodiment of the present application
  • Fig. 2 is a flowchart of an optional video decoding method according to an embodiment of the present application
  • Fig. 3 is a schematic diagram of an optional video decoding method according to an embodiment of the present application.
  • Fig. 4 is a schematic diagram of another optional video decoding method according to an embodiment of the present application.
  • Fig. 5 is a schematic diagram of yet another optional video decoding method according to an embodiment of the present application.
  • Fig. 6 is a schematic diagram of yet another optional video decoding method according to an embodiment of the present application.
  • Fig. 7 is a flowchart of an optional video encoding method according to an embodiment of the present application.
  • Fig. 8 is a schematic diagram of an optional video encoding and decoding process according to an embodiment of the present application.
  • Fig. 9 is a schematic structural diagram of an optional video decoding device according to an embodiment of the present application.
  • FIG. 10 is a schematic structural diagram of an optional video encoding device according to an embodiment of the present application.
  • FIG. 11 is a schematic structural diagram of an optional electronic device according to an embodiment of the present application.
  • Fig. 12 is a schematic structural diagram of another optional electronic device according to an embodiment of the present application.
  • a video decoding method is provided.
  • the above-mentioned video decoding method may be, but not limited to, be applied to the application environment shown in FIG. 1.
  • the application environment includes a terminal 102 and a server 104, and the above-mentioned terminal 102 and server 104 communicate through a network.
  • the aforementioned terminal 102 may be, but is not limited to, a smart phone, a tablet computer, a notebook computer, a desktop computer, etc., but is not limited to this.
  • the aforementioned server 104 may be, but is not limited to, a computer processing device with strong data processing capability and a certain storage space.
  • the video decoding method and video encoding method provided by the embodiments of the present application may be executed by a video processing device that has a video encoding and decoding function, for example, the terminal 102 or the server 104 described above.
  • the video encoding method corresponding to the above-mentioned video decoding method may also, but is not limited to, be applied to the application environment shown in FIG. 1.
  • the video encoding method provided in this application can be used, but not limited to, through the interaction process between the terminal 102 and the server 104 shown in FIG.
  • the resolution used during encoding enables different resolutions to be used for encoding for different video frame decisions, which not only saves transmission bandwidth, but also ensures the encoding quality of the video frame and avoids the problem of distortion.
  • the video decoding method provided in this application can also be used, but not limited to, through the interaction process between the terminal 102 and the server 104 shown in FIG. 1 to adaptively decide each of the different videos to be decoded.
  • the resolution used when the video frame is decoded so that different resolutions are used for decoding for different video frame decisions, so as to ensure the decoding quality of the video frame and avoid the problem of distortion.
  • the terminal 102 may include but is not limited to the following components: an image processing unit 1021, a processor 1022, a storage medium 1023, a memory 1024, a network interface 1025, a display screen 1026, and an input device 1027.
  • the above-mentioned components can be, but are not limited to, connected via the system bus 1028.
  • the above-mentioned image processing unit 1021 is used to provide at least the rendering capability of the display interface; the above-mentioned processor 1022 is used to provide calculation and control capabilities to support the operation of the terminal 102;
  • the storage medium 1023 stores an operating system 1023-2 and video encoding And/or video decoder 1023-4.
  • the operating system 1023-2 is used to provide control operation instructions, and the video encoder and/or video decoder 1023-4 is used to perform encoding/decoding operations according to the control operation instructions.
  • the aforementioned memory provides an operating environment for the video encoder and/or video decoder 1023-4 in the storage medium 1023, and the network interface 1025 is used for network communication with the network interface 1043 in the server 104.
  • the above-mentioned display screen is used to display application interfaces, such as decoding video; the input device 1027 is used to receive commands or data input by the user.
  • the display screen 1026 and the input device 1027 may be touch screens.
  • a specific terminal or server may include a diagram More or fewer parts are shown in, or some parts are combined, or have a different arrangement of parts.
  • the aforementioned server 104 may include but is not limited to the following components: a processor 1041, a memory 1042, a network interface 1043, and a storage medium 1044.
  • the above-mentioned components can be connected via the system bus 1045, but are not limited to.
  • the aforementioned storage medium 1044 includes an operating system 1044-1, a database 1044-2, a video encoder and/or a video decoder 1044-3.
  • the above-mentioned processor 1041 is used to provide computing and control capabilities to support the operation of the server 104.
  • the memory 1042 provides an environment for the operation of the video encoder and/or the video decoder 1044-3 in the storage medium 1044.
  • the network interface 1043 communicates with the network interface 1025 of the external terminal 102 through a network connection.
  • the operating system 1044-1 in the aforementioned storage medium is used to provide control operation instructions; the video encoder and/or video decoder 1044-3 is used to perform encoding/decoding operations according to the control operation instructions; and the database 1044-2 is used to store data.
  • the internal structure of the server shown in Figure 1 above is only a block diagram of part of the structure related to the solution of this application, and does not constitute a limitation on the computer equipment to which the solution of this application is applied. Specific computer equipment has different components. Layout.
  • the aforementioned network may include, but is not limited to, a wired network.
  • the above-mentioned wired network may include, but is not limited to: a wide area network, a metropolitan area network, and a local area network.
  • a wide area network a wide area network
  • a metropolitan area network a metropolitan area network
  • a local area network a local area network
  • a video decoding method As shown in FIG. 2, the method includes:
  • S204 Obtain a first number and a second number of target reference frames associated with the video frame to be decoded from the video frames that have been decoded.
  • the first number is the number of blocks using intra-frame decoding in the target reference frame
  • the second number is the number of blocks using inter-frame decoding in the target reference frame.
  • S206 Determine a target resolution for decoding the to-be-decoded video frame according to the relationship between the first number and the second number.
  • the video decoding method shown in FIG. 2 can be, but not limited to, used in the video decoder shown in FIG. 1. Through the interaction of the video decoder and other components, the decoding process of the video frame to be decoded is completed.
  • the above-mentioned video decoding method can be but not limited to be applied to application scenarios such as video playback applications, video sharing applications, or video session applications.
  • the video transmitted in the above application scenario may include, but is not limited to: long video, short video, such as long video may be a play episode with a long play time (for example, play time greater than 10 minutes), or in a long video session
  • the displayed picture, the short video may be a voice message that two or more parties interact, or a video with a short playing time (for example, playing time less than or equal to 30 seconds) displayed on a sharing platform.
  • the video decoding method provided in this embodiment can, but is not limited to, be applied to the playback device used to play video in the above application scenario.
  • the video frame that has been decoded is obtained, it is based on Determine the relationship between the first number of blocks using intra-frame decoding in the target reference frame and the second number of blocks using inter-frame decoding in the target reference frame to adaptively determine the target resolution used by the video frame to be decoded.
  • the operation of determining the target resolution adopted by the video frame to be decoded is simplified, thereby overcoming the problem of low video decoding efficiency in the related art, and achieving the effect of improving the decoding efficiency.
  • the target reference frame associated with the above-mentioned video frame to be decoded can be obtained from the video frame that has been decoded using intra-frame decoding. Use the relationship between the first number and the second number to determine the target resolution for decoding the video frame to be decoded, and use the target resolution Rate to decode the video frame to be decoded. For different video frames to be decoded, different resolutions can be adaptively selected to simplify the operation of determining the target resolution used in the video frames to be decoded, thereby overcoming the problem of low video decoding efficiency in related technologies and achieving improved decoding The effect of efficiency.
  • the target reference frame may be determined, but not limited to, from the decoded video frame, and the reference is determined in the following manner Frame decoding mode:
  • determining the target resolution for decoding the video frame to be decoded according to the relationship between the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding may include but Not limited to: obtaining the ratio between the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding; and determining the target resolution according to the comparison result between the ratio and a predetermined threshold. For example, when the ratio is greater than a predetermined threshold, the first resolution is used to decode the video frame to be decoded; when the ratio is less than the predetermined threshold, the second resolution is used to decode the video frame to be decoded, wherein the second resolution The rate is greater than the first resolution. In other words, the larger the number of inter-frame decoded blocks and the smaller the above-mentioned ratio, the higher the resolution used.
  • the target resolution for decoding the video frame to be decoded is determined according to the relationship between the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding, which may also include but Not limited to: obtaining the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding in the reference frame; where the first number is greater than the second number, and the difference between the two is greater than a predetermined threshold
  • the first resolution is used to decode the video frame to be decoded
  • the second resolution is used to decode the video frame to be decoded, where the second resolution is greater than the first resolution.
  • the above-mentioned first resolution may include but is not limited to one of the following: the original resolution of the video frame to be decoded, and the highest resolution obtained by up-sampling the video frame to be decoded.
  • the foregoing second resolution may include, but is not limited to, the resolution obtained by down-sampling the first resolution according to a predetermined sampling ratio.
  • the sampling ratio can be, but is not limited to, determined based on the comparison result of the ratio and different thresholds.
  • the different thresholds here can be, but are not limited to, a set of thresholds determined from a numerical interval less than a predetermined threshold.
  • the above-mentioned target reference frame may include but is not limited to one of the following: a reference frame in a video frame that has been decoded, and a multi-frame reference frame in a video frame that has been decoded.
  • the above-mentioned multi-frame reference frame may include, but is not limited to: continuous multi-frame video frames or discontinuous multi-frame video frames among video frames that have been decoded.
  • adaptively determining the target resolution adopted by the current video frame to be decoded may include, but is not limited to, the following methods:
  • the encoding end and the decoding end can use the same method to determine the resolution to be adopted for both ends in accordance with the agreement in the communication standard.
  • the decoding identification bit is used to indicate the target resolution used when decoding the aforementioned video frame to be decoded.
  • the decoding device at the decoding end can adaptively decide the target resolution of the video frame to be decoded in the same way as the encoding device according to the agreement, wherein the method can be based on the use of intra-frame decoding.
  • the comparison result of the ratio between the first number of blocks and the second number of blocks using inter-frame decoding and a predetermined threshold is used to determine the target resolution.
  • using the target resolution to decode the video frame to be decoded may include, but is not limited to: after adaptively determining the target resolution to be used for the video frame to be decoded, using related technologies based on the target resolution The resolution of the video frame to be decoded is decoded:
  • S1 Determine the current decoding mode corresponding to the video frame to be decoded. For example, by obtaining the flag bit corresponding to the preset position in the code stream, the current encoding mode information can be obtained by the flag bit, so as to determine whether the current decoding mode is the full-resolution mode or the down-sampling mode.
  • the current decoding mode is the same as the current encoding mode.
  • step S2 Determine the processing reference frame according to the resolution of the current target reference frame corresponding to the video frame to be decoded and the target resolution of the video frame to be decoded obtained by decision. Among them, it is determined whether the resolution of the current target reference frame is the same as the target resolution, if they are the same, the current reference frame is directly used as the processing reference frame; if they are different, step S3 is executed.
  • the preset sampling rule used in decoding is the same as the preset sampling rule used in encoding.
  • the preset sampling rule of the encoding end may be, but not limited to, the following: it is related to the resolution of the currently encoded video frame and the distance between the current target reference frame and the currently encoded video frame. For example, when the distance D between the current target reference frame and the currently encoded video frame exceeds a preset threshold, 1/M down-sampling is adopted, otherwise, 1/N down-sampling is adopted, where M ⁇ N.
  • the decoding end can also use the same method described above to perform sampling, which can speed up the sampling speed, thereby increasing the overall decoding speed. It should be noted that the value of M changes with the distance D, which will further improve the flexibility of the sampling parameters.
  • the current target reference frame is sampled to obtain a processing reference frame with the same resolution as the current decoded video frame.
  • the sampling of the current target reference frame includes up-sampling and down-sampling: if the resolution of the current target reference frame is greater than the resolution of the current decoded video frame, then down-sampling the current target reference frame to obtain the processed reference frame; If the resolution of the target reference frame is smaller than the resolution of the current decoded video frame, the current target reference frame is up-sampled to obtain the processed reference frame.
  • each current target reference frame is sampled according to the resolution of the reference frame and the resolution of the current decoded video frame to obtain the same number as the current decoded video frame. Process reference frames with the same resolution.
  • the sampling algorithm for sampling the current target reference frame matches the sampling algorithm for down-sampling the reconstructed video frame to obtain the decoded video frame, that is, if the target reference frame is down-sampled, the down-sampling algorithm It is the same as the down-sampling algorithm for down-sampling the reconstructed video frame to obtain the decoded video frame. If the target reference frame is up-sampled, the up-sampling algorithm and the down-sampling algorithm of the decoded video frame obtained by down-sampling the reconstructed video frame are the opposite sampling algorithms that match.
  • S4 Decode the currently decoded video frame according to the aforementioned processing reference frame to obtain a reconstructed video frame. Among them, the prediction residual in the code stream is decoded and superimposed with the corresponding prediction pixel of the processing reference frame to obtain a reconstructed video frame.
  • S5. Process the reconstructed video frame according to the current decoding mode to obtain the decoded video frame. If the current decoding mode is the down-sampling mode, then up-sampling the reconstructed video frame to obtain the decoded video frame. If the current encoding mode information is the full-sampling mode, then The reconstructed video frame is determined to be a decoded video frame.
  • the algorithm for up-sampling the reconstructed video frame and the algorithm for down-sampling the to-be-encoded video frame by the encoding end to obtain the current encoded video frame are matched opposite sampling algorithms.
  • each pair of decoding blocks in the at least one pair of decoding blocks includes the one with the first resolution.
  • the first decoded block and the second decoded block with the second resolution, the first decoded block and the second decoded block are the adjacent decoded blocks; the first resolution of the first decoded block is adjusted to the target resolution, and The second resolution of the second decoded block is adjusted to the target resolution; the first edge pixel set is determined from the first decoded block, and the second edge pixel set is determined from the second decoded block, where the first The position of the edge pixel point set is adjacent to the position of the second edge pixel point set; the first edge pixel point set is filtered to obtain the filtered first edge pixel point set, and the second edge pixel point set is filtered , The filtered second edge pixel point set is obtained, wherein the pixel value of the i-th pixel
  • the above-mentioned method of adjusting the target resolution includes one of the following:
  • the first resolution is adjusted to the third resolution
  • the second resolution is adjusted to the third resolution, where the third resolution is the same as the first resolution.
  • the resolution is different and different from the second resolution.
  • the first block of the intra-decoded block among the target reference frames associated with the video frame to be decoded can be obtained from the video frame that has been decoded.
  • the number and the second number of blocks using inter-frame decoding use the relationship between the first number and the second number to determine the target resolution for decoding the video frame to be decoded, and use the target resolution to be decoded Video frames are decoded.
  • different resolutions can be adaptively selected to simplify the operation of determining the target resolution used in the video frames to be decoded, thereby overcoming the problem of low video decoding efficiency in related technologies and achieving improved decoding The effect of efficiency.
  • determining the target resolution for decoding the video frame to be decoded according to the relationship between the first number and the second number includes:
  • the video frame to be decoded is decoded using the first resolution
  • the second resolution is used to decode the to-be-decoded video frame, where the second resolution is greater than the first resolution.
  • the above-mentioned second resolution may include but is not limited to one of the following: the original resolution of the video frame to be decoded, and the highest resolution obtained by up-sampling the video frame to be decoded.
  • the above-mentioned first resolution may include, but is not limited to, the resolution obtained by down-sampling the second resolution according to a predetermined sampling ratio.
  • the sampling ratio can be, but is not limited to, determined based on the comparison result of the ratio and different thresholds.
  • the different thresholds here can be, but are not limited to, a set of thresholds determined from a numerical interval greater than a predetermined threshold.
  • the method before using the first resolution to encode the to-be-encoded video frame, the method further includes:
  • a set of thresholds is determined from a numerical interval greater than the predetermined threshold
  • determining the sampling ratio according to the comparison result includes: in the case where it is determined that the ratio of the first quantity to the second quantity is in the target interval, obtaining the target sampling ratio configured for the target interval, where: The target interval includes a set of threshold values between adjacent first and second thresholds, the first threshold is less than the second threshold; down-sampling the second resolution according to the sampling ratio includes: according to the target sampling ratio, the first threshold The width and height of the second resolution are down-sampled to obtain the first resolution.
  • a set of thresholds determined from a numerical interval greater than the predetermined threshold includes: A2 to A5, where A2 ⁇ A3 ⁇ A4 ⁇ A5. It is further assumed that the ratio of the first number M of blocks using intra-frame decoding to the second number N of blocks using inter-frame decoding in the reference frame is r.
  • the determination process can be as follows:
  • the sampling ratio can be determined to be 3/4 of the width and height of the high resolution R, and the high resolution R is sampled according to the above sampling ratio to obtain the resolution R'for decoding (such as the first 1 resolution); in the case where A3 ⁇ r ⁇ A4 is determined, the sampling ratio can be determined to be 2/3 of the width and height of the high resolution R, and the high resolution R is sampled according to the above sampling ratio to obtain the result for decoding
  • the sampling ratio can be determined to be 1/3 of the width and height of the high-resolution R, and the high-resolution R performs sampling to obtain the resolution R'(such as the first resolution
  • sampling ratio is an example, and the interval corresponding to different thresholds can be set to different values.
  • the value of the sampling ratio and the corresponding relationship between the sampling ratio and the threshold are not limited.
  • the ratio between the first number of intra-frame decoding and the second number of inter-frame decoding in the target reference frame is obtained, so as to achieve a comparison result based on the ratio and a predetermined threshold.
  • Adaptive selection of the resolution matching the current video frame to be decoded improves the flexibility of resolution decision-making, and does not require a complicated decision-making process, so that the resolution to be used can be quickly determined for different video frames during the decoding process , In order to improve the decoding efficiency of the decoding process.
  • obtaining the first number and the second number of target reference frames associated with the video frame to be decoded from the video frames that have been decoded includes:
  • determining the target reference frame from the decoded video frame includes: determining the tk-th video frame when the video frame to be decoded is the t-th video frame Is the target reference frame; the above step S2, obtaining the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding in the target reference frame includes: obtaining from the tk-th video frame using intra-frame decoding The first number of blocks and the second number of blocks using inter-frame decoding, where k is a predetermined positive integer, t is a positive integer, and t>k.
  • the video frame to be decoded is the t-th frame video frame
  • the first number M1 of blocks using intra-frame decoding (blocks marked with diagonal lines as shown in FIG. 3) and the blocks using inter-frame decoding (as shown in FIG. 3) are obtained.
  • A1 For example, compare the ratio r with a predetermined threshold A1, and if r ⁇ A1, determine to use the second resolution R for decoding; if r ⁇ A1, obtain a set of thresholds from a numerical interval greater than A1, assuming A set of thresholds includes: A2 to A5, where A2 ⁇ A3 ⁇ A4 ⁇ A5. Assuming that it is determined by comparison: A4 ⁇ r ⁇ A5, it can be determined that the sampling ratio is 1/3 of the width and height of the high resolution R (such as the second resolution), and the high resolution R is sampled according to the above sampling ratio to obtain The resolution R'used for decoding (such as the first resolution). Use the resolution R'(e.g., the first resolution) to decode the t-th video frame.
  • A4 ⁇ r ⁇ A5 Assuming that it is determined by comparison: A4 ⁇ r ⁇ A5, it can be determined that the sampling ratio is 1/3 of the width and height of the high resolution R (such as the second resolution), and the high resolution R is sampled according to the above sampling ratio
  • the first number of blocks using intra-frame decoding in the target reference frame and the number of using inter-frame decoding are obtained.
  • the second number of decoded blocks is used to adaptively select the target resolution corresponding to the current video frame to be decoded for decoding using the ratio between the two. In this way, it is possible to quickly determine the resolution to be used for the video frame to be decoded, so as to overcome the problem of low decoding efficiency caused by the complicated operation of the resolution decision in the related technology.
  • obtaining the first number and the second number of target reference frames associated with the video frame to be decoded from the video frames that have been decoded includes:
  • S1 Determine a multi-frame reference frame from the decoded video frame as the target reference frame
  • determining a multi-frame reference frame from the decoded video frame as the target reference frame includes: determining a continuous multi-frame video frame from the decoded video frame or Discontinuous multi-frame video frames are used as target reference frames.
  • the above-mentioned target reference frame may be a continuous reference frame, as shown in FIG. 4, the multiple continuous video frames from the t-k1th video frame to the t-k2th video frame are determined as the target of the video frame to be decoded. Reference frame, using the first number M1 of intra-frame decoded blocks (blocks marked by slashes as shown in Figure 4) and the second number M2 of blocks using inter-frame decoding (blocks marked by dots as shown in Figure 4) .
  • the above-mentioned target reference frame may also be a discontinuous reference frame. As shown in FIG. 5, from the t-k1th video frame to the t-k2th video frame, multiple discrete video frames are selected, such as including the t-th video frame.
  • a group of video frames of k1 video frames and a group of video frames including video frames of t-k2 frames are determined as the target reference frame of the video frame to be decoded, and intra-frame decoded blocks are used (as shown in the diagonal line in Figure 5).
  • the first number M1 of marked blocks) and the second number M2 of blocks using inter-frame decoding are marked by dots as shown in FIG. 5).
  • the total number of blocks in each target reference frame can be the same value, such as N (as shown in Figure 4-5), or different values, such as N1, N2... ( Figure 4-5) Not shown in), this embodiment does not make any limitation on this.
  • parameters k1 and k2 may be, but are not limited to, parameters agreed in advance with the encoding device of the encoding end, and may also be parameters determined by parsing identification bits at designated positions in the code stream. There is no limitation on this in this embodiment.
  • the above step S1, determining the target reference frame from the decoded video frame includes: in the case that the video frame to be decoded is the t-th frame video frame, determining the t-k1th frame The video frame to the t-k2th frame is the target reference frame; the above step S2, acquiring the first number of blocks using intra-frame decoding in the target reference frame and the second number of blocks using inter-frame decoding in the target reference frame includes : From the t-k1th video frame to the t-k2th video frame, obtain the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding, where k1 and k2 are predetermined Is a positive integer, t is a positive integer, t>k1>k2.
  • the target reference frame is a continuous multi-frame video frame from the t-1 video frame to the t-2 video frame, where the total number of blocks in each target reference frame is N.
  • the second number M2 of blocks using intra-frame decoding (blocks marked with diagonal lines as shown in Figure 6) and the first number N1 of blocks using inter-frame decoding can be, but not limited to, the following formula to determine the ratio r:
  • the first number of blocks using intra-frame decoding in the target reference frame and the use of inter-frame decoding are obtained.
  • the relationship between the second number of decoded blocks is to adaptively select the target resolution corresponding to the current video frame to be decoded for decoding. In this way, it is possible to quickly determine the resolution to be used for the video frame to be decoded, so as to overcome the problem of low decoding efficiency caused by the complicated operation of the resolution decision in the related technology.
  • determining the target resolution also includes:
  • the encoding end may also directly write the target resolution determined by the encoding device according to the above-mentioned decision-making method into the code stream in the form of identification bits. Then, the decoding device at the decoding end can directly obtain the determined target resolution after obtaining the above-mentioned decoding flag from the code stream.
  • decoding flag can be indicated by, but not limited to, the following syntax elements:
  • the syntax element is used to indicate the relationship between the third resolution and the fourth resolution
  • the third resolution is the decoding
  • the resolution used for decoding the video frame, the fourth resolution is the resolution used for decoding the reference frame of the video frame to be decoded, the multiple resolutions used for decoding multiple video frames to be decoded include at least two different resolutions Rate; Determine the third resolution corresponding to each region according to the syntax element and the fourth resolution.
  • the corresponding peak signal-to-noise ratio is relatively large, and the distortion is relatively small, thus ensuring that the peak signal-to-noise ratio can be within a certain range. Changes within a small range, and the peak signal-to-noise ratio is relatively large, thereby achieving the technical effect of avoiding large fluctuations in the peak signal-to-noise ratio of video encoding and decoding.
  • the target resolution determined by the decoding is directly obtained by decoding the identification bits, so as to simplify the processing process of the decoding end, thereby reducing the transmission overhead and the decoding complexity of the decoding end, thereby achieving improvement The effect of decoding efficiency.
  • a video encoding method is provided. As shown in FIG. 7, the method is executed by a video processing device, and the method includes:
  • S702 Determine a video frame to be encoded.
  • S704 Obtain a first number and a second number of target reference frames associated with the video frame to be encoded from the video frames that have been encoded.
  • the first number is the number of blocks using intra-frame coding in the target reference frame
  • the second number is the number of blocks using inter-frame coding in the target reference frame.
  • S706 Determine a target resolution for encoding the to-be-encoded video frame according to the relationship between the first number and the second number.
  • S708 Use the target resolution to encode the to-be-encoded video frame.
  • the video encoding method shown in FIG. 7 can be, but not limited to, used in the video encoder shown in FIG. 1. Through the interaction of the video encoder and other components, the encoding process of the video frame to be encoded is completed.
  • the above-mentioned video encoding method may be applied to, but not limited to, application scenarios such as video playback applications, video sharing applications, or video session applications.
  • the video transmitted in the above application scenario may include, but is not limited to: long video, short video, such as long video may be a play episode with a long play time (for example, play time greater than 10 minutes), or in a long video session
  • the displayed picture, the short video may be a voice message that two or more parties interact, or a video with a short playing time (for example, playing time less than or equal to 30 seconds) displayed on a sharing platform.
  • the video encoding method provided in this embodiment can be, but is not limited to, applied to the foregoing application scenarios.
  • the frame is used based on the reference frame determined from the foregoing video frame.
  • the relationship between the first number of intra-encoded blocks and the second number of inter-encoded blocks in the reference frame is used to adaptively determine the target resolution of the video frame to be encoded, so as to simplify the determination of the above-mentioned video frame to be encoded.
  • the target resolution operation overcomes the problem of low video coding efficiency in related technologies and achieves the effect of improving coding efficiency.
  • the intra-encoded block in the target reference frame associated with the above-mentioned video frame to be encoded can be obtained from the video frame that has been encoded.
  • different resolutions can be adaptively selected to simplify the operation of determining the target resolution used in the video frames to be encoded, thereby overcoming the problem of low video encoding efficiency in related technologies and achieving improved encoding The effect of efficiency.
  • determining the target resolution for encoding the video frame to be encoded according to the relationship between the first number and the second number includes:
  • the video frame to be encoded is encoded with the first resolution
  • the video frame to be encoded is encoded with the second resolution, wherein the second resolution is greater than the first resolution.
  • the method before using the first resolution to encode the to-be-encoded video frame, the method further includes:
  • a set of thresholds is determined from a numerical interval greater than the predetermined threshold
  • determining the sampling ratio according to the comparison result includes: in the case where it is determined that the ratio of the first quantity to the second quantity is in the target interval, obtaining the target sampling ratio configured for the target interval, where: The target interval includes a set of threshold values between adjacent first and second thresholds, the first threshold is less than the second threshold; down-sampling the second resolution according to the sampling ratio includes: according to the target sampling ratio, the first threshold The width and height of the second resolution are down-sampled to obtain the first resolution.
  • the specific embodiment of the above-mentioned video encoding method may, but is not limited to, refer to the specific embodiment of the video decoding method, and adopt the opposite processing method corresponding to the decoding method to adaptively decide and match each video frame to be encoded.
  • the resolution is encoded. In this embodiment, it will not be repeated here.
  • step S802-S830 in the example shown in FIG. 8: at the encoding end, the video frame to be encoded is obtained, and the resolution used for encoding is selected through resolution decision. Then, according to the current encoding information and/or image feature information of the video frame to be encoded, the current encoding mode is determined from the optional encoding modes. If it is the full resolution mode, it is determined that down-sampling encoding is not required, and step S808-1 to step S808-1 are executed.
  • S816-1 Perform intra/inter prediction on the original resolution image frame to obtain prediction residuals, transformation/quantization and entropy coding processing to obtain coded data, and determine the resolution identifier of the adopted resolution. The above-mentioned data is further encoded, and in step S820, a code stream is output.
  • steps S810-2 to S818-2 are performed: Obtain down-sampled image frames, and perform intra/inter prediction on the image frames to obtain prediction residuals, transformation/quantization And entropy coding process, get the coded data, and determine the resolution mark of the adopted resolution.
  • the above-mentioned data is further encoded, and in step S820, a code stream is output.
  • step S822 to step S830 an adaptive decision is made to determine the resolution of the video frame to be decoded, and then decoded to obtain a reconstructed video frame. If it is determined that the down-sampling mode is adopted, the reconstructed video frame is up-sampled to obtain the decoded video frame; otherwise, the decoded video frame is directly output.
  • the foregoing is only an example.
  • the video encoding method and video decoding method provided in this embodiment are applied to the resolution decision process shown in the above figure, and are used for different video frames to be encoded/to be encoded on the encoding end and the decoding end.
  • the resolution of the decoded video frame is adaptively decided to simplify the process of determining the resolution used by each frame in the coding and decoding process, thereby achieving the effect of improving the coding and decoding efficiency.
  • the device includes:
  • the decoding determination module 902 is used to determine the video frame to be encoded
  • the acquiring module 904 is configured to acquire a first number and a second number of target reference frames associated with the video frame to be encoded from a video frame that has been encoded, where the first number is the frame used in the target reference frame The number of intra-encoded blocks, where the second number is the number of inter-encoded blocks in the target reference frame;
  • the first processing module 906 is configured to determine the target resolution for encoding the to-be-encoded video frame according to the relationship between the first number and the second number;
  • the decoding module 908 is configured to encode the to-be-encoded video frame using the target resolution.
  • the first processing module 906 includes:
  • the first processing sub-module is used to decode the video frame to be decoded at the first resolution when the ratio of the first number to the second number is greater than a predetermined threshold; and also to use the first resolution to decode the video frame to be decoded;
  • the second resolution is used to decode the to-be-decoded video frame, where the second resolution is greater than the first resolution.
  • the first determining submodule is used to determine from a numerical interval greater than the predetermined threshold when the ratio of the first number to the second number is greater than a predetermined threshold before encoding the video frame to be encoded with the first resolution A set of thresholds;
  • the comparison sub-module is used to compare the ratio of the first quantity to the second quantity with each threshold included in a set of thresholds;
  • the second determining sub-module is used to determine the sampling ratio according to the comparison result
  • the sampling sub-module is used to down-sample the second resolution according to the sampling ratio to determine the first resolution.
  • the obtaining module 904 includes:
  • the third determining sub-module is used to determine a frame of reference frame as the target reference frame from the decoded video frames;
  • the first acquisition submodule is used to acquire the first number of blocks using intra-frame decoding and the second number of blocks using inter-frame decoding in the target reference frame.
  • the third determining sub-module includes: (1) The first determining unit is configured to determine the t-kth video frame as the target reference frame when the video frame to be decoded is the t-th video frame;
  • the first acquisition submodule includes: (1) A first acquisition unit for acquiring a first number of blocks using intra-frame decoding and a second number of blocks using inter-frame decoding from the tk-th video frame, Among them, k is a predetermined positive integer, t is a positive integer, and t>k.
  • the obtaining module 904 includes:
  • the fourth determining sub-module is used to determine a multi-frame reference frame from the decoded video frame as the target reference frame;
  • the second acquisition submodule is configured to acquire the first number of blocks using intra-frame decoding in the target reference frame and the second number of blocks using inter-frame decoding in the target reference frame.
  • the first number is the total number of blocks using intra-frame decoding in the multi-frame reference frame
  • the second number is the total number of blocks using inter-frame decoding in the multi-frame reference frame.
  • the fourth determining sub-module can be implemented but not limited to the following steps: determining a continuous multi-frame video frame or a discontinuous multi-frame video frame from the video frames that have been decoded, As the target reference frame.
  • the fourth determining sub-module includes: (1) The second determining unit is used to determine the t-k1th video frame to the t-k2th video frame when the video frame to be decoded is the t-th video frame Is the target reference frame;
  • the second acquisition sub-module includes: (1) The second acquisition unit is used to acquire the first number of blocks using intra-frame decoding from the t-k1th video frame to the t-k2th video frame and the The second number of inter-decoded blocks, where k1 and k2 are predetermined positive integers, t is a positive integer, and t>k1>k2.
  • a video encoding device As shown in FIG. 10, the device includes:
  • Encoding determining module 1002 used to determine the video frame to be encoded
  • the acquisition module 1004 is configured to acquire a first number and a second number of target reference frames associated with the video frame to be encoded from a video frame that has been encoded, where the first number is the frame used in the target reference frame The number of intra-encoded blocks, where the second number is the number of inter-encoded blocks in the target reference frame;
  • a processing module 1006 configured to determine the target resolution for encoding the video frame to be encoded according to the relationship between the first number and the second number;
  • the encoding module 1008 is used to encode the to-be-encoded video frame using the target resolution.
  • the processing module 1006 includes:
  • the first processing sub-module is used to encode the video frame to be encoded with the first resolution when the ratio of the first number to the second number is greater than a predetermined threshold; and also to use the first resolution to encode the video frame to be encoded; In the case where the ratio of is less than the predetermined threshold, the video frame to be encoded is encoded using the second resolution, where the second resolution is greater than the first resolution.
  • the first determining submodule is used to determine from a numerical interval greater than the predetermined threshold when the ratio of the first number to the second number is greater than a predetermined threshold before encoding the video frame to be encoded with the first resolution A set of thresholds;
  • the comparison sub-module is used to compare the ratio of the first quantity to the second quantity with each threshold included in a set of thresholds;
  • the second determining sub-module is used to determine the sampling ratio according to the comparison result
  • the sampling sub-module is used to down-sample the second resolution according to the sampling ratio to determine the first resolution.
  • an electronic device for implementing the foregoing video decoding method.
  • the electronic device includes a memory and a processor, and a computer program is stored in the memory.
  • the processor is configured to execute the steps in any one of the foregoing method embodiments through a computer program.
  • the above-mentioned electronic device may be located in at least one network device among a plurality of network devices in a computer network.
  • the foregoing processor may be configured to execute the following steps through a computer program:
  • S2 Obtain a first number and a second number of target reference frames associated with the video frame to be encoded from the video frames that have been encoded, where the first number is the number of blocks in the target reference frame that are intra-encoded , The second number is the number of blocks using inter-frame coding in the target reference frame;
  • S3 Determine the target resolution for encoding the to-be-encoded video frame according to the relationship between the first number and the second number;
  • the structure shown in FIG. 11 is only for illustration, and the electronic device may also be a smart phone (such as an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, and a mobile Internet device (Mobile Internet Devices, MID), PAD and other terminal devices.
  • FIG. 11 does not limit the structure of the above electronic device.
  • the electronic device may also include more or fewer components (such as a network interface, etc.) than shown in FIG. 11, or have a configuration different from that shown in FIG. 11.
  • the memory 1102 can be used to store software programs and modules, such as program instructions/modules corresponding to the video decoding method and device in the embodiments of the present application.
  • the processor 1104 executes the software programs and modules stored in the memory 1102 by running the software programs and modules. This kind of functional application and data processing realizes the above-mentioned video decoding method.
  • the memory 1102 may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 1102 may further include a memory remotely provided with respect to the processor 1104, and these remote memories may be connected to the terminal through a network.
  • the memory 1102 may be specifically, but not limited to, used to store information such as video frames to be decoded.
  • the foregoing memory 1102 may include, but is not limited to, the decoding determination module 902, the acquisition module 904, the first processing module 906, and the decoding module 908 in the foregoing video decoding device.
  • it may also include, but is not limited to, other module units in the above-mentioned video decoding device, which will not be repeated in this example.
  • the aforementioned transmission device 1106 is used to receive or send data via a network.
  • the foregoing specific examples of the network may include wired networks and wireless networks.
  • the transmission device 1106 includes a network adapter (Network Interface Controller, NIC), which can be connected to other network devices and routers via a network cable so as to communicate with the Internet or a local area network.
  • the transmission device 1106 is a radio frequency (RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF radio frequency
  • the above-mentioned electronic device further includes: a display 1108 for displaying the decoded video; and a connection bus 1110 for connecting each module component in the above-mentioned electronic device.
  • the electronic device for implementing the foregoing video encoding method.
  • the electronic device includes a memory 1202 and a processor 1204.
  • the memory 1202 stores a computer
  • the processor 1204 is configured to execute the steps in any one of the foregoing method embodiments through a computer program.
  • the above-mentioned electronic device may be located in at least one network device among a plurality of network devices in a computer network.
  • the foregoing processor may be configured to execute the following steps through a computer program:
  • S2 Obtain a first number and a second number of reference frames associated with the video frame to be decoded from the video frames that have been decoded, where the first number is the number of blocks in the target reference frame that use intra-frame decoding, The second number is the number of blocks in the target reference frame that use inter-frame decoding;
  • S3 Determine a target resolution for decoding the to-be-decoded video frame according to the relationship between the first number and the second number;
  • the structure shown in FIG. 12 is only for illustration, and the electronic device may also be a smart phone (such as an Android phone, an iOS phone, etc.), a tablet computer, a palmtop computer, and a mobile Internet device (Mobile Internet Devices, MID), PAD and other terminal devices.
  • FIG. 12 does not limit the structure of the above-mentioned electronic device.
  • the electronic device may also include more or fewer components (such as a network interface, etc.) than shown in FIG. 12, or have a configuration different from that shown in FIG.
  • the memory 1202 can be used to store software programs and modules, such as program instructions/modules corresponding to the video encoding method and device in the embodiments of the present application.
  • the processor 1204 executes the software programs and modules stored in the memory 1202 by running the software programs and modules. This kind of functional application and data processing realizes the above-mentioned video coding method.
  • the memory 1202 may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 1202 may further include a memory remotely provided with respect to the processor 1204, and these remote memories may be connected to the terminal through a network.
  • the memory 1202 can be specifically, but not limited to, used for information such as video frames to be encoded.
  • the foregoing memory 1202 may include, but is not limited to, the encoding determination module 1002, the acquisition module 1004, the processing module 1006, and the encoding module 1008 in the foregoing video encoding device.
  • it may also include, but is not limited to, other module units in the above-mentioned video encoding device, which will not be repeated in this example.
  • the aforementioned transmission device 1206 is used to receive or send data via a network.
  • the foregoing specific examples of the network may include wired networks and wireless networks.
  • the transmission device 1206 includes a network adapter (Network Interface Controller, NIC), which can be connected to other network devices and routers via a network cable so as to communicate with the Internet or a local area network.
  • the transmission device 1206 is a radio frequency (RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF radio frequency
  • the above-mentioned electronic device further includes: a display 1208 for displaying the video before encoding; and a connection bus 1210 for connecting each module component in the above-mentioned electronic device.
  • the embodiment of the present application also provides a storage medium in which a computer program is stored, wherein the computer program is configured to execute the steps in any one of the foregoing method embodiments when running.
  • the aforementioned storage medium may be configured to store a computer program for executing the following steps:
  • S2 Obtain a first number and a second number of target reference frames associated with the video frame to be decoded from the video frames that have been decoded, where the first number is the number of blocks in the target reference frame that use intra-frame decoding , The second number is the number of blocks using inter-frame decoding in the target reference frame;
  • S3 Determine a target resolution for decoding the to-be-decoded video frame according to the relationship between the first number and the second number;
  • the storage medium is also configured to store a computer program for executing the following steps:
  • S2 Obtain a first number and a second number of target reference frames associated with the video frame to be encoded from the video frames that have been encoded, where the first number is the number of blocks in the target reference frame that are intra-encoded , The second number is the number of blocks using inter-frame coding in the target reference frame;
  • S3 Determine the target resolution for encoding the to-be-encoded video frame according to the relationship between the first number and the second number;
  • the storage medium is further configured to store a computer program used to execute the steps included in the method in the above-mentioned embodiment, which will not be repeated in this embodiment.
  • the embodiments of the present application also provide a computer program product including instructions, which when run on a computer, cause the computer to execute the video decoding method or video encoding method provided in the foregoing embodiments.
  • the storage medium may include: a flash disk, a read-only memory (Read-Only Memory, ROM), a random access device (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.
  • the integrated unit in the foregoing embodiment is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in the foregoing computer-readable storage medium.
  • the technical solution of the present application essentially or the part that contributes to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. It includes several instructions to make one or more computer devices (which may be personal computers, servers, or network devices, etc.) execute all or part of the steps of the methods in the various embodiments of the present application.
  • the disclosed client can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of units is only a logical function division.
  • multiple units or components can be combined or integrated into Another system, or some features can be ignored, or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, units or modules, and may be in electrical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本申请公开了一种视频编解码方法和相关装置。其中,该方法包括:确定出待解码视频帧;从已完成解码的视频帧中,获取待解码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;根据第一数量和第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率;采用目标分辨率对待解码视频帧进行解码。本申请解决了由于相关技术提供的处理操作较复杂所导致的视频编解码效率较低的技术问题。

Description

视频编解码方法和相关装置
本申请要求于2019年09月27日提交中国专利局、申请号为201910927110.5、申请名称为“视频编解码方法和装置及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及音视频编解码领域,具体而言,涉及视频编解码。
背景技术
随着数字媒体技术和计算机技术的发展,视频应用于各个领域,如移动通信、网络监控、网络电视等。随着硬件性能和屏幕分辨率的提高,用户对高清视频的需求日益强烈。
在移动带宽有限的条件下,编解码器对视频帧通常是采用相同分辨率进行编解码,这将使得部分带宽下的峰值信噪比(Peak Signal to Noise Ratio,简称PSNR)相对较低。
发明内容
本申请实施例提供了一种视频编解码方法和相关装置,以至少解决由于相关技术提供的处理操作较复杂所导致的视频编解码效率较低的技术问题。
根据本申请实施例的一个方面,提供了一种视频解码方法,所述方法由视频处理设备执行,所述方法包括:确定待解码视频帧;从已完成解码的视频帧中,获取上述待解码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;根据上述第一数量和上述第二数量的关系确定用于对上述待解码视频帧进行解码的目标分辨率;采用上述目标分辨率对上述待解码视频帧进行解码。
根据本申请实施例的另一方面,还提供了一种视频编码方法,所述方法由视频处理设备执行,所述方法包括:确定待编码视频帧;从已完成编码的视频帧中,获取上述待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;根据上述第一数量和上述第二数量的关系确定用于对上述待编码视频帧进行编码的目标分辨率;采用上述目标分辨 率对上述待编码视频帧进行编码。
根据本申请实施例的又一方面,还提供了一种视频解码装置,包括:确定待编码视频帧;从已完成编码的视频帧中,获取上述待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;根据上述第一数量和上述第二数量的关系确定用于对上述待编码视频帧进行编码的目标分辨率;采用上述目标分辨率对上述待编码视频帧进行编码。
根据本申请实施例的另一方面,还提供了一种视频编码装置,包括:确定待编码视频帧;从已完成编码的视频帧中,获取上述待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;根据上述第一数量和上述第二数量的关系确定用于对上述待编码视频帧进行编码的目标分辨率;采用上述目标分辨率对上述待编码视频帧进行编码。
根据本申请实施例的又一方面,还提供了一种计算机可读的存储介质,该计算机可读的存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行以上方面的视频解码方法或视频编码方法。
根据本申请实施例的再一方面,提供了一种视频处理设备,所述视频处理设备包括:
处理器、通信接口、存储器和通信总线;
其中,所述处理器、所述通信接口和所述存储器通过所述通信总线完成相互间的通信;所述通信接口为通信模块的接口;
所述存储器,用于存储程序代码,并将所述程序代码传输给所述处理器;处理器,用于调用存储器中程序代码的指令执行以上方面的视频解码方法或视频编码方法。
根据本申请实施例的又一方面,提供了一种包括指令的计算机程序产品,当其在计算机上运行时,使得所述计算机执行以上方面的视频解码方法或视频编码方法。
在本申请实施例中,针对确定的待解码视频帧,可以从在已完成解码的视频帧中,获取与上述待解码视频帧关联的目标参考帧的第一数量和第二数量, 所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量,利用上述第一数量与第二数量之间的关系,来确定用于对待解码视频帧进行解码的目标分辨率,并采用该目标分辨率对待解码视频帧进行解码。针对不同待解码视频帧,可以实现自适应选择不同的分辨率,以简化确定上述待解码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频解码效率较低的问题,达到提高解码效率的效果。进而解决了由于相关技术提供的处理操作较复杂所导致的视频编解码效率较低的技术问题。
附图说明
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:
图1是根据本申请实施例的一种可选的视频解码方法的应用环境的示意图;
图2是根据本申请实施例的一种可选的视频解码方法的流程图;
图3是根据本申请实施例的一种可选的视频解码方法的示意图;
图4是根据本申请实施例的另一种可选的视频解码方法的示意图;
图5是根据本申请实施例的又一种可选的视频解码方法的示意图;
图6是根据本申请实施例的又一种可选的视频解码方法的示意图;
图7是根据本申请实施例的一种可选的视频编码方法的流程图;
图8是根据本申请实施例的一种可选的视频编解码过程的示意图;
图9是根据本申请实施例的一种可选的视频解码装置的结构示意图;
图10是根据本申请实施例的一种可选的视频编码装置的结构示意图;
图11是根据本申请实施例的一种可选的电子装置的结构示意图;
图12是根据本申请实施例的另一种可选的电子装置的结构示意图。
具体实施方式
为了使本技术领域的人员更好地理解本申请方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分的实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所 有其他实施例,都应当属于本申请保护的范围。
需要说明的是,本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。
根据本申请实施例的一个方面,提供了一种视频解码方法,可选地,作为一种可选的实施方式,上述视频解码方法可以但不限于应用于如图1所示的应用环境中。其中,该应用环境中包括终端102和服务器104,上述终端102和服务器104通过网络进行通信。其中,上述终端102可以但不限于为智能手机、平板电脑、笔记本电脑、台式计算机等,但并不局限于此。上述服务器104可以但不限于为数据处理能力较强,且具有一定存储空间的计算机处理设备。
本申请实施例提供的视频解码方法和视频编码方法可以通过视频处理设备执行,该视频处理设备具有视频编解码功能,例如可以是上述的终端102或者服务器104。
需要说明的是,与上述视频解码方法对应的视频编码方法也可以但不限于应用于图1所示的应用环境中。在获取待编码视频之后,可以但不限于采用本申请所提供的视频编码方法,通过上述图1所示的终端102与服务器104的交互过程,来自适应决策不同的待编码视频中各个视频帧在编码时所采用的分辨率,从而实现针对不同的视频帧决策采用不同的分辨率进行编码,在节省传输带宽的同时,也保证了视频帧的编码质量,避免出现失真的问题。此外,在获取待解码视频之后,也可以但不限于采用本申请所提供的视频解码方法,通过上述图1所示的终端102与服务器104的交互过程,来自适应决策不同的待解码视频中各个视解帧在解码时所采用的分辨率,从而实现针对不同的视频帧决策采用不同的分辨率进行解码,以保证了视频帧的解码质量,避免出现失真的问题。
在一个实施例中,终端102可以包括但不限于以下部件:图像处理单元1021、 处理器1022、存储介质1023、内存1024、网络接口1025、显示屏幕1026和输入设备1027。上述部件可以但不限于通过系统总线1028连接。其中,上述图像处理单元1021用于至少提供显示界面的绘制能力;上述处理器1022用于提供计算和控制能力,以支持终端102的运行;存储介质1023中存储有操作系统1023-2、视频编码器和/或视频解码器1023-4。操作系统1023-2用于提供控制操作指令,视频编码器和/或视频解码器1023-4用于根据控制操作指令执行编码/解码操作。此外,上述内存为存储介质1023中的视频编码器和/或视频解码器1023-4提供运行环境,网络接口1025用于与服务器104中的网络接口1043进行网络通信。上述显示屏幕用于显示应用界面等,如解码视频;输入设备1027用于接收用户输入的命令或数据等。对于带触摸屏的终端102,显示屏幕1026和输入设备1027可为触摸屏。上述图1所示出的终端内部的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的终端的限定,具体的终端或服务器可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。
在一个实施例中,上述服务器104可以包括但不限于以下部件:处理器1041、内存1042、网络接口1043和存储介质1044。上述部件可以但不限于通过系统总线1045连接。上述存储介质1044包括操作系统1044-1、数据库1044-2、视频编码器和/或视频解码器1044-3。其中,上述处理器1041用于提供计算和控制能力,以支持服务器104的运行。内存1042为存储介质1044中的视频编码器和/或视频解码1044-3的运行提供环境。网络接口1043与外部的终端102的网络接口1025通过网络连接通信。上述存储介质中的操作系统1044-1用于提供控制操作指令;视频编码器和/或视频解码器1044-3用于根据控制操作指令执行编码/解码操作;数据库1044-2用于存储数据。上述图1所示出的服务器内部的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备具有不同的部件布置。
在一个实施例中,上述网络可以包括但不限于有线网络。其中,上述有线网络可以包括但不限于:广域网、城域网、局域网。上述仅是一种示例,本实施例中对此不作任何限定。
根据本申请实施例的一个方面,提供了一种视频解码方法,如图2所示, 该方法包括:
S202,确定待解码视频帧;
S204,从已完成解码的视频帧中,获取待解码视频帧关联的目标参考帧的第一数量和第二数量。
所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量。
S206,根据第一数量和第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率。
S208,采用目标分辨率对待解码视频帧进行解码。
需要说明的是,上述图2所示视频解码方法可以但不限于用于图1所示的视频解码器中。通过该视频解码器与其他部件的交互配合,来完成对待解码视频帧的解码过程。
可选地,在本实施例中,上述视频解码方法可以但不限于应用于视频播放应用、视频共享应用或视频会话应用等应用场景中。其中,上述应用场景中所传输的视频可以包括但不限于:长视频、短视频,如长视频可以为播放时间较长(例如播放时长大于10分钟)的播放剧集,或长时间视频会话中所展示的画面,短视频可以为双方或多方交互的语音消息,或用于在共享平台展示的播放时间较短(例如播放时长小于等于30秒)的视频。上述仅是示例,本实施例中所提供的视频解码方法可以但不限于应用于上述应用场景中用于播放视频的播放设备中,在获取到已完成解码的视频帧之后,基于从上述视频帧确定出的目标参考帧中采用帧内解码的块的第一数量与目标参考帧中采用帧间解码的块的第二数量的关系,来自适应决策待解码视频帧所采用的目标分辨率,以简化确定上述待解码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频解码效率较低的问题,达到提高解码效率的效果。
需要说明的是,在本实施例中,针对确定出的待解码视频帧,可以从在已完成解码的视频帧中,获取与上述待解码视频帧关联的目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,利用上述第一数量与第二数量之间的关系,来确定用于对待解码视频帧进行解码的目标分辨率,并采用该目标分辨率对待解码视频帧进行解码。针对不同待解码视频帧,可以实现 自适应选择不同的分辨率,以简化确定上述待解码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频解码效率较低的问题,达到提高解码效率的效果。
可选地,在本实施例中,在从编码设备接收的码流中确定出待解码视频帧之后,可以但不限于从已完成解码的视频帧中确定出目标参考帧,通过以下方式确定参考帧的解码模式:
1)获取码流中预设的标志位,根据该标志位确定目标参考帧所采用的解码方式,如帧内解码或帧间解码;
2)按照与编码端的编码设备之间的约定进行解码,解码后再确定已完成解码的目标参考帧所采用的解码方式,如帧内解码或帧间解码。
可选地,在本实施例中,根据采用帧内解码的块的第一数量与采用帧间解码的块的第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率可以包括但不限于:获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量二者之间的比值;根据该比值与预定阈值之间的比对结果确定目标分辨率。如在该比值大于预定阈值的情况下,采用第一分辨率对待解码视频帧进行解码;在该比值小于预定阈值的情况下,采用第二分辨率对待解码视频帧进行解码,其中,第二分辨率大于第一分辨率。也就是说,帧间解码的块的数量越多,上述比值越小,则所采用的分辨率越高。
此外,在本实施例中,根据采用帧内解码的块的第一数量与采用帧间解码的块的第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率,还可以包括但不限于:获取参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量;在第一数量大于第二数量,且二者之间的差值大于预定阈值的情况下,采用第一分辨率对待解码视频帧进行解码;在二者之间的差值小于预定阈值的情况下,采用第二分辨率对待解码视频帧进行解码,其中,第二分辨率大于第一分辨率。也就是说,在第一数量大于第二数量的情况下,随着帧间解码的块的数量的增加,二者之间的差值越小,则所采用的分辨率越高。反而反之。这里判定方式仅是示例,本实施例中对此不作任何限定。
需要说明的是,在本实施例中,上述第一分辨率可以包括但不限于以下之一:待解码视频帧的原始分辨率、对待解码视频帧进行上采样得到的最高分辨 率。上述第二分辨率可以包括但不限于是对第一分辨率按照预定的采样比例进行下采样后得到的分辨率。其中,上述采样比例可以但不限于是根据上述比值与不同阈值的比对结果确定。这里不同阈值可以但不限于是从小于预定阈值的数值区间内确定出的一组阈值。
进一步,在本实施例中,上述目标参考帧可以包括但不限于以下之一:已完成解码的视频帧中的一帧参考帧、已完成解码的视频帧中的多帧参考帧。其中,上述多帧参考帧可以包括但不限于:已完成解码的视频帧中的连续的多帧视频帧或不连续的多帧视频帧。
可选地,在本实施例中,自适应决策当前的待解码视频帧所采用的目标分辨率可以包括但不限于通过以下方式:
1)按照与编码设备约定的方式,获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量二者之间的比值,并基于该比值与预定阈值的比对结果,来自适应决策上述待解码视频帧所采用的目标分辨率。如编码端与解码端可以按照通信标准中的约定,对两端使用相同的方式决策所采用的分辨率。
2)获取编码设备发送的码流,解析该码流得到解码标识位,该解码标识位用于指示上述待解码视频帧在解码时所采用的目标分辨率。
需要说明的是,在本实施例中,解码端的解码设备可以按照约定采用与编码设备相同的方式来自适应决策待解码视频帧所采用的目标分辨率,其中,该方式可以为基于采用帧内解码的块的第一数量与采用帧间解码的块的第二数量二者之间的比值与预定阈值的比对结果,来确定上述目标分辨率。
可选地,在本实施例中,采用目标分辨率对待解码视频帧进行解码可以包括但不限于:在自适应决策出针对待解码视频帧所采用的目标分辨率之后,采用相关技术基于该目标分辨率对待解码视频帧进行解码处理:
S1,确定待解码视频帧对应的当前解码模式。如通过获取码流中预设位置对应的标志位,通过标志位得到当前编码模式信息,从而判断当前解码模式是全分辨率模式还是下采样模式。当前解码模式与当前编码模式相同。
S2,根据与待解码视频帧对应的当前的目标参考帧的分辨率,以及决策得到的待解码视频帧的目标分辨率,确定处理参考帧。其中,判断当前的目标参考帧的分辨率是否与目标分辨率相同,如果相同,则直接将该当前的参考帧作 为处理参考帧;如果不同,则执行步骤S3。
S3,对当前的目标参考帧根据预设采样规则进行采样得到对应的处理参考帧。其中,解码时采用的预设采样规则与编码时采用的预设采样规则相同。例如,编码端的预设采样规则可以但不限于如下:与当前编码视频帧的分辨率、当前的目标参考帧与当前编码视频帧的距离相关。如在当前的目标参考帧与当前编码视频帧的距离D超过预设阈值时,采用1/M下采样,否则采用1/N下采样,其中M<N。因为距离较远的目标参考帧可能参考价值小于距离较近的目标参考帧,从而对于距离较远的当前的目标参考帧,即使与当前编码视频帧的分辨率不同,也可不对其进行采样或使得其采样后的分辨率变化较小。对应的,解码端也可采用上述相同的方法来进行采样,可加快采样速度,进而提高整个解码速度。需要说明的是,上述M的值随着距离D变化,将进一步提高采样参数的灵活性。
此外,如果当前的目标参考帧与当前解码视频帧的分辨率(如目标分辨率)不相同,则对当前的目标参考帧进行采样,以得到与当前解码视频帧分辨率相同的处理参考帧,其中对当前的目标参考帧进行采样包括上采样和下采样:如果当前的目标参考帧的分辨率大于当前解码视频帧分辨率,则对当前的目标参考帧进行下采样得到处理参考帧;如果当前的目标参考帧的分辨率小于当前解码视频帧的分辨率,则对当前的目标参考帧进行上采样得到处理参考帧。在一个实施例中,如果当前的目标参考帧的个数为多个,则分别对每个当前的目标参考帧根据参考帧分辨率与当前解码视频帧分辨率进行采样,得到与当前解码视频帧分辨率相同的处理参考帧。
进一步,在本实施例中,对当前的目标参考帧进行采样的采样算法与对重建视频帧进行下采样得到解码视频帧的采样算法匹配,即如果对目标参考帧进行下采样,则下采样算法与对重建视频帧进行下采样得到解码视频帧的下采样算法相同。如果对目标参考帧进行上采样,则上采样算法与对重建视频帧进行下采样得到解码视频帧的下采样算法为匹配的相反的采样算法。
S4,根据上述处理参考帧对当前解码视频帧进行解码得到重建视频帧。其中,对码流中的预测残差进行解码,并与处理参考帧的对应预测像素叠加得到重建视频帧。
S5,将重建视频帧根据当前解码模式处理得到解码视频帧,如果当前解码模式为下采样模式,则对重建视频帧进行上采样得到解码视频帧,如果当前编码模式信息为全采样模式,则将重建视频帧确定为解码视频帧。
具体的,对重建视频帧进行上采样的算法与编码端对待编码视频帧进行下采样得到当前编码视频帧的算法为匹配的相反的采样算法。
需要说明的是,在本实施例中,从待解码视频帧中,确定所要重构的至少一对解码块,其中,在至少一对解码块中的每对解码块包括采用第一分辨率的第一解码块和采用第二分辨率的第二解码块,第一解码块与第二解码块为位置邻接的解码块;将第一解码块的第一分辨率调整为目标分辨率,并将第二解码块的第二分辨率调整为目标分辨率;从第一解码块中确定出第一边缘像素点集,并从第二解码块中确定出第二边缘像素点集,其中,第一边缘像素点集的位置与第二边缘像素点集的位置邻接;对第一边缘像素点集进行滤波处理,得到滤波后的第一边缘像素点集,并对第二边缘像素点集进行滤波处理,得到滤波后的第二边缘像素点集,其中,滤波后的第一边缘像素点集中第i个像素点的像素值与滤波后的第二边缘像素点集中与第i个像素点对应的第j个像素点的像素值之间的第一差值,小于第一边缘像素点集中第i个像素点的像素值与第二边缘像素点集中第j个像素点的像素值之间的第二差值,i为正整数,且小于等于第一边缘像素点集中像素点的总数,j为正整数,且小于等于第二边缘像素点集中像素点的总数。
其中,上述调整目标分辨率的方式包括以下之一:
1)在目标分辨率等于第一分辨率的情况下,将第二分辨率调整为第一分辨率;
2)在目标分辨率等于第二分辨率的情况下,将第一分辨率调整为第二分辨率;
3)在目标分辨率等于第三分辨率的情况下,将第一分辨率调整为第三分辨率,并将第二分辨率调整为第三分辨率,其中,该第三分辨率与第一分辨率不同,且与第二分辨率不同。
通过对上述解码块进行分辨率调整,并对解码块中确定出的边缘像素点集进行边缘滤波处理,以使得在重构过程中可以避免在视频中出现明显接缝,从 而保证准确地还原出视频中的内容,进而解决了分辨率不一致导致的视频失真的技术问题。
通过本申请提供的实施例,针对确定出的待解码视频帧,可以从在已完成解码的视频帧中,获取与上述待解码视频帧关联的目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,利用上述第一数量与第二数量之间的关系,来确定用于对待解码视频帧进行解码的目标分辨率,并采用该目标分辨率对待解码视频帧进行解码。针对不同待解码视频帧,可以实现自适应选择不同的分辨率,以简化确定上述待解码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频解码效率较低的问题,达到提高解码效率的效果。
作为一种可选的方案,根据第一数量和第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率包括:
1)在第一数量与第二数量的比值大于预定阈值的情况下,采用第一分辨率对待解码视频帧进行解码;
2)在第一数量与第二数量的比值小于预定阈值的情况下,采用第二分辨率对待解码视频帧进行解码,其中,第二分辨率大于第一分辨率。
需要说明的是,上述第二分辨率可以包括但不限于以下之一:待解码视频帧的原始分辨率、对待解码视频帧进行上采样得到的最高分辨率。上述第一分辨率可以包括但不限于是对第二分辨率按照预定的采样比例进行下采样后得到的分辨率。其中,上述采样比例可以但不限于是根据上述比值与不同阈值的比对结果确定。这里不同阈值可以但不限于是从大于预定阈值的数值区间内确定出的一组阈值。
可选地,在本实施例中,在采用第一分辨率对待编码视频帧进行编码之前,还包括:
S1,在第一数量与第二数量的比值大于预定阈值的情况下,从大于预定阈值的数值区间内确定一组阈值;
S2,对第一数量与第二数量的比值与一组阈值中包括的各个阈值进行比对;
S3,根据比对的结果确定采样比例;
S4,按照采样比例对第二分辨率进行下采样,以确定第一分辨率。
可选地,在本实施例中,根据比对的结果确定采样比例包括:在确定第一 数量与第二数量的比值位于目标区间的情况下,获取为目标区间配置的目标采样比例,其中,目标区间包括一组阈值中相邻的第一阈值与第二阈值之间的数值,第一阈值小于第二阈值;按照采样比例对第二分辨率进行下采样包括:按照目标采样比例,对第二分辨率的宽高进行下采样,以得到第一分辨率。
具体结合以下示例进行说明,假设预定阈值为A1,从大于预定阈值的数值区间内确定出的一组阈值包括:A2至A5,其中,A2<A3<A4<A5。进一步假设参考帧中采用帧内解码的块的第一数量M与采用帧间解码的块的第二数N的比值为r。确定过程可以如下:
在确定r<A1的情况下,则表示已完成解码的视频帧中帧间运动较复杂,对应的采用高分辨率R(如第二分辨率)对上述待解码视频帧进行解码;在确定A1<r≤A2的情况下,则可以确定采样比例为高分辨率R的宽高的3/4,按照上述采样比例对高分辨率R进行采样,得到用于解码的分辨率R’(如第一分辨率);在确定A3<r≤A4的情况下,则可以确定采样比例为高分辨率R的宽高的2/3,按照上述采样比例对高分辨率R进行采样,得到用于解码的分辨率R’(如第一分辨率);在确定A4<r≤A5的情况下,则可以确定采样比例为高分辨率R的宽高的1/3,按照上述采样比例对高分辨率R进行采样,得到用于解码的分辨率R’(如第一分辨率)。
需要说明的是,上述采样比例为示例,根据不同的阈值对应的区间可以设置为不同的取值,本实施例中对采样比例的数值,以及采样比例与阈值之间的对应关系不作任何限定。
通过本申请提供的实施例,获取目标参考帧中采用帧内解码的第一数量与采用帧间解码的第二数量二者之间的比值,从而实现根据该比值与预定阈值的比对结果,自适应选择与当前的待解码视频帧匹配的分辨率,提高了分辨率决策的灵活性,且无需复杂的决策过程,以便于在解码过程中针对不同的视频帧可以快速决策出所采用的分辨率,以提高解码过程的解码效率。
作为一种可选的方案,从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量包括:
S1,从已完成解码的视频帧中确定出一帧参考帧作为目标参考帧;
S2,获取目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块 的第二数量。
可选地,在本实施例中,上述步骤S1,从已完成解码的视频帧中确定出目标参考帧包括:在待解码视频帧为第t帧视频帧的情况下,确定第t-k帧视频帧为目标参考帧;上述步骤S2,获取目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量包括:从第t-k帧视频帧中,获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,其中,k为预先确定的正整数、t为正整数,t>k。
具体结合图3所示示例进行说明:假设待解码视频帧为第t帧视频帧,从已完成解码的视频帧中确定出参考帧为第t-k帧视频帧。进一步,假设从第t-k帧视频帧中,获取采用帧内解码的块(如图3所示斜线标记的块)的第一数量M1和采用帧间解码的块(如图3所示圆点标记的块)的第二数量M2,并计算比值r=M1/M2。
例如,比对该比值r与预定阈值A1,在r<A1的情况下,确定采用第二分辨率R进行解码;在r≥A1的情况下,从大于A1的数值区间获取一组阈值,假设一组阈值包括:A2至A5,其中,A2<A3<A4<A5。假设经比对确定:A4<r≤A5,则可以确定采样比例为高分辨率R(如第二分辨率)的宽高的1/3,按照上述采样比例对高分辨率R进行采样,得到用于解码的分辨率R’(如第一分辨率)。采用分辨率R’(如第一分辨率)对第t帧视频帧进行解码。
通过本申请提供的实施例,在从已完成解码的视频帧中确定出一帧参考帧作为目标参考帧之后,获取在该目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,以便于利用二者之间的比值自适应选择与当前的待解码视频帧对应的目标分辨率进行解码。从而实现针对待解码视频帧,快速决策出所要采用的分辨率,以克服相关技术中决策分辨率的操作较复杂所导致的解码效率较低的问题。
作为一种可选的方案,从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量包括:
S1,从已完成解码的视频帧中确定出多帧参考帧作为所述目标参考帧;
S2,获取目标参考帧中采用帧内解码的块的第一数量与多帧参考帧中采用帧间解码的块的第二数量,第一数量为该多帧参考帧中采用帧内解码的块的总 数量,第二数量为该多帧参考帧中采用帧间解码的块的总数量。
可选地,在本实施例中,从已完成解码的视频帧中确定出多帧参考帧作为所述目标参考帧包括:从已完成解码的视频帧中,确定出连续的多帧视频帧或不连续的多帧视频帧,作为目标参考帧。
例如,上述目标参考帧可以为连续的参考帧,如图4所示的从第t-k1帧视频帧至第t-k2帧视频帧的多帧连续视频帧,确定为待解码视频帧的目标参考帧,采用帧内解码的块(如图4所示斜线标记的块)的第一数量M1和采用帧间解码的块(如图4所示圆点标记的块)的第二数量M2。上述目标参考帧也可以为不连续的参考帧,如图5所示的从第t-k1帧视频帧至第t-k2帧视频帧中,选择离散的多个视频帧,如包含第t-k1帧视频帧的一组视频帧,以及包含第t-k2帧视频帧的一组视频帧,确定为待解码视频帧的目标参考帧,采用帧内解码的块(如图5所示斜线标记的块)的第一数量M1和采用帧间解码的块(如图5所示圆点标记的块)的第二数量M2。
其中,上述目标参考帧中,每个目标参考帧中块的总数可以相同数值,如均为N(如图4-5所示),也可以为不同数值,如分别为N1,N2…(图中未示出),本实施例中对此不作任何限定。
需要说明的是,上述参数k1与k2可以但不限于为与编码端的编码设备预先约定的参数,也可以为通过解析码流中指定位置的标识位确定出的参数。本实施例中对此不作任何限定。
可选地,在本实施例中,上述步骤S1,从已完成解码的视频帧中确定出目标参考帧包括:在待解码视频帧为第t帧视频帧的情况下,确定第t-k1帧视频帧至第t-k2帧视频帧为目标参考帧;上述步骤S2,获取目标参考帧中采用帧内解码的块的第一数量与目标参考帧中采用帧间解码的块的第二数量包括:从第t-k1帧视频帧至第t-k2帧视频帧中,获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,其中,k1和k2为预先确定的正整数、t为正整数,t>k1>k2。
具体结合图6所示示例进行说明:假设目标参考帧为第t-1帧视频帧至第t-2帧视频帧的连续多帧视频帧,其中,每个目标参考帧中块的总数均为N。进一步,从第t-1帧中获取到采用帧内解码的块(如图6所示斜线标记的块)的第一 数量M1和采用帧间解码的块(如图6所示圆点标记的块)的第二数量M2,从第t-2帧中获取到采用帧内解码的块(如图6所示斜线标记的块)的第一数量N1和采用帧间解码的块(如图6所示圆点标记的块)的第二数量N2,则可以但不限于利用以下公式来确定比值r:
1)r=M1/M2*a+N1/N2*b,其中,a、b为计算所需的权值,具体数值不作限定;
2)r=[M1+N1]/[M2+N2]。
上述公式仅为示例,本实施例中对此不作任何限定。
通过本申请提供的实施例,在从已完成解码的视频帧中确定出多帧参考帧作为目标参考帧之后,获取在该目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量的关系,以自适应选择与当前的待解码视频帧对应的目标分辨率进行解码。从而实现针对待解码视频帧,快速决策出所要采用的分辨率,以克服相关技术中决策分辨率的操作较复杂所导致的解码效率较低的问题。
作为一种可选的方案,确定目标分辨率,还包括:
S1,解析码流得到与待解码视频帧对应的解码标识位;
S2,获取解码标识位所指示的目标分辨率。
需要说明的是,为了简化解码端确定目标分辨率的操作,还可以由编码端直接将编码设备已按照上述决策方式确定出的目标分辨率,以标识位的方式写入码流。则解码端的解码设备在从码流中解析得到上述解码标识位后,可以直接获取已确定的目标分辨率。
其中,上述解码标识位可以但不限于通过以下语法元素来指示:
1)获取每个待解码视频帧对应的语法元素,其中,语法元素用于指示解码上述待解码视频帧所采用的分辨率,解码多个待解码视频帧所采用的多个分辨率包括至少两个不同的分辨率;
2)获取多个待解码视频帧中每个待解码视频帧对应的语法元素,其中,语法元素用于指示第三分辨率与第四分辨率之间的关系,第三分辨率为解码上述待解码视频帧所采用的分辨率,第四分辨率为解码上述待解码视频帧的参考帧所采用的分辨率,解码多个待解码视频帧所采用的多个分辨率包括至少两个 不同的分辨率;根据语法元素以及第四分辨率确定每个区域所对应的第三分辨率。
这样无论是在传输的带宽比较小的情况下,还是在传输的带宽比较大的情况下,对应的峰值信噪比都相对较大,失真相对较小,从而保证了峰值信噪比能够在一个较小的范围内变化,并且峰值信噪比都相对较大,从而实现了避免对视频进行编解码的峰值信噪比波动较大的技术效果。
通过本申请提供的实施例,通过解码标识位来直接获取决策出的解码所需的目标分辨率,以简化解码端的处理过程,从而降低传输过程中的开销和解码端的解码复杂度,进而达到提高解码效率的效果。
根据本申请实施例的另一个方面,提供了一种视频编码方法,如图7所示,所述方法由视频处理设备执行,该方法包括:
S702,确定待编码视频帧。
S704,从已完成编码的视频帧中,获取待编码视频帧关联的目标参考帧的第一数量和第二数量。
所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量。
S706,根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率。
S708,采用目标分辨率对待编码视频帧进行编码。
需要说明的是,上述图7所示视频编码方法可以但不限于用于图1所示的视频编码器中。通过该视频编码器与其他部件的交互配合,来完成对待编码视频帧的编码过程。
可选地,在本实施例中,上述视频编码方法可以但不限于应用于视频播放应用、视频共享应用或视频会话应用等应用场景中。其中,上述应用场景中所传输的视频可以包括但不限于:长视频、短视频,如长视频可以为播放时间较长(例如播放时长大于10分钟)的播放剧集,或长时间视频会话中所展示的画面,短视频可以为双方或多方交互的语音消息,或用于在共享平台展示的播放时间较短(例如播放时长小于等于30秒)的视频。上述仅是示例,本实施例中所提供的视频编码方法可以但不限于应用于上述应用场景中,在获取到已完成 编码的视频帧之后,基于从上述视频帧确定出的参考帧中采用帧内编码的块的第一数量与参考帧中采用帧间编码的块的第二数量的关系,来自适应决策待编码视频帧所采用的目标分辨率,以简化确定上述待编码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频编码效率较低的问题,达到提高编码效率的效果。
需要说明的是,在本实施例中,针对确定出的待编码视频帧,可以从在已完成编码的视频帧中,获取与上述待编码视频帧关联的目标参考帧中采用帧内编码的块的第一数量与采用帧间编码的块的第二数量,利用上述第一数量与第二数量之间的关系,来确定用于对待编码视频帧进行编码的目标分辨率,并采用该目标分辨率对待编码视频帧进行编码。针对不同待编码视频帧,可以实现自适应选择不同的分辨率,以简化确定上述待编码视频帧所采用的目标分辨率的操作,从而克服相关技术中视频编码效率较低的问题,达到提高编码效率的效果。
作为一种可选的方案,根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率包括:
1)在第一数量与第二数量的比值大于预定阈值的情况下,采用第一分辨率对待编码视频帧进行编码;
2)在第一数量与第二数量的比值小于预定阈值的情况下,采用第二分辨率对待编码视频帧进行编码,其中,第二分辨率大于第一分辨率。
可选地,在本实施例中,在采用第一分辨率对待编码视频帧进行编码之前,还包括:
S1,在第一数量与第二数量的比值大于预定阈值的情况下,从大于预定阈值的数值区间内确定一组阈值;
S2,对第一数量与第二数量的比值与一组阈值中包括的各个阈值进行比对;
S3,根据比对的结果确定采样比例;
S4,按照采样比例对第二分辨率进行下采样,以确定第一分辨率。
可选地,在本实施例中,根据比对的结果确定采样比例包括:在确定第一数量与第二数量的比值位于目标区间的情况下,获取为目标区间配置的目标采样比例,其中,目标区间包括一组阈值中相邻的第一阈值与第二阈值之间的数 值,第一阈值小于第二阈值;按照采样比例对第二分辨率进行下采样包括:按照目标采样比例,对第二分辨率的宽高进行下采样,以得到第一分辨率。
在本实施例中,上述视频编码方法的具体实施例,可以但不限于参考视频解码方法的具体实施例,采用与解码方法对应的相反的处理方式,来对各个待编码视频帧自适应决策匹配的分辨率进行编码。本实施例中在此不再赘述。
具体结合图8所示示例中步骤S802-S830进行说明:在编码端,获取待编码视频帧,并通过分辨率决策,选择其编码所采用的分辨率。然后,根据待编码视频帧的当前编码信息和/或图像特征信息从可选编码模式中确定当前编码模式,如果是全分辨率模式,则确定无需下采样编码,并执行步骤S808-1至步骤S816-1:对原分辨率图像帧进行帧内/帧间预测得到预测残差、变换/量化及熵编码处理,得到编码数据,并确定所采用分辨率的分辨率标识。进一步对上述数据进行编码,如步骤S820,输出码流。
而如果是下采样模式,则需下采样,并执行步骤S810-2至步骤S818-2:获取下采样图像帧,并对该图像帧进行帧内/帧间预测得到预测残差、变换/量化及熵编码处理,得到编码数据,并确定所采用分辨率的分辨率标识。进一步对上述数据进行编码,如步骤S820,输出码流。
然后,将码流输入解码端。如步骤S822至步骤S830:对待解码视频帧进行自适应决策以确定其分辨率,之后,再解码得到重建视频帧。如果确定是采用下采样模式,则对重建视频帧进行上采样,以得到解码视频帧,否则,直接输出解码视频帧。
上述仅是一种示例,本实施例中所提供的视频编码方法和视频解码方法,应用于上述图中所示分辨率决策过程,用于在编码端和解码端针对不同待编码视频帧/待解码视频帧进行自适应决策分辨率,以达到简化编解码过程中确定各个帧所采用的分辨率的过程,从而达到提高编解码效率的效果。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本申请所必须的。
根据本申请实施例的又一个方面,还提供了一种用于实施上述视频解码的视频解码装置,如图9所示,该装置包括:
1)解码确定模块902,用于确定待编码视频帧;
2)获取模块904,用于从已完成编码的视频帧中,获取待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
3)第一处理模块906,用于根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率;
4)解码模块908,用于采用目标分辨率对待编码视频帧进行编码。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,第一处理模块906包括:
1)第一处理子模块,用于在第一数量与第二数量的比值大于预定阈值的情况下,采用第一分辨率对待解码视频帧进行解码;还用于在第一数量与第二数量的比值小于预定阈值的情况下,采用第二分辨率对待解码视频帧进行解码,其中,第二分辨率大于第一分辨率。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,还包括:
1)第一确定子模块,用于在采用第一分辨率对待编码视频帧进行编码之前,在第一数量与第二数量的比值大于预定阈值的情况下,从大于预定阈值的数值区间内确定一组阈值;
2)比对子模块,用于对第一数量与第二数量的比值与一组阈值中包括的各个阈值进行比对;
3)第二确定子模块,用于根据比对的结果确定采样比例;
4)采样子模块,用于按照采样比例对第二分辨率进行下采样,以确定第一分辨率。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,获取模块904包括:
1)第三确定子模块,用于从已完成解码的视频帧中确定出一帧参考帧作为目标参考帧;
2)第一获取子模块,用于获取目标参考帧中采用帧内解码的块的第一数量与采用帧间解码的块的第二数量。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,
1)第三确定子模块包括:(1)第一确定单元,用于在待解码视频帧为第t帧视频帧的情况下,确定第t-k帧视频帧为目标参考帧;
2)第一获取子模块包括:(1)第一获取单元,用于从第t-k帧视频帧中,获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,其中,k为预先确定的正整数、t为正整数,t>k。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,获取模块904包括:
1)第四确定子模块,用于从已完成解码的视频帧中确定出多帧参考帧作为目标参考帧;
2)第二获取子模块,用于获取目标参考帧中采用帧内解码的块的第一数量与目标参考帧中采用帧间解码的块的第二数量。所述第一数量为所述多帧参考帧中采用帧内解码的块的总数量,所述第二数量为所述多帧参考帧中采用帧间解码的块的总数量。
可选地,在本实施例中,第四确定子模块可以但不限于通过以下步骤实现:从已完成解码的视频帧中,确定出连续的多帧视频帧或不连续的多帧视频帧,作为目标参考帧。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,
1)第四确定子模块包括:(1)第二确定单元,用于在待解码视频帧为第t帧视频帧的情况下,确定第t-k1帧视频帧至第t-k2帧视频帧为目标参考帧;
2)第二获取子模块包括:(1)第二获取单元,用于从第t-k1帧视频帧至第t-k2帧视频帧中,获取采用帧内解码的块的第一数量与采用帧间解码的块的第二数量,其中,k1和k2为预先确定的正整数、t为正整数,t>k1>k2。
具体实施例可以参考上述视频解码方法中所示示例,本示例中在此不再赘述。
根据本申请实施例的又一个方面,提供了一种视频编码装置,如图10所示,该装置包括:
1)编码确定模块1002,用于确定待编码视频帧;
2)获取模块1004,用于从已完成编码的视频帧中,获取待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
3)处理模块1006,用于根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率;
4)编码模块1008,用于采用目标分辨率对待编码视频帧进行编码。
具体实施例可以参考上述视频编码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,处理模块1006包括:
1)第一处理子模块,用于在第一数量与第二数量的比值大于预定阈值的情况下,采用第一分辨率对待编码视频帧进行编码;还用于在第一数量与第二数量的比值小于预定阈值的情况下,采用第二分辨率对待编码视频帧进行编码,其中,第二分辨率大于第一分辨率。
具体实施例可以参考上述视频编码方法中所示示例,本示例中在此不再赘述。
作为一种可选的方案,还包括:
1)第一确定子模块,用于在采用第一分辨率对待编码视频帧进行编码之 前,在第一数量与第二数量的比值大于预定阈值的情况下,从大于预定阈值的数值区间内确定一组阈值;
2)比对子模块,用于对第一数量与第二数量的比值与一组阈值中包括的各个阈值进行比对;
3)第二确定子模块,用于根据比对的结果确定采样比例;
4)采样子模块,用于按照采样比例对第二分辨率进行下采样,以确定第一分辨率。
具体实施例可以参考上述视频编码方法中所示示例,本示例中在此不再赘述。
根据本申请实施例的又一个方面,还提供了一种用于实施上述视频解码方法的电子装置,如图11所示,该电子装置包括,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为通过计算机程序执行上述任一项方法实施例中的步骤。
可选地,在本实施例中,上述电子装置可以位于计算机网络的多个网络设备中的至少一个网络设备。
可选地,在本实施例中,上述处理器可以被设置为通过计算机程序执行以下步骤:
S1,确定待编码视频帧;
S2,从已完成编码的视频帧中,获取待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
S3,根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率;
S4,采用目标分辨率对待编码视频帧进行编码。
可选地,本领域普通技术人员可以理解,图11所示的结构仅为示意,电子装置也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备(Mobile Internet Devices,MID)、PAD等终端设备。图11其并不对上述电子装置的结构造成限定。例如,电子装置还可包括比图11中所示更多或者更少的组件(如网络接口等),或者具有与图11所示不同的配置。
其中,存储器1102可用于存储软件程序以及模块,如本申请实施例中的视频解码方法和装置对应的程序指令/模块,处理器1104通过运行存储在存储器1102内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的视频解码方法。存储器1102可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器1102可进一步包括相对于处理器1104远程设置的存储器,这些远程存储器可以通过网络连接至终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。其中,存储器1102具体可以但不限于用于存储待解码视频帧等信息。作为一种示例,如图11所示,上述存储器1102中可以但不限于包括上述视频解码装置中的解码确定模块902、获取模块904、第一处理模块906及解码模块908。此外,还可以包括但不限于上述视频解码装置中的其他模块单元,本示例中不再赘述。
可选地,上述的传输装置1106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括有线网络及无线网络。在一个实例中,传输装置1106包括一个网络适配器(Network Interface Controller,NIC),其可通过网线与其他网络设备与路由器相连从而可与互联网或局域网进行通讯。在一个实例中,传输装置1106为射频(Radio Frequency,RF)模块,其用于通过无线方式与互联网进行通讯。
此外,上述电子装置还包括:显示器1108,用于显示解码后的视频;和连接总线1110,用于连接上述电子装置中的各个模块部件。
根据本申请实施例的又一个方面,还提供了一种用于实施上述视频编码方法的电子装置,如图12所示,该电子装置包括存储器1202和处理器1204,该存储器1202中存储有计算机程序,该处理器1204被设置为通过计算机程序执行上述任一项方法实施例中的步骤。
可选地,在本实施例中,上述电子装置可以位于计算机网络的多个网络设备中的至少一个网络设备。
可选地,在本实施例中,上述处理器可以被设置为通过计算机程序执行以下步骤:
S1,确定待解码视频帧;
S2,从已完成解码的视频帧中,获取待解码视频帧关联的参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;
S3,根据第一数量和第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率;
S4,采用目标分辨率对待解码视频帧进行解码。
可选地,本领域普通技术人员可以理解,图12所示的结构仅为示意,电子装置也可以是智能手机(如Android手机、iOS手机等)、平板电脑、掌上电脑以及移动互联网设备(Mobile Internet Devices,MID)、PAD等终端设备。图12其并不对上述电子装置的结构造成限定。例如,电子装置还可包括比图12中所示更多或者更少的组件(如网络接口等),或者具有与图12所示不同的配置。
其中,存储器1202可用于存储软件程序以及模块,如本申请实施例中的视频编码方法和装置对应的程序指令/模块,处理器1204通过运行存储在存储器1202内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的视频编码方法。存储器1202可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器1202可进一步包括相对于处理器1204远程设置的存储器,这些远程存储器可以通过网络连接至终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。其中,存储器1202具体可以但不限于用于待编码视频帧等信息。作为一种示例,如图12所示,上述存储器1202中可以但不限于包括上述视频编码装置中的编码确定模块1002、获取模块1004、处理模块1006及编码模块1008。此外,还可以包括但不限于上述视频编码装置中的其他模块单元,本示例中不再赘述。
可选地,上述的传输装置1206用于经由一个网络接收或者发送数据。上述的网络具体实例可包括有线网络及无线网络。在一个实例中,传输装置1206包括一个网络适配器(Network Interface Controller,NIC),其可通过网线与其他网络设备与路由器相连从而可与互联网或局域网进行通讯。在一个实例中,传输装置1206为射频(Radio Frequency,RF)模块,其用于通过无线方式与互 联网进行通讯。
此外,上述电子装置还包括:显示器1208,用于显示编码前的视频;和连接总线1210,用于连接上述电子装置中的各个模块部件。
本申请的实施例还提供了一种存储介质,该存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。
可选地,在本实施例中,上述存储介质可以被设置为存储用于执行以下步骤的计算机程序:
S1,确定待解码视频帧;
S2,从已完成解码的视频帧中,获取待解码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;
S3,根据第一数量和第二数量的关系确定用于对待解码视频帧进行解码的目标分辨率;
S4,采用目标分辨率对待解码视频帧进行解码。
可选地,存储介质还被设置为存储用于执行以下步骤的计算机程序:
S1,确定待编码视频帧;
S2,从已完成编码的视频帧中,获取待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
S3,根据第一数量和第二数量的关系确定用于对待编码视频帧进行编码的目标分辨率;
S4,采用目标分辨率对待编码视频帧进行编码。
可选地,存储介质还被设置为存储用于执行上述实施例中的方法中所包括的步骤的计算机程序,本实施例中对此不再赘述。
本申请实施例还提供了一种包括指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述实施例提供的视频解码方法或视频编码方法。
可选地,在本实施例中,本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令终端设备相关的硬件来完成, 该程序可以存储于一计算机可读存储介质中,存储介质可以包括:闪存盘、只读存储器(Read-Only Memory,ROM)、随机存取器(Random Access Memory,RAM)、磁盘或光盘等。
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。
上述实施例中的集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在上述计算机可读取的存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在存储介质中,包括若干指令用以使得一台或多台计算机设备(可为个人计算机、服务器或者网络设备等)执行本申请各个实施例方法的全部或部分步骤。
在本申请的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
在本申请所提供的几个实施例中,应该理解到,所揭露的客户端,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
以上仅是本申请的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以做出若干改进和润饰,这些 改进和润饰也应视为本申请的保护范围。

Claims (17)

  1. 一种视频解码方法,所述方法由视频处理设备执行,所述方法包括:
    确定待解码视频帧;
    从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;
    根据所述第一数量和所述第二数量的关系,确定用于对所述待解码视频帧进行解码的目标分辨率;
    采用所述目标分辨率对所述待解码视频帧进行解码。
  2. 根据权利要求1所述的方法,所述根据所述第一数量和所述第二数量的关系确定用于对所述待解码视频帧进行解码的目标分辨率包括:
    在所述第一数量与所述第二数量的比值大于预定阈值的情况下,采用第一分辨率对所述待解码视频帧进行解码;
    在所述第一数量与所述第二数量的比值小于所述预定阈值的情况下,采用第二分辨率对所述待解码视频帧进行解码,其中,所述第二分辨率大于所述第一分辨率。
  3. 根据权利要求2所述的方法,在所述采用第一分辨率对所述待解码视频帧进行编码之前,还包括:
    在所述第一数量与所述第二数量的比值大于所述预定阈值的情况下,从大于所述预定阈值的数值区间内确定一组阈值;
    对所述第一数量与所述第二数量的比值与所述一组阈值中包括的各个阈值进行比对;
    根据比对的结果确定采样比例;
    按照所述采样比例对所述第二分辨率进行下采样,以确定所述第一分辨率。
  4. 根据权利要求3所述的方法,所述根据比对的结果确定采样比例包括:
    在确定所述第一数量与所述第二数量的比值位于目标区间的情况下,获取为所述目标区间配置的目标采样比例,其中,所述目标区间包括所述一组阈值中相邻的第一阈值与第二阈值之间的数值,所述第一阈值小于所述第二阈值;
    所述按照所述采样比例对所述第二分辨率进行下采样包括:
    按照所述目标采样比例,对所述第二分辨率的宽高进行下采样,以得到所述第一分辨率。
  5. 根据权利要求1所述的方法,所述从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量包括:
    从已完成解码的视频帧中确定出一帧参考帧作为所述目标参考帧;
    获取所述目标参考帧中采用帧内解码的块的所述第一数量与采用帧间解码的块的所述第二数量。
  6. 根据权利要求5所述的方法,所述从已完成解码的视频帧中确定出一帧参考帧作为所述目标参考帧包括:
    在所述待解码视频帧为第t帧视频帧的情况下,确定第t-k帧视频帧为所述目标参考帧;
    所述获取所述目标参考帧中采用帧内解码的块的所述第一数量与采用帧间解码的块的所述第二数量包括:
    从所述第t-k帧视频帧中,获取采用帧内解码的块的所述第一数量与采用帧间解码的块的所述第二数量,其中,所述k为预先确定的正整数、所述t为正整数,t>k。
  7. 根据权利要求1所述的方法,所述从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量包括:
    从已完成解码的视频帧中确定出多帧参考帧作为所述目标参考帧;
    获取所述目标参考帧中采用帧内解码的块的所述第一数量与所述目标参考帧中采用帧间解码的块的所述第二数量,所述第一数量为所述多帧参考帧中采用帧内解码的块的总数量,所述第二数量为所述多帧参考帧中采用帧间解码的块的总数量。
  8. 根据权利要求7所述的方法,所述从已完成解码的视频帧中确定出多帧参考帧作为所述目标参考帧包括:
    从所述已完成解码的视频帧中,确定出连续的多帧视频帧或不连续的多帧视频帧,作为所述目标参考帧。
  9. 根据权利要求7所述的方法,所述从已完成解码的视频帧中确定出多帧参考帧作为所述目标参考帧包括:
    在所述待解码视频帧为第t帧视频帧的情况下,确定第t-k1帧视频帧至第t-k2帧视频帧为所述目标参考帧;
    所述获取所述目标参考帧中采用帧内解码的块的所述第一数量与所述目标参考帧中采用帧间解码的块的所述第二数量包括:
    从所述第t-k1帧视频帧至所述第t-k2帧视频帧中,获取采用帧内解码的块的所述第一数量与采用帧间解码的块的所述第二数量,其中,所述k1和k2为预先确定的正整数、所述t为正整数,t>k1>k2。
  10. 根据权利要求1所述的方法,所述目标分辨率还通过如下方式确定:
    解析码流得到与所述待解码视频帧对应的解码标识位;
    获取所述解码标识位所指示的所述目标分辨率。
  11. 一种视频编码方法,所述方法由视频处理设备执行,所述方法包括:
    确定待编码视频帧;
    从已完成编码的视频帧中,获取所述待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
    根据所述第一数量和所述第二数量的关系,确定用于对所述待编码视频帧进行编码的目标分辨率;
    采用所述目标分辨率对所述待编码视频帧进行编码。
  12. 根据权利要求11所述的方法,所述根据所述第一数量和所述第二数量的关系确定用于对所述待编码视频帧进行编码的目标分辨率包括:
    在所述第一数量与所述第二数量的比值大于预定阈值的情况下,采用第一分辨率对所述待编码视频帧进行编码;
    在所述第一数量与所述第二数量的比值小于所述预定阈值的情况下,采用第二分辨率对所述待编码视频帧进行编码,其中,所述第二分辨率大于所述第一分辨率。
  13. 一种视频解码装置,包括:
    解码确定模块,用于确定待解码视频帧;
    获取模块,用于从已完成解码的视频帧中,获取所述待解码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧 内解码的块的数量,所述第二数量为所述目标参考帧中采用帧间解码的块的数量;
    第一处理模块,用于根据所述第一数量和所述第二数量的关系确定用于对所述待解码视频帧进行解码的目标分辨率;
    解码模块,用于采用所述目标分辨率对所述待解码视频帧进行解码。
  14. 一种视频编码装置,包括:
    编码确定模块,用于确定待编码视频帧;
    获取模块,用于从已完成编码的视频帧中,获取所述待编码视频帧关联的目标参考帧的第一数量和第二数量,所述第一数量为所述目标参考帧中采用帧内编码的块的数量,所述第二数量为所述目标参考帧中采用帧间编码的块的数量;
    处理模块,用于根据所述第一数量和所述第二数量的关系,确定用于对所述待编码视频帧进行编码的目标分辨率;
    编码模块,用于采用所述目标分辨率对所述待编码视频帧进行编码。
  15. 一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至10任一项中所述的视频解码方法或权利要求11或12中所述的视频编码方法。
  16. 一种视频处理设备,所述视频处理设备包括:
    处理器、通信接口、存储器和通信总线;
    其中,所述处理器、所述通信接口和所述存储器通过所述通信总线完成相互间的通信;所述通信接口为通信模块的接口;
    所述存储器,用于存储程序代码,并将所述程序代码传输给所述处理器;
    所述处理器,用于调用存储器中程序代码的指令执行权利要求1至10任一项中所述的视频解码方法或权利要求11或12中所述的视频编码方法。
  17. 一种包括指令的计算机程序产品,当其在计算机上运行时,使得所述计算机执行权利要求1至10任一项中所述的视频解码方法或权利要求11或12中所述的视频编码方法。
PCT/CN2020/114165 2019-09-27 2020-09-09 视频编解码方法和相关装置 WO2021057478A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/503,164 US20220078454A1 (en) 2019-09-27 2021-10-15 Video encoding method, video decoding method, and related apparatuses

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910927110.5A CN110677721B (zh) 2019-09-27 2019-09-27 视频编解码方法和装置及存储介质
CN201910927110.5 2019-09-27

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/503,164 Continuation US20220078454A1 (en) 2019-09-27 2021-10-15 Video encoding method, video decoding method, and related apparatuses

Publications (1)

Publication Number Publication Date
WO2021057478A1 true WO2021057478A1 (zh) 2021-04-01

Family

ID=69079643

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/114165 WO2021057478A1 (zh) 2019-09-27 2020-09-09 视频编解码方法和相关装置

Country Status (3)

Country Link
US (1) US20220078454A1 (zh)
CN (1) CN110677721B (zh)
WO (1) WO2021057478A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110650357B (zh) * 2019-09-27 2023-02-10 腾讯科技(深圳)有限公司 视频解码方法及装置

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102438135A (zh) * 2011-01-14 2012-05-02 展讯通信(上海)有限公司 高分辨率视频在小尺寸显示设备上的降采样解码方法及装置
CN102461169A (zh) * 2009-06-12 2012-05-16 微软公司 基于运动的动态分辨率多比特率视频编码
US20140241420A1 (en) * 2013-02-28 2014-08-28 Divx, Llc Systems and Methods of Encoding Multiple Video Streams for Adaptive Bitrate Streaming
CN105959700A (zh) * 2016-05-31 2016-09-21 腾讯科技(深圳)有限公司 视频图像编码的方法和装置
CN108769681A (zh) * 2018-06-20 2018-11-06 腾讯科技(深圳)有限公司 视频编码、解码方法、装置、计算机设备和存储介质
CN108924553A (zh) * 2018-06-20 2018-11-30 腾讯科技(深圳)有限公司 视频编码、解码方法、装置、计算机设备和存储介质
CN109660796A (zh) * 2018-11-09 2019-04-19 建湖云飞数据科技有限公司 一种对视频帧进行编码的方法
CN110636312A (zh) * 2019-09-27 2019-12-31 腾讯科技(深圳)有限公司 视频编解码方法和装置及存储介质

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102111620B (zh) * 2009-12-23 2013-08-07 富士通株式会社 帧率下采样转码方法和装置
KR101776809B1 (ko) * 2010-03-09 2017-09-08 스프레드트럼 커뮤니케이션즈 (상하이) 컴퍼니 리미티드 적응적 비디오 디코딩 회로망 및 기술들
JP2012205619A (ja) * 2011-03-29 2012-10-25 Olympus Medical Systems Corp 画像処理装置、制御装置、内視鏡装置、画像処理方法及び画像処理プログラム
EP2890131A1 (en) * 2013-12-31 2015-07-01 Patents Factory Ltd. Sp. z o.o. Video coding with different spatial resolutions for intra-coded frames and inter-coded frames
CN107155107B (zh) * 2017-03-21 2018-08-03 腾讯科技(深圳)有限公司 视频编码方法和装置、视频解码方法和装置
CN108833918B (zh) * 2018-06-20 2021-09-17 腾讯科技(深圳)有限公司 视频编码方法、解码方法、装置、计算机设备及存储介质
US11166028B2 (en) * 2019-02-01 2021-11-02 Comcast Cable Communications, Llc Methods and systems for providing variable bitrate content

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102461169A (zh) * 2009-06-12 2012-05-16 微软公司 基于运动的动态分辨率多比特率视频编码
CN102438135A (zh) * 2011-01-14 2012-05-02 展讯通信(上海)有限公司 高分辨率视频在小尺寸显示设备上的降采样解码方法及装置
US20140241420A1 (en) * 2013-02-28 2014-08-28 Divx, Llc Systems and Methods of Encoding Multiple Video Streams for Adaptive Bitrate Streaming
CN105959700A (zh) * 2016-05-31 2016-09-21 腾讯科技(深圳)有限公司 视频图像编码的方法和装置
CN108769681A (zh) * 2018-06-20 2018-11-06 腾讯科技(深圳)有限公司 视频编码、解码方法、装置、计算机设备和存储介质
CN108924553A (zh) * 2018-06-20 2018-11-30 腾讯科技(深圳)有限公司 视频编码、解码方法、装置、计算机设备和存储介质
CN109660796A (zh) * 2018-11-09 2019-04-19 建湖云飞数据科技有限公司 一种对视频帧进行编码的方法
CN110636312A (zh) * 2019-09-27 2019-12-31 腾讯科技(深圳)有限公司 视频编解码方法和装置及存储介质

Also Published As

Publication number Publication date
US20220078454A1 (en) 2022-03-10
CN110677721B (zh) 2022-09-13
CN110677721A (zh) 2020-01-10

Similar Documents

Publication Publication Date Title
WO2021057481A1 (zh) 视频编解码方法和相关装置
JP6714695B2 (ja) 動的な解像度切換えを用いたリアルタイムビデオエンコーダレート制御
US10321138B2 (en) Adaptive video processing of an interactive environment
WO2021057705A1 (zh) 视频编解码方法和相关装置
WO2014000238A1 (zh) 终端和视频图像压缩方法
WO2021057697A1 (zh) 视频编解码方法和装置、存储介质及电子装置
US20200374511A1 (en) Video encoding method and apparatus, video decoding method and apparatus, computer device, and storage medium
US20220210469A1 (en) Method For Transmitting Video Picture, Device For Sending Video Picture, And Video Call Method And Device
WO2021057479A1 (zh) 视频编解码方法和相关装置
WO2021057477A1 (zh) 视频编解码方法和相关装置
US11943473B2 (en) Video decoding method and apparatus, video encoding method and apparatus, storage medium, and electronic device
WO2021057478A1 (zh) 视频编解码方法和相关装置
WO2021057480A1 (zh) 视频编解码方法和相关装置
CN110572672A (zh) 视频编解码方法和装置、存储介质及电子装置
CN110572677A (zh) 视频编解码方法和装置、存储介质及电子装置
CN116962613A (zh) 数据传输方法及装置、计算机设备、存储介质
US12015763B2 (en) Video encoding method, video decoding method, and related apparatuses
CN104702970A (zh) 一种同步视频数据的方法、设备及系统
CN110572674A (zh) 视频编解码方法和装置、存储介质及电子装置
CN116866604A (zh) 一种图像处理方法及装置
CN117676143A (zh) 一种编解码方法及装置
CN110636295A (zh) 视频编解码方法和装置、存储介质及电子装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20868728

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20868728

Country of ref document: EP

Kind code of ref document: A1