WO2017211306A1 - 视频数据压缩码流的解码、视频数据的编码方法及装置 - Google Patents

视频数据压缩码流的解码、视频数据的编码方法及装置 Download PDF

Info

Publication number
WO2017211306A1
WO2017211306A1 PCT/CN2017/087482 CN2017087482W WO2017211306A1 WO 2017211306 A1 WO2017211306 A1 WO 2017211306A1 CN 2017087482 W CN2017087482 W CN 2017087482W WO 2017211306 A1 WO2017211306 A1 WO 2017211306A1
Authority
WO
WIPO (PCT)
Prior art keywords
sampling format
sampling
decoding
encoding
image
Prior art date
Application number
PCT/CN2017/087482
Other languages
English (en)
French (fr)
Inventor
林涛
李明
吴钊
吴平
Original Assignee
同济大学
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 同济大学, 中兴通讯股份有限公司 filed Critical 同济大学
Publication of WO2017211306A1 publication Critical patent/WO2017211306A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Definitions

  • the present invention relates to the field of data processing, and in particular to a method for decoding a video data compressed code stream, and a method and device for encoding video data.
  • a data set is a collection of data elements (for example: bytes, bits, pixels).
  • a data set eg, a file, a frame of image, a video sequence
  • This data set is divided into a subset of blocks having a predetermined shape and size (ie, the number of elements), called a coding block (from a decoding perspective, that is, a decoding block, collectively referred to as a codec block), in units of codec blocks.
  • the decoded block being decoded is referred to as the current decoded block.
  • the current coded block or the current decoded block is collectively referred to as the current codec block or simply as the current block.
  • the data elements (referred to as elements) being encoded or decoded are referred to as current encoded data elements or currently decoded data elements, collectively referred to as current data elements, referred to as current elements.
  • the element consists of N components (usually 1 ⁇ N ⁇ 5), so the data set and the codec block are also composed of N components.
  • an element of a frame image that is, a pixel, is arranged in a rectangular shape having a size (resolution) of 1920 (width) x 1080 (height), and is composed of three components: a G (green) component, and a B (blue) component.
  • both the data set and the codec block as the encoding object have only one fixed sampling format and size.
  • a sampling format called 4:4:4 all three components of the data set have the same sampling rate and size (ie, the number of component samples).
  • a sampling format called 4:2:0 is usually used, which is two components (D component and E) of a data set (such as image or video) having a rectangular shape and three components.
  • the sampling rate and size of the component are respectively one quarter of the other component (F component).
  • one D component D[i][j] and one E component E[i][j] correspond to four (2 ⁇ 2) F components F[2i][2j], F[2i+ 1][2j], F[2i][2j+1], F[2i+1][2j+1].
  • the D component sum
  • sampling format called 4:2:2, which is the sampling rate and size of the two components (D component and E component) of a data set (such as image or video) having a rectangular shape and three components, respectively.
  • D component and E component the two components
  • F component One-half of one component (F component).
  • one D component D[i][j] and one E component E[i][j] correspond to two (2 ⁇ 1) F
  • the F, D, and E components described above are the Y, U, and V components, respectively.
  • the F, D, and E components described above are G, B, and R components, respectively.
  • the data set and the codec block as the encoding object have multiple sampling formats and sizes, and when encoding the encoding objects of different sampling formats and sizes, the same encoding is used.
  • the way, or different encoding methods always encodes a single sample format and size of the encoded object.
  • different sampling formats mean that at least one component of the data set and/or codec block has a different sampling rate and size (ie, the number of component samples). Different sampling rates and sizes are converted to each other by upsampling operations or downsampling operations.
  • the upsampling operation is an operation that increases the number of samples.
  • the downsampling operation is an operation that reduces the number of samples. Therefore, different sampling formats are at least one of the data set and/or the codec block.
  • the components have different numbers of samples.
  • the format and size, and the inherently single encoding method greatly affect the improvement of data compression efficiency.
  • the embodiment of the invention provides a decoding method of a video data compressed code stream, a method and a device for encoding video data, so as to at least solve the technical problem that the efficiency is too low when the single format and the decoding mode are used in the related art.
  • a method for decoding a video data compressed code stream further comprising: parsing a video data compressed code stream, acquiring sampling format information and/or decoding mode information; and according to the sampling format information and/or Or decoding mode information, among the predetermined plurality of sampling formats and decoding modes, selecting a first sampling format and a decoding mode corresponding to the first sampling format; using the first sampling format and the first sampling format
  • the decoding method is decoded by the corresponding decoding method.
  • the video data compression code stream comprises at least one of the following data compression code streams: one-dimensional data, two-dimensional data, larger than two-dimensional multi-dimensional data, images, image sequences, video, audio, files, Bytes, bits, pixels, data consisting of three components, an image with a rectangular shape, a sequence of images with a rectangular shape, an image composed of three components, an image sequence consisting of three components, and three components a video composed of an R component, a G component, and a B component, an image sequence composed of an R component, a G component, and a B component, a video composed of an R component, a G component, and a B component, and a luminance component and two An image consisting of chrominance components, a sequence of images consisting of two chrominance components of a luminance component, a video consisting of two chrominance components of a luminance component, and a coded block of data.
  • the decoding block is a decoding area of an image, where the decoding area includes at least one of: a sub-image of an image, a macroblock, a maximum coding unit LCU, a coding tree unit CTU, The coding unit CU, the sub-region of the CU, the prediction unit PU, and the transform unit TU.
  • the multiple sampling formats include a primary sampling format and other sampling formats, wherein the other sampling formats are sampling formats obtained by the sampling operation of the primary sampling format.
  • the video data compressed code stream has a rectangular shape and a data compression code stream of a sequence of images or images of three components.
  • the multiple sampling formats are 4:4:4 sampling format and 4:2:0 sampling format; or, the multiple sampling formats are 4:4:4 sampling format and 4:2:2 sampling Format; or, the plurality of sampling formats are a 4:2:2 sampling format and a 4:2:0 sampling format.
  • the decoding manner corresponding to the 4:2:0 sampling format includes: generating a data version of the 4:2:0 sampling format, and converting the data version of the 4:2:0 sampling format by an upsampling operation a data version of the 4:4:4 or 4:2:2 sampling format, wherein the generating a data version of the 4:2:0 sampling format comprises: performing intra prediction according to the neighboring pixels of the decoding block The operation produces a data version of the 4:2:0 sampling format, and/or an inter prediction operation based on the neighboring image of the decoded image produces a data version of the 4:2:0 sampling format; and the 4:4:4 or
  • the corresponding decoding mode of the 4:2:2 sampling format includes: generating a data version of the 4:4:4 or 4:2:2 sampling format according to the prediction operation, for the 4:4:4 or 4:2:2 sampling format The data version is converted to a data version of the 4:2:0 sampling format by a downs
  • the decoding manner includes at least one of: performing intra prediction according to neighboring pixels of the decoding block; performing inter prediction according to neighboring images of the decoded image; and performing frame according to neighboring images of the decoded image.
  • Inter-transformation scaling scaling; general-purpose string prediction; palette decoding; dictionary decoding; entropy decoding.
  • the method further includes: parsing the video data compressed code stream, and obtaining the first flag bit from one of the following positions: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a CTU header, and a CU header. Decoding a block header, wherein the first flag bit is used to indicate that decoding is allowed using a plurality of sampling formats and/or corresponding decoding modes.
  • the method further includes: parsing the video data compressed code stream, and acquiring a second flag bit from at least one of: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, Decoding a block header, wherein the second flag bit is used to indicate that a decoding block that uses a 4:4:4 sampling format and/or a corresponding string prediction decoding mode is allowed to be used.
  • the method further includes: parsing the video data compressed code stream, and acquiring a third flag bit from at least one of the following: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, and a decoding block header, where The third flag is used to indicate that a decoding block using a 4:2:2 sampling format and/or a corresponding string prediction decoding mode is allowed to be used.
  • one of the predetermined plurality of sampling formats and decoding modes corresponds to a predetermined value k
  • the decoding block is directly or Indirect or direct indirect mixing of the sampling format and the corresponding decoding mode identification code.
  • the direct sampling format and the corresponding decoding mode identification code are composed of one or more bit strings in the video data compressed code stream;
  • the indirect sampling format and the corresponding decoding mode identification code are a decoding format derived from a decoding mode parameter and/or a sampling format derived from a syntax element other than the syntax element corresponding to the decoding mode parameter of the video data compressed code stream, and a corresponding decoding mode identification code;
  • the direct indirect mixed sampling format and the corresponding decoding mode identification code are partial direct partial indirect mixed sampling formats and corresponding decoding mode identification codes.
  • the sampling format and the identifier corresponding to the decoding manner are obtained from a location of the video data compression code stream: the decoding block header information syntax element, a sampling format, and a corresponding decoding mode identifier syntax element, and an additional Decoding block header information syntax element, decoding block data syntax element; or decoding block header information syntax element, partial sampling format and corresponding decoding mode identification code syntax element, additional decoding block header information syntax element, partial decoding block data syntax element, another a part of the sampling format and the corresponding decoding mode identification code syntax element, and another partial decoding block data syntax element; wherein, when the value of the identification code of the identification code syntax element is equal to the specified value, indicating that the sampling format corresponding to the specified value is adopted The decoded block is decoded with a corresponding decoding mode.
  • a method of encoding video data comprising: selecting a first sampling format from a predetermined plurality of sampling formats, and from a predetermined plurality of encoding modes Selecting an encoding manner corresponding to the first sampling format; encoding the encoded block of the video data using the selected first sampling format and the selected encoding manner to generate a video data compressed code stream, wherein the video data compression code
  • the stream includes: a first sampling format and/or an encoding method, a syntax element corresponding to the first sampling format and/or encoding mode.
  • the video data comprises at least one of the following: one-dimensional data, two-dimensional data, larger than two-dimensional multi-dimensional data, images, sequences of images, video, audio, files, bytes, bits, pixels, by three Data consisting of components, a rectangular shape image, a sequence of images with a rectangular shape, an image consisting of three components, an image sequence consisting of three components, a video consisting of three components, by R component, G
  • An image consisting of a component and a B component an image sequence composed of an R component, a G component, and a B component, a video composed of an R component, a G component, and a B component, an image composed of one luminance component and two chrominance components
  • An image sequence consisting of two chrominance components of a luminance component a video consisting of two chrominance components of a luminance component, and a coded block of data.
  • the coding block is an coding region of an image, where the coding region includes at least one of: a sub-image of an image, a macroblock, a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a CU Sub-region, prediction unit PU, and transform unit TU.
  • the multiple sampling formats include a primary sampling format and other sampling formats, wherein the other sampling formats are sampling formats obtained by the sampling operation of the primary sampling format.
  • the video data is a sequence of images or images having a rectangular shape and three components.
  • the multiple sampling formats are 4:4:4 sampling format and 4:2:0 sampling format; or, the multiple sampling formats are 4:4:4 sampling format and 4:2:2 sampling Format; or, the plurality of sampling formats are a 4:2:2 sampling format and a 4:2:0 sampling format.
  • the encoding manner corresponding to the 4:2:0 sampling format includes: generating a data version of the 4:2:0 sampling format, and converting the data version of the 4:2:0 sampling format by an upsampling operation a data version of the 4:4:4 or 4:2:2 sampling format, wherein the generating a data version of the 4:2:0 sampling format comprises: performing intra prediction according to the neighboring pixels of the encoding block Operational production Generating a data version of the 4:2:0 sampling format, and/or generating a data version of the 4:2:0 sampling format according to the operation of performing inter prediction on the adjacent image of the encoded image; and the 4:4:
  • the corresponding encoding mode of the 4 or 4:2:2 sampling format includes: generating a data version of the 4:4:4 or 4:2:2 sampling format according to the prediction operation, for the 4:4:4 or 4:2:2
  • the data version of the sample format is converted to a data version of the
  • the coding manner includes at least one of: performing intra prediction according to neighboring pixels of the coding block; performing inter prediction according to neighboring images of the coded image; and performing frame according to neighboring images of the coded image.
  • the method further includes: including, in a portion of the video data compression code stream, a first flag bit: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a CTU header, a CU header, and a coding block header
  • the first flag bit is used to indicate that encoding is allowed by using multiple sampling formats and/or corresponding encoding modes.
  • the method further includes: at least one of the following at least one of the video data compression code stream includes a second flag bit: a sequence parameter set, an image parameter set, a sequence header, a slice header, an image header, a coding block header, where The second flag is used to indicate that a coded block using a 4:4:4 sample format and/or a corresponding string predictive coding scheme is allowed to be used.
  • the method further includes: at least one of the following at least one of the video data compression code stream includes a third flag bit: a sequence parameter set, an image parameter set, a sequence header, a slice header, an image header, and an encoding block header, where The third flag is used to indicate that encoding blocks using the 4:2:2 sampling format and/or the corresponding string predictive coding mode are allowed to be used.
  • one of the predetermined plurality of sampling formats and encoding modes corresponds to a predetermined value k, and a sampling format that directly or indirectly or directly indirectly mixes is set for the encoding block and correspondingly And an encoding mode identification code, where the coding mode identification code is included in the video data compression code stream.
  • the direct sampling format and the corresponding coding mode identification code are composed of one or more bit strings in the video data compression code stream; the indirect sampling format and corresponding coding side
  • the code identification code is a coding format derived from the selected coding mode parameter and/or a sampling format derived from a syntax element other than the syntax element of the video data compression code stream and a corresponding coding mode identification code.
  • the direct indirect mixed sampling format and the corresponding encoding mode identification code are part of the direct indirect partial mixing sampling format and the corresponding encoding mode identification code.
  • the sampling format and the identification code corresponding to the coding mode are present in the video data compression code stream in the following manner: the coding block header information syntax element, a sampling format, and a corresponding coding mode identifier code syntax element, An additional coding block header information syntax element, a coding block data syntax element; or the coding block header information syntax element, a partial sampling format and a corresponding coding mode identification code syntax element, an additional coding block header information syntax element, a partial coding block data syntax element, Another part of the sampling format and the corresponding coding mode identification code syntax element, another partial coding block data syntax element; wherein, when the value of the identification code of the identification code syntax element is equal to the specified value, indicating that the sampling corresponding to the specified value is adopted
  • the coded block is encoded in a format and a corresponding coding scheme.
  • a decoding apparatus for a video data compressed code stream, comprising: a parsing module configured to parse a video data compressed code stream, obtain sampling format information and/or decoding mode information; and a selection module, Setting, according to the sampling format information and/or decoding mode information, selecting a first sampling format and a decoding mode corresponding to the first sampling format among a plurality of predetermined sampling formats and decoding modes; decoding module, setting Decoding the decoded block in a decoding manner corresponding to the first sampling format and the first sampling format.
  • a method of encoding video data comprising: a selecting module configured to select a first sampling format from a predetermined plurality of sampling formats, and from among a plurality of predetermined encoding modes Selecting an encoding manner corresponding to the first sampling format; the encoding module is configured to encode the encoded block of the video data by using the selected first sampling format and the selected encoding manner to generate a video data compressed code stream, where the video
  • the data compression code stream includes: a first sampling format and/or an encoding method, and a syntax element corresponding to the first sampling format and/or the encoding mode.
  • a storage medium is also provided.
  • the storage medium Set to store the program code used to perform the following steps:
  • Decoding the decoded block by using a decoding manner corresponding to the first sampling format and the first sampling format.
  • a storage medium is also provided.
  • the storage medium is arranged to store program code for performing the following steps:
  • the embodiment of the present invention selects a sampling format and a corresponding decoding mode among a plurality of predetermined sampling formats and decoding modes, and solves the technical problem that the efficiency of using a single format and decoding mode is too low in the related art. Increased decoding rate.
  • FIG. 1 is a flowchart of a method of decoding a video data compressed code stream according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method of encoding video data according to an embodiment of the present invention
  • FIG. 3 is a structural block diagram of a decoding apparatus for a video data compressed code stream according to an embodiment of the present invention
  • FIG. 4 is a block diagram showing the structure of an encoding apparatus for video data according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of an encoding method according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a decoding method according to an embodiment of the present invention.
  • FIG. 1 is a flowchart of a method for decoding a video data compressed code stream according to an embodiment of the present invention. As shown in FIG. 1, the process includes the following steps. :
  • Step S102 parsing the video data compression code stream, and acquiring sampling format information and/or decoding mode information
  • Step S104 selecting, according to the sampling format information and/or the decoding mode information, among the predetermined plurality of sampling formats and decoding modes, the first sampling format and the decoding mode corresponding to the first sampling format;
  • Step S106 Decode the decoded block by using a decoding manner corresponding to the first sampling format and the first sampling format.
  • the sampling format and the corresponding decoding mode are selected among a plurality of predetermined sampling formats and decoding modes, thereby solving the technical problem that the efficiency is too low when decoding is performed by using a single format and decoding method in the related art, and the technical problem is improved. Decoding rate.
  • the execution body of the foregoing steps may be a decoder, a video processing device, such as a video receiving end, a video rendering device, etc., but is not limited thereto.
  • the video data compression code stream comprises at least one of the following data compression code streams: one-dimensional data, two-dimensional data, larger than two-dimensional multi-dimensional data, images, image sequences, video, audio, files, bytes , bit, pixel, data consisting of three components, having a rectangular shape Image, a sequence of images with a rectangular shape, an image consisting of three components, a sequence of images consisting of three components, a video consisting of three components, an image consisting of R, G, and B components, An image sequence consisting of an R component, a G component, and a B component, a video composed of an R component, a G component, and a B component, an image composed of one luminance component and two chrominance components, composed of two luminance components and one chrominance component A sequence of images consisting of a video component consisting of two chrominance components of a luminance component.
  • the decoding block is a decoding area of the image, where the decoding area includes at least one of the following: a sub-image of the image, a macroblock, a Largest Coding Unit (LCU for short), and a coding tree unit (Coding Tree).
  • Unit abbreviated as CTU, Coding Unit (referred to as CU), sub-area of CU, Prediction Unit (PU), and Transform Unit (TU).
  • the multiple sampling formats include a primary sampling format and other sampling formats, wherein the other sampling formats are sampling formats obtained by sampling operations of the primary sampling format.
  • the video data compression code stream has a rectangular shape and a data compression code stream of a sequence of images or images of three components.
  • the multiple sampling formats are 4:4:4 sampling format and 4:2:0 sampling format; or, the multiple sampling formats are 4:4:4 sampling format and 4:2:2 sampling format; or, The various sampling formats are the 4:2:2 sampling format and the 4:2:0 sampling format.
  • the decoding method corresponding to the 4:2:0 sampling format includes: generating a data version of the 4:2:0 sampling format, and converting the data version of the 4:2:0 sampling format to 4:4 by the upsampling operation: A data version of the 4 or 2:2:2 sampling format, wherein the method of generating a data version of the 4:2:0 sampling format includes: generating a 4:2:0 sampling format according to an operation of performing intra prediction on neighboring pixels of the decoding block Data version, and/or, based on the inter-prediction operation of the adjacent image of the decoded image, produces a data version of the 4:2:0 sampling format; decoding corresponding to the 4:4:4 or 4:2:2 sampling format The method includes: generating a data version of the 4:4:4 or 4:2:2 sampling format according to the prediction operation, and converting the data version of the 4:4:4 or 4:2:2 sampling format to 4:2 by downsampling operation
  • the decoding manner includes at least one of: performing intra prediction according to neighboring pixels of the decoded block; performing inter prediction according to the adjacent image of the decoded image; performing interframe transform according to the adjacent image of the decoded image; scaling scaling; Prediction; palette decoding; dictionary decoding; entropy decoding.
  • the parsing the video data compression code stream further includes: obtaining the first flag bit from one of the following positions: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a CTU header, a CU header, and decoding.
  • the parsing the video data compression code stream further includes: acquiring a second flag bit from at least one of: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a decoding block header, wherein, The two flag bits are used to indicate that decoding blocks using the 4:4:4 sampling format and/or the corresponding string prediction decoding mode are allowed to be used.
  • the parsing the video data compression code stream further includes: acquiring a third flag bit from at least one of: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a decoding block header, wherein, The three flag bits are used to indicate that decoding blocks using the 4:2:2 sampling format and/or the corresponding string prediction decoding mode are allowed to be used.
  • one of the predetermined plurality of sampling formats and decoding modes corresponds to a predetermined value k
  • the video data compression code stream is directly or indirectly or directly indirectly mixed for the decoding block.
  • the sampling format and the corresponding decoding mode identification code corresponds to a predetermined value k
  • the direct sampling format and the corresponding decoding mode identification code are composed of one or more bit strings in the video data compression code stream; the indirect sampling format and the corresponding decoding mode identification code are in addition to the decoding mode parameters.
  • sampling format and the identifier of the corresponding decoding mode are obtained from the following locations of the video data compression code stream:
  • Decoding block header information syntax element partial sample format and corresponding decoding mode identification code syntax element, additional decoding block header information syntax element, partial decoding block data syntax element, another partial sampling format and corresponding decoding mode identification code syntax element, another partial decoding block Data syntax element;
  • the decoding block is decoded by using the sampling format corresponding to the specified value and the corresponding decoding manner.
  • FIG. 2 is a flowchart of a method for encoding video data according to an embodiment of the present invention. As shown in FIG. 2, the process includes the following steps:
  • Step S202 selecting a first sampling format from a plurality of predetermined sampling formats, and selecting an encoding manner corresponding to the first sampling format from among a plurality of predetermined encoding modes;
  • Step S204 encoding the coded block of the video data by using the selected first sampling format and the selected coding mode to generate a video data compressed code stream, where the video data compressed code stream includes: a first sampling format and/or an encoding mode, and A syntax element corresponding to the first sampling format and/or encoding mode.
  • the execution body of the foregoing steps may be an encoder, a video processing device, such as a video sending end, a video distribution device, etc., but is not limited thereto.
  • the video data comprises at least one of the following: one-dimensional data, two-dimensional data, larger than two-dimensional multi-dimensional data, images, image sequences, video, audio, files, bytes, bits, pixels, by three components
  • Composition data an image having a rectangular shape, a sequence of images having a rectangular shape, an image composed of three components, an image sequence composed of three components, a video composed of three components, and an R component, a G component,
  • An image composed of a B component an image sequence composed of an R component, a G component, and a B component, a video composed of an R component, a G component, and a B component, an image composed of one luminance component and two chrominance components, and a luminance
  • the coding block is an coding region of the image, where the coding region includes at least the following One: a sub-picture of a picture, a macroblock, a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a sub-area of a CU, a prediction unit PU, and a transformation unit TU.
  • the multiple sampling formats include a primary sampling format and other sampling formats, wherein the other sampling formats are sampling formats obtained by sampling operations of the primary sampling format.
  • the video data is a sequence of images or images having a rectangular shape and three components.
  • the multiple sampling formats are 4:4:4 sampling format and 4:2:0 sampling format; or, the multiple sampling formats are 4:4:4 sampling format and 4:2:2 sampling format; or, The various sampling formats are the 4:2:2 sampling format and the 4:2:0 sampling format.
  • the encoding method corresponding to the 4:2:0 sampling format includes: generating a data version of the 4:2:0 sampling format, and converting the data version of the 4:2:0 sampling format to 4:4 by the upsampling operation: A data version of the 4 or 2:2:2 sampling format, wherein the method of generating a data version of the 4:2:0 sampling format includes: generating a 4:2:0 sampling format according to an operation of intra prediction by a neighboring pixel of the encoding block Data version, and/or, according to the operation of inter-prediction of the adjacent image of the encoded image, the data version of the 4:2:0 sampling format is generated; the encoding corresponding to the 4:4:4 or 4:2:2 sampling format
  • the method includes: generating a data version of the 4:4:4 or 4:2:2 sampling format according to the prediction operation, and converting the data version of the 4:4:4 or 4:2:2 sampling format to 4:2 by downs
  • the coding manner includes at least one of: performing intra prediction according to neighboring pixels of the coded block; performing inter prediction according to the adjacent image of the coded image; performing interframe transform according to the adjacent image of the coded image; and performing quantization; universal string prediction ; palette coding; dictionary coding; hybrid coding Hybrid coding; entropy coding.
  • the embodiment further includes: including, in a part of the video data compression code stream, the first flag bit: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, a CTU header, a CU header, and an encoding A block header, wherein the first flag bit is used to indicate that encoding is allowed using a plurality of sampling formats and/or corresponding encoding modes.
  • the embodiment further includes: including at least one of the following at least one part of the video data compression code stream: a sequence parameter set, an image parameter set, a sequence header, a strip header, an image header, A coding block header, wherein the second flag bit is used to indicate that a coding block using a 4:4:4 sampling format and/or a corresponding string predictive coding mode is allowed to be used.
  • the embodiment further includes: at least one of the following at least one part of the video data compression code stream includes a third flag bit: a sequence parameter set, an image parameter set, a sequence header, a slice header, an image header, a coding block header, wherein, The three flag bits are used to indicate that code blocks using the 4:2:2 sample format and/or the corresponding string predictive coding mode are allowed to be used.
  • one of the predetermined plurality of sampling formats and encoding modes corresponds to a predetermined value k, and the sampling format and the corresponding encoding mode identification code that are directly or indirectly or directly indirectly mixed are set for the encoding block.
  • the coding mode identification code is included in the video data compression code stream.
  • the direct sampling format and the corresponding coding mode identification code are composed of one or more bit strings in the video data compression code stream; the indirect sampling format and the corresponding coding mode identification code are in addition to the selected coding mode parameter.
  • Other coding parameters and/or sample format and corresponding coding mode identification code derived from syntax elements other than syntax elements of the compressed video stream; direct indirect mixed sampling format and corresponding coding mode identification code are partially direct partial indirect Mixed sampling format and corresponding encoding mode identification code.
  • the sampling format and the identifier of the corresponding encoding manner are present in the video data compressed code stream in the following manner: the encoding block header information syntax element, the sampling format and the corresponding encoding mode identifier syntax element, the additional encoding block header information syntax element, Encoding block data syntax element; or encoding block header information syntax element, partial sampling format and corresponding encoding mode identification code syntax element, additional encoding block header information syntax element, partial encoding block data syntax element, another partial sampling format, and corresponding encoding mode identification code The syntax element, another part of the coding block data syntax element; wherein, when the value of the identification code of the identification code syntax element is equal to the specified value, it indicates that the coding block is encoded by using a sampling format corresponding to the specified value and a corresponding coding manner.
  • a decoding device for the video data compression code stream and an encoding device for the video data are provided.
  • the device is used to implement the foregoing embodiments and preferred embodiments, and details are not described herein.
  • the term "module” may implement a combination of software and/or hardware of a predetermined function.
  • the apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
  • FIG. 3 is a structural block diagram of a decoding apparatus for a video data compressed code stream according to an embodiment of the present invention. As shown in FIG. 3, the apparatus includes:
  • the parsing module 30 is configured to parse the video data compressed code stream, and obtain sampling format information and/or decoding mode information;
  • the selecting module 32 is configured to select, according to the sampling format information and/or the decoding mode information, among the predetermined plurality of sampling formats and decoding modes, the first sampling format and the decoding mode corresponding to the first sampling format;
  • the decoding module 34 is configured to decode the decoded block by using a decoding manner corresponding to the first sampling format and the first sampling format.
  • FIG. 4 is a structural block diagram of an encoding apparatus for video data according to an embodiment of the present invention. As shown in FIG. 4, the apparatus includes:
  • the selecting module 40 is configured to select a first sampling format from a predetermined plurality of sampling formats, and select an encoding manner corresponding to the first sampling format from among a plurality of predetermined encoding modes;
  • the encoding module 42 is configured to encode the encoded block of the video data using the selected first sampling format and the selected encoding manner to generate a video data compressed code stream, where the video data compressed code stream includes: a first sampling format and/or encoding Way, with the first sampling format and / or encoding method
  • the syntax element should be.
  • each of the above modules may be implemented by software or hardware.
  • the foregoing may be implemented by, but not limited to, the foregoing modules are all located in the same processor; or, the above modules are in any combination.
  • the forms are located in different processors.
  • This embodiment is an optional embodiment of the present invention, and is supplemented and detailed for the solutions of multiple applications:
  • the present embodiment provides a data compression method using multiple (ie, two or more) sampling formats and corresponding encoding modes.
  • the data set and the codec block have K (K>1) versions respectively having K different sampling formats, correspondingly having K sets of codec modes; when encoding and decoding a codec block, selecting the right place One of the K versions uses the corresponding codec mode for encoding and decoding.
  • the first technical feature of this embodiment is to encode and decode one codec block by using one of a plurality of sampling formats (i.e., a plurality of data versions having different sampling formats) and a corresponding codec mode.
  • one codec block is coded and decoded using one of two sampling formats and a corresponding codec mode.
  • the data set and its elements consist of 3 components.
  • the data set is an image having a rectangular shape.
  • the data set is a sequence of images having a rectangular shape.
  • the data set is an image consisting of 3 components.
  • the data set is a sequence of images consisting of 3 components.
  • the data set is a video consisting of 3 components.
  • the data set is an image composed of an R component, a G component, and a B component.
  • the data set is a video composed of an R component, a G component, and a B component.
  • the data set is an image composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the data set is a video composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the two sampling formats are a 4:4:4 sampling format and a 4:2:0 sampling format.
  • the two sampling formats are a 4:4:4 sampling format and a 4:2:2 sampling format.
  • the two sampling formats are a 4:2:0 sampling format and a 4:2:2 sampling format.
  • one of the plurality of sampling formats is a main sampling format
  • the other sampling formats are sampling formats obtained by the down sampling operation of the main sampling format.
  • the data version of one sampling format generated in the codec is converted into a data version of another sampling format by a sampling format conversion operation.
  • the sample format conversion operation includes a resampling operation and/or an upsampling operation and/or a downsampling operation.
  • the codec mode corresponding to one sampling format includes a block prediction operation, and/or a transform operation; and the codec mode corresponding to another sampling format includes a string prediction operation.
  • the data set is an image having a rectangular shape
  • the codec mode corresponding to one sampling format includes an operation of performing intra prediction from neighboring pixels of the current codec block, and/or a transform operation
  • the corresponding codec mode of another sampling format may include a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the codec manner corresponding to one sampling format includes an operation of performing intra prediction from neighboring pixels of the current codec block, and/or from the current
  • the adjacent image of the codec image is subjected to inter prediction, and / Or a transform operation;
  • a codec corresponding to another sample format may include a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the codec mode corresponding to the 4:2:0 sampling format includes an operation of performing intra prediction from neighboring pixels of the current codec block, and/ Or an inter prediction operation from a neighboring image of the current codec image, and/or a transform operation
  • a codec mode corresponding to the 4:4:4 sample format may include a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the codec mode corresponding to the 4:2:0 sampling format includes an operation of performing intra prediction from neighboring pixels of the current codec block, and/ Or performing an inter prediction operation from a neighboring image of the current codec image, and/or a transform operation, and the generated data version of the 4:2:0 sampling format is converted into a data version of the 4:4:4 sampling format by the upsampling operation.
  • the codec mode corresponding to the 4:4:4 sampling format may include a string prediction operation, and the generated data version of the 4:4:4 sampling format is converted into a data version of the 4:2:0 sampling format by a downsampling operation.
  • the most basic characteristic feature of the encoding method or apparatus of this embodiment is that the current encoding block is adaptively encoded by one of a predetermined plurality of sampling formats and a corresponding encoding manner according to the characteristics of a current encoding block, generating at least A compressed data stream containing a sampling format, and/or information of an identification code corresponding to a sampling format, and other information required for decoding.
  • FIG. 5 is a schematic diagram of an encoding method according to an embodiment of the present invention.
  • one coding block is encoded using one of two sampling formats and a corresponding coding method.
  • the data set and its elements consist of 3 components.
  • the data set is an image having a rectangular shape.
  • the data set is a sequence of images having a rectangular shape.
  • the data set is an image consisting of 3 components.
  • the data set is a sequence of images consisting of 3 components.
  • the data set is a video consisting of 3 components.
  • the data set is an image composed of an R component, a G component, and a B component.
  • the data set is a video composed of an R component, a G component, and a B component.
  • the data set is an image composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the data set is a video composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the two sampling formats are a 4:4:4 sampling format and a 4:2:0 sampling format.
  • the two sampling formats are the 4:4:4 sampling format and the 4:2:2 sampling format.
  • the two sampling formats are a 4:2:0 sampling format and a 4:2:2 sampling format.
  • one of the plurality of sampling formats is a main sampling format
  • the other sampling formats are sampling formats obtained by the down sampling operation of the main sampling format.
  • the data version of one of the sampling formats generated in the encoding is converted to a data version of the other sampling format by a sampling format conversion operation.
  • the sample format conversion operation includes a resampling operation and/or an upsampling operation and/or a downsampling operation.
  • the encoding mode corresponding to one sampling format includes a block prediction operation and/or a transform operation; and the encoding mode corresponding to another sampling format includes a string prediction operation.
  • the data set is an image having a rectangular shape
  • the encoding manner corresponding to one sampling format includes an operation of performing intra prediction from neighboring pixels of the current encoding block, and/or a transform operation; corresponding to another sampling format
  • the encoding method includes string prediction operations.
  • the data set is a sequence of images having a rectangular shape, and the encoding manner corresponding to one sampling format includes an operation of intra prediction from neighboring pixels of the current encoded block, and/or inter-frame from adjacent images of the currently encoded image Predicted operations, and/or transform operations; encoding methods corresponding to another sampling format include string prediction operations.
  • the data set is a sequence of images having a rectangular shape
  • the encoding corresponding to the 4:2:0 sampling format includes an operation of intra prediction from neighboring pixels of the current encoded block, and/or a neighboring image from the currently encoded image
  • the operation of inter prediction, and/or the transform operation; the coding mode corresponding to the 4:4:4 sampling format includes a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the encoding corresponding to the 4:2:0 sampling format includes an operation of intra prediction from neighboring pixels of the current encoded block, and/or a neighboring image from the currently encoded image
  • the resulting data version of the 4:2:0 sample format is converted to a data version of the 4:4:4 sample format by upsampling; and a 4:4:4 sample format
  • the corresponding encoding method includes a string prediction operation, and the generated data version of the 4:4:4 sampling format is converted into a data version of the 4:2:0 sampling format by a downsampling operation.
  • the most basic characteristic feature of the decoding method or apparatus of this embodiment is to parse the compressed data stream, obtain the sampling format and/or the information of the corresponding encoding mode, and adopt predetermined information according to the sampling format and/or the information of the corresponding encoding mode.
  • One of the plurality of sampling formats and the corresponding decoding mode decodes a current decoded block.
  • FIG. 6 is a schematic diagram of a decoding method according to an embodiment of the present invention.
  • two are used One of the sampling formats and the corresponding decoding method decodes one decoding block.
  • the data set and its elements consist of 3 components.
  • the data set is an image having a rectangular shape.
  • the data set is a sequence of images having a rectangular shape.
  • the data set is an image consisting of 3 components.
  • the data set is a sequence of images consisting of 3 components.
  • the data set is a video consisting of 3 components.
  • the data set is an image composed of an R component, a G component, and a B component.
  • the data set is a video composed of an R component, a G component, and a B component.
  • the data set is an image composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the data set is a video composed of a Y luminance component, a U chrominance component, and a V chrominance component.
  • the two sampling formats are a 4:4:4 sampling format and a 4:2:0 sampling format.
  • the two sampling formats are a 4:4:4 sampling format and a 4:2:2 sampling format.
  • the two sampling formats are a 4:2:0 sampling format and a 4:2:2 sampling format.
  • one of the plurality of sampling formats is a main sampling format
  • the other sampling formats are sampling formats obtained by the down sampling operation of the main sampling format.
  • the data version of one of the sampling formats generated in the decoding is converted to the data version of the other sampling format by the sampling format conversion operation.
  • the sample format conversion operation includes a resampling operation and/or an upsampling operation and/or a downsampling operation.
  • the decoding mode corresponding to one sampling format includes a block prediction operation and/or a transform operation; and the decoding mode corresponding to another sampling format includes a string prediction operation.
  • the data set is an image having a rectangular shape
  • the decoding manner corresponding to one sampling format includes an operation of performing intra prediction from neighboring pixels of the currently decoded block, and/or a transform operation; corresponding to another sampling format
  • the decoding method includes a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the decoding manner corresponding to one sampling format includes an operation of intra prediction from neighboring pixels of the current decoded block, and/or an interframe from a neighboring image of the currently decoded image.
  • Predicted operations, and/or transform operations; decoding methods corresponding to another sample format include string prediction operations.
  • the data set is a sequence of images having a rectangular shape
  • the decoding manner corresponding to the 4:2:0 sampling format includes an operation of intra prediction from neighboring pixels of the current decoded block, and/or a neighboring image from the currently decoded image.
  • the operation of inter prediction, and/or the transform operation; the decoding method corresponding to the 4:4:4 sampling format includes a string prediction operation.
  • the data set is a sequence of images having a rectangular shape
  • the decoding manner corresponding to the 4:2:0 sampling format includes intra-prediction operations from neighboring pixels of the current decoded block, and/or from The adjacent image of the currently decoded image is subjected to inter prediction, and/or the transform operation, and the generated data version of the 4:2:0 sampling format is converted into a data version of the 4:4:4 sampling format by the upsampling operation;
  • the corresponding decoding mode of the 4:4 sampling format includes a string prediction operation, and the generated data version of the 4:4:4 sampling format is converted into a data version of the 4:2:0 sampling format by a downsampling operation.
  • an encoding method or apparatus for compressing data comprising at least steps or modules for performing the following functions and operations:
  • the embodiment further provides a decoding method or device for compressing data, comprising at least a step or a module for performing the following functions and operations: parsing a compressed data code stream, and obtaining information of a sampling format and/or a corresponding encoding mode, according to The information of the sampling format and/or the corresponding encoding mode decodes one decoding block by using one of a predetermined plurality of sampling formats and corresponding decoding modes and a corresponding decoding manner.
  • This embodiment is applicable to encoding and decoding of lossy compression of data, and the embodiment is also applicable to encoding and decoding of data for lossless compression.
  • This embodiment is applicable to encoding and decoding of one-dimensional data such as character string data or byte string data, and the present embodiment is equally applicable to encoding and decoding of two-dimensional or above data such as image or video data.
  • the data includes one or a combination of the following types of data: one-dimensional data; two-dimensional data; multi-dimensional data; images; sequences of images; video; audio; files; bytes; bits;
  • the coding block or the decoding block is one coding region or one decoding region of the image, including the following cases: a sub-image of the image, a macroblock, and a maximum coding unit.
  • LCU coding tree unit CTU, coding unit CU, sub-region of CU, prediction unit PU, and transform unit TU.
  • sampling format is one of the following sampling formats:
  • the codec mode includes one or a combination of the following operations:
  • the plurality of sampling formats are one of the following situations:
  • the data is one of the following types of data.
  • An image sequence consisting of an R component, a G component, and a B component
  • a video consisting of an R component, a G component, and a B component
  • An image sequence consisting of a Y luminance component, a U chrominance component, and a V chrominance component;
  • a video consisting of a Y luminance component, a U chrominance component, and a V chrominance component
  • Variants of the above various data include variant data that undergoes one of the following operations or a combination thereof: predicted prediction residual, transformed transform domain data, differentially processed differential data, quantized quantized data, The inverse quantized data, the inverse transformed data, the deblocking filtered data, the sample offset compensated data, and the adaptively modified filtered data.
  • the data is an image composed of three components
  • the plurality of sampling formats are two sampling formats
  • the two sampling formats are one of the following situations:
  • one of the plurality of sampling formats is a main sampling format
  • the other sampling formats are sampling formats obtained by the down sampling operation of the main sampling format.
  • a data version of a sampling format generated in a codec is converted into a data version of another sampling format by a sampling format conversion operation.
  • the sampling format conversion operation includes a resampling operation and/or an upsampling operation and/or a downsampling operation.
  • a codec mode corresponding to one sampling format includes a block prediction operation, and/or a transform operation; and a codec mode corresponding to another sampling format includes a string prediction operation.
  • the data is an image having a rectangular shape, and a codec manner corresponding to a sampling format includes an intra prediction operation from neighboring pixels of a current codec block, and / or transform operation; codec mode corresponding to another sample format includes string prediction operations.
  • the data is a sequence of images having a rectangular shape
  • a codec manner corresponding to one sampling format includes an intra prediction operation from neighboring pixels of a current codec block. And/or an operation of inter-prediction from a neighboring image of the current codec image, and/or a transform operation; a codec mode corresponding to another sample format includes a string prediction operation.
  • the data is a sequence of images having a rectangular shape and three components, the plurality of sampling formats being two sampling formats, the two sampling formats being 4: 4:4 sampling format and 4:2:0 sampling format, the codec mode corresponding to the 4:2:0 sampling format includes intra prediction operation from neighboring pixels of the current codec block, and/or from current The adjacent image of the codec image is subjected to inter prediction, and/or a transform operation; the codec mode corresponding to the 4:4:4 sample format includes a string prediction operation.
  • the data is a sequence of images or images having a rectangular shape and three components, the plurality of sampling formats being two sampling formats, the two sampling formats being 4:4:4 sampling format and 4:2:0 sampling format, the codec mode corresponding to the 4:2:0 sampling format includes intra prediction operation from neighboring pixels of the current codec block, and/or Performing an inter prediction operation from a neighboring image of the current codec image, and/or a transform operation, and generating a data version of the 4:2:0 sampling format is converted into a data version of the 4:4:4 sampling format by an upsampling operation;
  • the codec mode corresponding to the 4:4:4 sampling format includes a string prediction operation, and the generated data version of the 4:4:4 sampling format is converted into a data version of the 4:2:0 sampling format by a downsampling operation.
  • the codec mode corresponding to the 4:4:4 sampling format includes a string prediction operation, and the generated data version of the 4:4:4 sampling format
  • i 0 to M-1
  • j 0 to N-1
  • R is equal to 0 (cutoff method) or 2 (rounding method).
  • sequence parameter set usually a grammatical element of a direct or implicit derivation of a sequence parameter set
  • an image parameter set usually a grammatical element of a direct or implicit derivation of an image parameter set
  • sequence header usually a grammatical element of a direct or implicit derivation of the sequence header
  • Strip head usually a grammatical element of the direct or implicit derivation of the strip head
  • an image header usually a grammatical element of a direct or implicit derivation of the image header
  • CTU header usually a grammatical element of a direct or implicit derivation of the CTU header
  • CU header usually a grammatic element of a direct or implicit derivation of the CU header
  • Codec block header usually a directly existing or implicitly derived syntax element of the codec block header.
  • sequence parameter set usually a grammatical element of a direct or implicit derivation of a sequence parameter set
  • an image parameter set usually a grammatical element of a direct or implicit derivation of an image parameter set
  • sequence header usually a grammatical element of a direct or implicit derivation of the sequence header
  • Strip head usually a grammatical element of the direct or implicit derivation of the strip head
  • Image header usually a grammatical element of a direct or implicit derivation of the image header.
  • the predetermined plurality of sampling formats and corresponding codec modes are respectively represented by a plurality of predetermined values, and one sampling format and corresponding codec mode correspond to a predetermined one.
  • a value k each of the codec blocks has a direct or indirect or direct indirect mixed sampling format and a corresponding codec identification code in the video data compressed code stream.
  • sampling format and the corresponding codec mode identification code are equal to k
  • the direct sampling format and the corresponding codec mode identification code are composed of one or more bit strings (binary symbol strings) in the video data compression code stream.
  • the indirect sampling format and corresponding The codec mode identification code is a sample format and a corresponding codec mode identification code derived from other codec parameters and/or other syntax elements of the video data compression code stream.
  • the direct indirect mixed sampling format and the corresponding codec mode identification code are partially indirectly (ie, consisting of one or more bit strings in the video data compressed code stream) partially indirectly (ie, from other codec parameters and/or video)
  • the other syntax elements of the data compression code stream are derived) a mixed sampling format and a corresponding codec identification code.
  • Block video data compression stream In the encoding method or apparatus or the decoding method or apparatus, the sampling format and the corresponding codec mode identification code syntax element used to represent the codec block and the corresponding codec mode are present in the codec in the following form.
  • Block video data compression stream
  • Codec block header information syntax element sampling format and corresponding codec mode identification code syntax element, more codec block header information syntax element, codec block data syntax element;
  • Codec block header information syntax element partial sample format and corresponding codec mode identification code syntax element, more codec block header information syntax element, partial codec block data syntax element, another partial sample format and corresponding codec mode identification code syntax Element, another part of the codec block data syntax element;
  • the codec block is coded and decoded by using the sampling format corresponding to the value and the corresponding codec mode.
  • a codec mode corresponding to a sampling format includes a prediction operation, and/or a prediction compensation operation, and/or a deblocking filtering operation, and/or a sample offset Compensating operations, and/or adaptively correcting filtering operations;
  • encoding and decoding methods corresponding to another sampling format include transform operations, and/or quantization operations, and/or inverse quantization operations (scaling scaling operations), and/or inverse transforms operating.
  • an encoding corresponding to a sampling format includes a block prediction operation, and/or a string prediction operation, and/or a prediction compensation operation; a codec mode corresponding to another sampling format includes a transform operation, and/or a quantization operation, and/or an inverse quantization operation, and / or inverse transformation operation.
  • Embodiments of the present invention also provide a storage medium.
  • the foregoing storage medium may be configured to store program code for performing the following steps:
  • sampling format information and/or the decoding mode information selecting a first sampling format and a decoding mode corresponding to the first sampling format among a plurality of predetermined sampling formats and decoding modes;
  • the foregoing storage medium may include, but not limited to, a USB flash drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), a mobile hard disk, and a magnetic memory.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • a mobile hard disk e.g., a hard disk
  • magnetic memory e.g., a hard disk
  • the processor performs a parsed video data compression code stream according to the stored program code in the storage medium, and acquires sampling format information and/or decoding mode information;
  • the processor performs, according to the sample format information and/or the decoding mode information, according to the stored program code in the storage medium, selecting a first one of a plurality of predetermined sampling formats and decoding modes. a sampling format and a decoding method corresponding to the first sampling format;
  • the processor performs decoding on the decoded block by using a decoding manner corresponding to the first sampling format and the first sampling format according to the stored program code in the storage medium.
  • modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the foregoing technical solution provided by the embodiment of the present invention selects a sampling format and a corresponding decoding mode among a plurality of predetermined sampling formats and decoding modes, and solves the problem that the related art adopts a single adopting format and decoding mode to perform decoding when the efficiency is too low.
  • the technical problem has increased the decoding rate.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明提供了一种视频数据压缩码流的解码、视频数据的编码方法及装置,其中,视频数据压缩码流的解码方法包括:解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;根据采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与第一采样格式相应的解码方式;采用第一采样格式和第一采样格式相应的解码方式对解码块进行解码。通过本发明,解决了相关技术中采用单一的采用格式和解码方式进行解码时效率过低的技术问题。

Description

视频数据压缩码流的解码、视频数据的编码方法及装置 技术领域
本发明涉及数据处理领域,具体而言,涉及一种视频数据压缩码流的解码、视频数据的编码方法及装置。
背景技术
随着人类社会进入大数据、云计算、移动计算、云-移动计算、超高清(4K)和特超高清(8K)视频图像分辨率、4G/5G通讯、虚拟现实的时代,对各种数据,包括大数据、图像数据、视频数据,进行超高压缩比和极高质量的数据压缩成为必不可少的技术。
数据集是由数据元素(例如:字节、比特、像素)组成的集合。对一个排列成一定形状和具有一定元素数目(即具有一定采样格式)的数据集(例如:一个文件、一帧图像、一个视频序列)进行数据压缩的编码(以及相应的解码)时,通常把此数据集划分成若干具有预定形状和大小(即元素数目)的块的子集,称为编码块(从解码的角度也就是解码块,统称为编解码块),以编解码块为单位,一块一块进行编码或解码。在任一时刻,正在编码中的编码块称为当前编码块。在任一时刻,正在解码中的解码块称为当前解码块。当前编码块或当前解码块统称为当前编解码块或简称为当前块。正在编码或解码中的数据元素(简称为元素)称为当前编码数据元素或当前解码数据元素,统称为当前数据元素,简称为当前元素。元素由N个分量(通常1≤N≤5)组成,因此数据集和编解码块也都由N个分量组成。例如,一帧图像的元素即像素排列成矩形形状,具有1920(宽度)x 1080(高度)的大小(分辨率),由3个分量组成:G(绿色)分量,B(蓝色)分量,R(红色)分量或Y(亮度)分量,U(Cb色度)分量,V(Cr色度)分量。
在相关技术中,作为编码对象的数据集和编解码块都只有一种固定的采样格式和大小。例如,对于计算机产生的含图形和文字的图像,通常采 用一种称为4:4:4的采样格式,就是数据集的3个分量都具有同样的采样率和大小(即分量样值的数目)。对于摄像机摄取的自然图像和视频,通常采用一种称为4:2:0的采样格式,就是具有矩形形状和3个分量的数据集(如图像或视频)的2个分量(D分量和E分量)的采样率和尺寸分别是另一个分量(F分量)的四分之一。在这种情形,一个D分量D[i][j]和一个E分量E[i][j]对应于四个(2×2个)F分量F[2i][2j],F[2i+1][2j],F[2i][2j+1],F[2i+1][2j+1]。如果F分量的分辨率是2M×2N,即数据集的F分量是F={F[i][j]:i=0~2M-1,j=0~2N-1},那么D分量和E分量的分辨率分别都是M×N,即数据集的D分量和E分量分别是D={D[i][j]:i=0~M-1,j=0~N-1}和E={E[i][j]:i=0~M-1,j=0~N-1}。还有一种称为4:2:2的采样格式,就是具有矩形形状和3个分量的数据集(如图像或视频)的2个分量(D分量和E分量)的采样率和尺寸分别是另一个分量(F分量)的二分之一。在这种情形,在数据集(如图像或视频)的水平方向,一个D分量D[i][j]和一个E分量E[i][j]对应于两个(2×1个)F分量F[2i][j]和F[2i+1][j]。如果F分量的分辨率是2M×N,,即数据集的F分量是F={F[i][j]:i=0~2M-1,j=0~N-1},那么D分量和E分量的分辨率分别都是M×N,即数据集的D分量和E分量分别是D={D[i][j]:i=0~M-1,j=0~N-1}和E={E[i][j]:i=0~M-1,j=0~N-1}。在采用YUV色彩格式的图像和视频中,以上所述F、D、E分量分别是Y、U、V分量。在采用RGB色彩格式的图像和视频中,以上所述F、D、E分量分别是G、B、R分量。在现有技术中,即使在某些情形,作为编码对象的数据集和编解码块有多种采样格式和大小,对这些不同采样格式和大小的编码对象进行编码时,采用的是同一种编码方式,或者不同的编码方式,总是对单一采样格式和大小的编码对象进行编码。这里,不同的采样格式是指数据集和/或编解码块的至少一个分量具有不同的采样率和大小(即分量样值的数目)。不同的采样率和大小通过上采样操作或下采样操作互相转换。上采样操作是增加样值的数目的操作。下采样操作是减少样值的数目的操作。因此,不同的采样格式也就是数据集和/或编解码块的至少一个 分量具有不同的样值数目。
对于由多种特性的内容混合而成的数据集,例如由计算机产生的图形和文字与摄像机摄取的自然图像和视频混合而成的屏幕内容图像和视频、虚拟现实的图像和视频,单一的采样格式和大小、本质上单一的编码方式,极大影响了数据压缩效率的提高。
针对相关技术中存在的上述问题,目前尚未发现有效的解决方案。
发明内容
本发明实施例提供了一种视频数据压缩码流的解码、视频数据的编码方法及装置,以至少解决相关技术中采用单一的采用格式和解码方式进行解码时效率过低的技术问题。
根据本发明的一个实施例,提供了一种视频数据压缩码流的解码方法,还包括:解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
可选地,所述视频数据压缩码流包括以下至少之一信息的数据压缩码流:一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
可选地,所述解码块是图像的解码区域,其中,所述解码区域包括以下至少之一:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、 编码单元CU、CU的子区域、预测单元PU、变换单元TU。
可选地,所述多种采样格式包括主采样格式和其他采样格式,其中,所述其他采样格式是所述主采样格式经过采样操作得到的采样格式。
可选地,所述视频数据压缩码流具有矩形形状和三个分量的图像或图像的序列的数据压缩码流。
可选地,所述多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,所述多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,所述多种采样格式是4:2:2采样格式和4:2:0采样格式。
可选地,与所述4:2:0采样格式相应的解码方式包括:产生4:2:0采样格式的数据版本,对所述4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生所述4:2:0采样格式的数据版本方法包括:根据所述当解码块的邻近像素进行帧内预测的操作产生4:2:0采样格式的数据版本,和/或,根据解码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;与所述4:4:4或4:2:2采样格式相应的解码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对所述4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本,其中,所述预测操作包括串预测操作。
可选地,所述解码方式包括以下至少之一:根据所述解码块的邻近像素进行帧内预测;根据所述解码图像的邻近图像进行帧间预测;根据所述解码图像的邻近图像进行帧间变换;缩放scaling;通用串预测;调色板解码;字典解码;熵解码。
可选地,还包括:解析所述视频数据压缩码流,从以下之一的位置获得第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,解码块头,其中,所述第一标志位用于指示允许采用多种采样格式和/或相应解码方式进行解码。
可选地,还包括:解析所述视频数据压缩码流,从以下至少之一的位置获取第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、 解码块头,其中,所述第二标志位用于指示允许使用采用4:4:4采样格式和/或相应串预测解码方式的解码块。
可选地,还包括:解析所述视频数据压缩码流,从以下至少之一的位置获取第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、解码块头,其中,所述第三标志位用于指示允许使用采用4:2:2采样格式和/或相应串预测解码方式的解码块。
可选地,所述预定的多种采样格式和解码方式中的一种采样格式和解码方式对应于一个预定的值k,从所述视频数据压缩码流中,为所述解码块获取直接或间接或直接间接混合的采样格式和相应解码方式标识码。
可选地,所述直接的采样格式和相应解码方式标识码由所述视频数据压缩码流中的一个或多个位串所组成;所述间接的采样格式和相应解码方式标识码是除所述解码方式参数之外的其他解码参数和/或所述视频数据压缩码流的除所述解码方式参数对应的语法元素之外的其他语法元素导出的采样格式和相应解码方式标识码;所述直接间接混合的采样格式和相应解码方式标识码是部分直接部分间接混合的采样格式和相应解码方式标识码。
可选地,从所述视频数据压缩码流的以下位置获取所述采样格式和对应所述解码方式的标识码:所述解码块头信息语法元素、采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、解码块数据语法元素;或所述解码块头信息语法元素、部分采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、部分解码块数据语法元素、另一部分采样格式和相应解码方式标识码语法元素、另一部分解码块数据语法元素;其中,所述标识码语法元素的标识码的取值等于指定值时,表示采用与所述指定值对应的采样格式和相应解码方式对所述解码块进行解码。
根据本发明的另一个实施例,提供了一种视频数据的编码方法,包括:从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式 之中选择与所述第一采样格式对应的编码方式;使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,所述视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
可选地,所述视频数据包括以下至少之一:一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
可选地,所述编码块是图像的编码区域,其中,所述编码区域包括以下至少之一:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、编码单元CU、CU的子区域、预测单元PU、变换单元TU。
可选地,所述多种采样格式包括主采样格式和其他采样格式,其中,所述其他采样格式是所述主采样格式经过采样操作得到的采样格式。
可选地,所述视频数据是具有矩形形状和三个分量的图像或图像的序列。
可选地,所述多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,所述多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,所述多种采样格式是4:2:2采样格式和4:2:0采样格式。
可选地,与所述4:2:0采样格式相应的编码方式包括:产生4:2:0采样格式的数据版本,对所述4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生所述4:2:0采样格式的数据版本方法包括:根据所述当编码块的邻近像素进行帧内预测的操作产 生4:2:0采样格式的数据版本,和/或,根据所述当编码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;与所述4:4:4或4:2:2采样格式相应的编码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对所述4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本,其中,所述预测操作包括串预测操作。
可选地,所述编码方式包括以下至少之一:根据所述编码块的邻近像素进行帧内预测;根据所述编码图像的邻近图像进行帧间预测;根据所述编码图像的邻近图像进行帧间变换;量化;通用串预测;调色板编码;字典编码;混合编码Hybrid coding;熵编码。
可选地,还包括:在所述视频数据压缩码流的以下之一部分包含第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,编码块头,其中,所述第一标志位用于指示允许采用多种采样格式和/或相应编码方式进行编码。
可选地,还包括:在所述视频数据压缩码流的以下至少之一部分包含第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、编码块头,其中,所述第二标志位用于指示允许使用采用4:4:4采样格式和/或相应串预测编码方式的编码块。
可选地,还包括:在所述视频数据压缩码流的以下至少之一部分包含第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、编码块头,其中,所述第三标志位用于指示允许使用采用4:2:2采样格式和/或相应串预测编码方式的编码块。
可选地,所述预定的多种采样格式和编码方式中的一种采样格式和编码方式对应于一个预定的值k,为所述编码块设置直接或间接或直接间接混合的采样格式和相应编码方式标识码,将所述编码方式标识码包含在所述视频数据压缩码流中。
可选地,所述直接的采样格式和相应编码方式标识码由所述视频数据压缩码流中的一个或多个位串所组成;所述间接的采样格式和相应编码方 式标识码是除所述选择的编码方式参数之外的其他编码参数和/或所述视频数据压缩码流的除所述语法元素之外的其他语法元素导出的采样格式和相应编码方式标识码;所述直接间接混合的采样格式和相应编码方式标识码是部分直接部分间接混合的采样格式和相应编码方式标识码。
可选地,所述采样格式和对应所述编码方式的标识码使用下列方式存在于所述视频数据压缩码流中:所述编码块头信息语法元素、采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、编码块数据语法元素;或所述编码块头信息语法元素、部分采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、部分编码块数据语法元素、另一部分采样格式和相应编码方式标识码语法元素、另一部分编码块数据语法元素;其中,所述标识码语法元素的标识码的取值等于指定值时,表示采用与所述指定值对应的采样格式和相应编码方式对所述编码块进行编码。
根据本发明的另一个实施例,提供了一种视频数据压缩码流的解码装置,包括:解析模块,设置为解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;选择模块,设置为根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;解码模块,设置为采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
根据本发明的另一个实施例,提供了一种视频数据的编码方法,包括:选择模块,设置为从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与所述第一采样格式对应的编码方式;编码模块,设置为使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,所述视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
根据本发明的又一个实施例,还提供了一种存储介质。该存储介质设 置为存储用于执行以下步骤的程序代码:
解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;
采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
根据本发明的又一个实施例,还提供了一种存储介质。该存储介质设置为存储用于执行以下步骤的程序代码:
从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与所述第一采样格式对应的编码方式;
使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,所述视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
通过本发明实施例,在预定的多种采样格式和解码方式之中选择采样格式和相应的解码方式,解决了相关技术中采用单一的采用格式和解码方式进行解码时效率过低的技术问题,提高了解码速率。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是根据本发明实施例的视频数据压缩码流的解码方法的流程图;
图2是根据本发明实施例的视频数据的编码方法的流程图;
图3是根据本发明实施例的视频数据压缩码流的解码装置的结构框图;
图4是根据本发明实施例的视频数据的编码装置的结构框图;
图5是根据本发明实施例的编码方法的一个示意图;
图6是根据本发明实施例的解码方法的一个示意图。
具体实施方式
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。
实施例1
在本实施例中提供了一种视频数据压缩码流的解码方法,图1是根据本发明实施例的视频数据压缩码流的解码方法的流程图,如图1所示,该流程包括如下步骤:
步骤S102,解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
步骤S104,根据采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与第一采样格式相应的解码方式;
步骤S106,采用第一采样格式和第一采样格式相应的解码方式对解码块进行解码。
通过上述步骤,在预定的多种采样格式和解码方式之中选择采样格式和相应的解码方式,解决了相关技术中采用单一的采用格式和解码方式进行解码时效率过低的技术问题,提高了解码速率。
可选地,上述步骤的执行主体可以为解码器,视频处理设备,如视频接收端,视频呈现设备等,但不限于此。
可选的,视频数据压缩码流包括以下至少之一信息的数据压缩码流:一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状 的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
可选的,解码块是图像的解码区域,其中,解码区域包括以下至少之一:图像的子图像、宏块、最大编码单元(The Largest Coding Unit,简称为LCU)、编码树单元(Coding Tree Unit,简称为CTU)、编码单元(Coding Unit,简称为CU)、CU的子区域、预测单元PU(Prediction Unit,简称为PU)、变换单元(Transform Unit,简称为TU)。
可选的,多种采样格式包括主采样格式和其他采样格式,其中,其他采样格式是主采样格式经过采样操作得到的采样格式。
可选的,视频数据压缩码流具有矩形形状和三个分量的图像或图像的序列的数据压缩码流。
可选的,多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,多种采样格式是4:2:2采样格式和4:2:0采样格式。对应的,与4:2:0采样格式相应的解码方式包括:产生4:2:0采样格式的数据版本,对4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生4:2:0采样格式的数据版本方法包括:根据当解码块的邻近像素进行帧内预测的操作产生4:2:0采样格式的数据版本,和/或,根据当解码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;与4:4:4或4:2:2采样格式相应的解码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本,具体的,所述预测操作可以是串预测操作。
可选的,解码方式包括以下至少之一:根据解码块的邻近像素进行帧内预测;根据解码图像的邻近图像进行帧间预测;根据解码图像的邻近图像进行帧间变换;缩放scaling;通用串预测;调色板解码;字典解码;熵解码。
可选的,解析视频数据压缩码流还包括:,从以下之一的位置获得第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,解码块头,其中,第一标志位用于指示允许采用多种采样格式和/或相应解码方式进行解码。
可选的,解析视频数据压缩码流还包括:,从以下至少之一的位置获取第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、解码块头,其中,第二标志位用于指示允许使用采用4:4:4采样格式和/或相应串预测解码方式的解码块。
可选的,解析视频数据压缩码流还包括:,从以下至少之一的位置获取第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、解码块头,其中,第三标志位用于指示允许使用采用4:2:2采样格式和/或相应串预测解码方式的解码块。
可选的,预定的多种采样格式和解码方式中的一种采样格式和解码方式对应于一个预定的值k,从视频数据压缩码流中,为解码块获取直接或间接或直接间接混合的采样格式和相应解码方式标识码。
在本实施例中,直接的采样格式和相应解码方式标识码由视频数据压缩码流中的一个或多个位串所组成;间接的采样格式和相应解码方式标识码是除解码方式参数之外的其他解码参数和/或视频数据压缩码流的除解码方式参数对应的语法元素之外的其他语法元素导出的采样格式和相应解码方式标识码;直接间接混合的采样格式和相应解码方式标识码是部分直接部分间接混合的采样格式和相应解码方式标识码。
可选的,从视频数据压缩码流的以下位置获取采样格式和对应解码方式的标识码:
解码块头信息语法元素、采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、解码块数据语法元素;或
解码块头信息语法元素、部分采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、部分解码块数据语法元素、另一部分采样格式和相应解码方式标识码语法元素、另一部分解码块数据语法元素;
其中,标识码语法元素的标识码的取值等于指定值时,表示采用与指定值对应的采样格式和相应解码方式对解码块进行解码。
在本实施例中提供了一种视频数据的编码方法,图2是根据本发明实施例的视频数据的编码方法的流程图,如图2所示,该流程包括如下步骤:
步骤S202,从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与第一采样格式对应的编码方式;
步骤S204,使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
可选地,上述步骤的执行主体可以为编码器,视频处理设备,如视频发送端,视频分发设备等,但不限于此。
可选的,视频数据包括以下至少之一:一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
可选的,编码块是图像的编码区域,其中,编码区域包括以下至少之 一:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、编码单元CU、CU的子区域、预测单元PU、变换单元TU。
可选的,多种采样格式包括主采样格式和其他采样格式,其中,其他采样格式是主采样格式经过采样操作得到的采样格式。
可选的,视频数据是具有矩形形状和三个分量的图像或图像的序列,
可选的,多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,多种采样格式是4:2:2采样格式和4:2:0采样格式。对应的,与4:2:0采样格式相应的编码方式包括:产生4:2:0采样格式的数据版本,对4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生4:2:0采样格式的数据版本方法包括:根据当编码块的邻近像素进行帧内预测的操作产生4:2:0采样格式的数据版本,和/或,根据当编码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;与4:4:4或4:2:2采样格式相应的编码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本,具体的,所述预测操作可以是串预测操作。
可选的,编码方式包括以下至少之一:根据编码块的邻近像素进行帧内预测;根据编码图像的邻近图像进行帧间预测;根据编码图像的邻近图像进行帧间变换;量化;通用串预测;调色板编码;字典编码;混合编码Hybrid coding;熵编码。
可选的,本实施例还包括:在视频数据压缩码流的以下之一部分包含第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,编码块头,其中,第一标志位用于指示允许采用多种采样格式和/或相应编码方式进行编码。
可选的,本实施例还包括:在视频数据压缩码流的以下至少之一部分包含第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、 编码块头,其中,第二标志位用于指示允许使用采用4:4:4采样格式和/或相应串预测编码方式的编码块。
可选的,本实施例还包括:在视频数据压缩码流的以下至少之一部分包含第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、编码块头,其中,第三标志位用于指示允许使用采用4:2:2采样格式和/或相应串预测编码方式的编码块。
可选的,预定的多种采样格式和编码方式中的一种采样格式和编码方式对应于一个预定的值k,为编码块设置直接或间接或直接间接混合的采样格式和相应编码方式标识码,将编码方式标识码包含在视频数据压缩码流中。
可选的,直接的采样格式和相应编码方式标识码由视频数据压缩码流中的一个或多个位串所组成;间接的采样格式和相应编码方式标识码是除选择的编码方式参数之外的其他编码参数和/或视频数据压缩码流的除语法元素之外的其他语法元素导出的采样格式和相应编码方式标识码;直接间接混合的采样格式和相应编码方式标识码是部分直接部分间接混合的采样格式和相应编码方式标识码。
可选的,采样格式和对应编码方式的标识码使用下列方式存在于视频数据压缩码流中:编码块头信息语法元素、采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、编码块数据语法元素;或编码块头信息语法元素、部分采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、部分编码块数据语法元素、另一部分采样格式和相应编码方式标识码语法元素、另一部分编码块数据语法元素;其中,标识码语法元素的标识码的取值等于指定值时,表示采用与指定值对应的采样格式和相应编码方式对编码块进行编码。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施例的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理 解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。
实施例2
在本实施例中还提供了一种视频数据压缩码流的解码装置,视频数据的编码装置,该装置用于实现上述实施例及优选实施方式,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施例所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。
图3是根据本发明实施例的视频数据压缩码流的解码装置的结构框图,如图3所示,该装置包括:
解析模块30,设置为解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
选择模块32,设置为根据采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与第一采样格式相应的解码方式;
解码模块34,设置为采用第一采样格式和第一采样格式相应的解码方式对解码块进行解码。
图4是根据本发明实施例的视频数据的编码装置的结构框图,如图4所示,该装置包括:
选择模块40,设置为从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与第一采样格式对应的编码方式;
编码模块42,设置为使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对 应的语法元素。
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。
实施例3
本实施例是本发明的可选实施例,用于多本申请的方案进行补充和详细说明:
为了解决对由多种特性的内容混合而成的数据集的压缩中的这一问题,本实施例提供了一种采用多种(即两种或以上)采样格式和相应编码方式的数据压缩方法和装置:数据集和编解码块有K(K>1)个分别具有K种不同采样格式的版本,相应地有K套编解码方式;在对一个编解码块进行编解码时,选择对所述K个版本之一使用相应的编解码方式进行编解码。
本实施例的首要技术特征是采用多种采样格式(即多种具有不同采样格式的数据版本)和相应的编解码方式之一对一个编解码块进行编解码。
优选地,采用两种采样格式和相应的编解码方式之一对一个编解码块进行编解码。
优选地,数据集及其元素由3个分量组成。
优选地,数据集是具有矩形形状的图像。
优选地,数据集是具有矩形形状的图像的序列。
优选地,数据集是由3个分量组成的图像。
优选地,数据集是由3个分量组成的图像序列。
优选地,数据集是由3个分量组成的视频。
优选地,数据集是由R分量、G分量、B分量组成的图像。
优选地,数据集是由R分量、G分量、B分量组成的视频。
优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的图像。
本实施例中,优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的视频。
本实施例中,优选地,两种采样格式是4:4:4采样格式和4:2:0采样格式。
本实施例中,优选地,两种采样格式是4:4:4采样格式和4:2:2采样格式。
本实施例中,优选地,两种采样格式是4:2:0采样格式和4:2:2采样格式。
本实施例中,优选地,多种采样格式中的一种是主采样格式,而其他采样格式则是所述主采样格式经过下采样操作得到的采样格式。
本实施例中,优选地,在编解码中产生的一种采样格式的数据版本,经过采样格式转换操作转换为其他采样格式的数据版本。
本实施例中,优选地,采样格式转换操作包括重采样操作和/或上采样操作和/或下采样操作。
本实施例中,优选地,与一种采样格式相应的编解码方式包括块预测操作,和/或变换操作;与另一种采样格式相应的编解码方式包括串预测操作。
本实施例中,优选地,数据集是具有矩形形状的图像,与一种采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或变换操作;与另一种采样格式相应的编解码方式可包括串预测操作。
本实施例中,优选地,数据集是具有矩形形状的图像的序列,与一种采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/ 或变换操作;与另一种采样格式相应的编解码方式可包括串预测操作。
本实施例中,优选地,数据集是具有矩形形状的图像的序列,与4:2:0采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作;与4:4:4采样格式相应的编解码方式可包括串预测操作。
本实施例中,优选地,数据集是具有矩形形状的图像的序列,与4:2:0采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作,产生的4:2:0采样格式的数据版本经过上采样操作转换为4:4:4采样格式的数据版本;与4:4:4采样格式相应的编解码方式可包括串预测操作,产生的4:4:4采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
本实施例的编码方法或装置的最基本的特有技术特征是根据一个当前编码块的特性自适应地采用预定的多种采样格式和相应编码方式之一对所述当前编码块进行编码,产生至少含采样格式,和/或与采样格式相应编码方式的标识码的信息及其对应的解码时需要的其他信息的压缩数据码流。
图5是根据本发明实施例的编码方法的一个示意图。优选地,采用两种采样格式和相应的编码方式之一对一个编码块进行编码。优选地,数据集及其元素由3个分量组成。优选地,数据集是具有矩形形状的图像。优选地,数据集是具有矩形形状的图像的序列。优选地,数据集是由3个分量组成的图像。优选地,数据集是由3个分量组成的图像序列。优选地,数据集是由3个分量组成的视频。优选地,数据集是由R分量、G分量、B分量组成的图像。优选地,数据集是由R分量、G分量、B分量组成的视频。优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的图像。优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的视频。优选地,两种采样格式是4:4:4采样格式和4:2:0采样格式。优选地, 两种采样格式是4:4:4采样格式和4:2:2采样格式。优选地,两种采样格式是4:2:0采样格式和4:2:2采样格式。优选地,多种采样格式中的一种是主采样格式,而其他采样格式则是所述主采样格式经过下采样操作得到的采样格式。优选地,在编码中产生的一种采样格式的数据版本,经过采样格式转换操作转换为其他采样格式的数据版本。优选地,采样格式转换操作包括重采样操作和/或上采样操作和/或下采样操作。优选地,与一种采样格式相应的编码方式包括块预测操作和/或变换操作;与另一种采样格式相应的编码方式包括串预测操作。优选地,数据集是具有矩形形状的图像,与一种采样格式相应的编码方式包括从当前编码块的邻近像素进行帧内预测的操作,和/或变换操作;与另一种采样格式相应的编码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与一种采样格式相应的编码方式包括从当前编码块的邻近像素进行帧内预测的操作,和/或从当前编码图像的邻近图像进行帧间预测的操作,和/或变换操作;与另一种采样格式相应的编码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与4:2:0采样格式相应的编码方式包括从当前编码块的邻近像素进行帧内预测的操作,和/或从当前编码图像的邻近图像进行帧间预测的操作,和/或变换操作;与4:4:4采样格式相应的编码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与4:2:0采样格式相应的编码方式包括从当前编码块的邻近像素进行帧内预测的操作,和/或从当前编码图像的邻近图像进行帧间预测的操作,和/或变换操作,产生的4:2:0采样格式的数据版本经过上采样操作转换为4:4:4采样格式的数据版本;与4:4:4采样格式相应的编码方式包括串预测操作,产生的4:4:4采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
本实施例的解码方法或装置的最基本的特有技术特征是解析压缩数据码流,获取采样格式和/或相应编码方式的信息,根据所述采样格式和/或相应编码方式的信息采用预定的多种采样格式和相应解码方式之一对一个当前解码块进行解码。
图6是根据本发明实施例的解码方法的一个示意图。优选地,采用两 种采样格式和相应的解码方式之一对一个解码块进行解码。优选地,数据集及其元素由3个分量组成。优选地,数据集是具有矩形形状的图像。优选地,数据集是具有矩形形状的图像的序列。优选地,数据集是由3个分量组成的图像。优选地,数据集是由3个分量组成的图像序列。优选地,数据集是由3个分量组成的视频。优选地,数据集是由R分量、G分量、B分量组成的图像。优选地,数据集是由R分量、G分量、B分量组成的视频。优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的图像。优选地,数据集是由Y亮度分量、U色度分量、V色度分量组成的视频。优选地,两种采样格式是4:4:4采样格式和4:2:0采样格式。优选地,两种采样格式是4:4:4采样格式和4:2:2采样格式。优选地,两种采样格式是4:2:0采样格式和4:2:2采样格式。优选地,多种采样格式中的一种是主采样格式,而其他采样格式则是所述主采样格式经过下采样操作得到的采样格式。优选地,在解码中产生的一种采样格式的数据版本,经过采样格式转换操作转换为其他采样格式的数据版本。优选地,采样格式转换操作包括重采样操作和/或上采样操作和/或下采样操作。优选地,与一种采样格式相应的解码方式包括块预测操作和/或变换操作;与另一种采样格式相应的解码方式包括串预测操作。优选地,数据集是具有矩形形状的图像,与一种采样格式相应的解码方式包括从当前解码块的邻近像素进行帧内预测的操作,和/或变换操作;与另一种采样格式相应的解码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与一种采样格式相应的解码方式包括从当前解码块的邻近像素进行帧内预测的操作,和/或从当前解码图像的邻近图像进行帧间预测的操作,和/或变换操作;与另一种采样格式相应的解码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与4:2:0采样格式相应的解码方式包括从当前解码块的邻近像素进行帧内预测的操作,和/或从当前解码图像的邻近图像进行帧间预测的操作,和/或变换操作;与4:4:4采样格式相应的解码方式包括串预测操作。优选地,数据集是具有矩形形状的图像序列,与4:2:0采样格式相应的解码方式包括从当前解码块的邻近像素进行帧内预测的操作,和/或从 当前解码图像的邻近图像进行帧间预测的操作,和/或变换操作,产生的4:2:0采样格式的数据版本经过上采样操作转换为4:4:4采样格式的数据版本;与4:4:4采样格式相应的解码方式包括串预测操作,产生的4:4:4采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
根据本实施例的一个方面,提供了一种对数据进行压缩的编码方法或装置,至少包括完成下列功能和操作的步骤或模块:
自适应地选择预定的多种采样格式和相应编码方式之中的一种采样格式和相应编码方式对一个编码块进行编码,产生至少含采样格式和/或相应编码方式的信息及其语法元素的压缩数据码流。
本实施例还提供了一种对数据进行压缩的解码方法或装置,至少包括完成下列功能和操作的步骤或模块:解析压缩数据码流,获取采样格式和/或相应编码方式的信息,根据所述采样格式和/或相应编码方式的信息采用预定的多种采样格式和相应解码方式之中的一种采样格式和相应解码方式对一个解码块进行解码。
本实施例适用于对数据进行有损压缩的编码和解码,本实施例也同样适用于数据进行无损压缩的编码和解码。本实施例适用于一维数据如字符串数据或字节串数据的编码和解码,本实施例也同样适用于二维或以上数据如图像或视频数据的编码和解码。
本实施例中,数据包括下列类型的数据之一或其组合:一维数据;二维数据;多维数据;图像;图像的序列;视频;音频;文件;字节;比特;像素。
本实施例中,在数据是图像、图像的序列、视频等的情形,编码块或解码块是图像的一个编码区域或一个解码区域,包括以下情形:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、编码单元CU、CU的子区域、预测单元PU、变换单元TU。
本实施例中,所述采样格式是下列采样格式之一:
4:4:4采样格式;
或者
4:2:2采样格式;
或者
4:2:0采样格式。
本实施例中,所述编解码方式包括下列操作之一或其组合:
1)从当前编解码块的邻近像素进行帧内预测;
2)从当前编解码图像的邻近图像进行帧间预测;
3)变换和对应的逆变换;
4)量化和对应的反量化;
5)通用串预测;
6)调色板编码和对应的解码;
7)字典编码和对应的解码;
8)Hybrid coding;
9)熵编码和对应的熵解码。
以下是本实施例的更多的实施细节或变体,包括多个实例。
实例1
所述编码方法或装置或解码方法或装置中,所述多种采样格式是下列情形之一:
两种采样格式;
或者
三种采样格式;
或者
四种采样格式。
实例2
所述编码方法或装置或解码方法或装置中,所述数据是下列类型的数据之一。
由3个分量组成的数据;
或者
具有矩形形状的图像;
或者
具有矩形形状的图像的序列;
或者
由3个分量组成的图像;
或者
由3个分量组成的图像序列;
或者
由3个分量组成的视频;
或者
由R分量、G分量、B分量组成的图像;
或者
由R分量、G分量、B分量组成的图像序列;
或者
由R分量、G分量、B分量组成的视频;
或者
由Y亮度分量、U色度分量、V色度分量组成的图像;
或者
由Y亮度分量、U色度分量、V色度分量组成的图像序列;
或者
由Y亮度分量、U色度分量、V色度分量组成的视频;
或者
以上各种数据的一个编解码块;
或者
以上各种数据的变体,包括经过下列操作之一或其组合的变体数据:经过预测的预测残差、经过变换的变换域数据、经过差分运算的差分数据、经过量化的量化数据、经过反量化的数据、经过反变换的数据、经过去块效应滤波的数据、经过样值偏移补偿的数据、经过自适应修正滤波的数据。
实例3
所述编码方法或装置或解码方法或装置中,所述数据是由3个分量组成的图像,所述多种采样格式是两种采样格式,所述两种采样格式是下列情形之一:
4:4:4采样格式和4:2:0采样格式;
或者
4:4:4采样格式和4:2:2采样格式;
或者
4:2:0采样格式和4:2:2采样格式。
实例4
所述编码方法或装置或解码方法或装置中,所述多种采样格式中的一种是主采样格式,而其他采样格式则是所述主采样格式经过下采样操作得到的采样格式。
实例5
所述编码方法或装置或解码方法或装置中,在编解码中产生的一种采样格式的数据版本,经过采样格式转换操作转换为其他采样格式的数据版本。
实例6
实例5所述编码方法或装置或解码方法或装置中,所述采样格式转换操作包括重采样操作和/或上采样操作和/或下采样操作。
实例7
所述编码方法或装置或解码方法或装置中,与一种采样格式相应的编解码方式包括块预测操作,和/或变换操作;与另一种采样格式相应的编解码方式包括串预测操作。
实例8
所述编码方法或装置或解码方法或装置中,所述数据是具有矩形形状的图像,与一种采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或变换操作;与另一种采样格式相应的编解码方式包括串预测操作。
实例9
所述编码方法或装置或解码方法或装置中,所述数据是具有矩形形状的图像的序列,与一种采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作;与另一种采样格式相应的编解码方式包括串预测操作。
实例10
所述编码方法或装置或解码方法或装置中,所述数据是具有矩形形状和3个分量的图像的序列,所述多种采样格式是两种采样格式,所述两种采样格式是4:4:4采样格式和4:2:0采样格式,与所述4:2:0采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作;与所述4:4:4采样格式相应的编解码方式包括串预测操作。
实例11
所述编码方法或装置或解码方法或装置中,所述数据是具有矩形形状和3个分量的图像或图像的序列,所述多种采样格式是两种采样格式,所述两种采样格式是4:4:4采样格式和4:2:0采样格式,与所述4:2:0采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作,产生的4:2:0采样格式的数据版本经过上采样操作转换为4:4:4采样格式的数据版本;与所述4:4:4采样格式相应的编解码方式包括串预测操作,产生的4:4:4采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
实例12
所述编码方法或装置或解码方法或装置中,所述数据是具有矩形形状和3个分量的图像或图像的序列,所述多种采样格式是两种采样格式,所述两种采样格式是4:4:4采样格式和4:2:0采样格式,与所述4:2:0采样格式相应的编解码方式包括从当前编解码块的邻近像素进行帧内预测的操作,和/或从当前编解码图像的邻近图像进行帧间预测的操作,和/或变换操作,产生的4:2:0采样格式的数据版本的D分量D420={D420[i][j]:i=0~M-1,j=0~N-1}和E分量E420={E420[i][j]:i=0~M-1,j=0~N-1},分别经过下列上采样操作转换为4:4:4采样格式的数据版本的D分量D444={D444[i][j]:i=0~2M-1,j=0~2N-1}和E分量E444={E444[i][j]:i=0~2M-1,j=0~2N-1}:
D444[2i][2j]=D420[i][j]
D444[2i+1][2j]=D420[i][j]
D444[2i][2j+1]=D420[i][j]
D444[2i+1][2j+1]=D420[i][j]
E444[2i][2j]=E420[i][j]
E444[2i+1][2j]=E420[i][j]
E444[2i][2j+1]=E420[i][j]
E444[2i+1][2j+1]=E420[i][j]
其中,i=0~M-1,j=0~N-1;与所述4:4:4采样格式相应的编解码方式包括串预测操作,产生的4:4:4采样格式的数据版本的D分量D444={D444[i][j]:i=0~2M-1,j=0~2N-1}和E分量E444={E444[i][j]:i=0~2M-1,j=0~2N-1},分别经过下列下采样操作转换为4:2:0采样格式的数据版本的D分量D420={D420[i][j]:i=0~M-1,j=0~N-1}和E分量E420={E420[i][j]:i=0~M-1,j=0~N-1}:
D420[i][j]=(D444[2i][2j]+D444[2i+1][2j]+D444[2i][2j+1]+D444[2i+1][2j+1]+R)>>2
E420[i][j]=(E444[2i][2j]+E444[2i+1][2j]+E444[2i][2j+1]+E444[2i+1][2j+1]+R)>>2
其中,i=0~M-1,j=0~N-1,R等于0(截断法)或2(四舍五入法)。
实例13
所述编码方法或装置或解码方法或装置中,在所述视频数据压缩码流的下列地方之一或若干处存在表示允许采用多种采样格式和/或相应编解码方式进行编解码的标志位:
1)序列参数集;通常是是序列参数集的一个直接存在或隐含推导的语法元素;
2)图像参数集;通常是图像参数集的一个直接存在或隐含推导的语法元素;
3)序列头;通常是序列头的一个直接存在或隐含推导的语法元素;
4)条带头;通常是条带头的一个直接存在或隐含推导的语法元素;
5)图像头;通常是图像头的一个直接存在或隐含推导的语法元素;
6)CTU头;通常是CTU头的一个直接存在或隐含推导的语法元素;
7)CU头;通常是CU头的一个直接存在或隐含推导的语法元素;
8)编解码块头;通常是编解码块头的一个直接存在或隐含推导的语法元素。
实例14
所述编码方法或装置或解码方法或装置中,在所述视频数据压缩码流的下列地方之一或若干处存在表示允许使用采用4:4:4采样格式和/或相应串预测编解码方式的编解码块的标志位:
1)序列参数集;通常是是序列参数集的一个直接存在或隐含推导的语法元素;
2)图像参数集;通常是图像参数集的一个直接存在或隐含推导的语法元素;
3)序列头;通常是序列头的一个直接存在或隐含推导的语法元素;
3)条带头;通常是条带头的一个直接存在或隐含推导的语法元素;
3)图像头;通常是图像头的一个直接存在或隐含推导的语法元素。
实例15
所述编码方法或装置或解码方法或装置中,所述预定的多种采样格式和相应编解码方式分别用多个预定的值来表示,一种采样格式和相应编解码方式对应于一个预定的值k,每个所述编解码块在所述视频数据压缩码流中都有一个直接或间接或直接间接混合的采样格式和相应编解码方式标识码,
如果所述采样格式和相应编解码方式标识码等于k,则
{
采用与k对应的一种采样格式和相应编解码方式对所述编解码块进行编解码
}
所述直接的采样格式和相应编解码方式标识码由视频数据压缩码流中的一个或多个位串(二元符号串)所组成。所述间接的采样格式和相应 编解码方式标识码是从其他编解码参数和/或视频数据压缩码流的其他语法元素导出的采样格式和相应编解码方式标识码。所述直接间接混合的采样格式和相应编解码方式标识码是部分直接(即由视频数据压缩码流中的一个或多个位串所组成)部分间接(即从其他编解码参数和/或视频数据压缩码流的其他语法元素导出)混合的采样格式和相应编解码方式标识码。
实例16
所述编码方法或装置或解码方法或装置中,用来表示所述编解码块的采样格式和相应编解码方式的采样格式和相应编解码方式标识码语法元素以下列形式存在于所述编解码块的视频数据压缩码流中:
编解码块头信息语法元素、采样格式和相应编解码方式标识码语法元素、更多的编解码块头信息语法元素、编解码块数据语法元素;
编解码块头信息语法元素、部分采样格式和相应编解码方式标识码语法元素、更多的编解码块头信息语法元素、部分编解码块数据语法元素、另一部分采样格式和相应编解码方式标识码语法元素、另一部分编解码块数据语法元素;
其中,采样格式和相应编解码方式标识码取一个值时,采用与所述值对应的那种采样格式和相应编解码方式对所述编解码块进行编解码。
实例17
所述编码方法或装置或解码方法或装置中,与一种采样格式相应的编解码方式包括预测操作,和/或预测补偿操作,和/或去块效应滤波操作,和/或样值偏移补偿操作,和/或自适应修正滤波操作;与另一种采样格式相应的编解码方式包括变换操作,和/或量化操作,和/或反量化操作(缩放scaling操作),和/或反变换操作。
实例18
所述编码方法或装置或解码方法或装置中,与一种采样格式相应的编 解码方式包括块预测操作,和/或串预测操作,和/或预测补偿操作;与另一种采样格式相应的编解码方式包括变换操作,和/或量化操作,和/或反量化操作,和/或反变换操作。
实施例4
本发明的实施例还提供了一种存储介质。可选地,在本实施例中,上述存储介质可以被设置为存储用于执行以下步骤的程序代码:
S1,解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
S2,根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;
S3,采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
可选地,在本实施例中,处理器根据存储介质中已存储的程序代码执行解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
可选地,在本实施例中,处理器根据存储介质中已存储的程序代码执行根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;
可选地,在本实施例中,处理器根据存储介质中已存储的程序代码执行采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。
工业实用性
本发明实施例提供的上述技术方案,在预定的多种采样格式和解码方式之中选择采样格式和相应的解码方式,解决了相关技术中采用单一的采用格式和解码方式进行解码时效率过低的技术问题,提高了解码速率。

Claims (34)

  1. 一种视频数据压缩码流的解码方法,还包括:
    解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
    根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;
    采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
  2. 根据权利要求1所述的方法,其中,所述视频数据压缩码流包括以下至少之一信息的数据压缩码流:
    一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
  3. 根据权利要求1所述的方法,其中,所述解码块是图像的解码区域,其中,所述解码区域包括以下至少之一:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、编码单元CU、CU的子区域、预测单元PU、变换单元TU。
  4. 根据权利要求1所述的方法,其中,所述多种采样格式包括主采样格式和其他采样格式,其中,所述其他采样格式是所述主采样格式经过采样操作得到的采样格式。
  5. 根据权利要求1所述的方法,其中,所述视频数据压缩码流具有矩形形状和三个分量的图像或图像的序列的数据压缩码流。
  6. 根据权利要求1所述的方法,其中,所述多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,所述多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,所述多种采样格式是4:2:2采样格式和4:2:0采样格式。
  7. 根据权利要求6所述的方法,其中,其中,
    与所述4:2:0采样格式相应的解码方式包括:产生4:2:0采样格式的数据版本,对所述4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生所述4:2:0采样格式的数据版本方法包括:根据对所述解码块的邻近像素进行帧内预测的操作产生4:2:0采样格式的数据版本,和/或,根据对解码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;
    与所述4:4:4或4:2:2采样格式相应的解码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对所述4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
  8. 根据权利要求7所述的方法,其中,所述解码方式包括以下至少之一:根据所述解码块的邻近像素进行帧内预测;根据所述解码图像的邻近图像进行帧间预测;变换;缩放scaling;通用串预测;调色板解码;字典解码;熵解码。
  9. 根据权利要求1所述的方法,其中,还包括:解析所述视频数据压缩码流,从以下之一的位置获得第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,解码块头,其中,所述第一标志位用于指示允许采用多种采样格式和/或相应解码方式进行解码。
  10. 根据权利要求1所述的方法,其中,还包括:解析所述视频 数据压缩码流,从以下至少之一的位置获取第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、解码块头,其中,所述第二标志位用于指示允许使用采用4:4:4采样格式和/或相应串预测解码方式的解码块。
  11. 根据权利要求1所述的方法,其中,还包括:解析所述视频数据压缩码流,从以下至少之一的位置获取第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、解码块头,其中,所述第三标志位用于指示允许使用采用4:2:2采样格式和/或相应串预测解码方式的解码块。
  12. 根据权利要求1所述的方法,其中,所述预定的多种采样格式和解码方式中的一种采样格式和解码方式对应于一个预定的值k,从所述视频数据压缩码流中,为所述解码块获取直接或间接或直接间接混合的采样格式和相应解码方式标识码。
  13. 根据权利要求12所述的方法,其中,
    所述直接的采样格式和相应解码方式标识码由所述视频数据压缩码流中的一个或多个位串所组成;
    所述间接的采样格式和相应解码方式标识码是除所述解码方式参数之外的其他解码参数和/或所述视频数据压缩码流的除所述解码方式参数对应的语法元素之外的其他语法元素导出的采样格式和相应解码方式标识码;
    所述直接间接混合的采样格式和相应解码方式标识码是部分直接部分间接混合的采样格式和相应解码方式标识码。
  14. 根据权利要求1所述的方法,其中,从所述视频数据压缩码流的以下位置获取所述采样格式和对应所述解码方式的标识码:
    所述解码块头信息语法元素、采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、解码块数据语法元素;或
    所述解码块头信息语法元素、部分采样格式和相应解码方式标识码语法元素、额外的解码块头信息语法元素、部分解码块数据语法元素、另一部分采样格式和相应解码方式标识码语法元素、另一部分解码块数据语法元素;
    其中,所述标识码语法元素的标识码的取值等于指定值时,表示采用与所述指定值对应的采样格式和相应解码方式对所述解码块进行解码。
  15. 根据权利要求7所述的方法,其中,所述预测操作包括串预测操作。
  16. 一种视频数据的编码方法,包括:
    从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与所述第一采样格式对应的编码方式;
    使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,所述视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
  17. 根据权利要求16所述的方法,其中,所述视频数据包括以下至少之一:
    一维数据,二维数据,大于二维的多维数据,图像,图像的序列,视频,音频,文件,字节,比特,像素,由三个分量组成的数据,具有矩形形状的图像,具有矩形形状的图像的序列,由三个分量组成的图像,由三个分量组成的图像序列,由三个分量组成的视频,由R分量、G分量、B分量组成的图像,由R分量、G分量、B分量组成的图像序列,由R分量、G分量、B分量组成的视频,由一个亮度分量和两个色度分量组成的图像,由一个亮度分量两个色度分量组成的图 像序列,由一个亮度分量两个色度分量组成的视频,数据的编码块。
  18. 根据权利要求16所述的方法,其中,所述编码块是图像的编码区域,其中,所述编码区域包括以下至少之一:图像的子图像、宏块、最大编码单元LCU、编码树单元CTU、编码单元CU、CU的子区域、预测单元PU、变换单元TU。
  19. 根据权利要求16所述的方法,其中,所述多种采样格式包括主采样格式和其他采样格式,其中,所述其他采样格式是所述主采样格式经过采样操作得到的采样格式。
  20. 根据权利要求16所述的方法,其中,所述视频数据是具有矩形形状和三个分量的图像或图像的序列。
  21. 根据权利要求16所述的方法,其中,所述多种采样格式是4:4:4采样格式和4:2:0采样格式;或者,所述多种采样格式是4:4:4采样格式和4:2:2采样格式;或者,所述多种采样格式是4:2:2采样格式和4:2:0采样格式。
  22. 根据权利要求21所述的方法,其中,
    与所述4:2:0采样格式相应的编码方式包括:产生4:2:0采样格式的数据版本,对所述4:2:0采样格式的数据版本经过上采样操作转换为4:4:4或4:2:2采样格式的数据版本,其中,产生所述4:2:0采样格式的数据版本方法包括:根据对所述编码块的邻近像素进行帧内预测的操作产生4:2:0采样格式的数据版本,和/或,根据对编码图像的邻近图像进行帧间预测的操作产生4:2:0采样格式的数据版本;
    与所述4:4:4或4:2:2采样格式相应的编码方式包括:根据预测操作产生4:4:4或4:2:2采样格式的数据版本,对所述4:4:4或4:2:2采样格式的数据版本经过下采样操作转换为4:2:0采样格式的数据版本。
  23. 根据权利要求22所述的方法,其中,所述编码方式包括以下至少之一:根据所述编码块的邻近像素进行帧内预测;根据所述编码图像的邻近图像进行帧间预测;变换;量化;通用串预测;调色板 编码;字典编码;混合编码Hybrid coding;熵编码。
  24. 根据权利要求16所述的方法,其中,还包括:在所述视频数据压缩码流的以下之一部分包含第一标志位:序列参数集,图像参数集,序列头,条带头,图像头,CTU头,CU头,编码块头,其中,所述第一标志位用于指示允许采用多种采样格式和/或相应编码方式进行编码。
  25. 根据权利要求16所述的方法,其中,还包括:在所述视频数据压缩码流的以下至少之一部分包含第二标志位:序列参数集,图像参数集,序列头,条带头,图像头、编码块头,其中,所述第二标志位用于指示允许使用采用4:4:4采样格式和/或相应编码方式的编码块。
  26. 根据权利要求16所述的方法,其中,还包括:在所述视频数据压缩码流的以下至少之一部分包含第三标志位:序列参数集,图像参数集,序列头,条带头,图像头、编码块头,其中,所述第三标志位用于指示允许使用采用4:2:2采样格式和/或相应编码方式的编码块。
  27. 根据权利要求16所述的方法,其中,所述预定的多种采样格式和编码方式中的一种采样格式和编码方式对应于一个预定的值k,为所述编码块设置直接或间接或直接间接混合的采样格式和相应编码方式标识码,将所述编码方式标识码包含在所述视频数据压缩码流中。
  28. 根据权利要求27所述的方法,其中,
    所述直接的采样格式和相应编码方式标识码由所述视频数据压缩码流中的一个或多个位串所组成;
    所述间接的采样格式和相应编码方式标识码是除所述选择的编码方式参数之外的其他编码参数和/或所述视频数据压缩码流的除所述语法元素之外的其他语法元素导出的采样格式和相应编码方式标 识码;
    所述直接间接混合的采样格式和相应编码方式标识码是部分直接部分间接混合的采样格式和相应编码方式标识码。
  29. 根据权利要求16所述的方法,其中,所述采样格式和对应所述编码方式的标识码使用下列方式存在于所述视频数据压缩码流中:
    所述编码块头信息语法元素、采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、编码块数据语法元素;或
    所述编码块头信息语法元素、部分采样格式和相应编码方式标识码语法元素、额外的编码块头信息语法元素、部分编码块数据语法元素、另一部分采样格式和相应编码方式标识码语法元素、另一部分编码块数据语法元素;
    其中,所述标识码语法元素的标识码的取值等于指定值时,表示采用与所述指定值对应的采样格式和相应编码方式对所述编码块进行编码。
  30. 根据权利要求22所述的方法,其中,所述预测操作包括串预测操作。
  31. 一种视频数据压缩码流的解码装置,包括:
    解析模块,设置为解析视频数据压缩码流,获取采样格式信息和/或解码方式信息;
    选择模块,设置为根据所述采样格式信息和/或解码方式信息,在预定的多种采样格式和解码方式之中,选择第一采样格式和与所述第一采样格式相应的解码方式;
    解码模块,设置为采用所述第一采样格式和所述第一采样格式相应的解码方式对解码块进行解码。
  32. 一种视频数据的编码装置,包括:
    选择模块,设置为从预定的多种采样格式中选择第一采样格式,以及从预定的多种编码方式之中选择与所述第一采样格式对应的编码方式;
    编码模块,设置为使用选择的第一采样格式和选择的编码方式对视频数据的编码块进行编码产生视频数据压缩码流,其中,所述视频数据压缩码流包含:第一采样格式和/或编码方式,与第一采样格式和/或编码方式对应的语法元素。
  33. 一种存储介质,所述存储介质包括存储的程序,其中,所述程序运行时执行权利要求1至30中任一项所述的方法。
  34. 一种处理器,所述处理器用于运行程序,其中,所述程序运行时执行权利要求1至30中任一项所述的方法。
PCT/CN2017/087482 2016-06-08 2017-06-07 视频数据压缩码流的解码、视频数据的编码方法及装置 WO2017211306A1 (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201610401154 2016-06-08
CN201610401154.0 2016-06-08
CN201710143873 2017-03-12
CN201710143873.1 2017-03-12

Publications (1)

Publication Number Publication Date
WO2017211306A1 true WO2017211306A1 (zh) 2017-12-14

Family

ID=60578383

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/087482 WO2017211306A1 (zh) 2016-06-08 2017-06-07 视频数据压缩码流的解码、视频数据的编码方法及装置

Country Status (2)

Country Link
CN (1) CN107483942B (zh)
WO (1) WO2017211306A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113395515A (zh) * 2021-04-08 2021-09-14 同济大学 对分量下采样格式数据进行点预测的编码解码方法及装置
CN115037927A (zh) * 2022-05-07 2022-09-09 同济大学 融合全色度与混合色度的图像编码和解码方法及其应用

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109063091B (zh) * 2018-07-26 2021-06-15 成都大学 混合编码的数据迁移方法、数据迁移装置和存储介质
CN109379630B (zh) * 2018-11-27 2021-03-12 Oppo广东移动通信有限公司 视频处理方法、装置、电子设备及存储介质
CN113163212B (zh) * 2020-01-07 2024-08-13 腾讯科技(深圳)有限公司 视频解码方法及装置、视频编码方法及装置、介质和设备
CN111314778B (zh) * 2020-03-02 2021-09-07 北京小鸟科技股份有限公司 基于多种压缩制式的编解码融合处理方法、系统及装置
CN112929624B (zh) * 2021-01-21 2023-02-17 杭州雾联科技有限公司 一种编码方法、装置、电子设备及计算机可读存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013102418A1 (en) * 2012-01-04 2013-07-11 Mediatek Singapore Pte. Ltd. Method and apparatus of luma-based chroma intra prediction
CN104919804A (zh) * 2012-10-01 2015-09-16 微软技术许可有限责任公司 帧封装和解封较高分辨率色度采样格式
CN104995918A (zh) * 2013-01-08 2015-10-21 微软公司 用于将帧的格式转换成色度子采样格式的方法
CN105264888A (zh) * 2014-03-04 2016-01-20 微软技术许可有限责任公司 用于对色彩空间、色彩采样率和/或比特深度自适应切换的编码策略

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050129130A1 (en) * 2003-12-10 2005-06-16 Microsoft Corporation Color space coding framework
CN101420614B (zh) * 2008-11-28 2010-08-18 同济大学 一种混合编码与字典编码整合的图像压缩方法及装置
CN103918269B (zh) * 2012-01-04 2017-08-01 联发科技(新加坡)私人有限公司 色度帧内预测方法及装置
JP6126234B2 (ja) * 2012-11-12 2017-05-10 エルジー エレクトロニクス インコーポレイティド 信号送受信装置及び信号送受信方法
US10397607B2 (en) * 2013-11-01 2019-08-27 Qualcomm Incorporated Color residual prediction for video coding
CN104853211A (zh) * 2014-02-16 2015-08-19 上海天荷电子信息有限公司 使用多种形式的参考像素存储空间的图像压缩方法和装置
CN104853209B (zh) * 2014-02-16 2020-09-29 同济大学 图像编码、解码方法及装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013102418A1 (en) * 2012-01-04 2013-07-11 Mediatek Singapore Pte. Ltd. Method and apparatus of luma-based chroma intra prediction
CN104919804A (zh) * 2012-10-01 2015-09-16 微软技术许可有限责任公司 帧封装和解封较高分辨率色度采样格式
CN104995918A (zh) * 2013-01-08 2015-10-21 微软公司 用于将帧的格式转换成色度子采样格式的方法
CN105264888A (zh) * 2014-03-04 2016-01-20 微软技术许可有限责任公司 用于对色彩空间、色彩采样率和/或比特深度自适应切换的编码策略

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113395515A (zh) * 2021-04-08 2021-09-14 同济大学 对分量下采样格式数据进行点预测的编码解码方法及装置
CN113395515B (zh) * 2021-04-08 2022-06-14 同济大学 对分量下采样格式数据进行点预测的编码解码方法及装置
CN115037927A (zh) * 2022-05-07 2022-09-09 同济大学 融合全色度与混合色度的图像编码和解码方法及其应用

Also Published As

Publication number Publication date
CN107483942A (zh) 2017-12-15
CN107483942B (zh) 2023-07-14

Similar Documents

Publication Publication Date Title
WO2017211306A1 (zh) 视频数据压缩码流的解码、视频数据的编码方法及装置
CN105556963B (zh) 用于hevc范围扩展的残差差分脉冲编码调制方法
EP3416389B1 (en) Encoding and decoding method and device for data compression
KR100968652B1 (ko) 이미지 프레임들의 비-프레임-에지 블록들을 표현함에있어서의 강화된 압축
WO2021004152A1 (zh) 图像分量的预测方法、编码器、解码器以及存储介质
US9497461B2 (en) Method and apparatus for encoding frequency transformed block using frequency mask table, and method and apparatus for encoding/decoding video using same
JP2017535218A (ja) 高効率ビデオ符号化(hevc)画面コンテンツ符号化(scc)における改善されたパレットモード
CN104396245A (zh) 用于对图像进行编码或解码的方法和装置
WO2021238540A1 (zh) 图像编码方法、图像解码方法及相关装置
KR20070009486A (ko) 영상 부호화 및 복호화 방법과 장치
US20080175494A1 (en) Methods and Systems for Inter-Layer Image Prediction
JP2015019152A (ja) 画像符号化装置、画像符号化方法及びプログラム、画像復号装置、画像復号方法及びプログラム
WO2023040600A1 (zh) 图像编码方法、图像解码方法、装置、电子设备及介质
AU2003291058B2 (en) Apparatus and method for multiple description encoding
TW202127893A (zh) 用於視頻編碼中的參考圖片重採樣的參考圖片縮放比
JP2022548354A (ja) ビデオ復号方法、ビデオ符号化方法、装置、機器及び記憶媒体
TW202106013A (zh) 用於視訊寫碼中之自適應迴路濾波器之剪切索引寫碼
CN113613008A (zh) 一种视频编解码的方法、装置、电子设备及存储介质
JP2024506156A (ja) ビデオ符号化のための残差および係数の符号化
WO2017137006A1 (zh) 数据压缩的编码、解码方法及装置
CN118632028A (zh) 用于视频编解码的残差和系数编解码
CN116803077A (zh) 用于视频编解码的残差和系数编解码
CN114786019B (zh) 图像预测方法、编码器、解码器以及存储介质
CN108574845B (zh) 动态采用多种采样格式的数据压缩方法和装置
CN111416975B (zh) 预测模式确定方法和装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17809745

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17809745

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 18.07.2019)

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 18/07/2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17809745

Country of ref document: EP

Kind code of ref document: A1