US10432938B2 - Method and device for video coding and decoding - Google Patents

Method and device for video coding and decoding Download PDF

Info

Publication number
US10432938B2
US10432938B2 US15/418,496 US201715418496A US10432938B2 US 10432938 B2 US10432938 B2 US 10432938B2 US 201715418496 A US201715418496 A US 201715418496A US 10432938 B2 US10432938 B2 US 10432938B2
Authority
US
United States
Prior art keywords
slice
slices
current
data
slice set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US15/418,496
Other versions
US20170142414A1 (en
Inventor
Xiaozhen ZHENG
Zhen MAO
Ziyuan WANG
Yun He
Xiaozhong Xu
Jianhua Zheng
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Huawei Technologies Co Ltd
Original Assignee
Tsinghua University
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/CN2008/072362 external-priority patent/WO2009033432A1/en
Application filed by Tsinghua University, Huawei Technologies Co Ltd filed Critical Tsinghua University
Priority to US15/418,496 priority Critical patent/US10432938B2/en
Publication of US20170142414A1 publication Critical patent/US20170142414A1/en
Assigned to TSINGHUA UNIVERSITY, HUAWEI TECHNOLOGIES CO., LTD. reassignment TSINGHUA UNIVERSITY ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WANG, ZIYUAN, HE, YUN, ZHENG, XIAOZHEN, XU, XIAOZHONG, MAO, ZHEN, ZHENG, JIANHUA
Priority to US16/573,148 priority patent/US10939112B2/en
Application granted granted Critical
Publication of US10432938B2 publication Critical patent/US10432938B2/en
Priority to US17/164,502 priority patent/US11659184B2/en
Priority to US18/302,761 priority patent/US12382046B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/174Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream

Definitions

  • the invention relates to coding and decoding technologies, and in particular, to a method and device for video coding and decoding.
  • a picture may be divided into several image blocks or macroblocks, and motion estimation is carried out based on blocks or macroblocks.
  • the residual after motion estimation is transformed and quantized.
  • entropy coding is performed on the quantized coefficient and the information obtained after the entropy coding is written into a bit stream. After a picture is divided into macroblocks, the macroblocks will be organized into slices.
  • the decoder After the decoder receives video packets from the encoder, the decoder will organize contents in the data packets according to a certain rule and obtain a reconstructed picture after dequantization, detranformation and motion compensation.
  • slice division methods include:
  • MPEG-2 Moving Picture Experts Group 2
  • a slice is defined as a series of any number of macroblocks.
  • a block group includes at least one macroblock and no overlap should exist between any two block groups. The first and the last macroblocks in a block group must be in one macroblock row.
  • Block groups appear in a bit stream from left to right and from the top down following the optical grating scanning sequence starting with the upper left corner of a picture.
  • MPEG-2 requires the first macroblock and the last macroblock appear in a same macroblock row, which restricts the flexibility of block groups in length and spatial distribution.
  • One block group can include macroblocks in at most one horizontal row so that the flexibility of a coding/decoding system is restricted.
  • a slice is defined as an integer number of macroblocks or macroblock pairs arranged in the optical grating scanning sequence inside a specific slice set. Although a slice contains an integer number of macroblocks or macroblock pairs arranged in the optical grating scanning sequence inside a slice group, the macroblocks or macroblock pairs are not necessarily arranged continuously in the optical grating scanning sequence within a picture.
  • a macroblock address is obtained through the address of the first macroblock and the mapping from the macroblock to a slice group.
  • a slice group is defined as a subset of macroblocks or macroblock pairs in the picture. To divide a picture into slice groups is to organize certain macroblocks in the picture into a slice group according to the mapping from macroblocks to slice groups, which means macroblocks are organized according to certain spatial mappings.
  • mapping from macroblocks to slice groups is stored in the picture header, an extra storage cost is imposed on the encoder and the decoder. Besides, the encoder must know the spatial mappings of macroblocks in advance, which is not good for real-time coding.
  • the embodiments of the invention provide a method and device for video coding and decoding, to improve the video transmission performance and simplify the implementation of the coding and decoding system.
  • a video coding method includes: (1) dividing a picture to be encoded into several slices, each containing macroblocks continuous in a scanning sequence in the picture; (2) dividing the slices contained in the picture into one or more slice sets, each containing one or more slices; and (3) encoding slices in the slice sets according to slice and slice set division information to get a coded bit stream of the picture.
  • a video encoder includes: (1) a slice dividing unit, configured to divide a picture to be encoded into several slices, each containing macroblocks continuous in a scanning sequence in the picture; (2) a slice set dividing unit, configured to divide slices in the picture into one or more slice sets according to attribute information of the slices obtained by the slice dividing unit, where each slice set contains one or more slices; and (3) an encoding unit, configured to encode slices in the slice sets divided by the slice set dividing unit to get a coded bit stream of the picture.
  • a video decoding method includes: (1) obtaining slice and slice set division information from a bit stream to be decoded, where a slice includes macroblocks continuous in a scanning sequence in a picture and a slice set includes one or more slices; and (2) decoding the bit stream according to the obtained slice and slice set division information.
  • a video decoder includes: (1) a slice information obtaining unit, configured to obtain slice and slice set division information from a bit stream to be decoded, where a slice includes macroblocks continuous in a scanning sequence in a picture and a slice set includes one or more slices; and (2) a picture reconstructing unit, configured to: decode the bit stream according to the slice and slice set division information obtained by the slice information obtaining unit and reconstruct the picture.
  • slices and slice sets are divided appropriately so that the encoder and decoder can perform coding and decoding flexibly according to actual needs and therefore the performance in video transmission is improved. Furthermore, the implementation of coding and decoding is simple and the complexity of the coding and decoding systems is reduced.
  • FIG. 1 is a schematic drawing of the division of slices and slice sets according to an embodiment of the invention
  • FIG. 2 shows a structure of an encoder according to an embodiment of the invention.
  • FIG. 3 shows a structure of a decoder according to an embodiment of the invention.
  • the video coding method includes: dividing a picture to be encoded into several slices, each containing continuous macroblocks in the picture in the scanning sequence, such as the optical grating scanning sequence or another predetermined scanning sequence; dividing slices in the picture into one or more slice sets according to slice attributes, where one slice set includes multiple slices; encoding the slices in a slice set according to slice and slice set division information, such as slice set serial number and slice position information, to get a coded bit stream of the picture.
  • the attributes may include but are not limited to one or more of the following items: network transport condition, spatial continuity of slices, picture characteristics, spatial resolution, content of the picture or region, and picture coding quality.
  • a slice set enable flag may be written in the coded bit stream if necessary.
  • the flag indicates whether slice sets are divided during the coding of the current picture. When the method in the embodiment of the invention is used, the flag is set to be valid; otherwise the flag is set to be invalid.
  • the coding of slices in a slice set may be at least one of the following coding operations:
  • the slice set identification information includes but is not limited to slice set serial number allocated for slice sets or the slice_set_index allocated for slices, or slice distance index indicating the number of slices that are not in the slice set between two slices adjacent in the scanning sequence in the slice set, or start macroblock and number of macroblocks of the slice set.
  • Encoding information that the reference data used in the inter-frame coding of a slice is the data in the same slice set of the current slice or all data available in the picture (the information may be called the first data usage indicator), so that the decoder can choose reference data for decoding of the slice according to the information.
  • Encoding information that the data used in the coding of macroblocks in a slice is the data in the current slice or data of other slices in the same slice set (the information may be called the second data usage indicator), so that the decoder can choose data for decoding of the macroblocks in the slice according to the information.
  • the first slice may be the first slice in the optical grating scanning sequence of the slice set.
  • an embodiment of the invention provides a video decoding method.
  • the method includes: the decoder receives a bit stream to be decoded and obtains division information of slices and slice sets in the bit stream; and the decoder decodes the bit stream according to the obtained division information of slices and slice sets.
  • the decoder must first obtain the slice set enable flag in the coded bit stream and the division information of slices and slice sets is not obtained until the decoder determines that the current picture is divided into slice sets according to the flag.
  • the decoder decodes the bit stream in at least one of the following methods:
  • the reference data used in the inter-frame coding of a slice is the data in the same slice set of the current slice or all data available in the picture (the first data usage indicator) and performing decoding according to the information.
  • the parameters of other slices in the slice set may be set according to the coding parameters of the first slice.
  • the first slice may be the first slice in the optical grating scanning sequence of the slice set.
  • region specific coding and decoding of pictures An important application in video coding and decoding is the region specific coding and decoding of pictures, where one picture is divided into several regions and data (like macroblocks) in a region is unnecessarily continuous in the optical grating scanning sequence but often has certain common features, such as spatial continuity or the same and similar image attributes.
  • the region specific coding of pictures may be applied to region based quality specific coding and resolution specific coding of pictures or used with the video watermark technique.
  • region based picture coding is achieved because of the flexible division and combination of picture slices.
  • region specific picture coding is realized through flexible combination of slices and the processing of coding and decoding scheme of the combined slices and the syntax elements used in coding and decoding.
  • appropriate parameters or flags for identification of slice coding and decoding features may be added to the slice, sequence header and/or picture header information and the flags or parameters are flexibly combined to implement the region specific coding.
  • the solution provided in the embodiment of the invention reduces the hardware complexity of the coding system and the decoding system and is well compatible with the existing coding and decoding systems.
  • A, B 0 , C, B 1 , D, B 2 and E are slices; Row 0 to Row 7 are eight macroblock rows in a picture, where 0-7 are vertical positions of the macroblock rows in the picture counted by macroblocks; Col 0 to Col 7 are eight macroblock columns in the picture, where 0-7 are horizontal positions of the macroblock columns in the picture counted by macroblocks.
  • a slice is a number of macroblocks continuous in the optical grating scanning sequence and includes unnecessarily an integer number of macroblock rows.
  • the slice does not require that the start and end macroblocks be in one macroblock row.
  • the optical grating scanning is a scanning method that converts two-dimensional data to one-dimensional data, where scanning may start from the upper left corner of a region and proceed from left to right and from the top down.
  • slice set is a set of slices possessing common attributes including but not limited to one or more of the following items: spatial continuity of slices in a picture, picture characteristics, spatial resolution, content of the picture or region, and coding quality of the picture.
  • the division of slice sets is decided by the encoder and the decoder can know the division mode of slice sets in the encoder according to the picture header information or slice set information.
  • a slice set is a loose combination of slices in a picture, where the slice coding and decoding modes are decided by the syntax element added to the video bit stream in the embodiments of the invention.
  • the syntax element is stored in the start position of the picture header or slice.
  • the syntax element may be a flag or parameter added in the embodiments of the invention.
  • the encoder and the decoder will be able to decide the mode to process slices in a slice set, thus realizing region specific coding through the processing of slices in the slice set.
  • slice specific coding may include one or more of the following means:
  • slice_set_index a parameter that identifies a slice group to a slice to specify the slice set where the slice belongs.
  • the parameter is called slice_set_index.
  • slices having the same slice_set_index belong to one slice set.
  • Adding a flag (the first data usage indicator) to a slice to specify whether data of other slices belonging to a same slice set can be used when data of the current slice is coded or decoded. For example, when the flag is set to 1, it indicates that the data of other slices belonging to the same slice set can be used when data of the current slice is coded or decoded; when the flag is set to 0, it indicates that the data of other slices belonging to the same slice set cannot be used when data of the current slice is coded or decoded.
  • a number of slices in a picture form a loose slice set by one or more of the above means. Accordingly, when slice data in the slice set is coded or decoded, any of the following means may be used:
  • data used when the current slice is coded or decoded is independent of other slice data in the same slice set and independent of slice data in other slice sets.
  • data used when the current slice is coded or decoded may come from other slice data in the same slice set but will not come from slice data in other slice sets.
  • the encoder may set slice set parameters (such as slice_set_index, slice association flag and slice set syntax element indicator) for each slice according to specific needs by means of the above three slice based coding methods. If region specific coding or flexible packing is necessary, an appropriate slice set parameter can be set so that certain consistency is kept in the coding and decoding of slices in one slice set. This means region specific coding can be realized through the use of slice sets. Accordingly, the decoder provided in the embodiments of the invention may analyze the syntax element related to the slice set through the slice set elements to know the attributes of the slice and the slice set. For example, the decoded picture may be displayed by regions, or a region of the picture can be decoded independently in the decoding process.
  • slice set parameters such as slice_set_index, slice association flag and slice set syntax element indicator
  • Slice set syntax set a set that includes a number of syntax elements related to slice coding, such as quantization step size.
  • the encoder may write one or more of the following parameters or flags in a coded bit stream during the coding process:
  • the flag occupies one bit and indicates whether slice sets are used in the picture; for example, the value 1 indicates slice sets are not used and 0 indicates slice sets are used.
  • slice_set_disable may also be added to the sequence header, slice header or macroblock header, and so on.
  • slice_vertical_position is an 8-bit unsigned integer configured to identify the vertical position of the first macroblock of the slice in the picture, and the position is measured by macroblocks. If the vertical_size of the picture is larger than 2800 bits, a 3-bit parameter slice_vertical_position_extension may be further added to the start code of the slice.
  • the parameters slice_vertical_position_extension and slice_vertical_position together specify the vertical position of the slice in the picture.
  • slice_set_index may be a 6-bit unsigned integer, allocated for each slice when the value of slice_set_disable (such as 0) specifies that the use of slice sets is enabled.
  • slice_horizontal_position may be an 8-bit unsigned integer indicating the macroblock measured horizontal position of the first macroblock of the slice in the picture.
  • slice_horizontal_position_extension may be a 3-bit unsigned integer, which may not appear in the bit stream if the horizontal size of the picture is smaller than or equal to 4096 bits; otherwise if the horizontal size of the picture is larger than 4096 bits, the parameter may be set to specify together with slice_horizontal_position the horizontal position of the slice in the picture.
  • slice_connect 1 bit, when the value of slice_set_disable (such as 0) indicates the use of slice sets, indicating whether there are other slices in the current slice set that are not decoded after the current slice is decoded.
  • slice_set_syntax syntax indicator of the slice set, where the syntax elements are associated with picture data coding, such as the quantization step size and the weighted prediction coefficient; it may be a 1-bit flag, indicating whether the current slice carries a slice set syntax set. For example, the value 1 indicates the presence of a slice set syntax set in the current slice and the value 0 indicates the presence of no slice set syntax set in the current slice.
  • the syntax elements in the slice set syntax set required by the coding and decoding of the current slice are the same as the first slice in the same slice set whose slice_set_syntax is 1 and precedes the current slice in the optical grating scanning sequence.
  • slice_set_reference the first data usage indicator, which may be a 1-bit flag, indicating the way to obtain reference data of the slice set; for example, the value 1 indicates only slice set data that has the same slice_set_index as the current slice set in the picture can be used as reference for coding and decoding of the data within the current slice set, and the value 0 indicates that all data available in the picture can be used as reference for coding and decoding of the data within the current slice set.
  • the second data usage indicator which may be a 1-bit flag, indicating data can be used for coding and decoding of macroblocks in the current slice; for example, the value 1 indicates that only data in the slice can be used for the coding and decoding of macroblocks in the current slice and the value 0 indicates slice data of the same slice_set_index in the picture can be used for coding and decoding of macroblocks in the current slice provided that the slices used for coding of the current slice precede the current slice in the scanning sequence (such as the optical grating scanning sequence) and follow the previous slice whose slice_independency is 1.
  • slice_independency of the first slice in the optical grating scanning sequence in the slice set is 1 and the slice_independency of the current slice is 1, if slice_set_syntax is present, the value of slice_set_syntax is 1.
  • first_of_slice_set_flag indicating whether the current slice is the first slice in the current slice set in the optical grating scanning sequence.
  • slice_set_index can be used alone in a slice without slice_horizontal_position, slice_set_syntax, slice_horizontal_position_extension, slice_connect, slice_set_reference, and slice_independency.
  • slice_set_index is used alone in a slice, the encoder and the decoder divide slice sets according to slice_set_index values.
  • data in the current picture and the current slice and data of slices with the same slice_set_index can be used for coding and decoding of data in the current slice but data of slices with a slice_set_index different from that of the current slice cannot be used.
  • parameters identifying the position of a slice in a picture include: slice_horizontal_position and slice_horizontal_position_extension that identify the horizontal position of the slice and slice_vertical_position and slice_vertical_position_extension that specify the vertical position of the slice.
  • the parameters may also exist in the picture header or sequence header, in which case they can be used with slice_set_index, with a mapping table in the picture header or sequence header to identify the position of each slice in the picture and its slice_set_index value.
  • slice_set_index may be used together with one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, slice_vertical_position_extension and first_of_slice_set_flag to realize the slice set function.
  • slice_set_index must be present in a slice, or a sequence header or a picture header to realize the division of slice sets.
  • Data in the current picture and the current slice and data of slices with the same slice_set_index can be used for coding and decoding of data in the current data but data of slices with a slice_set_index different from that of the current slice cannot be used.
  • One or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency and first_of_slice_set_flag can be used jointly with slice_set_index.
  • slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension and slice_vertical_position_extension that identify the position of a slice may be used jointly with slice_set_index and one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency and first_of_slice_set_flag.
  • the coding process using the parameters or flags includes:
  • Step 1 The picture to be coded is shown in FIG. 1 .
  • A, B 0 , C, B 1 , D, B 2 and E are slices; Row 0 to Row 7 are eight macroblocks in the picture where 0-7 represent the vertical positions of the macroblock rows in the picture; Col 0 to Col 7 are eight macroblock columns in the picture, where 0-7 represent the horizontal positions of the macroblock columns in the picture; A, C, and D belong to one slice set, and B 0 , B 1 , and B 2 belong to another slice set while slice E belongs to a third slice set.
  • Step 2 Let the slice_set_index of A, C and D be 0, the slice_set_index of B 0 , B 1 and B 2 be 1 and the slice_set_index of E be 2.
  • Step 3 Set the slice_connect, slice_independency and slice_set_syntax of each slice.
  • slice_connect, slice_independency and slice_set_syntax of the slices may be set as in Table 1:
  • Slices B 0 , B 1 and B 2 belong to one slice set and their slice_connect, slice_independency and slice_set_syntax may be set as in Table 2:
  • Slice E belongs to a slice set alone and its slice_connect, slice_independency and slice_set_syntax may be set to 0, 1, and 1.
  • the slice set syntax elements of B 1 /B 2 and C/D are the same as the slice set syntax elements of slices whose first_of_slice_set_flag is 1.
  • the first_of_slice_set_flag of slices B 0 and A is 1.
  • Step 4 According to definitions of the parameters, the slice_vertical_position and slice_horizontal_position of slices A, B 0 , C, B 1 , D, B 2 , and E are as follows:
  • Slice_vertical_position values are: Slice A: 0, Slice B 0 : 3, Slice C: 3, Slice B 1 : 4, Slice D: 4, Slice B 2 : 5 and Slice E: 5; and
  • Slice A 0, Slice B 0 : 2, Slice C: 7, Slice B 1 : 2, Slice D: 5, Slice B 2 : 2 and Slice E: 6.
  • Step 5 Encoding the picture according to the parameters or flags which are also encoded into the bit stream sent to the decoder.
  • Step 6 The decoder performs decoding according to the parameters or flags in the bit stream received from the encoder to reconstruct the picture shown in FIG. 1 .
  • the decoding of any macroblock inside slice A, C or D can use data of the other two slices; the decoding of any macroblock inside slice B 0 , B 1 or B 2 can use data of the other two slices; the decoding of macroblocks in slice E cannot use data of other slices.
  • the decoding may be done by any of the following means:
  • Decoding Slice A Because A is the first slice in the picture, the decoding method may be that defined in a standard (like MPEG-2) in the conventional art.
  • Decoding Slice B 0 Because the slice_set_index of B 0 is different from that of A, the decoding of macroblocks in B 0 cannot use the data of A.
  • Decoding Slice C Because the slice_set_index of C is different from that of B 0 , the decoding of macroblocks in C cannot use the data of B 0 ; because the slice_set_index of C is the same as that of A and the slice_independency of C is 0, the decoding of macroblocks in C can use the data of A.
  • Decoding Slice B 1 Because the slice_set_index of B 1 is different from that of C, the decoding of macroblocks in B 1 cannot use the data of C; because the slice_set_index of B 1 is the same as that of B 0 and the slice_independency of B 1 is 0, the decoding of macroblocks in B 1 can use the data of B 0 .
  • Decoding Slice D Because the slice_set_index of D is different from that of B 1 , the decoding of macroblocks in C cannot use the data of B 1 ; because the slice_set_index of D is the same as that of C and the slice_independency of D is 0, the decoding of macroblocks in D can use the data of C.
  • Decoding Slice B 2 Because the slice_set_index of B 2 is different from that of D, the decoding of macroblocks in B 2 cannot use the data of D; because the slice_set_index of B 2 is the same as that of B 1 and the slice_independency of B 2 is 0, the decoding of macroblocks in B 2 can use the data of B 1 .
  • Decoding Slice E Because the slice_set_index of E is different from that of B 2 , the decoding of macroblocks in E cannot use the data of B 2 ; because the slice_set_index of E is different from that of D, the decoding of macroblocks in E can use the data of D.
  • a controlling unit may display one or more of other slice sets.
  • the contents of the slices can be spliced: the splicing is performed in the ascending sequence of slice_vertical_position in the slices; if the slice_vertical_position of slices is the same, the splicing is performed according to the slice_horizontal_position sequence. For example, because the slice_vertical_position of slice A is 0, the data of slice A is put in the reconstructed picture region ahead of the data of other slices; because the slice_vertical_position of slice B 0 is the same as that of slice C while the slice_horizontal_position of slice B 0 is smaller than that of slice C, the data of slice B 0 is put in the reconstructed picture region before the data of slice C. In this way, the seven slices A, B 0 , C, B 1 , D, B 2 and E will be spliced to reconstruct the picture shown in FIG. 1 after the coding and decoding processing.
  • slices A, C, and D, slices B 0 , B 1 and B 2 , and slice E belong to three different slice sets, and are decoded respectively in three threads, slices A, C, and D, slices B 0 , B 1 and B 2 , and slice E can be displayed separately.
  • the slice set syntax set of the slice is the same as the slice set syntax set of a slice preceding the current slice in the optical grating scanning sequence whose slice_set_syntax is 1.
  • the slice set syntax set of the slice is the same as the slice set syntax set of the slice in the current slice set whose first_of_slice_set_flag is 1.
  • a picture is divided into slices according to the value of slice_set_index.
  • the second embodiment of the invention provides a slice set dividing method based on slice_distance. The specific solution is as follows:
  • the slice_distance is an integer number indicating the number of slices that are not in the slice set between two slices adjacent in the optical grating scanning sequence in the slice set.
  • the current slice is the 10 th slice in the picture in the optical grating scanning sequence and the slice_distance of the current slice is 3.
  • the slice in the same slice set and preceding the current slice in the optical grating scanning sequence is the 6 th slice in the picture in the optical grating scanning sequence. This means, the 6 th and the 10 th slices in the optical grating scanning sequence are in the same slice set.
  • the decoder is able to know the division of slice sets in the picture according to the slice distance.
  • slice_distance may also be used with one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, and slice_vertical_position_extension.
  • slice_connect may also be used with one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, and slice_vertical_position_extension.
  • the slice_distance parameter may be used in a slice alone.
  • the coding and decoding of data of a slice in the slice set may use data of other slices in the same slice set, but data outside the current slice set cannot be used.
  • the division of slice sets is achieved by writing the position of the first macroblock of each slice set in a picture, the position of the first macroblock in each slice in a slice set, and the number of continuous macroblocks included in the slice set, in the sequence header or picture header.
  • slice set parameters may include one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension and slice_vertical_position_extension.
  • the specific use of the parameters is the same as that described in the first embodiment.
  • the division of slice sets may be achieved by writing the position coordinates of the slice set in the sequence header of the picture, the picture header or the first slice in the slice set and instructing that no slice start code is present in other slices in the slice set.
  • the coordinates of the upper left corner of the region covered by the slice set and the length and width of the region may be written in the sequence header, picture header or the first slice of the slice set.
  • a 1-bit flag is written in the sequence header, picture header or slice set, indicating whether a slice start code is present in slices other than the first slice in the optical grating scanning sequence in the slice set.
  • the flag may be set to 1, indicating that all slices in the slice set carry a slice start code, and to 0, indicating that the first slice in the optical grating scanning sequence in the slice set carries a slice start code but other slices do not carry the slice start code.
  • the start position of slices other than the first slice in the optical grating scanning sequence in the slice set may be determined according to the position of the upper left corner of the slice set, the length and width of the slice set and the height of the slice (may be the height of one or more macroblocks) in the sequence header, picture header or slice set. For example, when it is determined that the currently coded macroblock is the last macroblock in the slice in the optical grating scanning sequence, the macroblock to be coded next to the current macroblock (in the optical grating scanning sequence) is the first macroblock to be coded in the next slice in the slice set according to the optical grating scanning sequence.
  • the decoding of the slice set is performed by determining the start or end of slices other than the first slice in the optical grating scanning sequence according to the position of the upper left corner of the slice set, the length and width of the slice set and the height of the slice in the sequence header, picture header or slice set.
  • slice_horizontal_position If no slice start code is present in a slice in the slice set, the slice will not carry such syntax elements as slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, and slice_vertical_position_extension.
  • syntax elements as slice_connect, slice_set_syntax, slice_set_reference, and slice_independency may be present. The use of the syntax elements is the same as that described in the first embodiment.
  • the slice set technique is applied to resolution based coding.
  • a new syntax element slice_resolution_ratio is added on the basis of the first embodiment of the invention, indicating the ratio of the resolution used for the coding and decoding of intra-slice data in a slice set to the original resolution of the picture where the slice set belongs.
  • the decoder may obtain from the bit stream the ratio of the resolution used for the coding of the intra-slice data in the slice set to the original resolution of the slice and perform decoding according to the ratio.
  • the resolution of the picture represented by FIG. 1 is 128 ⁇ 128, which means there are 128 pixels each in the horizontal direction and the vertical direction of the picture.
  • the slice_resolution_ratio of the slice set where B 0 , B 1 and B 2 belong is 0.5, it indicates that slice data in the slice set is encoded at the resolution of (128 ⁇ 0.5) ⁇ (128 ⁇ 0.5).
  • the resolution of the data is first reduced to half of the original resolution via downsampling before the method in the first embodiment of the invention is used to perform the coding. Accordingly, the decoder decodes the data of B 0 , B 1 and B 2 in the method described in the first embodiment of the invention and then increases the resolution of the obtained data to double via upsampling.
  • the slice set technique is applied to quality based coding.
  • a new syntax element slice_bit_depth is added on the basis of the first embodiment of the invention, indicating the number of bits used for each pixel when intra-slice data in a slice set is coded or decoded. This means the number of bits required for the coding of each piece of slice data in the coded slice set is also coded in the coding process. For example, if the slice_bit_depth of the slice set where B 0 , B 1 and B 2 belong in the first embodiment of the invention is 10, it indicates 10 bits are used to store a pixel value when B 0 , B 1 and B 2 are coded or decoded.
  • the decoder may obtain the number of storage bits required for each piece of decoded data when slice data in the slice set is decoded and perform decoding according to the number of bits.
  • ROM/RAM read-only memory/random access memory
  • magnetic disk or a compact disk.
  • a slice set may also include quantization parameters related to a designated quantization method.
  • the specific process includes: the encoder decides whether slices in the slice set are encoded with a designated quantization method and writes in the slice one or more of the following quantization parameters: a parameter indicating that a designated quantization matrix is used for slices in the current slice set, an index of the designated quantization matrix used for slices in the current slice set, an enable flag indicating adaptive weighted quantization of macroblocks is used for slices in the slice set, a weighted quantization parameter used for the slice set, and the quantization matrix used for slices in the current slice set.
  • the decoder obtains quantization parameter information used for slices in the current slice set and gets one or more of the following parameters according to the quantization parameter information and performs decoding accordingly.
  • the obtaining of quantization parameter information includes one or more of the following: obtaining the parameter indicating the use of a designated quantization matrix for slices in the current slice set; or obtaining the index indicating use of the quantization matrix for slices in the current slice set; or obtaining the enable flag indicating the use of adaptive weighted quantization of macroblocks for slices in the current slice set; or obtaining the weighted quantization parameter used for slices in the current slice set; or obtaining the quantization matrix used for slices in the current slice set.
  • a weighted quantization matrix is used for coding and a 1-bit weighted matrix enable flag is written in a slice, indicating whether a weighted quantization matrix is used for coding. If the value of the enable flag is 1, 2-bit information is written in the slice, indicating the weighted matrix index used for coding of the slice in the current slice set. If the index value is 0, it indicates that the slice in the slice set uses the same weighted quantization matrix as the picture header or a weighted matrix is preset at the encoder; the value 1, 2, or 3 indicates the use of the weighted quantization matrix or weighted quantization matrix model whose index is 1, 2 or 3.
  • a weighted quantization parameter or weighted quantization matrix may be written in the slice.
  • the decoder resolves slice parameters in the slice set to obtain the weighted matrix enable flag which indicates whether a weighted matrix is used for coding of the slice in the slice set. If the value of the enable flag is 1, the decoder reads 2-bit more information from the slice to get the index of the weighted matrix used for coding of the slice in the current slice set. If the index value is 0, it indicates that the slice in the slice set uses the same weighted quantization matrix as the picture header or a weighted matrix is preset at the decoder; the value 1, 2, or 3 indicates the use of the weighted quantization matrix or weighted quantization matrix model whose index is 1, 2 or 3.
  • the decoder may read the weighted quantization parameter or weighted quantization matrix from the slice.
  • the method in the embodiment of the invention guarantees the use of a designated quantization matrix for a specified region in a picture to adjust the subjective quality of the region and guarantees the coding efficiency of the entire picture as well.
  • the embodiment of the invention also provides a video encoder and video decoder.
  • the structures of the encoder and decoders will be described in detail.
  • the scanning sequence used for sorting the macroblocks or slices in the embodiments of the invention may also be any other scanning sequence determined according a certain rule.
  • FIG. 2 A structure of the encoder provided in the embodiment of the invention is shown in FIG. 2 , including the following units:
  • a slice dividing unit 21 configured to divide a picture to be encoded into several slices, each containing macroblocks continuous in the optical grating scanning sequence in the picture;
  • a slice set dividing unit 22 configured to divide slices in the picture into one or more slice sets according to attribute information of the slices divided by the slice dividing unit 21 , each containing one or more slices;
  • an encoding unit 23 configured to encode slices in the slice sets divided by the slice set dividing unit 22 to get a coded bit stream of the picture.
  • the encoder may further include an attribute information obtaining unit 24 , configured to: obtain one or more of the items, spatial continuity of slices, picture feature, spatial resolution, content of the picture or region, and picture coding quality, as the attribute information and send the information to the slice set dividing unit 22 .
  • an attribute information obtaining unit 24 configured to: obtain one or more of the items, spatial continuity of slices, picture feature, spatial resolution, content of the picture or region, and picture coding quality, as the attribute information and send the information to the slice set dividing unit 22 .
  • the encoding unit 23 may include, without limitation, one or more of the following units: (1) an enable flag encoding unit 231 , configured to write a slice set enable flag in the coded bit stream obtained by the encoding unit 23 , where the flag indicates whether slice sets are divided in the coding process of the current picture so that the decoder knows whether the division of slice sets is performed; (2) a position parameter encoding unit 232 , configured to encode at least one of the horizontal position parameter and vertical position parameter of slices in the slice set so that the position information of the slices in the picture is sent to the decoder; (3) an identification information encoding unit 233 , configured to encode slice set identification information of the slice set so that the slice set division information of the encoder is sent to the decoder and that the decoder knows the mappings of slice sets and slices, for example, which slice set the current slice belongs to or which slices a slice set includes; (4) a slice set syntax indicator encoding unit 234 , configured to encode the information whether slice set syntax elements
  • the encoding unit 23 may further include at least one of the following processing units: (1) a resolution information encoding unit 239 , configured to encode the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice; and (2) a quality information encoding unit 240 , configured to encode the number of bits used by each piece of encoded data when the slice data in the slice set is encoded.
  • a resolution information encoding unit 239 configured to encode the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice
  • a quality information encoding unit 240 configured to encode the number of bits used by each piece of encoded data when the slice data in the slice set is encoded.
  • the embodiment of the invention also provides a decoder, of which a specific structure is shown in FIG. 3 .
  • the decoder includes the following processing units:
  • a bit stream receiving unit 31 configured to receive a bit stream to be decoded.
  • a slice information obtaining unit 32 configured to obtain slice and slice set division information from the bit stream to be decoded received by the bit stream receiving unit 31 , where a slice includes macroblocks continuous in the optical grating scanning sequence in the picture and a slice set includes one or more slices.
  • the slice information obtaining unit 32 may include, without limitation, one or more of the following units: (1) a position parameter obtaining unit 321 , configured to obtain at least one of the horizontal position parameter and vertical position parameter of all slices in the slice sets in the bit stream; (2) an identification information obtaining unit 322 , configured to obtain slice set identification information of the slice sets in the bit stream; (3) a slice set syntax indicator obtaining unit 323 , configured to obtain the information whether slice set syntax elements are present in the current slice; (4) a slice connect flag obtaining unit 324 , configured to obtain the information whether the current slice in the bit stream is the last slice in the current slice set; (5) a first data usage indicator obtaining unit 325 , configured to obtain the information that the reference data used in the coding of the slices in the bit stream is the data in the current slice set or all data in the picture; (6) a first data usage indicator obtaining unit 326 , configured to obtain the information that the data used in the coding of macroblocks in the bit stream is the data in
  • a picture reconstructing unit 33 configured to: decode the bit stream according to the slice and slice set division information obtained by the slice information obtaining unit 32 and reconstruct the picture.
  • the decoder may further include the following units: (1) an enable flag obtaining unit 328 , configured to obtain the slice set enable flag in the coded bit stream, where the flag indicates whether slice sets are divided in the coding process of the current picture; and (2) an information obtainment notifying unit 329 , configured to notify the slice information obtaining unit 32 after determining that the current picture is divided into slice sets.
  • the slice information obtaining unit 32 may further include at least one of the following processing units: (1) a resolution information obtaining unit 330 , configured to obtain the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice; and (2) a quality information obtaining unit 331 , configured to obtain from the bit stream the number of storage units required for each piece of decoded data when the slice data in the slice set is encoded as the quality information.
  • a resolution information obtaining unit 330 configured to obtain the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice
  • a quality information obtaining unit 331 configured to obtain from the bit stream the number of storage units required for each piece of decoded data when the slice data in the slice set is encoded as the quality information.
  • the embodiments of the invention through flexible combination of slices and the processing of the coding and decoding of the slice set made up of multiple slices and the syntax elements used in the coding and decoding, region specific, resolution specific and quality specific picture coding and decoding are realized to increase the flexibility of video coding and decoding. Moreover, the video coding and decoding solution provided by the embodiments of the invention is easy to implement. Relative to the conventional art, the solution will effectively reduce the hardware complexity of the coding system and the decoding system. In addition, in the embodiments of the invention, an appropriate slice set enable flag is added so that the solution provided by the embodiments of the invention is well compatible with the existing coding and decoding systems.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method and device for coding and decoding are disclosed. The method includes: dividing a picture to be encoded into several slices, each containing macroblocks continuous in a designated scanning sequence in the picture; dividing slices in the picture into one or more slice sets according to attribute information of the slices, each slice set containing one or more slices; and encoding the slices in the slice sets according to slice and slice set division information to get a coded bit stream of the picture. The decoding method includes: obtaining slice and slice set division information from a bit stream to be decoded and decoding the bit stream according to the obtained slice and slice set division information. The invention improves the performance of video transmission effectively and realizes region based coding. The implementation of coding and decoding is simple and the complexity of coding and decoding systems is reduced.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation of U.S. patent application Ser. No. 14/451,083, filed on Aug. 4, 2014, which is a continuation of U.S. patent application Ser. No. 12/724,076, filed on Mar. 15, 2010, which is a continuation of International Patent Application No. PCT/CN2008/072362, filed on Sep. 12, 2008, which claims priority to Chinese Patent Application No. 200710153924.5, filed on Sep. 14, 2007, and Chinese Patent Application No. 200810096456.7, filed on May 9, 2008. The afore-mentioned patent applications are hereby incorporated by reference in their entireties.
FIELD OF THE INVENTION
The invention relates to coding and decoding technologies, and in particular, to a method and device for video coding and decoding.
BACKGROUND
In a video coding/decoding system, a picture may be divided into several image blocks or macroblocks, and motion estimation is carried out based on blocks or macroblocks. The residual after motion estimation is transformed and quantized. Finally, entropy coding is performed on the quantized coefficient and the information obtained after the entropy coding is written into a bit stream. After a picture is divided into macroblocks, the macroblocks will be organized into slices.
In a video coding/decoding system, after the decoder receives video packets from the encoder, the decoder will organize contents in the data packets according to a certain rule and obtain a reconstructed picture after dequantization, detranformation and motion compensation.
In the conventional art, in a video coding/decoding system, slice division methods include:
1. Method Defined by Moving Picture Experts Group 2 (MPEG-2)
According to MPEG-2, a slice (block group) is defined as a series of any number of macroblocks. A block group includes at least one macroblock and no overlap should exist between any two block groups. The first and the last macroblocks in a block group must be in one macroblock row. Block groups appear in a bit stream from left to right and from the top down following the optical grating scanning sequence starting with the upper left corner of a picture.
During the implementation of the invention, the inventor discovers at least the following weaknesses in the conventional art:
(1) MPEG-2 requires the first macroblock and the last macroblock appear in a same macroblock row, which restricts the flexibility of block groups in length and spatial distribution. One block group can include macroblocks in at most one horizontal row so that the flexibility of a coding/decoding system is restricted.
(2) In certain scenarios, coding and decoding of a macroblock cannot use information of adjacent macroblocks and thus the efficiency of coding and decoding is low.
(3) Because block groups must appear in a bit stream in the sequence of being met, a picture cannot be sliced or encoded based on multiple macroblock rows or based on regions or objects.
2. Method Defined by H.264
In the H.264 standard, a slice is defined as an integer number of macroblocks or macroblock pairs arranged in the optical grating scanning sequence inside a specific slice set. Although a slice contains an integer number of macroblocks or macroblock pairs arranged in the optical grating scanning sequence inside a slice group, the macroblocks or macroblock pairs are not necessarily arranged continuously in the optical grating scanning sequence within a picture. A macroblock address is obtained through the address of the first macroblock and the mapping from the macroblock to a slice group. A slice group is defined as a subset of macroblocks or macroblock pairs in the picture. To divide a picture into slice groups is to organize certain macroblocks in the picture into a slice group according to the mapping from macroblocks to slice groups, which means macroblocks are organized according to certain spatial mappings.
During the implementation of the invention, the inventor finds at least the following weaknesses in the conventional art:
(1) Because the mapping from macroblocks to slice groups is stored in the picture header, an extra storage cost is imposed on the encoder and the decoder. Besides, the encoder must know the spatial mappings of macroblocks in advance, which is not good for real-time coding.
(2) Because of the mapping of macroblocks, it is unnecessary for macroblocks in one slice to be continuous in the optical grating scanning sequence so that the encoder and the decoder have to provide an extra storage space to store macroblocks that are not encoded and decoded in a slice.
(3) Because macroblocks in one slice or one slice group are not continuous in the optical grating scanning sequence, the encoder and the decoder must continuously search for and locate the physical addresses of adjacent macroblocks, which causes an additional computing cost.
SUMMARY
The embodiments of the invention provide a method and device for video coding and decoding, to improve the video transmission performance and simplify the implementation of the coding and decoding system.
A video coding method includes: (1) dividing a picture to be encoded into several slices, each containing macroblocks continuous in a scanning sequence in the picture; (2) dividing the slices contained in the picture into one or more slice sets, each containing one or more slices; and (3) encoding slices in the slice sets according to slice and slice set division information to get a coded bit stream of the picture.
A video encoder includes: (1) a slice dividing unit, configured to divide a picture to be encoded into several slices, each containing macroblocks continuous in a scanning sequence in the picture; (2) a slice set dividing unit, configured to divide slices in the picture into one or more slice sets according to attribute information of the slices obtained by the slice dividing unit, where each slice set contains one or more slices; and (3) an encoding unit, configured to encode slices in the slice sets divided by the slice set dividing unit to get a coded bit stream of the picture.
A video decoding method includes: (1) obtaining slice and slice set division information from a bit stream to be decoded, where a slice includes macroblocks continuous in a scanning sequence in a picture and a slice set includes one or more slices; and (2) decoding the bit stream according to the obtained slice and slice set division information.
A video decoder includes: (1) a slice information obtaining unit, configured to obtain slice and slice set division information from a bit stream to be decoded, where a slice includes macroblocks continuous in a scanning sequence in a picture and a slice set includes one or more slices; and (2) a picture reconstructing unit, configured to: decode the bit stream according to the slice and slice set division information obtained by the slice information obtaining unit and reconstruct the picture.
With the solution provided by the embodiments of the invention, slices and slice sets are divided appropriately so that the encoder and decoder can perform coding and decoding flexibly according to actual needs and therefore the performance in video transmission is improved. Furthermore, the implementation of coding and decoding is simple and the complexity of the coding and decoding systems is reduced.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a schematic drawing of the division of slices and slice sets according to an embodiment of the invention;
FIG. 2 shows a structure of an encoder according to an embodiment of the invention; and
FIG. 3 shows a structure of a decoder according to an embodiment of the invention.
DETAILED DESCRIPTION OF EMBODIMENTS
In order to make the technical solution, objectives and merits of the invention clearer, a detailed description of the invention is given below by reference to accompanying drawings and preferred embodiments.
The video coding method according to an embodiment of the invention includes: dividing a picture to be encoded into several slices, each containing continuous macroblocks in the picture in the scanning sequence, such as the optical grating scanning sequence or another predetermined scanning sequence; dividing slices in the picture into one or more slice sets according to slice attributes, where one slice set includes multiple slices; encoding the slices in a slice set according to slice and slice set division information, such as slice set serial number and slice position information, to get a coded bit stream of the picture. The attributes may include but are not limited to one or more of the following items: network transport condition, spatial continuity of slices, picture characteristics, spatial resolution, content of the picture or region, and picture coding quality.
In the embodiment of the invention, for ease of compatibility with other technologies, a slice set enable flag may be written in the coded bit stream if necessary. The flag indicates whether slice sets are divided during the coding of the current picture. When the method in the embodiment of the invention is used, the flag is set to be valid; otherwise the flag is set to be invalid.
Further, in the embodiment of the invention, the coding of slices in a slice set may be at least one of the following coding operations:
(1) Encoding at least one item of the horizontal position parameter and vertical position parameter of each slice in the slice set so that the decoder knows the position of the slice in the reconstructed picture and thus is able to decode the slices.
(2) Encoding identification information of the slice set so that the decoder can identify the slices in the set or identify the slice set of a slice. The slice set identification information includes but is not limited to slice set serial number allocated for slice sets or the slice_set_index allocated for slices, or slice distance index indicating the number of slices that are not in the slice set between two slices adjacent in the scanning sequence in the slice set, or start macroblock and number of macroblocks of the slice set.
(3) Encoding information whether a current slice is the last slice in a slice set so that the decoder obtains all slices in the slice set reliably.
(4) Encoding information whether the current slice includes slice set syntax elements so that the decoder is able to determine whether the current slice contains syntax elements according to the information and further determine the syntax elements that can currently be used.
(5) Encoding information that the reference data used in the inter-frame coding of a slice is the data in the same slice set of the current slice or all data available in the picture (the information may be called the first data usage indicator), so that the decoder can choose reference data for decoding of the slice according to the information.
(6) Encoding information that the data used in the coding of macroblocks in a slice is the data in the current slice or data of other slices in the same slice set (the information may be called the second data usage indicator), so that the decoder can choose data for decoding of the macroblocks in the slice according to the information.
(7) Encoding data that is used in the coding of macroblocks in the current slice or data of other slices in the current slice set.
(8) Encoding information whether the current slice is the first slice in the slice set, so that the coding parameters of other slices in the slice set can be set according to the coding parameters of the first slice. The first slice may be the first slice in the optical grating scanning sequence of the slice set.
(9) Encoding information that slices in the current slice set use a designated quantization matrix and encode relevant parameters of the quantization matrix.
In accordance with the above video coding method, an embodiment of the invention provides a video decoding method. The method includes: the decoder receives a bit stream to be decoded and obtains division information of slices and slice sets in the bit stream; and the decoder decodes the bit stream according to the obtained division information of slices and slice sets.
If a slice set enable flag is written in the bit stream coded by the encoder, the decoder must first obtain the slice set enable flag in the coded bit stream and the division information of slices and slice sets is not obtained until the decoder determines that the current picture is divided into slice sets according to the flag.
The decoder decodes the bit stream in at least one of the following methods:
(1) Obtaining at least one item of the horizontal position parameter and vertical position parameter of each slice in the slice set from the bit stream and perform decoding according to the parameter.
(2) Obtaining slice set identification information in the bit stream and performing decoding according to the slice set identification information.
(3) Obtaining the information whether a current slice is the last slice in a current slice set in the bit stream and performing decoding according to the information.
(4) Obtaining the information whether the current slice contains syntax elements in the bit stream and perform decoding according to the information.
(5) Obtaining the information that the reference data used in the inter-frame coding of a slice is the data in the same slice set of the current slice or all data available in the picture (the first data usage indicator) and performing decoding according to the information.
(6) Obtaining the information that the data used in the coding of macroblocks in a slice is the data in the current slice or data of other slices in the same slice set (the second data usage indicator) and performing decoding according to the information.
(7) Obtaining the information that the data used in the coding of macroblocks in the slice is the data of the current slice or data of other slices in the same slice set and performing decoding according to the information.
(8) Obtaining the information whether the current slice is the first slice in the slice set and performing decoding according to the information. The parameters of other slices in the slice set may be set according to the coding parameters of the first slice. The first slice may be the first slice in the optical grating scanning sequence of the slice set.
(9) Obtaining quantization parameters used by the slices in the current slice set and performing decoding accordingly.
An important application in video coding and decoding is the region specific coding and decoding of pictures, where one picture is divided into several regions and data (like macroblocks) in a region is unnecessarily continuous in the optical grating scanning sequence but often has certain common features, such as spatial continuity or the same and similar image attributes. The region specific coding of pictures may be applied to region based quality specific coding and resolution specific coding of pictures or used with the video watermark technique. In the embodiment of the invention, region based picture coding is achieved because of the flexible division and combination of picture slices.
In the embodiment of the invention, region specific picture coding is realized through flexible combination of slices and the processing of coding and decoding scheme of the combined slices and the syntax elements used in coding and decoding. In particular, appropriate parameters or flags for identification of slice coding and decoding features may be added to the slice, sequence header and/or picture header information and the flags or parameters are flexibly combined to implement the region specific coding. Compared with the conventional art, the solution provided in the embodiment of the invention reduces the hardware complexity of the coding system and the decoding system and is well compatible with the existing coding and decoding systems.
To help understand the invention, the embodiments of the invention will be described in detail with reference to the accompanying drawings.
As shown in FIG. 1, A, B0, C, B1, D, B2 and E are slices; Row 0 to Row 7 are eight macroblock rows in a picture, where 0-7 are vertical positions of the macroblock rows in the picture counted by macroblocks; Col 0 to Col 7 are eight macroblock columns in the picture, where 0-7 are horizontal positions of the macroblock columns in the picture counted by macroblocks.
A slice is a number of macroblocks continuous in the optical grating scanning sequence and includes unnecessarily an integer number of macroblock rows. The slice does not require that the start and end macroblocks be in one macroblock row. The optical grating scanning is a scanning method that converts two-dimensional data to one-dimensional data, where scanning may start from the upper left corner of a region and proceed from left to right and from the top down.
The embodiments of the invention adopt the concept of slice set, which is a set of slices possessing common attributes including but not limited to one or more of the following items: spatial continuity of slices in a picture, picture characteristics, spatial resolution, content of the picture or region, and coding quality of the picture. The division of slice sets is decided by the encoder and the decoder can know the division mode of slice sets in the encoder according to the picture header information or slice set information.
A slice set is a loose combination of slices in a picture, where the slice coding and decoding modes are decided by the syntax element added to the video bit stream in the embodiments of the invention. Normally, the syntax element is stored in the start position of the picture header or slice. The syntax element may be a flag or parameter added in the embodiments of the invention.
With the addition of a syntax element to a slice to identify the slice feature, the encoder and the decoder will be able to decide the mode to process slices in a slice set, thus realizing region specific coding through the processing of slices in the slice set.
In implementation of the embodiments of the invention, slice specific coding may include one or more of the following means:
(1) Adding a parameter that identifies a slice group to a slice to specify the slice set where the slice belongs. The parameter is called slice_set_index. In one picture, slices having the same slice_set_index belong to one slice set.
(2) Adding a flag (the first data usage indicator) to a slice, to specify whether data of other slices belonging to a same slice set can be used when data of the current slice is coded or decoded. For example, when the flag is set to 1, it indicates that the data of other slices belonging to the same slice set can be used when data of the current slice is coded or decoded; when the flag is set to 0, it indicates that the data of other slices belonging to the same slice set cannot be used when data of the current slice is coded or decoded. Optionally, there may be other ways to set the flag.
(3) Adding a flag (syntax element flag) to a slice, to specify whether the particular syntax element (like quantization step size) used in the current slice is the same as the syntax element in other slices in the same slice set.
A number of slices in a picture form a loose slice set by one or more of the above means. Accordingly, when slice data in the slice set is coded or decoded, any of the following means may be used:
(1) Independent coding and decoding
In this mode, data used when the current slice is coded or decoded is independent of other slice data in the same slice set and independent of slice data in other slice sets.
(2) Dependent coding and decoding
In this mode, data used when the current slice is coded or decoded may come from other slice data in the same slice set but will not come from slice data in other slice sets.
In the embodiments of the invention, the encoder may set slice set parameters (such as slice_set_index, slice association flag and slice set syntax element indicator) for each slice according to specific needs by means of the above three slice based coding methods. If region specific coding or flexible packing is necessary, an appropriate slice set parameter can be set so that certain consistency is kept in the coding and decoding of slices in one slice set. This means region specific coding can be realized through the use of slice sets. Accordingly, the decoder provided in the embodiments of the invention may analyze the syntax element related to the slice set through the slice set elements to know the attributes of the slice and the slice set. For example, the decoded picture may be displayed by regions, or a region of the picture can be decoded independently in the decoding process.
Exemplary embodiments will be described to further explain the invention.
For ease of description, the concepts concerned in the embodiments are explained as follows:
(1) Slice set syntax set: a set that includes a number of syntax elements related to slice coding, such as quantization step size.
(2) Flexible slice set: certain slices in a picture are grouped in a certain way to form a specific set and the coding and decoding of a slice in the slice set is controlled by specific syntax elements in the slice set and the slice.
Based on the above definitions, several exemplary embodiments are described to further clarify the invention.
Embodiment 1
In the first embodiment of the invention, to realize the coding of a slice set, the encoder may write one or more of the following parameters or flags in a coded bit stream during the coding process:
(1) Adding slice_set_disable to the picture header parameter set: the flag occupies one bit and indicates whether slice sets are used in the picture; for example, the value 1 indicates slice sets are not used and 0 indicates slice sets are used.
In addition to the picture header information, slice_set_disable may also be added to the sequence header, slice header or macroblock header, and so on.
(2) Naming the last 8 bits of the start code of a slice “slice_vertical_position”, which is an 8-bit unsigned integer configured to identify the vertical position of the first macroblock of the slice in the picture, and the position is measured by macroblocks. If the vertical_size of the picture is larger than 2800 bits, a 3-bit parameter slice_vertical_position_extension may be further added to the start code of the slice. The parameters slice_vertical_position_extension and slice_vertical_position together specify the vertical position of the slice in the picture.
(3) Adding one or more of the following parameters or flags after the slice start code:
slice_set_index: may be a 6-bit unsigned integer, allocated for each slice when the value of slice_set_disable (such as 0) specifies that the use of slice sets is enabled.
slice_horizontal_position: may be an 8-bit unsigned integer indicating the macroblock measured horizontal position of the first macroblock of the slice in the picture.
slice_horizontal_position_extension: may be a 3-bit unsigned integer, which may not appear in the bit stream if the horizontal size of the picture is smaller than or equal to 4096 bits; otherwise if the horizontal size of the picture is larger than 4096 bits, the parameter may be set to specify together with slice_horizontal_position the horizontal position of the slice in the picture.
slice_connect: 1 bit, when the value of slice_set_disable (such as 0) indicates the use of slice sets, indicating whether there are other slices in the current slice set that are not decoded after the current slice is decoded.
slice_set_syntax: syntax indicator of the slice set, where the syntax elements are associated with picture data coding, such as the quantization step size and the weighted prediction coefficient; it may be a 1-bit flag, indicating whether the current slice carries a slice set syntax set. For example, the value 1 indicates the presence of a slice set syntax set in the current slice and the value 0 indicates the presence of no slice set syntax set in the current slice. Besides, the syntax elements in the slice set syntax set required by the coding and decoding of the current slice are the same as the first slice in the same slice set whose slice_set_syntax is 1 and precedes the current slice in the optical grating scanning sequence.
slice_set_reference: the first data usage indicator, which may be a 1-bit flag, indicating the way to obtain reference data of the slice set; for example, the value 1 indicates only slice set data that has the same slice_set_index as the current slice set in the picture can be used as reference for coding and decoding of the data within the current slice set, and the value 0 indicates that all data available in the picture can be used as reference for coding and decoding of the data within the current slice set.
slice_independency: the second data usage indicator, which may be a 1-bit flag, indicating data can be used for coding and decoding of macroblocks in the current slice; for example, the value 1 indicates that only data in the slice can be used for the coding and decoding of macroblocks in the current slice and the value 0 indicates slice data of the same slice_set_index in the picture can be used for coding and decoding of macroblocks in the current slice provided that the slices used for coding of the current slice precede the current slice in the scanning sequence (such as the optical grating scanning sequence) and follow the previous slice whose slice_independency is 1. In the case that the slice_independency of the first slice in the optical grating scanning sequence in the slice set is 1 and the slice_independency of the current slice is 1, if slice_set_syntax is present, the value of slice_set_syntax is 1.
first_of_slice_set_flag: indicating whether the current slice is the first slice in the current slice set in the optical grating scanning sequence.
Of the above parameters or flags, slice_set_index can be used alone in a slice without slice_horizontal_position, slice_set_syntax, slice_horizontal_position_extension, slice_connect, slice_set_reference, and slice_independency. When slice_set_index is used alone in a slice, the encoder and the decoder divide slice sets according to slice_set_index values. In this case, data in the current picture and the current slice and data of slices with the same slice_set_index can be used for coding and decoding of data in the current slice but data of slices with a slice_set_index different from that of the current slice cannot be used.
Of the above parameters or flags, parameters identifying the position of a slice in a picture include: slice_horizontal_position and slice_horizontal_position_extension that identify the horizontal position of the slice and slice_vertical_position and slice_vertical_position_extension that specify the vertical position of the slice. The parameters may also exist in the picture header or sequence header, in which case they can be used with slice_set_index, with a mapping table in the picture header or sequence header to identify the position of each slice in the picture and its slice_set_index value.
In the embodiments of the invention, slice_set_index may be used together with one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, slice_vertical_position_extension and first_of_slice_set_flag to realize the slice set function. slice_set_index must be present in a slice, or a sequence header or a picture header to realize the division of slice sets. Data in the current picture and the current slice and data of slices with the same slice_set_index can be used for coding and decoding of data in the current data but data of slices with a slice_set_index different from that of the current slice cannot be used. One or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency and first_of_slice_set_flag can be used jointly with slice_set_index. The four parameters slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension and slice_vertical_position_extension that identify the position of a slice may be used jointly with slice_set_index and one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency and first_of_slice_set_flag.
Based on the above parameters or flags, when the video encoder decides that region specific coding is necessary, the coding process using the parameters or flags includes:
Step 1: The picture to be coded is shown in FIG. 1. Set slice_set_disable to be valid, for example to 0, to indicate that flexible slice set technology is used in the current picture.
In the picture shown in FIG. 1, A, B0, C, B1, D, B2 and E are slices; Row 0 to Row 7 are eight macroblocks in the picture where 0-7 represent the vertical positions of the macroblock rows in the picture; Col 0 to Col 7 are eight macroblock columns in the picture, where 0-7 represent the horizontal positions of the macroblock columns in the picture; A, C, and D belong to one slice set, and B0, B1, and B2 belong to another slice set while slice E belongs to a third slice set.
Step 2: Let the slice_set_index of A, C and D be 0, the slice_set_index of B0, B1 and B2 be 1 and the slice_set_index of E be 2.
Step 3: Set the slice_connect, slice_independency and slice_set_syntax of each slice.
Because slices A, C and D belong to one slice set, the slice_connect, slice_independency and slice_set_syntax of the slices may be set as in Table 1:
TABLE 1
slice_connect slice_independency slice_set_syntax
A 1 1 1
C 1 0 0
D 0 0 0
Slices B0, B1 and B2 belong to one slice set and their slice_connect, slice_independency and slice_set_syntax may be set as in Table 2:
TABLE 2
slice_connect slice_independency slice_set_syntax
B0 1 1 1
B1 1 0 0
B2 0 0 0
Slice E belongs to a slice set alone and its slice_connect, slice_independency and slice_set_syntax may be set to 0, 1, and 1.
In the two tables, because slices A, B0 and E are the first slice in their respective slice sets in the optical grating scanning sequence, their slice_set_syntax is 1 and their first_of_slice_set_flag is 1 while the first_of_slice_set_flag of other slices is 0, and a slice set syntax set is present in the three slices. Because the slice_set_syntax of B1, B2, C and D is 0, no slice set syntax set is present in the four slices and the slice set syntax of B1 and B2 is the same as that of B0 and the slice set syntax of C and D is the same as that of A. Optionally, the slice set syntax elements of B1/B2 and C/D are the same as the slice set syntax elements of slices whose first_of_slice_set_flag is 1. In this embodiment of the invention, the first_of_slice_set_flag of slices B0 and A is 1.
Step 4: According to definitions of the parameters, the slice_vertical_position and slice_horizontal_position of slices A, B0, C, B1, D, B2, and E are as follows:
Their slice_vertical_position values are: Slice A: 0, Slice B0: 3, Slice C: 3, Slice B1: 4, Slice D: 4, Slice B2: 5 and Slice E: 5; and
Their slice_horizontal_position values are: Slice A: 0, Slice B0: 2, Slice C: 7, Slice B1: 2, Slice D: 5, Slice B2: 2 and Slice E: 6.
Step 5: Encoding the picture according to the parameters or flags which are also encoded into the bit stream sent to the decoder.
Step 6: The decoder performs decoding according to the parameters or flags in the bit stream received from the encoder to reconstruct the picture shown in FIG. 1.
According to the above settings and the definition of slice set, in the decoder, with respect to the picture shown in FIG. 1, the decoding of any macroblock inside slice A, C or D can use data of the other two slices; the decoding of any macroblock inside slice B0, B1 or B2 can use data of the other two slices; the decoding of macroblocks in slice E cannot use data of other slices.
Corresponding to the encoder, after the decoder resolves the slice set parameters from the bit stream, the decoding may be done by any of the following means:
(1) Single-thread decoding
In this mode, slices are decoded in the optical grating scanning sequence. The specific process is:
Decoding Slice A: Because A is the first slice in the picture, the decoding method may be that defined in a standard (like MPEG-2) in the conventional art.
Decoding Slice B0: Because the slice_set_index of B0 is different from that of A, the decoding of macroblocks in B0 cannot use the data of A.
Decoding Slice C: Because the slice_set_index of C is different from that of B0, the decoding of macroblocks in C cannot use the data of B0; because the slice_set_index of C is the same as that of A and the slice_independency of C is 0, the decoding of macroblocks in C can use the data of A.
Decoding Slice B1: Because the slice_set_index of B1 is different from that of C, the decoding of macroblocks in B1 cannot use the data of C; because the slice_set_index of B1 is the same as that of B0 and the slice_independency of B1 is 0, the decoding of macroblocks in B1 can use the data of B0.
Decoding Slice D: Because the slice_set_index of D is different from that of B1, the decoding of macroblocks in C cannot use the data of B1; because the slice_set_index of D is the same as that of C and the slice_independency of D is 0, the decoding of macroblocks in D can use the data of C.
Decoding Slice B2: Because the slice_set_index of B2 is different from that of D, the decoding of macroblocks in B2 cannot use the data of D; because the slice_set_index of B2 is the same as that of B1 and the slice_independency of B2 is 0, the decoding of macroblocks in B2 can use the data of B1.
Decoding Slice E: Because the slice_set_index of E is different from that of B2, the decoding of macroblocks in E cannot use the data of B2; because the slice_set_index of E is different from that of D, the decoding of macroblocks in E can use the data of D.
After slices A, B0, C, B1, D, B0 and E are decoded, because B0, B1 and B2 are in one slice set, B0, B1 and B2 can be displayed alone without A, C, D and E; alternatively, a controlling unit may display one or more of other slice sets.
(2) Parallel decoding
In this mode, multiple threads run simultaneously to decode the slices. Supposing three threads run to decode the picture shown in FIG. 1, the decoding process is as follows:
Suppose three threads a, b, and c simultaneously decode the coded bit stream of the picture shown in FIG. 1. According to the slice_set_index of the slices in FIG. 1, slices A, C and D with the same slice_set_index are placed in thread A, and slices B0, B1 and B2 with the same slice_set_index are placed in thread B, and slice E is placed in thread C. The decoding process of each thread is the same as the decoding of each slice in single-thread mode.
After threads A, B and C decode the data of all slices, the contents of the slices can be spliced: the splicing is performed in the ascending sequence of slice_vertical_position in the slices; if the slice_vertical_position of slices is the same, the splicing is performed according to the slice_horizontal_position sequence. For example, because the slice_vertical_position of slice A is 0, the data of slice A is put in the reconstructed picture region ahead of the data of other slices; because the slice_vertical_position of slice B0 is the same as that of slice C while the slice_horizontal_position of slice B0 is smaller than that of slice C, the data of slice B0 is put in the reconstructed picture region before the data of slice C. In this way, the seven slices A, B0, C, B1, D, B2 and E will be spliced to reconstruct the picture shown in FIG. 1 after the coding and decoding processing.
Because, slices A, C, and D, slices B0, B1 and B2, and slice E belong to three different slice sets, and are decoded respectively in three threads, slices A, C, and D, slices B0, B1 and B2, and slice E can be displayed separately.
In the above two decoding methods, if the slice_set_syntax of a slice is 0, the slice set syntax set of the slice is the same as the slice set syntax set of a slice preceding the current slice in the optical grating scanning sequence whose slice_set_syntax is 1. Optionally, the slice set syntax set of the slice is the same as the slice set syntax set of the slice in the current slice set whose first_of_slice_set_flag is 1.
Embodiment 2
In the first embodiment of the invention, a picture is divided into slices according to the value of slice_set_index. In addition to this, the second embodiment of the invention provides a slice set dividing method based on slice_distance. The specific solution is as follows:
The slice_distance is an integer number indicating the number of slices that are not in the slice set between two slices adjacent in the optical grating scanning sequence in the slice set. For example, the current slice is the 10th slice in the picture in the optical grating scanning sequence and the slice_distance of the current slice is 3. Then, it is known that the slice in the same slice set and preceding the current slice in the optical grating scanning sequence is the 6th slice in the picture in the optical grating scanning sequence. This means, the 6th and the 10th slices in the optical grating scanning sequence are in the same slice set. The decoder is able to know the division of slice sets in the picture according to the slice distance.
Further, slice_distance may also be used with one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, and slice_vertical_position_extension. The specific use of the parameters is the same as that described in the first embodiment.
Or, the slice_distance parameter may be used in a slice alone. In this case, the coding and decoding of data of a slice in the slice set may use data of other slices in the same slice set, but data outside the current slice set cannot be used.
Embodiment 3
In the third embodiment of the invention, the division of slice sets is achieved by writing the position of the first macroblock of each slice set in a picture, the position of the first macroblock in each slice in a slice set, and the number of continuous macroblocks included in the slice set, in the sequence header or picture header.
In the embodiment, slice set parameters may include one or more of the parameters slice_connect, slice_set_syntax, slice_set_reference, slice_independency, slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension and slice_vertical_position_extension. The specific use of the parameters is the same as that described in the first embodiment.
Embodiment 4
In the fourth embodiment, when a slice set covers a rectangle region, the division of slice sets may be achieved by writing the position coordinates of the slice set in the sequence header of the picture, the picture header or the first slice in the slice set and instructing that no slice start code is present in other slices in the slice set.
Specifically, the coordinates of the upper left corner of the region covered by the slice set and the length and width of the region may be written in the sequence header, picture header or the first slice of the slice set. Further, a 1-bit flag is written in the sequence header, picture header or slice set, indicating whether a slice start code is present in slices other than the first slice in the optical grating scanning sequence in the slice set. For example, the flag may be set to 1, indicating that all slices in the slice set carry a slice start code, and to 0, indicating that the first slice in the optical grating scanning sequence in the slice set carries a slice start code but other slices do not carry the slice start code.
If the above start code flag indicates that no slice start code is present in slices other than the first slice in the optical grating scanning sequence in the slice set, the start position of slices other than the first slice in the optical grating scanning sequence in the slice set may be determined according to the position of the upper left corner of the slice set, the length and width of the slice set and the height of the slice (may be the height of one or more macroblocks) in the sequence header, picture header or slice set. For example, when it is determined that the currently coded macroblock is the last macroblock in the slice in the optical grating scanning sequence, the macroblock to be coded next to the current macroblock (in the optical grating scanning sequence) is the first macroblock to be coded in the next slice in the slice set according to the optical grating scanning sequence.
Corresponding to the coding method, the decoding of the slice set is performed by determining the start or end of slices other than the first slice in the optical grating scanning sequence according to the position of the upper left corner of the slice set, the length and width of the slice set and the height of the slice in the sequence header, picture header or slice set.
If no slice start code is present in a slice in the slice set, the slice will not carry such syntax elements as slice_horizontal_position, slice_vertical_position, slice_horizontal_position_extension, and slice_vertical_position_extension. However, such syntax elements as slice_connect, slice_set_syntax, slice_set_reference, and slice_independency may be present. The use of the syntax elements is the same as that described in the first embodiment.
Embodiment 5
In the fifth embodiment of the invention, the slice set technique is applied to resolution based coding. Specifically, a new syntax element slice_resolution_ratio is added on the basis of the first embodiment of the invention, indicating the ratio of the resolution used for the coding and decoding of intra-slice data in a slice set to the original resolution of the picture where the slice set belongs. Accordingly, the decoder may obtain from the bit stream the ratio of the resolution used for the coding of the intra-slice data in the slice set to the original resolution of the slice and perform decoding according to the ratio.
For example, the resolution of the picture represented by FIG. 1 is 128×128, which means there are 128 pixels each in the horizontal direction and the vertical direction of the picture. If the slice_resolution_ratio of the slice set where B0, B1 and B2 belong is 0.5, it indicates that slice data in the slice set is encoded at the resolution of (128×0.5)×(128×0.5). Before data in B0, B1 and B2 is encoded, the resolution of the data is first reduced to half of the original resolution via downsampling before the method in the first embodiment of the invention is used to perform the coding. Accordingly, the decoder decodes the data of B0, B1 and B2 in the method described in the first embodiment of the invention and then increases the resolution of the obtained data to double via upsampling.
Embodiment 6
In the sixth embodiment of the invention, the slice set technique is applied to quality based coding. Specifically, a new syntax element slice_bit_depth is added on the basis of the first embodiment of the invention, indicating the number of bits used for each pixel when intra-slice data in a slice set is coded or decoded. This means the number of bits required for the coding of each piece of slice data in the coded slice set is also coded in the coding process. For example, if the slice_bit_depth of the slice set where B0, B1 and B2 belong in the first embodiment of the invention is 10, it indicates 10 bits are used to store a pixel value when B0, B1 and B2 are coded or decoded.
Accordingly, the decoder may obtain the number of storage bits required for each piece of decoded data when slice data in the slice set is decoded and perform decoding according to the number of bits.
Those skilled in the art may understand that all or part of the steps in the foregoing embodiments may be performed by hardware by following instructions of a program. The program may be stored in a computer readable storage medium such as a read-only memory/random access memory (ROM/RAM), a magnetic disk or a compact disk.
Embodiment 7
In the seventh embodiment of the invention, in addition to the information described in the first embodiment, a slice set may also include quantization parameters related to a designated quantization method. The specific process includes: the encoder decides whether slices in the slice set are encoded with a designated quantization method and writes in the slice one or more of the following quantization parameters: a parameter indicating that a designated quantization matrix is used for slices in the current slice set, an index of the designated quantization matrix used for slices in the current slice set, an enable flag indicating adaptive weighted quantization of macroblocks is used for slices in the slice set, a weighted quantization parameter used for the slice set, and the quantization matrix used for slices in the current slice set. The decoder obtains quantization parameter information used for slices in the current slice set and gets one or more of the following parameters according to the quantization parameter information and performs decoding accordingly. The obtaining of quantization parameter information includes one or more of the following: obtaining the parameter indicating the use of a designated quantization matrix for slices in the current slice set; or obtaining the index indicating use of the quantization matrix for slices in the current slice set; or obtaining the enable flag indicating the use of adaptive weighted quantization of macroblocks for slices in the current slice set; or obtaining the weighted quantization parameter used for slices in the current slice set; or obtaining the quantization matrix used for slices in the current slice set.
For example, a weighted quantization matrix is used for coding and a 1-bit weighted matrix enable flag is written in a slice, indicating whether a weighted quantization matrix is used for coding. If the value of the enable flag is 1, 2-bit information is written in the slice, indicating the weighted matrix index used for coding of the slice in the current slice set. If the index value is 0, it indicates that the slice in the slice set uses the same weighted quantization matrix as the picture header or a weighted matrix is preset at the encoder; the value 1, 2, or 3 indicates the use of the weighted quantization matrix or weighted quantization matrix model whose index is 1, 2 or 3. Optionally, a weighted quantization parameter or weighted quantization matrix may be written in the slice.
Accordingly, the decoder resolves slice parameters in the slice set to obtain the weighted matrix enable flag which indicates whether a weighted matrix is used for coding of the slice in the slice set. If the value of the enable flag is 1, the decoder reads 2-bit more information from the slice to get the index of the weighted matrix used for coding of the slice in the current slice set. If the index value is 0, it indicates that the slice in the slice set uses the same weighted quantization matrix as the picture header or a weighted matrix is preset at the decoder; the value 1, 2, or 3 indicates the use of the weighted quantization matrix or weighted quantization matrix model whose index is 1, 2 or 3. Optionally, the decoder may read the weighted quantization parameter or weighted quantization matrix from the slice.
The method in the embodiment of the invention guarantees the use of a designated quantization matrix for a specified region in a picture to adjust the subjective quality of the region and guarantees the coding efficiency of the entire picture as well.
The embodiment of the invention also provides a video encoder and video decoder. The structures of the encoder and decoders will be described in detail.
In addition to the optical grating scanning sequence, the scanning sequence used for sorting the macroblocks or slices in the embodiments of the invention may also be any other scanning sequence determined according a certain rule.
A structure of the encoder provided in the embodiment of the invention is shown in FIG. 2, including the following units:
(1) a slice dividing unit 21, configured to divide a picture to be encoded into several slices, each containing macroblocks continuous in the optical grating scanning sequence in the picture;
(2) a slice set dividing unit 22, configured to divide slices in the picture into one or more slice sets according to attribute information of the slices divided by the slice dividing unit 21, each containing one or more slices; and
(3) an encoding unit 23, configured to encode slices in the slice sets divided by the slice set dividing unit 22 to get a coded bit stream of the picture.
Optionally, the encoder may further include an attribute information obtaining unit 24, configured to: obtain one or more of the items, spatial continuity of slices, picture feature, spatial resolution, content of the picture or region, and picture coding quality, as the attribute information and send the information to the slice set dividing unit 22.
Optionally, in the encoder, the encoding unit 23 may include, without limitation, one or more of the following units: (1) an enable flag encoding unit 231, configured to write a slice set enable flag in the coded bit stream obtained by the encoding unit 23, where the flag indicates whether slice sets are divided in the coding process of the current picture so that the decoder knows whether the division of slice sets is performed; (2) a position parameter encoding unit 232, configured to encode at least one of the horizontal position parameter and vertical position parameter of slices in the slice set so that the position information of the slices in the picture is sent to the decoder; (3) an identification information encoding unit 233, configured to encode slice set identification information of the slice set so that the slice set division information of the encoder is sent to the decoder and that the decoder knows the mappings of slice sets and slices, for example, which slice set the current slice belongs to or which slices a slice set includes; (4) a slice set syntax indicator encoding unit 234, configured to encode the information whether slice set syntax elements are present in the current slice so that the decoder determines the slice set syntax elements used by the current slice according to the information; (5) a slice connect flag encoding unit 235, configured to encode the information whether the current slice is the last slice in the current slice set; (6) a first data usage indicator encoding unit 236, configured to encode the information that the reference data used in the coding of the slice is the data in the current slice set or all data in the picture; (7) a second data usage indicator encoding unit 237, configured to encode the information that the data used in the coding of macroblocks in the current slice is the data in the current slice set or all data in the picture; and (8) a second slice connect flag encoding unit 238, configured to encode the information whether the current slice is the first slice in the current slice set.
Optionally, the encoding unit 23 may further include at least one of the following processing units: (1) a resolution information encoding unit 239, configured to encode the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice; and (2) a quality information encoding unit 240, configured to encode the number of bits used by each piece of encoded data when the slice data in the slice set is encoded.
The embodiment of the invention also provides a decoder, of which a specific structure is shown in FIG. 3. As shown in FIG. 3, the decoder includes the following processing units:
(1) a bit stream receiving unit 31, configured to receive a bit stream to be decoded.
(2) a slice information obtaining unit 32, configured to obtain slice and slice set division information from the bit stream to be decoded received by the bit stream receiving unit 31, where a slice includes macroblocks continuous in the optical grating scanning sequence in the picture and a slice set includes one or more slices.
Optionally, in the decoder, the slice information obtaining unit 32 may include, without limitation, one or more of the following units: (1) a position parameter obtaining unit 321, configured to obtain at least one of the horizontal position parameter and vertical position parameter of all slices in the slice sets in the bit stream; (2) an identification information obtaining unit 322, configured to obtain slice set identification information of the slice sets in the bit stream; (3) a slice set syntax indicator obtaining unit 323, configured to obtain the information whether slice set syntax elements are present in the current slice; (4) a slice connect flag obtaining unit 324, configured to obtain the information whether the current slice in the bit stream is the last slice in the current slice set; (5) a first data usage indicator obtaining unit 325, configured to obtain the information that the reference data used in the coding of the slices in the bit stream is the data in the current slice set or all data in the picture; (6) a first data usage indicator obtaining unit 326, configured to obtain the information that the data used in the coding of macroblocks in the bit stream is the data in the current slice set or all data in the picture; and (7) a second slice connect flag obtaining unit 327, configured to obtain the information whether the current slice in the bit stream is the first slice in the current slice set.
(3) a picture reconstructing unit 33, configured to: decode the bit stream according to the slice and slice set division information obtained by the slice information obtaining unit 32 and reconstruct the picture.
Optionally, the decoder may further include the following units: (1) an enable flag obtaining unit 328, configured to obtain the slice set enable flag in the coded bit stream, where the flag indicates whether slice sets are divided in the coding process of the current picture; and (2) an information obtainment notifying unit 329, configured to notify the slice information obtaining unit 32 after determining that the current picture is divided into slice sets.
Optionally, the slice information obtaining unit 32 may further include at least one of the following processing units: (1) a resolution information obtaining unit 330, configured to obtain the ratio of the resolution used for coding of intra-slice data in the slice set to the original resolution of the slice; and (2) a quality information obtaining unit 331, configured to obtain from the bit stream the number of storage units required for each piece of decoded data when the slice data in the slice set is encoded as the quality information.
To sum up, in the embodiments of the invention, through flexible combination of slices and the processing of the coding and decoding of the slice set made up of multiple slices and the syntax elements used in the coding and decoding, region specific, resolution specific and quality specific picture coding and decoding are realized to increase the flexibility of video coding and decoding. Moreover, the video coding and decoding solution provided by the embodiments of the invention is easy to implement. Relative to the conventional art, the solution will effectively reduce the hardware complexity of the coding system and the decoding system. In addition, in the embodiments of the invention, an appropriate slice set enable flag is added so that the solution provided by the embodiments of the invention is well compatible with the existing coding and decoding systems.
Although the invention has been described through some exemplary embodiments, the invention is not limited to such embodiments. It is apparent that those skilled in the art can make various modifications and substitutions to the invention without departing from the spirit and scope of the invention. In view of the foregoing, it is intended that the invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.

Claims (22)

What is claimed is:
1. A video coding method comprising:
dividing a picture to be encoded into several slices, each containing continuous blocks;
grouping the slices contained in the picture into one or more slice sets, each containing one or more of the slices; and
encoding slices in the slice sets according to slice and slice set division information to get a coded bit stream of the picture, wherein the information includes whether one or more slice set syntax elements are present in a current slice;
wherein when none of the slice set syntax elements is present in the current slice, using syntax set syntax elements for encoding the current slice that are
a) the same as any slice set syntax elements of a slice containing one or more slice set syntax elements that occurs before the current slice in a current slice set, or
b) the same as any slice set syntax elements of a first slice in the current slice set.
2. The method according to claim 1, further comprising:
writing a slice set enable flag in the coded bit stream, indicating whether the slices are grouped into one or more slice sets in a coding process of a current picture.
3. The method according to claim 1, wherein encoding slices in the slice sets comprises at least one of the following coding actions:
a) encoding at least one of a horizontal position parameter and a vertical position parameter of each slice in the slice sets;
b) encoding slice set identification information of the slice sets;
c) encoding information that a current slice is a first slice in a current slice set;
d) encoding information that the current slice is a last slice in the current slice set;
e) encoding information that reference data used in inter-frame coding of a slice is data in slices in a same slice set as the current slice in the picture or all data available in the picture; and
f) encoding information that data used in coding of blocks in a slice is data of the current slice or data of other slices in the same slice set as the current slice.
4. The method according to claim 3, wherein the slice set identification information comprises at least one of the following items:
a) slice set index allocated for each slice set;
b) slice distance, indicating a quantity of slices that are not in the slice set between two slices adjacent in the slice set; and
c) start block and a quantity of blocks in the slice set.
5. The method according to claim 3, wherein if the data used in the coding of blocks of a slice is data of the slice set of the current slice, slices that can be used for coding of the data of the current slice precede the current slice in the scanning sequence and follow a previous slice in the slice set which is encoded using the data in the slice set of the current slice.
6. The method according to claim 1, wherein the encoding slices in the slice sets further comprises:
encoding information that a designated quantization method is used for slices in a slice set and encoding one or more of the following quantization parameters:
a parameter indicating use of a designated quantization matrix for slices in a current slice set;
an index of a quantization matrix used for the slices in the current slice set;
an enable flag indicating use of block adaptive weighted quantization for the slices in the current slice set;
a weighted quantization parameter used for the slices in the current slice set; and
the quantization matrix used for the slices in the current slice set.
7. The method according to claim 1, wherein data used in coding of a slice comprises data from other slices in the slice set where the current slice belongs and does not comprise data of slices outside the slice set of the current slice.
8. A video decoding method comprising:
obtaining slice and slice set division information from a bit stream to be decoded;
decoding the bit stream according to the obtained slice and slice set division information, wherein the obtained information includes whether a current slice contains slice set syntax elements in the bit stream; and
when none of the slice set syntax elements is present in a slice, using the syntax set syntax elements for decoding the current slice that are
a) the same as any slice set syntax elements of a slice containing one or more slice set syntax element that occurs before the current slice in a current slice set, or
b) the same as any slice set syntax elements of a first slice in the current slice set.
9. The method according to claim 8, wherein obtaining the slice and slice set division information in the bit stream comprises:
obtaining a slice set enable flag which indicates whether slices are grouped into one or more slice sets in coding of a current picture in the bit stream to be decoded; and
obtaining the slice and slice set division information in the bit stream after determining that the current picture is divided into the slice sets.
10. The method according to claim 8, wherein decoding the bit stream comprises at least one of the following decoding actions:
a) obtaining at least one of a horizontal position parameter and a vertical position parameter of each slice in the slice sets in the bit stream and performing decoding according to the parameter;
b) obtaining slice set identification information of the slice sets in the bit stream and performing decoding according to the slice set identification information;
c) obtaining first information whether a current slice is a last slice in a current slice set in the bit stream and performing decoding according to the first information;
d) obtaining second information whether the current slice is a first slice in the current slice set in the bit stream and performing decoding according to the second information;
e) obtaining fourth information in the bit stream that reference data used in inter-frame coding of a slice is data in slices in a same slice set as the current slice in the picture or all data available in the picture; and
f) obtaining fifth information in the bit stream that data used in coding of blocks in a slice is data of the current slice or data of other slices in the same slice set as the current slice.
11. The method according to claim 10, wherein the slice set identification information comprises at least one of the following items:
a) slice set index allocated for each slice set;
b) slice set index allocated for each slice;
c) slice distance, indicating a quantity of slices that are not in the slice set between two slices adjacent in a scan order in the slice set; and
d) start block and a quantity of blocks in the slice set.
12. The method according to claim 8, wherein decoding the bit stream further comprises:
obtaining quantization parameters used for slices in a current slice set and getting one or more of the following parameters, according to the quantization parameters and performing decoding accordingly:
a) a parameter indicating use of a designated quantization matrix for slices in the current slice set;
b) an index of the quantization matrix used for the slices in the current slice set;
c) an enable flag indicating use of block adaptive weighted quantization for the slices in the current slice set;
d) a weighted quantization parameter used for the slices in the current slice set; and
e) a quantization matrix used for the slices in the current slice set.
13. The method according to claim 8, further comprising obtaining position information and shape information of the slice set from a sequence header of the picture, a picture header or a first slice in the slice set, and determining start positions of slices in the slice set.
14. The method according to claim 8, further comprising at least one of the following:
a) obtaining, from the bit stream, a ratio of a resolution used for coding of data inside the slice in the slice set and an original resolution of the slice and decoding the bit stream according to the ratio; and
b) obtaining a quantity of storage bits required for each piece of decoded data when slice data in the slice set is decoded and decoding the bit stream according to the quantity of the storage bits.
15. The method according to claim 8, wherein data used in coding comprises data from other slices in the slice set where the current slice belongs.
16. A non-transitory computer readable medium storing instructions for execution by a video decoder, the instructions comprising:
instructions for obtaining slice and slice set division information from a bit stream to be decoded, wherein a slice comprises blocks continuous in a scanning sequence in a picture, and a slice set comprises one or more slices; and
instructions for decoding the bit stream according to the slice and slice set division information obtained by the slice information obtaining unit and reconstruct the picture; and
instructions for one or both of the following:
a) obtaining, from the bit stream, a ratio of a resolution used for coding of data inside the slice in the slice set and an original resolution of the slice and decoding the data according to the ratio; and
b) obtaining a quantity of storage bits required for each piece of decoded data when slice data in the slice set is decoded and decoding the data according to the quantity of the storage bits.
17. The non-transitory computer readable medium storing instructions for execution by the video decoder according to claim 16, the instructions further comprising:
instructions for obtaining a slice set enable flag in the bit stream to be decoded, where the flag indicates whether slices are grouped into one or more slice sets in coding of a current picture; and
instructions for providing notification after determining that the current picture is divided into the slice sets.
18. The non-transitory computer readable medium storing instructions for execution by the video decoder according to claim 16, the instructions further comprising:
instructions for obtaining at least one of a horizontal position parameter and a vertical position parameter of all slices in slice sets in the bit stream;
instructions for obtaining slice set identification information of the slice sets in the bit stream;
instructions for obtaining information whether a current slice in the bit stream is a last slice in a current slice set;
instructions for obtaining information whether slice set syntax elements are present in the current slice;
instructions for obtaining information that reference data used in coding of a slice is data in slices in a same slice set as the current slice in the picture or all data available in the picture;
instructions for obtaining information that data used in coding of blocks in the bit stream is data in the current slice set or data in a slice set where the current slice belongs;
instructions for obtaining information whether the current slice in the bit stream is a first slice in the current slice set;
instructions for obtaining a ratio of the resolution used for coding of data inside a slice in the slice set to an original resolution of the slice; and
instructions for obtaining from the bit stream a quantity of storage units required for each piece of decoded data when slice data in the slice set is encoded as quality information.
19. A video decoding method comprising:
obtaining slice and slice set division information from a bit stream to be decoded; and
decoding the bit stream according to the obtained slice and slice set division information, wherein the decoding includes one or both of the following:
a) obtaining, from the bit stream, a ratio of a resolution used for coding of data inside the slice in the slice set and an original resolution of the slice and decoding the data according to the ratio; and
b) obtaining a quantity of storage bits required for each piece of decoded data when slice data in the slice set is decoded and decoding the data according to the quantity of the storage bits.
20. The method according to claim 19, wherein decoding the bit stream further comprises obtaining quantization parameters used for slices in the current slice set and getting one or more of the following parameters, according to the quantization parameters and performing decoding accordingly:
a) a parameter indicating use of a designated quantization matrix for slices in the current slice set;
b) an index of the quantization matrix used for the slices in the current slice set;
c) an enable flag indicating use of block adaptive weighted quantization for the slices in the current slice set;
d) a weighted quantization parameter used for the slices in the current slice set; and
e) a quantization matrix used for the slices in the current slice set.
21. A video coder comprising a processor and a memory, wherein the memory includes instructions executed by the processor so as to configure to:
divide a picture to be encoded into several slices, each containing macroblocks continuous in a scanning sequence in the picture;
group the slices contained in the picture into one or more slice sets, each containing one or more of the slices; and
encode slices in the slice sets according to slice and slice set division information to get a coded bit stream of the picture, wherein the information includes whether one or more slice set syntax elements are present in a current slice;
 wherein when none of the slice set syntax elements is present in the current slice, using syntax set syntax elements for encoding the current slice that are
a) the same as any slice set syntax elements of a slice containing one or more slice set syntax elements that occurs before the current slice in a current slice set, or
b) the same as any slice set syntax elements of a first slice in the current slice set.
22. A video coder comprising a processor and a memory, wherein the memory includes instructions executed by the processor so as to configure to:
obtain slice and slice set division information from a bit stream to be decoded;
decode the bit stream according to the obtained slice and slice set division information, wherein the obtained information includes whether a current slice contains slice set syntax elements in the bit stream; and
when none of the slice set syntax elements is present in a slice, using the syntax set syntax elements for decoding the current slice that are
a) the same as any slice set syntax elements of a slice containing one or more slice set syntax element that occurs before the current slice in a current slice set, or
the same as any slice set syntax elements of a first slice in the current slice set.
US15/418,496 2007-09-14 2017-01-27 Method and device for video coding and decoding Active US10432938B2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US15/418,496 US10432938B2 (en) 2007-09-14 2017-01-27 Method and device for video coding and decoding
US16/573,148 US10939112B2 (en) 2007-09-14 2019-09-17 Method and device for video coding and decoding
US17/164,502 US11659184B2 (en) 2007-09-14 2021-02-01 Method and device for video coding and decoding
US18/302,761 US12382046B2 (en) 2007-09-14 2023-04-18 Method and device for video coding and decoding

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
CN200710153924 2007-09-14
CN200710153924.5 2007-09-14
CN200710153924 2007-09-14
CN200810096456.7 2008-05-09
CN2008100964567A CN101389021B (en) 2007-09-14 2008-05-09 Video encoding/decoding method and apparatus
CN200810096456 2008-09-09
PCT/CN2008/072362 WO2009033432A1 (en) 2007-09-14 2008-09-12 A method and apparatus for video encoding and decoding
US12/724,076 US8831107B2 (en) 2007-09-14 2010-03-15 Method and device for video coding and decoding
US14/451,083 US9769482B2 (en) 2007-09-14 2014-08-04 Method and device for video coding and decoding
US15/418,496 US10432938B2 (en) 2007-09-14 2017-01-27 Method and device for video coding and decoding

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/451,083 Continuation US9769482B2 (en) 2007-09-14 2014-08-04 Method and device for video coding and decoding

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/573,148 Continuation US10939112B2 (en) 2007-09-14 2019-09-17 Method and device for video coding and decoding

Publications (2)

Publication Number Publication Date
US20170142414A1 US20170142414A1 (en) 2017-05-18
US10432938B2 true US10432938B2 (en) 2019-10-01

Family

ID=40478151

Family Applications (5)

Application Number Title Priority Date Filing Date
US12/724,076 Active 2029-11-26 US8831107B2 (en) 2007-09-14 2010-03-15 Method and device for video coding and decoding
US14/451,083 Active US9769482B2 (en) 2007-09-14 2014-08-04 Method and device for video coding and decoding
US15/418,496 Active US10432938B2 (en) 2007-09-14 2017-01-27 Method and device for video coding and decoding
US16/573,148 Active US10939112B2 (en) 2007-09-14 2019-09-17 Method and device for video coding and decoding
US17/164,502 Active US11659184B2 (en) 2007-09-14 2021-02-01 Method and device for video coding and decoding

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US12/724,076 Active 2029-11-26 US8831107B2 (en) 2007-09-14 2010-03-15 Method and device for video coding and decoding
US14/451,083 Active US9769482B2 (en) 2007-09-14 2014-08-04 Method and device for video coding and decoding

Family Applications After (2)

Application Number Title Priority Date Filing Date
US16/573,148 Active US10939112B2 (en) 2007-09-14 2019-09-17 Method and device for video coding and decoding
US17/164,502 Active US11659184B2 (en) 2007-09-14 2021-02-01 Method and device for video coding and decoding

Country Status (2)

Country Link
US (5) US8831107B2 (en)
CN (1) CN101389021B (en)

Families Citing this family (65)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007067990A2 (en) * 2005-12-08 2007-06-14 Vidyo, Inc. Systems and methods for error resilience and random access in video communication systems
US8693538B2 (en) 2006-03-03 2014-04-08 Vidyo, Inc. System and method for providing error resilience, random access and rate control in scalable video communications
US8462856B2 (en) * 2007-01-09 2013-06-11 Vidyo, Inc. Systems and methods for error resilience in video communication systems
CN101707716B (en) * 2009-11-19 2011-05-11 清华大学 Video coder and coding method
JP5805665B2 (en) 2010-01-22 2015-11-04 トムソン ライセンシングThomson Licensing Data pruning for video compression using Example-based super-resolution
JP5911809B2 (en) 2010-01-22 2016-04-27 トムソン ライセンシングThomson Licensing Sampling-based super-resolution video encoding and decoding method and apparatus
CN102215396A (en) * 2010-04-09 2011-10-12 华为技术有限公司 Video coding and decoding methods and systems
CN103281527B (en) * 2010-04-15 2015-12-23 华为技术有限公司 Image coding/decoding method and relevant apparatus
US9047218B2 (en) * 2010-04-26 2015-06-02 Cleversafe, Inc. Dispersed storage network slice name verification
EP2614645A1 (en) * 2010-09-10 2013-07-17 Thomson Licensing Video decoding using example - based data pruning
WO2012033972A1 (en) 2010-09-10 2012-03-15 Thomson Licensing Methods and apparatus for pruning decision optimization in example-based data pruning compression
US9338477B2 (en) 2010-09-10 2016-05-10 Thomson Licensing Recovering a pruned version of a picture in a video sequence for example-based data pruning using intra-frame patch similarity
US8656138B2 (en) 2010-10-06 2014-02-18 Cleversafe, Inc. Efficiently accessing an encoded data slice utilizing a memory bin
KR20130105870A (en) * 2010-10-27 2013-09-26 브이아이디 스케일, 인크. Systems and methods for adaptive video coding
US10244239B2 (en) 2010-12-28 2019-03-26 Dolby Laboratories Licensing Corporation Parameter set for picture segmentation
CN103404138B (en) * 2011-01-05 2017-01-18 皇家飞利浦电子股份有限公司 Video encoding and decoding device and method for retaining PPG-related information
CN102595119B (en) * 2011-01-14 2014-03-12 华为技术有限公司 Stripe coding method and device thereof as well as stripe decoding method and device thereof
CN102595123B (en) * 2011-01-14 2014-06-04 华为技术有限公司 Stripe coding method and device thereof as well as stripe decoding method and device thereof
WO2012105959A1 (en) * 2011-02-01 2012-08-09 Vidyo, Inc. Spatial scalability using redundant pictures and slice groups
US8934530B2 (en) 2011-02-01 2015-01-13 Vidyo, Inc. Spatial scalability using redundant pictures and slice groups
US8891889B2 (en) 2011-04-13 2014-11-18 Huawei Technologies Co., Ltd. Image encoding and decoding methods and related devices
US9060173B2 (en) * 2011-06-30 2015-06-16 Sharp Kabushiki Kaisha Context initialization based on decoder picture buffer
DK4152749T3 (en) 2011-07-15 2024-07-15 Ge Video Compression Llc Sample array encoding for low latency
US9584819B2 (en) * 2011-10-24 2017-02-28 Qualcomm Incorporated Grouping of tiles for video coding
US9247258B2 (en) * 2011-10-26 2016-01-26 Qualcomm Incorporated Unified design for picture partitioning schemes
EP2745512B1 (en) 2012-01-03 2019-10-23 HFI Innovation Inc. Method and apparatus for block-based significance map and significance group flag context selection
CN103220507B (en) * 2012-01-19 2018-04-20 中兴通讯股份有限公司 A kind of video coding-decoding method and system
TWI597969B (en) * 2012-01-30 2017-09-01 三星電子股份有限公司 Video decoding apparatus
SG10201505821WA (en) * 2012-01-30 2015-08-28 Samsung Electronics Co Ltd Method and apparatus for video encoding for each spatial sub-area, and method and apparatus for video decoding for each spatial sub-area
CN103260022A (en) * 2012-02-21 2013-08-21 安凯(广州)微电子技术有限公司 Low-power-consumption video decoding method and device
WO2013144144A1 (en) * 2012-03-30 2013-10-03 Panasonic Corporation Syntax and semantics for adaptive loop filter and sample adaptive offset
SG10201607842VA (en) * 2012-04-13 2016-11-29 Mitsubishi Electric Corp Image encoding device, image decoding device, image encoding method and image decoding method
US20130343465A1 (en) * 2012-06-26 2013-12-26 Qualcomm Incorporated Header parameter sets for video coding
US9332269B2 (en) * 2012-06-27 2016-05-03 Broadcom Corporation Slice overhead coding
CN104429073B (en) * 2012-09-06 2018-01-19 太阳专利托管公司 Method for encoding images/coding/decoding method and device, arrangement for encoding
KR102072832B1 (en) * 2012-09-26 2020-02-03 벨로스 미디어 인터내셔널 리미티드 Image encoding method, image decoding method, image encoding device, image decoding device, and image encoding/decoding device
SG11201600223UA (en) * 2013-07-19 2016-02-26 Sony Corp Information processing device and method
US10283091B2 (en) * 2014-10-13 2019-05-07 Microsoft Technology Licensing, Llc Buffer optimization
CN105872539B (en) 2015-02-08 2020-01-14 同济大学 Image encoding method and apparatus, and image decoding method and apparatus
WO2016124154A1 (en) * 2015-02-08 2016-08-11 同济大学 Image coding method and apparatus, and image decoding method and apparatus
EP3259848A4 (en) 2015-04-10 2018-10-24 Red.Com, Llc Video camera with rate control video compression
CN104935933B (en) * 2015-06-05 2019-11-26 广东中星微电子有限公司 A kind of video coding-decoding method
US10097765B2 (en) 2016-04-20 2018-10-09 Samsung Electronics Co., Ltd. Methodology and apparatus for generating high fidelity zoom for mobile video
US10034026B2 (en) 2016-04-22 2018-07-24 Akila Subramaniam Device for and method of enabling the processing of a video stream
CN107517385B (en) * 2016-06-16 2020-02-21 华为技术有限公司 A kind of video image encoding and decoding method and device
US10805611B2 (en) * 2016-10-18 2020-10-13 Mediatek Inc. Method and apparatus of constrained sequence header
CN115174905B (en) 2017-01-20 2025-05-13 世宗大学校产学协力团 Image decoding method, image encoding method and bit stream generation method
WO2019010233A1 (en) 2017-07-05 2019-01-10 Red. Com, Llc Video image data processing in electronic devices
CN111264059B (en) * 2017-10-25 2021-11-12 株式会社索思未来 Encoding method, decoding method, encoding device, decoding device, and computer-readable recording medium
WO2019080136A1 (en) * 2017-10-28 2019-05-02 华为技术有限公司 Video encoding and decoding method and device
EP3847817B1 (en) 2018-09-14 2024-05-15 Huawei Technologies Co., Ltd. Slicing and tiling in video coding
WO2020140908A1 (en) * 2018-12-31 2020-07-09 Beijing Bytedance Network Technology Co., Ltd. Mapping between distance index and distance in merge with mvd
CN113273187B (en) 2019-01-10 2024-07-05 北京字节跳动网络技术有限公司 Affine-based Merge with Motion Vector Difference (MVD)
US12113997B2 (en) * 2019-01-22 2024-10-08 Tencent America LLC Method for tile group identification
WO2020185146A1 (en) * 2019-03-11 2020-09-17 Telefonaktiebolaget Lm Ericsson (Publ) Video coding comprising rectangular tile group signaling
PL3989578T3 (en) * 2019-06-19 2024-02-26 Sony Group Corporation Image processing device, and image processing method
BR112020025895A2 (en) 2019-06-24 2021-12-28 Ericsson Telefon Ab L M Methods, computer program, carrier, decoder and encoder.
EP4085622A1 (en) * 2019-12-31 2022-11-09 Koninklijke KPN N.V. Partial output of a decoded picture buffer in video coding
CN115191114B (en) 2020-02-14 2025-06-13 抖音视界有限公司 Sub-picture signaling in video bitstream
CN112422969A (en) * 2020-10-20 2021-02-26 西安万像电子科技有限公司 Image coding and decoding method and device
EP4268457A1 (en) 2020-12-28 2023-11-01 Koninklijke KPN N.V. Partial output of a decoded picture buffer in video coding
EP4268462A1 (en) 2020-12-28 2023-11-01 Koninklijke KPN N.V. Prioritized decoding and output of parts of a picture in video coding
CN113973210B (en) * 2021-10-25 2022-09-20 腾讯科技(深圳)有限公司 Media file packaging method, device, equipment and storage medium
CN116074499B (en) 2021-11-11 2024-10-29 杭州海康威视数字技术股份有限公司 Video encoding and decoding method and device
CN116405694B (en) * 2022-03-18 2024-10-29 杭州海康威视数字技术股份有限公司 Encoding and decoding method, device and equipment

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001086002A (en) 1999-09-17 2001-03-30 Sanyo Electric Co Ltd Data encoding device
US20030112867A1 (en) 2001-11-27 2003-06-19 Miska Hannuksela Video coding and decoding
US20050053158A1 (en) * 2003-09-07 2005-03-10 Microsoft Corporation Slice-layer in video codec
US20060078209A1 (en) 2004-10-12 2006-04-13 Canon Kabushiki Kaisha Image coding apparatusa and method
WO2007019790A1 (en) 2005-08-15 2007-02-22 Huawei Technologies Co., Ltd. Method and apparatus for encoding coefficient block
JP2007174568A (en) 2005-12-26 2007-07-05 Sanyo Electric Co Ltd Encoding method
US20080151997A1 (en) 2006-12-22 2008-06-26 Qualcomm Incorporated Techniques for content adaptive video frame slicing and non-uniform access unit coding
US20080170624A1 (en) 2007-01-12 2008-07-17 Mitsubishi Electric Corporation Image encoding device and image encoding method
US7436888B2 (en) * 2003-01-30 2008-10-14 Samsung Electronics Co., Ltd. Method and apparatus for redundant image encoding and decoding
US7924625B2 (en) 2006-11-27 2011-04-12 Sandisk Corporation Segmented bitscan for verification of programming
US7924925B2 (en) * 2006-02-24 2011-04-12 Freescale Semiconductor, Inc. Flexible macroblock ordering with reduced data traffic and power consumption
US8005149B2 (en) 2006-07-03 2011-08-23 Unisor Design Services Ltd. Transmission of stream video in low latency

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7813431B2 (en) * 2002-05-20 2010-10-12 Broadcom Corporation System, method, and apparatus for decoding flexibility ordered macroblocks
US7769088B2 (en) * 2003-05-28 2010-08-03 Broadcom Corporation Context adaptive binary arithmetic code decoding engine
AU2006303886B2 (en) * 2005-10-12 2011-11-17 Interdigital Vc Holdings, Inc. Region of interest H .264 scalable video coding

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001086002A (en) 1999-09-17 2001-03-30 Sanyo Electric Co Ltd Data encoding device
US20030112867A1 (en) 2001-11-27 2003-06-19 Miska Hannuksela Video coding and decoding
CN1593065A (en) 2001-11-27 2005-03-09 诺基亚有限公司 Video encoding and decoding of foreground and background wherein picture is divided into slice
US20070183676A1 (en) 2001-11-27 2007-08-09 Nokia Corporation Video coding and decoding
US7436888B2 (en) * 2003-01-30 2008-10-14 Samsung Electronics Co., Ltd. Method and apparatus for redundant image encoding and decoding
US20050053158A1 (en) * 2003-09-07 2005-03-10 Microsoft Corporation Slice-layer in video codec
US20060078209A1 (en) 2004-10-12 2006-04-13 Canon Kabushiki Kaisha Image coding apparatusa and method
WO2007019790A1 (en) 2005-08-15 2007-02-22 Huawei Technologies Co., Ltd. Method and apparatus for encoding coefficient block
JP2007174568A (en) 2005-12-26 2007-07-05 Sanyo Electric Co Ltd Encoding method
US7924925B2 (en) * 2006-02-24 2011-04-12 Freescale Semiconductor, Inc. Flexible macroblock ordering with reduced data traffic and power consumption
US8005149B2 (en) 2006-07-03 2011-08-23 Unisor Design Services Ltd. Transmission of stream video in low latency
US7924625B2 (en) 2006-11-27 2011-04-12 Sandisk Corporation Segmented bitscan for verification of programming
US20080151997A1 (en) 2006-12-22 2008-06-26 Qualcomm Incorporated Techniques for content adaptive video frame slicing and non-uniform access unit coding
US20080170624A1 (en) 2007-01-12 2008-07-17 Mitsubishi Electric Corporation Image encoding device and image encoding method

Non-Patent Citations (9)

* Cited by examiner, † Cited by third party
Title
"Information technology-Advanced coding of audio and video-Part 2: Video," GB/T20090.2, pp. i-94, (2006).
"Information technology-Generic coding of moving pictures and associated audio information: Video," ISO/IEC 13818-2, Amendment 5, pp. i-202, International Organization for Standardization, Geneva, Switzerland (Apr. 15, 2000).
"Joint Committee Draft (CD)," Joint Video Team (JVT) of ISO/IEC MPEG and ITU-TVCEG, 3rd Meeting, Fairfax, Virginia, USA, Draft ISO/IEC 14496-10, JVT-C167, pp. i-132,International Organization for Standardization and International Telecommunication Union, Geneva, Switzerland (May 6-10, 2002).
"Series H: Audiovisual and Multimedia Systems Infrastructure of audiovisual services-Coding of moving video; Amendment 2: New profiles for professional applications," Recommendation ITU-T H.264, pp. 1-75, International Telecommunication Union, Geneva, Switzerland (Apr. 2007).
"Series H: Audiovisual and Multimedia Systems; Infrastructure of audiovisual services-Coding of moving video; Advanced video coding for generic audiovisual services," Recommendation ITU-T H.264, pp. 1-669, International Telecommunication Union, Geneva, Switzerland (Mar. 2010).
"Information technology—Advanced coding of audio and video—Part 2: Video," GB/T20090.2, pp. i-94, (2006).
"Information technology—Generic coding of moving pictures and associated audio information: Video," ISO/IEC 13818-2, Amendment 5, pp. i-202, International Organization for Standardization, Geneva, Switzerland (Apr. 15, 2000).
"Series H: Audiovisual and Multimedia Systems Infrastructure of audiovisual services—Coding of moving video; Amendment 2: New profiles for professional applications," Recommendation ITU-T H.264, pp. 1-75, International Telecommunication Union, Geneva, Switzerland (Apr. 2007).
"Series H: Audiovisual and Multimedia Systems; Infrastructure of audiovisual services—Coding of moving video; Advanced video coding for generic audiovisual services," Recommendation ITU-T H.264, pp. 1-669, International Telecommunication Union, Geneva, Switzerland (Mar. 2010).

Also Published As

Publication number Publication date
US9769482B2 (en) 2017-09-19
CN101389021A (en) 2009-03-18
US10939112B2 (en) 2021-03-02
US20210160504A1 (en) 2021-05-27
US20170142414A1 (en) 2017-05-18
US11659184B2 (en) 2023-05-23
US20140341278A1 (en) 2014-11-20
US8831107B2 (en) 2014-09-09
US20100189181A1 (en) 2010-07-29
CN101389021B (en) 2010-12-22
US20200014927A1 (en) 2020-01-09

Similar Documents

Publication Publication Date Title
US11659184B2 (en) Method and device for video coding and decoding
US11290717B2 (en) Encoding and decoding methods and apparatuses with multiple image block division manners
US12382046B2 (en) Method and device for video coding and decoding
US20230013104A1 (en) Encoding and Decoding Pictures Based on Tile Group ID
US10805644B2 (en) Device and method for entropy encoding and decoding
US9100648B2 (en) Method and apparatus for decoding a video signal
US12355954B2 (en) Image encoding/decoding method and apparatus for determining prediction mode of chroma block by referring to luma sample position, and method for transmitting bitstream
CN113678456B (en) Method and apparatus for signaling information about chroma formats
US20250142095A1 (en) Image encoding/decoding method and device using palette mode, and method for transmitting bitstream
CN118474369A (en) Encoding device, decoding device, and data transmitting device
US20240388697A1 (en) Encoding and decoding method and apparatus, and device therefor
JP7664205B2 (en) Improved tile address signaling in video encoding and decoding - Patents.com
CN119383359A (en) Intra-frame prediction method and device based on MPM list
CN114930847B (en) Transformation-based image coding method and device
CN117528081A (en) Image encoding/decoding apparatus and apparatus for transmitting data
CN117395415A (en) Image encoding and decoding apparatus and apparatus for transmitting data
CN105828080B (en) Image coding and decoding method and device
CN107911705B (en) Method for analyzing AVS2.0 key parameter and checking error frame after decoding
CN117714715A (en) Image decoding apparatus, image encoding apparatus, and data transmitting apparatus
KR20200059216A (en) Motion compensation reference frame compression
CN117857819A (en) Image encoding device, image decoding device, and data transmitting device

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

AS Assignment

Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ZHENG, XIAOZHEN;MAO, ZHEN;WANG, ZIYUAN;AND OTHERS;SIGNING DATES FROM 20100310 TO 20100408;REEL/FRAME:048530/0079

Owner name: TSINGHUA UNIVERSITY, CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ZHENG, XIAOZHEN;MAO, ZHEN;WANG, ZIYUAN;AND OTHERS;SIGNING DATES FROM 20100310 TO 20100408;REEL/FRAME:048530/0079

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4