WO2021133909A1 - Picture header intra random access picture and gradual decoder refresh signaling in video coding - Google Patents

Picture header intra random access picture and gradual decoder refresh signaling in video coding Download PDF

Info

Publication number
WO2021133909A1
WO2021133909A1 PCT/US2020/066838 US2020066838W WO2021133909A1 WO 2021133909 A1 WO2021133909 A1 WO 2021133909A1 US 2020066838 W US2020066838 W US 2020066838W WO 2021133909 A1 WO2021133909 A1 WO 2021133909A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
nal unit
gdr
pictures
picture header
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2020/066838
Other languages
English (en)
French (fr)
Inventor
Muhammed Zeyd Coban
Vadim Seregin
Adarsh Krishnan RAMASUBRAMONIAN
Marta Karczewicz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Priority to KR1020227020175A priority Critical patent/KR20220112785A/ko
Priority to EP20839536.8A priority patent/EP4082207B1/en
Priority to CN202080087362.2A priority patent/CN114846802B/zh
Priority to BR112022011752A priority patent/BR112022011752A2/pt
Priority to JP2022537062A priority patent/JP7664253B2/ja
Priority to PH1/2022/551264A priority patent/PH12022551264A1/en
Publication of WO2021133909A1 publication Critical patent/WO2021133909A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/174Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/188Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a video data packet, e.g. a network abstraction layer [NAL] unit

Definitions

  • FIG. 4 is a flowchart illustrating an example method for encoding a current block.
  • FIG. 5 is a flowchart illustrating an example method for decoding a current block of video data.
  • a video encoder may generate encoded pictures for a set of pictures of the video data.
  • the video encoder may include a picture header NAL unit in a bitstream that comprises the encoded pictures.
  • the picture header NAL unit includes a syntax element that indicates that a picture associated with the picture header NAL unit must be either an IRAP or a Gradual Decoder Refresh (GDR) picture. Because of this syntax element, a device performing random access may directly identify the picture header NAL unit as being associated with an IRAP or GDR picture without needing to search backward in the bitstream to find this picture header NAL unit.
  • GDR Gradual Decoder Refresh
  • Video source 104 of source device 102 may include a video capture device, such as a video camera, a video archive containing previously captured raw video, and/or a video feed interface to receive video from a video content provider.
  • video source 104 may generate computer graphics-based data as the source video, or a combination of live video, archived video, and computer-generated video.
  • video encoder 200 encodes the captured, pre-captured, or computer-generated video data.
  • Video encoder 200 may rearrange the pictures from the received order (sometimes referred to as “display order”) into a coding order for coding.
  • Video encoder 200 may generate a bitstream including encoded video data.
  • Source device 102 may then output the encoded video data via output interface 108 onto computer-readable medium 110 for reception and/or retrieval by, e.g., input interface 122 of destination device 116.
  • Display device 118 may represent any of a variety of display devices such as a cathode ray tube (CRT), a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or another type of display device.
  • CTR cathode ray tube
  • LCD liquid crystal display
  • plasma display a plasma display
  • OLED organic light emitting diode
  • video encoder 200 and video decoder 300 may use a single QTBT or MTT structure to represent each of the luminance and chrominance components, while in other examples, video encoder 200 and video decoder 300 may use two or more QTBT or MTT structures, such as one QTBT/MTT structure for the luminance component and another QTBT/MTT structure for both chrominance components (or two QTBT/MTT structures for respective chrominance components).
  • Video encoder 200 and video decoder 300 may be configured to use quadtree partitioning per HEVC, QTBT partitioning, MTT partitioning, or other partitioning structures. For purposes of explanation, the description of the techniques of this disclosure is presented with respect to QTBT partitioning. However, it should be understood that the techniques of this disclosure may also be applied to video coders configured to use quadtree partitioning, or other types of partitioning as well.
  • a slice may be an integer number of bricks of a picture that may be exclusively contained in a single network abstraction layer (NAL) unit.
  • NAL network abstraction layer
  • a slice includes either a number of complete tiles or only a consecutive sequence of complete bricks of one tile.
  • video encoder 200 may select an intra-prediction mode to generate the prediction block.
  • Some examples of VVC provide sixty-seven intra prediction modes, including various directional modes, as well as planar mode and DC mode.
  • video encoder 200 selects an intra-prediction mode that describes neighboring samples to a current block (e.g., a block of a CU) from which to predict samples of the current block. Such samples may generally be above, above and to the left, or to the left of the current block in the same picture as the current block, assuming video encoder 200 codes CTUs and CUs in raster scan order (left to right, top to bottom).
  • Video encoder 200 encodes data representing the prediction mode for a current block.
  • video encoder 200 may calculate residual data for the block.
  • the residual data such as a residual block, represents sample by sample differences between the block and a prediction block for the block, formed using the corresponding prediction mode.
  • Video encoder 200 may apply one or more transforms to the residual block, to produce transformed data in a transform domain instead of the sample domain.
  • video encoder 200 may apply a discrete cosine transform (DCT), an integer transform, a wavelet transform, or a conceptually similar transform to residual video data.
  • DCT discrete cosine transform
  • an integer transform an integer transform
  • wavelet transform or a conceptually similar transform
  • video encoder 200 may generate a bitstream including encoded video data, e.g., syntax elements describing partitioning of a picture into blocks (e.g., CUs) and prediction and/or residual information for the blocks.
  • video decoder 300 may receive the bitstream and decode the encoded video data.
  • a bitstream may comprise a sequence of network abstraction layer (NAL) units.
  • NAL unit is a syntax structure containing an indication of the type of data in the NAL unit and bytes containing that data in the form of a raw byte sequence payload (RBSP) interspersed as necessary with emulation prevention bits.
  • Each of the NAL units may include a NAL unit header and may encapsulate a RBSP.
  • the NAL unit header may include a syntax element indicating a NAL unit type code.
  • the NAL unit type code specified by the NAL unit header of a NAL unit indicates the type of the NAL unit.
  • An RBSP may be a syntax structure containing an integer number of bytes that is encapsulated within a NAL unit. In some instances, an RBSP includes zero bits.
  • each NAL unit includes a syntax element (e.g., nal_unit_type) that indicates a NAL unit type of the NAL unit.
  • video decoder 300 may identify, based on the NAL unit type of a NAL unit, the NAL unit as being associated with one of a plurality of picture types. These picture types may include Instantaneous Decoding Refresh (IDR) pictures, Clean Random Access (CRA) pictures, Temporal Sub-Layer Access (TSA) pictures, Broken Link Access (BLA) pictures and encoded pictures that are not IDR, CRA, or TSA pictures.
  • IDR Instantaneous Decoding Refresh
  • CRA Clean Random Access
  • TSA Temporal Sub-Layer Access
  • BLA Broken Link Access
  • picture headers are mandatory for each picture.
  • PHs are not mandatory for each picture.
  • Slice layer specific NUTs can be signalled at PH as PH NUT types and the slice layer NUTs can be replaced by a SLICE NUT indication with IRAP or GDR indication signaled in the slice header when mixed nalu types in pic flag is equal to 1.
  • the slice header level NAL unit types are derived from associated PH NUT types except for mixed nal unit case.
  • VVC Draft 7 changes BEGIN
  • FIG. 3 is a block diagram illustrating an example video decoder 300 that may perform the techniques of this disclosure.
  • FIG. 3 is provided for purposes of explanation and is not limiting on the techniques as broadly exemplified and described in this disclosure.
  • this disclosure describes video decoder 300 according to the techniques of VVC and HEVC.
  • the techniques of this disclosure may be performed by video coding devices that are configured to other video coding standards.
  • Aspect 8H A computer-readable storage medium having stored thereon instructions that, when executed, cause one or more processors to perform the method of any of aspects 1A-3F.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
PCT/US2020/066838 2019-12-23 2020-12-23 Picture header intra random access picture and gradual decoder refresh signaling in video coding Ceased WO2021133909A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
KR1020227020175A KR20220112785A (ko) 2019-12-23 2020-12-23 비디오 코딩에서의 픽처 헤더 인트라 랜덤 액세스 픽처 및 점진적 디코더 리프레시 시그널링
EP20839536.8A EP4082207B1 (en) 2019-12-23 2020-12-23 Picture header intra random access picture and gradual decoder refresh signaling in video coding
CN202080087362.2A CN114846802B (zh) 2019-12-23 2020-12-23 视频译码中的图片报头帧内随机访问图片和渐进解码器刷新信令
BR112022011752A BR112022011752A2 (pt) 2019-12-23 2020-12-23 Sinalização de imagem de acesso aleatório intra e atualização gradual do decodificador de cabeçalho de imagem na codificação de vídeo
JP2022537062A JP7664253B2 (ja) 2019-12-23 2020-12-23 ビデオコーディング中でのピクチャヘッダイントラランダムアクセスピクチャおよび漸次デコーダリフレッシュのシグナリング
PH1/2022/551264A PH12022551264A1 (en) 2019-12-23 2020-12-23 Picture header intra random access picture and gradual decoder refresh signaling in video coding

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962953035P 2019-12-23 2019-12-23
US62/953,035 2019-12-23
US17/130,759 US11356705B2 (en) 2019-12-23 2020-12-22 Picture header intra random access picture and gradual decoder refresh signaling in video coding
US17/130,759 2020-12-22

Publications (1)

Publication Number Publication Date
WO2021133909A1 true WO2021133909A1 (en) 2021-07-01

Family

ID=76437367

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2020/066838 Ceased WO2021133909A1 (en) 2019-12-23 2020-12-23 Picture header intra random access picture and gradual decoder refresh signaling in video coding

Country Status (9)

Country Link
US (1) US11356705B2 (https=)
EP (1) EP4082207B1 (https=)
JP (1) JP7664253B2 (https=)
KR (1) KR20220112785A (https=)
CN (1) CN114846802B (https=)
BR (1) BR112022011752A2 (https=)
PH (1) PH12022551264A1 (https=)
TW (1) TWI868286B (https=)
WO (1) WO2021133909A1 (https=)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210203971A1 (en) * 2019-12-27 2021-07-01 Alibaba Group Holding Limited Methods and systems for performing gradual decoding refresh processing on pictures
US11812062B2 (en) 2019-12-27 2023-11-07 Bytedance Inc. Syntax for signaling video subpictures
US11936917B2 (en) 2020-01-09 2024-03-19 Bytedance Inc. Processing of filler data units in video streams

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4044598B1 (en) * 2019-11-05 2025-03-05 LG Electronics Inc. Method and device for processing image information for image/video coding
US11395007B2 (en) * 2019-12-12 2022-07-19 Tencent America LLC Method for signaling dependent and independent picture header
CN114902656B (zh) 2019-12-26 2025-03-21 字节跳动有限公司 对编解码比特流中的视频层的信令的约束
JP7425204B2 (ja) 2019-12-26 2024-01-30 バイトダンス インコーポレイテッド ビデオビットストリームにおける仮想参照デコーダパラメータのシグナリングに対する制約
KR102928668B1 (ko) * 2020-03-05 2026-02-20 노키아 테크놀로지스 오와이 혼성 nal 유닛 타입에 기반하는 영상 부호화/복호화 방법, 장치 및 비트스트림을 전송하는 방법
EP4142286A4 (en) * 2020-04-24 2024-01-10 Atins Inc. METHOD AND APPARATUS FOR DECODING VIDEO
KR20250031318A (ko) * 2023-08-28 2025-03-07 가온그룹 주식회사 비디오 부호화 및 복호화를 위한 개선된 단계적 복호화기 갱신 방법 및 장치

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260827A1 (en) * 2003-06-19 2004-12-23 Nokia Corporation Stream switching based on gradual decoder refresh
US9979958B2 (en) * 2012-04-20 2018-05-22 Qualcomm Incorporated Decoded picture buffer processing for random access point pictures in video sequences
US9860529B2 (en) * 2013-07-16 2018-01-02 Qualcomm Incorporated Processing illumination compensation for video coding
US9854270B2 (en) * 2013-12-19 2017-12-26 Qualcomm Incorporated Device and method for scalable coding of video information
US20150264404A1 (en) * 2014-03-17 2015-09-17 Nokia Technologies Oy Method and apparatus for video coding and decoding
US10306253B2 (en) * 2015-10-14 2019-05-28 Qualcomm Incorporated Signaling of parameter sets in files of multi-layer bitstreams
US10679415B2 (en) * 2017-07-05 2020-06-09 Qualcomm Incorporated Enhanced signaling of regions of interest in container files and video bitstreams
US11418813B2 (en) * 2019-09-20 2022-08-16 Tencent America LLC Signaling of inter layer prediction in video bitstream
EP4074024A4 (en) * 2020-01-09 2023-04-05 ByteDance Inc. SIGNALING THE PRESENCE OF INTER-LAYER REFERENCE IMAGES
US11706428B2 (en) * 2020-04-06 2023-07-18 Tencent America LLC Method for signaling picture header in coded video stream
US11558630B2 (en) * 2020-05-20 2023-01-17 Tencent America LLC Techniques for random access point indication and picture output in coded video stream

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BROSS ET AL.: "Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, 16th Meeting: Geneva", October 2019, article "Versatile Video Coding (Draft 7", pages: 1 - 11
COBAN (QUALCOMM) M ET AL: "AHG9: On picture header IRAP/GDR signalling", no. JVET-Q0154, 28 December 2019 (2019-12-28), XP030222719, Retrieved from the Internet <URL:http://phenix.int-evry.fr/jvet/doc_end_user/documents/17_Brussels/wg11/JVET-Q0154-v1.zip JVET-Q0154.docx> [retrieved on 20191228] *
NISHI (PANASONIC) T ET AL: "AHG9: Unified signalling of PTL and HRD parameters in VPS", no. JVET-Q0047, 18 December 2019 (2019-12-18), XP030222423, Retrieved from the Internet <URL:http://phenix.int-evry.fr/jvet/doc_end_user/documents/17_Brussels/wg11/JVET-Q0047-v1.zip JVET-Q0047_based_on_JVET-P2001-vE.docx> [retrieved on 20191218] *
RYU (SAMSUNG) G ET AL: "Simplified NAL Unit Header and IRAP pictures", no. JVET-L0064, 10 October 2018 (2018-10-10), XP030195329, Retrieved from the Internet <URL:http://phenix.int-evry.fr/jvet/doc_end_user/documents/12_Macao/wg11/JVET-L0064-v5.zip JVET-L0064_r4.docx> [retrieved on 20181010] *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210203971A1 (en) * 2019-12-27 2021-07-01 Alibaba Group Holding Limited Methods and systems for performing gradual decoding refresh processing on pictures
US11758171B2 (en) * 2019-12-27 2023-09-12 Alibaba Group Holding Limited Methods and systems for performing gradual decoding refresh processing on pictures
US11812062B2 (en) 2019-12-27 2023-11-07 Bytedance Inc. Syntax for signaling video subpictures
US12244846B2 (en) 2019-12-27 2025-03-04 Alibaba Group Holding Limited Methods and systems for performing gradual decoding refresh processing on pictures
US11936917B2 (en) 2020-01-09 2024-03-19 Bytedance Inc. Processing of filler data units in video streams
US11956476B2 (en) 2020-01-09 2024-04-09 Bytedance Inc. Constraints on value ranges in video bitstreams
US11968405B2 (en) 2020-01-09 2024-04-23 Bytedance Inc. Signalling of high level syntax indication
US11985357B2 (en) 2020-01-09 2024-05-14 Bytedance Inc. Signalling of the presence of inter-layer reference pictures
US12506903B2 (en) 2020-01-09 2025-12-23 Bytedance Inc. Processing of filler data units in video streams

Also Published As

Publication number Publication date
TW202133616A (zh) 2021-09-01
EP4082207B1 (en) 2024-01-17
CN114846802A (zh) 2022-08-02
US11356705B2 (en) 2022-06-07
EP4082207C0 (en) 2024-01-17
US20210195248A1 (en) 2021-06-24
JP7664253B2 (ja) 2025-04-17
CN114846802B (zh) 2025-06-17
KR20220112785A (ko) 2022-08-11
BR112022011752A2 (pt) 2022-08-30
EP4082207A1 (en) 2022-11-02
TWI868286B (zh) 2025-01-01
PH12022551264A1 (en) 2023-11-20
JP2023517426A (ja) 2023-04-26

Similar Documents

Publication Publication Date Title
EP4082207B1 (en) Picture header intra random access picture and gradual decoder refresh signaling in video coding
KR20230008733A (ko) 비디오 코딩에서의 파라미터 세트 신택스 엘리먼트들 및 변수들
WO2021062225A1 (en) Quantization parameter signaling for joint chroma residual mode in video coding
WO2021055559A1 (en) Subpicture signaling in high-level syntax for video coding
EP3997878B1 (en) Memory constraint for adaptation parameter sets for video coding
EP4186235B1 (en) Deblocking filter parameter signaling
KR20220073755A (ko) 비디오 코딩을 위한 변환 스킵에서 잔차 값들을 위한 코딩 스킴 시그널링
US11785205B2 (en) Shared decoder picture buffer for multiple layers
WO2021237020A1 (en) General constraint information signaling in video coding
WO2021183654A1 (en) Coded video sequence start access unit in video coding
US11825073B2 (en) High level syntax for video with mixed NAL unit types
EP3981154A1 (en) Spatial scalability support in video encoding and decoding
US20210368192A1 (en) Determining whether to code picture header data of pictures of video data in slice headers
EP3987777B1 (en) Decoded picture buffer indexing
EP4111690A1 (en) Signaling constraints and sequence parameter set sharing in video coding
WO2020263916A1 (en) Gradual random access (gra) signalling in video coding
WO2020257557A1 (en) Maximum allowed block size for bdpcm mode

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20839536

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 20227020175

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2022537062

Country of ref document: JP

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112022011752

Country of ref document: BR

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2020839536

Country of ref document: EP

Effective date: 20220725

ENP Entry into the national phase

Ref document number: 112022011752

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20220614

WWG Wipo information: grant in national office

Ref document number: 202080087362.2

Country of ref document: CN

WWG Wipo information: grant in national office

Ref document number: 202247030681

Country of ref document: IN