CN104704842B - The syntactic structure of hypothetical reference decoder parameter - Google Patents

The syntactic structure of hypothetical reference decoder parameter Download PDF

Info

Publication number
CN104704842B
CN104704842B CN201380051430.XA CN201380051430A CN104704842B CN 104704842 B CN104704842 B CN 104704842B CN 201380051430 A CN201380051430 A CN 201380051430A CN 104704842 B CN104704842 B CN 104704842B
Authority
CN
China
Prior art keywords
hrd
hrd parameters
syntactic
vps
parameters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380051430.XA
Other languages
Chinese (zh)
Other versions
CN104704842A (en
Inventor
王益魁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN104704842A publication Critical patent/CN104704842A/en
Application granted granted Critical
Publication of CN104704842B publication Critical patent/CN104704842B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/188Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a video data packet, e.g. a network abstraction layer [NAL] unit
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Error Detection And Correction (AREA)

Abstract

A kind of video encoder sends the video parameter collection VPS comprising multiple hypothetical reference decoder HRD parameter syntactic structures in coded video bitstream with signal, and the multiple hypothetical reference decoder HRD parameter syntactic structures respectively contain HRD parameters.For each corresponding HRD parameter syntactic structures in the multiple HRD parameters syntactic structure, the VPS further includes the syntactic element whether the HRD parameters for indicating the corresponding HRD parameter syntactic structures also include the common set of HRD parameters in addition to the set of the specific HRD parameter informations of the sublayer of the particular sublayers specific to the coded video bitstream.The common set of HRD parameters is common for all sublayers of the coded video bitstream.A kind of Video Decoder or other devices decode the VPS from the coded video bitstream, and perform operation using the HRD parameters of at least one of the HRD parameters syntactic structure.

Description

The syntactic structure of hypothetical reference decoder parameter
Present application advocates the power of U.S. provisional patent application cases the 61/711,098th filed in 8 days October in 2012 Profit, the full content of the application case are incorporated herein by reference.
Technical field
The present invention relates to Video coding and video to decode.
Background technology
Digital video capabilities are incorporated into a wide range of devices, and described device includes DTV, digital live broadcast system System, wireless broadcast system, personal digital assistant (PDA), on knee or desktop PC, tablet PC, e-book reading Device, digital camera, digital recorder, digital media player, video game apparatus, video game console, honeycomb fashion or Satellite radiotelephone, so-called " intelligent telephone ", video teletype conference device, stream video device are similar with its Person.Digital video apparatus implements video compression technology, such as those technologies being described in the following:By MPEG-2, MPEG- 4th, ITU-T H.263, the ITU-T standards that H.264/MPEG-4 the 10th part advanced video decoding (AVC) defines, opening at present High efficiency video coding (HEVC) standard in hair, and the extension of these standards.Video-unit can be by implementing these video pressures Contracting technology and more efficiently transmit, receive, coding, decoding and/or storage digital video information.
Video compression technology performs space (in picture) prediction and/or time (between picture) prediction, is regarded with reducing or removing Institute's intrinsic redundancy in frequency sequence., can be by video segment (that is, the one of video frame or video frame for block-based video coding Part) it is divided into video block.Carry out code pattern using the spatial prediction relative to the adjacent reference sample in the block in identical picture Video block in intraframe decoding (I) section of piece.Phase can be used in video block in inter-frame decoded (P or B) section of picture Spatial prediction for the adjacent reference sample in the block in identical picture or relative to the reference sample in other reference pictures Time prediction.Picture can be referred to frame, and reference picture can be referred to reference frame.
Spatial prediction or time prediction cause predictive block of the decoding for block.Residual data represents original block to be decoded Pixel difference between predictive block.The motion vector of the block for the reference sample for forming predictive block according to being directed toward is encoded through frame Between decoding block, and residual data instruction decoded piece and predictive block between difference.According to Intra coding modes and residual According to encoding intra-coding block.Further to be compressed, residual data can be transformed to transform domain from pixel domain, so as to produce The residual coefficients that life can then be quantified.Initial placement can be scanned into the quantified coefficient of two-dimensional array to produce coefficient One-dimensional vector, and entropy coding can be applied to reach even more compressions.
Can by (such as) from multiple visual angle coded views and produce multi views decode bit stream.Multi views have been developed to translate Some three-dimensional (3D) video standards of code aspect.For example, different views can transmit left-eye view and right-eye view, to support 3D to regard Frequently.Alternatively, some 3D video codings processes can apply so-called multi views plus depth to decode.Decoded in multi views plus depth In, 3D video bit streams can not only contain texture view component, also contain depth views component.For example, each view may include one A texture view component and a depth views component.
The content of the invention
Generally, the signalling of present invention description hypothetical reference decoder (HRD) parameter.A kind of for example, Video coding Device can send the video parameter collection (VPS) comprising multiple HRD parameters syntactic structures, the multiple HRD ginsengs in bit stream with signal Number syntactic structure respectively contains the set of one or more HRD parameters.For each phase in the multiple HRD parameters syntactic structure HRD parameter syntactic structures are answered, the VPS, which is further included, indicates that the HRD parameters of the corresponding HRD parameter syntactic structures are removed Whether also comprising HRD parameters outside specific to the set of the specific HRD parameter informations of sublayer of the particular sublayers of the bit stream The syntactic element of common set.The common set of HRD parameters is common for all sublayers of the bit stream.One Kind Video Decoder or other devices can be from VPS described in institute's bit-stream, and can be used in the HRD parameters syntactic structure The HRD parameters of at least one perform operation.
In an example, the present invention describes a kind of method of decoding video data.The described method includes regarded from encoded Frequency bitstream decoding includes the VPS of multiple HRD parameters syntactic structures, and the multiple HRD parameters syntactic structure respectively contains HRD ginsengs Number.For each corresponding HRD parameter syntactic structures in the multiple HRD parameters syntactic structure, the VPS further includes finger Show whether the HRD parameters of the corresponding HRD parameter syntactic structures include the syntactic element of the common set of HRD parameters.HRD The common set of parameter is common for all sublayers of the coded video bitstream.The method also includes Operation is performed using the HRD parameters of at least one of the HRD parameters syntactic structure.
In another example, the present invention describes a kind of video decoder, it includes being configured to from Encoded video position One or more processors of VPS of the stream decoding comprising multiple HRD parameters syntactic structures, the multiple HRD parameters syntactic structure are each Self-contained HRD parameters.For each corresponding HRD parameter syntactic structures in the multiple HRD parameters syntactic structure, the VPS Further include and indicate whether the HRD parameters of the corresponding HRD parameter syntactic structures include the common set of HRD parameters Syntactic element.The common set of HRD parameters is common for all sublayers of the coded video bitstream.Institute State one or more processors and also be configured such that and held with the HRD parameters of at least one of the HRD parameters syntactic structure Row operation.
In another example, the present invention describes a kind of video decoder, it includes being used for from coded video bitstream solution The device of VPS of the code comprising multiple HRD parameters syntactic structures, the multiple HRD parameters syntactic structure respectively contain HRD parameters. For each corresponding HRD parameter syntactic structures in the multiple HRD parameters syntactic structure, the VPS further includes instruction Whether the HRD parameters of the corresponding HRD parameter syntactic structures include the syntactic element of the common set of HRD parameters, wherein The common set of HRD parameters is common for all sublayers of the coded video bitstream.The video solution Code device includes being used for the device that operation is performed using the HRD parameters of at least one of the HRD parameters syntactic structure.
In another example, the present invention describes a kind of computer-readable storage medium of store instruction, is decoded when by video When device performs described instruction, it is multiple to be included from coded video bitstream decoding that described instruction configures the video decoder The VPS of HRD parameter syntactic structures, the multiple HRD parameters syntactic structure respectively contain HRD parameters.Join for the multiple HRD Each corresponding HRD parameter syntactic structures in number syntactic structure, the VPS further include the instruction corresponding HRD parameter languages Whether the HRD parameters of method structure include the syntactic element of the common set of HRD parameters.The common set of HRD parameters It is common for all sublayers of the coded video bitstream.When executed, described instruction is further Configure the video decoder and behaviour is performed with the HRD parameters using at least one of the HRD parameters syntactic structure Make.
In another example, the present invention describes a kind of method of encoded video data.The described method includes produce comprising more The VPS of a HRD parameters syntactic structure, the multiple HRD parameters syntactic structure respectively contain HRD parameters.For the multiple HRD Each corresponding HRD parameter syntactic structures in parameter syntactic structure, the VPS further include the instruction corresponding HRD parameters The HRD parameters of syntactic structure are except the specific HRD parameter informations of the sublayer of the particular sublayers specific to coded video bitstream Set outside whether also comprising HRD parameters common set syntactic element.The common set of HRD parameters is for described It is common for all sublayers of coded video bitstream.The method is also included within the coded video bitstream with letter Number send the VPS.
In another example, the present invention describes a kind of video coding apparatus, it includes being configured to produce including multiple HRD One or more processors of the VPS of parameter syntactic structure, the multiple HRD parameters syntactic structure respectively contain HRD parameters.For Each corresponding HRD parameter syntactic structures in the multiple HRD parameters syntactic structure, the VPS are further included described in instruction The HRD parameters of corresponding HRD parameter syntactic structures are specific except the sublayer of the particular sublayers specific to coded video bitstream Whether the syntactic element of the common set of HRD parameter is also included outside the set of HRD parameter informations.HRD parameters it is described common Set is common for all sublayers of the coded video bitstream.One or more described processors are also configured to In the coded video bitstream VPS is sent with signal.
In another example, the present invention describes a kind of video coding apparatus, it includes being used to produce including multiple HRD parameters The device of the VPS of syntactic structure, the multiple HRD parameters syntactic structure respectively contain HRD parameters.Join for the multiple HRD Each corresponding HRD parameter syntactic structures in number syntactic structure, the VPS further include the instruction corresponding HRD parameter languages The HRD parameters of method structure are except the specific HRD parameter informations of the sublayer of the particular sublayers specific to coded video bitstream Whether the syntactic element of the common set of HRD parameter is also included outside set.The common set of HRD parameters is for the warp It is common for all sublayers of encoded video bitstream.The video coding apparatus also includes being used in the Encoded video The device of the VPS is sent in bit stream with signal.
In another example, the present invention describes a kind of computer-readable storage medium of store instruction, when by Video coding When device performs described instruction, described instruction configures the video coding apparatus and includes multiple HRD parameters syntactic structures to produce VPS, the multiple HRD parameters syntactic structure respectively contains HRD parameters.For in the multiple HRD parameters syntactic structure Each corresponding HRD parameter syntactic structures, the VPS further include the HRD for indicating the corresponding HRD parameter syntactic structures Whether parameter also wraps in addition to the set of the specific HRD parameter informations of the sublayer of the particular sublayers specific to coded video bitstream The syntactic element of the common set of the parameter containing HRD.Institute of the common set of HRD parameters for the coded video bitstream It is common for having for sublayer.When executed, described instruction further configures the video coding apparatus with institute State in coded video bitstream and send the VPS with signal.
The details of one or more examples of the present invention is illustrated in the accompanying drawings and the following description.From the description, schema And claims, further feature, target and advantage will be evident.
Brief description of the drawings
Fig. 1 is the block diagram for illustrating to utilize the instance video decoding system of technology described in the present invention.
Fig. 2 is the block diagram for the example video encoder for illustrating to implement technology described in the present invention.
Fig. 3 is the block diagram for the instance video decoder for illustrating to implement technology described in the present invention.
Fig. 4 is the flow chart of the example operation for the video encoder for illustrating one or more technologies according to the present invention.
Fig. 5 is the flow chart of the example operation for the device for illustrating one or more technologies according to the present invention.
Fig. 6 is the flow chart of the example operation for the video encoder for illustrating one or more technologies according to the present invention.
Fig. 7 is the flow chart of the example operation for the device for illustrating one or more technologies according to the present invention.
Fig. 8 is the flow chart of the example operation for the video encoder for illustrating one or more technologies according to the present invention.
Fig. 9 is the flow chart of the example operation for the device for illustrating one or more technologies according to the present invention.
Embodiment
Video encoder can produce the bit stream comprising encoded video data.Because bit stream includes encoded video data, So bit stream can be referred to coded video bitstream herein.Bit stream may include a series of network abstract layers (NAL) unit. NAL unit can include video coding layer (VCL) NAL unit and non-VCL NAL units.VCL NAL units can include the warp of picture Decoding section.Non- VCL NAL units can include video parameter collection (VPS), sequence parameter set (SPS), image parameters collection (PPS), Supplemental enhancement information (SEI) or other types of data.VPS is suitable for zero or multiple all through coded video sequence for that can contain Syntactic element syntactic structure.SPS is containing suitable for the zero or multiple syntactic element all through coded video sequence Syntactic structure.Single VPS is applicable to multiple SPS.PPS is containing the grammer for being suitable for zero or multiple whole decoded pictures The syntactic structure of element.Single SPS is applicable to multiple PPS.
Such as content transmission network (CDN) device, media aware element (MANE), video encoder or video decoding The device of device can extract sub- bit stream from bit stream.Device can be extracted by removing some NAL units from bit stream to perform sub- bit stream Journey.The sub- bit stream of gained includes the remaining NAL unit without removal of bit stream.In some instances, regarding from sub- bitstream decoding Frequency evidence can have relatively low frame rate, and/or can represent the view fewer than original bit stream.
Various video coding standard can include various features to support sub- bit stream extraction process.For example, can be by the video counts of bit stream According to the set of division stratification.For each of described layer, can be decoded relatively low without reference to the data in any higher level Data in layer.NAL unit is only encapsulated the data of simple layer.Therefore, the highest rest layers of bit stream can be encapsulated from bit stream removal The NAL unit of data, the decodabillity of the data in rest layers without influencing bit stream.At scalable video coding (SVC) In, higher level can include enhancing data, and the enhancing data improve the quality (quality adjustability) of the picture in lower level, expand (time is adjustable for the Space format (spatial scalability) of picture in lower level or the time speed of the picture in raising lower level Property).In multi views decoding (MVC) and 3 D video (3DV) decoding, higher level can include additional views.
Each NAL unit can include header and payload.The header of NAL unit can include nuh_reserved_zero_ 6bits syntactic elements.If NAL unit is related to the basic unit in MVC, 3DV decoding or SVC, then the nuh_ of NAL unit Reserved_zero_6bits syntactic elements are equal to 0.Bit stream can be decoded without reference to the data in any other layer of bit stream Basic unit in data.If NAL unit is not related to the basic unit in MVC, 3DV or SVC, then nuh_reserved_ Zero_6bits syntactic elements can have other nonzero values.Specifically, if NAL unit not with MVC, 3DV or SVC Basic unit is related, then the nuh_reserved_zero_6bits syntactic elements of NAL unit specify identification associated with NAL unit Layer layer identifier.
In addition, can without reference to other pictures in identical layer some pictures in decoding layer.Therefore, can be gone from bit stream Except the NAL unit of the data for some pictures for being encapsulated layer, the decodabillity without influencing other pictures in layer.For example, can The picture with even number picture order count (POC) value is decoded without reference to the picture with odd number POC values.Removal is encapsulated this The NAL unit of the data of a little pictures can reduce the frame rate of bit stream.Can be without reference to other pictures in layer and in decoded layer The subset of picture can be referred to herein " sublayer " or " time sublayer ".
NAL unit can include nuh_temporal_id_plus1 syntactic elements.The nuh_temporal_id_ of NAL unit Plus1 syntactic elements may specify the time identifier of NAL unit.If the time identifier of the first NAL unit is less than the 2nd NAL The time identifier of unit, then can be decoded without reference to the data being encapsulated by the second NAL unit by the first NAL unit institute The data being encapsulated.
The respective set with layer identifier (that is, the set of nuh_reserved_zero_6bits values) of the operating point of bit stream It is associated with time identifier.The set of layer identifier is represented by OpLayerIdSet, and time identifier is represented by TemporalID.If the layer identifier of NAL unit is the set of the layer identifier of operating point, and the time identification of NAL unit Time identifier of the symbol less than or equal to operating point, then NAL unit is associated with operating point.Operating point is expressed as and operating point Associated bit stream subset (that is, sub- bit stream).The operating point of operating point represents can be mono- comprising an every NAL associated with operating point Member.Operating point represents and does not include the VCL NAL units with operating point onrelevant.
External source may specify the set of the destination layer identifier for operating point.For example, content transmission network (CDN) device It may specify the set of destination layer identifier.In this example, the set of destination layer identifier can be used to be operated to identify for CDN devices Point.CDN devices can then extraction operation point operating point represent, and by operating point represent rather than original bit stream be relayed to user terminal Device.Extraction operation point represents and operating point being represented, the bit rate of bit stream can be reduced by being relayed to user end apparatus.
In addition, various video coding standard designated buffer model.Video buffer model is also known as " hypothetical reference decoding Device " or " HRD ".How how buffered data for decoding and buffers decoded data for output for HRD descriptions.Example Such as, HRD describes the behaviour of the decoded picture buffer (" CPB ") and decoded picture buffer (" DPB ") in Video Decoder Make.CPB is the first-in first-out buffer for containing access unit by the decoding order specified by HRD.DPB is to preserve decoded figure Piece is reset for reference, the output specified by HRD or the buffer of output delay.
Video encoder available signal sends the set of HRD parameters.The various aspects of HRD state modulators HRD.HRD parameters Initial CPB can be included and remove delay, CPB sizes, bit rate, initial DPB output delay and DPB sizes.Can be in VPS and/or SPS In decode these HRD parameters in specified hrd_parameters () syntactic structure.Also can in Buffer period SEI message or HRD parameters are specified in picture sequential SEI message.
As explained above, operating point represents can there is the frame rate and/or bit rate different compared to original bit stream.This is Because operating point represents some pictures and/or some data that can not include original bit stream.Therefore, when handling original bit stream, If Video Decoder will remove data with special speed from CPB and/or DPB, and when handling operating point expression, if video Decoder will remove data with phase same rate from CPB and/or DPB, then Video Decoder can remove excessive from CPB and/or DPB Or very few data.Therefore, video encoder can be directed to the different sets that different operating point sends HRD parameters with signal.New In high efficiency video coding (HEVC) standard of appearance, video encoder available signal sends the set of the HRD parameters in VPS, Or video encoder available signal sends the set of the HRD parameters in SPS.
Optionally, the set of HRD parameters includes the set for the common information of all time sublayers.Time sublayer is served as reasons The time scalable bitstream of VCL NAL units and associated non-VCL NAL units composition with special time identifier Time scalability layer.In addition to the set of common information, the set of HRD parameters can also include the language specific to respective time sublayer The set of method element., can need not be because the set of common information is common for multiple set of HRD parameters The set of common information is sent in the set of multiple HRD parameters with signal.In some of the recommendations for HEVC, when HRD parameters Collection when being combined into the set of the first HRD parameters in VPS, common information may be present in the set of HRD parameters, or when HRD joins When several set is associated with the first operating point, common information may be present in the set of HRD parameters.
However, when there are during the set of multiple HRD parameters in VPS, it may be necessary to have the more of the set for HRD parameters The set of a difference common information.When, there are during the HRD parameter syntactic structures of greater number, this situation can be especially just in VPS True.Therefore, different from the first HRD parameter syntactic structures, it may be necessary to common information in HRD parameter syntactic structures Set.
The technology of the present invention, which provides, to be allowed clearly to send HRD parameter languages with signal for any HRD parameters syntactic structure The design of the common information of method structure.In other words, technology of the invention allows to be directed to any hrd_parameters () language Method structure is clearly sent for the common information of all sublayers with signal.This situation can improve decoding efficiency.
Therefore, one or more technologies, such as the device of Video Decoder or other devices can at least portion according to the present invention Divide based on the syntactic element in the VPS comprising multiple HRD parameters syntactic structures, determine the specific HRD parameters syntactic structure in VPS Whether the set for for each sublayer of bit stream being common HRD parameters is included.Grammer member of the device decodable code from VPS Element.One or more HRD parameters syntactic structures can be occurred in VPS by decoding order prior to specific HRD parameters syntactic structure.Ring The set for for each sublayer of bit stream being common HRD parameters, dress should be included in determining specific HRD parameters syntactic structure Put and specific HRD parameters syntactic structure can be used (comprising the set for for each sublayer of bit stream being common HRD parameters) And perform operation.
In addition, video encoder can produce scalable nest set SEI message.Scalable nest set SEI message contains one or more SEI message.The SEI message that nest is placed in scalable nest set SEI message can include HRD parameters or associated with operating point other Information.Some of the recommendations for HEVC do not allow for a SEI message to be suitable for multiple operating points.This situation can reduce bit rate Efficiency, this is because it may be such that video encoder sends multiple SEI message with identical information with signal.Therefore, this hair Bright technology allows a SEI message to be suitable for multiple operating points.For example, scalable nest set SEI message can include, specify can The syntactic element of multiple operating points of the SEI message in scalable nest set SEI message is placed on suitable for nest.
In addition, being similar to other types of NAL unit, SEI NAL units include NAL unit header and NAL unit main body. The NAL unit main body of SEI NAL units can include SEI message, such as scalable nest set SEI message or another type of SEI disappear Breath.Similar to other NAL units, the NAL unit header of SEI NAL units can include nuh_reserved_zero_6bits languages Method element and nuh_temporal_id_plus1 syntactic elements.However, in some of the recommendations for HEVC, SEI NAL units NAL unit header nuh_reserved_zero_6bits syntactic elements and/or nuh_temporal_id_plus1 grammers Element is not used to determine to be applicable to the operating point for the SEI message (or multiple SEI message) being encapsulated by SEI NAL units.So And these syntactic elements of SEI NAL unit headers can be through reusing, to reduce the number of the position sent with signal.Therefore, The technique according to the invention, can send syntactic element with signal in scalable nest covers SEI message, SEI is applicable to instruction Whether the operating point of the nido SEI message in NAL unit is layer identification information in NAL unit header by SEI NAL units Indicated operating point.Layer identification information in the NAL unit header of SEI NAL units can include the nuh_ of NAL unit header Reserved_zero_6bits values and nuh_temporal_id_plus1 values.
Fig. 1 is the block diagram for illustrating to utilize the instance video decoding system 10 of the technology of the present invention.As used herein, Term " video decoder " substantially refers to both video encoder and Video Decoder.In the present invention, term " video coding " or " decoding " can substantially refer to Video coding or video decoding.
As demonstrated in Figure 1, video decoding system 10 includes source device 12 and destination device 14.Source device 12 produces warp Encoded video data.Therefore, source device 12 can be referred to video coding apparatus or video encoder.Destination device 14 can solve Code encoded video data as caused by source device 12.Therefore, destination device 14 can be referred to video decoder or regard Frequency decoding device.Source device 12 and destination device 14 can be the example of video decoding apparatus or video decoding equipment.
Source device 12 and destination device 14 may include a wide range of devices, be calculated comprising desktop PC, action Device, notes type (for example, on knee) computer, tablet PC, set-top box, the phone of for example so-called " intelligent " phone Mobile phone, TV, video camera, display device, digital media player, video game console, car-mounted computer, or its is similar Person.
Destination device 14 can receive encoded video data via channel 16 from source device 12.Channel 16 may include energy Enough one or more media or device that encoded video data is moved to destination device 14 from source device 12.In an example In, channel 16 may include to enable to source device 12 that encoded video data is transmitted directly to destination device 14 in real time One or more communication mediums.In this example, source device 12 can modulate warp knit according to communication standard (for example, wireless communication protocol) Code video data, and can be by modulated video data transmission to destination device 14.One or more communication mediums can include wireless And/or wired communication media, such as radio frequency (RF) frequency spectrum or one or more physical transmission lines.One or more communication mediums can be formed The part of network (for example, LAN, wide area network or global network (for example, internet)) based on package.One or more communications Media can include router, exchanger, base station, or promote from source device 12 to other equipments of the communication of destination device 14.
In another example, channel 16 can include the storage matchmaker of storage encoded video data as caused by source device 12 Body.In this example, destination device 14 can (such as) via disk access or card access and access storage media.Storage media The data storage medium of a variety of local terminals access, such as Blu-ray Disc, DVD, CD-ROM, flash memory can be included, or for depositing Store up other suitable digital storage medias of encoded video data.
In another example, channel 16 can include file server or the storage Encoded video as caused by source device 12 Another intermediate storage mean of data.In this example, destination device 14 can be accessed and stored via stream transmission or download Encoded video data at file server or at other intermediate storage means.File server can be that can store warp knit Code video data and encoded video data is transferred to destination device 14 type server.Instance file server bag Containing web page server (for example, being used for website), File Transfer Protocol (FTP) server, network attached storage (NAS) device and sheet Hold disc unit.
Destination device 14 can connect (for example, Internet connection) access encoded video data by normal data.Number Wireless channel (for example, Wi-Fi connection), wired connection can be included (for example, DSL, cable modem according to the example types of connection Device etc.) or the encoded video data suitable for being stored on file server both combination.Encoded video data Can be the combination of stream transmission transmission, download transmission, or both from the transmission of file server.
The technology of the present invention is not limited to wireless application or setting.Technology can be applied to support a variety of of for example following application Video coding in multimedia application:Airborne television broadcast, CATV transmission, satellite TV transmissions, for example via internet Stream transmission transmission of video, the coding for the video data that is stored on data storage medium, be stored in data storage matchmaker The decoding of video data on body, or other application.In some instances, video decoding system 10 can be configured unidirectional to support Or bi-directional video transmission is to support the application of such as stream video, video playing, video broadcasting, and/or visual telephone.
Fig. 1 be only example and the present invention technology be applicable to include it is any between code device and decoding apparatus The video coding setting (for example, Video coding or video decoding) of data communication.In other examples, from via network or similar Person and transmit as a stream regional memory retrieval data.Video coding apparatus codified data and memory is stored data into, And/or video decoder can be from memory search data and decoding data.In many instances, by not communicating with one another, but Simply conciliate by data encoding to memory and/or from the device of memory search data and decoding data to perform coding Code.
In the example of fig. 1, source device 12 includes video source 18, video encoder 20, and output interface 22.In some realities In example, output interface 22 can include modulator/demodulator (modem) and/or transmitter.Video source 18 can include example Such as the video capture device of video camera, the video containing the video data previously captured seals shelves up for safekeeping, being carried from video content Donor receives the video feed-in interface of video data, and/or the computer graphics system for producing video data, or video counts According to these sources combination.
Video data of 20 codified of video encoder from video source 18.In some instances, source device 12 is via defeated Encoded video data is directly transferred to destination device 14 by outgoing interface 22.In other examples, encoded video data It can be stored in storage media or on file server, decode and/or play to be accessed later for destination device 14.
In the example of fig. 1, destination device 14 includes input interface 28, Video Decoder 30 and display device 32. In some examples, input interface 28 includes receiver and/or modem.Input interface 28 can receive warp knit via channel 16 Code video data.Display device 32 can be integrated with destination device 14, or can be outside destination device 14.Substantially come Say, display device 32 shows decoded video data.Display device 32 may include a variety of display devices, such as liquid crystal display (LCD), plasma display, Organic Light Emitting Diode (OLED) display, or another type of display device.
Video encoder 20 and Video Decoder 30 can each be embodied as any one of a variety of appropriate circuitries, for example, one or Multi-microprocessor, digital signal processor (DSP), application-specific integrated circuit (ASIC), field programmable gate array (FPGA), from Scattered logic, hardware or any combination thereof.When partly implementing technology in software, device will can be stored for the instruction of software In suitable non-transitory computer-readable storage media, and it can be performed within hardware using one or more processors described Instruct to perform the technology of the present invention.Can be by any in the above (combination comprising hardware, software, hardware and software etc.) Person is considered as one or more processors.Each of video encoder 20 and Video Decoder 30 may be included in one or more codings In device or decoder, either of which can be integrated into the part of modular encoder/decoder (CODEC) in related device.
The present invention, which can substantially refer to, compiles the video of some information " signalling " to another device (for example, Video Decoder 30) Code device 20.Term " signalling " can substantially refer to syntactic element for decoding compressed video data and/or other data Pass on.Can in real time or near real-time this reception and registration occurs.Alternatively, a time span can be lasted and this reception and registration occurs, such as When at the scramble time, when the syntactic element in encoded bit stream is stored to computer-readable storage medium, this biography can occur Reach, then, institute's syntax elements can be retrieved at any time by decoding apparatus after this media is stored in.
In some instances, video encoder 20 and Video Decoder 30 are operated according to video compression standard, video pressure H.264, contracting standard such as ISO/IEC MPEG-4Visual and ITU-T (are also referred to as ISO/IEC MPEG-4AVC), comprising it Scalable video coding (SVC) extension, multi-view video decoding (MVC) extension and the 3DV extensions based on MVC.In certain situation Under, any bit stream for meeting the 3DV based on MVC contains the sub- position that shelves (for example, three-dimensional sound pitch setting shelves) are set in accordance with MVC all the time Stream.In addition, positive ongoing effort decodes extension, the i.e. 3DV based on AVC to produce to 3 D video (3DV) H.264/AVC. In other examples, video encoder 20 and Video Decoder 30 can according to ITU-T H.261, ISO/IEC MPEG-1 Visual, ITU-T H.262 or ISO/IEC MPEG-2 Visual, and ITU-T H.264, ISO/IEC Visual and operate.
In other examples, video encoder 20 and Video Decoder 30 can be according at present by ITU-T Video Coding Experts The exploitation of the video coding associating cooperative groups (JCT-VC) of group (VCEG) and ISO/IEC motion picture experts group (MPEG) High efficiency video coding (HEVC) standard and operate.(it is referred to as " HEVC working drafts to the draft of upcoming HEVC standard 8 ") it is described in " 8 (High of high efficiency video coding (HEVC) text preliminary specifications of cloth Ross (Bross) et al. Efficiency Video Coding (HEVC) text specification draft 8) " (16 WP3 of ITU-T SG and The video coding associating cooperative groups (JCT-VC) of ISO/IEC JTC1/SC29/WG11, the 10th meeting, Sweden Si Degeer Rub, in July, 2012, it can be from http untill 13 days June in 2013://phenix.int-evry.fr/jct/doc_end_ User/documents/10_Stockholm/wg11/JCTVC-J1003-v8.zip is obtained) in.Upcoming HEVC standard Another draft (be referred to as " HEVC working drafts 9 ") be described in " high efficiency video coding (HEVC) text of cloth Ross et al. Preliminary specifications 9 (High Efficiency Video Coding (HEVC) text specification draft 9) " (ITU- The video coding associating cooperative groups (JCT-VC) of T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, the 11st meeting, Chinese Shanghai, in October, 2012, it can be from http untill 13 days June in 2013://phenix.int-evry.fr/jct/ Doc_end_user/documents/11_Shanghai/wg11/JCTVC-K1003-v13. zip is obtained) in.In addition, just hold It is continuous to make great efforts to produce SVC, MVC and 3DV extension for HEVC.The 3DV extensions of HEVC can be referred to 3DV based on HEVC or HEVC-3DV。
In HEVC and other various video coding standards, video sequence generally comprises a series of pictures.Picture is also known as " frame ".Picture can include and be expressed as SL、SCbAnd SCrThree array of samples.SLFor the two-dimensional array (that is, block) of lightness sample.SCb For the two-dimensional array of Cb chroma samples.SCrFor the two-dimensional array of Cr chroma samples.Chroma sample can also be herein referred to as " color Degree " sample.In other cases, picture can be monochromatic, and can only include the array of lightness sample.
To produce the warp knit representation of picture, video encoder 20 can produce the set of decoding tree-shaped unit (CTU).CTU Each of can be the decoding tree-shaped block of lightness sample, two of chroma sample it is corresponding decode tree-shaped blocks, and for decoding State the syntactic structure of the sample of decoding tree-shaped block.Decode N × N blocks that tree-shaped block can be sample.CTU is also known as " tree-shaped block " Or " maximum decoding unit " (LCU).The CTU of HEVC can widely be similar to the macro block of other standards for example H.264/AVC.So And CTU is not necessarily limited to particular size, and one or more decoding units (CU) can be included.Section, which can be included in raster scanning, to be connected The integer number CTU of continuous sequencing.
To produce through decoding CTU, the decoding tree-shaped block that video encoder 20 can be to CTU performs quarter tree segmentation with pulling over, Decoding tree-shaped block is divided into decoding block, therefore it is named as " decoding tree-shaped unit ".Decoding block is N × N blocks of sample.CU can For the decoding block of the lightness sample of the picture with lightness array of samples and two corresponding decoding blocks, Cb sample battle arrays of chroma sample Row and Cr array of samples, and the syntactic structure of the sample for decoding the decoding block.Video encoder 20 can be by the decoding of CU Block is divided into one or more prediction blocks.Prediction block can be rectangle (that is, the square or non-pros of the sample of the identical prediction of application Shape) block.The predicting unit (PU) of CU can be prediction block, two of the chroma sample corresponding prediction blocks and use of the lightness sample of picture In the syntactic structure for predicting the prediction block sample.It is pre- that video encoder 20 can produce the lightness prediction block of every PU of CU, Cb Survey block and predictive lightness block, Cb blocks and the Cr blocks of Cr prediction blocks.
Infra-frame prediction or inter prediction can be used in video encoder 20, to produce the predictive block of PU.If Video coding Device 20 is using infra-frame prediction to produce the predictive block of PU, then video encoder 20 can the warp based on the picture associated with PU Decode sample and produce the predictive block of PU.
If video encoder 20 is using inter prediction to produce the predictive block of PU, then video encoder 20 can be based on Different from the decoded sample of one or more pictures of the picture associated with PU, the predictive block of PU is produced.Video encoder 20 can be used single directional predictions or bi-directional predicted to produce the predictive block of PU.When video encoder 20 using single directional prediction to produce During the predictive block of PU, PU can have single movement vector.When video encoder 20 using bi-directional predicted to produce the prediction of PU During property block, PU can have two motion vectors.
After predictive lightness block, Cb blocks and Cr blocks that one or more PU of CU are produced in video encoder 20, video is compiled Code device 20 can produce the lightness residual block of CU.The predictive lightness of the remaining each sample instruction CU in the block of the lightness of CU is in the block Lightness sample in one decodes the difference between corresponding sample in the block with the original lightness of CU.In addition, video encoder 20 The Cb residual blocks of CU can be produced.The remaining each samples in the block of the Cb of CU may indicate that the Cb in the predictive Cb one in the block of CU Sample decodes the difference between corresponding sample in the block with the original Cb of CU.The Cr that video encoder 20 can also produce CU is remaining Block.The remaining each samples in the block of the Cr of CU may indicate that the original Cr of Cr samples and CU in the predictive Cr one in the block of CU Decode the difference between corresponding sample in the block.
In addition, quarter tree segmentation can be used so that the lightness residual block, Cb residual blocks and Cr of CU is remaining in video encoder 20 Block resolves into one or more lightness transform blocks, Cb transform blocks and Cr transform blocks.Transform block can be the sample of the identical conversion of application Rectangular block.The converter unit (TU) of CU can be transform block, the two correspondent transform blocks of chroma sample of lightness sample, and for becoming Change the syntactic structure of the transform block sample.Therefore, every TU of CU can be with lightness transform block, Cb transform blocks, and Cr transform blocks It is associated.The lightness transform block associated with TU can be the sub-block of the lightness residual block of CU.Cb transform blocks can be remaining for the Cb of CU The sub-block of block.Cr transform blocks can be the sub-block of the Cr residual blocks of CU.
One or more conversion can be applied to the lightness transform block of TU by video encoder 20, to produce the brightness coefficient of TU Block.Coefficient block can be the two-dimensional array of conversion coefficient.Conversion coefficient can be scale quantity.Video encoder 20 can be by one or more Cb transform block of the conversion applied to TU, to produce the Cb coefficient blocks of TU.One or more conversion can be applied to by video encoder 20 The Cr transform blocks of TU, to produce the Cr coefficient blocks of TU.
After coefficient block (for example, brightness coefficient block, Cb coefficient blocks or Cr coefficient blocks) is produced, video encoder 20 can be measured Change coefficient block.Quantization substantially refers to following process:By quantization of transform coefficients possibly to reduce to represent the conversion coefficient The amount of data, so as to provide further compression.After 20 quantization parameter block of video encoder, video encoder 20 can entropy volume Code indicates the syntactic element of quantified conversion coefficient.For example, video encoder 20 can be to indicating quantified conversion coefficient Syntactic element performs context-adaptive binary arithmetic decoding (CABAC).Video encoder 20 can export warp in bit stream The syntactic element of entropy coding.
The exportable bit stream for including entropy encoded syntax element of video encoder 20.Bit stream can include to be formed schemes through decoding The bit sequence of the expression of piece and associated data.Bit stream may include network abstract layer (NAL) unit sequence.It is every in NAL unit One includes NAL unit header, and is encapsulated Raw Byte Sequence Payload (RBSP).NAL unit header can include instruction NAL The syntactic element of cell type code.The class for the NAL unit type codes instruction NAL unit specified by the NAL unit header of NAL unit Type.RBSP can be the syntactic structure containing the integer number byte being encapsulated in NAL unit.In some cases, RBSP bags Containing zero-bit.
Different types of NAL unit can be encapsulated different types of RBSP.For example, the NAL unit of the first kind can be encapsulated figure The RBSP of piece parameter set (PPS);The NAL unit of Second Type can be encapsulated the RBSP through decoding section;The NAL unit of 3rd type RBSP of SEI etc. can be encapsulated.It is encapsulated the RBSP of video coding data (in contrast to the RBSP of the RBSP and SEI message of parameter set) NAL unit can be referred to video coding layer (VCL) NAL unit.
Video Decoder 30 can receive the bit stream as caused by video encoder 20.In addition, Video Decoder 30 can dissect Bit stream is to decode the syntactic element from bit stream.Video Decoder 30 can be at least partially based on from the decoded syntactic element of bit stream institute And reconstruct the picture for building video data.The process that video data is built in reconstruct can be generally reciprocal in being performed by video encoder 20 Process.For example, the motion vector of PU can be used with the predictive block of the PU of definite current CU in Video Decoder 30.In addition, video Decoder 30 can the inverse quantization transformation coefficient block associated with the TU of current CU.Video Decoder 30 can perform transformation coefficient block Inverse transformation, the transform block associated with the TU of current CU is built with reconstruct.By the way that the sample of the predictive block of the PU of current CU is added It is added to the correspondence sample of the transform block of the TU of current CU, the restructural decoding block for building current CU of Video Decoder 30.Pass through reconstruct The decoding block of every CU of picture is built, Video Decoder 30 is restructural to build picture.
In multi views decoding, multiple views of the same scene from different points of view may be present.Term " access unit " For referring to the set of the picture corresponding to same time example.Therefore, video data can be through being conceptualized as a series of pushing away with the time The access unit of shifting." view component " can be being represented through decoding for the view in single access unit.In the present invention, " view " It can refer to the view component sequence associated with identical view identifier.
Inter-view prediction is supported in multi views decoding.Inter-view prediction is similar to the inter prediction used in HEVC, and can Use same syntax of figs element.However, when video decoder performs inter-view prediction to current video cell (for example, PU), depending on Picture in the access unit identical with current video cell but in different views can be used as reference chart by frequency encoder 20 Piece.Contrastingly, the picture in different access units is only used as reference picture by conventional inter prediction.
In multi views decoding, if Video Decoder (for example, Video Decoder 30) can be regarded without reference to any other Picture in figure and decode the picture in view, then the view can be referred to " base view ".When decoding non-basic view One of in picture when, if picture is in the time instance identical with the picture that video decoder is currently just decoding (i.e., Access unit) in but different views in, then video decoder (for example, video encoder 20 or Video Decoder 30) can be by institute Picture is stated to be added in reference picture list.Similar other inter prediction reference pictures, video decoder can be arranged in reference picture Inter-view prediction reference picture is inserted at any position of table.
Various video coding standard designated buffer model.H.264/AVC with HEVC, buffer model be referred to as " imagination Reference decoder " or " HRD ".In HEVC working drafts 8, HRD is described in appendix C.
HRD descriptions should how buffered data be for decoding, and how to buffer decoded data for output.For example, HRD describes the operation of CPB, decoded picture buffer (" DPB "), and video decoding process.CPB is by the solution specified by HRD Code order contains the first-in first-out buffer of access unit.DPB be preserve decoded picture for specified by HRD reference, Output is reset or the buffer of output delay.The behavior of CPB and DPB can be specified with mathematical method.HRD can directly to sequential, Buffer size and bit rate impulsive constraints.In addition, HRD can be indirectly to various bit stream characteristics and statistics impulsive constraints.
H.264/AVC with HEVC, bit stream accordance and decoder accordance are being appointed as to the part of HRD specifications.Change Sentence is talked about, and HRD models are specified to determine whether bit stream follows the test of standard, and to determine whether decoder follows standard Test.Although HRD is known as certain decoder, video encoder usually using HRD to ensure bit stream accordance, and video Decoder is typically not required HRD.
H.264/AVC two kinds of bit stream accordance or HRD accordances are all specified with both HEVC, i.e. I types and II Type.I type bit streams be contain only all access units being useful in bit stream VCL NAL units and number of fillers according to NAL unit NAL unit stream.II types bit stream is except the VCL NAL units for all access units in bit stream and number of fillers are according to NAL Outside unit, the NAL unit stream containing at least one of the following:Different from number of fillers according to the extra non-of NAL unit VCL NAL units;With all leading_zero_8bits, zero_byte, start_ that byte stream is formed from NAL unit stream Coded_prefix_one_3bytes, and trailing_zero_8bits syntactic elements.
When device, which performs, determines whether bit stream follows the bit stream compliance test of various video coding standard, position may be selected in device The operating point of stream.Then, device can determine that the set for the HRD parameters for being applicable to selected operating point.Device can be used can Suitable for the set of the HRD parameters of selected operating point, to configure the behavior of HRD.More particularly, HRD can be used in device The applicable set of parameter, to configure the behavior of the specific component of HRD, for example, imagination flows scheduler (HSS), CPB, decoded Journey, DPB etc..Then, can be by the CPB through coded video data injection HRD of bit stream according to specific scheduling, HSS.In addition, Device can call the decoding process through coded video data in decoding CPB.Decoded picture can be output to by decoding process DPB.When data are moved through HRD by device, device can be determined whether to still meet particular constraints set.For example, when HRD is solved When the operating point of the selected operating point of code represents, device can determine that spilling or underflow condition whether occur in CPB or DPB.Dress Put can selection and processing bit stream in this way each operating point.If without the operating point for causing to violate the bit stream constrained, then Device can determine that bit stream follows various video coding standard.
H.264/AVC two kinds of decoder accordance is all specified with both HEVC, i.e. output timing decoder meets Property and output order decoder accordance.Advocating the decoder of the accordance for specific setting shelves, layer and level can succeed Ground decoding follows all bit streams of the bit stream accordance requirement of various video coding standard (for example, HEVC).In the present invention, " setting Shelves " can refer to the subset of bit stream syntax." layer " and " level " can be specified in each setting shelves.The level of layer can be to force at bit stream In syntactic element value constraint specified set.These constraints can be the simple limitation to value.Alternatively, the constraint The constraint to the arithmetic combination (for example, picture width is multiplied by the number that picture height is multiplied by decoded picture per second) of value can be taken Form.In general, passed through for the level that lower level is specified than the level specified for higher level compared with multiple constraint.
When whether device perform decoding device compliance test follows various video coding standard with definite tested decoder (DUT) When, the bit stream that device can will comply with various video coding standard is supplied to both HRD and DUT.HRD can be as described above on position The mode of stream compliance test handles bit stream.If by DUT output decoded picture sequence matches by HRD export through solve The order of code picture, then device can determine that DUT follows various video coding standard.In addition, if DUT export decoded picture when Sequence matching HRD exports the sequential of decoded picture, then device can determine that DUT follows various video coding standard.
H.264/AVC with HEVC HRD models, decoding or CPB remove can be based on access unit.I.e., it is assumed that HRD mono- It is secondary to decode whole access units and remove whole access units from CPB.In addition, H.264/AVC with HEVC HRD models, it is false It is instantaneous to determine picture decoding.Video encoder 20 can send decoding time to start in picture sequential SEI message with signal Coded access unit.In practical applications, if the Video Decoder met is strictly followed through being sent with signal to start to solve The decoding time of code access unit, then the decoding that may be equal to that particular picture the time earliest of the specific decoded picture of output Time is plus the time decoded needed for that particular picture.However, in real world, the time needed for decoding picture can not possibly be waited In zero.
HRD parameters can control the various aspects of HRD.In other words, the responsible HRD parameters of HRD.HRD parameters can include just Beginning CPB removes delay, CPB sizes, bit rate, initial DPB output delay and DPB sizes.Video encoder 20 can join in video In the hrd_parameters () syntactic structure specified in manifold (VPS) and/or sequence parameter set (SPS) this is sent with signal A little HRD parameters.Indivedual VPS and/or SPS can include multiple hrd parameters () languages of the different sets for HRD parameters Method structure.In some instances, video encoder 20 can use signal in Buffer period SEI message or picture sequential SEI message Send HRD parameters.
As explained above, set (that is, the nuh_reserved_zero_6bits of the operating point of bit stream and layer identifier The set of value) and time identifier be associated.Operating point represents that each NAL unit associated with operating point can be included.Operation Point represents can there is the frame rate and/or bit rate different compared to original bit stream.This situation is because operating point represents to wrap Some data of some pictures and/or original bit stream containing original bit stream.Therefore, when handling original bit stream, if video solution Code device 30 will remove data with special speed from CPB and/or DPB, and when handling operating point expression, if Video Decoder 30 Data will be removed from CPB and/or DPB with phase same rate, then Video Decoder 30 can remove excessive or mistake from CPB and/or DPB Few data.Therefore, video encoder 20 can be directed to the different sets that different operating point sends HRD parameters with signal.For example, In VPS, video encoder 20 can include multiple hrd_parameters () syntactic structures, hrd_parameters () grammer knot Structure includes the HRD parameters for different operating point.
In HEVC working drafts 8, optionally, the set of HRD parameters is included for the common information of all time sublayers Set.In other words, the set of HRD parameters being total to optionally including the operating point for being applicable to include any time sublayer With the set of syntactic element.Time sublayer can be by the VCL NAL units of the particular value with TemporalId and associated The time scalability layer of the time scalable bitstream of non-VCL NAL units composition.In addition to the set of common information, HRD parameters Set can also include specific to respective time sublayer syntactic element set.For example, hrd_parameters () grammer knot Structure is optionally including set common for all sublayers, and including the information of the information of specific sublayer all the time.It is because common The set of information is common for multiple set of HRD parameters, so need not can be used in the set of multiple HRD parameters Signal sends the set of common information.Truth is, in HEVC working drafts 8, when the collection of HRD parameters is combined into first in VPS During the set of HRD parameters, common information may be present in the set of HRD parameters, or set and the first operating point when HRD parameters When index is associated, common information may be present in the set of HRD parameters.For example, work as hrd_parameters () syntactic structure For the first hrd_parameters () syntactic structure in VPS when, or when hrd_parameters () syntactic structure and first When operating point index is associated, HEVC working drafts 8 support the presence of common information.
Following table 1 is the example syntactic structure of hrd_parameters () syntactic structure in HEVC.
Table 1 --- HRD parameters
In the example of table 1 above and other syntax tables of the present invention, there is the syntactic element of type descriptor ue (v) Can for using the integer of the variable not signed of the length that encodes of 0 rank index Columbus (Exp-Golomb) decoding, wherein Since left position.In the example and following table of table 1, the language of the descriptor of the form with u (n) (wherein n is nonnegative integer) Method element is the not signed values of length n.
In the example grammer of table 1, " if (commonInfPresentFlag) { ... } " syntactic element in the block is HRD The common information of parameter syntactic structure.In other words, the common information of the set of HRD parameters can include syntactic element timing_ info_present_flag、num_units_in_tick、time_scale、nal_hrd_parameters_present_ flag、vcl_hrd_parameters_present_flag、sub_pic_cpb_params_present_flag、tick_ divisor_minus2、du_cpb_removal_delay_length_minus1、bit_rate_scale、cpb_size_ scale、initial_cpb_removal_delay_length_minus1、cpb_removal_delay_length_ Minus1, and dpb_output_delay_length_minus1.
In addition, in the example of table 1, syntactic element fixed_pic_rate_flag [i], pic_duration_in_tc_ Minus1 [i], low_delay_hrd_flag [i] and cpb_cnt_minus1 [i] can be the set of the specific HRD parameters of sublayer. In other words, these syntactic elements of hrd_parameter () syntactic structure can be only applicable to include the operation of specific sublayer Point.Therefore, in addition to the common information optionally included, the HRD parameters of hrd_parameters () syntactic structure can also include Specific to the set of the specific HRD parameters of sublayer of the particular sublayers of bit stream.
When HighestTid is equal to i, fixed_pic_rate_flag [i] syntactic element may indicate that in a concrete fashion about Beam is by the time gap between the HRD output times of any two continuous picture of output order.HighestTid can be identification (example Such as, operating point) highest time sublayer variable.When HighestTid is equal to i, pic_duration_in_tc_minus1 [i] syntactic element can be specified with clock scale through being exported in coded video sequence by the HRD of any continuous picture of output order Time gap between time.When HighestTid is equal to i, low_delay_hrd_flag [i] syntactic element may specify HRD Operator scheme, it is specified such as in the appendix C of HEVC working drafts 8.When HighestTid is equal to i, cpb_cnt_minus1 [i] syntactic element may specify the number of the alternative CPB specifications in the bit stream through coded video sequence, one of them is alternative CPB specifications mean a specific CPB operation of the set with specific CPB parameters.
Video encoder 20 can be used SEI message and be correctly decoded the sample value of picture and unwanted to be included in bit stream After set data.However, Video Decoder 30 or other devices can be included in SEI message it is rear set data be used for it is various other Purpose.For example, the rear data that set in SEI message can be used for picture output timing, picture by Video Decoder 30 or another device It has been shown that, loss detection and error concealing.
Video encoder 20 can include one or more SEI NAL units in access unit.In other words, any number SEI NAL units can be associated with access unit.In addition, every SEI NAL units can contain one or more SEI message.HEVC Syntax and semantics of the standard description for various types of SEI message.However, HEVC standard does not describe the disposal of SEI message, This is because SEI message has no effect on specification decoding process.A reason with SEI message makes to realize in HEVC standard Supplementary data is interpreted in the same manner with the different system of HEVC.Video encoder may be needed to produce using the specification and system of HEVC The specific disposal of raw some SEI message or certain types of the received SEI message of definable.Following table 2 lists institute in HEVC The SEI message specified and its purpose is briefly described.
Table 2 --- the general introduction of SEI message
The description of U.S. provisional patent application cases 61/705,102 filed in September in 2012 24 days is used to be sent and be selected with signal The various methods of HRD parameters are selected, include delay information and the timing information for being sent and being selected with signal in SEI message.Chinese Nuck " the AHG9 of Sai La (Hannuksela) et al.:Operating point and nest set SEI (AHG9 in VPS:Operation points in VPS and nesting SEI) " (the video coding connection of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG11 Close cooperative groups (JCT-VC), the 11st meeting, Chinese Shanghai, 10 to 19 October in 2012, file JCTVC- No. K0180v1, it can be from http untill 13 days June in 2013://phenix.int-evry.fr/jct/doc_end_ User/documents/11_Shanghai/wg11/JCTVC-K0180-v1.zip is obtained) provide for sending HRD with signal The another method of parameter and the mechanism for nest set SEI message.
There are some problem or shortcoming for the existing technology for sending HRD parameters with signal.For example, existing technology can The set that HRD parameters are shared by multiple operating points is not allowed.However, when the number of operating point is higher, for attempting to ensure that position The accordance of stream with produced for each operating point the video encoder 20 of the set of different HRD parameters or another unit come Say, this situation can be to have burden.Truth is, can be by ensuring that each operating point is associated with the set of HRD parameters, but HRD The specific collection of parameter can be shared by multiple operating points and ensure the accordance of bit stream.One or more technologies of the present invention can provide Allow the design that a set of HRD parameters is shared by multiple operating points.In other words, the singleton of HRD parameters is applicable In multiple operating points.The video encoder 20 or another unit of the permissible accordance for attempting to ensure that bit stream of this design are in complexity Accepted or rejected between performance.
The existing technologies of HRD parameters are sent with signal the problem of or in another example of shortcoming, when there are multiple in VPS During the set of HRD parameters, it may be necessary to which there are multiple and different set of the common information of the set for HRD parameters.When in VPS There are during greater number HRD parameter syntactic structures, this situation can be especially true.Accordingly, it may be desirable to different from There is the set of common information in the HRD parameter syntactic structures of one HRD parameter syntactic structures.For example, when there are multiple in VPS During hrd_parameters () syntactic structure, especially when the total number of hrd_parameters () syntactic structure is of a relatively high, To provide the performance improved, different from the common information of the first hrd_parameters () syntactic structure, or different from the first behaviour Make the common information of point index, it may be necessary to which there are the different common informations for hrd_parameters () syntactic structure.
One or more technologies of the present invention, which provide, allows any set for HRD parameters clearly to send HRD with signal The design of the common information of the set of parameter.For example, the technology of the present invention allows to be directed to any hrd_parameters () language Method structure is clearly sent for the common information of all sublayers with signal.
In this way, video encoder 20 can be sent comprising multiple HRD parameters syntactic structures in bit stream with signal VPS, the multiple HRD parameters syntactic structure respectively contain HRD parameters.For each phase in multiple HRD parameters syntactic structures HRD parameter syntactic structures are answered, VPS, which is further included, indicates that the HRD parameters of corresponding HRD parameters syntactic structure are removed specific to bit stream Whether the syntactic element of the common set of HRD parameter is also included outside the set of the specific HRD parameter informations of sublayer of particular sublayers. The common set of HRD parameters is common for all sublayers of bit stream.
Similarly, Video Decoder 30 or another device can include multiple HRD parameters syntactic structures from bitstream decoding VPS, the multiple HRD parameters syntactic structure respectively contain HRD parameters.For each phase in multiple HRD parameters syntactic structures HRD parameter syntactic structures are answered, VPS, which can further include, indicates whether the HRD parameters of corresponding HRD parameters syntactic structure include HRD The syntactic element of the common set of parameter.Video Decoder 30 or other devices can be used in HRD parameter syntactic structures at least The HRD parameters of one and perform operation.
In addition, for nest set SEI message Existing methods can have the shortcomings that some problem or.For example, send HRD with signal The existing technologies of parameter can not allow a SEI message to be suitable for multiple operating points.The technology of the present invention can provide permission one SEI message is suitable for the design of multiple operating points.
It is placed on specifically, scalable nest set SEI message can include to specify suitable for nest in scalable nest set SEI message The syntactic element of multiple operating points of SEI message.In other words, scalable nest set SEI message can provide for making SEI message with Bit stream subset (for example, operating point expression) is associated, or the mechanism associated with specific layer and sublayer.
In this way, video encoder 20 can produce the scalable nest set SEI message comprising multiple syntactic elements, described more Multiple operating points that a syntactic element identification is applicable in by the nido SEI message that scalable nest set SEI message is encapsulated.In addition, video Encoder 20 can send scalable nest with signal in bit stream and cover SEI message.
In this way, during video coding, Video Decoder 30 or another device can cover SEI message from scalable nest The multiple syntactic elements for the operating point that decoding identification is applicable in by the nido SEI message that scalable nest set SEI message is encapsulated.This Outside, Video Decoder 30 or other devices can be at least partially based on one or more of syntactic element of nido SEI message and hold Row operation.
Another example of the problem of existing technologies of nest set SEI message or shortcoming is on following true:Nest covers SEI message Existing technologies and without using the layer identifier syntactic element in current SEI NAL units (for example, nuh_reserved_zero_ Value 6bits), to determine that the scalable nest for being suitable for being encapsulated by current SEI NAL units covers the operating point of SEI message.
The technology offer of the present invention sends the operating point of the nido SEI message suitable for SEI NAL units with signal No is as the design of the operating point indicated by the layer identification information in the NAL unit header of SEI NAL units.SEI NAL units NAL unit header in layer identification information can include NAL unit header nuh_reserved_zero_6bits value and The value of nuh_temporal_id_plus1.In other words, technology of the invention can provide for sending nido by using signal Whether SEI message is suitable for by being contained in current SEI NAL units, (that is, the SEI NAL containing scalable nest set SEI message are mono- Member) NAL unit header in the default action point that is identified of layer identification information, and use the NAL of current SEI NAL units mono- Layer identification information in first header is (for example, the value and nuh_temporal_id_plus1 of nuh_reserved_zero_6bits Value) design.
In this way, in the scalable nest set SEI message being encapsulated by SEI NAL units, video encoder 20 can include Whether instruction is suitable for giving tacit consent to the syntactic element of sub- bit stream by the nido SEI message that scalable nest set SEI message is encapsulated.Acquiescence Bit stream can be by it is specified in layer identifier and NAL unit header specified in the NAL unit header of SEI NAL units when Between operating point defined in identifier operating point represent.Disappear in addition, video encoder 20 is exportable comprising scalable nest set SEI The bit stream of breath.
Similarly, such as the device of Video Decoder 30 or another device can be at least partially based on by SEI NAL unit capsules Syntactic element in the scalable nest set SEI message of envelope, determine be by the nido SEI message that is encapsulated of scalable nest set SEI message It is no to be suitable for giving tacit consent to sub- bit stream.As described above, it can be by specified in the NAL unit header of SEI NAL units to give tacit consent to sub- bit stream Layer identifier and NAL unit header in the operating point of operating point defined in specified time identifier represent.Work as nido When SEI message is suitable for giving tacit consent to sub- bit stream, device can use nido SEI message in the operation to giving tacit consent to sub- bit stream.For example, nest Shape SEI message can include one or more HRD parameters.In this example, one or more described HRD parameters can be used to perform for device Determine whether the sub- bit stream of acquiescence follows the bit stream compliance test of various video coding standard (for example, HEVC).Alternatively, it is real herein In example, one or more described HRD parameters can be used to determine whether Video Decoder 30 meets decoder compliance test in device.
The Existing methods for nest set SEI message the problem of or in another example of shortcoming, the explicit of layer identifier is translated Code is poorly efficient.The technology of the present invention can increase the explicit decoding of layer identifier by the decoding of differential decoding or usage flag Efficiency.
Fig. 2 is the block diagram for illustrating to implement the example video encoder 20 of the technology of the present invention.Fig. 2 is in order at the mesh of explanation And provide, and should not serve to as the present invention in illustrate extensively and described in technology limitation.For the mesh of explanation , present invention video encoder 20 described in the context of HEVC decodings.However, the technology of the present invention is applicable to other translate Code standard or method.
In the example of figure 2, video encoder 20 includes prediction processing unit 100, residue generation unit 102, at conversion Manage unit 104, quantifying unit 106, inverse quantization unit 108, inverse transformation processing unit 110, weight construction unit 112, wave filter list Member 114, decoded picture buffer 116 and entropy code unit 118.Prediction processing unit 100 includes inter prediction processing unit 120 and intra-prediction process unit 126.Inter prediction processing unit 120 includes motion estimation unit 122 and motion compensation units 124.In other examples, video encoder 20 can include more, less or different functional unit.
Video encoder 20 can receive video data.In the section of the picture of 20 codified video data of video encoder Every CTU.The corresponding CTB that each of CTU can decode tree-shaped block (CTB) and picture with the lightness of equal sizes is associated. As the part of coding CTU, it is gradually less the CTB of CTU to be divided into that prediction processing unit 100 can perform quarter tree segmentation Block.Smaller piece can be the decoding block of CU.For example, the CTB associated with CTU can be divided into four phases by prediction processing unit 100 One or more of sub-block, is divided into sub-block of four equal sizes etc. by sub-block etc. size.
The CU of 20 codified CTU of video encoder is to produce the warp knit representation of CU (that is, through decode CU).As coding CU Part, the associated decoding block of CU in divisible one or more PU with CU of prediction processing unit 100.Therefore, every PU It can be associated with lightness prediction block and corresponding colorimetric prediction block.Video encoder 20 and Video Decoder 30 can be supported to have various The PU of size.As indicated above, the size of CU can refer to the size of the lightness decoding block of CU, and the size of PU can refer to the lightness of PU The size of prediction block.It is assumed that the size of specific CU is 2N × 2N, then video encoder 20 and Video Decoder 30 can be supported to use In the symmetrical PU sizes of the PU sizes of the 2N × 2N or N × N of infra-frame prediction, and 2N × 2N, 2N × N, N × 2N, N × N, or it is used for Inter prediction it is similarly sized.Video encoder 20 and Video Decoder 30 can also support 2N × nU for inter prediction, 2N The asymmetric segmentation of the PU sizes of × nD, nL × 2N and nR × 2N.
By performing inter prediction to every PU of CU, inter prediction processing unit 120 can produce the predictability for PU Data.Predictive data for PU can the predictive block comprising PU and the movable information for PU.Depending on PU is in I sections In, P section in or B section in, inter prediction processing unit 120 can for CU PU perform different operating.In I sections, All PU are through infra-frame prediction.Therefore, if PU is in I sections, then inter prediction processing unit 120 does not perform interframe to PU Prediction.Therefore, for the block encoded in I patterns, formed using the spatial prediction of the adjacent block of previous coding out of same number of frames Predictive block.
If PU is in P sections, then motion estimation unit 122 can be in the row of the reference picture of the reference zone for PU Searching for reference picture in table (for example, " RefPicList0 ").The reference zone of PU can be in reference picture, containing most closely Corresponding to the region of the sample block of the sample block of PU.Motion estimation unit 122 can produce the ginseng of reference zone of the instruction containing PU Examine the reference key of the position in the RefPicList0 of picture.In addition, motion estimation unit 122 can produce the decoding of instruction PU The motion vector of spatial displacement between block and the reference position for being associated in reference zone.For example, motion vector can be to provide The bivector of the offset of coordinate in from the coordinate in photo current to reference picture.Motion estimation unit 122 will can refer to The movable information that index and motion vector output are PU.Actual sample based on the reference position indicated by the motion vector as PU This or interpolation sample, motion compensation units 124 can produce the predictive block of PU.
If PU is in B sections, then motion estimation unit 122 can be directed to PU and perform single directional prediction or bi-directional predicted.For Single directional prediction is performed for PU, motion estimation unit 122 can search for RefPicList0 or the second reference picture list The reference picture of (" RefPicList1 ") is to obtain the reference zone for PU.Instruction can be contained ginseng by motion estimation unit 122 The reference key of position in the RefPicList0 or RefPicList1 of the reference picture in examination district domain, the prediction block for indicating PU with The motion vector of the spatial displacement being associated between the reference position of reference zone and instruction reference picture be The movable information that one or more prediction direction designators output in RefPicList0 or in RefPicList1 is PU.Movement Compensating unit 124 can be at least partially based on actual sample or interpolation sample by the reference zone indicated by the motion vector of PU And produce the predictive block of PU.
To perform bidirectional interframe predictive for PU, motion estimation unit 122 can search in RefPicList0 reference picture with The reference zone for PU is obtained, and can also search for the reference picture in RefPicList1 to obtain another reference for PU Region.Motion estimation unit 122 can produce instruction the reference picture containing reference zone RefPicList0 and The reference key of position in RefPicList1.In addition, can to produce instruction associated with reference zone for motion estimation unit 122 Reference position and PU prediction block between space displacement motion vector.The movable information of PU can include the reference key of PU And motion vector.Motion compensation units 124 can be at least partially based on by the reality of the reference zone indicated by the motion vector of PU Border sample or interpolation sample and produce the predictive block of PU.
Intra-prediction process unit 126 can produce the predictive data for PU by performing infra-frame prediction to PU.With Predictive block and the various syntactic elements for PU can be included in the predictive data of PU.Intra-prediction process unit 126 can be to I PU in section, in P section neutralization B sections performs infra-frame prediction.
To perform infra-frame prediction to PU, multiple intra prediction modes can be used to produce use for intra-prediction process unit 126 In the set of multiple predictive datas of PU.Intra-prediction process unit 126 can the sample based on adjacent PU and produce for PU Predictive block.For PU, CU and CTU, it is assumed that from left to right coding orders from top to down, then adjacent PU can above PU, Upper right side, upper left side or the left side.Various number intra prediction modes can be used in intra-prediction process unit 126, for example, 33 Directional intra-prediction pattern.In some instances, the number of intra prediction mode may depend on the size of the prediction block of PU.
Prediction processing unit 100 can be used in the predictive data of PU from caused by inter prediction processing unit 120, Or predictive number of the selection for the PU of CU from the predictive data caused by intra-prediction process unit 126 for PU According to.In some instances, bit rate/distortion metrics of set of the prediction processing unit 100 based on predictive data and select to be used for The predictive data of the PU of CU.The predictive block of selected predictive data can be referred to selected predictability herein Block.
Lightness decoding block, Cb decoding blocks and Cr decoding blocks based on CU, and the selected predictive lightness block of the PU of CU, Predictive Cb blocks and predictability Cr blocks, the lightness residual block, Cb residual blocks and Cr that residue generation unit 102 can produce CU are remaining Block.For example, residue generation unit 102 can produce the residual block of CU so that remaining each sample in the block has translating equal to CU Poor value between code sample selected predictability corresponding sample in the block corresponding with the PU of CU in the block.
Converting processing unit 104 can perform quarter tree segmentation so that the residual block of CU to be divided into the change associated with the TU of CU Change block.Therefore, TU can chromaticity transformation block corresponding with lightness transform block and two be associated.The lightness transform block and colourity of the TU of CU The size and location of transform block may or may not the size and location based on the prediction block of the PU of CU.Referred to as " remaining quarter tree " (RQT) quarter tree structure can include the node associated with each of region.The TU of CU may correspond to the leaf segment of RQT Point.
By the way that one or more conversion to be applied to the transform block of TU, converting processing unit 104 can produce every TU's of CU Transformation coefficient block.Various conversion can be applied to the transform block associated with TU by converting processing unit 104.For example, conversion process Discrete cosine transform (DCT), directional transforms or conceptive similar conversion can be applied to transform block by unit 104.In some realities In example, conversion is not applied to transform block by converting processing unit 104., can be using transform block as transformation series in these examples Several piece is treated.
Quantifying unit 106 can quantization parameter conversion coefficient in the block.Quantizing process can be reduced and some in conversion coefficient Or the bit depth being all associated.For example, n bit map coefficient depreciations can be truncated to m bit map coefficients during quantization, wherein N is more than m.Based on quantization parameter (0P) value associated with CU, quantifying unit 106 can quantify the coefficient associated with the TU of CU Block.Video encoder 20 can adjust the quantization applied to the coefficient block associated with CU by adjusting the QP values associated with CU Degree.Quantization may be such that information is lost, therefore quantified conversion coefficient can have the precision lower than original transform coefficient.
Inverse quantization and inverse transformation can be separately to coefficient block by inverse quantization unit 108 and inverse transformation processing unit 110, with Residual block is built from coefficient block reconstruct.The reconstructed residual block built can be added to freely to predict that processing is single by weight construction unit 112 The correspondence sample of one or more predictive blocks caused by member 100, to produce the reconstructed transform block associated with TU.By with This mode reconstructs the transform block for the every TU for building CU, the restructural decoding block for building CU of video encoder 20.
Filter cell 114 can perform one or more deblocking operations to reduce in the block piece of puppet of the decoding associated with CU Shadow.Decoded picture buffer 116 can filter cell 114 to it is reconstructed build decoding block perform one or more deblocking operations it Afterwards, storage is reconstructed builds decoding block.Inter prediction processing unit 120 can be used containing the reconstructed reference picture for building decoding block with Inter prediction is performed to the PU of other pictures.In addition, decoded picture buffer 116 can be used in intra-prediction process unit 126 In it is reconstructed build decoding block, with pair picture identical with CU other PU perform infra-frame prediction.
Entropy code unit 118 can receive data from other functional units of video encoder 20.For example, entropy code unit 118 can receive coefficient block from quantifying unit 106, and can receive syntactic element from prediction processing unit 100.Entropy code unit 118 Data can be performed with the operation of one or more entropy codings, to produce entropy encoded data.For example, entropy code unit 118 can be to data Perform context-adaptive variable-length decoding (CAVLC) operation, CABAC operations, variable (V2V) length decoding behaviour can be changed to Make, the context-adaptive binary arithmetic decoding (SBAC) based on grammer operates, probability interval segmentation entropy (PIPE) decoding Operation, exp-Golomb coding operation or the operation of another type of entropy coding.Video encoder 20 is exportable to be included by entropy coding The bit stream of entropy encoded data caused by unit 118.For example, bit stream can include the data for the RQT for representing CU.
As indicated above, technology of the invention can provide allow be directed to VPS in any HRD parameters syntactic structure it is clear and definite Ground sends the design of the common information of HRD parameter syntactic structures with signal.To make it possible to any HRD parameters being directed in VPS The common information of HRD parameter syntactic structures is clearly sent with signal, video encoder 20, which can produce, follows institute in following table 3 The VPS syntactic structures of the example grammer of displaying.
Table 3 --- VPS syntactic structures
Difference between the grammer of the italicized item dial gauge 3 of table 3 and the corresponding table from HEVC working drafts 8.In addition, In the example grammer of table 3, num_ops_minus1 syntactic elements specify operation_point () grammer present in VPS The number of structure.Hrd_applicable_ops_minus1 [i] syntactic element specifies i-th hrd_parameters () grammer The number for the operating point that structure is applicable in.Hrd_op_idx [i] [j] syntactic element specifies the i-th hrd_parameters in VPS The jth operating point that () syntactic structure is applicable in.As simply referred to above, technology of the invention allows to be total to by multiple operating points Enjoy a set of HRD parameters.Hrd_applicable_ops_minus1 [i] syntactic elements and hrd_op_idx [i] [j] can It is used to refer to the applicable operating point of the set of HRD parameters.In the singleton for not allowing for multiple operating points to be suitable for HRD parameters Some examples in, omit hrd_applicable_ops_minus1 [i] syntactic elements and hrd_op_idx [i] [j] from table 3 Syntactic element.
In the example grammer of table 3, VPS can include set (that is, syntactic element) of the common parameters there are flag, in table 3 In be expressed as cprms_present_flag [i].Cprms_present_flag [i] syntactic element equal to 1 is specified for institute The HRD parameters for having sublayer common are present in i-th hrd_parameters () syntactic structure in VPS.Cprms_ equal to 0 Present_flag [i] syntactic element specifies the i-th hrd_ being not present in for the common HRD parameters of all sublayers in VPS In parameters () syntactic structure, but through export and with (i-1) hrd_parameters () syntactic structure in VPS It is identical.
Cprms_present_flag [0] can be through being inferred as being equal to 1.That is, device driven can be determined and (that is, inferred) in VPS (pressing decoding order) the first hrd_parameters () syntactic structure include for the common HRD parameters of all sublayers.Cause This, the common set that the first HRD parameters syntactic structure sent in VPS with signal includes HRD parameters.It is one or more in VPS A follow-up HRD parameters syntactic structure can include the different common sets of HRD parameters.
As simply referred to above, technology of the invention is allowed for any HRD parameters syntactic structure clearly with letter Number send the common information (that is, for each of sublayer being common HRD parameters) of HRD parameter syntactic structures.Table 3 Cprms_present_flag [i] syntactic element Video Decoder 30 or another device can be made to can determine HRD parameter grammers Which of structure includes the set for for each of sublayer being common HRD parameters.Therefore, although the first HRD Parameter syntactic structure can include the common set of HRD parameters, but one or more the HRD parameters sent in VPS with signal all the time Syntactic structure and the common set not comprising HRD parameters.Cprms_present_flag [i] syntactic element can be used with true in device Determine the common set which of HRD parameter syntactic structures of VPS include HRD parameters.
HRD parameters syntactic structure (for example, hrd_parameters () syntactic structure) can include the specific HRD parameters of sublayer Whether set, include for the common HRD parameters of all sublayers but regardless of HRD parameters syntactic structure.When Video Decoder 30 or When another device determines specific HRD parameters syntactic structure and does not include the common set of HRD parameters, Video Decoder 30 or another The set phase with the specific HRD parameters of the sublayer of previous HRD parameters syntactic structure and specific HRD parameters syntactic structure can be used in device The common set of associated HRD parameters and perform operation.Previous HRD parameters syntactic structure can be in specific HRD by decoding order The set of the HRD parameters sent before parameter syntactic structure in VPS with signal.If previous HRD parameter syntactic structures include The common set of HRD parameters, then the common set of the HRD parameter associated with previous HRD parameters syntactic structure is to be contained in The common set of HRD parameters in previous HRD parameter syntactic structures.If previous HRD parameters syntactic structure is simultaneously joined not comprising HRD Several common sets, then device can determine that the HRD parameter associated with previous HRD parameters syntactic structure common set be with By decoding order prior to HRD parameters that the HRD parameter syntactic structures of the previous HRD parameters syntactic structure by decoding order are associated Common set.
As mentioned above, device can be used the common set of HRD parameters and the specific HRD parameters of sublayer and perform operation. During this operation, device can according to one or more of HRD parameters manage CPB operation, decoding video data, and according to Decoded picture in one or more of HRD parameters management DPB.In another example, the common set and sublayer of HRD parameters Specific HRD parameters can be used for performing bit stream compliance test or decoder compliance test.
In addition, in some instances, scalable nest set SEI message provide be used to making SEI message and bit stream subset (for example, Operating point represents) or the mechanism associated with specific layer and sublayer.In some these examples, scalable nest set SEI message can Contain one or more SEI message.Nido SEI message is can be referred to containing the SEI message in SEI message is covered in scalable nest.Not Non- nido SEI message is can be referred to containing the SEI message in SEI message is covered in scalable nest.In some instances, scalable nest Nido SEI message in set SEI message can include the set of HRD parameters.
In some instances, for can nest cover the message of which type there are some limitations.For example, can cannot it is identical can Scale the SEI message of nest set Buffer period SEI message and any other type in nest set SEI message.Buffer period SEI message can Indicate the initial delay of HRD operations.In another example, can cannot be in identical scalable nest set SEI message during nest set picture The SEI message of sequence SEI message and any other type.Picture sequential SEI message may indicate that HRD operation picture output time and Picture/sub-pictures remove the time.In other examples, nest set picture sequential SEI it can disappear in identical scalable nest set SEI message Breath and sub-pictures sequential SEI message.CPB removal delay information can be supplied to related to SEI message by sub-pictures sequential SEI message The decoded unit of connection.
As indicated above, one or more technologies of the invention allow a SEI message to be suitable for multiple operating points.This Outside, one or more technologies of the invention can enable video encoder 20 send the nest suitable for SEI NAL units with signal Whether the operating point of shape SEI message is operation indicated by layer identification information in NAL unit header as SEI NAL units Point.In addition, one or more technologies of the present invention can increase the efficiency explicitly decoded of layer identifier by differential decoding.Following The example grammer shown in table 4 and semanteme of enclosing can implement these technologies.
Table 4 --- scalable nest covers SEI message
In the example of table 4, italicized item may indicate that the difference with HEVC working drafts 8.Specifically, in the reality of table 4 In illustrative phrase method, the bitstream_subset_flag syntactic elements equal to 0 specify nest to be placed in scalable nest set SEI message SEI message is suitable for specific layer and sublayer.Bitstream_subset_flag syntactic elements equal to 1, which specify nest to be placed on, to contract The SEI message put in nest set SEI message is suitable for being produced by the sub- bit stream extraction process of the sub-clause 10.1 of HEVC working drafts 8 Raw sub- bit stream, the process have defeated as specified by the following specified syntactic element as scalable nest set SEI message Enter.The description of sub-clause 10.1 of HEVC working drafts 8 is used for the operation that sub- bit stream (that is, operating point represents) is extracted from bit stream.Tool For body, the sub-clause 10.1 of HEVC working drafts 8 is provided to be identified by being removed from bit stream with the time more than tIdTarget Accord with all NAL units of (for example, TemporalID) or the layer identification with the value being not in targetDecLayerIdSet Accord with all NAL units of (for example, nuh_reserved_zero_6bits) and export sub- bit stream.TIdTarget and TargetDecLayerIdSet is the parameter of bit stream extraction process.In some instances, if nido SEI message is delayed for picture Rush SEI message, picture sequential SEI message or sub-pictures sequential SEI message, then bitstream_subset_flag grammers member Element is equal to 1.Otherwise, in these examples, bitstream_subset_flag syntactic elements are equal to 0.
In addition, in the example grammer of table 4, if bitstream_subset_flag syntactic elements are equal to 1, then can Scaling nest set SEI message includes default_op_applicable_flag syntactic elements.Default_op_ equal to 1 Applicable_flag syntactic elements specify nido SEI message (that is, nest is placed on the SEI message in scalable nest set SEI message) Suitable for giving tacit consent to sub- bit stream, the sub- bit stream of acquiescence is the sub- bit stream extraction process of the sub-clause 10.1 of HEVC working drafts 8 Output, the tIdTarget's of time identifier (TemporalId) of the process with equal to current SEI NAL units is defeated Enter, and 0 and nuh_reserved_ (is included by the nuh_reserved_zero_6bits in 0 to current SEI NAL units Zero_6bits the targetDecLayerIdSet's of all values composition of the nuh_reserved_zero_6bits in the range of) Input.Therefore, give tacit consent to sub- bit stream can be by from bit stream remove with more than current SEI NAL units time identifier when Between identifier all NAL units or remove with 0 to current SEI NAL units layer identifier (for example, nuh_ Reserved_zero_6bits) (include 0 and nuh_reserved_zero_6bits) in the range of layer identifier all NAL Unit and derived bit stream.For example, giving tacit consent to sub- bit stream can be the subset of bit stream, and give tacit consent to sub- bit stream can not include have be more than by The VCL NAL units of the bit stream of the layer identifier of layer identifier indicated by the layer identifier syntactic element of NAL unit header, or With indicated more than the time horizon identifier syntactic element (for example, nuh_temporal_id_plus1) by NAL unit header Time identifier time identifier bit stream VCL NAL units.Default_op_applicable_flag equal to 0 Syntactic element specifies nido SEI message to be not particularly suited for giving tacit consent to sub- bit stream.
In the example grammer of table 4, if bitstream_subset_flag syntactic elements are equal to 1, then scalable nest Set SEI message includes nesting_num_ops_minus1 syntactic elements.Nesting_num_ops_minus1 syntactic elements add Upper 1 specifies the number of nesting_op_idx [i] syntactic element in scalable nest set SEI message.Therefore, if nesting_ Num_ops_minus1 syntactic elements are more than 0 plus 1, then nesting_num_ops_minus1 syntactic elements, which may indicate that, to contract Put the multiple syntactic elements whether nest set SEI message includes multiple operating points that identification nido SEI message is applicable in.In this way, Device can cover the syntactic element of the number for the operating point that SEI source codecs instruction nido SEI message is applicable in from scalable nest (nesting_num_ops_minus1).When nesting_num_ops_minus1 syntactic elements are not present, nesting_ The value of num_ops_minus1 can be through being inferred as being equal to 0.Therefore, if bitstream_subset_flag syntactic elements are equal to 0, then scalable nest set SEI message does not include nesting_op_idx [i] syntactic element.
Nesting_op_flag syntactic elements equal to 0 are specified by all_layers_flag syntactic elements, and (work as presence When) (all values i arrives nesting_num_layers_minus1 to nesting_layer_id_delta [i] syntactic element in 0 In the range of (include 0 and nesting_num_layers_minus1)) specify nestingLayerIdSet [0]. NestingLayerIdSet [] syntactic element is the array of layer identifier.Nesting_op_flag syntactic elements equal to 1 refer to It is fixed that nestingLayerIdSet [i] is specified by nesting_op_idx [i] syntactic element.When it be not present, nesting_ is inferred The value of op_flag is equal to 1.
Nesting_max_temporal_id_plus1 [i] syntactic element specifying variable maxTemporalId [i].In table In 4 example grammer, the value of nesting_max_temporal_id_plus1 [i] syntactic element is more than current SEI NAL units The value of the nuh_temporal_id_plus1 syntactic elements of (that is, the NAL unit containing scalable nest set SEI message).Variable MaxTemporalId [i] is set equal to nesting_max_temporal_id_plus1 [i] -1.
Nesting_op_idx [i] syntactic element is used for given settings nestingLayerIdSet [i].Setting NestingLayerIdSet [i] can by op_layer_id [nesting_op_idx] [i] (wherein all values of i be in 0 to (0 and op_num_layer_id_ is included in the range of op_num_layer_id_values_minus1 [nesting_op_idx] Values_minus1 [nesting_op_idx])) composition.VPS may specify op_layer_id [] [] values and op_ in effect Num_layer_values_minus1 [] value.
In addition, in the example grammer of table 4, the all_layers_flag syntactic element given settings equal to 0 By nestingLayerId [i], (all values i is in 0 to nesting_num_layers_ to nestingLayerIdSet [0] (0 and nesting_num_layers_minus1 is included in the range of minus1)) composition.Variable is described below nestingLayerId[i].All_layers_flag syntactic element given settings nestingLayerIdSet equal to 1 by etc. In or more than current SEI NAL units nuh_reserved_zero_6bits, the nuh_ being present in current access unit The all values composition of reserved_zero_6bits.
Nesting_num_layers_minus1 syntactic elements are specified in scalable nest set SEI message plus 1 The number of nesting_layer_id_delta [i] syntactic element.When i is equal to 0, nesting_layer_id_delta [i] Syntactic element specifies first (that is, the 0th) nuh_reserved_zero_ being contained in setting nestingLayerIdSet [0] Difference between the nuh_reserved_zero_6bits syntactic elements of 6bits values and current SEI NAL units.When i is more than 0, Nesting_layer_id_delta [i] syntactic element specifies i-th be contained in setting nestingLayerIdSet [0] Difference between nuh_reserved_zero_6bits values and (i-1) nuh_reserved_zero_6bits values.
Can induced variable nestingLayerId [i], wherein nuh_reserved_zero_6bits be as follows from current The NAL unit header of SEI NAL units.
NestingLayerId [0]=nuh_reserved_zero_6bits+nesting_layer_id_delta [0]
For (i=1;I <=nesting_num_layers_minus1;i++)
NestingLayerId [i]=nestingLayerId [i-1]+nesting_layer_id_delta [i]
Setting nestingLayerIdSet [0] be set to by nestingLayerId [i] (all i values in 0 to (0 and nesting_num_layers_minus1 is included in the range of nesting_num_layers_minus1)) composition.When When bitstream_subset_flag syntactic elements are equal to 0, nido SEI message, which is suitable for having, is contained in setting The NAL unit of nuh_reserved_zero_6bits in nestingLayerIdSet [0], or have and be equal to current SEI The NAL unit of the nuh_reserved_zero_6bits of NAL unit, and wherein nuh_temporal_id_plus1 is in current (current SEI is included in the range of nuh_temporal id_plus1 to the maxTemporalId [0]+1 of SEI NAL units The nuh_temporal_id_plus1 and maxTemporalId [0]+1 of NAL unit).When bitstream_subset_flag languages When method element is equal to 1, nido SEI message is suitable for the defeated of the sub- bit stream extraction process of the sub-clause 10.1 of HEVC working drafts 8 Going out, the process has the input of the tIdTarget equal to maxTemporalId [i], and equal to nestingLayerIdSet [i] (every i values (include 0 and nesting_num_ops_ in the range of being in 0 to nesting_num_ops_minus1 Minus1 the input of targetDecLayerIdSet)), and when default_op_applicable_flag syntactic elements etc. When 1, nido SEI message is also applied for giving tacit consent to sub- bit stream.Extracted sub- bit stream can be more than by removing to have All NAL units of the time identifier of maxTemporalId [i], or remove to have and arrive nesting_num_ops_ in 0 All NAL units of layer identifier in the range of minus1 and produce.
In this way, at least one corresponding operating point in the multiple operating points being applicable in for nido SEI message, device (for example, video encoder 20, Video Decoder 30, or another device of such as content transmission network device) can be from scalable nest Cover the first syntactic element of SEI source codecs (for example, nesting_max_temporal_id_plus1 [i]) and the second grammer member Plain (for example, nesting_op_idx [i]).In addition, device can be at least partially based on the first syntactic element, corresponding operating point is determined Maximum time identifier.Device can be at least partially based on the second syntactic element, determine the collection of the layer identifier of corresponding operating point Close.
In the example of table 4, nesting_zero_bit syntactic elements are equal to 0.Nesting_zero_bit syntactic elements It can be used to ensure that scalable nest set SEI message is byte alignment.The number that the position in SEI message is covered when scalable nest can be by 8 When dividing exactly, scalable nest set SEI message can be byte alignment.
In addition, in the example of table 4, sei_message () syntactic structure includes SEI message.Therefore, device can be from can contract Put multiple nido SEI message that nest set SEI source codecs are encapsulated by scalable nest set SEI message.It is every in nido SEI message One is applicable to by multiple syntactic elements (for example, nesting_max_temporal_id_plus1 [i], nesting_op_ Idx [i] etc.) all operating points for being identified.
In alternate examples, scalable nest set SEI message can follow the example grammer of following table 5.In the example of table 5 In grammer, one or more technologies according to the present invention, scalable nest set SEI message can be increased layer by using decoding flag and be known The efficiency explicitly decoded not accorded with.
Table 5 --- scalable nest covers SEI message
In the example of table 5, italicized item displaying and the difference of HEVC working drafts 8.As shown in table 5, Bitstream_subset_flag syntactic elements, default_op_applicable_flag syntactic elements, nesting_num_ Ops_minus1 syntactic elements, nesting_max_temporal_id_plus1 syntactic elements, nesting_op_idx [i] language Method element and nesting_zero_bit syntactic elements can have with above for the identical semanteme of 4 described semanteme of table.
In addition, in the example of table 5, variable minLayerId is set equal to nuh_reserved_zero_6bits+1, Wherein nuh_reserved_zero_6bits is the NAL unit header from current SEI NAL units.Nesting_ equal to 0 Op_flag syntactic elements are specified by all_layers_flag syntactic elements and (when it is present) nesting_layer_id_ (all values i (is included included_flag [i] in the range of being in 0 to nesting_max_layer_id-minLayerId-1 0 and nesting_max_layer_id-minLayerId-1)) given settings nestingLayerIdSet [0].Equal to 1 Nesting_op_flag syntactic elements are specified by nesting_op_idx [i] syntactic element given settings nestingLayerIdSet[i].When nesting_op_flag syntactic elements are not present, infer nesting_op_flag's Value is equal to 1.
In the example of table 5, the all_layers_flag syntactic element given settings nestingLayerIdSet equal to 0 [0] by nestingLayerId [i], (all i values are in the range of 0 to nesting_max_layer_id-minLayerId (including 0 and nesting_max_layer_id-minLayerId)) composition.NestingLayerId [i] variable is described below. In the example of table 5, the all_layers_flag given settings nestingLayerIdSet equal to 1 is by more than or equal to current The nuh_reserved_zero_6bits syntactic elements of SEI NAL units, the nuh_ being present in current access unit The all values composition of reserved_zero_6bits.
In addition, in the example of table 5, nesting_max_layer_id syntactic element given settings The maximum of nuh_reserved_zero_6bits in nestingLayerIdSet [0].Nesting_layer_ equal to 1 Id_included_flag [i] syntactic element specifies the value of the nuh_reserved_zero_6bits equal to i+minLayerId It is contained in setting nestingLayerIdSet [0].Nesting_layer_id_included_flag [i] grammer equal to 0 The value of nuh_reserved_zero_6bits of the element assignment equal to i+minLayerId is not included in setting In nestingLayerIdSet [0].
Can (i be arrived in 0 by induced variable nestingNumLayersMinus1 and variable nestingLayerId [i] as follows (0 and nestingNumLayersMinus1 is included in the range of nestingNumLayersMinus1)):
For (i=0, j=0;I < nesting_max_layer_id;i++)
if(nesting_layer_id_incuded_flag[i])
NestingLayerId [j++]=I+minLayerId
NestingLayerId [j]=nesting_max_layer_id
NestingNumLayersMinus1=j
Setting nestingLayerIdSet [0] can be set into by nestingLayerId [i] (all i values in 0 to (0 and nestingNumLayersMinus 1 is included in the range of nestingNumLayersMinus 1)) composition.
When bitstream_subset_flag syntactic elements are equal to 0, nido SEI message is applicable to have and is contained in The NAL unit of the nuh_reserved_zero_6bits in nestingLayerIdSet [0] is set, or it is current with being equal to The NAL of the nuh_reserved_zero_6bits of the nuh_reserved_zero_6bits syntactic elements of SEI NAL units is mono- Member, and wherein nuh_temporal_id_plus1 is in the nuh_temporal_id_plus1 languages from current SEI NAL units Method element is to (including the nuh_temporal_id_plus1 of current SEI NAL units in the range of maxTemporalId [0]+1 Syntactic element and maxTemporalId [0]+1).
When the bitstream_subset_flag syntactic elements of scalable nest set SEI message are equal to 1, nido SEI message The output of the sub- bit stream extraction process of sub-clause 10.1 is applicable to, the process has equal to maxTemporalId's [i] The input of tIdTarget, and (every i values arrive nesting_num_ops_ in 0 equal to nestingLayerIdSet [i] (include 0 and nesting_num_ops_minus1) in the range of minus1) targetDecLayerIdSet input, and When default_op_applicable_flag is equal to 1, nido SEI message is also applied for giving tacit consent to sub- bit stream.
Fig. 3 is the block diagram for illustrating to be configured to implement the instance video decoder 30 of the technology of the present invention.Fig. 3 is in order at solution The purpose released and provide, and be not intended to limit as widely illustrated in the present invention and described in technology.For illustrative purposes, originally Video Decoder 30 described in the context that invention is decoded in HEVC.However, the technology of the present invention is applicable to other decoding marks Accurate or method.
In the example of fig. 3, Video Decoder 30 includes entropy decoding unit 150, prediction processing unit 152, inverse quantization list Member 154, inverse transformation processing unit 156, weight construction unit 158, filter cell 160 and decoded picture buffer 162.Prediction Processing unit 152 includes motion compensation units 164 and intra-prediction process unit 166.In other examples, Video Decoder 30 More, less or different functional unit can be included.
Decoded picture buffer (CPB) 151 can receive and store the encoded video data of bit stream (for example, NAL is mono- Member).Entropy decoding unit 150 can receive NAL unit from CPB 151, and dissect NAL unit to decode syntactic element.Entropy decoding list Member 150 can be entropy encoded syntactic element in entropy decoding NAL unit.Prediction processing unit 152, inverse quantization unit 154, inverse transformation Processing unit 156, weight construction unit 158 and filter cell 160 can produce warp based on the syntactic element extracted from bit stream Decoding video data.
The NAL unit of bit stream can be included through decoding section NAL unit.As the part of decoding bit stream, entropy decoding unit 150 Extractable and entropy decoding is come the syntactic element for decoding section NAL unit of hanging oneself.Section can be included through each of decoding section Header and slice of data.Slice header, which can contain, is related to the syntax element for slicing.Syntactic element in slice header, which can include, to be known The syntactic element of PPS not associated with the picture containing section.
Except in addition to bitstream decoding syntactic element, Video Decoder 30 can perform reconstruct to not segmented CU and build behaviour Make.Operation is built to perform reconstruct to not segmented CU, Video Decoder 30 can perform reconstruct to every TU of CU and build operation.It is logical Cross every TU execution reconstruct to CU and build operation, the restructural residual block for building CU of Video Decoder 30.
Performed as the TU to CU and reconstruct the part for building operation, inverse quantization unit 154 can inverse quantization (that is, de-quantization) and TU Associated coefficient block.The QP value associated with the CU of TU can be used in inverse quantization unit 154, to determine quantization degree, and equally, The inverse quantization degree that inverse quantization unit 154 is applied.That is, can be controlled by adjusting the value of used QP when quantization transform coefficient Compression ratio processed, that is, be used to represent original series and the ratio of the number of the position of compressed sequence.Compression ratio also may depend on institute's profit Entropy coding method.
After 154 dequantized coefficients block of inverse quantization unit, inverse transformation processing unit 156 can answer one or more inverse transformations Coefficient block is used, to produce the residual block associated with TU.For example, inverse transformation processing unit 156 can be by inverse DCT, inverse integer Nan-Luo Wei (Karhunen-Loeve) conversion (KLT), despining conversion, opposite orientation conversion or another inverse transformation are neglected in conversion, anti-card It is applied to coefficient block.
If use intraframe predictive coding PU, then intra-prediction process unit 166 can perform infra-frame prediction to produce use In the predictive block of PU.Intra-prediction process unit 166 can the prediction block based on spatially adjacent PU, use infra-frame prediction mould Formula is to produce the predictive lightness block for PU, predictability Cb blocks and predictability Cr blocks.Intra-prediction process unit 166 can be based on From one or more syntactic elements of bitstream decoding, the intra prediction mode for PU is determined.
Prediction processing unit 152 can build the first reference picture list based on the syntactic element extracted from bit stream (RefPicList0) and the second reference picture list (RefPicList1).In addition, if use inter prediction encoding PU, then Entropy decoding unit 150 can extract the movable information for PU.Motion compensation units 164 can the movable information based on PU, determine use In one or more reference zones of PU.Motion compensation units 164 can based on the sample block at one or more reference blocks for PU, Produce predictive lightness block, predictability Cb blocks and the predictability Cr blocks for PU.
Lightness transform block, Cb transform block and the Cr associated with the TU of CU (as being applicable in) can be used to become for weight construction unit 158 Block is changed, and (that is, intra-prediction data or interframe is pre- for the predictive lightness block of the PU of CU, predictability Cb blocks and predictability Cr blocks Survey data), to reconstruct the lightness decoding block, Cb decoding blocks and Cr decoding blocks of building CU.For example, weight construction unit 158 can be by lightness The sample of transform block, Cb transform blocks and Cr transform blocks is added to pair of predictive lightness block, predictability Cb blocks and predictability Cr blocks Sample is answered, the lightness decoding block, Cb decoding blocks and Cr decoding blocks of CU are built with reconstruct.
Filter cell 160 can perform deblocking operation to reduce lightness decoding block, Cb decoding blocks and Cr decoding blocks with CU Associated block artifacts.The lightness decoding block, Cb decoding blocks and Cr decoding blocks of CU can be stored in decoded by Video Decoder 30 In picture buffer 162.Decoded picture buffer 162 can provide reference picture for subsequent motion compensation, infra-frame prediction With show in the display device of the display device 32 of such as Fig. 1.Buffered for example, Video Decoder 30 can be based on decoded picture Lightness block, Cb blocks and Cr blocks in device 162, perform infra-frame prediction to the PU of other CU or inter prediction operate.In this way, depending on Frequency decoder 30 can extract the conversion coefficient level of a large amount of brightness coefficient blocks from bit stream;Inverse quantization conversion coefficient level;Will conversion Conversion coefficient level is applied to produce transform block;It is at least partially based on transform block and produces decoding block;With output decoding block with For showing.
Fig. 4 is the flow chart of the example operation 200 for the video encoder 20 for illustrating one or more technologies according to the present invention. In the example in figure 4, video encoder 20 can produce the VPS for including multiple HRD parameters syntactic structures, the multiple HRD parameters Syntactic structure respectively contains HRD parameters (202).For each corresponding HRD parameter grammer knots in multiple HRD parameters syntactic structures Structure, VPS further include the HRD parameters for indicating corresponding HRD parameters syntactic structure except the son of the particular sublayers specific to bit stream Whether the syntactic element of the common set of HRD parameter, wherein HRD parameter are also included outside the set of layer specific HRD parameter informations Common set be common for all sublayers of bit stream.In addition, video encoder 20 can be sent out in bit stream with signal Go out VPS (204).
Fig. 5 is the flow chart of the example operation 250 for the device for illustrating one or more technologies according to the present invention.Can be by video Encoder 20, Video Decoder 30 or another device perform operation 250.As illustrated in the example of fig. 5, device can be from bit stream Decoding includes the VPS of multiple HRD parameters syntactic structures, and the multiple HRD parameters syntactic structure respectively contains HRD parameters (252). For each corresponding HRD parameter syntactic structures in multiple HRD parameters syntactic structures, VPS further includes the corresponding HRD ginsengs of instruction Whether the HRD parameters of number syntactic structure include the syntactic element of the common set of HRD parameters.
In addition, device can be used the HRD parameters of at least one of HRD parameter syntactic structures and perform operation (254). In some examples, bit stream may include that the operating point of particular point of operation represents that specific HRD parameters syntactic structure is applicable to specific behaviour Make a little, and the HRD parameters of specific HRD parameters syntactic structure can be used to perform operation in device.For example, HRD can be used to join for device Number, to perform the position for determining various video coding standard (for example, HEVC) whether is followed suitable for the operating point of HRD parameter syntactic structures Flow compliance test.In another example, HRD parameters can be used with perform decoding device compliance test in device.
The common set of HRD parameters can be common for all sublayers of bit stream.In some instances, it is each The HRD parameters of HRD parameter syntactic structures include the set of the specific HRD parameters of sublayer specific to the particular sublayers of bit stream.One In a little examples, each of set of the specific HRD parameters of sublayer includes syntactic element (for example, instruction appointing by output order It is alternative in time gap, bit stream of the instruction through coded video sequence between the HRD output times of what two continuous picture The syntactic element of the number of decoded picture buffer specification).In some instances, when device determines specific HRD parameters grammer Structure and not comprising HRD parameters common set when, device can be used and previous HRD parameters syntactic structure and specific HRD parameters The common sets of the associated HRD parameters of the set of the specific HRD parameters of sublayer of syntactic structure performs operation.
Fig. 6 is the flow chart of the example operation 300 for the video encoder 20 for illustrating one or more technologies according to the present invention. As illustrated in the example of Fig. 6, video encoder 20 can produce the scalable nest set SEI message comprising multiple syntactic elements, institute State multiple operating points that multiple syntactic element identifications are applicable in by the nido SEI message that scalable nest set SEI message is encapsulated (302).In addition, video encoder 20 can send scalable nest set SEI message (304) in bit stream with signal.
Fig. 7 is the flow chart of the example operation 350 for the device for illustrating one or more technologies according to the present invention.Video coding Device 20, Video Decoder 30 or another device can perform operation 350.As illustrated in the example of Fig. 7, device can be from scalable nest Set SEI source codecs identify the multiple of multiple operating points that the nido SEI message being encapsulated by scalable nest set SEI message is applicable in Syntactic element (352).In some instances, device can cover the scalable nest set SEI message of SEI source codecs instruction from scalable nest Whether the syntactic element (for example, nesting_num_ops_minus1) of the multiple syntactic elements that identify operating point is included.
In addition, one or more syntactic elements of nido SEI message can be used in device, fitted with performing on nido SEI message The operation (354) of any one of operating point.For example, device can be in the operating point that definite nido SEI message is applicable in Whether any one follows the grammer that nido SEI message is used in the bit stream compliance test of various video coding standard (for example, HEVC) Element.In another example, the syntactic element of nido message can be used with perform decoding device compliance test in device.
Fig. 8 is the flow chart of the example operation 400 for the video encoder 20 for illustrating one or more technologies according to the present invention. As illustrated in the example in Fig. 8, in the scalable nest set SEI message being encapsulated by SEI NAL units, video encoder 20 can Whether it is suitable for giving tacit consent to the syntactic element of sub- bit stream by the nido SEI message that scalable nest set SEI message is encapsulated comprising instruction (for example, default_op_applicable_flag) (402).It is the NAL unit header by SEI NAL units to give tacit consent to sub- bit stream In specified layer identifier, and the operating point table of operating point defined in time identifier specified in NAL unit header Show.The first syntactic element (for example, nuh_reserved_zero_6bits) in NAL unit header may indicate that a layer identifier, and The second syntactic element (for example, nuh_reserved_temporal_id_plus1) in NAL unit header may indicate that the time knows Do not accord with.
In the example of Fig. 8, in scalable nest covers SEI message, video encoder 20 can include identification operation bidirectional point Time identifier, and one or more extra syntactic elements (404) of the maximum layer identifier of operation bidirectional point.In addition, video Encoder 20 can send scalable nest set SEI message (406) in bit stream with signal.In some instances, indicate by scalable The syntactic element whether the nido SEI message that nest set SEI message is encapsulated is suitable for giving tacit consent to sub- bit stream can be referred to the first grammer Element, and video encoder 20 can include the second syntactic element (for example, bitstream_ in scalable nest covers SEI message subset_flag).Second syntactic element may indicate that whether the nido SEI message being encapsulated by scalable nest set SEI message is applicable in Whether it is suitable for the specific layer and sublayer of bit stream in the sub- bit stream extracted from bit stream, or nido SEI message.When the second grammer member Element instruction nido SEI message be suitable for from the sub- bit stream that bit stream is extracted when, video encoder 20 can be only comprising the first grammer member Element.
Fig. 9 is the flow chart of the example operation 450 for the device for illustrating one or more technologies according to the present invention.Video coding Device 20, Video Decoder 30 or another device can perform operation 450.As illustrated in the example in Fig. 9, device can be at least partly Based on the first syntactic element (for example, bitstream_subset_flag) of scalable nest set SEI message, determine by scalable Whether the nido SEI message that nest set SEI message is encapsulated is suitable for the sub- bit stream (452) extracted from bit stream.In response to determining The nido SEI message being encapsulated by scalable nest set SEI message is suitable for the sub- bit stream (452 "Yes") extracted from bit stream, Default action point syntactic element in the scalable nest set SEI message of device decodable code is (for example, default_op_applicable_ flag)(454).Default action point syntactic element may indicate that by the nido SEI message that is encapsulated of scalable nest set SEI message whether Suitable for giving tacit consent to sub- bit stream.
It can be by layer identifier specified in the NAL unit header of SEI NAL units, and NAL unit to give tacit consent to sub- bit stream The operating point of operating point defined in specified time identifier represents in header.In some instances, in NAL unit header The first syntactic element (for example, nuh_reserved_zero_6bits) marker identifier, and in NAL unit header Two syntactic elements (for example, nuh_reserved_temporal_id_plus1) instruction time identifier.Giving tacit consent to sub- bit stream can be The subset of bit stream, and give tacit consent to sub- bit stream and the following VCL NAL units not comprising bit stream:With more than by NAL unit header The layer identifier of layer identifier indicated by first syntactic element, or with the second syntactic element being more than by NAL unit header The time identifier of indicated time identifier.
In addition, device can be at least partially based on the grammer in the scalable nest set SEI message being encapsulated by SEI NAL units Element (for example, default_op_applicable_flag), determines the nido SEI being encapsulated by scalable nest set SEI message Whether message is suitable for the sub- bit stream of acquiescence (456) of bit stream.In some instances, scalable nest set SEI message is encapsulated multiple nests Shape SEI message.In these examples, device can be based on syntactic element (for example, default_op_applicable_flag), really Whether each of nido SEI message in fixed scalable nest set SEI message is suitable for giving tacit consent to sub- bit stream.
When nido SEI message is suitable for giving tacit consent to sub- bit stream (456 "Yes"), device can be in the operation to giving tacit consent to sub- bit stream It is middle to use nido SEI message (458).For example, nido SEI message can include the set of HRD parameters.In this example, device can HRD in nido SEI message is used during whether the sub- bit stream of test acquiescence follows the operation of various video coding standard (for example, HEVC) Parameter.In another example, device can use the HRD parameters in nido SEI message in decoder compliance test.Another In example, device can use nido SEI message in the decoding operate to giving tacit consent to sub- bit stream.In another example, initial CPB is gone Except delay can be used for guidance system to establish appropriate initial end-to-end delay, and when conveying video via RTP, DPB outputs Time can be used for export RTP time stamps.
Otherwise, when nido SEI message is not particularly suited for giving tacit consent to sub- bit stream (456 "No"), or when scalable nest covers SEI When message is not particularly suited for sub- bit stream (452 "No") extracted from bit stream, device is not in the operation to giving tacit consent to sub- bit stream It is middle to use nido SEI message (460).For example, device can cover one or more extra grammer members in SEI message based on scalable nest Plain (for example, nesting_max_temporal_id_plus1 [i], nesting_op_idx [i] etc.), determines the second operating point Time identifier and the second operating point maximum layer identifier.In this example, device can be in the operation to extra sub- bit stream Middle to use nido SEI message, extra sub- bit stream represents for the operating point of the second operating point.
In one or more examples, described function can be implemented with hardware, software, firmware, or any combination thereof.Such as Fruit is implemented in software, then and function can be stored on computer-readable media as one or more instructions or codes, or via meter Calculation machine readable media transmits, and is performed by hardware based processing unit.Computer-readable media can be included to correspond to and for example counted According to the computer-readable storage medium of the tangible medium of storage media, or communication medium, communication medium is including (for example) according to logical Believe agreement, promote for computer program to be transmitted to any media at another place at one.In this way, computer-readable media is big It may correspond to the tangible computer readable memory medium of (1) non-transitory, or the communication matchmaker of (2) such as signal or carrier wave on body Body.Data storage medium can be can be by one or more computers or the access of one or more processors with search instruction, Procedure Codes And/or data structure, for implementing any useable medium of technology described in the present invention.Computer program product can wrap Containing computer-readable media.
By example and non-limiting, these computer-readable storage mediums may include RAM, ROM, EEPROM, CD-ROM Or other optical disk storage apparatus, disk storage device or other magnetic storage devices, flash memory, or storage is can be used in finger The form of order or data structure wants program code and any other media accessible by a computer.Also, any connection can It is properly termed as computer-readable media.If for example, use coaxial cable, Connectorized fiber optic cabling, twisted-pair feeder, Digital Subscriber Line (DSL) or wireless technology (for example, infrared ray, radio and microwave) and from website, server or other remote sources transmit instruct, So coaxial cable, Connectorized fiber optic cabling, twisted-pair feeder, DSL or wireless technology (for example, infrared ray, radio and microwave) are contained in matchmaker In the definition of body.However, it should be understood that computer-readable storage medium and data storage medium and not comprising connection, carrier wave, signal Or other temporary media, but it is directed to non-transitory tangible storage medium.As used herein, disk and CD include pressure Contracting CD (CD), laser-optical disk, optical compact disks, digital image and sound optical disk (DVD), floppy disk and Blu-ray Disc, wherein disk usually with Magnetic means reproduce data, and CD passes through laser reproduce data optically.The combination of each above should also be contained in meter In the range of calculation machine readable media.
Can by such as one or more digital signal processors (DSP), general purpose microprocessor, application-specific integrated circuit (ASIC), Field Programmable Logic Array (FPGA) or one or more other equivalent integrated or discrete logic processors refer to perform Order.Therefore, term " processor " used herein can refer to said structure or be adapted for carrying out technology described herein Any one of any other structure.In addition, in certain aspects, feature described herein can be provided in be configured For in the specialized hardware and/or software module of coding and decoding, or it is incorporated into combined type codec.Also, the skill Art could be fully implemented in one or more circuits or logic elements.
The technology of the present invention a variety of devices or equipment can be implemented extensively, and described device or equipment include wireless handheld The set (for example, chipset) of machine, integrated circuit (IC) or IC.Various components, modules, or units are to emphasize described in the present invention It is configured to perform the function aspect of the device of revealed technology, but may not requires to be realized by different hardware unit.Truth For that as described above, various units can be combined in codec hardware unit, or pass through interoperability hardware cell The set of (including one or more as described above processors) and provide the list with reference to suitable software and/or firmware Member.
Various examples have been described.These and other example belongs in the range of claims below.

Claims (32)

1. a kind of method of decoding video data, the described method includes:
The video parameter collection VPS of multiple hypothetical reference decoder HRD parameter syntactic structures is included from coded video bitstream decoding, The multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein for the multiple HRD parameters grammer knot Each corresponding HRD parameter syntactic structures in addition to the first HRD parameter syntactic structures included in the VPS in structure, The VPS, which is further included, indicates whether the HRD parameters of the corresponding HRD parameter syntactic structures include being total to for HRD parameters With the syntactic element of set, the wherein common set of HRD parameters comes for all sublayers of the coded video bitstream Say it is common, and the first HRD parameters syntactic structure being wherein included in the VPS includes the described common of HRD parameters With set;
And
Using the HRD parameters of at least one of the HRD parameters syntactic structure, operation is performed.
2. according to the method described in claim 1, it further comprises:
Determine the specific HRD parameters syntactic structure in the multiple HRD parameters syntactic structure and do not include the described common of HRD parameters With set, and
The operation is wherein performed using the HRD parameters of at least one of the HRD parameters syntactic structure to be included making With the common collection of the HRD parameters being included in previous HRD parameters syntactic structure in the multiple HRD parameters syntactic structure Close the extra HRD parameters with the specific HRD parameters syntactic structure and perform the operation.
3. according to the method described in claim 1, each of wherein described HRD parameters syntactic structure always include specific to The set of the specific HRD parameter informations of sublayer of the particular sublayers of the coded video bitstream.
4. according to the method described in claim 1, wherein be included in the VPS in follow-up HRD parameters syntactic structure include with The common set for being included in the HRD parameters in the first HRD parameter syntactic structures included in the VPS is different The common set of HRD parameters.
5. according to the method described in claim 1, wherein it is included in the HRD parameters syntactic structure in the VPS extremely A few HRD parameters syntactic structure and the common set not comprising HRD parameters.
6. according to the method described in claim 1, each of wherein described HRD parameters syntactic structure is suitable for the warp knit The operating point of code video bit stream.
7. according to the method described in claim 1, at least one of wherein described HRD parameters syntactic structure is suitable for the warp Multiple operating points of encoded video bitstream.
8. a kind of video decoder, it includes:
Memory, it is configured to storage data, and the data include coded video bitstream;And
One or more processors, it is configured to:
The video parameter collection of multiple hypothetical reference decoder HRD parameter syntactic structures is included from coded video bitstream decoding VPS, the multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein for the multiple HRD parameters language Each corresponding HRD parameter grammer knots in addition to the first HRD parameter syntactic structures included in the VPS in method structure Structure, the VPS, which is further included, indicates whether the HRD parameters of the corresponding HRD parameter syntactic structures include HRD parameters All sublayers of the common set of the syntactic element of common set, wherein HRD parameters for the coded video bitstream For be common, wherein the first HRD parameters syntactic structure included in the VPS include HRD parameters it is described common With set;And
Using the HRD parameters of at least one of the HRD parameters syntactic structure, operation is performed.
9. video decoder according to claim 8, wherein one or more described processors are further configured to:
Determine the specific HRD parameters syntactic structure in the multiple HRD parameters syntactic structure and do not include the described common of HRD parameters With set, and
Using described in the HRD parameters being included in previous HRD parameters syntactic structure in the multiple HRD parameters syntactic structure The extra HRD parameters of common set and the specific HRD parameters syntactic structure and perform the operation.
10. video decoder according to claim 8, wherein each of described HRD parameters syntactic structure is always The set of the specific HRD parameter informations of sublayer comprising the particular sublayers specific to the coded video bitstream.
11. video decoder according to claim 8, wherein the follow-up HRD parameters grammer knot included in the VPS Structure includes the common collection of the HRD parameters with being included in the first HRD parameter syntactic structures being included in the VPS Close the common set of different HRD parameters.
12. video decoder according to claim 8, wherein the HRD parameters grammer knot included in the VPS At least one HRD parameters syntactic structure in structure and the common set not comprising HRD parameters.
13. video decoder according to claim 8, wherein each of described HRD parameters syntactic structure is applicable in In the operating point of the coded video bitstream.
14. video decoder according to claim 8, wherein at least one of described HRD parameters syntactic structure is suitable Multiple operating points for the coded video bitstream.
15. a kind of video decoder, it includes:
For including the video parameter collection of multiple hypothetical reference decoder HRD parameter syntactic structures from coded video bitstream decoding The device of VPS, the multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein for the multiple HRD Each corresponding HRD parameters in addition to the first HRD parameter syntactic structures included in the VPS in parameter syntactic structure Syntactic structure, the VPS, which is further included, indicates whether the HRD parameters of the corresponding HRD parameter syntactic structures include HRD Institute of the common set of the syntactic element of the common set of parameter, wherein HRD parameters for the coded video bitstream It is common to have for sublayer, and the first HRD parameters syntactic structure being wherein included in the VPS includes HRD parameters The common set;And
For performing the device of operation using the HRD parameters of at least one of the HRD parameters syntactic structure.
16. video decoder according to claim 15, wherein:
Follow-up HRD parameters syntactic structure included in the VPS is included with being included in be included in the VPS described first The common set of the different HRD parameters of the common sets of HRD parameters in HRD parameter syntactic structures, and
Each of described HRD parameters syntactic structure is suitable for the operating point of the coded video bitstream.
17. a kind of non-transitory computer-readable storage media of store instruction, when performing described instruction by video decoder When, described instruction configure the video decoder with:
The video parameter collection VPS of multiple hypothetical reference decoder HRD parameter syntactic structures is included from coded video bitstream decoding, The multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein for the multiple HRD parameters grammer knot Each corresponding HRD parameter syntactic structures in addition to the first HRD parameter syntactic structures included in the VPS in structure, The VPS, which is further included, indicates whether the HRD parameters of the corresponding HRD parameter syntactic structures include being total to for HRD parameters With the syntactic element of set, the wherein common set of HRD parameters comes for all sublayers of the coded video bitstream Say it is common, and the first HRD parameters syntactic structure being wherein included in the VPS includes the described common of HRD parameters With set;
And
Using the HRD parameters of at least one of the HRD parameters syntactic structure, operation is performed.
18. non-transitory computer-readable storage media according to claim 17, wherein:
Follow-up HRD parameters syntactic structure included in the VPS is included with being included in be included in the VPS described first The common set of the different HRD parameters of the common sets of HRD parameters in HRD parameter syntactic structures, and
Each of described HRD parameters syntactic structure is suitable for the operating point of the coded video bitstream.
19. a kind of method of encoded video data, the described method includes:
Produce the video parameter collection VPS for including multiple hypothetical reference decoder HRD parameter syntactic structures, the multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein in the multiple HRD parameters syntactic structure except included in Each corresponding HRD parameter syntactic structures outside the first HRD parameter syntactic structures in the VPS, the VPS are further included Indicate whether the HRD parameters of the corresponding HRD parameter syntactic structures include the syntactic element of the common set of HRD parameters, Wherein institute of the common set of HRD parameters for the coded video bitstream of the coded picture comprising the video data It is common to have for sublayer,
And the common set that the first HRD parameters syntactic structure being wherein included in the VPS includes HRD parameters; And
In the coded video bitstream VPS is sent with signal.
20. according to the method for claim 19, wherein each of described HRD parameters syntactic structure is always comprising specific In the set of the specific HRD parameter informations of the sublayer of the particular sublayers of the coded video bitstream.
21. according to the method for claim 19, wherein the follow-up HRD parameters syntactic structure included in the VPS includes The common set of HRD parameters from being included in the first HRD parameter syntactic structures being included in the VPS is different HRD parameters common set.
22. according to the method for claim 19, wherein in the HRD parameters syntactic structure included in the VPS At least one HRD parameters syntactic structure and the common set not comprising HRD parameters.
23. a kind of video coding apparatus, it includes:
Data storage medium, it is configured to storage video data;And
One or more processors, it is configured to:
Produce the video parameter collection VPS for including multiple hypothetical reference decoder HRD parameter syntactic structures, the multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein in the multiple HRD parameters syntactic structure except included in Each corresponding HRD parameter syntactic structures outside the first HRD parameter syntactic structures in the VPS, the VPS are further included Indicate whether the HRD parameters of the corresponding HRD parameter syntactic structures include the syntactic element of the common set of HRD parameters, Wherein institute of the common set of HRD parameters for the coded video bitstream of the coded picture comprising the video data It is common to have for sublayer, and the first HRD parameters syntactic structure being wherein included in the VPS includes HRD parameters The common set;And
In the coded video bitstream VPS is sent with signal.
24. video coding apparatus according to claim 23, wherein each of described HRD parameters syntactic structure is always The set of the specific HRD parameter informations of sublayer comprising the particular sublayers specific to the coded video bitstream.
25. video coding apparatus according to claim 23, wherein the follow-up HRD parameters grammer included in the VPS Structure is described common comprising the HRD parameters with being included in the first HRD parameter syntactic structures being included in the VPS Gather the common set of different HRD parameters.
26. video coding apparatus according to claim 23, wherein the HRD parameters grammer included in the VPS At least one HRD parameters syntactic structure in structure and the common set not comprising HRD parameters.
27. a kind of video coding apparatus, it includes:
It is described more for producing the device of the video parameter collection VPS comprising multiple hypothetical reference decoder HRD parameter syntactic structures A hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein in the multiple HRD parameters syntactic structure Each corresponding HRD parameter syntactic structures in addition to the first HRD parameter syntactic structures included in the VPS, the VPS Further include and indicate whether the HRD parameters of the corresponding HRD parameter syntactic structures include the common set of HRD parameters Encoded video position of the common set of syntactic element, wherein HRD parameters for the coded picture comprising video data It is common for all sublayers of stream, and the first HRD parameter syntactic structures being wherein included in the VPS include The common set of HRD parameters;And
For the device of the VPS to be sent with signal in the coded video bitstream.
28. video coding apparatus according to claim 27, wherein:
Follow-up HRD parameters syntactic structure included in the VPS is included with being included in be included in the VPS described first The common set of the different HRD parameters of the common sets of HRD parameters in HRD parameter syntactic structures, and
Each of described HRD parameters syntactic structure is suitable for the operating point of the coded video bitstream.
29. video coding apparatus according to claim 27, wherein each of described HRD parameters syntactic structure is always The set of the specific HRD parameter informations of sublayer comprising the particular sublayers specific to the coded video bitstream.
30. a kind of non-transitory computer-readable storage media of store instruction, when performing described instruction by video coding apparatus When, described instruction configure the video coding apparatus with:
Produce the video parameter collection VPS for including multiple hypothetical reference decoder HRD parameter syntactic structures, the multiple hypothetical reference HRD parameter syntactic structures respectively contain HRD parameters, wherein in the multiple HRD parameters syntactic structure except included in Each corresponding HRD parameter syntactic structures outside the first HRD parameter syntactic structures in the VPS, the VPS are further included Indicate whether the HRD parameters of the corresponding HRD parameter syntactic structures include the syntactic element of the common set of HRD parameters, Wherein institute of the common set of HRD parameters for the coded video bitstream of the coded picture comprising video data It is common to have for sublayer,
And the common set that the first HRD parameters syntactic structure being wherein included in the VPS includes HRD parameters; And
In the coded video bitstream VPS is sent with signal.
31. non-transitory computer-readable storage media according to claim 30, wherein:
Follow-up HRD parameters syntactic structure included in the VPS is included with being included in be included in the VPS described first The common set of the different HRD parameters of the common sets of HRD parameters in HRD parameter syntactic structures, and
Each of described HRD parameters syntactic structure is suitable for the operating point of the coded video bitstream.
32. non-transitory computer-readable storage media according to claim 30, wherein the HRD parameters syntactic structure Each of always include specific to the coded video bitstream particular sublayers the specific HRD parameter informations of sublayer collection Close.
CN201380051430.XA 2012-10-08 2013-09-20 The syntactic structure of hypothetical reference decoder parameter Active CN104704842B (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201261711098P 2012-10-08 2012-10-08
US61/711,098 2012-10-08
US13/954,712 US9319703B2 (en) 2012-10-08 2013-07-30 Hypothetical reference decoder parameter syntax structure
US13/954,712 2013-07-30
PCT/US2013/060906 WO2014058598A1 (en) 2012-10-08 2013-09-20 Hypothetical reference decoder parameter syntax structure

Publications (2)

Publication Number Publication Date
CN104704842A CN104704842A (en) 2015-06-10
CN104704842B true CN104704842B (en) 2018-04-20

Family

ID=50432659

Family Applications (3)

Application Number Title Priority Date Filing Date
CN201380051435.2A Active CN104685892B (en) 2012-10-08 2013-09-20 A kind of method, apparatus and readable memory medium handling video data
CN201380051430.XA Active CN104704842B (en) 2012-10-08 2013-09-20 The syntactic structure of hypothetical reference decoder parameter
CN201380051423.XA Active CN104685891B (en) 2012-10-08 2013-09-20 Identification in video coding suitable for the operating point of nido supplemental enhancement information message

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201380051435.2A Active CN104685892B (en) 2012-10-08 2013-09-20 A kind of method, apparatus and readable memory medium handling video data

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201380051423.XA Active CN104685891B (en) 2012-10-08 2013-09-20 Identification in video coding suitable for the operating point of nido supplemental enhancement information message

Country Status (30)

Country Link
US (3) US9380317B2 (en)
EP (3) EP2904782B1 (en)
JP (3) JP6062558B2 (en)
KR (3) KR101677867B1 (en)
CN (3) CN104685892B (en)
AP (1) AP3952A (en)
AR (2) AR094449A1 (en)
AU (2) AU2013330372B2 (en)
BR (2) BR112015007763B1 (en)
CA (2) CA2885807C (en)
CO (1) CO7350643A2 (en)
DK (1) DK2904782T3 (en)
EC (1) ECSP15018127A (en)
ES (2) ES2727814T3 (en)
HK (2) HK1207775A1 (en)
HU (2) HUE038492T2 (en)
IL (2) IL237949B (en)
MA (1) MA37971B1 (en)
MX (1) MX341437B (en)
MY (2) MY168739A (en)
PH (2) PH12015500625A1 (en)
PT (1) PT2904782T (en)
RU (2) RU2643463C2 (en)
SA (1) SA515360254B1 (en)
SG (2) SG11201501832UA (en)
SI (1) SI2904782T1 (en)
TW (2) TWI533674B (en)
UA (2) UA116998C2 (en)
WO (3) WO2014058598A1 (en)
ZA (1) ZA201502498B (en)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104185992A (en) * 2012-02-08 2014-12-03 汤姆逊许可公司 Method and apparatus for using an ultra-low delay mode of a hypothetical reference decoder
US9912941B2 (en) 2012-07-02 2018-03-06 Sony Corporation Video coding system with temporal layers and method of operation thereof
US10110890B2 (en) 2012-07-02 2018-10-23 Sony Corporation Video coding system with low delay and method of operation thereof
US9380317B2 (en) 2012-10-08 2016-06-28 Qualcomm Incorporated Identification of operation points applicable to nested SEI message in video coding
EP2907318A1 (en) 2012-10-09 2015-08-19 Cisco Technology, Inc. Output management of prior decoded pictures at picture format transitions in bitstreams
EP2907308B1 (en) * 2012-10-09 2019-08-14 Cisco Technology, Inc. Providing a common set of parameters for sub-layers of coded video
US9374585B2 (en) * 2012-12-19 2016-06-21 Qualcomm Incorporated Low-delay buffering model in video coding
EP3058735B1 (en) * 2013-10-14 2019-05-08 Nokia Technologies Oy Multi-layer hypothetical reference decoder
CN105981387B (en) * 2013-10-15 2019-09-27 诺基亚技术有限公司 For handling the method, apparatus and computer readable storage medium of video
KR101930896B1 (en) * 2013-10-22 2018-12-19 후아웨이 테크놀러지 컴퍼니 리미티드 Image decoding device and image decoding method
US10063867B2 (en) * 2014-06-18 2018-08-28 Qualcomm Incorporated Signaling HRD parameters for bitstream partitions
US9918091B2 (en) * 2014-06-20 2018-03-13 Qualcomm Incorporated Systems and methods for assigning a minimum value to a syntax structure in a parameter set
US10432951B2 (en) * 2014-06-24 2019-10-01 Qualcomm Incorporated Conformance and inoperability improvements in multi-layer video coding
US9800898B2 (en) 2014-10-06 2017-10-24 Microsoft Technology Licensing, Llc Syntax structures indicating completion of coded regions
WO2016180486A1 (en) * 2015-05-12 2016-11-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Composite scalable video streaming
US10129558B2 (en) * 2015-09-21 2018-11-13 Qualcomm Incorporated Supplement enhancement information (SEI) messages for high dynamic range and wide color gamut video coding
US10244249B2 (en) 2015-09-21 2019-03-26 Qualcomm Incorporated Fixed point implementation of range adjustment of components in video coding
US10349067B2 (en) * 2016-02-17 2019-07-09 Qualcomm Incorporated Handling of end of bitstream NAL units in L-HEVC file format and improvements to HEVC and L-HEVC tile tracks
US10397443B2 (en) * 2016-03-01 2019-08-27 Qualcomm Incorporated Methods and systems for generating color remapping information supplemental enhancement information messages for video
CA3020757C (en) * 2016-04-13 2024-02-13 Cryos Technologies Inc. Membrane-based foot imaging apparatus including a camera for monitoring foot positioning
UA124806C2 (en) 2016-10-05 2021-11-24 Долбі Леборетеріз Лайсенсінг Корпорейшн Source color volume information messaging
US10728559B2 (en) * 2017-07-07 2020-07-28 Qualcomm Incorporated Precision of computation and signaling of dynamic range adjustment and color remapping information
WO2019065587A1 (en) * 2017-09-29 2019-04-04 Sharp Kabushiki Kaisha Systems and methods for signaling information associated with a constituent picture
KR102284378B1 (en) * 2018-02-07 2021-08-03 엘지전자 주식회사 Method for transmitting signal in wireless communication system supporting unlicensed band and apparatus supporting thereof
CN109905715B (en) * 2019-02-26 2021-07-06 北京世纪好未来教育科技有限公司 Code stream conversion method and system for inserting SEI data
CN113892271A (en) * 2019-05-30 2022-01-04 夏普株式会社 System and method for signaling hypothetical reference decoder parameters in video coding
US12022122B2 (en) * 2019-06-19 2024-06-25 Sharp Kabushiki Kaisha Systems and methods for signaling decoded picture buffer information in video coding
CN114009051B (en) * 2019-06-27 2023-07-18 华为技术有限公司 Hypothetical reference decoder for V-PCC
EP3977746A4 (en) 2019-07-05 2022-08-03 Huawei Technologies Co., Ltd. Video coding bitstream extraction with identifier signaling
JP7425185B2 (en) * 2019-09-24 2024-01-30 華為技術有限公司 Scalable nesting SEI messages for specified layers
JP2022550320A (en) * 2019-09-24 2022-12-01 ホアウェイ・テクノロジーズ・カンパニー・リミテッド Simplifying SEI Message Dependencies in Video Coding
JP7419507B2 (en) * 2019-09-24 2024-01-22 ホアウェイ・テクノロジーズ・カンパニー・リミテッド Scalable nesting SEI messages for OLS
KR20220063260A (en) * 2019-09-24 2022-05-17 후아웨이 테크놀러지 컴퍼니 리미티드 Picture timing and decoding unit information for temporal scalability
CN117528101A (en) * 2019-09-24 2024-02-06 华为技术有限公司 Sequence level HRD parameters
EP4062635A4 (en) 2019-12-26 2022-12-28 ByteDance Inc. Constraints on signaling of video layers in coded bitstreams
WO2021134018A1 (en) 2019-12-26 2021-07-01 Bytedance Inc. Signaling of decoded picture buffer parameters in layered video
KR20220113404A (en) 2019-12-27 2022-08-12 바이트댄스 아이엔씨 Syntax for signaling video subpictures
WO2021142369A1 (en) 2020-01-09 2021-07-15 Bytedance Inc. Signalling of the wavefront parallel processing
KR20230015382A (en) 2020-05-22 2023-01-31 바이트댄스 아이엔씨 Constraints on Picture Types in Video Bitstream Processing
BR122023020999A2 (en) * 2020-05-22 2024-02-27 Ge Video Compression, Llc APPARATUS AND METHOD FOR RECEIVING VIDEO DATA STREAM, VIDEO DATA STREAM, VIDEO ENCODER AND DECODER, AND METHOD FOR ENCODING A VIDEO INTO A VIDEO DATA STREAM
WO2021237123A1 (en) 2020-05-22 2021-11-25 Bytedance Inc. Sei message handling in video sub-bitstream extraction process
AU2022271427B2 (en) * 2020-05-22 2024-01-18 Ge Video Compression, Llc Video encoder, video decoder, methods for encoding and decoding and video data stream for realizing advanced video coding concepts
KR20230020425A (en) 2020-06-09 2023-02-10 바이트댄스 아이엔씨 Scalable Nesting of Supplementary Enhancement Information Messages in Video Coding
CN115699773A (en) 2020-06-09 2023-02-03 字节跳动有限公司 Signaling picture timing information in coded video
JP2023529823A (en) * 2020-06-09 2023-07-12 バイトダンス インコーポレイテッド Sub-bitstream extraction of multilayer video bitstreams
US11962936B2 (en) 2020-09-29 2024-04-16 Lemon Inc. Syntax for dependent random access point indication in video bitstreams
US20230412846A1 (en) * 2020-12-17 2023-12-21 Lg Electronics Inc. Method and device for generating/receiving media file including nal unit information, and method for transmitting media file
US20240205460A1 (en) * 2021-04-12 2024-06-20 Telefonaktiebolaget Lm Ericsson (Publ) Message referencing

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1421096A (en) * 2000-12-04 2003-05-28 皇家菲利浦电子有限公司 Recording arrangement for error-tolerant recording of information signal
CN101485208A (en) * 2006-07-05 2009-07-15 汤姆森许可贸易公司 Methods and apparatus for multi-view video encoding and decoding
CN101960856A (en) * 2008-02-26 2011-01-26 佳能株式会社 Moving image encoding apparatus and moving image encoding method

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW262619B (en) 1995-06-06 1995-11-11 United Microelectronics Corp Shrinking device for code table of variable length decoder
TWI259378B (en) 2004-02-26 2006-08-01 Cablesoft Tech Inc An automatic addressing control method for MSO CATV system
US8615038B2 (en) 2004-12-06 2013-12-24 Nokia Corporation Video coding, decoding and hypothetical reference decoder
BRPI0605857A (en) 2005-04-19 2007-12-18 Coding Tech Ab energy-dependent quantization for efficient coding of spatial audio parameters
WO2007042914A1 (en) * 2005-10-11 2007-04-19 Nokia Corporation Efficient decoded picture buffer management for scalable video coding
AU2006300881B2 (en) 2005-10-11 2011-03-17 Nokia Technologies Oy System and method for efficient scalable stream adaptation
TWI432035B (en) * 2006-01-11 2014-03-21 Nokia Corp Backward-compatible aggregation of pictures in scalable video coding
US20070230564A1 (en) * 2006-03-29 2007-10-04 Qualcomm Incorporated Video processing with scalability
US8699583B2 (en) * 2006-07-11 2014-04-15 Nokia Corporation Scalable video coding and decoding
US7860147B2 (en) 2006-08-16 2010-12-28 Harris Corporation Method of communicating and associated transmitter using coded orthogonal frequency division multiplexing (COFDM)
WO2008046243A1 (en) 2006-10-16 2008-04-24 Thomson Licensing Method and device for encoding a data stream, method and device for decoding a data stream, video indexing system and image retrieval system
US20080089411A1 (en) 2006-10-16 2008-04-17 Nokia Corporation Multiple-hypothesis cross-layer prediction
US20080095228A1 (en) * 2006-10-20 2008-04-24 Nokia Corporation System and method for providing picture output indications in video coding
AU2007342468B2 (en) * 2007-01-05 2011-11-24 Interdigital Vc Holdings, Inc. Hypothetical reference decoder for scalable video coding
EP2123044A1 (en) 2007-01-08 2009-11-25 Thomson Licensing Methods and apparatus for video stream splicing
GB0700381D0 (en) * 2007-01-09 2007-02-14 Mitsubishi Electric Inf Tech Generalised Hypothetical Reference Decoder for Scalable Video Coding with Bitstream Rewriting
WO2008126059A2 (en) * 2007-04-17 2008-10-23 Nokia Corporation Feedback based scalable video coding
US8761265B2 (en) * 2007-04-17 2014-06-24 Thomson Licensing Hypothetical reference decoder for multiview video coding
US20100142613A1 (en) 2007-04-18 2010-06-10 Lihua Zhu Method for encoding video data in a scalable manner
KR101663438B1 (en) * 2007-04-18 2016-10-06 톰슨 라이센싱 Coding systems
US20090003431A1 (en) * 2007-06-28 2009-01-01 Lihua Zhu Method for encoding video data in a scalable manner
KR20100030648A (en) 2007-06-26 2010-03-18 노키아 코포레이션 System and method for indicating temporal layer switching points
US8369415B2 (en) * 2008-03-06 2013-02-05 General Instrument Corporation Method and apparatus for decoding an enhanced video stream
FR2932050B1 (en) 2008-06-03 2010-05-21 Canon Kk METHOD AND DEVICE FOR TRANSMITTING VIDEO DATA
RU2492585C2 (en) 2008-07-16 2013-09-10 Нокиа Корпорейшн Method and apparatus for track and track subset grouping
WO2010021665A1 (en) 2008-08-20 2010-02-25 Thomson Licensing Hypothetical reference decoder
JP5072893B2 (en) * 2009-03-25 2012-11-14 株式会社東芝 Image encoding method and image decoding method
JP5267886B2 (en) 2009-04-08 2013-08-21 ソニー株式会社 REPRODUCTION DEVICE, RECORDING MEDIUM, AND INFORMATION PROCESSING METHOD
JP5614901B2 (en) 2009-05-01 2014-10-29 トムソン ライセンシングThomson Licensing 3DV reference picture list
US8948241B2 (en) 2009-08-07 2015-02-03 Qualcomm Incorporated Signaling characteristics of an MVC operation point
CN103119934B (en) 2010-07-20 2017-02-22 诺基亚技术有限公司 A media streaming apparatus
US9131033B2 (en) 2010-07-20 2015-09-08 Qualcomm Incoporated Providing sequence data sets for streaming video data
US9716920B2 (en) * 2010-08-05 2017-07-25 Qualcomm Incorporated Signaling attributes for network-streamed video data
US9635355B2 (en) 2011-07-28 2017-04-25 Qualcomm Incorporated Multiview video coding
US10244257B2 (en) 2011-08-31 2019-03-26 Nokia Technologies Oy Video coding and decoding
US9591361B2 (en) 2011-09-07 2017-03-07 Qualcomm Incorporated Streaming of multimedia data from multiple sources
US9998726B2 (en) * 2012-06-20 2018-06-12 Nokia Technologies Oy Apparatus, a method and a computer program for video coding and decoding
US9351005B2 (en) 2012-09-24 2016-05-24 Qualcomm Incorporated Bitstream conformance test in video coding
US8989508B2 (en) * 2012-09-28 2015-03-24 Sharp Kabushiki Kaisha Electronic device for signaling a sub-picture buffer parameter
US9380317B2 (en) 2012-10-08 2016-06-28 Qualcomm Incorporated Identification of operation points applicable to nested SEI message in video coding

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1421096A (en) * 2000-12-04 2003-05-28 皇家菲利浦电子有限公司 Recording arrangement for error-tolerant recording of information signal
CN101485208A (en) * 2006-07-05 2009-07-15 汤姆森许可贸易公司 Methods and apparatus for multi-view video encoding and decoding
CN101960856A (en) * 2008-02-26 2011-01-26 佳能株式会社 Moving image encoding apparatus and moving image encoding method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《BoG on high-level syntax for extension planning》;Ye-Kui Wang;《Joint Collaborative Team on 3D Video Coding Extension Development》;20120720;Section2.2-2.4 *
《Proposed Editorial Improvements for high efficiency video coding(HEVC) Text Specification Draft 8》;Benjamin Bross;《Joint Collaborative Team on Video Coding (JCT-VC)》;20120912;Section7.3-7.4以及SectionE.1.2 *

Also Published As

Publication number Publication date
JP6077124B2 (en) 2017-02-08
KR20150067319A (en) 2015-06-17
SG11201501833PA (en) 2015-05-28
CN104685892B (en) 2019-02-22
CN104685891B (en) 2018-07-20
KR101677867B1 (en) 2016-11-18
BR112015007761A2 (en) 2017-07-04
HK1209550A1 (en) 2016-04-01
US20140098894A1 (en) 2014-04-10
EP2904787A1 (en) 2015-08-12
MY168739A (en) 2018-11-29
KR101697886B1 (en) 2017-01-18
BR112015007763B1 (en) 2022-08-02
JP2015532551A (en) 2015-11-09
RU2015117436A (en) 2016-11-27
AR094449A1 (en) 2015-08-05
TWI565299B (en) 2017-01-01
MA37971B1 (en) 2016-10-31
PH12015500742A1 (en) 2015-05-25
AU2013330284B2 (en) 2017-06-08
RU2015117520A (en) 2016-11-27
HUE043479T2 (en) 2019-08-28
CO7350643A2 (en) 2015-08-10
US9154785B2 (en) 2015-10-06
MX341437B (en) 2016-08-18
HK1207775A1 (en) 2016-02-05
ECSP15018127A (en) 2015-12-31
SA515360254B1 (en) 2016-08-15
CA2885670C (en) 2017-07-18
SG11201501832UA (en) 2015-05-28
RU2633100C2 (en) 2017-10-11
AU2013330284A1 (en) 2015-04-23
BR112015007763A2 (en) 2017-07-04
US20140098895A1 (en) 2014-04-10
EP2904782A1 (en) 2015-08-12
CN104685891A (en) 2015-06-03
SI2904782T1 (en) 2018-05-31
EP2904784A1 (en) 2015-08-12
PH12015500625B1 (en) 2015-05-11
AU2013330372B2 (en) 2017-07-06
JP2015536114A (en) 2015-12-17
US20140098896A1 (en) 2014-04-10
JP6062558B2 (en) 2017-01-18
TW201429256A (en) 2014-07-16
ES2727814T3 (en) 2019-10-18
WO2014058599A1 (en) 2014-04-17
MY172252A (en) 2019-11-20
HUE038492T2 (en) 2018-10-29
PH12015500625A1 (en) 2015-05-11
UA116998C2 (en) 2018-06-11
AU2013330372A1 (en) 2015-04-16
CA2885670A1 (en) 2014-04-17
US9319703B2 (en) 2016-04-19
CA2885807C (en) 2018-04-03
WO2014058598A1 (en) 2014-04-17
DK2904782T3 (en) 2018-04-30
IL237949B (en) 2019-10-31
AP2015008363A0 (en) 2015-04-30
CA2885807A1 (en) 2014-04-17
TW201436536A (en) 2014-09-16
ZA201502498B (en) 2017-09-27
RU2643463C2 (en) 2018-02-01
KR20150056877A (en) 2015-05-27
AR093288A1 (en) 2015-05-27
IL237949A0 (en) 2015-05-31
JP6013614B2 (en) 2016-10-25
TWI533674B (en) 2016-05-11
EP2904787B1 (en) 2019-02-27
MX2015004383A (en) 2015-06-10
US9380317B2 (en) 2016-06-28
CN104704842A (en) 2015-06-10
ES2663692T3 (en) 2018-04-16
EP2904782B1 (en) 2018-01-17
CN104685892A (en) 2015-06-03
MA37971A1 (en) 2016-03-31
JP2015537420A (en) 2015-12-24
WO2014058600A1 (en) 2014-04-17
AP3952A (en) 2016-12-21
BR112015007761B1 (en) 2022-08-09
PT2904782T (en) 2018-04-23
IL237948A (en) 2016-11-30
PH12015500742B1 (en) 2015-05-25
UA116363C2 (en) 2018-03-12
EP2904784B1 (en) 2018-12-19
KR20150067318A (en) 2015-06-17
KR101719935B1 (en) 2017-03-24

Similar Documents

Publication Publication Date Title
CN104704842B (en) The syntactic structure of hypothetical reference decoder parameter
CN105612752B (en) Support to the multimode extraction for multi-layer video codec
CN104054347B (en) The instruction of parallel processing used before video coding medium wave
CN104919802B (en) Non-nested formula supplemental enhancement information message in video coding
CN104813671B (en) For the method and apparatus processing video data
CN104365105B (en) Exterior artwork in video coding
CN104471943B (en) Parameter set in video coding
CN104704829B (en) Handle the method and apparatus and computer-readable storage medium of video data
CN106464936A (en) Method and device for decoding multi-layer video data by determining the processing core of the decoder based on partition containing one or more layers
CN108141616A (en) Parameter set signaling in the file of multilayer bit stream
CN104471942A (en) Reusing Parameter Sets For Video Coding

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1209550

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1209550

Country of ref document: HK