CN105122816A - Decoding of inter-layer reference picture set and reference picture list construction - Google Patents

Decoding of inter-layer reference picture set and reference picture list construction Download PDF

Info

Publication number
CN105122816A
CN105122816A CN201480020037.9A CN201480020037A CN105122816A CN 105122816 A CN105122816 A CN 105122816A CN 201480020037 A CN201480020037 A CN 201480020037A CN 105122816 A CN105122816 A CN 105122816A
Authority
CN
China
Prior art keywords
layer
image
inter
sublayer
maximum quantity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201480020037.9A
Other languages
Chinese (zh)
Inventor
萨琴·G·德希潘德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/857,990 external-priority patent/US9532067B2/en
Application filed by Sharp Corp filed Critical Sharp Corp
Publication of CN105122816A publication Critical patent/CN105122816A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/187Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission

Abstract

The invention relates to decoding of inter-layer reference picture set and reference picture list construction. A method for video coding is described. Signaling of a maximum number of sub-layers for inter-layer prediction is obtained. A sub-layer non-reference picture is also obtained. It is determined whether a value of a temporal identifier of the sub-layer non-reference picture is greater than the maximum number of sub-layers for inter-layer prediction minus 1. The sub-layer non-reference picture is marked as "unused for reference" if the value of the temporal identifier of the sub-layer non-reference picture is greater than the maximum number of sub-layers for inter-layer prediction minus 1. In some cases a sub-layer non-reference picture is also obtained. It is determined whether a value of a temporal identifier of the sub-layer non-reference picture is greater than the maximum number of sub-layers for inter-layer prediction. The sub-layer non-reference picture is marked as "unused for reference" if the value of the temporal identifier of the sub-layer non-reference picture is greater than the maximum number of sub-layers for inter-layer prediction.

Description

The decoding of inter-layer reference image set and reference picture list build
Technical field
The disclosure relates in general to electronic equipment.More specifically, the disclosure relates to the decode system of inter-layer reference image set and method and reference picture list and builds.
Background technology
The more and more less and function of electronic equipment from strength to strength, to satisfy the demands of consumers and to provide portability and convenience.Consumer becomes and depends on electronic equipment and require more function.Some examples of electronic equipment comprise: desktop computer, laptop computer, cellular phone, smart phone, media player, integrated circuit etc.
Some electronic equipments for the treatment of with display Digital Media.Such as, nowadays portable electric appts allows almost in any position consume digital media that consumer can go.In addition, some electronic equipments can provide download or the flow transmission of digital media content, use and amusement for consumer.
Being widely current of Digital Media proposes problems.Such as, for storing, sending and playback, represent that high quality digital media proposes lot of challenges efficiently.Can find out according to this discussion, more efficiently the system and method for representative digit media may be useful.
Summary of the invention
By reference to the accompanying drawings, consider following specific descriptions of the present invention, above-mentioned and other object of the present invention, feature, advantage will be easier to understand.
One embodiment of the present of invention disclose a kind of method for video coding, comprising: the signaling (signaling) obtaining the maximum quantity of inter-layer prediction sublayer; Start the decoding process for inter-layer reference image set (RPS); Obtain the value of the time identifier of interlayer image; Determine that the maximum quantity whether value of time identifier is greater than inter-layer prediction sublayer subtracts 1; And if the maximum quantity that the value of the time identifier of interlayer image is not more than inter-layer prediction sublayer subtracts 1, then added in inter-layer reference image set (RPS) by interlayer image.
An alternative embodiment of the invention discloses a kind of electronic equipment being arranged to Video coding, comprising: processor; With the memory of processor telecommunication, store instruction in which memory and can be performed with the signaling that obtains the maximum quantity of inter-layer prediction sublayer; Start the decoding process for inter-layer reference image set (RPS); Obtain the value of the time identifier of interlayer image; Determine that the maximum quantity whether value of time identifier is greater than inter-layer prediction sublayer subtracts 1; And if the maximum quantity that the value of the time identifier of interlayer image is not more than inter-layer prediction sublayer subtracts 1, then added in inter-layer reference image set (RPS) by interlayer image.
Accompanying drawing explanation
Fig. 1 is the block diagram of the Video coding illustrated between multiple electronic equipment.
Fig. 2 is the block diagram of the image tagged module used in the system and method for the present invention.
Fig. 3 is the flow chart of the method for marking sublayer non-reference picture.
Fig. 4 illustrates the block diagram being marked as the additional image of " being not used in reference " using system and method for the present invention.
Fig. 5 is the block diagram that inter-layer reference image set (RPS) update module is shown.
Fig. 6 is the flow chart for the method for reference diagram image set (RPS) (RefPicSetInterLayer) between update step.
Fig. 7 is the block diagram of a kind of configuration of the encoder illustrated on electronic equipment.
Fig. 8 is the block diagram of a kind of configuration of the decoder illustrated on electronic equipment.
Fig. 9 shows and can send each assembly used in electronic equipment.
Figure 10 is the block diagram that each assembly that can use in reception electronic equipment is shown.
Embodiment
Disclose a kind of method for video coding.Obtain the signaling of the maximum quantity of inter-layer prediction sublayer.Start the decoding process for inter-layer reference image set (RPS).Obtain the value of the time identifier of interlayer image.Determine that the maximum quantity whether value of time identifier is greater than inter-layer prediction sublayer subtracts 1.If the maximum quantity that the value of the time identifier of the interlayer image of the time identifier of interlayer image is not more than inter-layer prediction sublayer subtracts 1, then add interlayer image to inter-layer reference image set (RPS).
Inter-layer reference image set (RPS) may be used for reference picture list and builds.Interlayer image can comprise the value of the layer identifier corresponding with the layer of the direct reference layer as current layer.Interlayer image can comprise the picture numbers equal with the picture numbers of present image.If the value of the maximum quantity of inter-layer prediction sublayer is 0, and interlayer image is non-RAP (random access points) image, then can not add interlayer image to inter-layer reference image set (RPS).If the value of the maximum quantity of inter-layer prediction sublayer is 0, and interlayer image is RAP (random access points) image, then interlayer image can be added in inter-layer reference image set (RPS).
The maximum quantity of inter-layer prediction sublayer can have index [LayderIdInVps [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]]].Alternatively, the maximum quantity of inter-layer prediction sublayer can have index [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]].The maximum quantity of inter-layer prediction sublayer can also have index [layer_id_in_nuh [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]]].The maximum quantity of inter-layer prediction sublayer can also have index [layer_id_in_nuh [i]].
The maximum quantity of inter-layer prediction sublayer can also have index [i].Inter-layer reference image list RefPicSetInterLayer can have index [NumInterLayerRPSPics [LayerIdInVps [nuh_layer_id]]].NumInterLayerRPSPics [LayerIdInVps [nuh_layer_id]] can be derived as different from NumDirectRefLayers [LayerIdInVps [nuh_layer_id]].
Also disclose the electronic equipment being arranged to video decode.Described electronic equipment comprises processor and the memory with processor telecommunication.The instruction stored in memory can be performed, to obtain the signaling of the maximum quantity of inter-layer prediction sublayer.The instruction stored in memory can also be performed, to start the decoding process for inter-layer reference image set (RPS).The instruction stored in memory can also be performed, to obtain the value of the time identifier of interlayer image.Store instruction in memory can also be performed, to determine that the maximum quantity whether value of time identifier is greater than inter-layer prediction sublayer subtracts 1.The instruction stored in memory can also be performed, if the maximum quantity that the value of the time identifier of interlayer image is not more than inter-layer prediction sublayer subtracts 1, is then added in inter-layer reference image set (RPS) by interlayer image.
With reference now to accompanying drawing, describe each configuration, wherein similar Reference numeral can the similar element of deixis.In this paper accompanying drawing, various different configuration widely can be arranged and be designed to the general system and method described and illustrate.Therefore, below the detailed description of the multiple configurations shown in accompanying drawing is not intended to limit to the scope of claim, only as the signal of system and method.
Fig. 1 is the block diagram of the Video coding illustrated between multiple electronic equipment 102a-b.Show the first electronic equipment 102a and the second electronic equipment 102b.However, it is noted that in some configurations, the one or more Characteristic and function described in conjunction with the first electronic equipment 102a and the second electronic equipment 102b can be combined in Single Electron equipment 102.Each electronic equipment 102 can be configured to encode to video and/or decode.In one configuration, each electronic equipment can meet efficient video coding (HEVC) standard.HEVC is the follow-up video compression standard of H.264/MPEG-4AVC (advanced video coding), and it provides the data compression rate of better video quality and Geng Gao.The electronic equipment 102 meeting HEVC standard can comprise additional image mark function, inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 more New function and reference picture list constructing function.Herein, image is the array of the array chroma sample corresponding to two of the array of brightness (luma) sample of monochrome format or the luma samples of 4: 2: 0,4: 2: 2 and 4: 4: 4 color format.
First electronic equipment 102a can comprise video encoder 182, and video encoder 182 comprises enhancement layer encoder 106 and base layer encoder 109.Hereafter discuss composition graphs 7 in detail enhancement layer encoder 106 and base layer encoder 109.Each element (i.e. enhancement layer encoder 106 and base layer encoder 109) that first electronic equipment 102a comprises can be implemented as hardware, software or the combination of the two.First electronic equipment 102a can obtain input picture 104.In some configurations, input picture 104 can use imageing sensor to catch on the first electronic equipment 102a, or obtains from memory, or receives from another electronic equipment 102.In one configuration, video encoder 182 can meet scalable efficient video standard (SHVC) or multi views efficient video coding standard (MV-HEVC).
Enhancement layer encoder 106 can be encoded to input picture 104, to produce coded data.Such as, enhancement layer encoder 106 can be encoded to input picture 104 sequence (as video).In one configuration, enhancement layer encoder 106 can be efficient video coding (HEVC) encoder.In another arrangement, enhancement layer encoder 106 can be scalable efficient video standard (SHVC) encoder or multi views efficient video coding standard (MV-HEVC) encoder.Coded data can be included in the enhancement layer video bit-streams 110 of coding.Enhancement layer encoder 106 can generate expense (overhead) signaling based on input picture 104.
Base layer encoder 109 also can be encoded to input picture 104.In one configuration, base layer encoder 109 can use the identical input picture 104 used with enhancement layer encoder 106.In another arrangement, base layer encoder 109 can use the input picture of different from the input picture 104 that enhancement layer encoder 106 uses (but similar).Such as, for signal to noise ratio (SNR) scalability (also referred to as quality scalability), enhancement layer encoder 106 and base layer encoder 109 can use identical input picture 104.As another example, for spatial scalability, base layer encoder 109 can use down and adopt image.In another example, for multi views scalability, base layer encoder 109 can use different view image.Base layer encoder 109 can produce the coded data be included in the base layer video bit-stream 107 of coding.Base layer encoder 109 also can be scalable efficient video (SHVC) encoder or multi views efficient video coding (MV-HEVC) encoder.
The enhancement layer video bit-streams 110 of coding and the base layer video bit-stream 107 of coding each can comprise coded data based on input picture 104.In one example, the enhancement layer video bit-streams 110 of coding and the base layer video bit-stream 107 of coding can comprise coded image data.In some configurations, the enhancement layer video bit-streams 110 of coding and/or the base layer video bit-stream 107 of coding can also comprise overhead data, such as sequence parameter set (SPS) information, picture parameter set (PPS) information, video parameter collection (VPS) information, slice header information etc.
First electronic equipment 102a can to the maximum quantity (max_sublayer_for_ilp_plus1) 108 predicting sublayer between the second electronic equipment 102b providing layer.The maximum quantity (max_sublayer_for_ilp_plus1) 108 of inter-layer prediction sublayer can notify (that is, video parameter collection raw bytes net load (RBSP) being used in the F.7.4.3.1 part definition of JCTVC-L1008 is semantic) by signal in VPS extension syntax structure.
The maximum quantity (max_sublayer_for_ilp_plus1) 108 of inter-layer prediction sublayer can be notified by signal in the base layer video bit-stream 107 of coding or the enhancement layer video bit-streams 110 of encoding.In one configuration, can to the maximum quantity (max_sublayer_for_ilp_plus1) 108 predicting sublayer between electronic equipment 102b providing layer in overhead data (such as sequence parameter set (SPS) information, picture parameter set (PPS) information, video parameter collection (VPS) information, slice header information etc.).In another arrangement, can be separated in " metadata " bit stream or file to the maximum quantity (max_sublayer_for_ilp_plus1) 108 predicting sublayer between the second electronic equipment 102b providing layer.
It is " being not used in reference " that second electronic equipment 102b can use the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 108 to determine whether image tagged.Second electronic equipment 102b can also use the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 108 to add image to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.RefPicSetInterLayer can refer to inter-layer reference image list.Second electronic equipment 102b can also use the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 108 to build reference picture list (RefPicList0RefPicList1).
The enhancement layer video bit-streams 110 of coding can be provided to the second electronic equipment 102b.Similarly, the base layer video bit-stream 107 of coding can be provided to the second electronic equipment 102b.Second electronic equipment 102b can comprise Video Decoder 112 and base layer decoder 113.Video Decoder 112 can comprise enhancement layer decoder 115.In one configuration, while the enhancement layer video bit-streams 110 of enhancement layer decoder 115 to coding is decoded, the base layer video bit-stream 107 of base layer decoder 113 to coding is decoded.Hereafter discuss composition graphs 8 in detail base layer decoder 113 and enhancement layer decoder 115 further.In one configuration, Video Decoder 112 can meet scalable efficient video coding (SHVC) standard.In another arrangement, Video Decoder 112 can meet multi views efficient video coding (MV-HEVC) standard.Base layer decoder 113 and enhancement layer decoder 115 each can be efficient video coding (HEVC) decoder.Base layer decoder 112 and enhancement layer decoder 115 can also be scalable efficient video coding (SHVC) decoder or multi views efficient video coding (MV-HEVC) decoder.
In one example, wired or wireless chain road direction second electronic equipment 102b can be used to send the enhancement layer video bit-streams 110 of coding and the base layer video bit-stream 107 of coding.In some cases, this can complete on network, and network is such as the Internet, local area network (LAN) (LAN) or the network of other types for carrying out communicating at equipment room.Should be noted that, in some configurations, encoder (namely, enhancement layer encoder 106 and base layer encoder 109) and decoder is (namely, Video Decoder 112, base layer decoder 113 and enhancement layer decoder 115) can be implemented on identical electronic equipment 102 (that is, the first electronic equipment 102a and the second electronic equipment 102b can be a part for Single Electron equipment 102).Such as, encoder is being realized in the situation of same electronic device 102, the enhancement layer video bit-streams 110 of coding and base layer video bit-stream 107 pairs of Video Decoders 112 of coding can be able to used by various mode.Such as, the enhancement layer video bit-streams 110 of coding and the base layer video bit-stream 107 of coding can be provided to Video Decoder 112 by bus, or storage obtains for Video Decoder 112 in memory.
Video Decoder 112 (such as base layer decoder 113 and enhancement layer decoder 115) can be implemented as hardware, software or the combination of the two.In one configuration, Video Decoder 112 can be HEVC decoder.The enhancement layer video bit-streams 110 that Video Decoder 112 can obtain (such as receiving) encodes and the base layer video bit-stream 107 of coding.Then, Video Decoder 112 can generate one or more decoded picture 116 based on the base layer video bit-stream 107 of the enhancement layer video bit-streams 110 of coding and coding.Decoded picture 116 can be shown, playback, be stored in memory and/or send to another equipment etc.
Video Decoder 112 can comprise image tagged module 114.Some image tagged can be " being not used in reference " by image tagged module 114.The image being labeled as " being not used in reference " will be not used as between image or the reference picture of inter-layer prediction.Advantage additional image being labeled as " being not used in reference " to reduce decoded picture buffer (DPB) size/memory.Hereinafter composition graphs 2-4 discusses image tagged module 114 further in detail.
Video Decoder 112 can also comprise inter-layer reference image set (RPS) update module 118.Video Decoder 112 can use inter-layer reference image set (RPS) update module 118 to carry out reference diagram image set (RPS) (RefPicSetInterLayer) 120 between update step.Such as, inter-layer reference image set (RPS) update module 118 can use the signaling of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 108 to determine whether interlayer image to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Hereinafter composition graphs 5-6 discusses inter-layer reference image set (RPS) update module 118 further in detail.
In some configurations, the second electronic equipment 102b can export decoded picture 116.In one example, decoded picture 116 can be sent to another equipment or send it back the first electronic equipment 102a.Decoded picture 116 can store or otherwise remain on the second electronic equipment 102b.In another example, the second electronic equipment 102b can show decoded picture 116.In other configurations, based on the coding performed bit stream 110 and other operations, decoded picture 116 can comprise the element of the input picture 104 with different attribute.In some configurations, decoded picture 116 can be included in the image stream with the resolution different from input picture 104, form, technical specification or other character.
By intermediate equipment (not shown), bit stream 110 can be relayed to the second electronic equipment 102b from the first electronic equipment 102a.Such as, intermediate equipment can receive bit stream 110 from the first electronic equipment 102a, and bit stream 110 is relayed to the second electronic equipment 102b.
It should be noted that one or more element of comprising in the electronic device 102 or parts can be implemented as hardware.Such as, these elements or the one or more of parts can be implemented as chip, circuit or nextport hardware component NextPort etc.Function as herein described or method can be implemented as hardware and/or use hardware to perform.Such as, one or more method as herein described can use chipset, application specific integrated circuit (ASIC), large scale integrated circuit (LSI) or integrated circuit etc. to realize and/or realize wherein.
Fig. 2 is the block diagram of the image tagged module 214 used in the system and method for the present invention.The image tagged module 214 of Fig. 2 can be the one configuration of the image tagged module 114 of Fig. 1.Image tagged module 214 can be a part for the Video Decoder 112 on electronic equipment 102.
Image tagged module 214 can comprise sublayer non-reference picture 222.Herein, the time subset of scalability layer is not called layer, but is called sublayer or time sublayer.Sublayer is the time scalability layer of time scalable bit stream, comprises video coding layer (VCL) network abstract layer (NAL) unit with time identifier particular value and the non-VCLNAL unit be associated.Sublayer non-reference picture 222 is images of the sample of the inter-layer prediction comprised in the decoding process of the successive image that can not be used for identical sublayer in decoding order.The sample of sublayer non-reference picture 222 may be used for the inter-layer prediction in the decoding process of the successive image of higher sublayer in decoding order.
Sublayer non-reference picture 222 can be received from the first electronic equipment 102a via bit stream 110.Each sublayer non-reference picture 222 can comprise time identifier (TemporalId) 224.Image tagged module 214 can also comprise the maximum quantity (max_sublayer_for_ilp_plus1_208 of inter-layer prediction sublayer.In some cases, the maximum quantity (max_sublayer_for_ilp_plus1) 208 of inter-layer prediction sublayer equals max_sublayer_for_ilp_plus1-1, and is not equal to max_sublayer_for_ilp_plus1.In addition, compare syntax and semantics as herein described, by adding plus1 or plus2 (adding) or adding minus1 or minus2 (subtracting), various syntactic element and semanteme thereof can be changed.For each layer, a value of the maximum quantity (max_sublayer_for_ilp_plus1) 208 of sublayer can be predicted between transmit layer.Thus, max_sublayer_for_ilp_plus1 [i] scope is from 0 to vps_max_layers_minus1.JCTVC_L0449 defines for the signal notice sublayer of inter-layer prediction and the syntax and semantics of the use of Stochastic accessing dot image, as with following table 1:
Table 1
Herein, Stochastic accessing refers to the action starting bit stream decoding process at the point being different from stream starting point.This decoding can start at random access points (RAP) image place.Non-RAP image refers to the image of nonrandom access point (RAP) image.In some cases, RAP image alternatively can be called internal random access point image (IRAP).Similarly, at this moment non-RAP image can be called non-IRAP image.Max_sublayer_for_ilp_plus1 [i] equals 0 instruction, and the non-RAP image that layer identifier (nuh_layer_id) 236 equals layer identifier syntax element value layer_id_in_nuh [i] 226 is not used as the reference of inter-layer prediction.Max_sublayer_for_ilp_plus1 [i] is greater than 0 instruction, and layer identifier (nuh_layer_id) 236 equals layer identifier syntax element value layer_id_in_nuh [i] 226 and the reference of time identifier (TemporalId) 224 is greater than the image of max_sublayer_for_ilp_plus1 [i]-1 it goes without doing inter-layer prediction.When there is not maximum quantity (max_sublayer_for_ilp_plus1) of inter-layer prediction sublayer, maximum quantity (max_sublayer_for_ilp_plus1 [i]) 208 of inter-layer prediction sublayer is indefinite.
In another embodiment, as i < vps_max_layers_minus1, the circulation of signal notice max_sublayer_for_ilp_plus1 [i] can terminate, as follows:
for(i=0;i<vps_max_layers_minus1;i++){
max_sublayer_for_ilp_plus1[i] u(3)
}
The mark process to sublayer non-reference picture 222 is described in JCTVC-L1008, JCTVC-L0452 and JCTVC-L0453.But this mark process does not use the signaling of the maximum quantity (max_sublayer_for_ilp_plus1) 208 of inter-layer prediction sublayer.Be used for the new method of the image of inter-layer prediction by usage flag, can benefit be realized.When usage flag is used for the new method of the image of inter-layer prediction, use based on the sublayer of each layer of inter-layer prediction and the precedence categories of RAP image, the sublayer non-reference picture 222 of destination layer can be labeled as " being not used in reference ".
The decoding process defined is provided in as JCTVC-L1008 below in F.8 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" process of F.8 decoding "
" process of F.8.1 generally decoding "
For illustrating in sub-clause 8.1, apply following additional content.
When present image has the nuh_layer_id being greater than 0, apply following content.
-according to the value of separate_colour_plane_flag, build process of decoding as follows:
If-separate_colour_plane_flag equals 0, then call once following process of decoding, and using present image as output.
-otherwise (separate_colour_plane_flag equals 1), call process of decoding for less than three times.The input of this decoding process is all NAL unit of the coded image with identical colour_plane_id value.Specify the decoding process to the NAL unit with particular value colour_plane_id value, make only there is the CVS with the monochrome format of this specific colour_plane_id value in the bitstream.One that in three decoding process, output of each process is assigned in three sampling arrays of present image, the NAL unit that colour_plane_id equals 0 is assigned to S l, the NAL unit that colour_plane_id equals 0,1 and 2 is assigned to S l, S cband S cr.
Note-when separate_colour_plane_flag equals 1 and chroma_format_idc equals 3, variable ChromaArrayType is derived as 0.In decoding process, obtain this variate-value, cause the operation (when chroma_format_idc equal 0) identical with monochrome image
--carry out following decode operation for present image CurrPic.
-for present image in decoding order first section the decoding of cutting slice header, be invoked at sub-clause F.8.1.1 in specify for starting the decoding process of decoding to the coded image that nuh_layer_id is greater than 0.
If-ViewId [nuh_layer_id] is greater than 0, be then invoked at sub-clause G.8.1 in the decoding process being greater than the coded image of 0 for nuh_layer_id of specifying.
-otherwise, when DependencyId [nuh_layer_id] is greater than 0, be invoked at the decoding process being greater than the coded image of 0 for nuh_layer_id of specifying in sub-clause X.X.X.
-after all sections of current Graphics are all decoded, be invoked at sub-clause F.8.1.2 in the decoding process for terminating to be greater than nuh_layer_id the decoding of the coded image of 0 of specifying.
" F.8.1.1 for starting the decoding process of decoding to the coded image that nuh_layer_id is greater than 0 "
The each image related in this sub-clause is complete coded image.
For present image CurrPic, carry out following decode operation:
1. in sub-clause 8.2, specify the decoding to NAL unit.
2. the process in sub-clause 8.3 specifies following use to cut the decoding process of slice layer and syntactic element above:
-deriving in sub-clause 8.3.1 relates to the variable sum functions of image sequence number.This is only called when the first section section for image.The conforming PicOrderCntVal of requirement of bit stream should remain unchanged in addressed location.
-for the equal image of nuh_layer_id and CurrPic, be invoked at the decoding process for RPS performed in sub-clause 8.3.2, wherein reference picture can be labeled as " being not used in reference " or " for long term reference ".This only needs called when the first section section for image.
-when CurrPic be BLA image or NoRaslOutputFlag equal the CRA image of 1 time, be invoked at the decoding process for generating unavailable reference picture of specifying in sub-clause 8.3.3, this is only called when the first section section for image.
" F.8.1.2 for terminating decoding process nuh_layer_id being greater than to the decoding of the coded image of 0 "
PicOutputFlag arranges as follows:
If-present image is RASL image and the NoRaslOutputFlag associating IRAP image equals 1, PicOutputFlag is set to 0.
-otherwise, PicOutputFlag is set to pic_output_flag.
Apply following content:
-decoded picture is labeled as " for short term reference ".
-when TemporalId equals HighestTid, be invoked at sub-clause F.8.1.2.1 in specify for inter-layer prediction the mark process of unwanted sublayer non-reference picture, and the latestDecLayerId equaling nuh_layer_id is as input.
" F.8.1.2.1 for inter-layer prediction the mark process of unwanted sublayer non-reference picture "
The input of this process is:
The latestDecLayerId of-nuh_layer_id value
The output of this process is:
-likely upgrade ground some decoded pictures are labeled as " being not used in reference "
Attention-this process by between image or inter-layer prediction institute unwanted picture be labeled as " being not used in reference ".When TemporalId is less than HighestTid, present image can be used for the reference of inter picture prediction, and can never call this process.
TargetDecLayerIdList, numTargetDecLayers and latestDecIdx are as follows for derivation variable:
Designated layer identifier list TargetDecLayerIdList in the following way, described layer identifier list TargetDecLayerIdList specifies the list of the nuh_layer_id value of decoded NAL unit according to the ascending order of nuh_layer_id value:
If-certain external device (ED) can be used for arranging TargetDecLayerIdList, then TargetDecLayerIdList is set by this external device (ED).
-otherwise, if call decoding process in bit stream uniformity test, then correspondingly TargetDecLayerIdList is set.
-otherwise TargetDecLayerIdList only comprises the nuh_layer_id that equals 0.
-numTargetDecLayers is set to the quantity of the entry equaled in TargetDecLayerIdList.
-the i that equals latestDecLayerId for TargetDecLayerIdList [i], is set to the value of i by latestDecIdx.
As mentioned above, separate_colour_plane_flag refers to the mark indicated the quantity in the seperate color face for coded image.Term colour_plane_id refers to the identifier of color component.Term ChromaArrayType refers to the type of chrominance arrays.Term S l, S cband S crrefer to sampling array.Term NAL refers to network abstract layer (NAL).Term PicOrderCntVal refers to the image sequence number of present image.CurrPic refers to present image.Whether term NoRaslOutputFlag refers to and is used to indicate Stochastic accessing jump leading (randomaccessskippedleading) (RASL) image is the mark exporting (and whether this image can be correctly decoded).Term pic_output_flag refers in association slice header may exist syntactic element.Term TargetDecLayerIdList is the ascending order according to nuh_layer_id value, specifies the layer identifier list of the nuh_layer_id value list of decoded NAL unit.Term NumNegativePics specifies the quantity of the entry of the image sequence number value in stRpsIdx candidate's short-term reference picture collection (RPS) with the image sequence number value being greater than present image.
Whether term UsedByCurrPicS0 specifies in stRpsIdx candidate short-term RPS i-th entry of the image sequence number value with the image sequence number value being less than present image for the reference of present image.Whether term UsedByCurrPicS1 specifies i-th entry in current candidate short-term RPS with the image sequence number value of the image sequence number value being greater than present image for the reference of present image.
Term num_long_term_sps specifies the quantity of the entry in the long-term RPS derived based on candidate's long term reference image of specifying in activation SPS in present image.Term num_long_term_pics specifies the quantity of the entry in the long-term RPS of direct signal notice in slice header in present image.Whether term UsedByCurrPicLt specifies i-th entry in the long-term RPS of present image for the reference of present image.
Layer_id_in_nuh [i] specifies the value of the syntactic element nuh_layer_id in the VCLNAL unit of i-th layer.When not existing, infer that the value of layer_id_in_nuh [i] equals i.Variables L ayerIdInVps [layer_id_in_nuh [i]] is set to equal i.The direct reference layer of direct_dependency_flag [i] [j] assigned indexes equaling 0 is the layer of j not to be index the be layer of i.The direct reference layer of variable direct_dependency_flag [i] [j] assigned indexes equaling 1 is the layer of j can be index the be layer of i.For i and j within the scope of 0 to vps_max_layers_minus1, when direct_dependency_flag [i] [j] does not exist, infer that direct_dependency_flag [i] [j] equals 0.Derivation variable NumDirectRefLayers [i] and RefLayerId [i] [j], as follows:
for(i=1;i<=vps_max_layers_minus1;i++)
for(j=0,NumDirectRefLayers[i]=0;j<i;j++)
if(direct_dependency_flag[i][j]==1)
RefLayerId[i][NumDirectRefLayers[i]++]=layer_id_in_nuh[j]
There is the dimension_id syntactic element corresponding with i-th dimension of scalability shown in table " scalability Id is to the mapping of dimension of scalability " (MappingofScalabiltyIdtoscalabilitydimensions) in scalability_mask [i] instruction equaling 1.There is not the dimension_id syntactic element corresponding with i-th dimension of scalability in variable scalability_mask [i] instruction equaling 0.The mapping of scalability ld to dimension of scalability is provided with following table F-1:
Table F-1: scalability Id is to the mapping of dimension of scalability
Scalability_Task index Dimension of scalability Scalability Id maps
0 Multi views View Id
1 Space/SNR scalability Dependence Id
2-15 Reserved
Table F-1
Dimension_id [i] [j] specifies the identifier of the jth dimension of scalability type existed in i-th layer.When not existing, infer that the value of dimension_id [i] [j] is for equaling 0.Quantity for the position of the expression of dimension_id [i] [j] is dimension_id_len_minus1 [j]+1.When splitting_flag equals 1, bit stream coherence request dimension_id [i] [j] should equal ((layer_id_in_nuh [i] & ((1 < < dimBitOffset [j+1])-1)) > > dimBitOffset [j]).
Variable ScalabilityId [i] [smIdx] specifies the identifier of smldx dimension of scalability type in i-th layer, variable V iewId [layer_id_in_nuh [i]] specifies the view identifier of i-th layer, variables D ependencyId [layer_id_in_nuh [i]] specifies the space/SNR scalability identifier of i-th layer, and above-mentioned variable of deriving is as follows:
HighestTid is the highest time identifier (TemporalId) existed in bit stream.PicOutputFlag is based on image type (such as, whether image is that Stochastic accessing jumps leading image) and the variable that arranges based on the syntactic element pic_output_flag of signal notice.
In one configuration, F.8.1.2.1 part can comprise language image 222 being labeled as " being not used in reference " of table 2.
Table 2
In table 2, each sublayer non-reference picture 222 has the time identifier (TemporalId) 224 of definition.By the time identifier (TemporalId) 224 of image 222 compared with the maximum quantity (max_sublayer_for_ilp_plus1) 208 of the inter-layer prediction sublayer of designated layer (i.e. LayerIdInVps [TargetDecLayerIdList [i]]).Therefore, the index of max_sublayer_for_ilp_plus1 is LayerIdInVps [TargetDecLayerIdList [i]].If the time identifier of image 222 (TemporalId) 224 is greater than the max_sublayer_for_ilp_plus1208-1 of designated layer, then this image 222 is labeled as " being not used in reference ".In table 2, TargetDecLayerIdList refers to destination layer identifier list.Therefore, at marking phase, even if image belongs to be used as the layer of reference layer by layer any in destination layer identifier list, be also " being not used in reference " by this image tagged.
In another arrangement, F.8.1.2.1 part can comprise language image 222 being labeled as " being not used in reference " of table 3.
Table 3
Similar with table 2, in table 3, each image 222 has the time identifier (TemporalId) 224 of definition.But, in table 3, by the time identifier (TemporalId) of image 222 compared with max_sublayer_for_ilp_plus1 [i].If the time identifier of image 222 (TemporalId) is greater than max_sublayer_for_ilp_plus1 [i]-1, then image 222 is labeled as " being not used in reference ".In table 3, TargetDecLayerIdList refers to destination layer identifier list.Therefore, at marking phase, even if image belongs to be used as the layer of reference layer by layer any in destination layer identifier list, be also " being not used in reference " by this image tagged.
In another configuration, F.8.1.2.1 part can comprise language image 222 being labeled as " being not used in reference " of table 4
Table 4
Language in table 4 and table 2 similar, difference be that table 4 does not comprise the concrete syntax about time identifier (TemporalId) 224 in markup language.In this situation, when marking image 222, additional examination is not carried out to the time identifier (TemporalId) 224 of image 222 at this one-phase.
In another arrangement, F.8.1.2.1 part can comprise language image 222 being labeled as " being not used in reference " of table 5.
Table 5
Language in table 5 and table 3 similar, difference be that table 5 does not comprise the concrete syntax about time identifier (TemporalId) 224 in markup language.In this situation, when marking image 222, additional examination is not carried out to the time identifier (TemporalId) 224 of image 222 at this one-phase.
Fig. 3 is the flow chart of the method 300 for marking sublayer non-reference picture 222.Method 300 can be performed by electronic equipment 102.In one configuration, method 300 can be performed by the Video Decoder 112 on electronic equipment 102.Electronic equipment 102 can obtain the signaling of the maximum quantity (max_sublayer_for_ilp_plus1) 208 of 302 inter-layer prediction sublayers.As mentioned above, can via bit stream 110 to the maximum quantity (max_sublayer_for_ilp_plus1) 208 predicting sublayer between electronic equipment 102 providing layer.
Electronic equipment 102 can also obtain 304 sublayer non-reference picture 222.Sublayer non-reference picture 222 can be provided to electronic equipment 102 via bit stream 110.Electronic equipment 102 can determine whether the value of the time identifier (TemporalId) 224 of 306 sublayer non-reference picture 222 is greater than maximum quantity (max_sublayer_for_ilp_plus1) 208-1 of inter-layer prediction sublayer.In one configuration, electronic equipment 102 can use the language of above-mentioned table 2, table 3, one of table 4 and table 5, by the time identifier (TemporalId) 224 of sublayer non-reference picture 222 compared with the maximum quantity (max_sublayer_for_ilp_plus1) 208 of inter-layer prediction sublayer.
If the time identifier of sublayer non-reference picture 222 (TemporalId) 224 is greater than maximum quantity (max_sublayer_for_ilp_plus1) 208-1 of inter-layer prediction sublayer, then sublayer non-reference picture 222 can be marked 308 for " being not used in reference " by electronic equipment 102, even if these images belong to the layer being used as reference layer by any layer in destination layer identifier list.Therefore, sublayer non-reference picture 222 will be not used in inter-layer prediction.If the time identifier of sublayer non-reference picture 222 (TemporalId) 224 is not more than maximum quantity (max_sublayer_for_ilp_plus1) 208-1 of inter-layer prediction sublayer, then method 300 can terminate.In other words, if the value of the time identifier of sublayer non-reference picture 222 (TemporalId) 224 is less than or equal to maximum quantity (max_sublayer_for_ilp_plus1) 208-1 of inter-layer prediction sublayer, then sublayer non-reference picture 222 is not labeled as " being not used in reference ", and if sublayer non-reference picture 222 belongs to the layer being used as reference layer by any layer in destination layer identifier list, then they can be used for inter-layer prediction.
(not shown) in another scene, can also be labeled as belonging to " being not used in reference " by the sublayer non-reference picture 222 of any layer in destination layer identifier list as the layer of reference layer.In some cases, only when the highest time identifier that the time identifier (TemporalId) 224 of sublayer non-reference picture 222 equals to exist in bit stream, the step for marking sublayer non-reference picture 222 described in Fig. 3 is just performed.
Fig. 4 is the block diagram that the additional image 432 being marked as " being not used in reference " using system and method for the present invention is shown.Employ three layers (basic unit and two enhancement layers) comprising time sublayer in the illustrated example.In the second enhancement layer EL2, by the standard of JCTVC-L1008, JCTVC-L0452 and JCTVC-L0453 definition, multiple image 430 is marked as " being not used in reference ".In the first enhancement layer EL1, based on the maximum quantity (max_sublayer_for_ilp_plus1) 208 of inter-layer prediction sublayer, additional image 432 is marked as " being not used in reference ".By additional image 432 being labeled as " being not used in reference ", decoded picture buffer (DPB) size/memory can be reduced.
Fig. 5 is the block diagram that inter-layer reference image set (RPS) update module 518 is shown.Inter-layer reference image set (RPS) update module 518 of Fig. 5 can be the one configuration of inter-layer reference image set (RPS) update module 118 of Fig. 1.Inter-layer reference image set (RPS) update module 518 can be a part for the Video Decoder 112 on electronic equipment 102.Video Decoder 112 can use inter-layer reference image set (RPS) update module 518, with reference diagram image set (RPS) (RefPicSetInterLayer) 120 between update step.
Inter-layer reference image set (RPS) update module 518 can comprise interlayer image 534.In one configuration, interlayer image 534 can be non-RAP (random access points) image or random access points (RAP) image.Interlayer image 534 can be the image received from another electronic equipment 102 via bit stream 110 and 107.Inter-layer reference image set (RPS) update module 518 can determine whether to add interlayer image 524 to inter-layer reference image set (RPS) 120.
Interlayer image 534 can comprise layer identifier (nuh_layer_id) 536, time identifier (TemporalId) 538 and image sequence number (POC) 553.If the layer identifier of interlayer image 534 (nuh_layer_id) 536 corresponds to the layer as the direct reference layer of present image, and the image sequence of interlayer image No. 534 (POC) 553 equals image sequence number (POC) 561 of present image (nuh_layer_id) 559, then perform additional examination (for determining whether interlayer image 534 to be added in inter-layer reference image set (RPS) 120).
Compared with the maximum quantity (max_sublayer_for_ilp_plus1) 508 of the inter-layer prediction sublayer that the time identifier (TemporalId) 538 of interlayer image 534 notifies with signal by performed additional examination.If the value of the time identifier of interlayer image 534 (TemporalId) 538 is greater than maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer, then interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Similarly, if the value of the time identifier of interlayer image 534 (TemporalId) 538 is less than or equal to maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer, then interlayer image 534 is added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Additionally, if the value of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 508 is, unless then interlayer image is random access points (RAP) image, interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
Semanteme as defined at JCTVC-1008 (underscore represent system and method for the present invention the change that increases) is provided below in G7.4.7.2 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G7.4.7.2 is semantic "
For sub-clause F.7.4.7.2 and the illustrating of its whole sub-clause, the following amendment of application.
Derivation variable NumPocTotalCurr, as follows:
In sample code provided above, NumInterLayerRPSPics substitutes NumDirectRefLayers.
The alternative configurations (underscore represent system and method for the present invention the change that increases) for determining variable NumPocTotalCurr120 as defined in JCTVC-L1008 is provided below in G7.4.7.3 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G7.4.7.3 is semantic "
For sub-clause F.7.4.7.2 and the illustrating of its whole sub-clause, the following amendment of application.
Derivation variable NumInterLayerRPSPics, as follows:
Derivation variable NumPocTotalCurr, as follows:
In sample code provided above, the derivation of NumInterLayerRPSPics is new, and when calculating NumPocTotalCurr, NumInterLayerRPSPics substitutes NumDirectRefLayers.
Decoding process as defined in JCTVC-L1008 (underscore represent system and method for the present invention the change that increases) is provided below in G.2 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" process of G.2 decoding "
" be G.2.1 greater than the decoding process of the coded image of 0 for nuh_layer_id "
For present image CurrPic, decode processing operations is as follows:
1. in sub-clause 8.2, specify the decoding to NAL unit.
2. sub-clause G.8.1.1 and G.8.3.4 in process specify and use the decoding process of cutting slice layer and syntactic element above below:
-before first section of present image is decoded, call sub-clause G.8.1.1.
-when the decoding process of each P or B section starts, call sub-clause G.8.3.4 in the decoding process built for reference picture list of specifying, for the derivation of reference picture list 1 (RefPicList1) when reference picture list 0 (RefPicList0) and decoding B section.
3. the process in sub-clause 8.4,8.5,8.6 and 8.7 specifies the decoding process of the syntactic element used in all syntactic structure layers.The coded slice of bit stream coherence request image should comprise the section segment data of each code tree unit of image, makes image to be divided into the division of section, section is divided into the division of section section and section section is divided into the division of code tree unit that each forms the part of Iamge Segmentation.
4. after all sections of present image are all decoded, be invoked at sub-clause G.8.1.2 in the mark process for terminating to be greater than nuh_layer_id the decoding of the coded image of 0 of specifying.
" G.2.1.1 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
To in the amendment of G.2.1.1 part, when determining whether interlayer image 534 to be added to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120, consider time identifier (TemporalId) 538.If the value of the time identifier of interlayer image 534 (TemporalId) 538 is less than or equal to the value-1 of the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer, then interlayer image 534 is added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120, and for inter-layer prediction.In addition, if the value-1 of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 508 is 0, if then interlayer image 534 is RAP images, interlayer image 534 is added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
If the value of the time identifier of interlayer image 534 (TemporalId) 538 is greater than maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer, interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120, and interlayer image 534 is not used in inter-layer prediction.In addition, if the value-1 of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 508 is 0, if then interlayer image 534 is not RAP image, interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Only when the value of layer identifier (nuh_layer_id) 536 is corresponding with the floor of the direct reference layer as present image and the image sequence of interlayer image numbers 553 equals image sequence number (POC) 561 of present image (nuh_layer_id) 559, the value of the time identifier (TemporalId) 538 of interlayer image 534 is compared with the value of the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer.
The alternative configurations (underscore represent system and method for the present invention the change that increases) for determining whether interlayer image 534 to add to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 defined is provided in as JCTVC-L1008 in following G.2.1.2 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.2 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
With G.2.1.1 described in similar to the decoding process of inter-layer reference image set (RPS), the G.2.1.2 described decoding process to inter-layer reference image set (RPS) uses the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer to determine whether interlayer image 534 to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
The another kind configuration (underscore represent system and method for the present invention the change that increases) for determining whether interlayer image 534 to add to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 defined is provided in as JCTVC-L1008 in following G.2.1.3 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.3 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
With G.2.1.1 and G.2.1.2 described in similar to the decoding process of inter-layer reference image set (RPS), the G.2.1.3 described decoding process to inter-layer reference image set (RPS) uses the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer to determine whether interlayer image 534 to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
Another configuration (underscore represent system and method for the present invention the change that increases) for determining whether interlayer image 534 to add to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 defined is provided in as JCTVC-L1008 in following G.2.1.4 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.4 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
With G.2.1.1, G.2.1.2 and G.2.1.3 described in similar to the decoding process of inter-layer reference image set (RPS), the G.2.1.4 described decoding process to inter-layer reference image set (RPS) uses the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer to determine whether interlayer image 534 to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
The another kind configuration (underscore represent system and method for the present invention the change that increases) for determining whether interlayer image 534 to add to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 defined is provided in as JCTVC-L1008 in following G.2.1.5 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.5 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
With G.2.1.1, G.2.1.2, G.2.1.3 and G.2.1.4 described in similar to the decoding process of inter-layer reference image set (RPS), the G.2.1.5 described decoding process to inter-layer reference image set (RPS) uses the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer to determine whether interlayer image 534 to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
The another kind configuration (underscore represent system and method for the present invention the change that increases) for determining whether interlayer image 534 to add to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 defined is provided in as JCTVC-L1008 in following G.2.1.6 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.6 to the decoding process of inter-layer reference image set "
The output of this process is the renewal list RefPicSetInterLayer of interlayer image.
First empty list RefPicSetInterLayer, then derive as follows:
With G.2.1.1, G.2.1.2, G.2.1.3, G.2.1.4 and G.2.1.5 described in similar to the decoding process of inter-layer reference image set (RPS), the G.2.1.6 described decoding process to inter-layer reference image set (RPS) uses the maximum quantity (max_sublayer_for_ilp_plus1) 508 of inter-layer prediction sublayer to determine whether interlayer image 534 to add in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.
The mark process (underscore represent system and method for the present invention the change that increases) for terminating the decoding to coded image defined is provided in as JCTVC-L1008 in following G.2.1.7 part.
" G.2.1.7 for terminating mark process nuh_layer_id being greater than to the decoding of the coded image of 0 "
The output of this process is:
-for the mark of " for the short term reference " that may upgrade of some decoded pictures.
Apply as follows:
for(i=0;i< NumInterLayerRPSPics[LayerIdInVps[nuh_layer_id]];i++)
RefPicSetInterLayer[i]ismarkedas″usedforshort-termreference″
When the decoding process of each P and B section starts, perform reference picture list and build.The decoding process (underscore represent system and method for the present invention the change that increases) of building for reference picture list of defining is provided in as JCTVC-L1008 in following G.2.1.8 part.Similar process also has appointment in JCTVC-L0452 and JCTVC-L0453.
" G.2.1.8 for the decoding process of reference picture structure "
This process is called when the decoding process of each P and B section starts.
Addressing reference key is carried out by the reference key of specifying in sub-clause 8.5.3.2.1.Reference key is the index in reference picture list.When the P that decodes cuts into slices, there is single reference picture list RefPicList0.When the B that decodes cuts into slices, except RefPicList0, also there is the second independent reference image list RefPicList1.
When the decoding process of each section starts, derivation reference picture list RefPicList0, and for the RefPicList1 that B cuts into slices, as follows:
Variable NumRpsCurrTempList0 is set to equal Max (num_ref_idx_l0_active_minus1+1, NumPocTotalCurr), and builds list RefPicListTemp0, as follows:
Build list RefPicList0, as follows:
When section is B section, variable NumRpsCurrTempList1 is set to equal Max (num_ref_idx_l1_active_minus1+1, NumPocTotalCurr), and builds list RefPicListTemp1, as follows:
When section is B section, build list RefPicList1, as follows:
for(rIdx=0;rIdx<=num_ref_idx_l1_ative_minus1;rIdx++)(8-11)RefPicList1[rIdx]=ref_pic_list_modification_flag_l1?RefPicListTemp1[list_entry_l1[rIdx]]:RefPicListTemp1[rIdx]
In G.2.1.8 part, the quantity (NumInterLayerRPSPics) of inter-layer reference image set (RPS) image is used to carry out the quantity (NumDirectRefLayers) of alternative direct reference layer.
Fig. 6 is the flow chart for the method 600 of reference diagram image set (RPS) (RefPicSetInterLayer) 120 between update step.Method 600 can be performed by electronic equipment 102.In one configuration, method 600 can be performed by the Video Decoder 112 on electronic equipment 102.Electronic equipment 102 can obtain the signaling of the maximum quantity (max_sublayer_for_ilp_plus1) 508 of 602 inter-layer prediction sublayers.In one configuration, electronic equipment 102 can obtain the signaling of the maximum quantity (max_sublayer_for_ilp_plus1) 508 of 602 inter-layer prediction sublayers from another electronic equipment 102 via bit stream 110.
Electronic equipment 102 can start 604 for the decoding process of inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Electronic equipment 102 can obtain the value of the time identifier (TemporalId) 538 of 606 interlayer images 534, and described interlayer image 534 has the layer identifier (nuh_layer_id) 536 corresponding with the floor of the direct reference layer as current layer and has image sequence number (POC) 553 equal with the image sequence of present image (nuh_layer_id) No. 559 (POC) 561.
Electronic equipment 102 can determine whether the value of the time identifier (TemporalId) 538 of 608 interlayer images 534 is less than or equal to maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer.If the value of the time identifier of interlayer image 534 (TemporalId) 538 is less than or equal to maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer, then interlayer image 534 can add in 610 to inter-layer reference image set (RPS) (RefPicSetInterLayer) 120 by electronic equipment 102.In addition, if the value-1 of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 508 is 0, if then interlayer image 534 is RAP images, interlayer image 534 is added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Then, electronic equipment 102 can use 612 inter-layer reference image set (RPS) (RefPicSetInterLayer) 120, builds (as above G.2.1.7 as described in part) for reference picture list.
If the value of the time identifier of interlayer image 534 (TemporalId) 538 is not less than or is also not equal to maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer (such as, the value of the time identifier (TemporalId) 538 of interlayer image 534 is greater than maximum quantity (max_sublayer_for_ilp_plus1) 508-1 of inter-layer prediction sublayer), then interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.In addition, if the value-1 of the maximum quantity of inter-layer prediction sublayer (max_sublayer_for_ilp_plus1) 508 is 0, if then interlayer image 534 is not RAP image, interlayer image 534 is not added in inter-layer reference image set (RPS) (RefPicSetInterLayer) 120.Then, electronic equipment 102 can use 612 inter-layer reference image set (RPS) (RefPicSetInterLayer) 120, builds (as above G.2.1.7 as described in part) for reference picture list.
Fig. 7 shows the block diagram of a kind of configuration of the video encoder 782 on electronic equipment 702.The video encoder 782 of Fig. 7 can be the one configuration of the video encoder 182 of Fig. 1.Video encoder 782 can comprise enhancement layer encoder 706, base layer encoder 709, resolution rise module 770 and output interface 780.
Enhancement layer encoder 706 can comprise the video input 781 receiving input picture 704.The output of video input 781 can be supplied to the adder/subtracter 783 of the output receiving prediction selection 750.The output of adder/subtracter 783 can be supplied to transform and quantization module 752.Conversion and the output of quantization modules 752 can be supplied to entropy code module 748 and convergent-divergent and inverse transform module 772.After execution entropy code, the output of entropy code module 748 can be supplied to output interface 780.Output interface 780 can the base layer video bit-stream 707 of output encoder and the enhancement layer video bit-streams 710 of coding.
The output of convergent-divergent and inverse transform module 772 can be supplied to adder 779.Adder 779 can also receive the output of prediction selection 750.The output of adder 779 can be supplied to (deblocking) module 751 of deblocking.The output of deblocking module 751 can be supplied to reference buffer storage 794.The output of reference buffer storage 794 can be supplied to motion compensating module 754.The output of motion compensating module 754 can be supplied to prediction selection 750.The output of reference buffer storage 794 can also be supplied to internal predictor 756.The output of internal predictor 756 can be supplied to prediction selection 750.Prediction selection 750 can also receive the output that resolution raises module 770.
Base layer encoder 709 can comprise the video input 762 receiving down-sampling input picture or alternative view input picture or identical input picture 703 (that is, identical with the input picture 704 that enhancement layer encoder 706 receives).The output of video input 762 can be supplied to coded prediction circulation 764.Entropy code 766 can be arranged in the output of coded prediction circulation 764.The output of coded prediction circulation 764 can also be supplied to reference buffer storage 768.Reference buffer storage 768 can circulate to coded prediction and 764 provide feedback.The output of reference buffer storage 768 can also be supplied to resolution and raise module 770.Once perform entropy code 766, export and just can be supplied to output interface 780.
Fig. 8 is the block diagram of a kind of configuration of the Video Decoder 812 illustrated on electronic equipment 802.The Video Decoder 812 of Fig. 8 can be the one configuration of the Video Decoder 112 of Fig. 1.Video Decoder 812 can comprise enhancement layer decoder 815 and base layer decoder 813.Video Decoder 812 can also comprise interface 889 and resolution raises 870.
Interface 889 can received code video flowing 885.Encoded video streams 885 can comprise base layer encoder video flowing and enhancement layer coding video flowing.Base layer encoder video flowing can send with enhancement layer coding video flowing with being separated or merging.Interface 889 can provide part or all of encoded video streams 885 to the entropy decoder module 886 in base layer decoder 813.The output of entropy decoder module 886 can be supplied to decoding prediction loop 887.The output of decoding prediction loop 887 can be supplied to reference buffer storage 888.Reference buffer storage can provide feedback to decoding prediction loop 887.Reference buffer storage 888 can also export the base layer video 884 of decoding.
Interface 889 can also provide part or all of encoded video streams 885 to the entropy decoder module 890 in enhancement layer decoder 815.The output of entropy decoder module 890 can be supplied to inverse quantization module 891.The output of inverse quantization module 891 can be supplied to adder 892.The output of inverse quantization module 891 and prediction can select the output of module 895 to be added by adder 892.The output of adder 892 can be supplied to deblocking module 893.The output of deblocking module 893 can be supplied to reference buffer storage 894.Reference buffer storage 894 can export the enhancement-layer video 882 of decoding.
The output of reference buffer storage 894 can also be supplied to internal predictor 897.Enhancement layer decoder 815 can comprise motion compensation 896.Motion compensation 896 can perform after resolution raises 870.Prediction selects module 895 can receive the output of internal predictor 897 and the output of motion compensation 896.
Fig. 9 shows and can send the various assemblies used in electronic equipment 902.One or more electronic installation 102 as herein described can realize by the transmission electronic equipment 902 according to Fig. 9.
Send electronic equipment 902 and comprise the processor 939 controlling the operation sending electronic equipment 902.Processor 939 also can be called as CPU (CPU).The equipment that memory 933 can comprise read-only memory (ROM), random access memory (RAM) maybe can store any type of information, and provide instruction 935a (such as executable instruction) and data 937a to processor 939.A part for memory 933 can also comprise nonvolatile RAM (NVRAM).Memory 933 can carry out electronic communication with processor 939.
Instruction 935b and data 937b also can reside in processor 939.Be loaded into instruction 935b in processor 939 and/or data 937b can also comprise and loading so that the data 935a being performed by processor 939 or processed and/or data 937a from memory 933.Instruction 935b can be performed to realize one or more method disclosed herein by processor 939.
Send electronic equipment 902 can comprise for carrying out with other electronic equipments (such as receiving electronic equipment) one or more communication interfaces 941 of communicating.Communication interface 941 can based on cable communicating technology, wireless communication technology or the two.The example of communication interface 941 comprises serial port, parallel port, USB (USB), Ethernet Adaptation Unit, IEEE1394 bus interface, small computer system interface (SCSI) bus interface, infrared ray (IR) communication port, Bluetooth wireless communication adapter and the transceiver etc. according to third generation partner program (3GPP) technical specification.
Send electronic equipment 902 and can comprise one or more output equipment 945 and one or more input equipment 943.The example of output equipment 945 comprises loud speaker, printer etc.A kind of output equipment sent in electronic equipment 902 that can be included in is display device 947.The display device 947 used in conjunction with configuration disclosed herein can utilize any suitable image projection technology, such as cathode ray tube (CRT), liquid crystal display (LCD), light-emitting diode (LED), gas plasma, electroluminescence etc.Display controller 949 can be provided, text, figure and/or moving image that the data transaction for being stored in memory 933 becomes (as required) to show on display 947.The example of input equipment 943 comprises keyboard, mouse, microphone, remote control equipment, button, joystick, trace ball, touch pad, touch-screen, light pen etc.
The various assemblies sending electronic installation 902 are coupled by bus system 951, and bus system 951 can comprise power bus, control signal bus, status signal bus in addition and data/address bus.But, for the sake of clarity, in Fig. 9, various bus is shown for bus system 951.Transmission electronic equipment 902 shown in Fig. 9 is functional block diagrams, instead of the list of concrete assembly.
Figure 10 is the block diagram that the various assemblies that can use in reception electronic equipment 1002 are shown.One or more electronic installation 102 can realize by the transmission electronic equipment 1002 according to Figure 10.
Receive electronic equipment 1002 and comprise the processor 1039 controlling the operation receiving electronic equipment 1002.Processor 1039 also can be called as CPU.Memory 1033 can comprise the equipment that ROM, RAM maybe can store any type of information, and provides instruction 1035a (such as executable instruction) and data 1037a to processor 1039.A part for memory 1033 can also comprise NVRAM.Memory 1033 can carry out electronic communication with processor 1039.
Instruction 1035b and data 1037b also can reside in processor 1039.Be loaded into instruction 1035b in processor 1039 and/or data 1037b can also comprise and loading so that the data 1035a being performed by processor 1039 or processed and/or data 1037a from memory 1033.Instruction 1035b can be performed to realize one or more method 200,300,400,500 disclosed herein by processor 1039.
Receive electronic equipment 1002 can comprise for carrying out with other electronic equipments (such as sending electronic equipment) one or more communication interfaces 1041 of communicating.Communication interface 1041 can based on cable communicating technology, wireless communication technology or the two.The example of communication interface 1041 comprises serial port, parallel port, USB, Ethernet Adaptation Unit, IEEE1394 bus interface, scsi bus interface, IR communication port, Bluetooth wireless communication adapter and the transceiver etc. according to 3GPP technical specification.
Receive electronic equipment 1002 and can comprise one or more output equipment 1045 and one or more input equipment 1043.The example of output equipment 1045 comprises loud speaker, printer etc.A kind of output equipment received in electronic equipment 1002 that can be included in is display device 1047.The display device 1047 used in conjunction with configuration disclosed herein can utilize any suitable image projection technology, such as CRT, LCD, LED, gas plasma, electroluminescence etc.Display controller 1049 can be provided, text, figure and/or moving image that the data transaction for being stored in memory 1033 becomes (as required) to show on display 1047.The example of input equipment 1043 comprises keyboard, mouse, microphone, remote control equipment, button, joystick, trace ball, touch pad, touch-screen, light pen etc.
The various assemblies receiving electronic installation 1002 are coupled by bus system 1051, and bus system 1051 can comprise power bus, control signal bus, status signal bus in addition and data/address bus.But, for the sake of clarity, in Figure 10, various bus is shown for bus system 1051.Reception electronic equipment 1002 shown in Figure 10 is functional block diagrams, instead of the list of concrete assembly.
Term " computer-readable medium " refers to can by any usable medium of computer or processor access.Term used herein " computer-readable medium " can represent non-transient and tangible computer and/or processor readable medium.By way of example and not limitation, computer-readable or processor readable medium can comprise RAM, ROM, EEPROM, CD-ROM or other optical disc storage, magnetic disc store or other magnetic storage apparatus or can be used for carrying and storing instruction or data structure form required program code and can by any other medium of computer or processor access.Disk used herein and CD comprise compact disk (CD), laser disk, CD, digital versatile disc (DVD), floppy disk and Blu-ray (registered trade mark) dish, the wherein usual magnetically rendering data of disk, and CD laser carrys out rendering data optically.
It should be pointed out that one or more method as described herein can use hardware to realize and/or implement.Such as, one or more method as herein described or scheme can use chipset, ASIC, LSI or integrated circuit etc. to realize and/or implement.
Each method disclosed herein comprises one or more method step for realizing described method or action.Method step and/or action can exchange and/or be combined into a step mutually, and do not depart from the scope of claim.In other words, unless for making described method proper operation require concrete step or action, can modify to the order of concrete steps and/or action and/or use, and not departing from the scope of claim.
Should be appreciated that claim is not defined to the accurate configuration and assembly illustrated above.Amendment can be made to the layout of system as herein described, method and apparatus, operation and details, and not depart from the scope of claim.

Claims (21)

1. a method for video coding, comprising:
Obtain the signaling of the maximum quantity of inter-layer prediction sublayer;
Start the decoding process for inter-layer reference image set (RPS);
Obtain the value of the time identifier of interlayer image;
Determine that the maximum quantity whether value of described time identifier is greater than described inter-layer prediction sublayer subtracts 1; And
If the maximum quantity that the value of the time identifier of described interlayer image is not more than described inter-layer prediction sublayer subtracts 1, then described interlayer image is added in described inter-layer reference image set (RPS).
2. method according to claim 1, also comprises and described inter-layer reference image set (RPS) being built for reference picture list.
3. method according to claim 1, wherein, described interlayer image comprises the value of the layer identifier corresponding with the layer of the direct reference layer as current layer.
4. method according to claim 1, wherein, described interlayer image comprises and the image sequence of present image number equal image sequence number.
5. method according to claim 1, wherein, if the value of the maximum quantity of described inter-layer prediction sublayer is 0, and described interlayer image is non-random access points (RAP) image, then do not added in described inter-layer reference image set (RPS) by described interlayer image.
6. method according to claim 1, wherein, if the value of the maximum quantity of described inter-layer prediction sublayer is 0, and described interlayer image is random access points (RAP) image, then added in described inter-layer reference image set (RPS) by described interlayer image.
7. method according to claim 1, wherein, the maximum quantity of described inter-layer prediction sublayer has index [LayderIdInVps [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]]].
8. method according to claim 1, wherein, the maximum quantity of described inter-layer prediction sublayer has index [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]].
9. method according to claim 1, wherein, the maximum quantity of described inter-layer prediction sublayer has index [layer_id_in_nuh [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]]].
10. method according to claim 1, wherein, the maximum quantity of described inter-layer prediction sublayer has index [layer_id_in_nuh [i]].
11. methods according to claim 1, wherein, the maximum quantity of described inter-layer prediction sublayer has index [i].
12. methods according to claim 1, wherein, inter-layer reference image list RefPicSetInterLayer has index [NumInterLayerRPSPics [LayerIdInVps [nuh_layer_id]]].
13. methods according to claim 12, wherein, NumInterLayerRPSPics [LayerIdInVps [nuh_layer_id]] is derived as different from NumDirectRefLayers [LayerIdInVps [nuh_layer_id]].
14. 1 kinds of electronic equipments being arranged to Video coding, comprising:
Processor;
With the memory of described processor telecommunication, the instruction wherein stored in which memory can be performed with:
Obtain the signaling of the maximum quantity of inter-layer prediction sublayer;
Start the decoding process for inter-layer reference image set (RPS);
Obtain the value of the time identifier of interlayer image;
Determine that the maximum quantity whether value of described time identifier is greater than described inter-layer prediction sublayer subtracts 1; And
If the maximum quantity that the value of the time identifier of described interlayer image is not more than described inter-layer prediction sublayer subtracts 1, then described interlayer image is added in described inter-layer reference image set (RPS).
15. electronic equipments according to claim 14, also comprise and described inter-layer reference image set (RPS) being built for reference picture list.
16. electronic equipments according to claim 14, wherein, described interlayer image comprises the value of the layer identifier corresponding with the layer of the direct reference layer as current layer.
17. electronic equipments according to claim 14, wherein, described interlayer image comprises and the image sequence of present image number equal image sequence number.
18. electronic equipments according to claim 14, wherein, if the value of the maximum quantity of described inter-layer prediction sublayer is 0, and described interlayer image is non-random access points (RAP) image, then do not added in inter-layer reference image set (RPS) by described interlayer image.
19. electronic equipments according to claim 14, if the value of the maximum quantity of described inter-layer prediction sublayer is 0, and described interlayer image is random access points (RAP) image, then described interlayer image is added in described inter-layer reference image set (RPS).
20. electronic equipments according to claim 14, wherein, the maximum quantity of described inter-layer prediction sublayer has index [LayderIdInVps [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]]].
21. electronic equipment according to claim 14, wherein, the maximum quantity of described inter-layer prediction sublayer has index [RefLayerId [LayerIdInVps [nuh_layer_id]] [i]].
CN201480020037.9A 2013-04-05 2014-04-02 Decoding of inter-layer reference picture set and reference picture list construction Pending CN105122816A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US13/857,990 2013-04-05
US13/857,990 US9532067B2 (en) 2013-04-05 2013-04-05 Decoding of inter-layer reference picture set and reference picture list construction
US201361818804P 2013-05-02 2013-05-02
US61/818,804 2013-05-02
PCT/JP2014/001923 WO2014162739A1 (en) 2013-04-05 2014-04-02 Decoding of inter-layer reference picture set and reference picture list construction

Publications (1)

Publication Number Publication Date
CN105122816A true CN105122816A (en) 2015-12-02

Family

ID=51658047

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480020037.9A Pending CN105122816A (en) 2013-04-05 2014-04-02 Decoding of inter-layer reference picture set and reference picture list construction

Country Status (5)

Country Link
EP (1) EP2982123A4 (en)
JP (1) JP2016519853A (en)
CN (1) CN105122816A (en)
HK (1) HK1215835A1 (en)
WO (1) WO2014162739A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112740705A (en) * 2018-09-21 2021-04-30 夏普株式会社 System and method for signaling reference pictures in video coding
CN114554196A (en) * 2018-08-17 2022-05-27 华为技术有限公司 Reference picture management in video coding

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20130116782A (en) 2012-04-16 2013-10-24 한국전자통신연구원 Scalable layer description for scalable coded video bitstream
US20150256838A1 (en) * 2012-09-30 2015-09-10 Sharp Kabushiki Kaisha Signaling scalability information in a parameter set
US9325997B2 (en) 2012-11-16 2016-04-26 Huawei Technologies Co., Ltd Signaling scalability information in a parameter set
US9426468B2 (en) 2013-01-04 2016-08-23 Huawei Technologies Co., Ltd. Signaling layer dependency information in a parameter set
KR20150009424A (en) * 2013-07-15 2015-01-26 한국전자통신연구원 Method and apparatus for image encoding and decoding using inter-layer prediction based on temporal sub-layer information

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101356822A (en) * 2006-01-10 2009-01-28 汤姆逊许可公司 Method and apparatus for constructing reference picture lists for scalable video coding svc.
CN101444104A (en) * 2006-03-27 2009-05-27 诺基亚公司 Reference picture marking in scalable video encoding and decoding
CN103024397A (en) * 2013-01-07 2013-04-03 华为技术有限公司 Method and device for determining time domain motion vector predictor

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101356822A (en) * 2006-01-10 2009-01-28 汤姆逊许可公司 Method and apparatus for constructing reference picture lists for scalable video coding svc.
CN101444104A (en) * 2006-03-27 2009-05-27 诺基亚公司 Reference picture marking in scalable video encoding and decoding
CN103024397A (en) * 2013-01-07 2013-04-03 华为技术有限公司 Method and device for determining time domain motion vector predictor

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HANNUKSELA M M ET AL: "《Design considered for signalling inter-layer prediction indication》", 《103. MPEG MEETING;21-1-2013 - 25-1-2013;GENEVA;(MOTION PICTURE EXPERT GROUP OR ISO/IEC JTC1/SC29/WG11)》 *
JIANLE CHEN: "《SHVC Draft Text 1》", 《12.JCT-VC MEETING;103.MPEG MEETING;14-1-2013 - 23-1-2013;GENEVA;(JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JCT1/SC29/WG11 AND ITU-T SG.16)》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114554196A (en) * 2018-08-17 2022-05-27 华为技术有限公司 Reference picture management in video coding
CN114554196B (en) * 2018-08-17 2023-04-28 华为技术有限公司 Method, apparatus and system for decoding decoded code stream
US11758123B2 (en) 2018-08-17 2023-09-12 Huawei Technologies Co., Ltd. Reference picture management in video coding
CN112740705A (en) * 2018-09-21 2021-04-30 夏普株式会社 System and method for signaling reference pictures in video coding

Also Published As

Publication number Publication date
EP2982123A1 (en) 2016-02-10
JP2016519853A (en) 2016-07-07
WO2014162739A1 (en) 2014-10-09
EP2982123A4 (en) 2016-09-07
HK1215835A1 (en) 2016-09-15

Similar Documents

Publication Publication Date Title
US10104390B2 (en) Marking pictures for inter-layer prediction
US9532067B2 (en) Decoding of inter-layer reference picture set and reference picture list construction
CN105122816A (en) Decoding of inter-layer reference picture set and reference picture list construction
RU2668723C2 (en) Method and equipment for coding and decoding video signals
JP6386532B2 (en) Coding output layer set video data
JP7096896B2 (en) Methods and equipment for video coding
JP6272630B2 (en) Video encoding and decoding method and technical apparatus
CN110708541B (en) Video decoding method, video decoder, apparatus and storage medium
JP2021517392A5 (en)
CN108650514A (en) Tile is aligned signaling and consistency constraint
CN105379273A (en) Signaling indications and constraints
JP2018530968A (en) Video coding using helper data for spatial intra prediction
JP2019134444A (en) Method for decoding image and apparatus using the same
US20150312568A1 (en) Method and technical equipment for video encoding and decoding
CN112135134B (en) Video decoding method, video decoding device, computer equipment and storage medium
WO2014162747A1 (en) Reference picture set signaling and restriction on an electronic device
CA3150261A1 (en) Method and apparatus of high-level signaling for weighted prediction
JP2023521609A (en) Method, computer program and apparatus for video coding
WO2023060023A1 (en) Method, apparatus and medium for video processing
JP2023518453A (en) Encoders, decoders and corresponding methods
WO2023056358A1 (en) Method, apparatus and medium for video processing
EP2887673A1 (en) Method for coding a sequence of pictures and method for decoding a bitstream and corresponding devices

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1215835

Country of ref document: HK

WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20151202

WD01 Invention patent application deemed withdrawn after publication
REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1215835

Country of ref document: HK