CN101401433A - Method and apparatus for decoding/encoding of a video signal - Google Patents

Method and apparatus for decoding/encoding of a video signal Download PDF

Info

Publication number
CN101401433A
CN101401433A CN 200780008303 CN200780008303A CN101401433A CN 101401433 A CN101401433 A CN 101401433A CN 200780008303 CN200780008303 CN 200780008303 CN 200780008303 A CN200780008303 A CN 200780008303A CN 101401433 A CN101401433 A CN 101401433A
Authority
CN
China
Prior art keywords
image
information
nal unit
picture
flag information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200780008303
Other languages
Chinese (zh)
Inventor
朴胜煜
全柄文
朴志皓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of CN101401433A publication Critical patent/CN101401433A/en
Pending legal-status Critical Current

Links

Images

Abstract

In decoding a scalable video signal using a partial picture reference on a temporal domain and a scalable domain, the present invention provides a method including obtaining a first partial picture on a first temporal point, and decoding a full picture referring to the first partial picture, the full picture being on a second temporal point, the second temporal point being located after the first temporal point, wherein a level of the first partial picture on a scalable domain is lower than a level of the full picture on the scalable domain.

Description

The method and the device that are used for the decoding/encoding vision signal
Technical field
The present invention relates to a kind of vision signal coding/decoding scheme.
Background technology
The compression coding/decoding is meant a series of signal processing technology, is used for perhaps storing this digital information with the form of suitable storage medium by the transmitting digitized information of telecommunication circuit.The object of compression coding/decoding has audio frequency, video, character etc.Especially, the technology to video execution compressed encoding is called as the video sequence compression.Usually, video sequence is characterised in that and comprises spatial redundancy or time redundancy.
Scalable-video-coded bit stream can be by a partial decoding of h optionally.For example, the basic layer of can decoding of the decoder with low complex degree, the bit stream of low data rate can be extracted being used for and be transmitted by the network of limited capacity.In order further little by little to generate high-resolution image, need progressively improve the picture quality of sequence.
Summary of the invention
Technical purpose
The objective of the invention is to improve the coding/decoding efficient of vision signal.
Technological means
Therefore, The present invention be directed to a kind of method of coding/decoding vision signal, it has fully avoided one or more problems of causing because of the restriction of correlation technique and shortcoming.
One object of the present invention is to define a kind of grammer of the compatibility for coding decoder (codec), improves the compatibility between dissimilar coding decoders thus.
Another object of the present invention is to define a kind of reconstruct (recomposition) grammer that is used for scalable-video-coded bit stream, thereby improves the compatibility between coding decoder.
Another object of the present invention is to limit a kind of grammer, and whether this grammer is used for expression in appropriate location stored reference primary image, thereby improves the compatibility between coding decoder.
Another object of the present invention is to define a kind of grammer, and whether in position this grammer is used for expression stored reference primary image, thereby manages decoded picture buffer district (decodedpicture buffer) effectively.
Another object of the present invention is to carry out the decoded picture mark effectively by whether in position being used to represent the grammer of stored reference primary image.
Another object of the present invention is to provide a kind of coding/decoding method, thus, minimizes the problem in the vision signal decode procedure that causes because of the mistake that produces in the transmission course.
Another object of the present invention is to provide a kind of method, manages the decoded picture buffer district by the decoding scheme of the problem that produced in the decode procedure that minimizes vision signal, and wherein this problem is caused by the wrong institute that transmission course produces.
Beneficial effect
Therefore, the invention provides following effect or advantage.
At first, during the coding/decoding vision signal, be used for the grammer of the compatibility of coding decoder by definition, the present invention can improve the compatibility between dissimilar coding decoders.For example, be used for scalable-video-coded bit stream is transformed to AVC (advanced video encoding) thus the coded bit stream of coding decoder strengthen the syntactic structure of the compatibility between coding decoder.
The second, decoded picture buffer district (DPB) managed more effectively by the present invention; Thereby reduce the burden that is applied in the decoded picture buffer district.Therefore, can improve coding/decoding speed.
The 3rd, by using the various configuration informations about the telescopic video sequence, the present invention can realize more effectively coding/decoding.
Description of drawings
The included accompanying drawing of the present invention is used to provide to further understanding of the present invention, and they are bonded to this part that has also constituted this specification, and these accompanying drawings show embodiments of the invention, and is used from explanation principle of the present invention with specification one.
In the accompanying drawing:
Fig. 1 is the schematic block diagram according to telescopic video coding/decoding of the present invention system;
Fig. 2 is the configuration information figure that increases to the scalable sequence of scalable-video-coded bit stream according to an embodiment of the invention;
Fig. 3 is used to explain storage and uses figure with reference to the various Collapsible structures of the scalable video of the process of primary image for according to an embodiment of the invention;
Fig. 4 is the flow chart of the storing process with reference to primary image according to an embodiment of the invention;
Fig. 5 is used to store and the structure chart of mark with reference to the grammer of primary image for according to an embodiment of the invention;
Fig. 6 is used to store and the structure chart of mark with reference to the grammer of primary image for according to an embodiment of the invention; And
Fig. 7 to Figure 12 is respectively acquisition according to an embodiment of the invention and whether is used for representing structure chart at the grammer of the flag information of the current NAL of buffer area storage unit.
Optimal mode
Other advantages of the present invention and feature will be illustrated in the following description, and its part can from Understood in the description, maybe can be obtained by implementing the present invention. Purpose of the present invention and other advantages will Can realize by the structure of specifically noting in specification and claim and the accompanying drawing and obtain.
In order to realize these and other advantages, and according to as comprising and broadly described order of the present invention , by utilizing the partial reference image on time domain and the scalable territory to come decoding telescopic video signal In, the present invention includes: obtain the first's image on the very first time point; With reference to this First component Picture decoding complete image, this complete image is positioned at second time point, this second time point be positioned at this first After the time point; Wherein, the rank on the scalable territory of this first's image is lower than this complete image Scalable territory on rank.
Preferably, said method further comprises and utilizes the scalable decoding of second portion image to decode This complete image, this second portion image is positioned at this second time point, and corresponding to this very first time This first's image on the point.
Preferably, said method further comprises the restriction sign information that obtains to be used for the restriction grammer, should Grammer is corresponding to the partial reference information in the bit stream.
Realize these and other advantages for further, and according to purpose of the present invention, according to the present invention Method comprise: the quality scale that obtains current NAL unit; Determine whether this current NAL unit wraps The band that contains reference picture; When this current NAL unit corresponding to minimum quality levels and this current NAL When the unit comprises the band of this reference picture, obtain expression and whether in buffer area, store this current NAL First flag information of unit; With based on this first flag information decoded picture is labeled as with reference to basic Image, wherein, this current NAL unit is included in this decoded picture.
Realize these and other advantages for further, and according to purpose of the present invention, according to the present invention The device that is used for decoded video signal comprise: the identifying information inspection unit is used for checking current NAL The quality scale of unit, and be used for checking whether this current NAL unit comprises the band of reference picture; With unit, decoded picture buffer district, if according to the check result of this identifying information inspection unit, deserve Front NAL unit is corresponding to minimum quality levels and comprise the fragment of this reference picture, this decoded picture Whether the buffer area unit stores the first sign letter of this current NAL unit in buffer area based on expression The decoded picture that breath, mark comprise this NAL unit is with reference to primary image.
Be understandable that above general description and the following detailed description all be example and explain The property, and further specifying claim of the present invention can be provided.
The working of an invention mode
Be elaborated referring now to the preferred embodiment of the present invention, its example is represented in the accompanying drawings.
At first, consider between spatial redundancy, time redundancy, scalable redundancy, visual angle redundant to the compression coding/decoding of video signal data.The compression coding/decoding of considering scalable redundancy is one embodiment of the present of invention.But technical conceive of the present invention is applicable to redundancy between time redundancy, spatial redundancy, visual angle etc.
" coding/decoding (coding) " of indication comprises coding (encoding) and decoding (decoding) two notions in this specification, can make an explanation neatly according to technical conceive of the present invention and technical scope.
In the bit sequence configuration of vision signal, exist and be referred to as NAL (NetworkAbstraction Layer, the layer structure of separation network abstraction layer), it is positioned between the lower-level system of the VCL (Video Code Layer, video coding layer) that carries out the moving image encoding process and transmission and memory encoding information.Cataloged procedure is output as the VCL data, and is mapped by the NAL unit before transmission or the storage.Each NAL unit comprises the video data of compression or corresponding to the data RBSP of header (Raw Byte Sequence Payload, raw byte sequence payload: the result data of moving image compression).
The NAL unit mainly comprises NAL unit header and RBSP two parts.The NAL unit header comprises whether expression comprises the identifier (nal_unit_type) as the type of the flag information (nal_ref_idc) of the band of the reference picture of this NAL unit and expression NAL unit.Storage is through the initial data of overcompression in RBSP.And,, add RBSP tail bit (RBSP trailing bit) at the end of RBSP for the lengths table that makes RBSP is shown the multiple of 8 bits.The type of NAL unit has IDR (Instantaneous Decoding Refresh, instantaneous decoding refresh) image, SPS (SequenceParameter Set, sequence parameter set), PPS (Picture Parameter Set, picture parameter set) SEI (Supplemental Enhancement Information, supplemental enhancement information) etc.
Therefore, show as scalable video coded slice, can improve coding/decoding efficient by increasing the various configuration informations relevant with above-mentioned scalable coding/decoding if represent the information (nal_unit_type) of the type of above-mentioned NAL unit.For instance, can increase whether the current access unit of expression is the flag information of instantaneous decoding refresh (being designated hereinafter simply as IDR) access unit, the dependency identification information of representation space scalability (dependency identificationinformation), quality identification information, whether expression is used as the flag information of reference picture, priority identification information etc. with reference to primary image.In order to manage the decoded picture buffer district more effectively, can use the configuration information of various scalable coding/decodings, below with reference to Fig. 2 it is elaborated.
In standard,, stipulated for various abridged tables and other requirement of level in order to buy target product with suitable expense.In this case, decoder must satisfy according to determined requirement in corresponding abridged table and the rank.Therefore, defined " abridged table " and " rank " two conceptions of species and come representative function or parameter, it is used to represent the manageable size that is compressed the scope of sequence of decoder.Profile identifier (profile_idc) can be discerned bit stream based on predetermined abridged table.Profile identifier be the expression bit stream based on the sign of abridged table.For instance, H.264/AVC in, profile identifier is 66, this is meant that bit stream is based on baseline profile; Profile identifier is 77, is meant that based on main abridged table, profile identifier is 88, is meant that bit stream is based on extended profile.And above-mentioned profile identifier is included in sequential parameter and concentrates.
Therefore, in order to handle scalable sequence, need whether the identification incoming bit stream is the abridged table that is used for scalable sequence,, be necessary to increase grammer and make the more than one additional information that is used for scalable sequence to be transmitted if incoming bit stream is identified as the abridged table that is used for scalable sequence.Here the abridged table that is used for scalable sequence, as additional aspects H.264/AVC, expression is used to handle the profile mode of telescopic video.
Because for traditional AVC technology, SVC is an additional aspects, so, and unconditionally increase grammer and compare, it is more effective as the additional information that is used for the SVC pattern to increase grammer.For instance, when the profile identifier of AVC is expressed as the abridged table that is used for scalable sequence, if increase then can improve coding/decoding efficient about the information of scalable sequence.
Sequential parameter set representations header, wherein header comprises the information in the coding that is present in whole sequence, for example abridged table, rank etc.Whole compression movement image, promptly sequence should start from the sequence head place.Therefore, the sequence parameter set corresponding to header should arrive decoder earlier before the data that depend on this parameter set arrive.That is, sequence parameter set RBSP bears the role of the header of the result data that is used for the moving image compression.In case the bit stream that incoming bit stream, profile identifier preferentially identify input is based in a plurality of abridged tables which.
Below explain a plurality of embodiment of effective video signal decoding method.
Fig. 1 is the schematic block diagram according to scalable video of the present invention system.
For the optimization for various communication environments and various terminals is provided, the sequence that is provided to terminal should be diversified.If the optimization of certain terminal is provided to corresponding terminal, then represent to prepare single sequence source at the combined value of various parameters (the transmission frame number that comprises per second, resolution, the bit number of every pixel etc.).Therefore, optimization has applied burden to content supplier.
Therefore, content supplier is encoded to original series the compressed sequence data of high bit rate.When receiving the sequence of requests of being made by terminal, content supplier's decoding original series is encoded to the sequence data of the series processing ability that is suitable for terminal to it, and then this coded data is offered terminal.Because this code transforms and is accompanied by coding-decoding-cataloged procedure, so generation time delay inevitably in the process of sequence is provided.Therefore, need complicated hardware equipment and algorithm in addition.
Scalable video (SVC) is a kind of encoding scheme with the optimum picture quality encoded video signal, so that the partial sequence of the image sequence that is produced can show as sequence by decoding.Here, partial sequence is meant the sequence that is constituted by select frame off and on by from whole sequence.For by SVC image encoded sequence, utilize spatial scalability for low bit rate, can reduce sequence size, but and also the service quality scalability reduce the picture quality of sequence.Here, the image sequence with the small screen and/or low number of pictures per second can be called as basic layer, and the sequence with relative large-screen and/or high relatively number of pictures per second can be called as be enhanced or enhancement layer.
The coded image sequence of above-mentioned scalable scheme is by only receiving and the mode of processing section sequence is represented with the sequence of realization low image quality.If bit rate is lowered, then picture quality also reduce ground quite big.
In order to solve the reduction problem of picture quality, the auxiliary picture sequence of the separation of low bit rate can be provided, for example comprise the image sequence of the small screen and/or the less frame number of per second.This auxiliary sequencel can be called as basic layer, and main picture sequence can be called as be enhanced or enhancement layer.
Below explain in detail the scalable video system.
At first, the scalable video system comprises encoder 102 and decoder 110.
Encoder 102 comprises basic layer coding unit 104, enhancement layer coding unit 106 and Multiplexing Unit 108.Decoder 110 can comprise dequantisation unit 112, basic layer decoder unit 114 and enhancement layer decoder unit 116.
By the sequence signal X (n) of compression input, basic layer coding unit 104 can produce elementary bit stream.
Use the sequence signal X (n) of input and the information that basic layer coding unit 104 produced, enhancement layer coding unit 106 can produce enhancement layer bit-stream.
And, using basic layer bit stream and enhancement layer bit-stream, Multiplexing Unit 108 can produce scalable bitstream.
The scalable bitstream that is produced is transferred into decoder 110 by allocated channel.By the dequantisation unit 112 of decoder 110, the scalable bitstream of transmission can be separated into enhancement layer bit-stream and basic layer bit stream.
Basic layer decoder unit 114 receives and the basic layer of decoding bit stream is output sequence signal Xb (n).
Enhancement layer decoder unit 116 receives enhancement layer bit-stream, and the signal of being rebuild with reference to basic layer decoder unit 114, and the decoding enhancement layer bit stream is output sequence signal Xe (n).Here, output sequence signal Xb (n) be have than after the low picture quality of output sequence signal Xe (n) or the sequence signal of resolution.
In the scalable video process, when specific image was transmitted by encoding enhancement layer, enhancement layer bit-stream may partly be damaged between corresponding transmission period.In this case, because the enhancement layer bit-stream of decoder 110 service failures decoding respective image, so original series and decoding sequence are different aspect picture quality.Especially, be for having other required reference picture of another image of decoding of minimum time stage if there is the image of this problem, then this problem can be serious further.
Therefore, having other image of minimum time stage needs to be managed more effectively.This will be below by coming to be explained in detail with reference to figure 3 and Fig. 4.
According to one embodiment of present invention, decoded picture buffer district (DPB) makes scalable storage or the mark that can realize complete image and parts of images in scalable video.In this situation, the image that complete image (full picture) expression has highest quality level, and the image that parts of images (partial picture) expression has minimum quality levels.Perhaps, complete image can be defined as representing relative high and low quality scale respectively with parts of images.
For example, if quality scale is divided into 5 grades (0-4), it is 0 to 3 example that parts of images can belong to quality scale.It is 4 example that complete image can belong to quality scale.Perhaps, having only quality scale is that 0 situation is corresponding to parts of images.
Simultaneously, the very first time point place parts of images as needing to be stored before the reference picture.Therefore, be positioned at the part or the complete image of very first time point second time point afterwards in order to decode, the very first time parts of images at some place can be used as reference picture.And the very first time complete or parts of images at some place can be used as reference picture adaptively.
Fig. 2 is the figure of the configuration information of the scalable sequence that increases to scalable-video-coded bit stream according to an embodiment of the invention.
Fig. 2 shows the topology example that the configuration information that makes on the scalable sequence is increased NAL unit thereon.
The NAL unit mainly comprises NAL unit header and RBSP (raw byte sequence payload: the result data of moving image compression).
The NAL unit header can comprise the information (nal_unit_type) whether this NAL unit of expression comprises the identifying information (nal_ref_idc) of the band of reference picture and represent the type of NAL unit.
And, under certain limitation, can comprise the extended area of NAL unit header.
For example, if be used to represent information or the expression prefix NAL unit relevant with scalable video of NAL cell type, then the NAL unit can comprise the extended area of NAL unit header.Especially, if nal_unit_type equals 20 or 14, the NAL unit can comprise the extended area of NAL unit header.Whether according to discerning is the flag information (svc_mvc_flag) of SVC bit stream, and the configuration information of (multi-view) sequence can be increased to the extended area of NAL unit header from various visual angles.
Another example is the information of expression subset sequence parameter if be used to represent the information of the type of NAL unit, and then RBSP can comprise the information about subset sequence parameter.Especially, if nal_unit_type equals 15, then RBSP can comprise the information about subset sequence parameter.In this situation, according to profile information, subset sequence parameter can comprise the extended area of sequence parameter set.For example, if profile information (profile_idc) is the abridged table relevant with scalable video, subset sequence parameter can comprise the extended area of sequence parameter set.Perhaps, according to profile information, sequence parameter set can comprise the extended area of sequence parameter set.The extended area of sequence parameter set can comprise restriction sign information, limits special grammer with the compatibility that is used to coding decoder.
Below explain in detail various configuration informations about scalable sequence, for example can be contained in the configuration information in the extended area of NAL unit header, perhaps can be contained in the configuration information in the extended area of sequence parameter set.
At first, the identifying information of representation space scalability is meant the information of the dependence of expression identification NAL unit.For example, dependence changes according to spatial resolution.Among Fig. 3, the image of Spa_Layer0 and Spa_Layer1 can have identical resolution.The image of Spa_Layer0 can comprise by the image among the Spa_Layer1 is carried out the image that down-sampling obtains.
Especially, suppose that the information of the dependence of identification NAL unit is named as dependency_id, the image among the Spa_Layer0 has the relation of dependency_id=0, and the image of Spa Layer1 then has the relation of dependency_id=1.
Can adopt multiple mode to define dependency identification information.Therefore, the information NAL unit with identification dependence of identical value can be expressed as dependence and represents (dependencyrepresentation).
Quality identification information represents to be used to discern the information of the quality of NAL unit.For example, single image can be encoded as the different image of quality.Among Fig. 3, Spa_Layer0 can be encoded as the different image of quality each other with image among the Spa_Layer1.
Especially, the information of supposing to be used to discern the quality of NAL unit is named as quality_id, image B 1, B2 ..., B10 can be set to quality_id=0.And, image Q1, Q2 ..., Q10 can be set to quality_id=1.That is, image B 1, B2 ..., B10 represents to comprise the image of lowest image quality.These are called as primary image.Image Q1, Q2 ..., Q10 can comprise image B 1, B2 ..., B10, and picture quality be better than image B 1, B2 ..., B10.Can adopt multiple mode to define quality identification information.For example, quality identification information can be expressed as 16 ranks.
Simultaneously, according to the information and the quality identification information of identification dependence, the single layer of definable.In this case, have the information of identification dependence of identical numerical value and the NAL unit of quality identification information and can be expressed as layer expression (layer representation).
The identifying information that is used for the express time scalability is meant other information of time stage that is used to discern the NAL unit.Can be in classification B picture structure the interpretation time rank.
For example, the image among the Spa_Layer0 (B1, Q1) and image (B3 Q3) can comprise identical time rank Tem_Layer0.If image (B5, Q5) be referred to image (B1, Q1) and image (B3, Q3), then image (B5, Q5) can have than image (B1, Q1) or image (B3, the time rank Tem_Layer1 that time rank Tem_Layer0 Q3) is higher.Equally, if image (B7, Q7) be referred to image (B1, Q1) and image (B5, Q5), then (B7 Q7) can have than image (B5, the time rank Tem_Layer2 that time rank Tem_Layer1 Q5) is higher image.Whole NAL unit in the single access unit can comprise identical time rank.In the example of IDR access unit, time class value can be changed into 0.
Flag information is used for expression and whether is used as reference picture with reference to primary image, and this flag information represents whether whether be used as reference picture as reference picture or decoded picture with reference to primary image in inter prediction (inter-prediction) process.The NAL unit of identical layer, the flag information of NAL unit that promptly comprises the information of identical identification dependence can comprise identical value.
Priority identification information represents to be used to discern the information of the priority of NAL unit.Use priority identification information that (inter-picture) extensibility between interlayer (inter-layer) extensibility or image can be provided.For example, provide various times and other sequence of space level by using priority identification information to can be the user.Therefore, the user can only just can see the sequence or the view in special time and space according to different restrictive conditions.
Precedence information can adopt multiple mode to form according to its reference conditions.Precedence information can form at random and need not adopt with particular reference to.And precedence information can be determined by decoder.
The configuration information that comprises in the extended area of NAL unit header can comprise and be used to represent whether current access unit is the flag information of IDR access unit.
Fig. 3 is used to explain storage and uses figure with reference to the various scalable structures of the scalable video of primary image process for according to an embodiment of the invention.
At first, in the time scalability, can determine the layer of video sequence according to frame rate.
Please refer to Fig. 3, in every layer along upward to, refer to that then higher time scalable layer becomes higher with the expression frame per second.
By the notion of classification B image or classification P image is applied to H.264 video coding, can realize the time scalable video.For example, (B5, in example Q5), (B9 Q9) belongs to and has the time rank Tem_Layer2 bigger than the value of time rank Tem_Layer1, so can't be used as reference picture image for B7, Q7 to belong to image among the time rank Tem_Layer1 in prediction.Yet (B3 Q3) belongs to and has other Tem_Layer0 of lower time stage, so can be used as reference picture image for B1, Q1.
Therefore, no matter whether have the decoding of the image of the layer that belongs to higher than random time layer, the image that belongs to the random time layer can be decoded independently.If the ability according to decoder is judged to be the decodable code rank, the H.264 compatible video of decodable code respective frame speed then.
Spatial scalability in the following key-drawing 3.Spa_Layer0 comprises identical resolution respectively with image among the Spa_Layer1.The image of Spa_Layer0 is by the image among the Spa_Layer1 is carried out the image that down-sampling obtains.For example, be set to dependency_id about the information of the identification dependence of NAL unit, the image among the Spa_Layer0 can be set to dependency_id and equal 0, and the image among the Spa_Layer1 can be set to dependency_id and equal 1.
Below explain quality scalability.Every layer image can comprise the different image of quality each other on the spatial axes.For example, the information of supposing to be used to discern the quality of NAL unit is set to quality_id, image B 1, B2 ..., B10 can be set to quality_id and equal 0, image Q1, Q2 ..., Q10 can be set to quality_id=1.Especially, image B 1, B2 ..., the B10 image of representing to have lowest image quality.On the contrary, image Q1, Q2 ..., the picture quality that had of the corresponding image of Q10 be higher than image B 1, B2 ..., B10.Can adopt multiple mode to define quality identification information.For example, quality identification information can be expressed as 16 ranks.
The process of below explaining stored reference primary image according to an embodiment of the invention and using the reference primary image stored to decode.
With reference to the image shown in the figure 3, decoding order can be set at B1, Q1, B2, Q2, B3, Q3 ..., B10, Q10 (1 → 2 → 3 → 4 → ..., → 9 → 10).If current image to be decoded is B4, image B 1, Q1, B2, Q2, B3 and Q3 are prior decoded image.Image B 4 is corresponding to the image with minimum time rank and minimum quality levels.But image B 4 reference picture B2, wherein image B 2 is a primary image.Therefore, image B 2 should be stored in the decoded picture buffer district.
In this situation,, need presentation video B2 will be stored in the decoded picture buffer district flag information with image to be encoded after being used for (for example, image B 4) when decoding during this decoded image B 2.For example, if current NAL unit corresponding to the reference primary image, then to represent whether to store the flag information of current NAL unit in buffer area be store_ref_base_pic_flag to definable.And may need to be used for the mark whether presentation video B2 will be used as primary image.Therefore, after image B 2 decodings, but decoded picture buffer district marking image B2 is as the reference primary image.Finish after these steps, when decoded picture B4, according to store_ref_base_pic_flag, image B 4 can use image B 2 conducts with reference to image, and wherein image B 2 is stored in the decoded picture buffer district, and is marked as with reference to primary image.
According to another embodiment of the present invention, the process that obtains flag information is below explained corresponding to minimum quality levels and when comprising the band of reference picture in current NAL unit, and whether this flag information is used for expression in the current NAL of buffer area storage unit.
For example, this flag information can be only for being used for the syntactic element of scalable-video-coded bit stream.Therefore, need to limit another information of this flag information to be used for the compatibility of coding decoder.Perhaps, need to limit another information of this flag information so that bitstream format can be transformed.For example, definable is used to rewrite the flag information of scalable-video-coded bit stream, to satisfy the compatibility of coding decoder.
For compatible with last coding decoder, for example, scalable-video-coded bit stream is passed through scalable-video-coded bit stream to be rewritten as in the AVC bit stream under the decoded situation of AVC coding decoder.Like this, restriction sign information only can limit the syntactic information that is applied to scalable-video-coded bit stream.By limiting, use simple conversion process to be the AVC bit stream with regard to convertible scalable-video-coded bit stream.For example, can be expressed as slice_header_retriction_flag.Restriction sign information can obtain from sequence parameter set or subset sequence parameter.Perhaps, can from the extended area of subset sequence parameter, obtain restriction sign information.
Can limit the syntactic element that only is used for the specific coding decoder.For example, current NAL unit is corresponding to minimum quality levels and comprise under the situation of band of reference picture, can use restriction sign information in slice header place restriction sign information, and wherein whether this flag information is used for expression in the current NAL of buffer area storage unit.Especially, only when slice_header_retriction_flag=0, can obtain store_ref_base_pic_flag information.If slice_header_retriction_flag=1 then can't obtain store_ref_base_pic_flag information.This has played the part of the role that the slice header that makes the scalable video bit is equal to the head of AVC bit stream, thereby can realize decoding by the AVC coding decoder.
Fig. 4 is the flow chart of the process of stored reference primary image according to an embodiment of the invention.
To use Fig. 3 at first, below describing.Under the situation of attempting to decode with the corresponding image B 4 of present image, image B 4 is corresponding to the image that comprises minimum time rank and minimum quality levels.That is, image B 4 can be corresponding to by the represented primary image of basic representation (base representation).Therefore, but image B 4 reference picture B2 as primary image.For reference picture B2, need in advance image B 2 to be stored in the decoded picture buffer district.
Explain in below describing image B 2 is stored in the decoded picture buffer district to use image B 2 as the process with reference to image.
During decoded picture B2, can from the extended area of current NAL unit header, obtain quality identification information.When quality identification information was represented as the described minimum of Fig. 2, the current NAL unit of image B 2 can be corresponding to primary image.Therefore, need be according to the current NAL unit of the quality identification information check image B2 that is obtained whether corresponding to primary image (S410).
Because image B 2 will be used as reference picture, this can be notified by the NAL unit header.For example, can obtain identifying information (nal_ref_idc), this identifying information (nal_ref_idc) is used to represent whether current NAL unit comprises the band of reference picture.According to this identifying information, need the current NAL unit of check image B2 whether to comprise the band (S420) of reference picture.According to this identifying information, if current NAL unit is primary image and the band that comprises reference picture, then current NAL unit can be corresponding to the reference primary image.
Therefore, if the reference picture marking process corresponding to minimum quality levels and comprise the band of reference picture, is then carried out in current NAL unit.In the image tagged process in decoded picture buffer district, can be marked as with reference to primary image extraly with reference to primary image.In this situation, can obtain to be used to represent whether to store this flag information with reference to primary image.In order to obtain this flag information, then should not exist other flag informations to limit this flag information.For example, need to check the restriction sign information (S430) that is used to limit for the special grammer of the compatibility of coding decoder.
Flag information can be the information that obtains from the extended area of subset sequence parameter.Especially, to suppose to be used to rewrite scalable-video-coded bit stream be the AVC bit stream with the restriction sign information of the compatibility that is used for coding decoder is slice_header_retriction_flag.Use this restriction sign information, can represent whether the special grammer that relates to sequence parameter set is present in the slice header.Based on this restriction sign information, whether can obtain to be used for to represent flag information (S440) in the current NAL of buffer area storage unit.
According to the flag information that whether is used for representing in the current NAL of buffer area storage unit, if the current network abstraction layer unit is stored, if and the current network abstraction layer unit is not to be the IDR image, then can carry out the labeling process of the reference picture of the basic layer of decoding.
According to the flag information that whether is used for representing in the current NAL of buffer area storage unit, if current NAL unit is stored, if and the decoded picture that comprises this storage NAL unit is marked as with reference to primary image, then can use this with reference to primary image decoded video signal (S450).For example, if be stored and be marked as with reference to primary image,, can use image B 2 as with reference to image then in decoding during as the image B 4 of present image according to flag information image B 2.
Fig. 5 is used to store and the syntactic structure figure of mark with reference to primary image for according to an embodiment of the invention.
The example of the syntactic structure of the technical conceive of the flow chart of being explained in Fig. 5 presentation graphs 4.
At first, need in slice header, check that according to quality identification information whether current NAL unit is corresponding to primary image (S510).
Represent according to being used to whether current NAL unit comprises the identifying information nal_ref_idc of the band of reference picture, need to check whether current NAL unit comprises the band (S520) of reference picture.
If current NAL unit is primary image and the band that comprises reference picture, then current NAL unit can be corresponding to the reference primary image.Therefore, if current NAL unit corresponding to this with reference to primary image, then carry out reference picture marking process (S530).In the image tagged process in decoded picture buffer district, this can be labeled as with reference to primary image extraly with reference to primary image.
In this situation, can obtain to represent whether to store this flag information with reference to primary image.In order to obtain this flag information, need to check the restriction sign information (S540) that is used to limit for the specific syntax of the compatibility of coding decoder.
Flag information can be the information that obtains from the extended area of subset sequence parameter.For example, be used to rewrite scalable-video-coded bit stream and be the AVC bit stream and can be set to slice_header_retriction_flag with the restriction sign information of the compatibility that is used for coding decoder, by restriction sign information, can represent whether the special grammer that relates to sequence parameter set is present in the slice header.
According to this restriction sign information, can obtain to be used to represent whether to store this flag information (S550) with reference to primary image.
According to being used to represent whether to store this flag information with reference to primary image, if this is stored with reference to primary image, if and this is not to be IDR image (S560) with reference to primary image, then can carry out the process (S570) of reference picture of the basic layer of mark decoding.
Perhaps, can check whether expression is used as the flag information whether reference picture in the inter prediction process or decoded picture are used as reference picture with reference to primary image.As the result who checks,, then can carry out process (S570) for the reference picture of the basic layer of mark decoding if be used as reference picture and this is not to be IDR image (S560) with reference to primary image with reference to primary image.
Fig. 6 is the figure that is used for reference to the syntactic structure of the storage of primary image and mark according to an embodiment of the invention.
During by NAL cell processing signal, another NAL unit before the current NAL unit can be used.This another NAL unit is called as " prefix NAL (prefix NAL) ".
This prefix NAL unit can be used for the information that only can be applied to SVC is sent to basic layer, and keeps the compatibility between basic layer bit stream and the AVC coding decoder.
For example, the expression of being explained among Fig. 5 whether the flag information of stored reference primary image can be contained in the prefix NAL unit.Especially, according to representing whether current NAL unit comprises the identifying information nal_ref_idc of the band of reference picture, need to check whether current NAL unit comprises the band of reference picture.According to identifying information,, whether then can obtain to be used to represent the flag information of stored reference primary image if current NAL unit comprises the band of reference picture.
According to whether representing the flag information of stored reference primary image,, then can carry out the process of reference picture of the basic layer of mark decoding if be stored and reference picture is not to be the IDR image with reference to primary image.
Perhaps, can check whether expression is used as the flag information whether reference picture in inter prediction (inter-prediction) process or decoded picture are used as reference picture with reference to primary image.As the result who checks, if be used as reference picture with reference to primary image, and if should be not to be the IDR image with reference to primary image, then can carry out labeling process for basic layer reference picture of decoding.
Fig. 7 to Figure 12 is respectively the figure that whether is used for obtaining representing in the syntactic structure of the flag information of the current NAL of buffer area storage unit according to an embodiment of the invention.
Among the embodiment shown in Figure 7, the flag information that is used for the stored reference primary image can be defined as store_base_ref_flag.This flag information can obtain (S710) from slice header.
Predetermined condition can be presented to obtain this flag information.For example, type of strip is not the example for PR, that is to say, is not the example that expression strengthens the type of strip of quality scale, and it can be called as the minimum example of quality scale.
Can check specific syntax that whether expression limit current band information with the compatibility that is used for coding decoder.For example, can check another information that can limit this flag information, thereby make the form of bit stream to be transformed.
In addition, the definable flag information is used to rewrite scalable-video-coded bit stream to satisfy the compatibility of coding decoder.
As previously mentioned, if, then can obtain to be used for the flag information of stored reference primary image if type of strip is not by PR and is not limited by flag information.
Among the embodiment shown in Figure 8, whether another flag information of definable stores the flag information of current NAL unit to obtain expression in buffer area.For example, definable represents whether to be used as with reference to primary image the flag information of reference picture.This flag information is illustrated in the process of inter prediction whether whether be used as reference picture by image for referencial use or decoded picture with reference to primary image.For the NAL unit of the information with same identification dependence, this flag information can have identical value.Can in the extended area of NAL unit header, define this flag information (S810).
Can check the whether corresponding basic layer of current band and comprise minimum quality scale (S820).
Can check whether current band is used as reference picture (S830).
Can check with reference to primary image and whether be used as reference picture (S840).If be used as reference picture, whether then can obtain to be used for to represent flag information (S850) in the current NAL of buffer area storage unit with reference to primary image.
After current NAL unit was stored in the buffer area, if the type of NAL unit relevant with SVC (S860) can be carried out the labeling process of decoded picture, wherein this decoded picture comprised the current NAL unit (S870) with primary image for referencial use.
Can from slice header, obtain whether in buffer area, to store under the expression certain condition flag information of current NAL unit.For example, if type of strip represents to strengthen the band of quality scale, and when dividing quality layers first partly, and if be used as reference picture (S880) with reference to primary image, can obtain to represent whether to store the flag information (S890) of current NAL unit in buffer area.
Among the embodiment shown in Figure 9, whether expression stores current NAL unit in buffer area flag information (S930) can be defined in different ways.For example, this flag information can together be defined in the extended area of NAL unit header with reference to another flag information (S910) whether primary image is used as reference picture with expression.
Among the embodiment shown in Figure 10, by using another flag information to obtain to be used for the flag information of representing whether to store current NAL unit at buffer area.For example, definable is used to represent whether to be used as with reference to primary image the flag information of reference picture.This flag information definable (S1010) in the extended area of NAL unit header.
Can check that current band is whether corresponding to the quality scale of basic layer and current band whether minimum (S1020).
Can check that whether current band is as reference picture (S1030).
If current band is used as reference picture, can obtain to be used for representing whether to store the flag information (S1040) of current NAL unit then in buffer area.
If current NAL unit is stored in the buffer area and the type of current NAL unit relevant with SVC (S1050), can carry out the mark decoded picture and be process with reference to primary image, wherein decoded picture comprises current NAL unit (S1060).
Also can from slice header, obtain expression and whether in buffer area, store the flag information of current NAL unit under certain condition.For example, if type of strip is not corresponding to the band that strengthens quality scale, the specific syntax of current band does not limit the compatibility to coding decoder, and as reference picture (S1070), whether can obtain to be used for to represent flag information (S1080) in the current NAL of buffer area storage unit with reference to primary image.
Among the embodiment shown in Figure 11, use another flag information whether can obtain to be used for to represent flag information in the current NAL of buffer area storage unit.For example, can use expression whether to be used as the flag information of reference picture with reference to primary image.
For example, if type of strip not corresponding to the band that strengthens quality scale, and is used as reference picture with reference to primary image, whether then can obtain to be used for to represent flag information (a) in the current NAL of buffer area storage unit.
If be used as reference picture, and without limits for the special grammer of the current band of the compatibility of coding decoder, whether then can obtain to be used for to represent flag information (b) in the current NAL of buffer area storage unit with reference to primary image.
Whether inspection is used as with reference to primary image after the reference picture, whether then can obtain to be used for to represent the flag information (c) in the current NAL of buffer area storage unit.
Among the embodiment shown in Figure 12, use another flag information can obtain to represent whether in buffer area, to store the flag information of current NAL unit.
For example, if type of strip represents to strengthen the band of quality scale,, quality layers can check type of strip whether corresponding first partly (S1210) if being divided.
Can check with reference to primary image and whether whether represent that as the quality scale of reference picture or current band first strengthens quality scale (S1230).
Therefore, can obtain to represent whether in buffer area, to store the flag information (S1250) of current NAL unit.
As previously mentioned, decoder/encoder of the present invention is provided to multimedia broadcasting, and broadcast transmitter/receiver of DMB (DMB) for example is to be used for decoded video signal, data-signal etc.The emittor/receiver of multimedia broadcasting can comprise mobile communication terminal.
A kind of application decoding/coding method of the present invention is reserved as the program that computer is carried out, and is stored in the computer readable recording medium storing program for performing.And the multi-medium data with data structure of the present invention can be stored in the computer readable recording medium storing program for performing.Computer readable recording medium storing program for performing comprises the various types of memory devices that are used for storage computation machine system readable data.Computer readable recording medium storing program for performing comprises ROM, RAM, CD-ROM, tape, floppy disk, optical disc memory apparatus etc., and comprises the equipment realized with the carrier wave transmission of internet (for example, by).And, be stored in the computer-readable medium or by the bit stream that coding method generated and be transmitted by wired.
Industrial applicibility
Although the present invention is described and illustrates that clearly those skilled in the art can carry out various modifications and variation to it, and does not break away from spirit of the present invention or category with reference to its preferred embodiment.Therefore, the present invention cover appending claims and the scope that is equal in modification of the present invention and the variation of providing.

Claims (17)

1, a kind of method of utilizing partial reference on time domain and the scalable territory to come decoding telescopic video signal, described method comprises:
Obtain the first's image on the very first time point;
With reference to described first picture decoding complete image, described complete image is positioned at second time point, and described second time point is positioned at after the described very first time point;
Wherein, the rank on the scalable territory of described first image is lower than the rank on the scalable territory of described complete image.
2, method according to claim 1, wherein, utilization is to the scalable decoding of the second portion image described complete image of decoding, and described second portion image is positioned at described second time point, and corresponding to the described first image on the described very first time point.
3, method according to claim 1 further comprises: obtain to be used to limit the restriction sign information of grammer, described grammer is corresponding to the partial reference information in the bit stream.
4, a kind of method of decoded video signal, described method comprises:
Obtain the quality scale of current NAL unit;
Determine whether described current NAL unit comprises the band of reference picture;
When described current NAL unit comprises the band of described reference picture corresponding to minimum quality levels and described current NAL unit, obtain whether first flag information of the described current NAL of storage unit in buffer area of expression; With
Based on described first flag information decoded picture is labeled as with reference to primary image, wherein, described current NAL unit is included in the described decoded picture.
5, method according to claim 4 further comprises: come decoded video signal by utilizing the described reference primary image that is labeled.
6, method according to claim 4, wherein, described with reference to primary image corresponding to minimum time rank.
7, method according to claim 4 wherein, obtains described first flag information from slice header.
8, method according to claim 4, wherein, the RBSP of the NAL unit before from described current NAL unit obtains described first flag information.
9, method according to claim 4 comprises further and checks restriction sign information that described restriction sign information is used to limit the specific syntax for the compatibility of coding decoder, wherein, and based on described first flag information of described restriction sign information acquisition.
10, method according to claim 9 wherein, obtains described restriction sign information from the extended area of subset sequence parameter.
11, method according to claim 9, wherein, if store described NAL unit according to described first flag information, and if described current NAL unit be non-IDR image, the decoded picture that comprises described NAL unit so is marked as described with reference to primary image.
12, method according to claim 9, further comprise obtaining whether expression is described is used as second flag information of reference picture with reference to primary image, further based on described second flag information when described be the described decoded picture of non-IDR image tense marker with reference to primary image.
13, method according to claim 4 further comprises:
Acquisition is used to limit the restriction sign information for the specific syntax of the compatibility of coding decoder; With
Based on described restriction sign information, obtain to divide the required information of quality layers;
Wherein, described information is for other scanning position information of conversion coefficient level.
14, method according to claim 4, wherein, described vision signal is received with broadcast singal.
15, method according to claim 4, wherein, described vision signal is received by Digital Media.
16, a kind of medium has write down on it and has been used for the program that enforcement of rights requires 4 method, and described medium is configured to be read out by computer.
17, a kind of device that is used for decoded video signal, described device comprises:
The identifying information inspection unit is used to check the quality scale of current NAL unit, and is used to check whether described current NAL unit comprises the band of reference picture; With
Unit, decoded picture buffer district, check result according to described identifying information inspection unit, if described current NAL unit is corresponding to minimum quality levels and comprise the band of described reference picture, unit, described decoded picture buffer district is based on first flag information of representing whether to store described current NAL unit in buffer area, and the decoded picture that mark comprises described NAL unit is with reference to primary image.
CN 200780008303 2006-09-07 2007-09-07 Method and apparatus for decoding/encoding of a video signal Pending CN101401433A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US84266106P 2006-09-07 2006-09-07
US60/842,661 2006-09-07
US60/857,802 2006-11-09
US60/858,957 2006-11-15
US60/859,532 2006-11-17

Publications (1)

Publication Number Publication Date
CN101401433A true CN101401433A (en) 2009-04-01

Family

ID=40494908

Family Applications (2)

Application Number Title Priority Date Filing Date
CN200780008161.3A Active CN101395925B (en) 2006-09-07 2007-09-07 Method and apparatus for decoding/encoding of a video signal
CN 200780008303 Pending CN101401433A (en) 2006-09-07 2007-09-07 Method and apparatus for decoding/encoding of a video signal

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN200780008161.3A Active CN101395925B (en) 2006-09-07 2007-09-07 Method and apparatus for decoding/encoding of a video signal

Country Status (1)

Country Link
CN (2) CN101395925B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102883164A (en) * 2012-10-15 2013-01-16 浙江大学 Coding and decoding methods, corresponding devices and code streams for enhancement layer block units
CN108174224A (en) * 2012-07-03 2018-06-15 三星电子株式会社 Method for video coding and equipment and video encoding/decoding method and equipment
CN108769713A (en) * 2012-04-16 2018-11-06 韩国电子通信研究院 Video encoding/decoding method and equipment, method for video coding and equipment
CN116758562A (en) * 2023-08-22 2023-09-15 杭州实在智能科技有限公司 Universal text verification code identification method and system

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102104781B (en) * 2009-12-18 2013-03-20 联咏科技股份有限公司 Image decoder
KR20150035667A (en) * 2012-09-28 2015-04-07 삼성전자주식회사 Method and apparatus for encoding video for random access, and method and apparatus for decoding video for random access

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09102954A (en) * 1995-10-04 1997-04-15 Matsushita Electric Ind Co Ltd Method for calculating picture element value of block from one or two predictive blocks
KR100619822B1 (en) * 2003-12-24 2006-09-13 엘지전자 주식회사 Image processing apparatus and method

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11490100B2 (en) 2012-04-16 2022-11-01 Electronics And Telecommunications Research Institute Decoding method and device for bit stream supporting plurality of layers
US11483578B2 (en) 2012-04-16 2022-10-25 Electronics And Telecommunications Research Institute Image information decoding method, image decoding method, and device using same
US10958919B2 (en) 2012-04-16 2021-03-23 Electronics And Telecommunications Resarch Institute Image information decoding method, image decoding method, and device using same
US11949890B2 (en) 2012-04-16 2024-04-02 Electronics And Telecommunications Research Institute Decoding method and device for bit stream supporting plurality of layers
US10958918B2 (en) 2012-04-16 2021-03-23 Electronics And Telecommunications Research Institute Decoding method and device for bit stream supporting plurality of layers
CN108769713B (en) * 2012-04-16 2023-09-26 韩国电子通信研究院 Video decoding method and apparatus, video encoding method and apparatus
CN108769713A (en) * 2012-04-16 2018-11-06 韩国电子通信研究院 Video encoding/decoding method and equipment, method for video coding and equipment
CN108235035A (en) * 2012-07-03 2018-06-29 三星电子株式会社 Method for video coding and equipment and video encoding/decoding method and equipment
CN108174224A (en) * 2012-07-03 2018-06-15 三星电子株式会社 Method for video coding and equipment and video encoding/decoding method and equipment
US10764593B2 (en) 2012-07-03 2020-09-01 Samsung Electronics Co., Ltd. Method and apparatus for coding video having temporal scalability, and method and apparatus for decoding video having temporal scalability
US11252423B2 (en) 2012-07-03 2022-02-15 Samsung Electronics Co., Ltd. Method and apparatus for coding video having temporal scalability, and method and apparatus for decoding video having temporal scalability
CN108235033A (en) * 2012-07-03 2018-06-29 三星电子株式会社 Method for video coding and equipment and video encoding/decoding method and equipment
CN108235034A (en) * 2012-07-03 2018-06-29 三星电子株式会社 Method for video coding and equipment and video encoding/decoding method and equipment
CN102883164B (en) * 2012-10-15 2016-03-09 浙江大学 A kind of decoding method of enhancement layer block unit, corresponding device
CN102883164A (en) * 2012-10-15 2013-01-16 浙江大学 Coding and decoding methods, corresponding devices and code streams for enhancement layer block units
CN116758562A (en) * 2023-08-22 2023-09-15 杭州实在智能科技有限公司 Universal text verification code identification method and system
CN116758562B (en) * 2023-08-22 2023-12-08 杭州实在智能科技有限公司 Universal text verification code identification method and system

Also Published As

Publication number Publication date
CN101395925B (en) 2013-01-02
CN101395925A (en) 2009-03-25

Similar Documents

Publication Publication Date Title
CN102158697B (en) Method and apparatus for decoding/encoding of a video signal
CN112868184B (en) Method, apparatus and storage medium for decoding video sequence
JP5144522B2 (en) Apparatus and method for defining and reconstructing ROI in scalable video coding
US8184153B2 (en) Method and apparatus for defining and reconstructing ROIs in scalable video coding
KR102028527B1 (en) Image decoding method and apparatus using same
US20100142614A1 (en) Inter-view prediction
JP7472292B2 (en) Method, apparatus, and computer program product for video encoding and video decoding - Patents.com
KR101882596B1 (en) Bitstream generation and processing methods and devices and system
CN101395925B (en) Method and apparatus for decoding/encoding of a video signal
US11363248B2 (en) Method and device for transmitting region information of 360-degree video
JP2023143935A (en) Encoder, decoder and corresponding method for sub-block partitioning mode
WO2002069643A1 (en) Method and device for encoding mpeg-4 video data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20090401

C20 Patent right or utility model deemed to be abandoned or is abandoned