CN101395925B

CN101395925B - Method and apparatus for decoding/encoding of a video signal

Info

Publication number: CN101395925B
Application number: CN200780008161.3A
Authority: CN
Inventors: 朴胜煜; 全柄文; 朴志皓
Original assignee: LG Electronics Inc
Current assignee: LG Electronics Inc
Priority date: 2006-09-07
Filing date: 2007-09-07
Publication date: 2013-01-02
Anticipated expiration: 2027-09-07
Also published as: CN101395925A; CN101401433A

Abstract

A method of decoding a video signal is disclosed. The present invention includes determining whether to store a first partial picture when the first partial picture and a first full picture are corresponding to a first temporal point and storing the first partial picture for decoding a second full picture referring to the first partial picture, the second full picture being corresponding to a second temporal point, the second temporal point being located after the first temporal point, wherein a level of the first partial picture on a scalable domain is lower than a level of the second full picture on the scalable domain.

Description

The method and the device that are used for the decoding/encoding vision signal

Technical field

The present invention relates to a kind of vision signal coding/decoding scheme.

Background technology

The compression coding/decoding refers to a series of signal processing technology, is used for perhaps storing this digital information with the form of suitable storage medium by the transmitting digitized information of telecommunication circuit.The object of compression coding/decoding has audio frequency, video, character etc.Especially, the technology of video execution compressed encoding is called as compression of video sequence.Usually, video sequence is characterised in that and comprises spatial redundancy or time redundancy.

Scalable-video-coded bit stream can be by a partial decoding of h optionally.For example, the decoder with low complex degree can be decoded basic layer, and the bit stream of low data rate can be extracted to be transmitted for the network by limited capacity.In order further little by little to generate high-resolution image, need to progressively improve the picture quality of sequence.

Summary of the invention

Technical purpose

The object of the invention is to improve the coding/decoding efficient of vision signal.

Technological means

Therefore, the present invention be directed to a kind of method of coding/decoding vision signal, it has fully avoided one or more problems of causing because of the restriction of correlation technique and shortcoming.

One object of the present invention is to define a kind of grammer of the compatibility for coding decoder (codec), improves thus the compatibility between dissimilar coding decoders.

Another object of the present invention is to define a kind of grammer for rewriteeing scalable-video-coded bit stream, thereby improves the compatibility between coding decoder.

Another object of the present invention is to limit a kind of grammer, and whether this grammer is used for expression in appropriate location stored reference primary image, thereby improves the compatibility between coding decoder.

Another object of the present invention is to define a kind of grammer, and whether in position this grammer is used for expression stored reference primary image, thereby effectively manages decoded picture buffer district (decodedpicture buffer).

Whether in position another object of the present invention is effectively to carry out the decoded picture mark by being used for the expression grammer of stored reference primary image.

Another object of the present invention is to provide a kind of coding/decoding method, thus, minimizes the problem in the vision signal decode procedure that causes because of the mistake that produces in the transmission course.

Another object of the present invention is to provide a kind of method, manages the decoded picture buffer district by the decoding scheme of the problem that produces in the decode procedure that minimizes vision signal, and wherein this problem is caused by the wrong institute that transmission course produces.

Beneficial effect

Therefore, the invention provides following effect or advantage.

At first, during the coding/decoding vision signal, by the grammer of definition for the compatibility of coding decoder, the present invention can improve the compatibility between dissimilar coding decoders.For example, be used for scalable-video-coded bit stream is transformed to AVC (advanced video encoding) thus the coded bit stream of coding decoder strengthen the syntactic structure of the compatibility between coding decoder.

The second, decoded picture buffer district (DPB) managed more effectively by the present invention; Thereby reduce the burden that applies in the decoded picture buffer district.Therefore, can improve coding/decoding speed.

The 3rd, by using the various configuration informations about the telescopic video sequence, the present invention can realize more effectively coding/decoding.

Description of drawings

The included accompanying drawing of the present invention is used for providing a further understanding of the present invention, and they are bonded to this part that has also consisted of this specification, and these accompanying drawings show embodiments of the invention, and is used from explanation principle of the present invention with specification one.

In the accompanying drawing:

Fig. 1 is the schematic block diagram according to telescopic video coding/decoding of the present invention system;

Fig. 2 is the configuration information figure that increases to the scalable sequence of scalable-video-coded bit stream according to an embodiment of the invention;

Fig. 3 is according to an embodiment of the invention for explaining storage and the use figure with reference to the various Collapsible structures of the scalable video of the process of primary image;

Fig. 4 is the flow chart of the storing process with reference to primary image according to an embodiment of the invention;

Fig. 5 be according to an embodiment of the invention for storage and mark with reference to the structure chart of the grammer of primary image;

Fig. 6 be according to an embodiment of the invention for storage and mark with reference to the structure chart of the grammer of primary image; And

Fig. 7 to Figure 12 is respectively acquisition according to an embodiment of the invention and is used for expression whether at the structure chart of the grammer of the flag information of the current NAL of buffer area storage unit.

Optimal mode

Other advantages of the present invention and feature will be illustrated in the following description, and its part can be understood from describe, and maybe can obtain by implementing the present invention.Purpose of the present invention and other advantages can realize by the structure of specifically noting in specification and claim and the accompanying drawing and obtain.

In order to realize these and other advantages, and according to as comprising and broadly described purpose of the present invention, method according to a kind of decoded video signal of the present invention comprises: when first's image and the first complete image were put corresponding to the very first time, determine whether to store this first's image; Store this first's image, being used for reference to this first's image decoding the second complete image, this second complete image is corresponding to the second time point, and this second time point is positioned at after this very first time point; Wherein, this first's image is lower than the rank of this second complete image on scalable territory in the rank on the scalable territory.

Preferably, said method comprises further whether the extraction expression stores the flag information of second portion image, and this second portion image is corresponding to the second complete image of decoding.

In order further to realize these and other advantages, and according to purpose of the present invention, method according to a kind of decoded video signal of the present invention comprises: check restriction sign information, this restriction sign information is used for restriction for the specific syntax of the compatibility of coding decoder; Based on this restriction sign information acquisition the first flag information, whether this first flag information represents stored reference primary image in buffer area; With utilize according to this first flag information is stored should be with reference to primary image this vision signal of decoding.

In order further to realize these and other advantages, and according to purpose of the present invention, device for decoded video signal according to the present invention comprises: the identifying information inspection unit, be used for checking restriction sign information, this restriction sign information is used for restriction for the specific syntax of the compatibility of coding decoder; With unit, decoded picture buffer district, be used for storing primary image based on the first flag information at buffer area, whether this first flag information represents wherein, to obtain this first flag information from the result who checks this restriction sign information by stored reference primary image in this buffer area.

Be understandable that above general description and the following detailed description all be example with indicative, and can provide further specifying claim of the present invention.

The working of an invention mode

Be elaborated referring now to the preferred embodiment of the present invention, its example represents in the accompanying drawings.

At first, consider between spatial redundancy, time redundancy, scalable redundancy, visual angle redundant to the compression coding/decoding of video signal data.The compression coding/decoding of considering scalable redundancy is one embodiment of the present of invention.But technical conceive of the present invention is applicable to redundant etc. between time redundancy, spatial redundancy, visual angle.

" coding/decoding (coding) " of indication comprises coding (encoding) and decoding (decoding) two concepts in this specification, can make an explanation neatly according to technical conceive of the present invention and technical scope.

In the bit sequence configuration of vision signal, exist and be referred to as NAL (NetworkAbstraction Layer, the layer structure of separation network abstraction layer), it is positioned between the lower-level system of the VCL (Video Code Layer, Video coding layer) that carries out the moving image encoding process and transmission and memory encoding information.Cataloged procedure is output as the VCL data, and is mapped by the NAL unit before transmission or the storage.Each NAL unit comprises the video data of compression or corresponding to the data RBSP of header (Raw Byte Sequence Payload, raw byte sequence payload: the result data of moving image compression).

The NAL unit mainly comprises NAL unit header and RBSP two parts.The NAL unit header comprises whether expression comprises the identifier (nal_unit_type) as the type of the flag information (nal_ref_idc) of the band of the reference picture of this NAL unit and expression NAL unit.Storage is through the initial data of overcompression in RBSP.And, for the lengths table that makes RBSP is shown the multiple of 8 bits, add RBSP tail bit (RBSP trailing bit) at the end of RBSP.The type of NAL unit has IDR (Instantaneous Decoding Refresh, instantaneous decoding refresh) image, SPS (SequenceParameter Set, sequence parameter set), PPS (Picture Parameter Set, picture parameter set) SEI (Supplemental Enhancement Information, supplemental enhancement information) etc.

Therefore, show as scalable video coded slice if represent the information (nal_unit_type) of the type of above-mentioned NAL unit, can improve coding/decoding efficient by increasing the various configuration informations relevant with above-mentioned scalable coding/decoding.For instance, can increase whether the current access unit of expression is the flag information of instantaneous decoding refresh (being designated hereinafter simply as IDR) access unit, the dependency identification information of representation space scalability (dependency identificationinformation), quality identification information, whether expression is used as the flag information of reference picture, priority identification information etc. with reference to primary image.In order more effectively to manage the decoded picture buffer district, can use the configuration information of various scalable coding/decodings, be described in detail below with reference to Fig. 2.

In standardization, in order to buy target product with suitable expense, stipulated for various abridged tables and other requirement of level.In this case, decoder must satisfy according to determined requirement in corresponding abridged table and the rank.Therefore, defined " abridged table " and " rank " two conceptions of species and come representative function or parameter, it is used for the size of the scope of the manageable compressed sequence of expression decoder.Profile identifier (profile_idc) can be identified bit stream based on predetermined abridged table.Profile identifier be the expression bit stream based on the sign of abridged table.For instance, H.264/AVC in, profile identifier is 66, this refers to that bit stream is based on baseline profile; Profile identifier is 77, refers to that profile identifier is 88 take main abridged table as the basis, refers to that bit stream is take extended profile as the basis.And above-mentioned profile identifier is included in sequential parameter and concentrates.

Therefore, in order to process scalable sequence, whether be abridged table for scalable sequence, if incoming bit stream is identified as the abridged table for scalable sequence, be necessary to increase grammer so that be used for the more than one additional information of scalable sequence and can be transmitted if needing the identification incoming bit stream.Here the abridged table that is used for scalable sequence, as additional aspects H.264/AVC, expression is for the treatment of the profile mode of telescopic video.

Because for traditional AVC technology, SVC is additional aspects, so, and unconditionally increase grammer and compare, it is more effective as the additional information that is used for the SVC pattern to increase grammer.For instance, when the profile identifier of AVC is expressed as abridged table for scalable sequence, if increase information about scalable sequence, then can improve coding/decoding efficient.

Sequential parameter set representations header, wherein header comprises the information in the coding that is present in whole sequence, such as abridged table, rank etc.Whole compression movement image, namely sequence should start from the sequence head place.Therefore, the sequence parameter set corresponding to header should arrive first decoder before the data that depend on this parameter set arrive.That is, sequence parameter set RBSP bears the role of the header of the result data that compresses for moving image.In case the bit stream that incoming bit stream, profile identifier preferentially identify input is based in a plurality of abridged tables which.

Below explain a plurality of embodiment of effective video signal decoding method.

Fig. 1 is the schematic block diagram according to scalable video of the present invention system.

For the optimization for various communication environments and various terminals is provided, the sequence that is provided to terminal should be diversified.If the optimization of certain terminal is provided to corresponding terminal, then represent to prepare single sequence source for the combined value of various parameters (the transmission frame number that comprises per second, resolution, the bit number of every pixel etc.).Therefore, optimization has applied burden to content supplier.

Therefore, content supplier is encoded to original series the compressed sequence data of high bit rate.When receiving the sequence of requests of being made by terminal, content supplier's decoding original series is encoded to the sequence data of the series processing ability that is suitable for terminal to it, and then this coded data is offered terminal.Because this code transforms and is accompanied by coding-decoding-cataloged procedure, so inevitably generation time delay in the process of sequence is provided.Therefore, need in addition complicated hardware device and algorithm.

Scalable video (SVC) is a kind of encoding scheme with the optimum picture quality encoded video signal, so that the partial sequence of the image sequence that produces can show as sequence by decoding.Here, partial sequence refers to the sequence that consisted of by select frame off and on by from whole sequence.Image sequence for by SVC coding utilizes spatial scalability for low bit rate, can reduce sequence size, but and also the service quality scalability reduce the picture quality of sequence.Here, the image sequence with the small screen and/or low number of pictures per second can be called as basic layer, and the sequence with relative large-screen and/or relatively high number of pictures per second can be called as be enhanced or enhancement layer.

The coded image sequence of above-mentioned scalable scheme is by only receiving and the mode of processing section sequence represents with the sequence of realization low image quality.If bit rate is lowered, then picture quality also reduce ground quite large.

In order to solve the reduction problem of picture quality, the auxiliary picture sequence of the separation of low bit rate can be provided, for example comprise the image sequence of the small screen and/or the less frame number of per second.This auxiliary sequencel can be called as basic layer, and main picture sequence can be called as be enhanced or enhancement layer.

Below explain in detail the scalable video system.

At first, the scalable video system comprises encoder 102 and decoder 110.

Encoder 102 comprises basic layer coding unit 104, enhancement layer coding unit 106 and Multiplexing Unit 108.Decoder 110 can comprise dequantisation unit 112, basic layer decoder unit 114 and enhancement layer decoder unit 116.

By the sequence signal X (n) of compression input, basic layer coding unit 104 can produce elementary bit stream.

Use the sequence signal X (n) of input and the information that basic layer coding unit 104 produces, enhancement layer coding unit 106 can produce enhancement layer bit-stream.

And, using basic layer bit stream and enhancement layer bit-stream, Multiplexing Unit 108 can produce scalable bitstream.

The scalable bitstream that produces is transferred into decoder 110 by allocated channel.By the dequantisation unit 112 of decoder 110, the scalable bitstream of transmission can be separated into enhancement layer bit-stream and basic layer bit stream.

Basic layer decoder unit 114 receives and the basic layer of decoding bit stream is output sequence signal Xb (n).

Enhancement layer decoder unit 116 receives enhancement layer bit-stream, and the signal of rebuilding with reference to basic layer decoder unit 114, and the decoding enhancement layer bit stream is output sequence signal Xe (n).Here, output sequence signal Xb (n) be have than after the low picture quality of output sequence signal Xe (n) or the sequence signal of resolution.

In the scalable video process, when specific image was transmitted by encoding enhancement layer, enhancement layer bit-stream may partly be damaged between corresponding transmission period.In this case, because the enhancement layer bit-stream of decoder 110 service failures decoding respective image, so original series and decoding sequence are different aspect picture quality.Especially, be for having other required reference picture of another image of decoding of minimum time stage if there is the image of this problem, then this problem can be further serious.

Therefore, having other image of minimum time stage needs more effectively to be managed.This will be below by coming to be explained in detail with reference to figure 3 and Fig. 4.

According to one embodiment of present invention, decoded picture buffer district (DPB) is so that can realize scalable storage or the mark of complete image and parts of images in scalable video.In this situation, the image that complete image (full picture) expression has highest quality level, and the image that parts of images (partial picture) expression has minimum quality levels.Perhaps, complete image can be defined as respectively representing relative high and low quality scale with parts of images.

For example, if quality scale is divided into 5 grades (0-4), it is 0 to 3 example that parts of images can belong to quality scale.It is 4 example that complete image can belong to quality scale.Perhaps, only having quality scale is that 0 situation is corresponding to parts of images.

Simultaneously, the very first time point place parts of images as needing to be stored before the reference picture.Therefore, be positioned at part or the complete image of very first time point the second time point afterwards in order to decode, the very first time parts of images at some place can be used as reference picture.And the very first time complete or parts of images at some place can be used as reference picture adaptively.

Fig. 2 is the figure of the configuration information of the scalable sequence that increases to scalable-video-coded bit stream according to an embodiment of the invention.

Fig. 2 shows so that the configuration information on the scalable sequence is increased the topology example of NAL unit thereon.

The NAL unit mainly comprises NAL unit header and RBSP (raw byte sequence payload: the result data of moving image compression).

The NAL unit header can comprise the information (nal_unit_type) whether this NAL unit of expression comprises the identifying information (nal_ref_idc) of the band of reference picture and represent the type of NAL unit.

And, under certain limitation, can comprise the extended area of NAL unit header.

For example, if be used for the information of expression NAL cell type or expression prefix NAL unit relevant with scalable video, then the NAL unit can comprise the extended area of NAL unit header.Especially, if nal_unit_type equal 20 or 14, NAL unit can comprise the extended area of NAL unit header.Whether is the flag information (svc_mvc_flag) of SVC bit stream according to identifying, the configuration information of (multi-view) sequence can be increased to the extended area of NAL unit header from various visual angles.

Another example is the information of expression subset sequence parameter if be used for the information of the type of expression NAL unit, and then RBSP can comprise the information about subset sequence parameter.Especially, if nal_unit_type equals 15, then RBSP can comprise the information about subset sequence parameter.In this situation, according to profile information, subset sequence parameter can comprise the extended area of sequence parameter set.For example, if profile information (profile_idc) is the abridged table relevant with scalable video, subset sequence parameter can comprise the extended area of sequence parameter set.Perhaps, according to profile information, sequence parameter set can comprise the extended area of sequence parameter set.The extended area of sequence parameter set can comprise restriction sign information, limits special grammer with the compatibility that is used to coding decoder.

Below explain in detail various configuration informations about scalable sequence, for example can be contained in the configuration information in the extended area of NAL unit header, perhaps can be contained in the configuration information in the extended area of sequence parameter set.

At first, the identifying information of representation space scalability refers to represent identify the information of the dependence of NAL unit.For example, dependence changes according to spatial resolution.Among Fig. 3, the image of Spa_Layer0 and Spa_Layer1 can have identical resolution.The image of Spa_Layer0 can comprise by the image among the Spa_Layer1 is carried out the image that down-sampling obtains.

Especially, suppose that the information of the dependence of identification NAL unit is named as dependency_id, the image among the Spa_Layer0 has the relation of dependency_id=0, and the image of Spa Layer1 then has the relation of dependency_id=1.

Can adopt various ways definition dependency identification information.Therefore, the information NAL unit that has an identification dependence of identical value can be expressed as dependence and represent (dependencyrepresentation).

Quality identification information represents the information for the quality of identification NAL unit.For example, single image can be encoded as the different image of quality.Among Fig. 3, Spa_Layer0 can be encoded as the each other different image of quality with image among the Spa_Layer1.

Especially, the information of supposing to be used for the quality of identification NAL unit is named as quality_id, image B 1, B2 ..., B10 can be set to quality_id=0.And, image Q1, Q2 ..., Q10 can be set to quality_id=1.That is, image B 1, B2 ..., B10 represents to comprise the image of lowest image quality.These are called as primary image.Image Q1, Q2 ..., Q10 can comprise image B 1, B2 ..., B10, and picture quality be better than image B 1, B2 ..., B10.Can adopt various ways definition quality identification information.For example, quality identification information can be expressed as 16 ranks.

Simultaneously, according to information and the quality identification information of identification dependence, the single layer of definable.In this case, have the information of identification dependence of identical numerical value and the NAL unit of quality identification information and can be expressed as layer expression (layer representation).

The identifying information that is used for expression time scalability refers to for other information of time stage of identifying the NAL unit.Can be in classification B picture structure the interpretation time rank.

For example, the image (B1, Q1) among the Spa_Layer0 and image (B3, Q3) can comprise identical time rank Tem_Layer0.If image (B5, Q5) is referred to image (B1, Q1) and image (B3, Q3), then image (B5, Q5) can have than image (B1, Q1) or the higher time rank Tem_Layer1 of the time rank Tem_Layer0 of image (B3, Q3).Equally, if image (B7, Q7) is referred to image (B1, Q1) and image (B5, Q5), then image (B7, Q7) can have the time rank Tem_Layer2 higher than the time rank Tem_Layer1 of image (B5, Q5).Whole NAL unit in the single access unit can comprise identical time rank.In the example of IDR access unit, time class value can be changed into 0.

Flag information is used for expression and whether is used as reference picture with reference to primary image, and this flag information represents whether whether be used as reference picture as reference picture or decoded picture with reference to primary image in inter prediction (inter-prediction) process.The NAL unit of identical layer, the flag information of NAL unit that namely comprises the information of identical identification dependence can comprise identical value.

Priority identification information represents the information for the priority of identification NAL unit.Use priority identification information that (inter-picture) extensibility between interlayer (inter-layer) extensibility or image can be provided.For example, provide various times and other sequence of space level by using priority identification information to can be the user.Therefore, the user can only just can see sequence or the view in special time and space according to different restrictive conditions.

Precedence information can adopt various ways to form according to its reference conditions.Precedence information can form at random and need not adopt with particular reference to.And precedence information can be determined by decoder.

The configuration information that comprises in the extended area of NAL unit header can comprise be used to representing whether current access unit is the flag information of IDR access unit.

Fig. 3 is according to an embodiment of the invention for explaining storage and the use figure with reference to the various scalable structures of the scalable video of primary image process.

At first, in the time scalability, can determine according to frame rate the layer of video sequence.

Please refer to Fig. 3, along upward direction, refer to that then higher time scalable layer becomes higher with the expression frame per second in every layer.

By the concept of classification B image or classification P image is applied to H.264 Video coding, can realize the time scalable video.For example, belong in the example of the image (B5, Q5) among the time rank Tem_Layer1 in prediction, image (B7, Q7, B9, Q9) belong to and have the time rank Tem_Layer2 larger than the value of time rank Tem_Layer1, so can't be used as reference picture.Yet image (B1, Q1, B3, Q3) belongs to and has other Tem_Layer0 of lower time stage, so can be used as reference picture.

Therefore, no matter whether have the decoding of the image of the layer that belongs to higher than random time layer, the image that belongs to the random time layer can be decoded independently.If the ability according to decoder is judged to be the decodable code rank, the H.264 compatible video of decodable code respective frame speed then.

Spatial scalability in the following key-drawing 3.Spa_Layer0 comprises respectively identical resolution with image among the Spa_Layer1.The image of Spa_Layer0 is by the image among the Spa_Layer1 is carried out the image that down-sampling obtains.For example, be set to dependency_id about the information of the identification dependence of NAL unit, the image among the Spa_Layer0 can be set to dependency_id and equal image among 0, the Spa_Layer1 and can be set to dependency_id and equal 1.

Below explain quality scalability.Every layer image can comprise the each other different image of quality on the spatial axes.For example, the information of supposing to be used for the quality of identification NAL unit is set to quality_id, image B 1, B2 ..., B10 can be set to quality_id and equal 0, image Q1, Q2 ..., Q10 can be set to quality_id=1.Especially, image B 1, B2 ..., the B10 image that represents to have lowest image quality.On the contrary, image Q1, Q2 ..., the picture quality that has of the corresponding image of Q10 be higher than image B 1, B2 ..., B10.Can adopt various ways definition quality identification information.For example, quality identification information can be expressed as 16 ranks.

The process of below explaining stored reference primary image according to an embodiment of the invention and using the reference primary image stored to decode.

With reference to the image shown in the figure 3, decoding order can be set as B1, Q1, B2, Q2, B3, Q3 ..., B10, Q10 (1 → 2 → 3 → 4 → ..., → 9 → 10).If current image to be decoded is B4, image B 1, Q1, B2, Q2, B3 and Q3 are prior decoded image.Image B 4 is corresponding to the image with minimum time rank and minimum quality levels.But image B 4 reference picture B2, wherein image B 2 is primary image.Therefore, image B 2 should be stored in the decoded picture buffer district.

In this situation, when this decoded image B 2 of decoding, need presentation video B2 will be stored in the decoded picture buffer district to be used for the flag information of image (for example, image B 4) later on to be encoded.For example, if current NAL unit corresponding to the reference primary image, then to represent whether to store the flag information of current NAL unit in buffer area be store_ref_base_pic_flag to definable.And may need whether will be used as for presentation video B2 the mark of primary image.Therefore, after image B 2 decodings, but decoded picture buffer district marking image B2 is as the reference primary image.Finish after these steps, when decoded picture B4, according to store_ref_base_pic_flag, image B 4 can use image B 2 conducts with reference to image, and wherein image B 2 is stored in the decoded picture buffer district, and is marked as with reference to primary image.

According to another embodiment of the present invention, the process that obtains flag information is below explained corresponding to minimum quality levels and when comprising the band of reference picture in current NAL unit, and whether this flag information is used for expression in the current NAL of buffer area storage unit.

For example, this flag information can be only for being used for the syntactic element of scalable-video-coded bit stream.Therefore, need to limit another information of this flag information to be used for the compatibility of coding decoder.Perhaps, need to limit another information of this flag information so that bitstream format can be transformed.For example, definable is used for rewriteeing the flag information of scalable-video-coded bit stream, to satisfy the compatibility of coding decoder.

For compatible with last coding decoder, for example, scalable-video-coded bit stream is passed through scalable-video-coded bit stream to be rewritten as in the AVC bit stream in the decoded situation of AVC coding decoder.Like this, restriction sign information only can limit the syntactic information that is applied to scalable-video-coded bit stream.By limiting, use simple conversion process to be the AVC bit stream with regard to convertible scalable-video-coded bit stream.For example, can be expressed as slice header retriction_flag.Restriction sign information can obtain from sequence parameter set or subset sequence parameter.Perhaps, can from the extended area of subset sequence parameter, obtain restriction sign information.

Can limit the syntactic element that only is used for the specific coding decoder.For example, current NAL unit is corresponding to minimum quality levels and comprise in the situation of band of reference picture, can use restriction sign information in slice header place restriction sign information, and wherein whether this flag information is used for expression in the current NAL of buffer area storage unit.Especially, only when slice_header_retriction_flag=0, can obtain store_ref_base_pic_flag information.If slice_header_retriction_flag=1 then can't obtain store_ref_base_pic_flag information.This has played the part of the role that the slice header that makes the scalable video bit is equal to the head of AVC bit stream, thereby can realize decoding by the AVC coding decoder.

Fig. 4 is the flow chart of the process of stored reference primary image according to an embodiment of the invention.

To use Fig. 3 at first, below describing.In the situation of the image B corresponding with present image 4 of attempting to decode, image B 4 is corresponding to the image that comprises minimum time rank and minimum quality levels.That is, image B 4 can be corresponding to by the represented primary image of basic representation (base representation).Therefore, but image B 4 reference picture B2 as primary image.For reference picture B2, need in advance image B 2 to be stored in the decoded picture buffer district.

Explain in below describing image B 2 is stored in the decoded picture buffer district to use image B 2 as the process with reference to image.

During decoded picture B2, can from the extended area of current NAL unit header, obtain quality identification information.When quality identification information represented as the described minimum of Fig. 2, the current NAL unit of image B 2 can be corresponding to primary image.Therefore, need to be according to the current NAL unit of the quality identification information check image B2 that obtains whether corresponding to primary image (S410).

Because image B 2 will be used as reference picture, this can be notified by the NAL unit header.For example, can obtain identifying information (nal_ref_idc), this identifying information (nal_ref_idc) is used for representing whether current NAL unit comprises the band of reference picture.According to this identifying information, need the current NAL unit of check image B2 whether to comprise the band (S420) of reference picture.According to this identifying information, if current NAL unit is primary image and the band that comprises reference picture, then current NAL unit can be corresponding to the reference primary image.

Therefore, if the reference picture marking process corresponding to minimum quality levels and comprise the band of reference picture, is then carried out in current NAL unit.In the image tagged process in decoded picture buffer district, can be marked as with reference to primary image extraly with reference to primary image.In this situation, can obtain whether to store this with reference to the flag information of primary image for expression.In order to obtain this flag information, then should not exist other flag informations to limit this flag information.For example, need to check for the restriction sign information (S430) of restriction for the special grammer of the compatibility of coding decoder.

Flag information can be the information that obtains from the extended area of subset sequence parameter.Especially, to suppose be used for to rewrite scalable-video-coded bit stream be the AVC bit stream take the restriction sign information of the compatibility that is used for coding decoder as slice_header_retriction_flag.Use this restriction sign information, can represent whether the special grammer that relates to sequence parameter set is present in the slice header.Based on this restriction sign information, can obtain for expression whether at the flag information (S440) of the current NAL of buffer area storage unit.

According to being used for expression whether at the flag information of the current NAL of buffer area storage unit, if the current network abstraction layer unit is stored, if and the current network abstraction layer unit is not to be the IDR image, then can carry out the labeling process of the reference picture of the basic layer of decoding.

According to being used for expression whether at the flag information of the current NAL of buffer area storage unit, if current NAL unit is stored, if and the decoded picture that comprises this storage NAL unit is marked as with reference to primary image, then can use this with reference to primary image decoded video signal (S450).For example, if be stored and be marked as with reference to primary image according to flag information image B 2, then in decoding during as the image B 4 of present image, can use image B 2 as with reference to image.

Fig. 5 be according to an embodiment of the invention for storage and mark with reference to the syntactic structure figure of primary image.

The example of the syntactic structure of the technical conceive of the flow chart of explaining in Fig. 5 presentation graphs 4.

At first, need in slice header, check that according to quality identification information whether current NAL unit is corresponding to primary image (S510).

According to being used for representing whether current NAL unit comprises the identifying information nal_ref_idc of the band of reference picture, need to check whether current NAL unit comprises the band (S520) of reference picture.

If current NAL unit is primary image and the band that comprises reference picture, then current NAL unit can be corresponding to the reference primary image.Therefore, if current NAL unit corresponding to this with reference to primary image, then carry out reference picture marking process (S530).In the image tagged process in decoded picture buffer district, this can be labeled as with reference to primary image extraly with reference to primary image.

In this situation, can obtain to represent whether to store this with reference to the flag information of primary image.In order to obtain this flag information, need to check to be used for restriction for the restriction sign information (S540) of the specific syntax of the compatibility of coding decoder.

Flag information can be the information that obtains from the extended area of subset sequence parameter.For example, be used for to rewrite scalable-video-coded bit stream and be the AVC bit stream and can be set to slice_header_retriction_flag with the restriction sign information of the compatibility that is used for coding decoder, by restriction sign information, can represent whether the special grammer that relates to sequence parameter set is present in the slice header.

According to this restriction sign information, can obtain whether to store this flag information with reference to primary image (S550) for expression.

Whether store this with reference to the flag information of primary image according to being used for expression, if this is stored with reference to primary image, if and this is not to be IDR image (S560) with reference to primary image, then can carry out the process (S570) of reference picture of the basic layer of mark decoding.

Perhaps, can check whether expression is used as the flag information whether reference picture in the inter prediction process or decoded picture are used as reference picture with reference to primary image.As the result who checks, if be used as reference picture and this is not to be IDR image (S560) with reference to primary image with reference to primary image, then can carry out the process (S570) for the reference picture of the basic layer of mark decoding.

Fig. 6 is the figure of the syntactic structure of the storage for the reference primary image according to an embodiment of the invention and mark.

During by NAL cell processing signal, another NAL unit before the current NAL unit can be used.This another NAL unit is called as " prefix NAL (prefix NAL) ".

This prefix NAL unit can be used for the information that only can be applied to SVC is sent to basic layer, and keeps the compatibility between basic layer bit stream and the AVC coding decoder.

For example, the expression of explaining among Fig. 5 whether flag information of stored reference primary image can be contained in the prefix NAL unit.Especially, according to representing whether current NAL unit comprises the identifying information nal_ref_idc of the band of reference picture, need to check whether current NAL unit comprises the band of reference picture.According to identifying information, if current NAL unit comprises the band of reference picture, then can obtain be used to whether representing the flag information of stored reference primary image.

According to whether representing the flag information of stored reference primary image, if be stored and reference picture is not to be the IDR image with reference to primary image, then can carry out the process of reference picture of the basic layer of mark decoding.

Perhaps, can check whether expression is used as the flag information whether reference picture in inter prediction (inter-prediction) process or decoded picture are used as reference picture with reference to primary image.As the result who checks, if be used as reference picture with reference to primary image, and if should be not to be the IDR image with reference to primary image, then can carry out the labeling process for basic layer reference picture of decoding.

It is according to an embodiment of the invention for obtaining expression whether at the figure of the syntactic structure of the flag information of the current NAL of buffer area storage unit that Fig. 7 to Figure 12 is respectively.

Among the embodiment shown in Figure 7, the flag information that is used for the stored reference primary image can be defined as store_base_ref_flag.This flag information can obtain (S710) from slice header.

Predetermined condition can be presented to obtain this flag information.For example, type of strip is not the example for PR, that is to say, is not the example that expression strengthens the type of strip of quality scale, and it can be called as the minimum example of quality scale.

Can check that specific syntax that whether expression limit current band is with the information of the compatibility that is used for coding decoder.For example, can check another information that can limit this flag information, thereby so that the form of bit stream can be transformed.

In addition, the definable flag information is used for rewriteeing scalable-video-coded bit stream to satisfy the compatibility of coding decoder.

As previously mentioned, if if type of strip is not by PR and is not limited by flag information, then can obtain the flag information for the stored reference primary image.

Among the embodiment shown in Figure 8, whether another flag information of definable stores the flag information of current NAL unit in buffer area to obtain expression.For example, definable represents whether to be used as with reference to primary image the flag information of reference picture.This flag information is illustrated in the process of inter prediction whether whether be used as reference picture by image for referencial use or decoded picture with reference to primary image.For the NAL unit of the information with same identification dependence, this flag information can have identical value.Can in the extended area of NAL unit header, define this flag information (S810).

Can check whether corresponding basic layer and comprise minimum quality scale (S820) of current band.

Can check whether current band is used as reference picture (S830).

Can check with reference to primary image and whether be used as reference picture (S840).If be used as reference picture with reference to primary image, then can obtain for expression whether at the flag information (S850) of the current NAL of buffer area storage unit.

After current NAL unit was stored in the buffer area, if the type of NAL unit relevant with SVC (S860) can be carried out the labeling process of decoded picture, wherein this decoded picture comprised the current NAL unit (S870) with primary image for referencial use.

Can from slice header, obtain whether in buffer area, to store under the expression certain condition flag information of current NAL unit.For example, if type of strip represents to strengthen the band of quality scale, and the first part when dividing quality layers, and if be used as reference picture (S880) with reference to primary image, can obtain to represent whether to store the flag information (S890) of current NAL unit in buffer area.

Among the embodiment shown in Figure 9, whether expression stores current NAL unit in buffer area flag information (S930) can be defined in different ways.For example, this flag information can together be defined in the extended area of NAL unit header with reference to another flag information (S910) whether primary image is used as reference picture with expression.

Among the embodiment shown in Figure 10, can obtain whether to store at buffer area for expression the flag information of current NAL unit by using another flag information.For example, definable is used for whether expression is used as reference picture with reference to primary image flag information.This flag information definable (S1010) in the extended area of NAL unit header.

Can check that current band is whether corresponding to the quality scale of basic layer and current band whether minimum (S1020).

Can check that whether current band is as reference picture (S1030).

If current band is used as reference picture, then can obtain whether to store current NAL unit in the flag information (S1040) of buffer area for expression.

If current NAL unit is stored in the buffer area and the type of current NAL unit relevant with SVC (S1050), can carry out the mark decoded picture and be the process with reference to primary image, wherein decoded picture comprises current NAL unit (S1060).

Also can from slice header, obtain expression and whether in buffer area, store the flag information of current NAL unit under certain condition.For example, if type of strip is not corresponding to the band that strengthens quality scale, the specific syntax of current band does not limit the compatibility to coding decoder, and, can obtain for expression whether at the flag information (S1080) of the current NAL of buffer area storage unit as reference picture (S1070) with reference to primary image.

Among the embodiment shown in Figure 11, use another flag information can obtain for expression whether at the flag information of the current NAL of buffer area storage unit.For example, can use expression whether to be used as the flag information of reference picture with reference to primary image.

For example, if type of strip not corresponding to the band that strengthens quality scale, and is used as reference picture with reference to primary image, then can obtain for expression whether at the flag information (a) of the current NAL of buffer area storage unit.

If be used as reference picture with reference to primary image, and without limits for the special grammer of the current band of the compatibility of coding decoder, then can obtain for expression whether at the flag information (b) of the current NAL of buffer area storage unit.

Whether whether inspection is used as with reference to primary image after the reference picture, then can obtain for expression at the flag information (c) of the current NAL of buffer area storage unit.

Among the embodiment shown in Figure 12, use another flag information can obtain to represent whether in buffer area, to store the flag information of current NAL unit.

For example, if type of strip represents to strengthen the band of quality scale,, quality layers can check whether corresponding first part (S1210) of type of strip if being divided.

Can check with reference to primary image and whether whether represent that as the quality scale of reference picture or current band first strengthens quality scale (S1230).

Therefore, can obtain to represent whether in buffer area, to store the flag information (S1250) of current NAL unit.

As previously mentioned, decoder/encoder of the present invention is provided to multimedia broadcasting, and broadcast transmitter/receiver of DMB (DMB) for example is to be used for decoded video signal, data-signal etc.The emittor/receiver of multimedia broadcasting can comprise mobile communication terminal.

A kind of application decoding/coding method of the present invention is reserved as the program that computer is carried out, and is stored in the computer readable recording medium storing program for performing.And the multi-medium data with data structure of the present invention can be stored in the computer readable recording medium storing program for performing.Computer readable recording medium storing program for performing comprises the various types of memory devices for storage computer system-readable data.Computer readable recording medium storing program for performing comprises ROM, RAM, CD-ROM, tape, floppy disk, optical disc memory apparatus etc., and comprises the equipment realized by the carrier wave transmission of internet (for example, by).And the bit stream that is generated by coding method is stored in the computer-readable medium or by wired/wireless communication network and is transmitted.

Industrial applicibility

Although the present invention is described and illustrates that clearly those skilled in the art can carry out various modifications and variation to it, and does not break away from spirit of the present invention or category with reference to its preferred embodiment.Therefore, the present invention cover appending claims and the scope that is equal in modification of the present invention and the variation of providing.

Claims

1. the method for a decoded video signal, described method comprises:

Check and be used for restriction for the restriction sign information of the specific syntax of coding decoder compatibility;

Based on described restriction sign information, obtain the primary image storaging identificating information, whether described primary image storaging identificating information represents stored reference primary image in buffer area; With

According to described primary image storaging identificating information, be stored and described when being non-instantaneous decoding refresh image with reference to primary image with reference to primary image when described, the reference picture of the basic layer of decoding is carried out mark.

2. method according to claim 1 wherein, obtains described restriction sign information from the extended area of the sequence parameter set of described vision signal.

3. method according to claim 1 wherein, obtains described primary image storaging identificating information from the slice header of described vision signal.

4. method according to claim 1, wherein, described vision signal is received with broadcast singal.

5. method according to claim 1, wherein, described vision signal is received by Digital Media.

6. device that is used for decoded video signal, described device comprises:

The identifying information inspection unit is used for checking being used for restriction for the restriction sign information of the specific syntax of coding decoder compatibility; With

The decoded picture buffer unit, according to the primary image storaging identificating information, when the reference primary image is stored and described when being non-instantaneous decoding refresh image with reference to primary image, basic layer reference picture of decoding is carried out mark, it is described with reference to primary image whether described primary image storaging identificating information represents to store in described decoded picture buffer unit

Wherein, described primary image storaging identificating information is based on described restriction sign information acquisition.