WO2015139624A1 - 一种视频数据编码、解码的方法和装置 - Google Patents
一种视频数据编码、解码的方法和装置 Download PDFInfo
- Publication number
- WO2015139624A1 WO2015139624A1 PCT/CN2015/074421 CN2015074421W WO2015139624A1 WO 2015139624 A1 WO2015139624 A1 WO 2015139624A1 CN 2015074421 W CN2015074421 W CN 2015074421W WO 2015139624 A1 WO2015139624 A1 WO 2015139624A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information group
- information
- basic
- redundant
- group
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/65—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/65—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
- H04N19/68—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving the insertion of resynchronisation markers into the bitstream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/184—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/188—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a video data packet, e.g. a network abstraction layer [NAL] unit
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
Definitions
- the present invention relates to the field of coding and decoding, and in particular, to a method and apparatus for encoding and decoding video data.
- RTP Real-time Transport Protocol
- UDP User Datagram Protocol
- the video is composed of a plurality of image frames.
- UDP is an unreliable transmission protocol. During the transmission process, when the network is blocked, packet loss is likely to occur, which affects the quality of the video reconstructed by the decoder.
- each video display picture is referred to as an image
- an image obtained by progressive scanning may also be referred to as one frame for a field obtained by interlacing one image.
- One frame may contain one or more stripes, and may also include one or more slices, with the difference that the stripes are divided in the order of basic coding units, and the slices are divided by image regions.
- the frame is used as a processing unit, and specifically includes: encoding, in the encoding process of the video data, each image frame to generate a basic frame, and simultaneously generating one or more redundant frames corresponding to the basic frame; And redundant strips are packaged into different packets for transmission.
- the decoding end can perform decoding by parsing the header information of the current frame to obtain attribute information of the current frame, and the attribute information of the current frame is used to represent that the current frame is a basic frame or a redundant frame; if the current frame is a redundant frame If the basic frame corresponding to the current frame is a normal frame (that is, no packet is lost, and the data of the reconstructed frame is complete), the current redundant frame is discarded; if the current frame is a redundant frame and the basic frame corresponding to the current frame is an abnormal frame. (ie, the packet has been lost, cannot be decoded normally), then the header information of the basic frame is copied to the header information location of the current frame, and other information of the current frame is continued to be parsed, thereby reconstructing the video using the redundant frame.
- the header information of the basic frame is copied to the header information location of the current frame, and other information of the current frame is continued to be parsed, thereby reconstructing the video using the redundant frame.
- the above decoding method requires that the basic frame and its corresponding redundant frame are adjacent to each other in the decoding position. Otherwise, the correspondence between the basic frame and the redundant frame cannot be reflected, and the possibility that the basic frame and the redundant frame are simultaneously lost is increased, and the basic Frames must arrive at the decoder before redundant frames, reducing transmission flexibility.
- the method and device for encoding and decoding video data provided by the embodiments of the present invention can reduce the probability of simultaneous loss of basic frames and redundant frames, and enhance the robustness and flexibility of transmission.
- An embodiment of the present invention provides a method for encoding video data, including:
- the basic information group includes first data information, indicating whether the information group to be encoded is a basic information group or a redundant information group, and the location of the basic information group information;
- the basic information group and the redundant information group are programmed into a code stream.
- An embodiment of the present invention further provides a method for decoding video data, including:
- the basic information group corresponding to the to-be-decoded information group is obtained, and the basic information group includes first data information, indicating whether the information group to be encoded is a basic information group or a redundant information group, and the basic information group location information;
- a redundant information group corresponding to the to-be-decoded information group is obtained, and the redundant information group includes second data information, where the location information of the redundant information group indicates whether the information group to be encoded is a basic information group or a redundancy
- the indication information of the information group and the location relationship information of the redundant information group and the basic information group, and the location relationship information of the redundant information group and the basic information group is used to derive the basic information group location information;
- An embodiment of the present invention further provides an apparatus for encoding video data, including a processor, where the processor is configured to:
- the basic information group includes first data information, indicating whether the information group to be encoded is a basic information group or a redundant information group, and the location of the basic information group information;
- the redundant information group includes the second data Information, the location information of the redundant information group, indicating whether the information group to be encoded is a basic information group or a redundant information group, and location relationship information between the redundant information group and the basic information group;
- the basic information group and the redundant information group are programmed into a code stream.
- An embodiment of the present invention further provides an apparatus for decoding video data, including a processor, where the processor is configured to:
- the basic information group corresponding to the to-be-decoded information group is obtained, and the basic information group includes first data information, indicating whether the information group to be encoded is a basic information group or a redundant information group, and the basic information group location information;
- a redundant information group corresponding to the to-be-decoded information group is obtained, and the redundant information group includes second data information, where the location information of the redundant information group indicates whether the information group to be encoded is a basic information group or a redundancy
- the indication information of the information group and the location relationship information of the redundant information group and the basic information group, and the location relationship information of the redundant information group and the basic information group is used to derive the basic information group location information;
- the coding and decoding method and apparatus of the foregoing solution allow the basic frame and the corresponding redundant frame to maintain a certain interval in the code stream, thereby reducing the possibility that the basic frame and the corresponding redundant frame are simultaneously lost, and the enhanced The robustness of the transmission, and the decoding end no longer has requirements for decoding the order of the basic frame and the redundant frame, improving the flexibility of transmission.
- FIG. 1 is a flowchart of a method for encoding video data according to Embodiment 1 of the present invention
- FIG. 2 is a flowchart of a method for decoding video data according to Embodiment 2 of the present invention
- FIG. 3 is a flowchart of another method for decoding video data according to Embodiment 3 of the present invention.
- FIG. 4 is a flowchart of another method for decoding video data according to Embodiment 4 of the present invention.
- FIG. 5 is a flowchart of another method for decoding video data according to Embodiment 5 of the present invention.
- FIG. 6 is a flowchart of another method for decoding video data according to Embodiment 6 of the present invention.
- FIG. 7 is a schematic structural diagram of an apparatus for encoding video data according to Embodiment 7 of the present invention.
- FIG. 8 is a schematic structural diagram of an apparatus for decoding video data according to Embodiment 8 of the present invention.
- FIG. 9 is a schematic diagram of a syntax structure of a video encoding method according to Embodiment 1 of the present invention.
- FIG. 10 is a schematic diagram of a frame structure of a video encoding method according to Embodiment 1 of the present invention.
- a method for encoding video data includes:
- Step 102 The frame to be encoded is processed according to a specified step of a video compression coding standard, and generally includes prediction, transform, quantization, entropy coding, filtering, etc., and the video compression standard may be H.264, H.265, or SVC (scalable video). Coding, scalable coding, etc., to generate encoded data information of a basic frame, including prediction mode information, block mode information, transform mode information, quantization parameter information, filter parameters, and residual information.
- a video compression coding standard generally includes prediction, transform, quantization, entropy coding, filtering, etc.
- the video compression standard may be H.264, H.265, or SVC (scalable video). Coding, scalable coding, etc., to generate encoded data information of a basic frame, including prediction mode information, block mode information, transform mode information, quantization parameter information, filter parameters, and residual information.
- the POC represents the order in which the encoded frames are displayed in the video sequence. Since the basic frame and the redundant frame correspond to the same encoded frame, in order to avoid their POC being the same, for example, if there is at most one corresponding one for each basic frame. For redundant frames, you can limit the POC of the basic frame to consecutive even numbers. For example: 0, 2, 4, 6... The POC of the redundant frame is limited to an odd number. According to the needs of the actual application, it can be a continuous odd number, for example: 1, 3, 5, 7 ... may also be discontinuous odd numbers, for example, the POC of the redundant frame may be set to the POC of the corresponding basic frame plus one.
- the network abstraction layer unit is composed of a network abstraction layer unit header and a data part.
- the network abstraction layer unit header contains a network abstraction layer unit type, and the network abstraction layer unit type determines the type of data included in the network abstraction layer unit.
- the PREFIX_SEI_NUT (Supplemental Enhanced Information Prefix) mentioned later is one of the types of network abstract header units. For example, a flag may be added to the PREFIX_SEI_NUT. If the value A is written, the frame is a basic frame. If the value B is written, the frame is a redundant frame. For example,
- the frame is a redundant frame; if the network abstraction layer unit type is a supplementary enhanced information prefix, And if the information payload type is 136, the frame is a basic frame.
- the frame is a redundant frame; if the network abstraction layer unit type is a supplementary enhanced information prefix, and the information payload type is 135, and the frame type is 1, the frame is a basic frame.
- the frame is a redundant frame; if the network abstraction layer unit type is a supplementary enhanced information suffix, and if the information payload type is 135, the frame Is the basic frame.
- 105 determine a positional relationship between the redundant frames corresponding to the frame for the frame.
- the POC of the current basic frame is represented as currPicOrderCntVal
- the POC of the corresponding redundant frame is represented as rdnPicOrderCntVal
- Pic_order_cnt_delta can be a positive value or a negative value, which represents the sequence relationship between the basic frame and the corresponding redundant frame, that is, it can be represented by a positive or negative sign.
- the absolute value of the value can range from 0 to 256. If the redundant frame is not generated, the positional relationship may be set according to actual needs, and may be set to 1.
- GOP relationship it is determined whether the basic frame and the corresponding redundant frame belong to the same GOP; if not belonging to the same GOP, the number of intervals of the GOP is determined.
- Example 1 setting the basic frame and the corresponding redundant frame belong to the same GOP as gop_flag, belonging to the same GOP, then gop_flag is 0, otherwise it is 1; setting the basic frame and the corresponding redundant frame are separated by gop_circle GOP, gop_circle can be positive or negative; then the actual position difference between the basic frame and the corresponding redundant frame is pic_order_cnt_delta+gop_circle x gop_flag;
- Example 2 Setting the basic frame and the corresponding redundant frame are separated by gop_circle GOPs, and gop_circle is an integer; then the actual position difference between the basic frame and the corresponding redundant frame is pic_order_cnt_delta+gop_circle.
- the display order of the video frames ie, POC
- the actual coding order that characterizes the order of processing each frame of data may be used as the representation of the position
- the writing order in which the encoded data is written in binary form into the code stream is not limited as a representation of the position.
- the frame to be encoded is processed according to a specified step of a video compression coding standard, and generally includes prediction, transform, quantization, entropy coding, filtering, etc.
- the video compression standard may be H.264, H.265, or SVC, but
- the basic frame corresponding to the compression uses the same compression standard to generate encoded data information of the redundant frame, including prediction mode information, block mode information, transform mode information, quantization parameter information, filtering parameters, and residual information, and the like.
- Phase The generation information of the corresponding basic frames may be the same or different.
- step 107 Determine the POC for the frame, and the method for determining the POC is consistent with step 103.
- a flag identifying the redundant frame for the frame is a flag bit may be added in the PREFIX_SEI_NUT. If the value A is written, the frame is a basic frame. If the value B is written, the frame is a redundant frame, and the redundant frame flag is determined and step 104. be consistent.
- Pic_order_cnt_delta CurrPicOrderCntVal–rdnPicOrderCntVal.
- Pic_order_cnt_delta can be a positive value or a negative value, representing the order relationship between the redundant frame and the corresponding basic frame.
- the absolute value of the value can be 0 to 256. If the basic frame is not generated, the positional relationship may be set according to actual needs, and may be set to 1.
- the GOP relationship between the redundant frame and the corresponding basic frame is also determined.
- prm_poc_reset_flag indicates whether the basic frame is an instant refresh frame, that is, a POC reset of the type of frame is encountered.
- Step 111 Encode the data information, the POC information, the label information, the encoded data information of the redundant frame, the POC information, and the label information of the basic frame into the code stream according to the coding sequence, where the sequence of the basic frame and the redundant frame is not encoded. limited.
- a typical grammatical structure in the first embodiment is included.
- the scheme is coded to allow the basic frame and the corresponding redundant frame to maintain a certain interval in the code stream, which reduces the possibility of simultaneous loss of the basic frame and the corresponding redundant frame, and enhances the robustness of the transmission.
- Sex, and the decoding end is no longer ordered for decoding basic frames and redundant frames. There is a requirement to increase the flexibility of transmission.
- the method for decoding video data provided by this embodiment corresponds to the above method for encoding video data. As shown in Figure 2, it includes:
- the decoding order of the basic frame is before the corresponding redundant frame, the basic frame is not missing, and the current frame to be decoded is the basic frame.
- the nal_unit_type may be parsed. If the nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If the payloadType is the value A, the frame belongs to the basic frame, and the specific example corresponds to the step 104.
- 203 According to the code stream information, optionally, parsing a positional relationship between the frame and a redundant frame corresponding to the frame, in a special case, when the basic frame and the corresponding redundant frame are allowed to be in different GOPs, To describe the positional relationship, a GOP relationship between the basic frame and the corresponding redundant frame is also determined, corresponding to step 105. The information obtained in this step can be used as additional information for special decoding requirements.
- step 205 Parsing the encoded data information of the frame according to a video compression coding standard, and corresponding to step 102.
- the method for decoding video data provided in this embodiment corresponds to the method for encoding video data in the first embodiment. As shown in Figure 3, it includes:
- the decoding order of the basic frame is before the corresponding redundant frame, the basic frame is not deleted, and the current frame to be decoded is a redundant frame.
- nal_unit_type is parsed. If the nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If the payloadType is the value B, the frame belongs to the redundant frame. The specific example corresponds to step 108.
- step 109 Parse the positional relationship between the frame and the basic frame corresponding to the frame.
- step 304 Check if the position obtained in step 303 has a basic frame that has been generated.
- the method for decoding video data provided in this embodiment corresponds to the method for encoding video data in the first embodiment. As shown in Figure 4, it includes:
- the decoding order of the basic frame is before the corresponding redundant frame, the basic frame is missing, and the current frame to be decoded is a redundant frame.
- the nal_unit_type may be parsed. If the nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If the payloadType is the value B, the frame belongs to the redundant frame, and the specific example corresponds to step 108.
- step 403 Parse the positional relationship between the frame and the basic frame corresponding to the frame.
- the allowed redundant frame and the corresponding basic frame are located in different GOPs, in order to describe the positional relationship, it is determined.
- the GOP relationship between the redundant frame and the corresponding basic frame corresponds to step 109. This step can determine the location of the basic frame corresponding to the frame.
- step 404 Check if the position obtained in step 303 has a basic frame that has been generated.
- the POC will be the POC of the decoded frame.
- 408 Reconstruct the decoded frame according to the information obtained by parsing the frame code stream.
- the decoded frame is outputted in the order specified by the POC.
- the method for decoding video data provided in this embodiment corresponds to the method for encoding video data in the first embodiment. As shown in Figure 5, it includes:
- the decoding order of the redundant frame is before the corresponding basic frame, and the current frame to be decoded is a redundant frame.
- the nal_unit_type may be parsed. If the nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If the payloadType is the value B, the frame belongs to the redundant frame, and the specific example corresponds to step 108.
- step 503 Parse the positional relationship between the frame and the basic frame corresponding to the frame.
- the allowed redundant frame and the corresponding basic frame are located in different GOPs, in order to describe the positional relationship, it is determined.
- the GOP relationship between the redundant frame and the corresponding basic frame corresponds to step 109. This step can determine the location of the basic frame corresponding to the frame.
- step 504 Check if the position obtained in step 503 has a basic frame that has been generated.
- step 505 If the basic frame is not generated, parsing the attribute information of the basic frame corresponding to the frame, and corresponding to step 110. This attribute information will be used as attribute information of the decoded frame.
- POC 506 Parse the POC of the frame, and correspond to step 107.
- the POC will be the POC of the decoded frame.
- step 507 Parsing the encoded data information of the frame according to the video compression coding standard, and corresponding to step 106.
- 508 Reconstruct the decoded frame according to the information obtained by parsing the frame code stream.
- the decoded frame is stored in the order specified by the POC.
- the decoded frame is output in the order specified by the POC.
- the method for decoding video data provided by this embodiment corresponds to the above method for encoding video data. As shown in Figure 6, it includes:
- the decoding order of the redundant frame is before the corresponding basic frame, and the current frame to be decoded is the basic frame.
- the nal_unit_type may be parsed. If the nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If the payloadType is the value A, the frame belongs to the basic frame, and the specific example corresponds to the step 104.
- step 603 According to the code stream information, optionally, parsing a positional relationship between the frame and a redundant frame corresponding to the frame, in a special case, when the basic frame and the corresponding redundant frame are allowed to be in different GOPs, To describe the positional relationship, a GOP relationship between the basic frame and the corresponding redundant frame is also determined, corresponding to step 105. The information obtained in this step can be used as additional information for special decoding requirements.
- Step 605 Parsing the encoded data information of the frame according to a video compression coding standard, and corresponding to step 102.
- 606 Reconstruct the decoded frame according to the information obtained by parsing the frame code stream.
- 607 Output the decoded frame in the order specified by the POC.
- the decoding methods provided in Embodiments 2 to 6 allow the basic frame and the corresponding redundant frame to be coded because the code stream carries the positional relationship information between the basic frame and the corresponding redundant frame.
- the position in the stream is kept at a certain interval, which reduces the possibility of simultaneous loss of the basic frame and the corresponding redundant frame, enhances the robustness of the transmission, and the decoding end does not have the order for decoding the basic frame and the redundant frame.
- Requirements increase the flexibility of transmission.
- a device 1110 for encoding video data may specifically include a general-purpose processor (such as a CPU) or a dedicated processor (such as a GPU).
- the apparatus 1110 includes a processor 1111, a transmitting circuit 1114, a memory 1112, and a bus 1113.
- the transmitting circuit may be configured to send the encoded code stream to the outside, the memory is used to store the data obtained during the processing, and the bus is used to connect the devices to carry the communication.
- the processor 1111 may be configured to:
- the video compression standard may be H.264, H.265 or SVC, etc., and is used to process the to-be-coded frame according to the specified steps of the video compression coding standard, and generally includes prediction, transform, quantization, entropy coding, and filtering.
- the encoded data information of the basic frame is generated, including prediction mode information, block mode information, transform mode information, quantization parameter information, filtering parameters, residual information, and the like.
- the POC represents the order in which the encoded frames are displayed in the video sequence. Since the basic frame and the redundant frame correspond to the same encoded frame, in order to avoid their POC being the same, for example, if there is at most one corresponding one for each basic frame. For redundant frames, it is possible to limit the POC of the basic frame to a continuous even number, and the POC of the redundant frame is limited to an odd number. Specifically, the POC of the redundant frame may be set to the POC of the corresponding basic frame plus one.
- a flag bit may be added to PREFIX_SEI_NUT. If the value A is written, the frame is a basic frame, and if the value B is written, the frame is a redundant frame.
- the method further determines a positional relationship between the redundant frames corresponding to the frame for the frame.
- the POC of the current basic frame is represented as currPicOrderCntVal
- the POC of the corresponding redundant frame is represented as rdnPicOrderCntVal
- the POC difference between the two frames is represented as pic_order_cnt_delta.
- pic_order_cnt_delta rdnPicOrderCntVal–currPicOrderCntVal
- Pic_order_cnt_delta can be a positive value or a negative value, representing the sequence relationship between the basic frame and the corresponding redundant frame.
- the absolute value of the value can be 0 to 256. If the redundant frame is not generated, the positional relationship may be set according to actual needs, and may be set to 1.
- the GOP relationship between the basic frame and the corresponding redundant frame is also determined.
- the frame to be encoded is further processed according to a specified step of the video compression coding standard, and generally includes prediction, transform, quantization, entropy coding, filtering, etc.
- the video compression standard may be H.264 or H.265. Or SVC, etc., but generally the basic frame corresponding to compression
- generating encoded data information of the redundant frame including prediction mode information, block mode information, transform mode information, quantization parameter information, filtering parameters, residual information, and the like, and corresponding basic frame generation information Can be the same or different.
- step 103 Also used to determine the POC for the frame, the method of determining the POC is consistent with step 103.
- a flag bit may be added in the PREFIX_SEI_NUT. If the value A is written, the frame is a basic frame. If the value B is written, the frame is a redundant frame, and the redundant frame flag is determined and step 104. be consistent.
- Pic_order_cnt_delta can be a positive value or a negative value, representing the order relationship between the redundant frame and the corresponding basic frame.
- the absolute value of the value can be 0 to 256. If the basic frame is not generated, the positional relationship may be set according to actual needs, and may be set to 1.
- the GOP relationship between the redundant frame and the corresponding basic frame is also determined.
- prm_poc_reset_flag indicates whether the basic frame is an instant refresh frame, that is, a POC reset of the type of frame is encountered.
- It is further configured to encode the encoded data information, the POC information, the indication information, the encoded data information of the redundant frame, the POC information, and the indication information of the basic frame into the code stream according to the coding sequence, where the sequence order of the basic frame and the redundant frame are sequentially encoded. No limit.
- the device for encoding video data allows a basic frame and a corresponding redundant frame to maintain a certain interval in the code stream, thereby reducing the possibility that the basic frame and the corresponding redundant frame are simultaneously lost.
- the robustness of the transmission is enhanced, and the decoding end has no requirement for decoding the order of the basic frame and the redundant frame, which improves the transmission flexibility.
- an apparatus 1000 for decoding video data may specifically include a general-purpose processor (such as a CPU), or A dedicated processor (e.g., GPU) for performing the method of decoding video data shown in FIG. 2, the apparatus 1000 includes a processor 1001, a receiving circuit 1004, a memory 1002, and a bus 1003.
- the receiving circuit can be configured to receive the coded stream that has been encoded, the memory is used to store the data obtained during the processing, the bus is used to connect the devices to carry the communication, and the device is configured to perform the decoding of the video data shown in FIG.
- the processor 1001 can be configured to:
- nal_unit_type can be parsed. If nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If payloadType is the value A, the frame belongs to the basic frame.
- the frame is further configured to determine, according to the code stream information, a positional relationship between the frame and a redundant frame corresponding to the frame, in a special case, when the basic frame and the corresponding redundant frame are allowed to be located in different GOPs.
- the GOP relationship between the basic frame and the corresponding redundant frame is also determined.
- An apparatus for decoding video data includes a processor, which is configured to perform the method for decoding video data, as shown in FIG.
- nal_unit_type can be parsed. If nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If payloadType is the value B, the frame belongs to a redundant frame.
- An apparatus for decoding video data includes a processor, which is configured to perform the method for decoding video data shown in FIG. 4, where the processor is specifically configured to:
- nal_unit_type can be parsed. If nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If payloadType is the value B, the frame belongs to a redundant frame.
- the basic frame is not generated, it is used to determine attribute information of the basic frame corresponding to the frame.
- the second processing unit 1008 is configured to reconstruct the decoded frame according to the information obtained by parsing the frame code stream.
- An apparatus for decoding video data includes a processor for performing the method for decoding video data shown in FIG. 5, where the processor is specifically configured to:
- nal_unit_type can be parsed. If nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If payloadType is the value B, the frame belongs to a redundant frame.
- the basic frame is not generated, it is used to determine attribute information of the basic frame corresponding to the frame.
- An apparatus for decoding video data includes a processor, which is configured to perform the method for decoding video data shown in FIG. 6.
- the processor is specifically configured to:
- nal_unit_type can be parsed. If nal_unit_type is PREFIX_SEI_NUT, the payloadType is parsed. If payloadType is the value A, the frame belongs to the basic frame.
- the third determining unit 1203 is configured to determine a positional relationship between the frame and a redundant frame corresponding to the frame, in a special case, when the basic frame and the corresponding redundant frame are allowed to be located. In the case of different GOPs, in order to describe the positional relationship, the GOP relationship between the basic frame and the corresponding redundant frame is also determined.
- the decoding apparatus provided in Embodiments 2 to 6 of the present invention, since the positional relationship information between the basic frame and the corresponding redundant frame can be decoded, allows the position of the basic frame and the corresponding redundant frame to remain constant in the code stream.
- the interval reduces the possibility of simultaneous loss of the basic frame and the corresponding redundant frame, enhances the robustness of the transmission, and the decoding end no longer has requirements for decoding the sequence of the basic frame and the redundant frame, thereby improving the flexibility of transmission. Sex.
- the generated and the The step of the basic information group corresponding to the coded information group and the redundant information group corresponding to the to-be-coded information group is not in the order, and the embodiment of the present invention does not specifically limit the basic information corresponding to the to-be-coded information group.
- the step of the information group must be earlier than the step of generating the redundant information group corresponding to the information group to be encoded has no sequence.
- the embodiment of the present invention does not limit the sequence of obtaining the basic information group corresponding to the to-be-decoded information group and parsing the redundant information group corresponding to the to-be-decoded information group.
- the disclosed system, apparatus, and method may be implemented in other manners.
- the device embodiments described above are merely illustrative.
- the division of the unit is only a logical function division.
- there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
- the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
- each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be physically included separately, or two or more units may be integrated into one unit.
- the above integrated unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
- the above-described integrated unit implemented in the form of a software functional unit can be stored in a computer readable storage medium.
- the software functional units described above are stored in a storage medium and include instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform portions of the steps of the methods described in various embodiments of the present invention.
- the foregoing storage medium includes: a U disk, a mobile hard disk, a ROM (Read-Only Memory), a RAM (Random Access Memory), a disk or an optical disk, and the like, which can store program codes. .
Abstract
Description
如果网络抽象层单元类型是补充加强信息前缀 |
...... |
或者如果信息负载类型是135 |
设置冗余帧信息(信息负载大小) |
或者如果信息负载类型是136 |
设置基本帧信息(信息负载大小) |
如果网络抽象层单元类型是补充加强信息前缀 |
...... |
或者如果信息负载类型是135 |
如果帧类型是0 |
设置冗余帧信息(信息负载大小) |
否则 |
设置基本帧信息(信息负载大小) |
如果网络抽象层单元类型是补充加强信息前缀 |
...... |
或者如果信息负载类型是135 |
设置冗余帧信息(信息负载大小) |
或者如果网络抽象层单元类型是补充加强信息后缀 |
...... |
或者如果信息负载类型是135 |
设置基本帧信息(信息负载大小) |
primary_pic_info(payloadSize){ |
pic_order_cnt_delta |
} |
redundant_pic_info(payloadSize){ |
pic_order_cnt_delta |
} |
redundant_pic_info(payloadSize){ |
prm_poc_reset_flag |
} |
Claims (60)
- 一种视频数据的编码方法,其特征在于,包括:确定待编码视频序列中的一待编码信息组;生成与所述待编码信息组对应的基本信息组,基本信息组包括第一数据信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述基本信息组的位置信息;生成与所述待编码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息;将所述基本信息组和所述冗余信息组编入码流。
- 根据权利要求1所述的方法,其特征在于,所述基本信息组的位置信息,包括:所述基本信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序;所述冗余信息组的位置信息,包括:所述冗余信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序。
- 根据权利要求2所述的方法,其特征在于,所述位置关系信息包括:所述冗余信息组与所述基本信息组的位置差的绝对值和先后顺序,所述位置差的绝对值和先后顺序包括,所述基本信息组或者所述冗余信息组在码流中的位置差的绝对值和先后顺序,或者在编码过程中的顺序差的绝对值和先后顺序,或者图像显示顺序差的绝对值和先后顺序。
- 根据权利要求3所述的方法,其特征在于,所述位置关系信息还包括:表示所述冗余信息组的位置信息与所述基本信息组的位置信息是否属于同一个图片组的标示信息。
- 根据权利要求4所述的方法,其特征在于,当所述冗余信息组的位置信息与所述基本信息组的位置信息不属于同一个图片组时,所述位置关系信息还包括所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求3所述的方法,其特征在于,所述位置关系信息还包括:表示所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求1-6任一项所述的方法,其特征在于,所述生成与所述待编码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗 余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,包括:确定所述基本信息组是否已经生成;如果所述基本信息组未生成,则设定一预设位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第二数据信息,所述冗余信息组的位置信息;如果所述基本信息组已经生成,则计算所述冗余信息组与所述基本信息组的所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第二数据信息,所述冗余信息组的位置信息。
- 根据权利要求1-6任一项所述的方法,其特征在于,所述生成与所述待编码信息组对应的基本信息组,基本信息组包括第一数据信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述基本信息组的位置信息,包括:确定所述冗余信息组是否已经生成;如果所述冗余信息组未生成,则设定一预设基本信息组与所述冗余信息组所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第一数据信息,所述基本信息组的位置信息;如果所述冗余信息组已经生成,则计算所述冗余信息组与所述基本信息组的所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第一数据信息,所述基本信息组的位置信息。
- 根据权利要求2所述的方法,其特征在于,如果所述所述位置信息为图像显示顺序,在所述生成与所述待编码信息组对应的基本信息组和所述生成与所述待编码信息组对应的冗余信息组时,对所述基本信息组与所述冗余信息组的图像显示顺序按照预设规则进行设置,以使得所述基本信息组和所述冗余信息组的图像显示顺序不相同。
- 根据权利要求9所述的方法,其特征在于,所述对所述基本信息组与所述冗余信息组的图像显示顺序按照预设规则进行设置,以使得所述基本信息组和所述冗余信息组的图像显示顺序不相同,包括:将所述基本信息组的图像显示顺序设置为连续偶数,所述冗余信息组的图像显示顺序设置为所对应的基本信息组的图像显示顺序加1。
- 根据权利要求1-10任一项所述的方法,其特征在于,所述基本信息组还包括:所述冗余信息组与所述基本信息组的位置关系信息。
- 根据权利要求1-11任一项所述的方法,其特征在于,所述冗余信息组还包括:所述冗余信息组所对应的所述基本信息组的属性信息。
- 根据权利要求12所述的方法,其特征在于,所述基本信息组的属性信息,包括以下信息中的至少一种:表征所述基本信息组是否属于即时刷新帧的信息,表征所述基本信息组是否可以被参考的信息,表征所述基本信息组是否在解码端显示的信息,表征所述基本信息组是否可以被丢弃的信息。
- 根据权利要求1-13任一项所述的方法,其特征在于,所述待编码信息组,包括:图像、帧、条带、片,基本编码单元的集合的任意一种或多种。
- 根据权利要求1-14任一项所述的方法,其特征在于,所述位置关系信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 根据权利要求12或13所述的方法,其特征在于,所述表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组所对应的所述基本信息组的属性信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 一种视频数据的解码方法,其特征在于,包括:获取待解码码流,确定一待解码信息组;解析得到与所述待解码信息组对应的基本信息组,基本信息组包括第一数据信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述基本信息组的位置信息;解析得到与所述待解码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,所述所述冗余信息组与所述基本信息组的位置关系信息用于推导所述基本信息组的位置信息;基于所述解析得到的基本信息组或者所述解析得到的冗余信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建。
- 根据权利要求17所述的解码方法,其特征在于,所述基本信息组的位置信息,包括:所述基本信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序;所述冗余信息组的位置信息,包括:所述冗余信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序。
- 根据权利要求18所述的解码方法,其特征在于,所述位置关系信息包括:所述冗余信息组与所述基本信息组的位置差的绝对值和先后顺序,所述位置差的绝对值和先后顺序包括,所述基本信息组或者所述冗余信息组在码流中的位置差的绝对值和先后顺序,或者在编码过程中的顺序差的绝对值和先后顺序,或者图像显示顺序差的绝对值和先后顺序。
- 根据权利要求19所述的解码方法,其特征在于,所述位置关系信息还包括:表示所述冗余信息组的位置信息与所述基本信息组的位置信息是否属于同一个图片组的标示信息。
- 根据权利要求20所述的解码方法,其特征在于,当所述冗余信息组的位置信息与所述基本信息组的位置信息不属于同一个图片组时,所述位置关系信息还包括所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求19所述的解码方法,其特征在于,所述位置关系信息还包括:表示所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求17-22任一项所述的解码方法,其特征在于,所述解析得到与所述待解码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,所述所述冗余信息组与所述基本信息组的位置关系信息用于推导所述基本信息组的位置信息,包括:确定所述基本信息组是否被正确解析;如果所述基本信息组被正确解析,则停止解析所述待解码信息组,不用再解析得到与所述待解码信息组对应的冗余信息组,并基于所述解析得到的基本信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建;如果所述基本信息组未被正确解析,则继续解析得到与所述待解码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,所述所述冗余信息组与所述基本信息组的位置关系信息用于推导所述基本信息组的位置信息,并基 于所述解析得到的冗余信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建。
- 根据权利要求18所述的解码方法,其特征在于,如果所述所述位置信息为图像显示顺序,在所述生成与所述待解码信息组对应的基本信息组和所述生成与所述待解码信息组对应的冗余信息组时,根据按照预设规则进行设置的所述基本信息组与所述冗余信息组的图像显示顺序确认所述基本信息组与所述冗余信息组的图像显示顺序,所述基本信息组和所述冗余信息组的图像显示顺序不相同。
- 根据权利要求24所述的解码方法,其特征在于,所述基本信息组的图像显示顺序被设置为连续偶数,所述冗余信息组的图像显示顺序被设置为所对应的基本信息组的图像显示顺序加1。
- 根据权利要求17-25任一项所述的解码方法,其特征在于,所述冗余信息组还包括:所述冗余信息组所对应的所述基本信息组的属性信息。
- 根据权利要求26任一项所述的解码方法,其特征在于,所述基本信息组的属性信息,包括以下信息中的至少一种:表征所述基本信息组是否属于即时刷新帧的信息,表征所述基本信息组是否可以被参考的信息,表征所述基本信息组是否在解码端显示的信息,表征所述基本信息组是否可以被丢弃的信息。
- 根据权利要求17-27任一项所述的解码方法,其特征在于,所述待解码信息组,包括:图像、帧、条带、片,基本编码单元的集合的任意一种或多种。
- 根据权利要求17-28任一项所述的解码方法,其特征在于,所述位置关系信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 根据权利要求26或者27所述的解码方法,其特征在于,所述表示所述待解码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组所对应的所述基本信息组的属性信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 一种视频数据编码的装置,其特征在于,包括处理器,所述处理器被配置为:确定待编码视频序列中的一待编码信息组;生成与所述待编码信息组对应的基本信息组,基本信息组包括第一数据 信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述基本信息组的位置信息;生成与所述待编码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息;将所述基本信息组和所述冗余信息组编入码流。
- 根据权利要求31所述的装置,其特征在于,所述基本信息组的位置信息,包括:所述基本信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序;所述冗余信息组的位置信息,包括:所述冗余信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序。
- 根据权利要求32所述的装置,其特征在于,所述位置关系信息包括:所述冗余信息组与所述基本信息组的位置差的绝对值和先后顺序,所述位置差的绝对值和先后顺序包括,所述基本信息组或者所述冗余信息组在码流中的位置差的绝对值和先后顺序,或者在编码过程中的顺序差的绝对值和先后顺序,或者图像显示顺序差的绝对值和先后顺序。
- 根据权利要求33所述的装置,其特征在于,所述位置关系信息还包括:表示所述冗余信息组的位置信息与所述基本信息组的位置信息是否属于同一个图片组的标示信息。
- 根据权利要求34所述的装置,其特征在于,当所述冗余信息组的位置信息与所述基本信息组的位置信息不属于同一个图片组时,所述位置关系信息还包括所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求33所述的装置,其特征在于,所述位置关系信息还包括:表示所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求31-36任一项所述的装置,其特征在于,所述处理器进一步配置为,确定所述基本信息组是否已经生成;如果所述基本信息组未生成,则设定一预设位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第二数据信息,所述冗余信息组的位置信息;如果所述基本信息组已经生成,则计算所述冗余信息组与所述基本信息 组的所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第二数据信息,所述冗余信息组的位置信息。
- 根据权利要求31-36任一项所述的装置,其特征在于,所述处理器进一步配置为,确定所述冗余信息组是否已经生成;如果所述冗余信息组未生成,则设定一预设基本信息组与所述冗余信息组所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第一数据信息,所述基本信息组的位置信息;如果所述冗余信息组已经生成,则计算所述冗余信息组与所述基本信息组的所述位置关系信息作为所述冗余信息组与所述基本信息组的位置关系信息,并生成所述第一数据信息,所述基本信息组的位置信息。
- 根据权利要求32所述的装置,其特征在于,所述处理器进一步配置为,如果所述所述位置信息为图像显示顺序,在所述生成与所述待编码信息组对应的基本信息组和所述生成与所述待编码信息组对应的冗余信息组时,对所述基本信息组与所述冗余信息组的图像显示顺序按照预设规则进行设置,以使得所述基本信息组和所述冗余信息组的图像显示顺序不相同。
- 根据权利要求39所述的装置,其特征在于,所述处理器进一步配置为,将所述基本信息组的图像显示顺序设置为连续偶数,所述冗余信息组的图像显示顺序设置为所对应的基本信息组的图像显示顺序加1。
- 根据权利要求31-40任一项所述的装置,其特征在于,所述基本信息组还包括:所述冗余信息组与所述基本信息组的位置关系信息。
- 据权利要求31-41任一项所述的装置,其特征在于,所述冗余信息组还包括:所述冗余信息组所对应的所述基本信息组的属性信息。
- 根据权利要求42所述的装置,其特征在于,所述基本信息组的属性信息,包括以下信息中的至少一种:表征所述基本信息组是否属于即时刷新帧的信息,表征所述基本信息组是否可以被参考的信息,表征所述基本信息组是否在解码端显示的信息,表征所述基本信息组是否可以被丢弃的信息。
- 根据权利要求31-33任一项所述的装置,其特征在于,所述待编码 信息组,包括:图像、帧、条带、片,基本编码单元的集合的任意一种或多种。
- 根据权利要求31-34任一项所述的装置,其特征在于,其特征在于,所述位置关系信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 根据权利要求42或43所述的装置,其特征在于,所述表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组所对应的所述基本信息组的属性信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 一种视频数据解码的装置,其特征在于,包括处理器,所述处理器被配置为:获取待解码码流,确定一待解码信息组;解析得到与所述待解码信息组对应的基本信息组,基本信息组包括第一数据信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述基本信息组的位置信息;解析得到与所述待解码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,所述所述冗余信息组与所述基本信息组的位置关系信息用于推导所述基本信息组的位置信息;基于所述解析得到的基本信息组或者所述解析得到的冗余信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建。
- 根据权利要求47所述的装置,其特征在于,所述基本信息组的位置信息,包括:所述基本信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序;所述冗余信息组的位置信息,包括:所述冗余信息组在码流中的位置,或者在编码过程中的顺序,或者图像显示顺序。
- 根据权利要求48所述的装置,其特征在于,所述位置关系信息包括:所述冗余信息组与所述基本信息组的位置差的绝对值和先后顺序,所述位置差的绝对值和先后顺序包括,所述基本信息组或者所述冗余信息组在码流中的位置差的绝对值和先后顺序,或者在编码过程中的顺序差的绝对值和先后顺序,或者图像显示顺序差的绝对值和先后顺序。
- 根据权利要求49所述的装置,其特征在于,所述位置关系信息还包括:表示所述冗余信息组的位置信息与所述基本信息组的位置信息是否属于同一个图片组的标示信息。
- 根据权利要求50所述的装置,其特征在于,当所述冗余信息组的位置信息与所述基本信息组的位置信息不属于同一个图片组时,所述位置关系信息还包括所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求48所述的装置,其特征在于,所述位置关系信息还包括:表示所述冗余信息组与所述基本信息组间隔的图片组的个数。
- 根据权利要求47-52任一项所述的装置,其特征在于,所述处理器被进一步配置为:确定所述基本信息组是否被正确解析;如果所述基本信息组被正确解析,则停止解析所述待解码信息组,不用再解析得到与所述待解码信息组对应的冗余信息组,并基于所述解析得到的基本信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建;如果所述基本信息组未被正确解析,则继续解析得到与所述待解码信息组对应的冗余信息组,冗余信息组包含第二数据信息,所述冗余信息组的位置信息,表示所述待编码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组与所述基本信息组的位置关系信息,所述所述冗余信息组与所述基本信息组的位置关系信息用于推导所述基本信息组的位置信息,并基于所述解析得到的冗余信息组获取所述已解码信息组,所述已解码信息组用于已编码视频序列的重建。
- 根据权利要求48所述的装置,其特征在于,所述处理器被进一步配置为,如果所述所述位置信息为图像显示顺序,在所述生成与所述待解码信息组对应的基本信息组和所述生成与所述待解码信息组对应的冗余信息组时,根据按照预设规则进行设置的所述基本信息组与所述冗余信息组的图像显示顺序确认所述基本信息组与所述冗余信息组的图像显示顺序,所述基本信息组和所述冗余信息组的图像显示顺序不相同。
- 根据权利要求54所述的装置,其特征在于,所述处理器被记你一部配置为,所述基本信息组的图像显示顺序被设置为连续偶数,所述冗余信息组的 图像显示顺序被设置为所对应的基本信息组的图像显示顺序加1。
- 根据权利要求47-55任一项所述的装置,其特征在于,所述冗余信息组还包括:所述冗余信息组所对应的所述基本信息组的属性信息。
- 根据权利要求56所述的装置,其特征在于,所述基本信息组的属性信息,包括以下信息中的至少一种:表征所述基本信息组是否属于即时刷新帧的信息,表征所述基本信息组是否可以被参考的信息,表征所述基本信息组是否在解码端显示的信息,表征所述基本信息组是否可以被丢弃的信息。
- 根据权利要求47-57任一项所述的装置,其特征在于,所述待解码信息组,包括:图像、帧、条带、片,基本编码单元的集合的任意一种或多种。
- 根据权利要求47-58任一项所述的装置,其特征在于,所述位置关系信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
- 根据权利要求47或57所述的装置,其特征在于,所述表示所述待解码信息组是基本信息组还是冗余信息组的指示信息和所述冗余信息组所对应的所述基本信息组的属性信息位于补充增强信息,网络抽象层单元头,条带头,帧头,片头,图像参数集至少一者之中。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020167024055A KR101959319B1 (ko) | 2014-03-18 | 2015-03-17 | 비디오 데이터 인코딩 및 디코딩 방법 및 장치 |
BR112016021321-1A BR112016021321B1 (pt) | 2014-03-18 | 2015-03-17 | Métodos e aparelhos de codificação e decodificação de dados de vídeo |
JP2016556710A JP6268559B2 (ja) | 2014-03-18 | 2015-03-17 | 映像データの符号化および復号方法ならびに装置 |
EP15765911.1A EP3096521A4 (en) | 2014-03-18 | 2015-03-17 | Video data encoding and decoding methods and apparatuses |
US15/258,450 US10313703B2 (en) | 2014-03-18 | 2016-09-07 | Video data encoding and decoding methods and apparatuses |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410100869.3A CN104935944B (zh) | 2014-03-18 | 2014-03-18 | 一种视频数据编码、解码的方法和装置 |
CN201410100869.3 | 2014-03-18 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/258,450 Continuation US10313703B2 (en) | 2014-03-18 | 2016-09-07 | Video data encoding and decoding methods and apparatuses |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015139624A1 true WO2015139624A1 (zh) | 2015-09-24 |
Family
ID=54122858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/074421 WO2015139624A1 (zh) | 2014-03-18 | 2015-03-17 | 一种视频数据编码、解码的方法和装置 |
Country Status (6)
Country | Link |
---|---|
US (1) | US10313703B2 (zh) |
EP (1) | EP3096521A4 (zh) |
KR (1) | KR101959319B1 (zh) |
CN (1) | CN104935944B (zh) |
BR (1) | BR112016021321B1 (zh) |
WO (1) | WO2015139624A1 (zh) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10645404B2 (en) | 2014-03-24 | 2020-05-05 | Qualcomm Incorporated | Generic use of HEVC SEI messages for multi-layer codecs |
CN106713927B (zh) * | 2016-11-14 | 2019-08-13 | 珠海格力电器股份有限公司 | 一种图像输出装置、方法及解码器 |
CA3132582A1 (en) * | 2019-03-07 | 2020-09-10 | Digitalinsights Inc. | Image encoding/decoding method and apparatus |
JPWO2020255771A1 (zh) * | 2019-06-20 | 2020-12-24 | ||
AU2020396705A1 (en) * | 2019-12-06 | 2022-07-28 | Lg Electronics Inc. | Method and apparatus for encoding/decoding image on basis of picture header including information relating to co-located picture, and method for transmitting bitstream |
WO2021252541A1 (en) | 2020-06-09 | 2021-12-16 | Bytedance Inc. | Signaling non-scalable-bested hypothetical reference video decoder information |
WO2021252457A1 (en) | 2020-06-09 | 2021-12-16 | Bytedance Inc. | Subpicture level information signaling in video coding |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1115951A (zh) * | 1992-06-19 | 1996-01-31 | 通用电气公司 | 在噪声通信信道上安排传输已压缩视频数据的方法 |
CN101695134A (zh) * | 2009-10-15 | 2010-04-14 | 中兴通讯股份有限公司 | 一种弱信号环境下提高终端播放性能的终端、系统及方法 |
WO2011023776A1 (fr) * | 2009-08-28 | 2011-03-03 | Commissariat à l'énergie atomique et aux énergies alternatives | Procédé d'égalisation de la taille des paquets de données par blocs d'un flux multimedia |
CN102438152A (zh) * | 2011-12-29 | 2012-05-02 | 中国科学技术大学 | 可伸缩视频编码容错传输方法、编码器、装置和系统 |
CN103501441A (zh) * | 2013-09-11 | 2014-01-08 | 北京交通大学长三角研究院 | 一种基于人类视觉系统的多描述视频编码方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1722570B1 (en) | 2005-05-11 | 2020-04-29 | Beijing Xiaomi Mobile Software Co., Ltd. | Method of transmitting video data |
WO2007081150A1 (en) | 2006-01-09 | 2007-07-19 | Electronics And Telecommunications Research Institute | Method defining nal unit type and system of transmission bitstream and redundant slice coding |
JPWO2007099637A1 (ja) | 2006-03-02 | 2009-07-16 | 富士通マイクロエレクトロニクス株式会社 | 画像送信装置、画像受信装置、および画像送受信システム |
-
2014
- 2014-03-18 CN CN201410100869.3A patent/CN104935944B/zh active Active
-
2015
- 2015-03-17 EP EP15765911.1A patent/EP3096521A4/en not_active Ceased
- 2015-03-17 BR BR112016021321-1A patent/BR112016021321B1/pt active IP Right Grant
- 2015-03-17 KR KR1020167024055A patent/KR101959319B1/ko active IP Right Grant
- 2015-03-17 WO PCT/CN2015/074421 patent/WO2015139624A1/zh active Application Filing
-
2016
- 2016-09-07 US US15/258,450 patent/US10313703B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1115951A (zh) * | 1992-06-19 | 1996-01-31 | 通用电气公司 | 在噪声通信信道上安排传输已压缩视频数据的方法 |
WO2011023776A1 (fr) * | 2009-08-28 | 2011-03-03 | Commissariat à l'énergie atomique et aux énergies alternatives | Procédé d'égalisation de la taille des paquets de données par blocs d'un flux multimedia |
CN101695134A (zh) * | 2009-10-15 | 2010-04-14 | 中兴通讯股份有限公司 | 一种弱信号环境下提高终端播放性能的终端、系统及方法 |
CN102438152A (zh) * | 2011-12-29 | 2012-05-02 | 中国科学技术大学 | 可伸缩视频编码容错传输方法、编码器、装置和系统 |
CN103501441A (zh) * | 2013-09-11 | 2014-01-08 | 北京交通大学长三角研究院 | 一种基于人类视觉系统的多描述视频编码方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3096521A4 * |
Also Published As
Publication number | Publication date |
---|---|
BR112016021321A2 (pt) | 2018-06-26 |
BR112016021321B1 (pt) | 2023-12-05 |
CN104935944A (zh) | 2015-09-23 |
EP3096521A4 (en) | 2017-03-08 |
CN104935944B (zh) | 2018-03-13 |
JP2017513313A (ja) | 2017-05-25 |
US20160381394A1 (en) | 2016-12-29 |
EP3096521A1 (en) | 2016-11-23 |
US10313703B2 (en) | 2019-06-04 |
KR101959319B1 (ko) | 2019-03-18 |
KR20160114717A (ko) | 2016-10-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015139624A1 (zh) | 一种视频数据编码、解码的方法和装置 | |
US10972755B2 (en) | Method and system of NAL unit header structure for signaling new elements | |
US10602156B2 (en) | Devices and methods for identifying a leading picture | |
US9774927B2 (en) | Multi-layer video stream decoding | |
KR102652871B1 (ko) | 네트워크 장치 및 오류 처리 | |
AU2010231805B2 (en) | Image signal decoding device, image signal decoding method, image signal encoding device, image signal encoding method, and program | |
US20200413042A1 (en) | Multi-Layer Video Stream Encoding and Decoding | |
US10674171B2 (en) | Decoders and methods thereof for managing pictures in video decoding process | |
US10827170B2 (en) | Method and device for coding POC, method and device for decoding POC, and electronic equipment | |
EP4152730A1 (en) | Method for receiving media contents in a multimedia system | |
US20230421805A1 (en) | Decoder and encoder and methods for coding of a video sequence | |
US9264737B2 (en) | Error resilient transmission of random access frames and global coding parameters | |
US20130272619A1 (en) | Devices for identifying a leading picture | |
US20160316211A1 (en) | Video Data Coding and Decoding Methods and Apparatuses | |
US10536708B2 (en) | Efficient frame loss recovery and reconstruction in dyadic hierarchy based coding | |
JP2023533680A (ja) | V3c/v-pccのための復号タイルハッシュseiメッセージ | |
WO2016161678A1 (zh) | 一种视频编码和解码的方法、装置和处理系统 | |
US20130251022A1 (en) | Signaling of end of slices | |
US20190007691A1 (en) | Arrangements and methods thereof for processing video | |
RU2610286C2 (ru) | Декодирование многослойного видеопотока | |
JP6268559B2 (ja) | 映像データの符号化および復号方法ならびに装置 | |
RU2787711C1 (ru) | Управление буфером декодированных изображений для кодирования видеосигналов | |
Zou et al. | An efficient video-based geometry compression system for 3D meshes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15765911 Country of ref document: EP Kind code of ref document: A1 |
|
REEP | Request for entry into the european phase |
Ref document number: 2015765911 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2015765911 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20167024055 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2016556710 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: IDP00201606587 Country of ref document: ID |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112016021321 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 112016021321 Country of ref document: BR Kind code of ref document: A2 Effective date: 20160915 |