CN103237213A - Method for coding videos, method for decoding videos and related devices - Google Patents

Method for coding videos, method for decoding videos and related devices Download PDF

Info

Publication number
CN103237213A
CN103237213A CN2013101196999A CN201310119699A CN103237213A CN 103237213 A CN103237213 A CN 103237213A CN 2013101196999 A CN2013101196999 A CN 2013101196999A CN 201310119699 A CN201310119699 A CN 201310119699A CN 103237213 A CN103237213 A CN 103237213A
Authority
CN
China
Prior art keywords
image
code stream
class
time domain
class image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013101196999A
Other languages
Chinese (zh)
Other versions
CN103237213B (en
Inventor
林永兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201310119699.9A priority Critical patent/CN103237213B/en
Publication of CN103237213A publication Critical patent/CN103237213A/en
Priority to PCT/CN2014/072458 priority patent/WO2014166319A1/en
Application granted granted Critical
Publication of CN103237213B publication Critical patent/CN103237213B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding

Abstract

An embodiment of the invention discloses a method for coding videos, a method for decoding the videos and related devices. The method for decoding the videos can include acquiring a code stream; decoding a first type of images in the code stream; and decoding a second type of images in the code stream according to constraints and the decoded first type of images. The code stream carries first indicating information, the first indicating information is used for indicating the prediction constraints for time-domain motion vectors of the second type of images in the code stream, the second type of images are non-basic view images in the code stream, and the first type of images are basic view images in the code stream; or the second type of images are non-basic layer images in the code stream, and the first type of images are basic layer images in the code stream. According to the technical scheme in the embodiment of the invention, the method for coding the videos, the method for decoding the videos and the related devices have the advantage that the storage overhead of coding/decoding equipment is reduced advantageously.

Description

Method for video coding and video encoding/decoding method and relevant apparatus
Technical field
The present invention relates to technical field of image processing, be specifically related to method for video coding and video encoding/decoding method and relevant apparatus.
Background technology
Development and ever-increasing high-definition digital video demand along with the photoelectricity acquisition technique, the video data volume is increasing, the transmission bandwidth of limited isomery, diversified Video Applications have constantly proposed higher demand to the video code efficiency, the formulation work of high-performance video coding (HEVC, High Efficient Video Coding) standard starts because of need.
The basic principle of video coding compression is the correlation of utilizing between spatial domain, time domain and the code word, removes redundant as far as possible.Present popular way is to adopt block-based mixed video coding framework, realizes the compression of video coding by steps such as prediction (comprising infra-frame prediction and inter prediction), conversion, quantification, entropy codings.This coding framework has shown very strong vitality, and HEVC also still continues to use this block-based mixed video coding framework.
Time domain motion vector predictor (TMVP, Temporal Motion Vector prediction) technology is the motion vector (MV applicable to haplopia/layer, Motion Vector) Predicting Technique utilizes motion (motion) information of time domain reference picture to predict the MV (motion vector) of current block.Determine that from a plurality of time domain reference pictures 1 image as the correspondence image (collocated picture) of present image, utilizes the MV of the motion information prediction present image of this correspondence image.
The TMVP technology is applied to multi-video coding and decoding (MVC, multi-view coding) in, for the non-Code And Decode of looking (non-base view) substantially, except the motion information of a plurality of time domain reference pictures that utilize current non-base view, also may be used to predict the MV of current block from the motion information of looking a reference picture of looking (base view) substantially.Usually need to look from these a plurality of time domain reference pictures and this and determine reference picture that 1 image is as current non-base view correspondence image.
Because in the existing TMVP forecasting mechanism, the correspondence image of current non-base view may be any one in these reference pictures, therefore need to preserve the motion information of all these reference pictures.And movable information is piece aspect (block level), and each block has motion information, so the storage overhead of encoding and decoding end is all very big.Be understandable that the problem of bringing that the TMVP technology is applied in multilayer encoding and decoding such as the scalable video (SVC, scalable video coding) is similar with it.
Summary of the invention
The embodiment of the invention provides method for video coding and video encoding/decoding method and relevant apparatus, in the hope of reducing the storage overhead of coding/decoding equipment.
First aspect present invention provides a kind of video encoding/decoding method, can comprise:
Obtain code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, and first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream;
First kind image in the described code stream is decoded;
According to described restrictive condition and decoded first kind image, the second class image in the described code stream is decoded.
In conjunction with first aspect, in first kind of possible execution mode, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain.
In conjunction with first kind of first aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with second kind of possible execution mode of first kind of first aspect or first aspect possible execution mode or first aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of first aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of first aspect or the 4th kind of possible execution mode of first aspect in conjunction with first kind of first aspect or first aspect possible execution mode or first aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of first aspect or first aspect possible execution mode or first aspect or first aspect or first aspect or the 5th kind of possible execution mode of first aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Second aspect present invention provides a kind of video encoding/decoding method, can comprise:
Obtain code stream;
First kind image in the described code stream is decoded;
According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the described code stream is decoded, wherein, the described second class image is the non-basic view picture in the described code stream, and described first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream.
In conjunction with second aspect, in first kind of possible execution mode, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain;
Described restrictive condition and decoded first kind image according to the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream, comprise: the restrictive condition of the time domain motion-vector prediction of the second class image and decoded first kind image in the described code stream according to a preconcerted arrangement, prediction obtains the time domain motion vector of the second class image in the described code stream.
In conjunction with first kind of second aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with second kind of possible execution mode of first kind of second aspect or second aspect possible execution mode or second aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of second aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of second aspect or the 4th kind of possible execution mode of second aspect in conjunction with first kind of second aspect or second aspect possible execution mode or second aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of second aspect or second aspect possible execution mode or second aspect or second aspect or second aspect or the 5th kind of possible execution mode of second aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Third aspect present invention provides a kind of method for video coding, comprising:
Generate code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Store or export described code stream.
In conjunction with the third aspect, in first kind of possible execution mode,
Described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
In conjunction with first kind of the third aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with second kind of possible execution mode of first kind of the third aspect or the third aspect possible execution mode or the third aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of the third aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of the third aspect or the 4th kind of possible execution mode of the third aspect in conjunction with first kind of the third aspect or the third aspect possible execution mode or the third aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of the third aspect or the third aspect possible execution mode or the third aspect or the third aspect or the third aspect or the 5th kind of possible execution mode of the third aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Fourth aspect present invention provides a kind of method for video coding, can comprise:
Generate code stream, wherein, the time domain motion vector of the second class image in the described code stream predicts based on restrictive condition and obtains that the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Store or export described code stream.
In conjunction with fourth aspect, in first kind of possible execution mode, described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
In conjunction with first kind of fourth aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with second kind of possible execution mode of first kind of fourth aspect or fourth aspect possible execution mode or fourth aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of fourth aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of fourth aspect or the 4th kind of possible execution mode of fourth aspect in conjunction with first kind of fourth aspect or fourth aspect possible execution mode or fourth aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of fourth aspect or fourth aspect possible execution mode or fourth aspect or fourth aspect or fourth aspect or the 5th kind of possible execution mode of fourth aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Fifth aspect present invention provides a kind of video decoder, can comprise:
Obtain the unit, be used for obtaining code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, and first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream;
First decoding unit is used for the first kind image of described code stream is decoded, and according to described restrictive condition and decoded first kind image, the second class image in the code stream of described acquisition unit acquisition is decoded.
In conjunction with the 5th aspect, in first kind of possible execution mode, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain.
In conjunction with the 5th aspect or in conjunction with first kind of the 5th aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with first kind of possible execution mode of the 5th aspect or the 5th aspect or second kind of possible execution mode of the 5th aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of the 5th aspect, in the 5th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of the 5th aspect or the 4th kind of possible execution mode of the 5th aspect in conjunction with first kind of possible execution mode of the 5th aspect or the 5th aspect or the 5th aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of possible execution mode of the 5th aspect or the 5th aspect or the 5th aspect or the 5th aspect or the 5th aspect or the 5th kind of possible execution mode of the 5th aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Sixth aspect present invention provides a kind of video decoder, comprising:
Obtain the unit, be used for obtaining code stream;
Second decoding unit is used for the first kind image of described code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the described code stream is decoded, wherein, the described second class image is the non-basic view picture in the described code stream, and described first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream.
In conjunction with the 6th aspect, in first kind of possible execution mode, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain;
Described second decoding unit specifically is used for, and the first kind image in the described code stream is decoded; Described restrictive condition according to a preconcerted arrangement, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains, the second class image in the described code stream is decoded.
In conjunction with first kind of the 6th aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of described the 6th class image and the POC of described first kind image equate.
In conjunction with first kind of possible execution mode of the 6th aspect or the 6th aspect or second kind of possible execution mode of the 6th aspect, in the third possible execution mode, described first kind image is the correspondence image of described the 6th class image.
In conjunction with the third possible execution mode of the 6th aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the 6th class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of the 6th aspect or the 4th kind of possible execution mode of the 6th aspect in conjunction with first kind of possible execution mode of the 6th aspect or the 6th aspect or the 6th aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of possible execution mode of the 6th aspect or the 6th aspect or the 6th aspect or the 6th aspect or the 6th aspect or the 5th kind of possible execution mode of the 6th aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Seventh aspect present invention provides a kind of video coding apparatus, comprising:
First coding unit, be used for generating code stream, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Processing unit is used for storage or exports described code stream.
In conjunction with the 7th aspect, in first kind of possible execution mode,
Described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
In conjunction with first kind of the 7th aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with first kind of possible execution mode of the 7th aspect or the 7th aspect or second kind of possible execution mode of the 7th aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of the 7th aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of the 7th aspect or the 4th kind of possible execution mode of the 7th aspect in conjunction with first kind of possible execution mode of the 7th aspect or the 7th aspect or the 7th aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of possible execution mode of the 7th aspect or the 7th aspect or the 7th aspect or the 7th aspect or the 7th aspect or the 5th kind of possible execution mode of the 7th aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Eighth aspect present invention provides a kind of video coding apparatus, comprising:
Second coding unit, be used for generating code stream, wherein, the time domain motion vector of each the second class image in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, described first kind image is the basic view picture in the described code stream, and the described second class image is the non-basic view picture in the described code stream; Perhaps described first kind image is the basic tomographic image in the described code stream, and the described second class image is the non-basic tomographic image in the described code stream;
Described code stream is stored or exported to processing unit.
In conjunction with eight aspect, in first kind of possible execution mode,
Described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
In conjunction with first kind of eight aspect possible execution mode, in second kind of possible execution mode, the image sequence POC of the described second class image and the POC of described first kind image equate.
In conjunction with first kind of possible execution mode of eight aspect or eight aspect or second kind of possible execution mode of eight aspect, in the third possible execution mode, described first kind image is the correspondence image of the described second class image.
In conjunction with the third possible execution mode of eight aspect, in the 4th kind of possible execution mode, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
Second kind of possible execution mode or the third possible execution mode of eight aspect or the 4th kind of possible execution mode of eight aspect in conjunction with first kind of possible execution mode of eight aspect or eight aspect or eight aspect, in the 5th kind of possible execution mode, described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
In conjunction with the 4th kind of possible execution mode of the third possible execution mode of second kind of possible execution mode of first kind of possible execution mode of eight aspect or eight aspect or eight aspect or eight aspect or eight aspect or the 5th kind of possible execution mode of eight aspect, in the 6th kind of possible execution mode, described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each non-looking substantially or non-basic layer header or parameter set or auxiliary first indication information that comprises in the informational message that strengthens, be used to indicate this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, in the technical scheme that some embodiments of the invention provide, carry first indication information in the code stream to be decoded, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; According to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, to do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below, apparently, accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the schematic flow sheet of a kind of video encoding/decoding method of providing of the embodiment of the invention;
Fig. 2 is the schematic flow sheet of the another kind of video encoding/decoding method that provides of the embodiment of the invention;
Fig. 3 is the schematic flow sheet of a kind of method for video coding of providing of the embodiment of the invention;
Fig. 4 is the schematic flow sheet of the another kind of method for video coding that provides of the embodiment of the invention;
Fig. 5 is the schematic diagram of a kind of video decoder of providing of the embodiment of the invention;
Fig. 6 is the schematic diagram of the another kind of video decoder that provides of the embodiment of the invention;
Fig. 7 is the schematic diagram of a kind of video coding apparatus of providing of the embodiment of the invention;
Fig. 8 is the schematic diagram of the another kind of video coding apparatus that provides of the embodiment of the invention;
Fig. 9 is the schematic diagram of the another kind of video decoder that provides of the embodiment of the invention;
Figure 10 is the schematic diagram of the another kind of video decoder that provides of the embodiment of the invention;
Figure 11 is the schematic diagram of the another kind of video coding apparatus that provides of the embodiment of the invention;
Figure 12 is the schematic diagram of the another kind of video coding apparatus that provides of the embodiment of the invention;
Figure 13 is the schematic diagram of the another kind of video encoder that provides of the embodiment of the invention.
Embodiment
The embodiment of the invention provides method for video coding and video encoding/decoding method and relevant apparatus, in the hope of reducing the storage overhead of coding/decoding device.
In order to make those skilled in the art person understand the present invention program better, below in conjunction with the accompanying drawing in the embodiment of the invention, technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the embodiment of a part of the present invention, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills should belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
Below be elaborated respectively.
Term " first " in specification of the present invention and claims and the above-mentioned accompanying drawing, " second ", " the 3rd " " 4th " etc. (if existence) are for the similar object of difference, and needn't be used for describing specific order or precedence.The data that should be appreciated that such use suitably can exchanged under the situation, so as embodiments of the invention described herein for example can with except diagram here or describe those order enforcement.In addition, term " comprises " and " having " and their any distortion, intention is to cover not exclusive comprising, for example, comprised those steps or unit that process, method, system, product or the equipment of series of steps or unit are not necessarily limited to clearly list, but can comprise clearly do not list or for these processes, method, product or equipment intrinsic other step or unit.
An embodiment of video encoding/decoding method of the present invention, video encoding/decoding method can comprise: obtain code stream, wherein, carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, wherein, the second class image is the non-basic view picture in the above-mentioned code stream, and first kind image is the basic view picture in the above-mentioned code stream; Or second the class image be non-basic tomographic image in the above-mentioned code stream, first kind image is the basic tomographic image in the above-mentioned code stream; First kind image in the above-mentioned code stream is decoded; According to above-mentioned restrictive condition and decoded first kind image, the second class image in the above-mentioned code stream is decoded.
At first see also Fig. 1, Fig. 1 is the schematic flow sheet of a kind of video encoding/decoding method of providing of the embodiment of the invention.As shown in Figure 1, a kind of video encoding/decoding method that one embodiment of the invention provides can comprise following content:
101, obtain code stream;
Wherein, carry first indication information in the above-mentioned code stream, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
Wherein, can from local storage or disk or CD, obtain code stream, also can obtain the coding side transmitted stream.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the image sequence of the second class image number (POC, Picture order count) can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
102, the first kind image in the above-mentioned code stream is decoded, according to above-mentioned restrictive condition and decoded first kind image the second class image in the code stream is decoded.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or auxiliary enhancing informational message (SEI message, Supplemental enhancement information message) can be mainly used in describing the relevant information of video, sequence, image or band (slice).
First indication information for example can be arranged among code stream sequence parameter set (SPS, Sequence parameter set) or the video parameter collection (VPS, video parameter set).First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific class (profile), wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, in the technical scheme that present embodiment provides, carry first indication information in the code stream to be decoded, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; According to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Another embodiment of video encoding/decoding method of the present invention, method can comprise: obtain code stream; First kind image in the above-mentioned code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the above-mentioned code stream, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the above-mentioned code stream is decoded, wherein, the second class image is the non-basic view picture in the above-mentioned code stream, and first kind image is the basic view picture in the above-mentioned code stream; Or second the class image be non-basic tomographic image in the above-mentioned code stream, first kind image is the basic tomographic image in the above-mentioned code stream.
At first see also Fig. 2, Fig. 2 is the schematic flow sheet of a kind of video encoding/decoding method of providing of the embodiment of the invention.As shown in Figure 2, a kind of video encoding/decoding method that another embodiment of the present invention provides can comprise following content:
201, obtain code stream.
Wherein, can from local storage or disk or CD, obtain code stream, also can obtain the coding side transmitted stream.
202, the first kind image in the above-mentioned code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the above-mentioned code stream, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the above-mentioned code stream is decoded.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In some embodiments of the invention, restrictive condition according to the time domain motion-vector prediction of the second class image in the above-mentioned code stream, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream, comprise: the restrictive condition of the time domain motion-vector prediction of the second class image in (shake hands as coding side and decoding end agreement, acquiescence agreement or other stipulated form etc.) above-mentioned code stream according to a preconcerted arrangement, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, in the technical scheme of present embodiment, according to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
One embodiment of method for video coding of the present invention, method for video coding can comprise: generate code stream, wherein carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream; Store or export above-mentioned code stream.
At first see also Fig. 3, Fig. 3 is the schematic flow sheet of a kind of video encoding/decoding method of providing of the embodiment of the invention.As shown in Figure 3, a kind of video encoding/decoding method that one embodiment of the invention provides can comprise following content:
301, generate code stream;
Wherein, carry first indication information in the above-mentioned code stream, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
302, store or export above-mentioned code stream.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, in the technical scheme that present embodiment provides, carry first indication information in the code stream of generation, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; According to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or, first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore coding side and decoding end all need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, the present invention program is conducive to reduce greatly the storage overhead of coding/decoding device, be conducive to simplify the coding/decoding process, and the test proof, therefore picture quality too much do not influenced.
Another embodiment of method for video coding of the present invention, method can comprise: generate code stream, wherein, the time domain motion vector of the second class image in the above-mentioned code stream is predicted based on restrictive condition and is obtained, the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream; Store or export above-mentioned code stream.
At first see also Fig. 4, Fig. 4 is the schematic flow sheet of the another kind of video encoding/decoding method that provides of the embodiment of the invention.As shown in Figure 4, a kind of video encoding/decoding method that another embodiment of the present invention provides can comprise following content:
401, generate code stream;
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
402, store or export above-mentioned code stream.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, in the technical scheme of present embodiment, the time domain motion vector of the second class image in the code stream that generates is predicted based on restrictive condition and is obtained, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, predict based on the movable information of decoded first kind image in the above-mentioned code stream and to obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore the coding/decoding end need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, this programme is conducive to reduce greatly the storage overhead of coding/decoding device, be conducive to simplify the decoding device decode procedure, and therefore test proof picture quality do not influenced too much.
Such scheme for ease of the better implement example embodiment of the invention also is provided for implementing the such scheme relevant apparatus below.
Referring to Fig. 5, the embodiment of the invention provides a kind of video decoder 500, can comprise: obtain unit 510 and first decoding unit 520.
Wherein, obtain unit 510, be used for obtaining code stream, wherein, carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, wherein, the second class image is the non-basic view picture in the above-mentioned code stream, and first kind image is the basic view picture in the above-mentioned code stream; Or second the class image be non-basic tomographic image in the above-mentioned code stream, first kind image is the basic tomographic image in the above-mentioned code stream.
Wherein, obtain unit 510 and can from local storage or disk or CD, obtain code stream, also can obtain the coding side transmitted stream.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
First decoding unit 520 is used for the first kind image of above-mentioned code stream is decoded; According to above-mentioned restrictive condition and decoded first kind image, the second class image in the above-mentioned code stream is decoded.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged among the header or parameter set or SEI message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Be understandable that the function of each functional module of the video decoder 500 of present embodiment can be according to the method specific implementation among the method embodiment shown in Figure 1, its specific implementation process can repeat no more with reference to the associated description of said method embodiment herein.
As can be seen, carry first indication information in the present embodiment code stream to be decoded, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; Video decoder 500 is decoded to the second class image in the code stream according to restrictive condition, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Referring to Fig. 6, the embodiment of the invention provides a kind of video decoder 600, can comprise: obtain unit 610 and first decoding unit 620.
Obtain unit 610, be used for obtaining code stream;
Second decoding unit 620 is used for the first kind image of above-mentioned code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the above-mentioned code stream, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the above-mentioned code stream is decoded.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In some embodiments of the invention, second decoding unit 620 can specifically be used for, and the first kind image in the above-mentioned code stream is decoded; Restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in (shake hands as coding side and decoding end agreement, acquiescence agreement or other stipulated form etc.) above-mentioned code stream according to a preconcerted arrangement, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the above-mentioned code stream is decoded.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Be understandable that the function of each functional module of the video decoder 600 of present embodiment can be according to the method specific implementation among the method embodiment shown in Figure 2, its specific implementation process can repeat no more with reference to the associated description of said method embodiment herein.
As can be seen, present embodiment video decoder 600 is decoded to the second class image in the code stream according to restrictive condition, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Referring to Fig. 7, the embodiment of the invention provides a kind of video coding apparatus 700, can comprise: first coding unit 710 and processing unit 720.
Wherein, first coding unit 710, be used for generating code stream, wherein, wherein carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, and the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream.
Processing unit 720 is used for storage or exports above-mentioned code stream.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Be understandable that the function of each functional module of the video coding apparatus 700 of present embodiment can be according to the method specific implementation among the method embodiment shown in Figure 3, its specific implementation process can repeat no more with reference to the associated description of said method embodiment herein.
As can be seen, in the technical scheme that present embodiment provides, carry first indication information in the code stream that video coding apparatus 700 generates, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; According to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, predict based on the movable information of decoded first kind image in the above-mentioned code stream and to obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore coding side and decoding end all need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, the scheme that the embodiment of the invention provides is conducive to greatly reduce the storage overhead of coding/decoding device, be conducive to simplify the coding/decoding process, and therefore test proof picture quality do not influenced too much.
Referring to Fig. 8, the embodiment of the invention provides a kind of video coding apparatus 800, can comprise: second coding unit 810 and processing unit 820.
Second coding unit 810, be used for generating code stream, wherein, the time domain motion vector of the second class image in the above-mentioned code stream is predicted based on restrictive condition and is obtained, the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream.
Above-mentioned code stream is stored or exported to processing unit 820.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Be understandable that the function of each functional module of the video coding apparatus 800 of present embodiment can be according to the method specific implementation among the method embodiment shown in Figure 4, its specific implementation process can repeat no more with reference to the associated description of said method embodiment herein.
As can be seen, in the technical scheme of present embodiment, the time domain motion vector of the second class image in the code stream that video coding apparatus 800 generates is predicted based on restrictive condition and is obtained, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, predict based on the movable information of decoded first kind image in the above-mentioned code stream and to obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore the coding/decoding end need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, this programme is conducive to reduce greatly the storage overhead of coding/decoding device, be conducive to simplify the decoding device decode procedure, and therefore test proof picture quality do not influenced too much.
Fig. 9 is the structural representation of a kind of video decoder provided by the invention, as shown in Figure 9, the video decoder of present embodiment comprises at least one bus 901, at least one processor 902 that links to each other with bus 901 and at least one memory 903 that links to each other with bus 901.
Wherein, processor 902 is by bus 901, call the code of storage in the memory 903 to be used for obtaining code stream, wherein, carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, wherein, the second class image is the non-basic view picture in the above-mentioned code stream, and first kind image is the basic view picture in the above-mentioned code stream; Or second the class image be non-basic tomographic image in the above-mentioned code stream, first kind image is the basic tomographic image in the above-mentioned code stream; First kind image in the above-mentioned code stream is decoded; According to above-mentioned restrictive condition and decoded first kind image, the second class image in the above-mentioned code stream is decoded.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, can from local storage or disk or CD, obtain code stream, also can obtain the coding side transmitted stream.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged among the header or parameter set or SEI message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or band (slice).
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
The video decoder 900 that present embodiment provides can be for the corresponding part of carrying out of the technical scheme video decoder of carrying out method embodiment shown in Figure 1, and its realization principle and technique effect are similar with it, repeat no more herein.Fig. 9 only is a kind of schematic diagram of the structure of video decoder provided by the invention, and concrete structure can be adjusted according to actual.
As can be seen, in the technical scheme that present embodiment provides, carry first indication information in the code stream to be decoded, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; Video decoder 900 is decoded to the second class image in the code stream according to restrictive condition, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Figure 10 is the structural representation of a kind of video decoder provided by the invention, as shown in figure 10, the video decoder of present embodiment comprises at least one bus 1001, at least one processor 1002 that links to each other with bus 1001 and at least one memory 1003 that links to each other with bus 1001.
Wherein, processor 1002 calls the code of storage in the memory 1003 to be used for obtaining code stream by bus 1001; First kind image in the above-mentioned code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the above-mentioned code stream, prediction obtains the time domain motion vector of the second class image in the above-mentioned code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the above-mentioned code stream is decoded, wherein, the second class image is the non-basic view picture in the above-mentioned code stream, and first kind image is the basic view picture in the above-mentioned code stream; Or second the class image be non-basic tomographic image in the above-mentioned code stream, first kind image is the basic tomographic image in the above-mentioned code stream.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In some embodiments of the invention, processor 1002 is according to the restrictive condition of the time domain motion-vector prediction of the second class image in the above-mentioned code stream, the time domain motion vector that prediction obtains the second class image in the above-mentioned code stream can comprise: the restrictive condition of the time domain motion-vector prediction of the second class image in processor 1002 (as coding side and decoding end shake hands agreement, acquiescence agreement or other stipulated form etc.) above-mentioned code stream according to a preconcerted arrangement, predict the time domain motion vector that obtains the second class image in the above-mentioned code stream.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
As can be seen, processor 1002 in the present embodiment in the video decoder is decoded to the second class image in the code stream according to restrictive condition, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, so decoding end need not to store multiple reference picture, and (this multiple reference picture can comprise a reference picture of looking that belongs to first kind image, with the time domain reference picture that belongs to the second class image), the movable information that only need store first kind image (namely looking a reference picture) can be finished the prediction of the second class image time domain motion vector in the code stream, and need not store the movable information of time domain reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce greatly the storage overhead of decoding device, and therefore test proof picture quality too much do not influenced.
The video decoder 1000 that present embodiment provides can be for the corresponding part of carrying out of the technical scheme video decoder of carrying out method embodiment shown in Figure 2, and its realization principle and technique effect are similar with it, repeat no more herein.Figure 10 only is a kind of schematic diagram of the structure of video decoder provided by the invention, and concrete structure can be adjusted according to actual.
Figure 11 is the structural representation of a kind of video coding apparatus provided by the invention, as shown in figure 11, the video coding apparatus of present embodiment comprises at least one bus 1101, at least one processor 1102 that links to each other with bus 1101 and at least one memory 1103 that links to each other with bus 1101.
Wherein, processor 1102 is by bus 1101, call the code of storage in the memory 1103 to be used for generating code stream, wherein carry first indication information in the above-mentioned code stream, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the above-mentioned code stream is shown, the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream; Store or export above-mentioned code stream.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
The video coding apparatus 1100 that present embodiment provides can be for the corresponding part of carrying out of the technical scheme video coding apparatus of carrying out method embodiment shown in Figure 3, and its realization principle and technique effect are similar with it, repeat no more herein.Figure 11 only is a kind of schematic diagram of the structure of video coding apparatus provided by the invention, and concrete structure can be adjusted according to actual.
As can be seen, in the technical scheme that present embodiment provides, carry first indication information in the code stream that video coding apparatus 1100 generates, wherein, first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown; According to restrictive condition the second class image in the code stream is decoded, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, can predict based on the movable information of decoded first kind image in the above-mentioned code stream and obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or, first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore coding side and decoding end all need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, the present invention program is conducive to reduce greatly the storage overhead of coding/decoding device, be conducive to simplify the coding/decoding process, and the test proof, therefore picture quality too much do not influenced.
Figure 12 is the structural representation of a kind of video coding apparatus provided by the invention, as shown in figure 12, the video coding apparatus of present embodiment comprises at least one bus 1201, at least one processor 1202 that links to each other with bus 1201 and at least one memory 1203 that links to each other with bus 1201.
Wherein, processor 1202 is by bus 1201, call the code of storage in the memory 1203 to be used for generating code stream, wherein, the time domain motion vector of the second class image in the above-mentioned code stream is predicted based on restrictive condition and is obtained, the second class image is the non-basic view picture in the above-mentioned code stream, or the second class image is the non-basic tomographic image in the above-mentioned code stream; Store or export above-mentioned code stream.
For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
Wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Perhaps, first kind image is the basic tomographic image in the above-mentioned code stream, and the second class image is the non-basic tomographic image in the above-mentioned code stream.
In other embodiment of the present invention, go back portability first indication information in the above-mentioned code stream, wherein first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the code stream is shown.For example, restrictive condition can be the time domain motion vector of the second class image in the code stream, predicts based on the movable information of decoded first kind image in the code stream to obtain, and certainly, also can select other to satisfy the restrictive condition of actual needs.
In some embodiments of the invention, the movable information of first kind image can comprise: each image block of first kind image when other of the motion vector of forward sight/layer and/or first kind image look/layer motion vector etc.The time domain motion vector of the second class image is that each image block of the second class image is when the time domain motion vector of forward sight/layer.
In some embodiments of the invention, the time domain motion vector of the second class image, predict based on the movable information of decoded first kind image in the code stream and to obtain, specifically may be to carry out convergent-divergent by the motion vector with the image block of first kind image, obtain the time domain motion vector of the correspondence image piece of the second class image, wherein, scaling can or equal 1 or less than 1 greater than 1.Certainly, also can pass through other existing mode, based on the movable information of decoded first kind image in the code stream, predict the time domain motion vector that obtains the second class image.
Wherein, the POC of the second class image can equate with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with this second class image POC movable information that equate and decoded first kind image in the code stream.Certainly, the POC of the second class image also may be unequal with the POC of first kind image, that is to say, be that restrictive condition specifically can be: the time domain motion vector of each second class image in the code stream obtains based on predicting with the movable information of the unequal and decoded first kind image of this second class image POC in the code stream.
In some embodiments of the invention, first kind image can be the correspondence image of the second class image, for example first kind image can be the correspondence image of the second class image that POC is equal with it, and the time domain motion vector of the second class image obtains based on the movable information prediction of its correspondence image.Certainly, first kind image also may not be the correspondence image of the second class image that POC is equal with it.
Wherein, if first kind image is the basic view picture in the above-mentioned code stream, the second class image is the non-basic view picture in the above-mentioned code stream, and then first kind image is that a kind of of the second class image looks a reference picture, if wherein the POC of reference picture equates with the POC of current non-base view image.Apparent time under looking under the reference picture is different from present image can think also that then this reference picture is a reference picture of looking of current non-base view image.
In some embodiments of the invention, go back the station location marker of the correspondence image of each second class image of portability in the code stream, so that format compatible.
In some embodiments of the invention, first indication information can be arranged in header or parameter set or auxiliary the enhancing among the informational message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.
First indication information for example can be arranged among code stream SPS or the VPS.First indication information can be a flag bit or other form.
In some embodiments of the invention, above-mentioned code stream for example can be the code stream that meets specific profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing comprise the first corresponding with it indication information in the informational message substantially, wherein, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
The video coding apparatus 1200 that present embodiment provides can be for the corresponding part of carrying out of the technical scheme video coding apparatus of carrying out method embodiment shown in Figure 3, and its realization principle and technique effect are similar with it, repeat no more herein.Figure 12 only is a kind of schematic diagram of the structure of video coding apparatus provided by the invention, and concrete structure can be adjusted according to actual.
As can be seen, in the technical scheme of present embodiment, the time domain motion vector of the second class image in the code stream that video coding apparatus 1200 generates is predicted based on restrictive condition and is obtained, owing to come the reference picture of the second class image is locked by restrictive condition, therefore be conducive to reduce the movable information that the coding/decoding end is stored useless reference picture, for the movable information that prior art must be stored multiple possibility reference picture, the present invention program is conducive to reduce the storage overhead of coding/decoding device, and therefore test proof picture quality too much do not influenced.
Further, if restrictive condition is the time domain motion vector of the second class image in the code stream, predict based on the movable information of decoded first kind image in the code stream and to obtain, because the reference picture to the second class image locks targetedly, the time domain motion vector of the second class image of each in the code stream, predict based on the movable information of decoded first kind image in the above-mentioned code stream and to obtain, wherein, first kind image is the basic view picture in the above-mentioned code stream, and the second class image is the non-basic view picture in the above-mentioned code stream; Or first kind image is the basic tomographic image in the above-mentioned code stream, the second class image is the non-basic tomographic image in the above-mentioned code stream, therefore the coding/decoding end need not to store multiple reference picture, the movable information that only need store first kind image can be finished the prediction of the second class image time domain motion vector in the code stream, for the movable information that prior art must be stored multiple reference picture, this programme is conducive to reduce greatly the storage overhead of coding/decoding device, be conducive to simplify the decoding device decode procedure, and therefore test proof picture quality do not influenced too much.
Be understandable that, video coding/decoding device of the present invention can be the video coder/decoder or has disposed the equipment of video coder/decoder, for example, video coding apparatus can be deployed in digital camera, mobile phone, television set, computer or other can carry out among the equipment of video record, perhaps, video coding apparatus can be that digital camera, mobile phone, television set, computer or other can carry out the equipment of video record.In like manner, video decoder of the present invention for example can be deployed in digital camera, mobile phone, television set, computer or other can carry out among the equipment of video playback.Perhaps video decoder is that digital camera, mobile phone, television set, computer or other can carry out the equipment of video playback.
The embodiment of the invention also provides a kind of computer-readable storage medium, and wherein, this computer-readable storage medium can have program stored therein, and this program comprises the part or all of step of the method for video coding of putting down in writing among the said method embodiment when carrying out.
The embodiment of the invention also provides a kind of computer-readable storage medium, and wherein, this computer-readable storage medium can have program stored therein, and this program comprises the part or all of step of the video encoding/decoding method of putting down in writing among the said method embodiment when carrying out.
Figure 13 is the structural representation of another kind of Video Decoder provided by the invention.
As shown in figure 13, the Video Decoder 1300 of present embodiment can comprise: entropy coding unit 1301, coefficient scanning unit 1302, inverse quantization unit 1303, inverse transformation block 1304, predicting unit 1305, adder 1306 and frame buffer unit 1307.
Wherein, entropy coding unit 1301, coefficient scanning unit 1302, inverse quantization unit 1303 and inverse transformation block 1304 are used for the video code flow of input is carried out entropy coding, re-quantization and inversion process, obtain residual block.Predicting unit 1305 is used for obtaining the prediction piece.Adder 1306 is used for predicting that piece and residual block carry out add operation and obtain reconstructed blocks.Frame buffer unit 1307 is used for the storage reconstructed blocks, shows in order to reconstructed blocks is outputed to display.
Wherein, predicting unit 1305 can obtain the prediction piece according to first indication information in the code stream.
In some embodiments of the invention, first indication information can be arranged among the header or parameter set or SEI message of above-mentioned code stream.Wherein, this header or parameter set or SEI message can be mainly used in describing the relevant information of video, sequence, image or slice.First indication information for example can be arranged among code stream SPS or the VPS, and perhaps, first indication information also can be positioned among the SEI message.Wherein, first indication information can be a flag bit or other form.
In some embodiments of the invention, video code flow can be the code stream that meets specific profile, wherein all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-look substantially or the header of non-basic layer or parameter set or SEI message in comprise the first corresponding with it indication information, wherein, each is non-look substantially or the header of non-basic layer or parameter set or SEI message in first indication information that comprises, can be used for indicating this non-looking substantially or the time domain motion vector of each image of non-basic layer, look substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
Be understandable that, the function of each functional module of the Video Decoder 1300 of present embodiment can be according to the method specific implementation among Fig. 1 or the method embodiment shown in Figure 2, its specific implementation process can repeat no more with reference to the associated description of said method embodiment herein.
Video Decoder 1300 for example can be deployed in digital camera, mobile phone, television set, computer or other can carry out among the equipment of video playback.
Need to prove, for aforesaid each method embodiment, for simple description, so it all is expressed as a series of combination of actions, but those skilled in the art should know, the present invention is not subjected to the restriction of described sequence of movement, because according to the present invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in the specification all belongs to preferred embodiment, and related action and module might not be that the present invention is necessary.
In the above-described embodiments, the description of each embodiment is all emphasized particularly on different fields, do not have the part that describes in detail among certain embodiment, can be referring to the associated description of other embodiment.
In several embodiment that the application provides, should be understood that disclosed device can be realized by other mode.For example, device embodiment described above only is schematic, the for example division of said units, only be that a kind of logic function is divided, during actual the realization other dividing mode can be arranged, for example a plurality of unit or assembly can in conjunction with or can be integrated into another system, or some features can ignore, or do not carry out.Another point, the shown or coupling each other discussed or directly to be coupled or to communicate to connect can be by some interfaces, the indirect coupling of device or unit or communicate to connect can be electrically or other form.
Above-mentioned unit as separating component explanation can or can not be physically to separate also, and the parts that show as the unit can be or can not be physical locations also, namely can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select some or all of unit wherein to realize present embodiment scheme purpose according to the actual needs.
In addition, each functional unit in each embodiment of the present invention can be integrated in the processing unit, also can be that the independent physics in each unit exists, and also can be integrated in the unit two or more unit.Above-mentioned integrated unit both can adopt the form of hardware to realize, also can adopt the form of SFU software functional unit to realize.
If above-mentioned integrated unit is realized with the form of SFU software functional unit and during as independently production marketing or use, can be stored in the computer read/write memory medium.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words or this technical scheme is all or part of can embody with the form of software product, this computer software product is stored in the storage medium, comprises that some instructions are used so that a computer equipment (can be personal computer, server or the network equipment etc.) is carried out all or part of step of each embodiment said method of the present invention.And aforesaid storage medium comprises: various media that can be program code stored such as USB flash disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), portable hard drive, magnetic disc or CD.
More than above-mentioned, above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that aforementioned each embodiment puts down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these modifications or replacement do not make appropriate technical solution essence break away from various embodiments of the present invention technical scheme spirit and scope.

Claims (20)

1. a video encoding/decoding method is characterized in that, comprising:
Obtain code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, and first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream;
First kind image in the described code stream is decoded;
According to described restrictive condition and decoded first kind image, the second class image in the described code stream is decoded.
2. method according to claim 1 is characterized in that, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain.
3. method according to claim 2 is characterized in that,
The image sequence POC of the described second class image and the POC of described first kind image equate.
4. according to each described method of claim 1 to 3, it is characterized in that described first kind image is the correspondence image of the described second class image.
5. method according to claim 4 is characterized in that, described first kind image is the correspondence image of the second class image that equates with the POC of described first kind image.
6. according to each described method of claim 1 to 5, it is characterized in that described first indication information is arranged in header or parameter set or auxiliary the enhancing among the informational message of described code stream.
7. according to each described method of claim 1 to 5, it is characterized in that,
Described code stream is the code stream that meets specific class profile, wherein, all code streams comprise at least one non-looking substantially or non-basic layer, wherein, each is non-looks or the header of non-basic layer or parameter set or auxiliary the enhancing in the informational message comprise first indication information substantially, each is non-looks or the header of non-basic layer or parameter set or the auxiliary indicated restrictive condition of first indication information that comprises in the informational message that strengthens are substantially: this is non-looks or the time domain motion vector of each image of non-basic layer substantially, looks substantially or the movable information of basic tomographic image is predicted and obtained based on decoded.
8. a video encoding/decoding method is characterized in that, comprising:
Obtain code stream;
First kind image in the described code stream is decoded;
According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the described code stream is decoded, wherein, the described second class image is the non-basic view picture in the described code stream, and described first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream.
9. method according to claim 8 is characterized in that,
Described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain;
Described restrictive condition and decoded first kind image according to the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream, comprise: the restrictive condition of the time domain motion-vector prediction of the second class image and decoded first kind image in the described code stream according to a preconcerted arrangement, prediction obtains the time domain motion vector of the second class image in the described code stream.
10. a method for video coding is characterized in that, comprising:
Generate code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Store or export described code stream.
11. method according to claim 10 is characterized in that,
Described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
12. a method for video coding is characterized in that, comprising:
Generate code stream, wherein, the time domain motion vector of the second class image in the described code stream predicts based on restrictive condition and obtains that the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Store or export described code stream.
13. method according to claim 12 is characterized in that,
Described restrictive condition is: the time domain motion vector of the second class image described in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, wherein, if the described second class image is the non-basic view picture in the described code stream, then described first kind image is the basic view picture in the described code stream; Perhaps, if the described second class image is the non-basic tomographic image in the described code stream, then described first kind image is the basic tomographic image in the described code stream.
14. a video decoder is characterized in that, comprising:
Obtain the unit, be used for obtaining code stream, wherein, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, and first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream;
First decoding unit is used for the first kind image of described code stream is decoded, and according to described restrictive condition and decoded first kind image, the second class image in the code stream of described acquisition unit acquisition is decoded.
15. video decoder according to claim 14 is characterized in that, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain.
16., it is characterized in that the image sequence POC of the described second class image and the POC of described first kind image equate according to claim 14 or 15 described video decoders.
17. a video decoder is characterized in that, comprising:
Obtain the unit, be used for obtaining code stream;
Second decoding unit is used for the first kind image of described code stream is decoded; According to restrictive condition and the decoded first kind image of the time domain motion-vector prediction of the second class image in the described code stream, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains that the second class image in the described code stream is decoded, wherein, the described second class image is the non-basic view picture in the described code stream, and described first kind image is the basic view picture in the described code stream; Or the described second class image is the non-basic tomographic image in the described code stream, and described first kind image is the basic tomographic image in the described code stream.
18. video decoder according to claim 17 is characterized in that, described restrictive condition is the time domain motion vector of the second class image described in the described code stream, predicts based on the movable information of decoded first kind image in the described code stream to obtain;
Described second decoding unit specifically is used for, and the first kind image in the described code stream is decoded; Described restrictive condition according to a preconcerted arrangement and decoded first kind image, prediction obtains the time domain motion vector of the second class image in the described code stream; Utilize the time domain motion vector of predicting the second class image that obtains, the second class image in the described code stream is decoded.
19. a video coding apparatus is characterized in that, comprising:
First coding unit, be used for generating code stream, carry first indication information in the described code stream, described first indication information is used in reference to the restrictive condition that the time domain motion-vector prediction of the second class image in the described code stream is shown, wherein, the described second class image is the non-basic view picture in the described code stream, or the described second class image is the non-basic tomographic image in the described code stream;
Processing unit is used for storage or exports described code stream.
20. a video coding apparatus is characterized in that, comprising:
Second coding unit, be used for generating code stream, wherein, the time domain motion vector of each the second class image in the described code stream, predict based on the movable information of decoded first kind image in the described code stream and to obtain, described first kind image is the basic view picture in the described code stream, and the described second class image is the non-basic view picture in the described code stream; Perhaps described first kind image is the basic tomographic image in the described code stream, and the described second class image is the non-basic tomographic image in the described code stream;
Described code stream is stored or exported to processing unit.
CN201310119699.9A 2013-04-08 2013-04-08 Method for video coding and video encoding/decoding method and relevant apparatus Active CN103237213B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201310119699.9A CN103237213B (en) 2013-04-08 2013-04-08 Method for video coding and video encoding/decoding method and relevant apparatus
PCT/CN2014/072458 WO2014166319A1 (en) 2013-04-08 2014-02-24 Video coding method, video decoding method, and related apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310119699.9A CN103237213B (en) 2013-04-08 2013-04-08 Method for video coding and video encoding/decoding method and relevant apparatus

Publications (2)

Publication Number Publication Date
CN103237213A true CN103237213A (en) 2013-08-07
CN103237213B CN103237213B (en) 2016-03-30

Family

ID=48885226

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310119699.9A Active CN103237213B (en) 2013-04-08 2013-04-08 Method for video coding and video encoding/decoding method and relevant apparatus

Country Status (2)

Country Link
CN (1) CN103237213B (en)
WO (1) WO2014166319A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103747264A (en) * 2014-01-03 2014-04-23 华为技术有限公司 Motion vector prediction method, coding equipment and decoding equipment
WO2014166319A1 (en) * 2013-04-08 2014-10-16 华为技术有限公司 Video coding method, video decoding method, and related apparatus
CN106416250A (en) * 2013-12-02 2017-02-15 诺基亚技术有限公司 Video encoding and decoding
CN106464911A (en) * 2014-06-25 2017-02-22 高通股份有限公司 Recovery point SEI message in multi-layer video codecs
CN109905703A (en) * 2013-10-11 2019-06-18 Vid拓展公司 The high level syntax of HEVC extension

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101873484A (en) * 2009-08-13 2010-10-27 杭州海康威视软件有限公司 Method and device for selecting coding mode in layered video coding
US20120189061A1 (en) * 2004-10-21 2012-07-26 Samsung Electronics Co., Ltd. Method and apparatus for effectively compressing motion vectors in video coder based on multi-layer
US20130070854A1 (en) * 2011-09-17 2013-03-21 Qualcomm Incorporated Motion vector determination for video coding
CN103024397A (en) * 2013-01-07 2013-04-03 华为技术有限公司 Method and device for determining time domain motion vector predictor

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102857764B (en) * 2011-07-01 2016-03-09 华为技术有限公司 The method and apparatus of intra prediction mode process
CN103237213B (en) * 2013-04-08 2016-03-30 华为技术有限公司 Method for video coding and video encoding/decoding method and relevant apparatus

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120189061A1 (en) * 2004-10-21 2012-07-26 Samsung Electronics Co., Ltd. Method and apparatus for effectively compressing motion vectors in video coder based on multi-layer
CN101873484A (en) * 2009-08-13 2010-10-27 杭州海康威视软件有限公司 Method and device for selecting coding mode in layered video coding
US20130070854A1 (en) * 2011-09-17 2013-03-21 Qualcomm Incorporated Motion vector determination for video coding
CN103024397A (en) * 2013-01-07 2013-04-03 华为技术有限公司 Method and device for determining time domain motion vector predictor

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014166319A1 (en) * 2013-04-08 2014-10-16 华为技术有限公司 Video coding method, video decoding method, and related apparatus
CN109905703A (en) * 2013-10-11 2019-06-18 Vid拓展公司 The high level syntax of HEVC extension
CN109905703B (en) * 2013-10-11 2023-11-17 Vid拓展公司 High level syntax for HEVC extensions
CN106416250A (en) * 2013-12-02 2017-02-15 诺基亚技术有限公司 Video encoding and decoding
US10230965B2 (en) 2013-12-02 2019-03-12 Nokia Technologies Oy Video encoding and decoding
US10652559B2 (en) 2013-12-02 2020-05-12 Nokia Technologies Oy Video encoding and decoding
CN106416250B (en) * 2013-12-02 2020-12-04 诺基亚技术有限公司 Video encoding and decoding
CN103747264A (en) * 2014-01-03 2014-04-23 华为技术有限公司 Motion vector prediction method, coding equipment and decoding equipment
CN103747264B (en) * 2014-01-03 2017-10-17 华为技术有限公司 Method, encoding device and the decoding device of predicted motion vector
CN106464911A (en) * 2014-06-25 2017-02-22 高通股份有限公司 Recovery point SEI message in multi-layer video codecs

Also Published As

Publication number Publication date
CN103237213B (en) 2016-03-30
WO2014166319A1 (en) 2014-10-16

Similar Documents

Publication Publication Date Title
JP6550633B2 (en) Predictive Parameter Inheritance for 3D Video Coding
KR100888962B1 (en) Method for encoding and decoding video signal
CN112823518A (en) Apparatus and method for inter prediction of triangularly partitioned blocks of coded blocks
CN111448800B (en) Affine motion prediction based image decoding method and apparatus using affine MVP candidate list in image coding system
JP2016514378A (en) Content-adaptive interactive or functional predictive multi-pass pictures for highly efficient next-generation video coding
US20230353768A1 (en) Method and apparatus for processing video signal using affine prediction
JP7314300B2 (en) Method and apparatus for intra prediction
JP2023157942A (en) Method and device for affine-based inter prediction of chroma subblock
CN111630859A (en) Method and apparatus for image decoding according to inter prediction in image coding system
JP2023143935A (en) Encoder, decoder and corresponding method for sub-block partitioning mode
CN113660497B (en) Encoder, decoder and corresponding methods using IBC merge lists
CN103237213B (en) Method for video coding and video encoding/decoding method and relevant apparatus
JP2022521757A (en) Methods and equipment for intra-prediction using linear models
JP2014527782A (en) Multi-view video coding method
CN112673633A (en) Encoder, decoder and corresponding methods for merging modes
CN103079067A (en) Motion vector predicted value list construction method and video encoding and decoding method and device
JP7257524B2 (en) Side motion refinement in video encoding/decoding systems
CN115428448A (en) Image encoding/decoding method and apparatus based on inter prediction and recording medium storing bitstream
CN114982244A (en) Image encoding device and method
US20140192880A1 (en) Inter layer motion data inheritance
CN112204980A (en) Method and apparatus for inter prediction in video coding system
CN110679151A (en) Video coding using parametric motion models
KR20060069227A (en) Method and apparatus for deriving motion vectors of macro blocks from motion vectors of pictures of base layer when encoding/decoding video signal
KR20080055686A (en) A method and apparatus for decoding/encoding a video signal
CN114946191A (en) Image encoding apparatus and method based on signaling of information for filtering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant