WO2009065325A1 - A video encoding/decoding method and a video encoder/decoder - Google Patents
A video encoding/decoding method and a video encoder/decoder Download PDFInfo
- Publication number
- WO2009065325A1 WO2009065325A1 PCT/CN2008/072675 CN2008072675W WO2009065325A1 WO 2009065325 A1 WO2009065325 A1 WO 2009065325A1 CN 2008072675 W CN2008072675 W CN 2008072675W WO 2009065325 A1 WO2009065325 A1 WO 2009065325A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- view
- layer
- information
- prediction information
- coding
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 134
- 238000000605 extraction Methods 0.000 claims description 55
- 230000005540 biological transmission Effects 0.000 claims description 40
- 238000004458 analytical method Methods 0.000 claims description 11
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000005259 measurement Methods 0.000 claims 1
- 239000000284 extract Substances 0.000 abstract description 15
- 238000010586 diagram Methods 0.000 description 12
- 230000006870 function Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 241000282414 Homo sapiens Species 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/161—Encoding, multiplexing or demultiplexing different image signal components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/234327—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/167—Synchronising or controlling image signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/40—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/587—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/2365—Multiplexing of several video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4347—Demultiplexing of several video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2213/00—Details of stereoscopic systems
- H04N2213/007—Aspects relating to detection of stereoscopic image format, e.g. for adaptation to the display format
Definitions
- the present invention relates to the field of video technologies, and in particular, to a video encoding and decoding method, and a video encoder and a decoder. Background technique
- Traditional two-dimensional video is a kind of plane information carrier. It can only express the content of the scene, but can not express the depth information of the scene. When human beings watch the world around, they can not only see the width and height of the object, but also Know the depth of the object and determine the distance between objects or between the viewer and the object. The reason for this three-dimensional visual characteristic is that people use both eyes to view objects at the same time. Due to the binocular spacing, when the left and right eyes are looking at a certain distance, the received visual images are different, and the two images are combined. Information that creates a three-dimensional sense in people's brains. With the development of video technology, people are not satisfied with the video experience brought by two-dimensional video, and further pursue a better user experience and immersive experience. Stereo video technology has become one of the key technologies to achieve this goal.
- the stereo video technology is based on the principle of human binocular parallax. Two images of different angles of the same scene are acquired by the camera, and the two images are displayed on the screen simultaneously or sequentially, so that the two eyes respectively view the two images to obtain a stereoscopic feeling. Therefore, compared with the traditional two-dimensional video, the stereoscopic video has two video streams.
- the data traffic required for the stereoscopic video is twice that of the two-dimensional video, and the data traffic is The increase has brought great challenges to storage and transmission. Just increasing storage capacity and network bandwidth is not enough to solve this problem. It is necessary to design an efficient coding method to compress stereoscopic video data.
- stereoscopic display devices such as a head-mounted display, a stereoscopic eye, a holographic display device, and various auto-stereoscopic displays of different resolutions.
- Different stereoscopic displays require different levels of stereoscopic video content.
- the bandwidth of the network connected to these stereoscopic displays is also different, so that the same stereoscopic display is connected in different networks, and it is also required The same level of stereo video content.
- a stereoscopic display device connected to a high-speed network can require rich stereoscopic information according to its own resolution capability, thereby displaying high-quality stereoscopic video; in some cases, the stereoscopic display is limited by its own conditions or limited by network bandwidth.
- the existing stereo video encoding and decoding method only realizes two-dimensional display and three-dimensional display separately, that is, one of the binocular videos is used as a reference view.
- Coding with standard coding method encoding another view reference reference view, so that the content of the reference view can be decoded on the display side to realize two-dimensional display, decoding all the contents to realize three-dimensional display, but the connection cannot be satisfied.
- the embodiments of the present invention provide a video encoding and decoding method, a video encoder, and a decoder, so as to implement hierarchical encoding of a stereoscopic view, and meet stereoscopic view display requirements of different display levels of various stereoscopic display devices connected in different networks.
- the embodiment of the invention provides a video encoding method, including:
- An embodiment of the present invention provides a video encoder, including:
- a base layer coding module configured to perform base layer coding by using the first view as a reference view
- At least one prediction information extraction module configured to combine the first decoded first view and the second The view extracts at least one layer of the preview 'J information
- an enhancement layer coding module configured to separately perform prediction layer coding on the at least one layer
- a multiplexing module configured to multiplex the enhancement layer code with the base layer code of the first view to obtain coded information.
- the embodiment of the invention provides a video decoding method, including:
- An embodiment of the present invention provides a video decoder, including:
- a demultiplexing module configured to demultiplex the received coding information to obtain a base layer coding and an enhancement layer coding
- a base layer decoding module configured to code the base layer to obtain a first view as a reference view
- An enhancement layer decoding module configured to code and decode the enhancement layer to obtain at least one layer of prediction information
- a prediction module configured to predict the second view according to the prediction information of the at least one layer and the first view.
- the embodiment of the invention further provides a video encoding method, including:
- the step of extracting the current layer prediction information increment is as follows: extracting the current layer prediction information increment according to the locally decoded first view, the second view, and the previous layer prediction information, Layer prediction information increments for enhancement layer coding; and so on, until enhancement layer coding is performed on the last layer of prediction information increments; Encoding the base layer and the enhancement layer to obtain encoded information.
- An embodiment of the present invention further provides a video encoder, including:
- a base layer coding module configured to perform base layer coding by using the first view as a reference view
- At least two layers of prediction information extraction modules wherein the first layer prediction information extraction module in the multi-layer prediction information extraction module is connected to the base layer coding module, and is configured to extract the first one by combining the first decoded first view and the second view.
- Layer prediction information the layer prediction information extraction module of the multi-layer prediction information extraction module except the first layer prediction information extraction module is connected to the previous layer prediction information extraction module for combining the first decoded local view And the second view and the previous layer prediction information extract the current layer prediction information increment;
- An enhancement layer coding module configured to perform enhancement layer coding on the first layer prediction information and the plurality of layers of prediction information increments
- a multiplexing module configured to multiplex the base layer encoding and the enhancement layer to obtain encoded information.
- the embodiment of the invention further provides a video decoding method, including:
- the second view is predicted based on the prediction information of the at least two layers and the first view.
- An embodiment of the present invention further provides a video decoder, including:
- a demultiplexing module configured to demultiplex the received coding information to obtain a base layer coding and an enhancement layer coding
- a base layer decoding module configured to code the base layer to obtain a first view as a reference view
- An enhancement layer decoding module configured to code and decode the enhancement layer to obtain first layer prediction information and several layers of prediction information increments;
- a calculation module configured to calculate at least two layers of prediction information according to the first layer prediction information and the plurality of layers of prediction information increments;
- a prediction module configured to predict the second view according to the prediction information of the at least two layers and the first view.
- the video encoding and decoding method and the video encoder and the decoder in the embodiment of the present invention perform hierarchical coding of the stereoscopic view by extracting at least one layer of prediction information and respectively performing enhancement layer coding, which satisfies the connection under different networks.
- Stereoscopic display devices display requirements for stereoscopic views of different display levels.
- FIG. 1 is a flowchart of Embodiment 1 of a video encoding method according to the present invention
- Embodiment 2 is a flowchart of Embodiment 2 of a video encoding method according to the present invention
- FIG. 3 is a flowchart of Embodiment 3 of a video encoding method according to the present invention.
- Embodiment 4 is a flowchart of Embodiment 4 of a video encoding method according to the present invention.
- FIG. 5 is a schematic structural diagram of Embodiment 1 of a video encoder according to the present invention.
- Embodiment 2 of a video encoder is a schematic structural diagram of Embodiment 2 of a video encoder according to the present invention.
- FIG. 7 is a flowchart of Embodiment 1 of a video decoding method according to the present invention.
- FIG. 8 is a flowchart of Embodiment 2 of a video decoding method according to the present invention.
- FIG. 9 is a flowchart of Embodiment 3 of a video decoding method according to the present invention.
- FIG. 10 is a flowchart of Embodiment 4 of a video decoding method according to the present invention.
- FIG. 11 is a schematic structural diagram of Embodiment 1 of a video decoder according to the present invention
- FIG. 12 is a flowchart of Embodiment 1 of another video encoding method according to the present invention
- FIG. FIG. 14 is a flowchart of Embodiment 3 of another video encoding method according to the present invention
- FIG. 15 is a flowchart of Embodiment 4 of another video encoding method according to the present invention
- FIG. 16 is a schematic structural diagram of Embodiment 1 of another video encoder according to the present invention
- FIG. 17 is a schematic structural diagram of Embodiment 2 of another video encoder according to the present invention
- FIG. 18 is a flowchart of Embodiment 1 of another video decoding method according to the present invention
- FIG. 19 is a flowchart of Embodiment 2 of another video decoding method according to the present invention
- FIG. 21 is a flowchart of Embodiment 4 of another video decoding method provided by the present invention
- FIG. 22 is a schematic structural diagram of Embodiment 1 of another video decoder according to the present invention. detailed description
- Embodiment 1 of a video encoding method provided by the present invention :
- a flowchart of Embodiment 1 of a video coding method according to the present invention includes the following steps:
- Step 1 01 Perform base layer coding on the first view as a reference view, and extract at least one layer of prediction information by combining the first decoded first view and the second view; wherein the first view and the second view may be respectively a left eye view and In the right eye view, the prediction information may be motion vector information and/or depth or disparity information;
- Step 102 Perform at least one layer of prediction information on the enhancement layer respectively.
- Step 103 The enhancement layer coding and the base layer coding of the first view are multiplexed to obtain coded information.
- hierarchical coding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirement of different display levels of various stereoscopic display devices connected under different networks.
- FIG. 2 it is a flowchart of Embodiment 2 of a video coding method according to the present invention.
- depth/disparity information is used as prediction information to extract a layer of depth/disparity information, if the sparse depth is to be extracted.
- Parallax information This embodiment specifically includes the following steps:
- Step 201 Using two or more cameras to shoot the same scene from different angles, Get two views, left view and right view;
- Step 202 Select a view from the left eye view and the right eye view as the reference view for the base layer coding, and select the left eye view or the right eye view.
- the left view is selected as the reference view;
- Step 203 Decode the left-eye view encoded by the base layer locally, and perform sparse depth/disparity information extraction in combination with the right-eye view; the sparse depth/disparity information corresponds to a stereoscopic view display level acquired in advance;
- Step 204 Perform enhancement layer coding on the sparse depth/disparity information.
- Step 205 Code baseband coding and enhancement layer coding of the left eye view to obtain coded information.
- a stereoscopic view display level acquired in advance may be determined according to a preset number of layers and levels of depth/disparity information to be extracted, or may be determined according to the following steps added before step 203:
- Step 2021 analyzing request information and/or network transmission information of the display device, and obtaining that when the network is congested, only less content can be transmitted, and the required stereoscopic view has a lower display level, and may request to extract sparse depth/parallax. information.
- the prediction information in this embodiment may also be motion vector information or a combination of depth/disparity information and motion vector information; base layer coding and enhancement layer coding may be discrete cosine transform coding with motion compensation.
- the layer of prediction information of this embodiment may also be dense prediction information or fine prediction information.
- the hierarchical coding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirement of connecting different display levels of various stereoscopic display devices under different networks;
- the depth/disparity information of the corresponding level can be extracted according to the specific requirements of the display device and the network condition, the coding efficiency is improved, the coding complexity is reduced, and the network transmission efficiency is further improved.
- This embodiment also multiplexes the base layer coding. According to the base layer coding, the two-dimensional view can be displayed, which is compatible with the function of two-dimensional display.
- the third embodiment of a video encoding method provided by the present invention is as follows:
- FIG. 3 it is a flowchart of Embodiment 3 of a video coding method according to the present invention.
- depth/disparity information is used as prediction information.
- the number and level of the depth/disparity information to be extracted may be preset.
- the three-layer depth/disparity information is extracted as an example, and the sparse depth is from coarse to fine.
- the technical solution of this embodiment is further introduced by the disparity information, the dense depth/disparity information, and the fine depth/disparity information.
- the video coding method of this embodiment performs the following steps:
- Step 301 Using two or more cameras to shoot the same scene from different angles, and obtain two views, a left view and a right view;
- Step 302 Select a left view and a right view as the reference view for the base layer coding, and select the left eye view or the right eye view.
- the left view is selected as the reference view;
- Step 303 Decode the left-eye view encoded by the base layer locally, and perform sparse depth/disparity information extraction, dense depth/disparity information extraction, and fine depth/disparity information extraction in combination with the right eye view.
- Step 304 Perform enhancement layer coding on the sparse depth/disparity information, the dense depth/disparity information, and the fine depth/disparity information, respectively.
- Step 305 Code baseband coding and enhancement layer coding of the left eye view to obtain coded information.
- the prediction information in the video coding method of the embodiment of the present invention may also be motion vector information, or a combination of depth/disparity information and motion vector information; base layer coding and enhancement layer coding may be discrete cosine transform coding with motion compensation.
- the hierarchical coding of the stereoscopic view is realized, which satisfies different display levels of various stereoscopic display devices connected under different networks.
- the stereoscopic view displays requirements; in this embodiment, the base layer coding is further multiplexed, and according to the base layer coding, the two-dimensional view can be displayed, which is compatible with the function of the two-dimensional display.
- the fourth embodiment of a video encoding method provided by the present invention is as follows:
- FIG. 4 it is a flowchart of Embodiment 4 of a video coding method according to the present invention.
- the difference between this embodiment and the foregoing Embodiment 3 is that, before step 301, the extracted depth/disparity information may not be used.
- the number of layers and the level are preset, and the following steps are added before step 303: Step 3021: analyzing the request information of the display device and/or the network transmission information, and the display level of the stereoscopic view required by the display device with higher resolution is compared.
- the step 303 is specifically: decoding the left-eye view encoded by the base layer locally, and extracting the depth of at least one layer corresponding to the display level of the stereoscopic view required by the display device and/or the network in combination with the right view. Parallax information.
- the present embodiment further extracts the depth/disparity information of the corresponding level according to the specific requirements of the display device and the network condition, improves the coding efficiency, reduces the coding complexity, and improves the network transmission efficiency.
- the first embodiment of a video encoder provided by the present invention is as follows:
- a schematic structural diagram of a video encoder according to Embodiment 1 of the present invention includes:
- the base layer coding module 10 is configured to perform base layer coding on the first view as a reference view; at least one prediction information extraction module, such as the prediction information extraction module 11 , 12 , 1 , 3 , etc. in FIG. Combining the locally decoded first view and the second view to extract at least one layer of prediction information; the enhancement layer encoding module 14 is configured to separately perform at least one layer of prediction information on the enhancement layer; the multiplexing module 15 is configured to The enhancement layer coding is multiplexed with the base layer code of the first view to obtain coded information.
- the encoder provided in this embodiment can be applied to the embodiment of the video encoding method provided in the foregoing invention, which is the fourth embodiment.
- at least one prediction information extraction module extracts at least one layer of prediction information and separately performs enhancement layer coding, and implements hierarchical coding of the stereoscopic view, which satisfies different display levels of various stereoscopic display devices connected under different networks.
- the stereo view shows the requirements.
- FIG. 6 is a schematic structural diagram of Embodiment 2 of a video encoder according to the present invention, which specifically includes:
- the base layer coding module 20 is configured to perform base layer coding on the left eye view as the reference view, and may also use the right eye view as the reference view to perform base layer coding on the right view view;
- the sparse prediction information extraction module 21 is configured to combine the right eye view And the locally decoded left-eye view extracts the sparse prediction information;
- the dense prediction information extraction module 22 is configured to extract the dense prediction information by combining the right-view view and the locally decoded left-eye view;
- the fine prediction information extraction module 23 is configured to combine the right The eye view and the locally decoded left eye view extract fine prediction information;
- the enhancement layer coding module 24 is configured to separately perform the enhancement layer coding on the sparse prediction information, the dense prediction information, and the fine prediction information;
- the multiplexing module 25 is configured to Base layer coding and enhancement layer coding multiplexing of the eye view obtain coded information.
- the embodiment may further include an analysis module 26 for analyzing request information and/or network transmission information of the display device to obtain a stereoscopic view display level of at least one required by the display device and/or the network.
- an analysis module 26 for analyzing request information and/or network transmission information of the display device to obtain a stereoscopic view display level of at least one required by the display device and/or the network.
- the video encoder of the embodiment of the present invention is not limited to the foregoing three-layer prediction information extraction module, and at least one prediction information extraction module may be set according to actual needs, such as a display device and/or a network, to satisfy different display devices and/or networks. Demand.
- the sparse prediction information extraction module 21, the compact prediction information extraction module 22, and the fine prediction information extraction module 23 three layers of prediction information are extracted and enhanced layer coding is respectively performed, and the hierarchical coding of the stereoscopic view is realized, which satisfies
- the stereoscopic view display requirements of different display levels of different stereoscopic display devices are connected under different networks; the specific requirements of the display device and the network status can be obtained according to the analysis module 26, and the prediction information of the corresponding level is extracted, thereby improving the coding efficiency and decreasing
- the coding complexity is low, and the network transmission efficiency is further improved.
- FIG. 7 it is a flowchart of a video decoding method according to Embodiment 1 of the present invention.
- the decoding method in this embodiment is provided in the first embodiment of the video encoding method provided by the present invention, and specifically includes the following. Steps:
- Step 401 Demultiplex the received coding information to obtain a base layer coding and an enhancement layer coding.
- Step 402 Decode the base layer to obtain a first view as a reference view.
- Step 403 Encode the enhancement layer to obtain at least one layer of prediction information.
- Step 404 Prediction of the second view according to the prediction information of the at least one layer and the first view.
- the first view and the second view may be a left view and a right view, respectively, and the prediction information may be motion vector information and/or depth or disparity information.
- the second embodiment of a video decoding method is as follows:
- FIG. 8 it is a flowchart of a second embodiment of a video decoding method according to the present invention.
- the decoding method in this embodiment is provided in the second embodiment of the video encoding method provided by the present invention, and specifically includes the following. Steps:
- Step 501 Demultiplexing the received coding information to obtain base layer coding and enhancement layer coding;
- Step 502 Decoding the base layer code to obtain a left eye view as a reference view;
- Step 503 Encode and decode the enhancement layer to obtain sparse depth/disparity information.
- Step 504 Predict a right eye view according to the sparse depth/disparity information and the left eye view.
- the sparse depth/disparity information by obtaining sparse depth/disparity information, the sparse depth/disparity information and encoding time
- the pre-acquisition of a stereoscopic view display level corresponds to the hierarchical decoding of the stereoscopic view, and the second view is predicted according to the first view, so that the stereoscopic view can be displayed according to the first view and the predicted second view, which satisfies Stereoscopic display requirements for different display levels of various stereoscopic display devices.
- the third embodiment of a video decoding method provided by the present invention is as follows:
- FIG. 9 a flowchart of a third embodiment of a video decoding method according to the present invention is provided.
- the decoding method in this embodiment is provided in the fourth embodiment of the video encoding method provided by the present invention, and specifically includes the following. Steps:
- Step 601 Demultiplex the received coding information to obtain base layer coding and enhancement layer coding.
- Step 602 Decode the base layer code to obtain a left eye view as a reference view.
- Step 603 Encode and decode the enhancement layer to obtain sparse depth/disparity information, dense depth/disparity information, and fine depth/disparity information.
- Step 604 Predict the right-eye view according to the sparse depth/disparity information, the dense depth/disparity information, the fine depth/disparity information, and the left-eye view.
- the sparse depth/disparity information corresponding thereto, the dense depth/disparity information, and the fine depth/disparity are obtained according to the display level.
- the three-layer prediction information structure of the information, in the decoding process, directly decodes the enhancement layer coding to obtain the above three layers of depth/disparity information.
- the prediction information in the video decoding method of the embodiment of the present invention may also be motion vector information, or a combination of depth/disparity information and motion vector information.
- the video decoding method of this embodiment by obtaining at least one layer of depth/disparity information, hierarchical decoding of the stereoscopic view is realized, and the right-eye view is predicted by combining the left-eye view, so that the right-eye view can be obtained according to the left-eye view and the prediction.
- the stereoscopic view display satisfies the stereoscopic view display requirements of different display levels of various stereoscopic display devices; in this embodiment, the base layer encoding is also decoded, and according to the decoding information of the base layer encoding, the two-dimensional view can be displayed, which is compatible with the two-dimensional display.
- FIG. 10 a flowchart of a fourth embodiment of a video decoding method according to the present invention is provided.
- the decoding method in this embodiment is provided in the third embodiment of the video encoding method provided by the present invention.
- the difference between the third embodiment of the decoding method is:
- the foregoing three-layer prediction information structure is determined by setting the number and level of the prediction information to be extracted, and before the step 603 in the decoding process, the method may further include: Step 6021: analyzing the request information of the display device Obtaining at least one stereoscopic view display level required by various display devices.
- the step 603 is specifically: encoding and decoding the enhancement layer corresponding to the display level of the at least one stereoscopic view to obtain at least one layer of depth/disparity information, which may be sparse depth/disparity information or dense depth/disparity information. Or fine depth/disparity information or any combination of the three.
- the embodiment further decodes the enhancement layer coding of the corresponding level according to the specific requirements of the display device, obtains the depth/disparity information of the corresponding level, improves the efficiency of the learning code, and reduces the decoding complexity.
- the first embodiment of a video decoder provided by the present invention is as follows:
- a schematic structural diagram of a video decoder according to Embodiment 1 of the present invention includes:
- a demultiplexing module 30 configured to demultiplex the received encoded information to obtain a base layer encoding and an enhancement layer encoding
- a base layer encoding module 31 configured to decode the base layer code to obtain a first view as a reference view
- the module 32 is configured to perform coding decoding on the enhancement layer to obtain at least one layer of prediction information
- a prediction module 33 configured to predict a right eye view according to the at least one layer of prediction information and the first view.
- the embodiment may further include an analysis module 34, configured to analyze request information of the display device, and obtain at least one stereoscopic view display level required by the display device.
- the enhancement layer decoding module 32 acquires at least one layer of prediction information corresponding to at least one stereoscopic view display level.
- the decoder provided in this embodiment may be applied to a video decoding method provided by the foregoing invention.
- the enhancement layer decoding module 32 by setting the enhancement layer decoding module 32, at least one layer of prediction information is obtained, and the hierarchical decoding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirements of different display levels of various stereoscopic display devices; Obtain the specific requirements of the display device, decode the prediction information of the corresponding level, improve the decoding efficiency, and reduce the decoding complexity.
- Another embodiment of the video encoding method provided by the present invention is as follows:
- a flowchart of Embodiment 1 of another video coding method provided by the present invention specifically includes the following steps:
- Step 701 Perform base layer coding by using the first view as a reference view, and extracting first layer prediction information by combining the first decoded first view and the second view.
- Step 702 Perform enhancement layer coding on the first layer prediction information.
- Step 703 Starting from extracting the second layer prediction information increment, the step of extracting the current layer prediction information increment is as follows:
- Step 704 Code baseband coding and enhancement layer coding to obtain coded information.
- the video coding method in this embodiment extracts a layer of prediction information and at least one layer of prediction information increments, and respectively performs enhancement layer coding, thereby realizing hierarchical coding of the stereoscopic view, and satisfies various stereoscopic displays connected under different networks.
- the network transmission bandwidth improves the transmission efficiency.
- FIG. 13 is a flowchart of Embodiment 2 of another video coding method according to the present disclosure.
- depth/disparity information is used as prediction information to extract a layer of depth/disparity information and a layer of depth/disparity.
- the information increments are sparse depth/disparity information and dense depth/disparity information increments. This embodiment specifically includes the following steps:
- Step 801 Using two or more cameras to shoot the same scene from different angles, and obtaining two views, a left view and a right view;
- Step 802 Select a view from the left eye view and the right eye view as the reference view for the base layer coding, and select the left eye view or the right eye view.
- the left view is selected as the reference view;
- Step 803 Decode the left-eye view encoded by the base layer locally, perform sparse depth/disparity information extraction in combination with the right-eye view, and perform enhancement layer coding on the sparse depth/disparity information;
- Step 804 Combine the locally decoded left-eye view, the right-eye view, and the sparse depth/disparity information to perform incremental depth/disparity information incremental extraction, and perform enhancement layer coding on the dense depth/disparity information increment;
- the step may be specifically: extracting the dense depth/disparity information according to the locally decoded left-eye view and the right-eye view; calculating the dense depth/disparity information is equivalent to the increment of the sparse depth/disparity information, that is, the dense depth/disparity information increment .
- Step 805 Code baseband coding and enhancement layer coding to obtain coded information.
- the sparse depth/disparity information and the dense depth/disparity information correspond to the two stereoscopic view display levels acquired in advance, and the two stereoscopic view display levels acquired in advance may be according to preset depth/disparity information to be extracted.
- the number of layers and levels are determined. It can also be determined by adding the following steps before step 803:
- Step 8021 analyzing the request information and/or the network transmission information of the display device, wherein the display device with a higher resolution has a higher display level of the stereoscopic view, and may request to extract the dense depth/disparity information; when the network is crowded, only the transmission can be transmitted. With less content, the required stereo view has a lower display level, which may require extraction of sparse depth/disparity information; combining these two factors to obtain display devices and/or networks
- the display level of the required stereoscopic view determines the total number of layers and levels of the extracted depth/disparity information according to the display level. For example, two layers of depth/disparity information are extracted according to the display level, and the level is sparse and dense.
- the prediction information in the video coding method of the embodiment of the present invention may also be motion vector information, or a combination of depth/disparity information and motion vector information; base layer coding and enhancement layer coding may be discrete cosine transform coding with motion compensation.
- the two layers of prediction information of this embodiment may be any combination of sparse, dense, and fine prediction information.
- the video coding method of this embodiment by extracting a layer of depth/disparity information and a layer of depth/disparity information increment, and respectively performing enhancement layer coding, the hierarchical coding of the stereoscopic view is realized, and the connection is satisfied under different networks.
- Stereoscopic display device display requirements for different display levels; due to enhancement layer coding of a layer of depth/disparity information increment, the amount of information transmitted in the network is reduced, the network transmission bandwidth is reduced, and the transmission efficiency is improved;
- the base layer coding is also performed. Multiplexing, according to the base layer coding, the two-dimensional view can be displayed, which is compatible with the function of two-dimensional display.
- Embodiment 3 of another video coding method provided by the present invention uses depth/disparity information as prediction information.
- the number and level of the depth/disparity information to be extracted may be preset.
- the three-layer depth/disparity information is extracted as an example, and the sparse depth is from coarse to fine.
- the technical solution of this embodiment is further introduced by the disparity information, the dense depth/disparity information, and the fine depth/disparity information.
- This embodiment of the video encoding method performs the following steps:
- Step 901 Using two or more cameras to shoot the same scene from different angles, and obtaining two views, a left view and a right view;
- Step 902 Select a view from the left eye view and the right eye view as a reference view for the base layer Encoding, you can select the left eye view or the right eye view.
- the left eye view is selected as the reference view;
- Step 903 Decode the left-eye view encoded by the base layer locally, perform sparse depth/disparity information extraction in combination with the right-eye view, and perform enhancement layer coding on the sparse depth/disparity information;
- Step 904 Combine the locally decoded left-eye view, the right-eye view, and the sparse depth/disparity information to perform incremental depth/disparity information incremental extraction, and perform enhancement layer coding on the dense depth/disparity information increment;
- Step 905 Combine the locally decoded left-eye view, the right-eye view, and the dense depth/disparity information to perform fine depth/disparity information incremental extraction, and perform enhancement layer coding on the fine depth/disparity information increment;
- Step 906 Code baseband coding and enhancement layer coding to obtain coded information.
- step 904 may be specifically: extracting the dense depth/disparity information according to the locally decoded left-eye view and the right-eye view; and calculating the dense depth/disparity information as the increment of the sparse depth/disparity information, that is, the dense depth/ Parallax information increment.
- step 905 can also be implemented accordingly.
- the prediction information in the video coding method of the embodiment of the present invention may also be motion vector information, or a combination of depth/disparity information and motion vector information; base layer coding and enhancement layer coding may be discrete cosine transform coding with motion compensation.
- the encoding method of the embodiment of the present invention is not limited to extracting three layers of prediction information, and one layer of prediction information and at least one layer of prediction information increment may be extracted according to the determined total number and level of extracted prediction information.
- the video coding method in this embodiment extracts a layer of depth/disparity information and several layers of depth/disparity information increments, and respectively performs enhancement layer coding, thereby realizing hierarchical coding of the stereoscopic view, and satisfies the connection under different networks.
- Stereoscopic display device display requirements for different display levels of stereoscopic display devices; due to enhancement layer coding for several layers of depth/disparity information increments, the amount of information transmitted in the network is reduced, network transmission bandwidth is reduced, and transmission efficiency is improved;
- the example also multiplexes the base layer coding, and according to the base layer coding, the two-dimensional view can be displayed, which is compatible with the function of two-dimensional display.
- Another embodiment of the video encoding method provided by the present invention is as follows:
- the flowchart of the fourth embodiment of the video encoding method provided by the present invention is different from the third embodiment of the video encoding method in the foregoing embodiment.
- the number of layers and levels of the extracted depth/disparity information are preset, and the following steps may be added before step 903:
- Step 9021 analyzing the request information and/or the network transmission information of the display device, wherein the display device of the higher resolution display device has a higher display level, and may request to extract fine depth/disparity information; when the network is crowded, only the transmission can be transmitted. With less content, the required stereoscopic view has a lower display level, which may require extraction of sparse depth/disparity information; combining these two factors to obtain the display level of the stereoscopic view required by the display device and/or the network, according to The display level determines the total number of layers and levels of the extracted depth/disparity information. For example, three layers of depth/disparity information are extracted according to the display level, and the levels are sparse, dense, and fine. After step 9021 is performed, step 903 is continued. Step 906.
- the depth/disparity information of the corresponding layer and level is extracted according to the specific requirements of the display device and the network condition, thereby improving coding efficiency and reducing coding complexity. Degree, improve network transmission efficiency.
- FIG. 16 is a schematic structural diagram of another embodiment of a video encoder according to the present invention, which specifically includes:
- the base layer encoding module 40 is configured to perform the base layer encoding by using the first view as the reference view.
- the at least two layers of the prediction information extraction module 41 are connected to the base layer encoding module 40 for combining the local decoding.
- the first view and the second view extract the first layer prediction information, and further include other layer prediction information extraction modules 42 , 4 3 ⁇ , etc., other than the first layer prediction information extraction module 41 a layer of prediction information extraction module is connected, configured to extract a current layer prediction information increment by combining the locally decoded first view, the second view, and the previous layer of prediction information; and the enhancement layer coding module 44 is configured to use the first layer prediction information.
- enhancement layer Encoding module 45 configured to multiplex base layer coding and enhancement layer coding to obtain coded information.
- the encoder provided in this embodiment can be applied to another embodiment of the video encoding method provided by the present invention, which is the fourth embodiment.
- the enhancement layer coding is performed separately.
- the hierarchical coding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirement of different display levels of various stereoscopic display devices connected under different networks; because of the enhancement layer coding for the increment, the amount of information transmitted in the network is reduced, and the network transmission is performed. Reduced bandwidth requirements and improved transmission efficiency.
- Another video encoder embodiment provided by the present invention is as follows:
- FIG. 17 is a schematic structural diagram of another embodiment of a video encoder according to the present invention, which specifically includes:
- the base layer encoding module 50 is configured to perform base layer encoding on the left eye view;
- the sparse prediction information extracting module 51 is connected to the base layer encoding module 50, and is configured to extract sparse prediction information by combining the right eye view and the locally decoded left eye view;
- the information extraction module 52 is configured to be connected to the sparse prediction information extraction module 51, and configured to receive the sparse prediction information sent by the sparse prediction information extraction module 51, and extract the dense prediction information increment according to the right-view view and the locally decoded left-view view;
- the fine prediction information extraction module 53 is connected to the dense prediction information extraction module 52, and configured to receive the dense prediction information sent by the dense prediction information extraction module 52, and extract the fine prediction information in combination with the right eye view and the locally decoded left eye view.
- the enhancement layer coding module 54 is configured to perform enhancement layer coding on the sparse prediction information, the density prediction information increment, and the fine prediction information increment; the multiplexing module 55 is configured to perform multiplexing coding of the base layer coding and the enhancement layer to obtain coded information. .
- the embodiment may further include an analysis module 56, configured to analyze request information and/or network transmission information of the display device, obtain a display level of a stereoscopic view required by the display device and/or the network, and determine extraction prediction information according to the display level.
- the video encoder of the embodiment of the present invention is not limited to the foregoing three-layer prediction information extraction module, and may set at least two layers of prediction information extraction modules to meet different display devices and/or networks according to actual needs, such as display device and/or network requirements. Demand.
- the sparse prediction information extraction module 51 by setting the sparse prediction information extraction module 51, the dense prediction information extraction module 52, and the fine prediction information extraction module 53, the sparse prediction information, the dense prediction information increment, and the fine prediction information increment are extracted, and the enhancement layer coding is performed separately.
- the hierarchical encoding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirement of different display levels of various stereoscopic display devices connected under different networks; the enhancement layer coding is performed for the increment of the dense prediction information and the increment of the fine prediction information, the network The amount of information transmitted in the network is reduced, the requirement for network transmission bandwidth is reduced, and the transmission efficiency is improved; and the specific requirements of the display device and the network status can be obtained according to the analysis module 56, and the prediction information of the corresponding layer and level is extracted, and the improvement is improved.
- the coding efficiency reduces the coding complexity and further improves the network transmission efficiency.
- Another embodiment of the video decoding method provided by the present invention is as follows:
- FIG. 18 it is a flowchart of another embodiment of a video decoding method according to the present invention.
- the decoding method in this embodiment is implemented in the first embodiment of another video encoding method, and specifically includes the following steps:
- Step 1001 Demultiplex the received coding information to obtain a base layer coding and an enhancement layer coding.
- Step 1002 Decode the base layer to obtain a first view as a reference view.
- Step 1003 Encode and decode the enhancement layer to obtain first layer prediction information and several layers of prediction information increments;
- Step 1004 Calculate prediction information of at least two layers according to the first layer prediction information and the increment of the plurality of layers of prediction information;
- Step 1005 Prediction of the second view according to the prediction information of at least two layers and the first view.
- the prediction information of at least two layers is calculated according to the obtained first layer prediction information and the increment of the plurality of layers of prediction information, thereby realizing hierarchical decoding of the stereoscopic view, which satisfies Stereoscopic view display requirements for different display levels of various stereoscopic display devices; due to enhancement layer decoding of several layers of prediction information increments, the amount of information transmitted in the network is reduced, the requirement for network transmission bandwidth is reduced, and transmission efficiency is improved;
- the embodiment also decodes the base layer encoding, and according to the decoding information of the base layer encoding, a two-dimensional view can be displayed, which is compatible with the function of two-dimensional display.
- Another embodiment of the video decoding method provided by the present invention is as follows:
- FIG. 19 it is a flowchart of the second embodiment of the video decoding method provided by the present invention.
- the decoding method in this embodiment is performed in the second embodiment of the video encoding method, and specifically includes the following steps:
- Step 11 01 Demultiplex the received coding information to obtain base layer coding and enhancement layer coding
- Step 11 02 Decode the base layer to obtain a left eye view as a reference view
- Step 11 03 Encode the enhancement layer to obtain sparse depth/disparity information and dense depth/disparity information increment;
- Step 11 04. Calculate the confidential depth/disparity information according to the sparse depth/disparity information and the density/disparity information increment;
- Step 11 05 Predict the right eye view based on the sparse depth/disparity information, the dense depth/disparity information, and the left eye view.
- the prediction information of the two layers is calculated according to the obtained sparse prediction information and the density of the dense prediction information, thereby realizing the hierarchical decoding of the stereoscopic view, and satisfying the stereoscopic view of different display levels of various stereoscopic display devices.
- the decoding information can display a two-dimensional view and is compatible with the function of two-dimensional display.
- FIG. 20 it is a flowchart of Embodiment 3 of another video decoding method provided by the present invention.
- the decoding method in this embodiment is implemented in the fourth embodiment of another video coding method, and specifically includes the following steps:
- Step 1201 Demultiplex the received coding information to obtain base layer coding and enhancement layer coding.
- Step 1202 Decode the base layer to obtain a left eye view as a reference view.
- Step 1203 Encode and decode the enhancement layer to obtain sparse depth/disparity information, dense depth/disparity information increment, and fine depth/parallax information increment;
- Step 1204 Calculate the secret depth/disparity information according to the sparse depth/disparity information and the dense depth/disparity information increment; calculate the fine depth/disparity information according to the dense depth/disparity information and the fine depth/disparity information increment;
- Step 1205 Prediction of a right-eye view according to the sparse depth/disparity information, the dense depth/disparity information, the fine depth/disparity information, and the left-eye view.
- the sparse depth/disparity information corresponding thereto, the dense depth/disparity information, and the fine depth/disparity are obtained according to the display level.
- the three-layer prediction information structure of the information, in the decoding process, directly decodes the enhancement layer coding to obtain the above three layers of depth/disparity information.
- the prediction information in the video decoding method of the embodiment of the present invention may also be motion vector information, or a combination of depth/disparity information and motion vector information.
- the depth/disparity information of at least two layers is calculated according to the obtained first layer depth/disparity information and the increment of the plurality of layers of depth/disparity information, thereby realizing hierarchical decoding of the stereoscopic view, and combining the left eye view
- the view, the right view is predicted, so that the stereo view can be displayed according to the left view and the predicted right view, which satisfies the stereo view display requirements of different display levels of various stereoscopic display devices;
- the enhancement layer decoding is performed, the amount of information transmitted in the network is reduced, the requirement for network transmission bandwidth is reduced, and the transmission efficiency is improved.
- the base layer coding is also decoded, and the two-dimensional view can be displayed according to the decoding information of the base layer coding. , Compatible with the function of 2D display.
- Another embodiment of the video decoding method provided by the present invention is as follows:
- Embodiment 4 of another video decoding method As shown in FIG. 21, it is a flowchart of Embodiment 4 of another video decoding method provided by the present invention.
- the decoding method in this embodiment is performed in Embodiment 3 of another video encoding method, and the other
- the difference between the third embodiment of the video decoding method is:
- the foregoing three-layer prediction information structure is determined by setting the number and level of the prediction information to be extracted, and before the step 1203 in the decoding process, the method may further include: Step 1 2021: analyzing the request of the display device Information, obtaining at least one stereoscopic view display level required by various display devices, and determining the total number of layers and levels of enhancement layer decoding according to the display level.
- the step 1203 is specifically: decoding the enhancement layer coding according to the determined total layer number and level of enhancement layer decoding, obtaining sparse depth/disparity information and at least one layer of depth/disparity information increment, the at least one The depth/disparity information increment of the layer may be a dense depth/disparity information increment or a dense depth/disparity information increment and a fine depth/disparity information increment.
- the embodiment further decodes the enhancement layer coding of the corresponding layer and level according to the specific requirements of the display device, and obtains the depth/disparity information of the corresponding level, thereby improving the decoding efficiency. , reducing the decoding complexity.
- Another video decoder embodiment provided by the present invention is as follows:
- FIG. 11 is a schematic structural diagram of another embodiment of a video decoder according to the present invention, which specifically includes:
- a demultiplexing module 60 configured to demultiplex the received encoded information to obtain a base layer encoding and an enhancement layer encoding
- a base layer encoding module 61 configured to decode the base layer code to obtain a first view as a reference view
- a module 62 configured to encode and decode an enhancement layer to obtain first layer prediction information and a plurality of layers of prediction information increments
- a calculation module 63 configured to calculate at least two layers according to the first layer prediction information and the plurality of layers of prediction information increments
- the prediction module 64 is configured to predict the second view according to the at least two layers of prediction information and the first view.
- the embodiment may further include an analysis module 65, configured to analyze request information of the display device, and obtain The display level of the stereoscopic view required by the display device determines the total number of layers of the enhancement layer decoding according to the display level.
- an analysis module 65 configured to analyze request information of the display device, and obtain The display level of the stereoscopic view required by the display device determines the total number of layers of the enhancement layer decoding according to the display level.
- the decoder provided in this embodiment can be applied to another video decoding method embodiment provided in the above-mentioned present invention - Embodiment 4.
- the enhancement layer decoding module 62 and the calculation module 63 by setting the enhancement layer decoding module 62 and the calculation module 63, at least two layers of prediction information are obtained, and the hierarchical decoding of the stereoscopic view is realized, which satisfies the stereoscopic view display requirements of different display levels of various stereoscopic display devices;
- Several layers of prediction information are incremented for enhancement layer decoding, the amount of information transmitted in the network is reduced, the requirement for network transmission bandwidth is reduced, and transmission efficiency is improved; and the specific requirements of the display device can be obtained according to the analysis module 65, and the corresponding layer is decoded.
- level of prediction information improve decoding efficiency and reduce decoding complexity.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
Description
视频编码、 解码方法及视频编码器、 解码器 技术领域
本发明涉及视频技术领域, 尤其涉及一种视频编码、 解码方法及视频编 码器、 解码器。 背景技术
传统的二维视频是一种平面信息的载体, 它只能表现出场景的内容, 而 不能表现场景的深度信息, 而人类在观看周围世界时, 不仅能看到物体的宽 度和高度, 而且能获知物体的深度, 并判断物体之间或观看者与物体之间的 距离。 这种三维视觉特性产生的原因是: 人们用双目同时观看物体, 由于双 目间距, 左眼和右眼在看一定距离的物体时, 所接收到的视觉图像是不同的, 综合两幅图像的信息, 从而在人们大脑中产生立体感。 随着视频技术的发展, 人们不满足于二维视频所带来的视频感受, 进一步追求更好的用户体验和身 临其境的感受, 立体视频技术成为实现这一目标的关键技术之一。
立体视频技术基于人类的双目视差原理, 通过摄像机获取同一场景不同 角度的两幅图像, 将两幅图像同时或者先后显示在屏幕上, 让两眼分别观看 两幅图像来获得立体感觉。 因此, 与传统的二维视频相比, 立体视频具有两 个视频流, 在保证图像分辨率且不考虑压缩编码的情况下, 立体视频需要传 输的数据流量是二维视频的两倍, 数据流量的增加给存储和传输带来了极大 的挑战, 仅仅提高存储容量和网络带宽已不足以解决该问题, 必须设计高效 的编码方法对立体视频数据进行压缩处理。
目前市场上存在各种规格的立体显示设备, 如头盔显示器、 立体眼睛、 全息显示设备和各种不同分辨率的自动立体显示器, 不同的立体显示器所需 要的立体视频内容的等级也不尽相同, 并且, 与这些立体显示器连接的网络 的带宽也各有不同, 使得相同的立体显示器连接在不同的网络中, 也需要不
同等级的立体视频内容。 例如, 连接在高速网络上的立体显示设备可以根据 自身的分辩能力, 要求较丰富的立体信息, 从而显示高质量的立体视频; 在 某些情况下, 立体显示器受本身条件制约或受网络带宽限制, 仅需要较简单 的立体信息, 显示具有简单立体感的视频; 甚至不需要任何立体信息, 如传 统的二维显示器, 只需显示二维图像。 这种不同显示设备和不同网络传输能 力并存的现状要求一种立体视频编解码方法, 可以满足连接在不同网络下各 种立体显示设备的不同级别的立体显示需求。
在完成本发明的过程中, 发明人发现现有技术存在以下问题: 现有的立 体视频编解码方法仅实现了二维显示和三维显示分别编码, 即将双目视频中 的其中一视图作为基准视图, 釆用标准的编码方式进行编码, 将另一视图参 考基准视图进行编码, 这样, 在显示端解码基准视图的内容可实现二维显示, 解码所有的内容实现三维显示, 但无法满足连接在不同网络下各种立体显示 设备的不同级别的立体显示需求。 发明内容
本发明实施例提供了一种视频编码、 解码方法及视频编码器、 解码器, 以实现对立体视图的分级编码, 满足连接在不同网络下各种立体显示设备不 同显示级别的立体视图显示需求。
本发明实施例提供了一种视频编码方法, 包括:
将第一视图作为参考视图进行基层编码, 结合本地解码后的第一视图和 第二视图提取至少一层的预测信息;
将所述至少一层的预测信息分别进行增强层编码;
将所述增强层编码与所述第一视图的基层编码复用得到编码信息。
本发明实施例提供了一种视频编码器, 包括:
基层编码模块, 用于将第一视图作为参考视图进行基层编码;
至少一个的预测信息提取模块, 用于结合本地解码后的第一视图和第二
视图提取至少一层的预观 'J信息;
增强层编码模块,用于将所述至少一层的预测信息分别进行增强层编码; 复用模块, 用于将所述增强层编码与所述第一视图的基层编码复用得到 编码信息。
本发明实施例提供了一种视频解码方法, 包括:
将接收到的编码信息解复用, 获得基层编码和增强层编码;
将所述基层编码解码, 获得作为参考视图的第一视图;
将所述增强层编码解码, 获得至少一层的预测信息;
根据所述至少一层的预测信息和所述第一视图预测第二视图。
本发明实施例提供了一种视频解码器, 包括:
解复用模块, 用于将接收到的编码信息解复用, 获得基层编码和增强层 编码;
基层解码模块, 用于将所述基层编码解码, 获得作为参考视图的第一视 图;
增强层解码模块, 用于将所述增强层编码解码, 获得至少一层的预测信 息;
预测模块, 用于根据所述至少一层的预测信息和所述第一视图预测第二 视图。
本发明实施例还提供了一种视频编码方法, 包括:
将第一视图作为参考视图进行基层编码, 结合本地解码后的第一视图和 第二视图提取第一层预测信息;
对所述第一层预测信息进行增强层编码;
从提取第二层预测信息增量开始,提取当前层预测信息增量的步骤如下: 结合本地解码后的第一视图、 第二视图及前一层预测信息提取当前层预 测信息增量, 对当前层预测信息增量进行增强层编码; 依次类推, 直至对最 后一层预测信息增量进行增强层编码;
将所述基层编码和所述增强层编码复用得到编码信息。
本发明实施例还提供了一种视频编码器, 包括:
基层编码模块, 用于将第一视图作为参考视图进行基层编码;
至少两层预测信息提取模块, 所述多层预测信息提取模块中的第一层预 测信息提取模块与所述基层编码模块连接, 用于结合本地解码后的第一视图 和第二视图提取第一层预测信息; 所述多层预测信息提取模块中除第一层预 测信息提取模块以外的其他层预测信息提取模块, 与其前一层预测信息提取 模块连接, 用于结合本地解码后的第一视图、 第二视图以及前一层预测信息 提取当前层预测信息增量;
增强层编码模块, 用于对第一层预测信息和若干层预测信息增量进行增 强层编码;
复用模块, 用于将所述基层编码和所述增强层编码复用得到编码信息。 本发明实施例还提供了一种视频解码方法, 包括:
将接收到的编码信息解复用, 获得基层编码和增强层编码;
将基层编码解码, 获得作为参考视图的第一视图;
将增强层编码解码, 获得第一层预测信息和若干层预测信息增量; 根据所述第一层预测信息和若干层预测信息增量, 计算至少两层的预测 信息;
根据所述至少两层的预测信息和第一视图预测第二视图。
本发明实施例还提供了一种视频解码器, 包括:
解复用模块, 用于将接收到的编码信息解复用, 获得基层编码和增强层 编码;
基层解码模块, 用于将所述基层编码解码, 获得作为参考视图的第一视 图;
增强层解码模块, 用于将所述增强层编码解码, 获得第一层预测信息和 若干层预测信息增量;
计算模块, 用于根据所述第一层预测信息和若干层预测信息增量, 计算 至少两层的预测信息;
预测模块,用于根据所述至少两层的预测信息和第一视图预测第二视图。 本发明实施例的视频编码、 解码方法及视频编码器、 解码器, 通过提取 至少一层的预测信息并分别进行增强层编码,实现了对立体视图的分级编码, 满足了连接在不同网络下各种立体显示设备不同显示级别的立体视图显示需 求。 附图说明
图 1为本发明提供的一种视频编码方法实施例一的流程图;
图 2为本发明提供的一种视频编码方法实施例二的流程图;
图 3为本发明提供的一种视频编码方法实施例三的流程图;
图 4为本发明提供的一种视频编码方法实施例四的流程图;
图 5为本发明提供的一种视频编码器实施例一的结构示意图;
图 6为本发明提供的一种视频编码器实施例二的结构示意图;
图 7为本发明提供的一种视频解码方法实施例一的流程图;
图 8为本发明提供的一种视频解码方法实施例二的流程图;
图 9为本发明提供的一种视频解码方法实施例三的流程图;
图 1 0为本发明提供的一种视频解码方法实施例四的流程图;
图 11为本发明提供的一种视频解码器实施例一的结构示意图; 图 12为本发明提供的另一种视频编码方法实施例一的流程图; 图 1 3为本发明提供的另一种视频编码方法实施例二的流程图; 图 14为本发明提供的另一种视频编码方法实施例三的流程图; 图 15为本发明提供的另一种视频编码方法实施例四的流程图; 图 16为本发明提供的另一种视频编码器实施例一的结构示意图; 图 17为本发明提供的另一种视频编码器实施例二的结构示意图;
图 18为本发明提供的另一种视频解码方法实施例一的流程图; 图 19为本发明提供的另一种视频解码方法实施例二的流程图; 图 20为本发明提供的另一种视频解码方法实施例三的流程图; 图 21为本发明提供的另一种视频解码方法实施例四的流程图; 图 22为本发明提供的另一种视频解码器实施例一的结构示意图。 具体实施方式
下面通过附图和实施例, 对本发明的技术方案做进一步的详细描述。 本发明提供的一种视频编码方法实施例一:
如图 1所示, 为本发明提供的一种视频编码方法实施例一的流程图, 具 体包括如下步骤:
步骤 1 01、 将第一视图作为参考视图进行基层编码, 结合本地解码后的 第一视图和第二视图提取至少一层的预测信息; 其中第一视图和第二视图可 以分别为左眼视图和右眼视图, 预测信息可以为运动矢量信息和 /或, 深度或 视差信息;
步骤 102、 将至少一层的预测信息分别进行增强层编码;
步骤 103、 将增强层编码和第一视图的基层编码复用得到编码信息。 本实施例通过提取至少一层的预测信息并分别进行增强层编码, 实现了 对立体视图的分级编码, 满足了连接在不同网络下各种立体显示设备不同显 示级别的立体视图显示需求。 本发明提供的一种视频编码方法实施例二:
如图 2所示, 为本发明提供的一种视频编码方法实施例二的流程图, 本 实施例以深度 /视差信息作为预测信息, 提取一层深度 /视差信息, 假如所要 提取的为稀疏深度 /视差信息。 本实施例具体包括如下步骤:
步骤 201、 釆用两台或多台摄像机从不同角度对同一场景进行拍摄, 获
得两幅视图, 分别为左目艮视图和右目艮视图;
步骤 202、 从左眼视图和右眼视图中任选一视图作为参考视图进行基层 编码, 可以选择左眼视图, 也可以选择右眼视图, 本实施例选择左目艮视图作 为参考视图;
步骤 203、 将经过基层编码后的左眼视图在本地解码, 结合右眼视图进 行稀疏深度 /视差信息提取; 该稀疏深度 /视差信息与预先获取的一个立体视 图显示级别相对应;
步骤 204、 将稀疏深度 /视差信息进行增强层编码;
步骤 205、 将左眼视图的基层编码和增强层编码复用得到编码信息。 其中步骤 203中, 预先获取的一个立体视图显示级别可以根据预先设定 的所要提取的深度 /视差信息的层数和级别来确定,也可以根据在步骤 203之 前增加的如下步骤来确定:
步骤 2021、 分析显示设备的请求信息和 /或网络传输信息, 得出网络较 拥挤时, 仅能传输较少的内容, 则所需的立体视图的显示级别较低, 可要求 提取稀疏深度 /视差信息。
本实施例中预测信息也可以为运动矢量信息,或是深度 /视差信息与运动 矢量信息的结合; 基层编码和增强层编码可以为带运动补偿的离散余弦变换 编码。 当预先获得的一个立体视图显示级别较高时, 本实施例的一层预测信 息也可以为致密预测信息或精细预测信息。
本实施例通过提取一层的稀疏深度 /视差信息并进行增强层编码, 实现了 对立体视图的分级编码, 满足了连接在不同网络下各种立体显示设备不同显 示级别的立体视图显示需求; 还可以根据显示设备和网络状况的具体要求, 提取相应级别的深度 /视差信息, 提高了编码效率, 降低了编码复杂度, 且进 一步的提高了网络传输效率; 本实施例还对基层编码进行复用, 根据该基层 编码, 可以对二维视图进行显示, 兼容二维显示的功能。
本发明提供的一种视频编码方法实施例三:
如图 3所示, 为本发明提供的一种视频编码方法实施例三的流程图, 本 实施例以深度 /视差信息作为预测信息。 在执行图 1中所示的步骤之前, 可以 预先设定所要提取的深度 /视差信息的层数和级别,本实施例以提取三层深度 /视差信息为例, 从粗糙到精细依次为稀疏深度 /视差信息、 致密深度 /视差信 息及精细深度 /视差信息, 对本实施例的技术方案做进一步介绍。 本实施例视 频编码方法执行以下步骤:
步骤 301、 釆用两台或多台摄像机从不同角度对同一场景进行拍摄, 获 得两幅视图, 分别为左目艮视图和右目艮视图;
步骤 302、 从左眼视图和右眼视图中任选一视图作为参考视图进行基层 编码, 可以选择左眼视图, 也可以选择右眼视图, 本实施例选择左目艮视图作 为参考视图;
步骤 303、 将经过基层编码后的左眼视图在本地解码, 结合右眼视图分 别进行稀疏深度 /视差信息提取、致密深度 /视差信息提取以及精细深度 /视差 信息提取;
步骤 304、 将稀疏深度 /视差信息、 致密深度 /视差信息以及精细深度 /视 差信息分别进行增强层编码;
步骤 305、 将左眼视图的基层编码和增强层编码复用得到编码信息。 本发明实施例视频编码方法中的预测信息也可以为运动矢量信息, 或是 深度 /视差信息与运动矢量信息的结合;基层编码和增强层编码可以为带运动 补偿的离散余弦变换编码。
本实施例的视频编码方法,通过提取至少一层的深度 /视差信息并分别进 行增强层编码, 实现了对立体视图的分级编码, 满足了连接在不同网络下各 种立体显示设备不同显示级别的立体视图显示需求; 本实施例还对基层编码 进行复用, 根据该基层编码, 可以对二维视图进行显示, 兼容二维显示的功 匕。
本发明提供的一种视频编码方法实施例四:
如图 4所示, 为本发明提供的一种视频编码方法实施例四的流程图, 本 实施例与上述实施例三的区别在于, 在步骤 301之前, 也可以不对提取的深 度 /视差信息的层数和级别做预先设定, 而在步骤 303之前增加如下步骤: 步骤 3021、 分析显示设备的请求信息和 /或网络传输信息, 分辨率较高 的显示设备所需的立体视图的显示级别较高, 可要求提取精细深度 /视差信 息; 网络较拥挤时仅能传输较少的内容, 则所需的立体视图的显示级别较低, 可要求提取稀疏深度 /视差信息; 综合这两种因素, 获取各种显示设备在不同 网络下所需的至少一个的立体视图显示级别。
进一步地, 所述步骤 303具体为: 将经过基层编码后的左眼视图在本地 解码,结合右目艮视图提取与显示设备和 /或网络所需的立体视图显示级别对应 的至少一层的深度 /视差信息。
本实施例在上述实施例三的基础上, 进一步根据显示设备和网络状况的 具体要求, 提取相应级别的深度 /视差信息, 提高了编码效率, 降低了编码复 杂度, 提高了网络传输效率。 本发明提供的一种视频编码器实施例一:
如图 5所示, 为本发明提供的一种视频编码器实施例一的结构示意图, 具体包括:
基层编码模块 10 , 用于将第一视图作为参考视图进行基层编码; 至少一 个的预测信息提取模块, 如图 5中的预测信息提取模块 11、 12、 1 3......等, 用于结合本地解码后的第一视图和第二视图提取至少一层的预测信息; 增强 层编码模块 14 , 用于将至少一层的预测信息分别进行增强层编码; 复用模块 15 , 用于将增强层编码与第一视图的基层编码复用得到编码信息。
本实施例提供的编码器可以适用于上述本发明提供的一种视频编码方法 实施例——实施例四。
本实施例通过至少一个的预测信息提取模块, 提取了至少一层预测信息 并分别进行增强层编码, 实现了对立体视图的分级编码, 满足了连接在不同 网络下各种立体显示设备不同显示级别的立体视图显示需求。 本发明提供的一种视频编码器实施例二:
如图 6所示, 为本发明提供的一种视频编码器实施例二的结构示意图, 具体包括:
基层编码模块 20, 用于对作为参考视图的左眼视图进行基层编码, 也可 以将右眼视图作为参考视图, 对右目艮视图进行基层编码; 稀疏预测信息提取 模块 21 , 用于结合右目艮视图和本地解码后的左目艮视图提取稀疏预测信息; 致 密预测信息提取模块 22 , 用于结合右目艮视图和本地解码后的左眼视图提取致 密预测信息; 精细预测信息提取模块 23 , 用于结合右眼视图和本地解码后的 左眼视图提取精细预测信息; 增强层编码模块 24 , 用于将稀疏预测信息、 致 密预测信息、 精细预测信息分别进行增强层编码; 复用模块 25 , 用于将左眼 视图的基层编码和增强层编码复用得到编码信息。
本实施例还可以包括分析模块 26 , 用于分析显示设备的请求信息和 /或 网络传输信息, 获取显示设备和 /或网络所需的至少一个的立体视图显示级 别。
本发明实施例的视频编码器不限于上述三层预测信息提取模块, 可以根 据实际需要, 如显示设备和 /或网络的需要, 设置至少一个的预测信息提取模 块, 满足不同显示设备和 /或网络的需求。
本实施例通过设置稀疏预测信息提取模块 21、致密预测信息提取模块 22 及精细预测信息提取模块 23 , 提取了三层预测信息并分别进行增强层编码, 实现了对立体视图的分级编码, 满足了连接在不同网络下各种立体显示设备 不同显示级别的立体视图显示需求; 还可以根据分析模块 26 , 获得显示设备 和网络状况的具体要求, 并提取相应级别的预测信息, 提高了编码效率, 降
低了编码复杂度, 且进一步的提高了网络传输效率。 本发明提供的一种视频解码方法实施例一:
如图 7所示, 为本发明提供的一种视频解码方法实施例一的流程图, 本 实施例的解码方法是针对本发明提供的一种视频编码方法实施例一所作出 的, 具体包括以下步骤:
步骤 401、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 402、 将基层编码解码, 获得作为参考视图的第一视图;
步骤 403、 将增强层编码解码, 获得至少一层的预测信息;
步骤 404、 根据至少一层的预测信息和第一视图预测第二视图。
其中第一视图和第二视图可以分别为左目艮视图和右目艮视图, 预测信息可 以为运动矢量信息和 /或, 深度或视差信息。
本实施例通过获得至少一层的预测信息,实现了对立体视图的分级解码, 结合第一视图, 预测第二视图, 故可以根据第一视图和预测得到的第二视图 进行立体视图显示, 满足了各种立体显示设备不同显示级别的立体视图显示 需求。 本发明提供的一种视频解码方法实施例二:
如图 8所示, 为本发明提供的一种视频解码方法实施例二的流程图, 本 实施例的解码方法是针对本发明提供的一种视频编码方法实施例二所作出 的, 具体包括以下步骤:
步骤 501、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 502、 将基层编码解码, 获得作为参考视图的左眼视图;
步骤 503、 将增强层编码解码, 获得稀疏深度 /视差信息;
步骤 504、 根据稀疏深度 /视差信息以及左眼视图预测右眼视图。
本实施例通过获得稀疏深度 /视差信息, 该稀疏深度 /视差信息与编码时
预先获取的一个立体视图显示级别相对应, 实现了对立体视图的分级解码, 结合第一视图, 预测第二视图, 故可以根据第一视图和预测得到的第二视图 进行立体视图显示, 满足了各种立体显示设备不同显示级别的立体视图显示 需求。 本发明提供的一种视频解码方法实施例三:
如图 9所示, 为本发明提供的一种视频解码方法实施例三的流程图, 本 实施例的解码方法是针对本发明提供的一种视频编码方法实施例四所作出 的, 具体包括以下步骤:
步骤 601、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 602、 将基层编码解码, 获得作为参考视图的左眼视图;
步骤 603、 将增强层编码解码, 获得稀疏深度 /视差信息、 致密深度 /视 差信息及精细深度 /视差信息;
步骤 604、 根据稀疏深度 /视差信息、 致密深度 /视差信息、 精细深度 /视 差信息以及左目艮视图预测右目艮视图。
由于在编码过程中, 通过分析显示设备和 /或网络传输信息, 获得至少一 个的立体视图显示级别, 根据该显示级别获取与其对应的稀疏深度 /视差信 息、 致密深度 /视差信息及精细深度 /视差信息的三层预测信息结构, 则在解 码过程中, 直接对增强层编码进行解码, 获得上述三层深度 /视差信息即可。
本发明实施例视频解码方法中的预测信息也可以为运动矢量信息, 或是 深度 /视差信息与运动矢量信息的结合。
本实施例视频解码方法, 通过获得至少一层的深度 /视差信息, 实现了对 立体视图的分级解码, 结合左目艮视图, 预测右目艮视图, 故可以根据左眼视图 和预测得到的右目艮视图进行立体视图显示, 满足了各种立体显示设备不同显 示级别的立体视图显示需求; 本实施例还对基层编码进行解码, 根据该基层 编码的解码信息, 可以显示二维视图, 兼容二维显示的功能。
本发明提供的一种视频解码方法实施例四:
如图 1 0所示, 为本发明提供的一种视频解码方法实施例四的流程图, 本 实施例的解码方法是针对本发明提供的一种视频编码方法实施例三所作出 的, 与上述解码方法实施例三的区别在于:
由于在编码过程中, 通过预先设定所要提取的预测信息的层数和级别确 定上述三层预测信息结构, 则在解码过程中步骤 603之前, 还可以包括: 步骤 6021、 分析显示设备的请求信息, 获取各种显示设备所需的至少一 个的立体视图显示级别。
进一步地, 所述步骤 603具体为: 将与至少一个的立体视图的显示级别 对应的增强层编码解码, 获得至少一层的深度 /视差信息, 可以为稀疏深度 / 视差信息或致密深度 /视差信息或精细深度 /视差信息或三者的任意组合。
本实施例在上述解码方法实施例三的基础上, 进一步根据显示设备的具 体要求, 解码相应级别的增强层编码, 获得相应级别的深度 /视差信息, 提高 了解码效率, 降低了解码复杂度。 本发明提供的一种视频解码器实施例一:
如图 1 1所示, 为本发明提供的一种视频解码器实施例一的结构示意图, 具体包括:
解复用模块 30 , 用于将接收到的编码信息解复用, 获得基层编码和增强 层编码; 基层编码模块 31 , 用于将基层编码解码, 获得作为参考视图的第一 视图; 增强层解码模块 32 , 用于将增强层编码解码, 获得至少一层预测信息; 预测模块 33 , 用于根据至少一层预测信息和第一视图预测右眼视图。
本实施例还可以包括分析模块 34 , 用于分析显示设备的请求信息, 获取 显示设备所需的至少一个的立体视图显示级别。其中增强层解码模块 32获取 与至少一个的立体视图显示级别对应的至少一层预测信息。
本实施例提供的解码器可以适用于上述本发明提供的一种视频解码方法
实施例——实施例四。
本实施例通过设置增强层解码模块 32 , 获得了至少一层预测信息, 实现 了对立体视图的分级解码, 满足了各种立体显示设备不同显示级别的立体视 图显示需求; 还可以根据分析模块 24 , 获得显示设备的具体要求, 解码相应 级别的预测信息, 提高了解码效率, 降低了解码复杂度。 本发明提供的另一种视频编码方法实施例一:
如图 12所示, 为本发明提供的另一种视频编码方法实施例一的流程图, 具体包括如下步骤:
步骤 701、 将第一视图作为参考视图进行基层编码, 结合本地解码后的 第一视图和第二视图提取第一层预测信息;
步骤 702、 对第一层预测信息进行增强层编码;
步骤 703、 从提取第二层预测信息增量开始, 提取当前层预测信息增量 的步骤如下:
结合本地解码后的第一视图、 第二视图及前一层预测信息提取当前层预 测信息增量, 对当前层预测信息增量进行增强层编码; 依次类推, 直至对最 后一层预测信息增量进行增强层编码;
步骤 704、 将基层编码和增强层编码复用得到编码信息。
本实施例的视频编码方法, 通过提取一层预测信息和至少一层预测信息 增量, 并分别进行增强层编码, 实现了对立体视图的分级编码, 满足了连接 在不同网络下各种立体显示设备不同显示级别的立体视图显示需求; 由于对 至少一层深度 /视差信息增量进行增强层编码,相比于直接将预测信息进行增 强层编码来说, 网络中传输的信息量减少, 减小了网络传输带宽, 提高了传 输效率。 本发明提供的另一种视频编码方法实施例二:
如图 1 3所示, 为本发明提供的另一种视频编码方法实施例二的流程图, 本实施例以深度 /视差信息作为预测信息, 提取一层深度 /视差信息和一层深 度 /视差信息增量, 分别为稀疏深度 /视差信息和致密深度 /视差信息增量。 本 实施例具体包括如下步骤:
步骤 801、 釆用两台或多台摄像机从不同角度对同一场景进行拍摄, 获 得两幅视图, 分别为左目艮视图和右目艮视图;
步骤 802、 从左眼视图和右眼视图中任选一视图作为参考视图进行基层 编码, 可以选择左眼视图, 也可以选择右眼视图, 本实施例选择左目艮视图作 为参考视图;
步骤 803、 将经过基层编码后的左眼视图在本地解码, 结合右眼视图进 行稀疏深度 /视差信息提取, 并对稀疏深度 /视差信息进行增强层编码;
步骤 804、 结合本地解码后的左目艮视图、 右眼视图及稀疏深度 /视差信息 进行致密深度 /视差信息增量提取, 并对致密深度 /视差信息增量进行增强层 编码;
本步骤可以具体为: 结合本地解码后的左眼视图和右眼视图提取致密深 度 /视差信息; 计算致密深度 /视差信息相当于稀疏深度 /视差信息的增量, 即 致密深度 /视差信息增量。
步骤 805、 将基层编码和增强层编码复用得到编码信息。
本实施例中稀疏深度 /视差信息和致密深度 /视差信息和预先获取的两个 立体视图显示级别相对应, 预先获取的两个立体视图显示级别可以根据预先 设定所要提取的深度 /视差信息的层数和级别来确定, 也可以根据在步骤 803 之前增加如下步骤来确定:
步骤 8021、 分析显示设备的请求信息和 /或网络传输信息, 分辨率较高 的显示设备所需的立体视图的显示级别较高, 可要求提取致密深度 /视差信 息; 网络较拥挤时仅能传输较少的内容, 则所需的立体视图的显示级别较低, 可要求提取稀疏深度 /视差信息; 综合这两种因素, 获取显示设备和 /或网络
所需的立体视图的显示级别,根据所述显示级别确定提取深度 /视差信息的总 层数和级别,例如,根据显示级别需要提取两层深度 /视差信息,级别为稀疏、 致密。
本发明实施例视频编码方法中的预测信息也可以为运动矢量信息, 或是 深度 /视差信息与运动矢量信息的结合;基层编码和增强层编码可以为带运动 补偿的离散余弦变换编码。 本实施例的两层预测信息可以为稀疏、 致密和精 细预测信息中的任两种组合。
本实施例的视频编码方法, 通过提取一层深度 /视差信息和一层深度 /视 差信息增量, 并分别进行增强层编码, 实现了对立体视图的分级编码, 满足 了连接在不同网络下各种立体显示设备不同显示级别的立体视图显示需求; 由于对一层深度 /视差信息增量进行增强层编码, 网络中传输的信息量减少, 减小了网络传输带宽, 提高了传输效率; 还可以根据显示设备和网络状况的 具体要求, 提取相应层数和级别的深度 /视差信息, 提高了编码效率, 降低了 编码复杂度, 且进一步的提高了网络传输效率; 本实施例还对基层编码进行 复用, 根据该基层编码, 可以对二维视图进行显示, 兼容二维显示的功能。 本发明提供的另一种视频编码方法实施例三:
如图 14所示, 为本发明提供的另一种视频编码方法实施例三的流程图, 本实施例以深度 /视差信息作为预测信息。 在执行图 14所示的步骤之前, 可 以预先设定所要提取的深度 /视差信息的层数和级别,本实施例以提取三层深 度 /视差信息为例, 从粗糙到精细依次为稀疏深度 /视差信息、 致密深度 /视差 信息及精细深度 /视差信息, 对本实施例的技术方案做进一步介绍。 本实施例 视频编码方法执行以下步骤:
步骤 901、 釆用两台或多台摄像机从不同角度对同一场景进行拍摄, 获 得两幅视图, 分别为左目艮视图和右目艮视图;
步骤 902、 从左眼视图和右眼视图中任选一视图作为参考视图进行基层
编码, 可以选择左眼视图, 也可以选择右眼视图, 本实施例选择左目艮视图作 为参考视图;
步骤 903、 将经过基层编码后的左眼视图在本地解码, 结合右眼视图进 行稀疏深度 /视差信息提取, 并对稀疏深度 /视差信息进行增强层编码;
步骤 904、 结合本地解码后的左目艮视图、 右眼视图及稀疏深度 /视差信息 进行致密深度 /视差信息增量提取, 并对致密深度 /视差信息增量进行增强层 编码;
步骤 905、 结合本地解码后的左目艮视图、 右目艮视图及致密深度 /视差信息 进行精细深度 /视差信息增量提取, 并对精细深度 /视差信息增量进行增强层 编码;
步骤 906、 将基层编码和增强层编码复用得到编码信息。
具体地说, 步骤 904可以具体为: 结合本地解码后的左眼视图和右眼视 图提取致密深度 /视差信息;计算致密深度 /视差信息相当于稀疏深度 /视差信 息的增量, 即致密深度 /视差信息增量。 同理,步骤 905也可以通过依此实现。
本发明实施例视频编码方法中的预测信息也可以为运动矢量信息, 或是 深度 /视差信息与运动矢量信息的结合;基层编码和增强层编码可以为带运动 补偿的离散余弦变换编码。
本发明实施例编码方法不限于提取三层预测信息, 根据确定的提取预测 信息的总层数和级别可以提取一层预测信息和至少一层的预测信息增量。
本实施例的视频编码方法, 通过提取一层深度 /视差信息和若干层深度 / 视差信息增量, 并分别进行增强层编码, 实现了对立体视图的分级编码, 满 足了连接在不同网络下各种立体显示设备不同显示级别的立体视图显示需 求; 由于对若干层深度 /视差信息增量进行增强层编码, 网络中传输的信息量 减少, 减小了网络传输带宽, 提高了传输效率; 本实施例还对基层编码进行 复用, 根据该基层编码, 可以对二维视图进行显示, 兼容二维显示的功能。
本发明提供的另一种视频编码方法实施例四:
如图 15所示, 为本发明提供的另一种视频编码方法实施例四的流程图, 本实施例与上述另一种视频编码方法实施例三的区别在于,在步骤 901之前, 也可以不对提取的深度 /视差信息的层数和级别做预先设定, 可以在步骤 903 之前增加如下步骤:
步骤 9021、 分析显示设备的请求信息和 /或网络传输信息, 分辨率较高 的显示设备所需的立体视图的显示级别较高, 可要求提取精细深度 /视差信 息; 网络较拥挤时仅能传输较少的内容, 则所需的立体视图的显示级别较低, 可要求提取稀疏深度 /视差信息; 综合这两种因素, 获取显示设备和 /或网络 所需的立体视图的显示级别,根据所述显示级别确定提取深度 /视差信息的总 层数和级别,例如,根据显示级别需要提取三层深度 /视差信息,级别为稀疏、 致密和精细, 则在执行步骤 9021后, 继续执行步骤 903 -步骤 906。
本实施例在上述另一种视频编码方法实施例三的基础上, 进一步根据显 示设备和网络状况的具体要求, 提取相应层数和级别的深度 /视差信息, 提高 了编码效率, 降低了编码复杂度, 提高了网络传输效率。 本发明提供的另一种视频编码器实施例一:
如图 1 6 所示, 为本发明提供的另一种视频编码器实施例一的结构示意 图, 具体包括:
基层编码模块 40 , 用于将第一视图作为参考视图进行基层编码; 至少两 层预测信息提取模块,其中包括的第一层预测信息提取模块 41与基层编码模 块 40 连接, 用于结合本地解码后的第一视图和第二视图提取第一层预测信 息,其中还包括的除第一层预测信息提取模块 41以外的其他层预测信息提取 模块 42、 4 3 ······等, 与其前一层预测信息提取模块连接, 用于结合本地解码 后的第一视图、 第二视图以及前一层预测信息提取当前层预测信息增量; 增 强层编码模块 44 , 用于对第一层预测信息和若干层预测信息增量进行增强层
编码; 复用模块 45 , 用于将基层编码和增强层编码复用得到编码信息。 本实施例提供的编码器可以适用于上述本发明提供的另一种视频编码方 法实施例——实施例四。
本实施例通过设置有第一层预测信息提取模块 41 和其他层预测信息提 取模块 42、 43……等, 提取了一层预测信息和至少一层预测信息增量, 并分 别进行增强层编码, 实现了对立体视图的分级编码, 满足了连接在不同网络 下各种立体显示设备不同显示级别的立体视图显示需求; 由于对增量进行增 强层编码, 网络中传输的信息量减少, 对网络传输带宽的要求降低, 提高了 传输效率。 本发明提供的另一种视频编码器实施例二:
如图 1 7 所示, 为本发明提供的另一种视频编码器实施例二的结构示意 图, 具体包括:
基层编码模块 50 , 用于对左眼视图进行基层编码; 稀疏预测信息提取模 块 51 , 与基层编码模块 50连接, 用于结合右眼视图和本地解码后的左眼视 图提取稀疏预测信息; 致密预测信息提取模块 52 , 与稀疏预测信息提取模块 51连接, 用于接收稀疏预测信息提取模块 51发送来的稀疏预测信息, 并结 合右目艮视图和本地解码后的左目艮视图提取致密预测信息增量; 精细预测信息 提取模块 53 , 与致密预测信息提取模块 52连接, 用于接收致密预测信息提 取模块 52发送来的致密预测信息,并结合右眼视图和本地解码后的左眼视图 提取精细预测信息增量; 增强层编码模块 54 , 用于对稀疏预测信息、 致密预 测信息增量及精细预测信息增量进行增强层编码; 复用模块 55 , 用于将基层 编码和增强层编码复用得到编码信息。
本实施例还可以包括分析模块 56 , 用于分析显示设备的请求信息和 /或 网络传输信息, 获取显示设备和 /或网络所需的立体视图的显示级别, 根据所 述显示级别确定提取预测信息增量的总层数和级别。
本发明实施例的视频编码器不限于上述三层预测信息提取模块, 可以根 据实际需要,如显示设备和 /或网络的需要,设置至少两层预测信息提取模块, 满足不同显示设备和 /或网络的需求。
本实施例通过设置稀疏预测信息提取模块 51、致密预测信息提取模块 52 及精细预测信息提取模块 53 , 提取了稀疏预测信息、 致密预测信息增量及精 细预测信息增量, 并分别进行增强层编码, 实现了对立体视图的分级编码, 满足了连接在不同网络下各种立体显示设备不同显示级别的立体视图显示需 求; 由于对致密预测信息增量及精细预测信息增量进行增强层编码, 网络中 传输的信息量减少, 对网络传输带宽的要求降低, 提高了传输效率; 还可以 根据分析模块 56 , 获得显示设备和网络状况的具体要求, 并提取相应层数和 级别的预测信息, 提高了编码效率, 降低了编码复杂度, 且进一步的提高了 网络传输效率。 本发明提供的另一种视频解码方法实施例一:
如图 18所示, 为本发明提供的另一种视频解码方法实施例一的流程图, 本实施例的解码方法是针对另一种视频编码方法实施例一所作出的, 具体包 括如下步骤:
步骤 1001、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 1002、 将基层编码解码, 获得作为参考视图的第一视图;
步骤 1003、 将增强层编码解码, 获得第一层预测信息和若干层预测信息 增量;
步骤 1004、 根据第一层预测信息和若干层预测信息增量, 计算至少两层 的预测信息;
步骤 1005、 根据至少两层的预测信息和第一视图预测第二视图。
本实施例视频解码方法, 通过根据获得的第一层预测信息和若干层预测 信息增量, 计算至少两层的预测信息, 实现了对立体视图的分级解码, 满足
了各种立体显示设备不同显示级别的立体视图显示需求; 由于对若干层预测 信息增量进行增强层解码, 网络中传输的信息量减少, 对网络传输带宽的要 求降低, 提高了传输效率; 本实施例还对基层编码进行解码, 根据该基层编 码的解码信息, 可以显示二维视图, 兼容二维显示的功能。 本发明提供的另一种视频解码方法实施例二:
如图 19所示, 为本发明提供的另一种视频解码方法实施例二的流程图, 本实施例的解码方法是针对另一种视频编码方法实施例二所作出的, 具体包 括如下步骤:
步骤 11 01、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 11 02、 将基层编码解码, 获得作为参考视图的左眼视图;
步骤 11 03、 将增强层编码解码, 获得稀疏深度 /视差信息和致密深度 /视 差信息增量;
步骤 11 04、 根据稀疏深度 /视差信息和致密深度 /视差信息增量, 计算致 密深度 /视差信息;
步骤 11 05、 根据稀疏深度 /视差信息、 致密深度 /视差信息以及左眼视图 预测右眼视图。
本实施例视频解码方法, 通过根据获得的稀疏预测信息和致密预测信息 增量, 计算两层的预测信息, 实现了对立体视图的分级解码, 满足了各种立 体显示设备不同显示级别的立体视图显示需求; 由于对致密预测信息增量进 行增强层解码, 网络中传输的信息量减少, 对网络传输带宽的要求降低, 提 高了传输效率; 本实施例还对基层编码进行解码, 根据该基层编码的解码信 息, 可以显示二维视图, 兼容二维显示的功能。 本发明提供的另一种视频解码方法实施例三:
如图 20所示, 为本发明提供的另一种视频解码方法实施例三的流程图,
本实施例的解码方法是针对另一种视频编码方法实施例四所作出的, 具体包 括如下步骤:
步骤 1201、 将接收到的编码信息解复用, 获得基层编码和增强层编码; 步骤 1202、 将基层编码解码, 获得作为参考视图的左眼视图;
步骤 1203、 将增强层编码解码, 获得稀疏深度 /视差信息、 致密深度 /视 差信息增量及精细深度 /视差信息增量;
步骤 1204、 根据稀疏深度 /视差信息和致密深度 /视差信息增量, 计算致 密深度 /视差信息; 根据致密深度 /视差信息和精细深度 /视差信息增量, 计算 精细深度 /视差信息;
步骤 1205、 根据稀疏深度 /视差信息、 致密深度 /视差信息、 精细深度 / 视差信息以及左目艮视图预测右目艮视图。
由于在编码过程中, 通过分析显示设备和 /或网络传输信息, 获得至少一 个的立体视图显示级别, 根据该显示级别获取与其对应的稀疏深度 /视差信 息、 致密深度 /视差信息及精细深度 /视差信息的三层预测信息结构, 则在解 码过程中, 直接对增强层编码进行解码, 获得上述三层深度 /视差信息即可。
本发明实施例视频解码方法中的预测信息也可以为运动矢量信息, 或是 深度 /视差信息与运动矢量信息的结合。
本实施例视频解码方法,通过根据获得的第一层深度 /视差信息和若干层 深度 /视差信息增量, 计算至少两层的深度 /视差信息, 实现了对立体视图的 分级解码, 结合左目艮视图, 预测右目艮视图, 故可以根据左目艮视图和预测得到 的右目艮视图进行立体视图显示, 满足了各种立体显示设备不同显示级别的立 体视图显示需求; 由于对若干层深度 /视差信息增量进行增强层解码, 网络中 传输的信息量减少, 对网络传输带宽的要求降低, 提高了传输效率; 本实施 例还对基层编码进行解码, 根据该基层编码的解码信息, 可以显示二维视图, 兼容二维显示的功能。
本发明提供的另一种视频解码方法实施例四:
如图 21所示, 为本发明提供的另一种视频解码方法实施例四的流程图, 本实施例的解码方法是针对另一种视频编码方法实施例三所作出的, 与上述 另一种视频解码方法实施例三的区别在于:
由于在编码过程中, 通过预先设定所要提取的预测信息的层数和级别确 定上述三层预测信息结构, 则在解码过程中步骤 1203之前, 还可以包括: 步骤 1 2021、 分析显示设备的请求信息, 获取各种显示设备所需的至少 一个的立体视图显示级别,根据该显示级别确定增强层解码的总层数和级别。
进一步地, 所述步骤 1203具体为: 根据确定的增强层解码的总层数和级 别对增强层编码进行解码, 获得稀疏深度 /视差信息和至少一层的深度 /视差 信息增量, 该至少一层的深度 /视差信息增量可以为致密深度 /视差信息增量 或致密深度 /视差信息增量和精细深度 /视差信息增量。
本实施例在上述另一种视频解码方法实施例三的基础上, 进一步根据显 示设备的具体要求, 解码相应层数和级别的增强层编码, 获得相应级别的深 度 /视差信息, 提高了解码效率, 降低了解码复杂度。 本发明提供的另一种视频解码器实施例一:
如图 11 所示, 为本发明提供的另一种视频解码器实施例一的结构示意 图, 具体包括:
解复用模块 60 , 用于将接收到的编码信息解复用, 获得基层编码和增强 层编码; 基层编码模块 61 , 用于将基层编码解码, 获得作为参考视图的第一 视图; 增强层解码模块 62 , 用于将增强层编码解码, 获得第一层预测信息和 若干层预测信息增量; 计算模块 63 , 用于艮据第一层预测信息和若干层预测 信息增量, 计算至少两层的预测信息; 预测模块 64 , 用于根据至少两层预测 信息和第一视图预测第二视图。
本实施例还可以包括分析模块 65 , 用于分析显示设备的请求信息, 获取
所述显示设备所需的立体视图的显示级别, 根据所述显示级别确定增强层解 码的总层数。
本实施例提供的解码器可以适用于上述本发明提供的另一种视频解码方 法实施例——实施例四。
本实施例通过设置增强层解码模块 62和计算模块 63 , 获得了至少两层 预测信息, 实现了对立体视图的分级解码, 满足了各种立体显示设备不同显 示级别的立体视图显示需求; 由于对若干层预测信息增量进行增强层解码, 网络中传输的信息量减少, 对网络传输带宽的要求降低, 提高了传输效率; 还可以根据分析模块 65 , 获得显示设备的具体要求, 解码相应层数和级别的 预测信息, 提高了解码效率, 降低了解码复杂度。
最后应说明的是: 以上实施例仅用以说明本发明的技术方案, 而非对其 限制; 尽管参照前述实施例对本发明进行了详细的说明, 本领域的普通技术 人员应当理解: 其依然可以对前述各实施例所记载的技术方案进行修改, 或 者对其中部分技术特征进行等同替换; 而这些修改或者替换, 并不使相应技 术方案的本质脱离本发明各实施例技术方案的精神和范围。
Claims
1、 一种视频编码方法, 其特征在于, 包括:
将第一视图作为参考视图进行基层编码, 结合本地解码后的第一视图和 第二视图提取至少一层的预测信息;
将所述至少一层的预测信息分别进行增强层编码;
将所述增强层编码与所述第一视图的基层编码复用得到编码信息。
2、 根据权利要求 1 所述的视频编码方法, 其特征在于, 所述预测信息 具体为运动矢量信息和 /或深度 /视差信息。
3、 根据权利要求 1 所述的视频编码方法, 其特征在于, 所述基层编码 和增强层编码为带运动补偿的离散余弦变换编码。
4、 根据权利要求 1 - 3任一所述的视频编码方法, 其特征在于, 在所述 将第一视图作为参考视图进行基层编码之前, 还包括: 分析显示设备的请求 信息和 /或网络传输信息, 获取所述显示设备和 /或网络所需的至少一个的立 体视图显示级别;
所述结合本地解码后的第一视图和第二视图提取至少一层的预测信息具 体为:结合本地解码后的第一视图和第二视图提取与所述显示设备和 /或网络 所需的立体视图显示级别对应的至少一层的预测信息。
5、 一种视频编码器, 其特征在于, 包括:
基层编码模块, 用于将第一视图作为参考视图进行基层编码;
至少一个的预测信息提取模块, 用于结合本地解码后的第一视图和第二 视图提取至少一层的预观 'J信息;
增强层编码模块,用于将所述至少一层的预测信息分别进行增强层编码; 复用模块, 用于将所述增强层编码与所述第一视图的基层编码复用得到 编码信息。
6、 根据权利要求 5所述的视频编码器, 其特征在于, 还包括: 分析模 块, 用于分析显示设备的请求信息和 /或网络传输信息, 获取所述显示设备和
/或网络所需的至少一个的立体视图显示级别。
7、 一种视频解码方法, 其特征在于, 包括:
将接收到的编码信息解复用, 获得基层编码和增强层编码;
将所述基层编码解码, 获得作为参考视图的第一视图;
将所述增强层编码解码, 获得至少一层的预测信息;
根据所述至少一层的预测信息和所述第一视图预测第二视图。
8、 根据权利要求 7所述的视频解码方法, 其特征在于, 所述预测信息 具体为运动矢量信息和 /或深度 /视差信息。
9、 根据权利要求 7或 8所述的视频解码方法, 其特征在于, 在所述将 增强层编码解码之前还包括: 分析显示设备的请求信息, 获取所述显示设备 所需的至少一个的立体视图显示级别;
所述将增强层编码解码, 获得至少一层的预测信息, 具体为: 将增强层 编码解码, 获得与所述显示设备所需的立体显示级别对应的至少一层的预测 信息。
10、 一种视频解码器, 其特征在于, 包括:
解复用模块, 用于将接收到的编码信息解复用, 获得基层编码和增强层 编码;
基层解码模块, 用于将所述基层编码解码, 获得作为参考视图的第一视 图;
增强层解码模块, 用于将所述增强层编码解码, 获得至少一层的预测信 息;
预测模块, 用于根据所述至少一层的预测信息和所述第一视图预测第二 视图。
11、 根据权利要求 10所述的视频解码器, 其特征在于, 还包括: 分析 模块, 用于分析显示设备的请求信息, 获取所述显示设备所需的至少一个的 立体视图显示级别。
12、 一种视频编码方法, 其特征在于, 包括:
将第一视图作为参考视图进行基层编码, 结合本地解码后的第一视图和 第二视图提取第一层预测信息;
对所述第一层预测信息进行增强层编码;
从提取第二层预测信息增量开始,提取当前层预测信息增量的步骤如下: 结合本地解码后的第一视图、 第二视图及前一层预测信息提取当前层预 测信息增量, 对当前层预测信息增量进行增强层编码; 依次类推, 直至对最 后一层预测信息增量进行增强层编码;
将所述基层编码和所述增强层编码复用得到编码信息。
1 3、 根据权利要求 12所述的视频编码方法, 其特征在于, 所述结合本 地解码后的第一视图、第二视图及前一层预测信息提取当前层预测信息增量, 具体为:
结合本地解码后的第一视图以及第二视图提取当前层预测信息; 才艮据当前层预测信息和前一层预测信息计算当前层预测信息增量。
14、 根据权利要求 1 3所述的视频编码方法, 其特征在于, 所述提取当 前层预测信息增量具体为: 提取当前层运动矢量信息增量和 /或当前层深度 / 视差信息增量。
15、 根据权利要求 14所述的视频编码方法, 其特征在于, 所述基层编 码和增强层编码为带运动补偿的离散余弦变换编码。
16、 根据权利要求 12 - 15任一所述的视频编码方法, 其特征在于, 在 所述对第一层预测信息进行增强层编码之前还包括: 分析显示设备的请求信 息和 /或网络传输信息,获取显示设备和 /或网络所需的立体视图的显示级别, 根据所述显示级别确定提取预测信息增量的级别和总层数。
17、 一种视频编码器, 其特征在于, 包括:
基层编码模块, 用于将第一视图作为参考视图进行基层编码;
至少两层预测信息提取模块, 所述多层预测信息提取模块中的第一层预
测信息提取模块与所述基层编码模块连接, 用于结合本地解码后的第一视图 和第二视图提取第一层预测信息; 所述多层预测信息提取模块中除第一层预 测信息提取模块以外的其他层预测信息提取模块, 与其前一层预测信息提取 模块连接, 用于结合本地解码后的第一视图、 第二视图以及前一层预测信息 提取当前层预测信息增量;
增强层编码模块, 用于对第一层预测信息和若干层预测信息增量进行增 强层编码;
复用模块, 用于将所述基层编码和所述增强层编码复用得到编码信息。
18、 根据权利要求 17所述的视频编码器, 其特征在于, 还包括: 分析 模块, 用于分析显示设备的请求信息和 /或网络传输信息, 获取显示设备和 / 或网络所需的立体视图的显示级别, 根据所述显示级别确定提取预测信息增 量的级别和总层数。
19、 一种视频解码方法, 其特征在于, 包括:
将接收到的编码信息解复用, 获得基层编码和增强层编码;
将基层编码解码, 获得作为参考视图的第一视图;
将增强层编码解码, 获得第一层预测信息和若干层预测信息增量; 根据所述第一层预测信息和若干层预测信息增量, 计算至少两层的预测 信息;
根据所述至少两层的预测信息和第一视图预测第二视图。
20、 根据权利要求 19所述的视频解码方法, 其特征在于, 所述获得第 一层预测信息和若干层预测信息增量具体为: 获得第一层运动矢量信息和 / 或第一层深度 /视差信息以及若干层运动矢量信息增量和 /或若干层深度 /视 差信息增量。
21、 根据权利要求 19或 20所述的视频解码方法, 其特征在于, 在将增 强层编码解码之前还包括: 分析显示设备的请求信息, 获取显示设备所需的 立体视图的显示级别, 根据所述显示级别确定增强层解码的级别和总层数。
22、 一种视频解码器, 其特征在于, 包括:
解复用模块, 用于将接收到的编码信息解复用, 获得基层编码和增强层 编码;
基层解码模块, 用于将所述基层编码解码, 获得作为参考视图的第一视 图;
增强层解码模块, 用于将所述增强层编码解码, 获得第一层预测信息和 若干层预测信息增量;
计算模块, 用于根据所述第一层预测信息和若干层预测信息增量, 计算 至少两层的预测信息;
预测模块,用于根据所述至少两层的预测信息和第一视图预测第二视图。
23、 根据权利要求 22所述的视频解码器, 其特征在于, 还包括: 分析 模块, 分析显示设备的请求信息, 获取显示设备所需的立体视图的显示级别, 根据所述显示级别确定增强层解码的级别和总层数。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20080853028 EP2207352A4 (en) | 2007-10-24 | 2008-10-14 | VIDEO CODING AND DECODING METHOD AND VIDEO CODING AND DECODING DEVICE |
JP2010530253A JP5232866B2 (ja) | 2007-10-24 | 2008-10-14 | ビデオ符号化方法、ビデオ復号方法、ビデオコーダおよびビデオデコーダ |
US12/766,384 US20100202540A1 (en) | 2007-10-24 | 2010-04-23 | Video coding method, video decoding method, video coder, and video decorder |
US14/323,503 US20140313291A1 (en) | 2007-10-24 | 2014-07-03 | Video coding method, video decoding method, video coder, and video decoder |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200710176288.8 | 2007-10-24 | ||
CN2007101762888A CN101420609B (zh) | 2007-10-24 | 2007-10-24 | 视频编码、解码方法及视频编码器、解码器 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/766,384 Continuation US20100202540A1 (en) | 2007-10-24 | 2010-04-23 | Video coding method, video decoding method, video coder, and video decorder |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009065325A1 true WO2009065325A1 (en) | 2009-05-28 |
Family
ID=40631169
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2008/072675 WO2009065325A1 (en) | 2007-10-24 | 2008-10-14 | A video encoding/decoding method and a video encoder/decoder |
Country Status (5)
Country | Link |
---|---|
US (2) | US20100202540A1 (zh) |
EP (1) | EP2207352A4 (zh) |
JP (1) | JP5232866B2 (zh) |
CN (1) | CN101420609B (zh) |
WO (1) | WO2009065325A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012533925A (ja) * | 2009-07-17 | 2012-12-27 | サムスン エレクトロニクス カンパニー リミテッド | 多視点映像符号化及び復号化のための方法及び装置 |
Families Citing this family (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2596637B1 (en) | 2010-07-21 | 2021-08-04 | Dolby Laboratories Licensing Corporation | Systems and methods for multi-layered frame-compatible video delivery |
US20120044321A1 (en) * | 2010-08-18 | 2012-02-23 | Electronics And Telecommunications Research Institute | Apparatus and method for monitoring broadcasting service in digital broadcasting system |
CN102055984B (zh) * | 2011-01-27 | 2012-10-03 | 山东大学 | 2d和3d播放模式平滑切换的立体视频解码结构及工作方法 |
CN102281446B (zh) * | 2011-09-20 | 2013-07-03 | 西南交通大学 | 一种分布式视频编码中基于视觉感知特性的量化方法 |
US10764604B2 (en) * | 2011-09-22 | 2020-09-01 | Sun Patent Trust | Moving picture encoding method, moving picture encoding apparatus, moving picture decoding method, and moving picture decoding apparatus |
JP5735181B2 (ja) | 2011-09-29 | 2015-06-17 | ドルビー ラボラトリーズ ライセンシング コーポレイション | デュアルレイヤフレームコンパチブルフル解像度立体3dビデオ配信 |
TWI595770B (zh) | 2011-09-29 | 2017-08-11 | 杜比實驗室特許公司 | 具有對稱圖像解析度與品質之圖框相容全解析度立體三維視訊傳達技術 |
CN107241606B (zh) | 2011-12-17 | 2020-02-21 | 杜比实验室特许公司 | 解码系统、方法和设备以及计算机可读介质 |
CN102710949B (zh) * | 2012-05-11 | 2014-06-04 | 宁波大学 | 一种基于视觉感知的立体视频编码方法 |
US9998764B2 (en) | 2012-07-09 | 2018-06-12 | Vid Scale, Inc. | Codec architecture for multiple layer video coding |
WO2014034480A1 (ja) * | 2012-09-03 | 2014-03-06 | ソニー株式会社 | 画像処理装置および方法 |
US9900609B2 (en) | 2013-01-04 | 2018-02-20 | Nokia Technologies Oy | Apparatus, a method and a computer program for video coding and decoding |
KR101806949B1 (ko) * | 2013-03-13 | 2017-12-08 | 후아웨이 테크놀러지 컴퍼니 리미티드 | 깊이 룩업 테이블을 코딩하는 방법 |
US10469866B2 (en) | 2013-04-05 | 2019-11-05 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding and decoding video with respect to position of integer pixel |
CN105103562B (zh) * | 2013-04-05 | 2019-07-12 | 夏普株式会社 | 随机接入点画面 |
US11184599B2 (en) * | 2017-03-15 | 2021-11-23 | Pcms Holdings, Inc. | Enabling motion parallax with multilayer 360-degree video |
US10939086B2 (en) * | 2018-01-17 | 2021-03-02 | Mediatek Singapore Pte. Ltd. | Methods and apparatus for encoding and decoding virtual reality content |
CN113574568A (zh) * | 2019-03-26 | 2021-10-29 | 松下电器(美国)知识产权公司 | 三维数据编码方法、三维数据解码方法、三维数据编码装置、以及三维数据解码装置 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5619256A (en) * | 1995-05-26 | 1997-04-08 | Lucent Technologies Inc. | Digital 3D/stereoscopic video compression technique utilizing disparity and motion compensated predictions |
CN1175859A (zh) * | 1996-07-03 | 1998-03-11 | 德来怀通用仪器公司 | 立体数字式视频编码的速率控制 |
US6144701A (en) * | 1996-10-11 | 2000-11-07 | Sarnoff Corporation | Stereoscopic video coding and decoding apparatus and method |
CN1450816A (zh) * | 2003-04-22 | 2003-10-22 | 上海大学 | 一种立体视频流编码/解码器及其立体视频编解码系统 |
CN1742488A (zh) * | 2002-12-27 | 2006-03-01 | 韩国电子通信研究院 | 用于编码和解码立体视频的方法和装置 |
CN1954606A (zh) * | 2004-05-21 | 2007-04-25 | 韩国电子通信研究院 | 通过使用三维立体视频附加数据来发送/接收三维立体数字广播信号的装置和方法 |
WO2007047736A2 (en) * | 2005-10-19 | 2007-04-26 | Thomson Licensing | Multi-view video coding using scalable video coding |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6055012A (en) * | 1995-12-29 | 2000-04-25 | Lucent Technologies Inc. | Digital multi-view video compression with complexity and compatibility constraints |
US6057884A (en) * | 1997-06-05 | 2000-05-02 | General Instrument Corporation | Temporal and spatial scaleable coding for video object planes |
US6496980B1 (en) * | 1998-12-07 | 2002-12-17 | Intel Corporation | Method of providing replay on demand for streaming digital multimedia |
JP2001142166A (ja) * | 1999-09-15 | 2001-05-25 | Sharp Corp | 3dカメラ |
FI120125B (fi) * | 2000-08-21 | 2009-06-30 | Nokia Corp | Kuvankoodaus |
CN100394800C (zh) * | 2003-01-14 | 2008-06-11 | 日本电信电话株式会社 | 编码方法、解码方法、编码装置及解码装置 |
US7227894B2 (en) * | 2004-02-24 | 2007-06-05 | Industrial Technology Research Institute | Method and apparatus for MPEG-4 FGS performance enhancement |
US8773494B2 (en) * | 2006-08-29 | 2014-07-08 | Microsoft Corporation | Techniques for managing visual compositions for a multimedia conference call |
-
2007
- 2007-10-24 CN CN2007101762888A patent/CN101420609B/zh not_active Expired - Fee Related
-
2008
- 2008-10-14 EP EP20080853028 patent/EP2207352A4/en not_active Withdrawn
- 2008-10-14 JP JP2010530253A patent/JP5232866B2/ja active Active
- 2008-10-14 WO PCT/CN2008/072675 patent/WO2009065325A1/zh active Application Filing
-
2010
- 2010-04-23 US US12/766,384 patent/US20100202540A1/en not_active Abandoned
-
2014
- 2014-07-03 US US14/323,503 patent/US20140313291A1/en not_active Abandoned
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5619256A (en) * | 1995-05-26 | 1997-04-08 | Lucent Technologies Inc. | Digital 3D/stereoscopic video compression technique utilizing disparity and motion compensated predictions |
CN1175859A (zh) * | 1996-07-03 | 1998-03-11 | 德来怀通用仪器公司 | 立体数字式视频编码的速率控制 |
US6144701A (en) * | 1996-10-11 | 2000-11-07 | Sarnoff Corporation | Stereoscopic video coding and decoding apparatus and method |
CN1742488A (zh) * | 2002-12-27 | 2006-03-01 | 韩国电子通信研究院 | 用于编码和解码立体视频的方法和装置 |
CN1450816A (zh) * | 2003-04-22 | 2003-10-22 | 上海大学 | 一种立体视频流编码/解码器及其立体视频编解码系统 |
CN1954606A (zh) * | 2004-05-21 | 2007-04-25 | 韩国电子通信研究院 | 通过使用三维立体视频附加数据来发送/接收三维立体数字广播信号的装置和方法 |
WO2007047736A2 (en) * | 2005-10-19 | 2007-04-26 | Thomson Licensing | Multi-view video coding using scalable video coding |
Non-Patent Citations (1)
Title |
---|
See also references of EP2207352A4 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2012533925A (ja) * | 2009-07-17 | 2012-12-27 | サムスン エレクトロニクス カンパニー リミテッド | 多視点映像符号化及び復号化のための方法及び装置 |
Also Published As
Publication number | Publication date |
---|---|
US20140313291A1 (en) | 2014-10-23 |
US20100202540A1 (en) | 2010-08-12 |
EP2207352A1 (en) | 2010-07-14 |
JP5232866B2 (ja) | 2013-07-10 |
EP2207352A4 (en) | 2011-06-08 |
CN101420609B (zh) | 2010-08-25 |
CN101420609A (zh) | 2009-04-29 |
JP2011501581A (ja) | 2011-01-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2009065325A1 (en) | A video encoding/decoding method and a video encoder/decoder | |
CN109218734B (zh) | 用于提供媒体内容的方法和装置 | |
EP3334164B1 (en) | A method and an apparatus and a computer program product for video encoding and decoding | |
US20180167634A1 (en) | Method and an apparatus and a computer program product for video encoding and decoding | |
Smolic et al. | An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution | |
Chen et al. | Overview of the MVC+ D 3D video coding standard | |
CN1204757C (zh) | 一种立体视频流编码/解码器及其立体视频编解码系统 | |
Fehn et al. | Asymmetric coding of stereoscopic video for transmission over T-DMB | |
US9473788B2 (en) | Frame-compatible full resolution stereoscopic 3D compression and decompression | |
WO2009052730A1 (en) | Video encoding decoding method and device and video codec | |
KR101653319B1 (ko) | 3d 영상을 위한 영상 컴포넌트 송수신 처리 방법 및 장치 | |
CN105103545B (zh) | 用于多视图自动立体显示器的深度图传送格式 | |
EP2334092A1 (en) | Methods and apparatuses for encoding, decoding, and displaying a stereoscopic 3D image | |
TW201041402A (en) | Image signal decoding device, image signal decoding method, image signal encoding device, image signal encoding method, and program | |
EP2654305A2 (en) | Signaling method for a stereoscopic video service and apparatus using the method | |
WO2012100537A1 (zh) | 辅助视频补充信息承载方法、处理方法、装置与系统 | |
CN102065296A (zh) | 一种三维立体视频编码方法 | |
CN109451293B (zh) | 一种自适应立体视频传输系统和方法 | |
KR101386651B1 (ko) | 다시점 비디오 인코딩 및 디코딩 방법 및 이를 이용한 인코딩 및 디코딩 장치 | |
TW201138425A (en) | Method and system for rendering 3D graphics based on 3D display capabilities | |
Kimata et al. | Interactive panorama video distribution system | |
Jung et al. | Disparity-map-based rendering for mobile 3D TVs | |
KR20070046356A (ko) | 3차원 동영상을 다중화 방송 서비스로 송수신하는 시스템및 방법 | |
Kumar et al. | A Comparative Analysis of Advance Three Dimensional Video Coding for Mobile Three Dimensional TV | |
KR101536747B1 (ko) | 자유 시점 영상 전송을 위한 패킷구조형 송신장치 및 송신방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 08853028 Country of ref document: EP Kind code of ref document: A1 |
|
REEP | Request for entry into the european phase |
Ref document number: 2008853028 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010530253 Country of ref document: JP Ref document number: 2008853028 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |