WO2007081117A1 - Method and apparatus for inter-viewing reference in multi-viewpoint video coding - Google Patents
Method and apparatus for inter-viewing reference in multi-viewpoint video coding Download PDFInfo
- Publication number
- WO2007081117A1 WO2007081117A1 PCT/KR2007/000083 KR2007000083W WO2007081117A1 WO 2007081117 A1 WO2007081117 A1 WO 2007081117A1 KR 2007000083 W KR2007000083 W KR 2007000083W WO 2007081117 A1 WO2007081117 A1 WO 2007081117A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- view
- time
- inter
- point
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/114—Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
Definitions
- the present invention relates to an inter-view frame reference method and apparatus for coding multi-view video data captured by a plurality of cameras having different viewpoints.
- the H.264 standard for video encoding refers to various frames that temporally precede or follow a current frame at the same viewpoint as the view point of the current frame.
- FIG. 1 is a block diagram of an inter-view frame reference apparatus for encoding multi-view video data according to an exemplary embodiment of the present invention.
- FIG. 2 illustrates a coding structure of a multi-view Group Of Pictures (GOP) suggested for inter-view referencing of multi-view video data according to an exemplary embodiment of the present invention.
- GOP Group Of Pictures
- FIG. 3 illustrates an inter-view frame reference scheme for encoding a Predictive (P)-picture in a camera type having a primary parallel and arch structure.
- FIG. 4 illustrates an inter-view frame reference scheme for encoding a Bidirectional (B)-picture in a camera type having a primary parallel and arch structure.
- FIG. 5 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
- FIG. 6 illustrates another inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
- FIG. 7 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary parallel structure (3x5).
- FIG. 8 is a flowchart of an inter-view frame reference method for encoding multi- view video data according to an exemplary embodiment of the present invention.
- FIGS. 9A and 9B illustrate test environments for testing the effect of an inter-view frame reference method according to an exemplary embodiment of the present invention.
- FIGS. 1OA through 1 OG illustrate rate-distortion (RD) curves based on the test environments illustrated in FIGS. 9A and 9B.
- RD rate-distortion
- the present invention suggests a technique for referring to frames at different viewpoints from the viewpoint of a current frame to be encoded and at the same point of time as the point of time of the current frame or frames at different viewpoints and at different points of time from the viewpoint and the point of time of the current frame in multi-view video coding, thereby reducing a residual signal and thus improving a compression rate when compared to a technique for referring to a temporally past or future frame at the same viewpoint as the viewpoint of the current frame.
- a compression rate can be improved. Therefore, frames at different viewpoints from the viewpoint of a current frame to be encoded and at the same point of time as the point of time of the current frame or frames of different viewpoints and at different points of time from the viewpoint and the point of time of the current frame are referred to, reducing a residual signal and thus improving a compression rate when compared to a technique for referring to a temporally past or future frame at the same viewpoint as the viewpoint of the current frame.
- an inter-view frame reference apparatus for encoding multi-view video data
- the inter-view frame reference apparatus comprising: a multi-view Group Of Pictures (GOP) arrangement unit arranging a multi-view GOP so that the multi-view GOP includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference unit referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
- GEP Group Of Pictures
- the inter-view frame reference apparatus may further include a same-view reference unit referring to previously encoded pictures of same viewpoint.
- a basic GOP may be an H.264/AVC GOP.
- Encoding may be performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture in the GOP of each viewpoint.
- I Intra-predictive
- P Predictive
- rB reference Bidirectional
- B Bidirectional
- the cross-view reference unit may refer to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 , t- n) so as to encode the current P picture b(i, t) and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t.
- the cross-view reference unit may refer to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current rB picture and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t.
- the cross-view reference unit may refer to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current B picture and (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t.
- an interview frame reference method for encoding multi-view video data comprising a multi-view Group Of Pictures (GOP) arrangement operation of arranging a multi-view GOP so that the multi-view GOP includes GOPs 1 wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference operation of referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
- GEP Group Of Pictures
- FIG. 1 is a block diagram of an inter-view frame reference apparatus 100 for encoding multi-view video data according to an exemplary embodiment of the present invention.
- the inter-view frame reference apparatus 100 includes a multi-view Group Of Pictures (GOP) arrangement unit 110, a cross-view reference unit 120, and a same- view reference unit 130.
- GOP Group Of Pictures
- the multi-view GOP arrangement unit 110 includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint.
- Each of the GOPs includes an Instantaneous Decoder Refresh (IDR) picture at the initial point of time of encoding and each of the GOPs in the multi-view GOP has the same coding structure (see FIG. 2).
- IDR Instantaneous Decoder Refresh
- a multi-view GOP arranged by the multi-view GOP arrangement unit 110 includes not only a H.264/AVC GOP but also other viewpoints along a viewpoint axis and each of the GOPs includes an IDR picture at the initial point of time of encoding along a time axis.
- encoding is performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture.
- the cross-view reference unit 120 improves a compression rate by referring to frames at most spatially and temporally adjacent viewpoints. More specifically, the cross-view reference unit 120 refers to frames (or pictures) of different viewpoints at the same point of time and frames (or pictures) of different viewpoints at different point of time in a multi-view GOP, except for a basic GOP that only makes a temporal frame reference at its viewpoint, an IDR picture, and pictures at the last viewpoint.
- pictures referred to by the cross-view reference unit 120 in order to encode a P picture, a reference B (rB) picture, and a B picture are as follows.
- the cross-view reference unit 120 refers to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) in order to encode the current P picture b(i, t).
- (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t, which will be described with reference to FIG. 3.
- the cross-view reference unit 120 refers to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) to encode the current rB picture.
- (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t, which will be described later with reference to FIG. 4.
- the cross-view reference unit 120 refers to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t- n), and b(i+1 , t-n) in order to encode the current B picture.
- (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t, which will be described later with reference to FIG. 4.
- the cross-view reference unit 120 applies inter-view frame reference schemes differently according to camera types.
- FIGS. 3 and 4 illustrate an inter-view frame reference scheme in a camera type having a primary parallel and arch structure
- FIGS. 5 and 6 illustrate an inter-view frame reference scheme in a camera type having a secondary cross structure
- FIG. 7 illustrates an inter-view frame reference scheme in a camera type having a secondary parallel structure (3x5).
- the same-view reference unit 130 refers to previously encoded pictures at the same viewpoint in order to encode current picture, i.e., temporally preceding and following pictures at the same viewpoint.
- FIG. 2 illustrates the coding structure of a multi-view GOP suggested for interview frame referencing of multi-view video data according to an exemplary embodiment of the present invention.
- a multi-view GOP 200 includes at least one H.264/AVC GOP and each of GOPs js ⁇ rresponding Jo-each-of -viewpoints. — The coding-struetures -of-the G ⁇ Ps-are- the same as one another.
- the H.264/AVC GOP makes only a temporal frame reference at its viewpoint and does not make an inter-view frame avireference. Any viewpoint in multi-view video data can be selected as the H.264/AVC GOR
- the coding structure of a GOP at each viewpoint may be I B rB B P B r ⁇ B P.
- the coding structure of a GOP may also be I rB rB rB P rB rB rB P.
- rB is a B picture that can be used as a reference picture.
- FIG. 3 illustrates an inter-view frame reference scheme for encoding a P picture in a camera type having a primary parallel and arch structure.
- P pictures of a multi- view GOP refer to a P picture at its left viewpoint and at the same point of time, an I picture or a P picture at its left viewpoint, which has been encoded immediately prior to the point of time t, and an I picture or a P picture at its right viewpoint, which has been encoded immediately prior to the point of time t.
- the last camera refers to a P picture from the left camera and at the same point of time and an I picture or a P picture from the left camera, which has been encoded immediately prior to the point of time t.
- FIG. 4 illustrates an inter-view frame reference scheme for encoding a B picture in a camera type having a primary parallel and arch structure.
- multi-view GOPs For example, for inter-view frame referencing of rB pictures that are located temporally between I pictures and P pictures, multi-view GOPs), except for an H.264/ AVC GOP and pictures at the last viewpoint, refer to an rB picture from the left camera, which has been encoded at the same point of time, a P picture from the left camera, which is to be encoded immediately after the point of time t, and a P picture from the right camera, which is to be encoded immediately after the point of time t.
- the last camera refers to a B picture from the left camera, which has been encoded at the same point of time, and a P picture from the left camera, which is to be encoded immediately after the point of time t.
- an rB picture indicated by an index 1 can refer to three pictures such as (O, 20, 22), (0, 15, 17), or the like out of pictures (0, 20, 22, 15, 17).
- multi-view GOPs For inter-view frame referencing of B pictures that are located temporally between rB pictures and P or I pictures, multi-view GOPs, except for an H.264/ AVC GOP and pictures at the last viewpoint, refer to an rB picture from the left camera, which has been encoded immediately prior to the point of time t, a P picture or an rB picture from the left camera, which is to be encoded immediately after the point of time t, and a P picture or an rB picture from the right camera, which is to be encoded immediately after the point of time t.
- the last camera refers to an rB picture from the left camera, which has been encoded immediately prior to the point of time t, and a P picture or an rB picture from the left camera, which is to be encoded immediately after the point of time t.
- a B picture indicated by an index 6 can refer to three pictures such as (0, 2, 15), (0, 15, 17), or the like out of pictures (0, 2, 15, 17).
- FIG. 5 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
- a plurality of cameras while arranged in the secondary cross structure, capture video data.
- Indices 0 through 4 indicate the encoding orders of pictures and arrows indicate inter-view frames to be referred to.
- Multi-view GOPs refer to inter-view frames as follows.
- the central viewpoint has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference.
- the left viewpoint refers to a picture at the central viewpoint, which has been encoded at the same point of time.
- the top viewpoint refers to the picture at the central viewpoint encoded at the same point of time and a picture at the left viewpoint.
- the right viewpoint refers to the picture at the central viewpoint encoded at the same point of time and the picture at the top viewpoint.
- the bottom viewpoint refers to the picture at the central viewpoint encoded at the same point of time and the pictures at the left and right viewpoints.
- FIG. 6 illustrates another inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
- an inter-view frame reference is made like in FIG. 6 and not like in FIG. 5.
- multi-view GOPs except for an H.264/AVC GOP, refer to interview frames as follows.
- the central viewpoint has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference.
- the left viewpoint refers to a picture at the central viewpoint, which has been encoded at the same point of time.
- the right viewpoint refers to the picture at the central viewpoint encoded at the same point of time and a picture at the left viewpoint.
- the top viewpoint refers to the picture at the central viewpoint encoded at the same point of time and the pictures at the left and right viewpoints.
- the bottom viewpoint reters to tne picture at the central viewpoint encoded at the same point of time and the pictures at the left and right viewpoints.
- FIG. 7 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary parallel structure (3x5).
- Indices 0 through 14 indicate the encoding orders of pictures and arrows indicate inter-view frames to be referred to.
- Multi-view GOPs except for an H.264/AVC GOP 1 refer to inter-view frames as follows.
- the central viewpoint indicated by an index 0 has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference.
- a viewpoint indicated by the index 1 refers to a picture (0) at the central viewpoint encoded at the same point of time.
- a viewpoint indicated by the index 2 refers to the picture at the central viewpoint (0) and a picture at its left viewpoint (1).
- a viewpoint indicated by the index 3 refers to the picture at the central viewpoint (0) and pictures at its top (2) and left (1 ) viewpoints.
- a viewpoint indicated by the index 4 refers to the picture at the central viewpoint (0) and pictures at its upper left (1 ) and right (3) viewpoints.
- a left viewpoint indicated by an index 9 among viewpoints in the middle row, except for the central viewpoint 0 and its neighboring viewpoints 1 and 3, refers to pictures at its upper right viewpoint (5), its right viewpoint (1 ), and its lower right viewpoint (7).
- a right viewpoint indicated by 10 refers to pictures at its upper left viewpoint (6), its left viewpoint (3), and its lower left viewpoint (8).
- Each of upper left viewpoints 5 and 11 with respect to the central viewpoint (0) refers to pictures at its right viewpoint (2 or 5), its lower right viewpoint (0 or 1 ), and its lower viewpoint (1 or 9).
- Each of upper right viewpoints 6 and 12 with respect to the central viewpoint 0 refers to pictures at its left viewpoint, its lower left viewpoint, and its lower viewpoint.
- Each of lower left viewpoints 7 and 13 with respect to the central viewpoint 0 refers to pictures at its right viewpoint, its upper right viewpoint, and its upper viewpoint.
- Each of lower right viewpoints 8 and 14 with respect to the central viewpoint 0 refers to pictures at its left viewpoint, its lower left viewpoint, and its lower viewpoint.
- FIG. 8 is a flowchart of an inter-view frame reference method for encoding multi- view video data according to an exemplary embodiment of the present invention.
- the inter-view frame reference method arranges a multi-view GOP for same-view referencing and cross-view referencing and then refers to a picture at a spatially and temporally adjacent viewpoint among previously encoded pictures at viewpoints.
- a multi-view GOP used for cross-view reference is arranged in operation S810 so that the multi-view GOP includes GOPs, wherein each of the GOPs corresponds to each of at least one viewpoint and each of the GOPs includes an IDR picture at the initial point of time of encoding and coding structures of each GOP are the same as one another in the multi-view GOP.
- pictures are referred in spatially closer viewpoints sequence among previously encoded pictures of different viewpoints, except for a basic GOP that makes only a temporal frame reference at its viewpoint, the IDR picture, and pictures at the last point of time.
- a picture at the same point of time is referred to for multi-view video encoding.
- a temporally past or future frame at the same viewpoint is referred to.
- Same-view referencing and cross-view referencing may be performed simultaneously or in a different order.
- FIGS. 9A and 9B illustrate test environments for testing the effect of an inter-view frame reference method according to an exemplary embodiment of the present invention.
- FIGS. 9A and 9B For a test data set of FIG. 9A, spatial resolutions, temporal resolutions, camera arrangement, and bit-rates for multi-view sequences are given as shown in FIGS. 9A and 9B.
- FIGS. 1OA through 1OG illustrate rate-distortion (RD) curves based on the test environments illustrated in FIGS. 9A and 9B
- FIGS. 1OA through 1 OG RD curves with respect to multi-view test sequences set in the environments of FIGS. 9A and 9B are shown. It can be seen from FIGS. 1OA through 1OG that a Peak Signal to Noise Ratio (PSNR) of 1.2 to 2dB can be obtained for all bitrates of all the multi-view test sequences.
- PSNR Peak Signal to Noise Ratio
- the present invention can also be embodied as a computer-readable code on a computer-readable recording medium.
- the computer-readable recording medium is any data storage device that can store data which can be thereafter read by a computer system.
- Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (transmission over the Internet).
- ROM read-only memory
- RAM random-access memory
- CD-ROMs compact disc-read only memory
- magnetic tapes magnetic tapes
- floppy disks magnetic tapes
- optical data storage devices optical data storage devices
- carrier waves transmission over the Internet
- carrier waves transmission over the Internet
- the computer-readable recording medium can also be distributed over network coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion.
- function programs, codes, and code segments for implementing the present invention can be easily construed by those skilled in the art.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Provided is an inter-view frame reference method and apparatus used in multi-view video encoding. In the present invention, pictures at the sequence of spatially closer viewpoints are preferentially referred to from among pictures at previously encoded viewpoints in each Group Of Pictures (GOP), except for a basic GOP that makes only a temporal frame reference at its viewpoint, an Instantaneous Decoder Refresh (IDR) picture, and pictures at the last point of time of encoding.
Description
METHOD AND APPARATUS FOR INTER-VIEWING REFERENCE IN MULTI- VIEWPOINT VIDEO CODING
TECHNICAL FIELD
The present invention relates to an inter-view frame reference method and apparatus for coding multi-view video data captured by a plurality of cameras having different viewpoints.
BACKGROUND ART
At present, the H.264 standard for video encoding refers to various frames that temporally precede or follow a current frame at the same viewpoint as the view point of the current frame.
DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram of an inter-view frame reference apparatus for encoding multi-view video data according to an exemplary embodiment of the present invention.
FIG. 2 illustrates a coding structure of a multi-view Group Of Pictures (GOP) suggested for inter-view referencing of multi-view video data according to an exemplary embodiment of the present invention.
FIG. 3 illustrates an inter-view frame reference scheme for encoding a Predictive (P)-picture in a camera type having a primary parallel and arch structure.
FIG. 4 illustrates an inter-view frame reference scheme for encoding a Bidirectional (B)-picture in a camera type having a primary parallel and arch structure.
FIG. 5 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
FIG. 6 illustrates another inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
FIG. 7 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary parallel structure (3x5).
FIG. 8 is a flowchart of an inter-view frame reference method for encoding multi- view video data according to an exemplary embodiment of the present invention.
FIGS. 9A and 9B illustrate test environments for testing the effect of an inter-view frame reference method according to an exemplary embodiment of the present
invention.
FIGS. 1OA through 1 OG illustrate rate-distortion (RD) curves based on the test environments illustrated in FIGS. 9A and 9B.
DETAILED DESCRIPTION OF THE INVENTION
TECHNICAL PROBLEM So far, reference has been made to frames at the same viewpoint.
TECHNICAL SOLUTION
The present invention suggests a technique for referring to frames at different viewpoints from the viewpoint of a current frame to be encoded and at the same point of time as the point of time of the current frame or frames at different viewpoints and at different points of time from the viewpoint and the point of time of the current frame in multi-view video coding, thereby reducing a residual signal and thus improving a compression rate when compared to a technique for referring to a temporally past or future frame at the same viewpoint as the viewpoint of the current frame.
ADVANTAGEOUS EFFECTS
By referring to frames at spatially and temporally adjacent viewpoints in multi- view video data, a compression rate can be improved. Therefore, frames at different viewpoints from the viewpoint of a current frame to be encoded and at the same point of time as the point of time of the current frame or frames of different viewpoints and at different points of time from the viewpoint and the point of time of the current frame are referred to, reducing a residual signal and thus improving a compression rate when compared to a technique for referring to a temporally past or future frame at the same viewpoint as the viewpoint of the current frame.
BEST MODE
According to an aspect of the present invention, there is provided an inter-view frame reference apparatus for encoding multi-view video data, the inter-view frame reference apparatus comprising: a multi-view Group Of Pictures (GOP) arrangement unit arranging a multi-view GOP so that the multi-view GOP includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference unit referring to
pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
The inter-view frame reference apparatus may further include a same-view reference unit referring to previously encoded pictures of same viewpoint.
A basic GOP may be an H.264/AVC GOP.
Encoding may be performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture in the GOP of each viewpoint.
When a P picture to be encoded at an ith viewpoint and at a point of time t is b(i, t), the cross-view reference unit may refer to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 , t- n) so as to encode the current P picture b(i, t) and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t.
When an rB picture to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross-view reference unit may refer to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current rB picture and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t.
When a B picture to be encoded at the ith viewpoint and the point of time t is b(i, t), the cross-view reference unit may refer to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current B picture and (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t.
According to another aspect of the present invention, there is provided an interview frame reference method for encoding multi-view video data, the inter-view frame reference method comprising a multi-view Group Of Pictures (GOP) arrangement operation of arranging a multi-view GOP so that the multi-view GOP includes GOPs1 wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference operation of referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
Hereinafter, an exemplary embodiment of the present invention will be described in detail with reference to the annexed drawings. It should be noted that like
reference numerals refer to like elements throughout the specification. In the following description, a detailed description of known functions and configurations incorporated herein has been omitted for conciseness.
MODE OF THE INVENTION
FIG. 1 is a block diagram of an inter-view frame reference apparatus 100 for encoding multi-view video data according to an exemplary embodiment of the present invention.
The inter-view frame reference apparatus 100 includes a multi-view Group Of Pictures (GOP) arrangement unit 110, a cross-view reference unit 120, and a same- view reference unit 130.
The multi-view GOP arrangement unit 110 includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint. Each of the GOPs includes an Instantaneous Decoder Refresh (IDR) picture at the initial point of time of encoding and each of the GOPs in the multi-view GOP has the same coding structure (see FIG. 2).
A multi-view GOP arranged by the multi-view GOP arrangement unit 110 includes not only a H.264/AVC GOP but also other viewpoints along a viewpoint axis and each of the GOPs includes an IDR picture at the initial point of time of encoding along a time axis. In a GOP corresponding to each viewpoint, encoding is performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture.
The coding structure of a multi-view GOP suggested by an embodiment of the present invention for inter-view frame reference of multi-view video data will be described in more detail with reference to FIG. 2.
The cross-view reference unit 120 improves a compression rate by referring to frames at most spatially and temporally adjacent viewpoints. More specifically, the cross-view reference unit 120 refers to frames (or pictures) of different viewpoints at the same point of time and frames (or pictures) of different viewpoints at different point of time in a multi-view GOP, except for a basic GOP that only makes a temporal frame reference at its viewpoint, an IDR picture, and pictures at the last viewpoint.
More specifically, pictures referred to by the cross-view reference unit 120 in order to encode a P picture, a reference B (rB) picture, and a B picture are as follows.
When a P picture to be encoded at an ith viewpoint and at a point of time t is b(i, t), the cross-view reference unit 120 refers to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 ,
t-n) in order to encode the current P picture b(i, t). In this case, (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t, which will be described with reference to FIG. 3.
When an rB picture that is to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross-view reference unit 120 refers to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) to encode the current rB picture. In this case, (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t, which will be described later with reference to FIG. 4.
When a B picture to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross-view reference unit 120 refers to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t- n), and b(i+1 , t-n) in order to encode the current B picture. In this case, (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t, which will be described later with reference to FIG. 4.
The cross-view reference unit 120 applies inter-view frame reference schemes differently according to camera types. FIGS. 3 and 4 illustrate an inter-view frame reference scheme in a camera type having a primary parallel and arch structure, FIGS. 5 and 6 illustrate an inter-view frame reference scheme in a camera type having a secondary cross structure, and FIG. 7 illustrates an inter-view frame reference scheme in a camera type having a secondary parallel structure (3x5).
The same-view reference unit 130 refers to previously encoded pictures at the same viewpoint in order to encode current picture, i.e., temporally preceding and following pictures at the same viewpoint.
FIG. 2 illustrates the coding structure of a multi-view GOP suggested for interview frame referencing of multi-view video data according to an exemplary embodiment of the present invention.
In FIG. 2, an x axis indicates a viewpoint and a y axis indicates a point of time. A multi-view GOP 200 includes at least one H.264/AVC GOP and each of GOPs jsαrresponding Jo-each-of -viewpoints. — The coding-struetures -of-the GΘPs-are- the same as one another.
The H.264/AVC GOP makes only a temporal frame reference at its viewpoint and
does not make an inter-view frame avireference. Any viewpoint in multi-view video data can be selected as the H.264/AVC GOR
The coding structure of a GOP at each viewpoint may be I B rB B P B rβ B P. Alternatively, the coding structure of a GOP may also be I rB rB rB P rB rB rB P. In this case, rB is a B picture that can be used as a reference picture.
FIG. 3 illustrates an inter-view frame reference scheme for encoding a P picture in a camera type having a primary parallel and arch structure.
In this case, a plurality of cameras, while arranged in the primary parallel and arch structure, capture video data. Indices O through 9 indicate the encoding orders of P pictures and arrows indicate inter-view frames that are to be referred to.
For example, for inter-view frame referencing of P pictures, P pictures of a multi- view GOP, except for an H.264/AVC GOP and pictures at the last viewpoint, refer to a P picture at its left viewpoint and at the same point of time, an I picture or a P picture at its left viewpoint, which has been encoded immediately prior to the point of time t, and an I picture or a P picture at its right viewpoint, which has been encoded immediately prior to the point of time t. The last camera refers to a P picture from the left camera and at the same point of time and an I picture or a P picture from the left camera, which has been encoded immediately prior to the point of time t.
FIG. 4 illustrates an inter-view frame reference scheme for encoding a B picture in a camera type having a primary parallel and arch structure.
For example, for inter-view frame referencing of rB pictures that are located temporally between I pictures and P pictures, multi-view GOPs), except for an H.264/ AVC GOP and pictures at the last viewpoint, refer to an rB picture from the left camera, which has been encoded at the same point of time, a P picture from the left camera, which is to be encoded immediately after the point of time t, and a P picture from the right camera, which is to be encoded immediately after the point of time t. The last camera refers to a B picture from the left camera, which has been encoded at the same point of time, and a P picture from the left camera, which is to be encoded immediately after the point of time t.
More specifically, for example, an rB picture indicated by an index 1 can refer to three pictures such as (O, 20, 22), (0, 15, 17), or the like out of pictures (0, 20, 22, 15, 17).
For inter-view frame referencing of B pictures that are located temporally between rB pictures and P or I pictures, multi-view GOPs, except for an H.264/ AVC
GOP and pictures at the last viewpoint, refer to an rB picture from the left camera, which has been encoded immediately prior to the point of time t, a P picture or an rB picture from the left camera, which is to be encoded immediately after the point of time t, and a P picture or an rB picture from the right camera, which is to be encoded immediately after the point of time t. The last camera refers to an rB picture from the left camera, which has been encoded immediately prior to the point of time t, and a P picture or an rB picture from the left camera, which is to be encoded immediately after the point of time t.
More specifically, for example, a B picture indicated by an index 6 can refer to three pictures such as (0, 2, 15), (0, 15, 17), or the like out of pictures (0, 2, 15, 17).
FIG. 5 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure.
In this case, a plurality of cameras, while arranged in the secondary cross structure, capture video data. Indices 0 through 4 indicate the encoding orders of pictures and arrows indicate inter-view frames to be referred to.
Multi-view GOPs, except for an H.264/AVC GOP, refer to inter-view frames as follows. The central viewpoint has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference. The left viewpoint refers to a picture at the central viewpoint, which has been encoded at the same point of time. The top viewpoint refers to the picture at the central viewpoint encoded at the same point of time and a picture at the left viewpoint. The right viewpoint refers to the picture at the central viewpoint encoded at the same point of time and the picture at the top viewpoint. The bottom viewpoint refers to the picture at the central viewpoint encoded at the same point of time and the pictures at the left and right viewpoints.
FIG. 6 illustrates another inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary cross structure. In this case, an inter-view frame reference is made like in FIG. 6 and not like in FIG. 5.
In other words, multi-view GOPs, except for an H.264/AVC GOP, refer to interview frames as follows. The central viewpoint has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference. The left viewpoint refers to a picture at the central viewpoint, which has been encoded at the same point of time. The right viewpoint refers to the picture at the central viewpoint encoded at the same point of time and a picture at the left viewpoint. The top viewpoint refers to the picture at the central viewpoint encoded at the same point of time
and the pictures at the left and right viewpoints. The bottom viewpoint reters to tne picture at the central viewpoint encoded at the same point of time and the pictures at the left and right viewpoints.
FIG. 7 illustrates an inter-view frame reference scheme for encoding a P picture and a B picture in a camera type having a secondary parallel structure (3x5).
In this case, a plurality of cameras, while arranged in the secondary parallel structure, capture video data. Indices 0 through 14 indicate the encoding orders of pictures and arrows indicate inter-view frames to be referred to.
Multi-view GOPs, except for an H.264/AVC GOP1 refer to inter-view frames as follows. The central viewpoint indicated by an index 0 has no previously encoded viewpoint at the same point of time and thus does not make an inter-view frame reference.
A viewpoint indicated by the index 1 refers to a picture (0) at the central viewpoint encoded at the same point of time. A viewpoint indicated by the index 2 refers to the picture at the central viewpoint (0) and a picture at its left viewpoint (1). A viewpoint indicated by the index 3 refers to the picture at the central viewpoint (0) and pictures at its top (2) and left (1 ) viewpoints. A viewpoint indicated by the index 4 refers to the picture at the central viewpoint (0) and pictures at its upper left (1 ) and right (3) viewpoints. A left viewpoint indicated by an index 9 among viewpoints in the middle row, except for the central viewpoint 0 and its neighboring viewpoints 1 and 3, refers to pictures at its upper right viewpoint (5), its right viewpoint (1 ), and its lower right viewpoint (7). A right viewpoint indicated by 10 refers to pictures at its upper left viewpoint (6), its left viewpoint (3), and its lower left viewpoint (8).
Each of upper left viewpoints 5 and 11 with respect to the central viewpoint (0) refers to pictures at its right viewpoint (2 or 5), its lower right viewpoint (0 or 1 ), and its lower viewpoint (1 or 9). Each of upper right viewpoints 6 and 12 with respect to the central viewpoint 0 refers to pictures at its left viewpoint, its lower left viewpoint, and its lower viewpoint. Each of lower left viewpoints 7 and 13 with respect to the central viewpoint 0 refers to pictures at its right viewpoint, its upper right viewpoint, and its upper viewpoint. Each of lower right viewpoints 8 and 14 with respect to the central viewpoint 0 refers to pictures at its left viewpoint, its lower left viewpoint, and its lower viewpoint.
FIG. 8 is a flowchart of an inter-view frame reference method for encoding multi- view video data according to an exemplary embodiment of the present invention.
The inter-view frame reference method arranges a multi-view GOP for same-view referencing and cross-view referencing and then refers to a picture at a spatially and temporally adjacent viewpoint among previously encoded pictures at viewpoints.
A multi-view GOP used for cross-view reference is arranged in operation S810 so that the multi-view GOP includes GOPs, wherein each of the GOPs corresponds to each of at least one viewpoint and each of the GOPs includes an IDR picture at the initial point of time of encoding and coding structures of each GOP are the same as one another in the multi-view GOP. In operation S820, in each of the GOPs, pictures are referred in spatially closer viewpoints sequence among previously encoded pictures of different viewpoints, except for a basic GOP that makes only a temporal frame reference at its viewpoint, the IDR picture, and pictures at the last point of time.
In operation S830, a picture at the same point of time is referred to for multi-view video encoding. In other words, a temporally past or future frame at the same viewpoint is referred to. Same-view referencing and cross-view referencing may be performed simultaneously or in a different order.
FIGS. 9A and 9B illustrate test environments for testing the effect of an inter-view frame reference method according to an exemplary embodiment of the present invention.
For a test data set of FIG. 9A, spatial resolutions, temporal resolutions, camera arrangement, and bit-rates for multi-view sequences are given as shown in FIGS. 9A and 9B.
FIGS. 1OA through 1OG illustrate rate-distortion (RD) curves based on the test environments illustrated in FIGS. 9A and 9B
In FIGS. 1OA through 1 OG, RD curves with respect to multi-view test sequences set in the environments of FIGS. 9A and 9B are shown. It can be seen from FIGS. 1OA through 1OG that a Peak Signal to Noise Ratio (PSNR) of 1.2 to 2dB can be obtained for all bitrates of all the multi-view test sequences.
The present invention can also be embodied as a computer-readable code on a computer-readable recording medium. The computer-readable recording medium is any data storage device that can store data which can be thereafter read by a computer system.
Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (transmission over the Internet). The
computer-readable recording medium can also be distributed over network coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion. Also, function programs, codes, and code segments for implementing the present invention can be easily construed by those skilled in the art.
The present invention has been particularly shown and described with reference to an exemplary embodiment thereof. Terms used herein are only intended to describe the present invention and are not intended to limit any meaning or the scope of the present invention claimed in the claims.
Therefore, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims. Accordingly, the disclosed embodiments should be considered in a description sense not in a restrictive sense. The scope of the present invention will be defined by the appended claims, and differences within the scope should be construed to be included in the present invention.
Claims
1. An inter-view frame reference apparatus for encoding multi-view video data, the inter-view frame reference apparatus comprising: a multi-view Group Of Pictures (GOP) arrangement unit arranging a multi-view GOP so that the multi-view GOP includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference unit referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
2. The inter-view frame reference apparatus of claim 1 , further comprising: a same-view reference unit referring to previously encoded pictures of same viewpoint.
3. The inter-view frame reference apparatus of claim 1 , wherein the multi- view GOP includes a basic GOP and the basic GOP is an H.264/AVC GOP.
4. The inter-view frame reference apparatus of claim 1 , wherein encoding is performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture in the GOP of each viewpoint.
5. The inter-view frame reference apparatus of claim 1 , wherein when a P picture to be encoded at an ith viewpoint and at a point of time t is b(i, t), the cross-view reference unit refers to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current P picture b(i, t) and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t.
6. The inter-view frame reference apparatus of claim 1 , wherein when an rB picture to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross- view reference unit refers to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current rB picture and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t.
7. The inter-view frame reference apparatus of claim 1 , wherein when a B picture to be encoded at the ith viewpoint and the point of time t is b(i, t), the cross-view reference unit refers to three out of b(i+1, t+n), b(i-1 , t+n), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current B picture and (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t.
8. The inter-view frame reference apparatus of claim 1 , wherein when viewpoints and points of time form coordinate axes in a camera type having a secondary cross structure and a camera type having a secondary parallel structure, the cross-view reference unit preferentially encodes a viewpoint having the smallest radius from the closest point to the central point of the points of time and the viewpoints of the multi-view GOP.
9. The inter-view frame reference apparatus of claim 1 , wherein the cross- view reference unit refers to a maximum of three pictures.
10. The inter-view frame reference apparatus of claim 1 , wherein the coding structure of each of the GOPs at each of the viewpoints is I B rB B P B rB B P and rB is a B picture that can be used as a reference picture.
11. The inter-view frame reference apparatus of claim 1 , wherein the coding structure of each of the GOPs at each of the viewpoints is I rB rB rB P B rB rB P and rB is a B picture that can be used as a reference picture.
12. The inter-view frame reference apparatus of claim 1 , wherein each of the GOPs includes an Instantaneous Decoder Refresh (IDR) picture at the initial point of time of coding.
13. The inter-view frame reference apparatus of claim 1 , wherein the cross- view reference unit refers to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs, except for a basic GOP that makes only a temporal frame reference at its viewpoint, an Instantaneous Decoder Refresh (IDR) picture, and pictures at the last point of time of coding.
14. An inter-view frame reference method for encoding multi-view video data, the inter-view frame reference method comprising: a multi-view Group Of Pictures (GOP) arrangement operation of arranging a multi-view GOP so that the multi-view GOP includes GOPs, wherein each of GOPs corresponds to each of at least one viewpoint and coding structures of each GOP are the same as one another; and a cross-view reference operation of referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs.
15. The inter-view frame reference method of claim 14, further comprising a same-view reference operation of referring to previously encoded pictures of same viewpoint.
16. The inter-view frame reference method of claim 14, wherein the multi- view GOP includes a basic GOP and the basic GOP is an H.264/AVC GOP.
17. The inter-view frame reference method of claim 14, wherein encoding is performed in the order of an Intra-predictive (I) picture, a Predictive (P) picture, a reference Bidirectional (rB) picture, and then a Bidirectional (B) picture in the GOP of each viewpoint.
18. The inter-view frame reference method of claim 14, wherein when a P picture to be encoded at an ith viewpoint and at a point of time t is b(i, t), the cross-view reference operation comprises referring to at least one of b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current P picture b(i, t) and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t.
19. The inter-view frame reference method of claim 14, wherein when an rB picture to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross- view reference operation comprises referring to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current rB picture and (t-n) indicates the closest point of time at which an I picture or a P picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture or a P picture is encoded after the point of time t.
20. The inter-view frame reference method of claim 14, wherein when a B picture to be encoded at the ith viewpoint and at the point of time t is b(i, t), the cross- view reference operation comprises referring to three out of b(i+1 , t+n), b(i-1 , t+n), b(i-1 , t-n), and b(i+1 , t-n) so as to encode the current B picture and (t-n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded prior to the point of time t and (t+n) indicates the closest point of time at which an I picture, a P picture, or an rB picture is encoded after the point of time t.
21. The inter-view frame reference method of claim 14, wherein when viewpoints and points of time form coordinate axes in a camera type having a secondary cross structure and a camera type having a secondary parallel structure, the cross-view reference operation comprises preferentially encoding a viewpoint having the smallest radius from the closest point to the central point of the points of time and the viewpoints of the multi-view GOP.
22. The inter-view frame reference method of claim 14, wherein the cross- view reference operation comprises referring to a maximum of three pictures.
23. The inter-view frame reference method of claim 14, wherein the coding structure of each of the GOPs at each of the viewpoints is I B rB B P B rB B P and rB is a B picture that can be used as a reference picture.
24. The inter-view frame reference method of claim 14, wherein the coding structure of each of the GOPs at each of the viewpoints is I rB rB rB P B rB rB P and rB is a B picture that can be used as a reference picture.
25. The inter-view frame reference method of claim 14, wherein each of the GOPs includes an Instantaneous Decoder Refresh (IDR) picture at the initial point of time of coding.
26. The inter-view frame reference method of claim 14, wherein the cross- view reference operation comprises referring to pictures in the sequence of spatially closer viewpoints among previously encoded pictures of different viewpoints, in each of the GOPs, except for a basic GOP that makes only a temporal frame reference at its viewpoint, an Instantaneous Decoder Refresh (IDR) picture, and pictures at the last point of time of coding.
27. A computer-readable medium having recorded thereon a program for implementing the inter-view frame reference method in any one of claims 14 through 26.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07708435.8A EP1972141A4 (en) | 2006-01-07 | 2007-01-05 | Method and apparatus for inter-viewing reference in multi-viewpoint video coding |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20060002041 | 2006-01-07 | ||
KR10-2006-0002041 | 2006-01-07 | ||
KR1020070001621A KR20070074495A (en) | 2006-01-07 | 2007-01-05 | Method and apparatus for inter-viewing reference in multi-viewpoint video coding |
KR10-2007-0001621 | 2007-01-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007081117A1 true WO2007081117A1 (en) | 2007-07-19 |
Family
ID=38508619
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2007/000083 WO2007081117A1 (en) | 2006-01-07 | 2007-01-05 | Method and apparatus for inter-viewing reference in multi-viewpoint video coding |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1972141A4 (en) |
KR (1) | KR20070074495A (en) |
WO (1) | WO2007081117A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014047781A1 (en) * | 2012-09-25 | 2014-04-03 | Mediatek Singapore Pte. Ltd. | Methods for inter-view residual prediction |
WO2014198222A1 (en) * | 2013-06-11 | 2014-12-18 | Mediatek Inc. | Method of inter-view residual prediction with reduced complexity in three-dimensional video coding |
GB2556319A (en) * | 2016-07-14 | 2018-05-30 | Nokia Technologies Oy | Method for temporal inter-view prediction and technical equipment for the same |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10477247B2 (en) | 2015-11-18 | 2019-11-12 | Electronics And Telecommunications Research Institute | Method for decoding video using in-loop filter, and device therefor |
US10701396B2 (en) | 2015-11-23 | 2020-06-30 | Electronics And Telecommunications Research Institute | Multi-viewpoint video encoding/decoding method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030202592A1 (en) * | 2002-04-20 | 2003-10-30 | Sohn Kwang Hoon | Apparatus for encoding a multi-view moving picture |
WO2006062377A1 (en) * | 2004-12-10 | 2006-06-15 | Electronics And Telecommunications Research Institute | Apparatus for universal coding for multi-view video |
WO2006075844A1 (en) * | 2005-01-14 | 2006-07-20 | Humax Co., Ltd. | Reference frame ordering for multi-view coding |
WO2006110007A1 (en) * | 2005-04-13 | 2006-10-19 | Industry-Academic Cooperation Foundation, Yonsei University | Method for coding in multiview video coding/decoding system |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1273180B1 (en) * | 2000-03-24 | 2006-02-22 | Reality Commerce Corporation | Method and apparatus for parallel multi-viewpoint video capturing and compression |
KR100987775B1 (en) * | 2004-01-20 | 2010-10-13 | 삼성전자주식회사 | 3 Dimensional coding method of video |
KR100679740B1 (en) * | 2004-06-25 | 2007-02-07 | 학교법인연세대학교 | Method for Coding/Decoding for Multiview Sequence where View Selection is Possible |
-
2007
- 2007-01-05 EP EP07708435.8A patent/EP1972141A4/en not_active Withdrawn
- 2007-01-05 WO PCT/KR2007/000083 patent/WO2007081117A1/en active Application Filing
- 2007-01-05 KR KR1020070001621A patent/KR20070074495A/en not_active Application Discontinuation
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030202592A1 (en) * | 2002-04-20 | 2003-10-30 | Sohn Kwang Hoon | Apparatus for encoding a multi-view moving picture |
WO2006062377A1 (en) * | 2004-12-10 | 2006-06-15 | Electronics And Telecommunications Research Institute | Apparatus for universal coding for multi-view video |
WO2006075844A1 (en) * | 2005-01-14 | 2006-07-20 | Humax Co., Ltd. | Reference frame ordering for multi-view coding |
WO2006110007A1 (en) * | 2005-04-13 | 2006-10-19 | Industry-Academic Cooperation Foundation, Yonsei University | Method for coding in multiview video coding/decoding system |
Non-Patent Citations (1)
Title |
---|
See also references of EP1972141A4 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014047781A1 (en) * | 2012-09-25 | 2014-04-03 | Mediatek Singapore Pte. Ltd. | Methods for inter-view residual prediction |
WO2014198222A1 (en) * | 2013-06-11 | 2014-12-18 | Mediatek Inc. | Method of inter-view residual prediction with reduced complexity in three-dimensional video coding |
US9883205B2 (en) | 2013-06-11 | 2018-01-30 | Hfi Innovation Inc. | Method of inter-view residual prediction with reduced complexity in three-dimensional video coding |
GB2556319A (en) * | 2016-07-14 | 2018-05-30 | Nokia Technologies Oy | Method for temporal inter-view prediction and technical equipment for the same |
Also Published As
Publication number | Publication date |
---|---|
EP1972141A1 (en) | 2008-09-24 |
KR20070074495A (en) | 2007-07-12 |
EP1972141A4 (en) | 2017-04-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8520961B2 (en) | Method and apparatus for encoding and decoding multi-view image | |
US8175396B2 (en) | Method and apparatus for encoding and decoding multi-view images based on global disparity vector | |
KR100779875B1 (en) | Method for setting reference frame order for multi-view coding and computer readable medium storing thereof | |
JP4663792B2 (en) | Apparatus and method for encoding and decoding multi-view video | |
US8155461B2 (en) | Methods and apparatuses for encoding and decoding multi-view image | |
US20060013490A1 (en) | 3D video coding using sup-sequences | |
KR20060104671A (en) | Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof | |
EP2923491A1 (en) | Method and apparatus for bi-prediction of illumination compensation | |
WO2007081117A1 (en) | Method and apparatus for inter-viewing reference in multi-viewpoint video coding | |
US8848794B2 (en) | Method and device for video coding and decoding | |
KR100972222B1 (en) | Video encoding method and device, video decoding method and device, and recording medium containing the programs thereof | |
KR101528269B1 (en) | A method for playing a moving picture | |
KR102271118B1 (en) | Apparatus and method for recording video data | |
US20100290525A1 (en) | Method and apparatus for encoding and decoding at least one image frame that is artificially inserted into image sequence | |
WO2008156244A1 (en) | Method and apparatus for encoding/decoding image by partitioning image | |
KR101357755B1 (en) | Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof | |
KR101313223B1 (en) | Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof | |
KR101407719B1 (en) | Multi-view image coding method and apparatus using variable GOP prediction structure, multi-view image decoding apparatus and recording medium storing program for performing the method thereof | |
US20070064790A1 (en) | Apparatus and method for video encoding/decoding and recording medium having recorded thereon program for the method | |
KR101313224B1 (en) | Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof | |
JP2008177834A (en) | Coding device, and program | |
KR20070033239A (en) | Apparatus and method for encoding and decoding multiview video | |
KR20120031494A (en) | Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007708435 Country of ref document: EP |