WO2007010690A1 - 画像符号化装置、画像復号装置、および画像符号化方法、画像復号方法、画像符号化プログラム、画像復号プログラム、ならびに画像符号化プログラムを記録したコンピュータ読み取り可能な記録媒体、画像復号プログラムを記録したコンピュータ読み取り可能な記録媒体 - Google Patents
画像符号化装置、画像復号装置、および画像符号化方法、画像復号方法、画像符号化プログラム、画像復号プログラム、ならびに画像符号化プログラムを記録したコンピュータ読み取り可能な記録媒体、画像復号プログラムを記録したコンピュータ読み取り可能な記録媒体 Download PDFInfo
- Publication number
- WO2007010690A1 WO2007010690A1 PCT/JP2006/312159 JP2006312159W WO2007010690A1 WO 2007010690 A1 WO2007010690 A1 WO 2007010690A1 JP 2006312159 W JP2006312159 W JP 2006312159W WO 2007010690 A1 WO2007010690 A1 WO 2007010690A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- prediction
- image
- prediction mode
- decoding
- information
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 381
- 239000013598 vector Substances 0.000 claims description 323
- 238000012545 processing Methods 0.000 claims description 190
- 230000008569 process Effects 0.000 claims description 160
- 239000000284 extract Substances 0.000 claims description 10
- 230000006835 compression Effects 0.000 claims description 9
- 238000007906 compression Methods 0.000 claims description 9
- 238000011156 evaluation Methods 0.000 claims description 6
- 238000001454 recorded image Methods 0.000 claims 1
- 238000006243 chemical reaction Methods 0.000 description 163
- 238000013139 quantization Methods 0.000 description 68
- 238000010586 diagram Methods 0.000 description 58
- 239000000872 buffer Substances 0.000 description 53
- 230000015654 memory Effects 0.000 description 51
- 230000005540 biological transmission Effects 0.000 description 36
- 230000000875 corresponding effect Effects 0.000 description 28
- 230000000694 effects Effects 0.000 description 15
- 230000003044 adaptive effect Effects 0.000 description 14
- 230000006870 function Effects 0.000 description 10
- 238000009826 distribution Methods 0.000 description 9
- 230000011664 signaling Effects 0.000 description 9
- 230000009466 transformation Effects 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 7
- 238000004364 calculation method Methods 0.000 description 7
- 238000009825 accumulation Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000003672 processing method Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 102100037812 Medium-wave-sensitive opsin 1 Human genes 0.000 description 1
- 241000282376 Panthera tigris Species 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/004—Predictors, e.g. intraframe, interframe coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/19—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding using optimisation based on Lagrange multipliers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
Definitions
- Image encoding device image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer-readable recording medium storing image encoding program, and image decoding program
- the present invention relates to a digital image signal encoding device, a digital image signal decoding device, a digital image signal encoding method, and a digital image signal decoding method used for image compression encoding technology, compressed image data transmission technology, etc. About.
- 0 means that color moving image signals such as RGB are converted into a luminance component (Y) and two color difference components (Cb, Cr), and the number of samples of the color difference component is reduced to half of the luminance component both horizontally and vertically. Reduced format. Since the color difference component is less visible than the luminance component, the conventional international standard video coding system performs code sampling by down-sampling the color difference component before coding in this way. It was premised on reducing the amount of original information.
- a format with exactly the same number of samples for the luminance component and the color difference component is called a 4: 4: 4 format.
- AVC MPEG-4 AVC
- 4,4: 4: “4 Profile” has been formulated.
- the conventional 4: 2: 0 format is limited to the color space definition of Y, Cb, and Cr because it assumes the downsampling of color difference components, whereas the 4: 4: 4 format.
- Non-Patent Document 1 MPEG-4 AVC (ISO / IEC 14496-10) / ITU-T H.264 standard
- the corresponding color difference components are Cb and Cr in the macroblock region with a luminance component of 16x16 pixels. It becomes 8x8 pixel block.
- the intra macro block code ⁇ uses spatial prediction (intra prediction) using neighboring sample values in the same picture. The mode is supposed to be used.
- the luminance component has the highest prediction efficiency among the nine types shown in Fig. 3, and the chrominance component is the same for Cb and Cr, and the four types shown in Fig. 9 with the highest prediction efficiency are selected as the intra prediction modes ( Cb and Cr cannot use different prediction modes).
- one component is regarded as a luminance component, and only one component of information is multiplexed and common to all three components.
- Motion compensation prediction is performed using the inter prediction mode, reference image information, and motion vector information.
- the optimal prediction method is not necessarily.
- the present invention provides optimality in encoding a moving image signal without distinguishing the sample ratio between color components such as the 4: 4: 4 format. It is an object of the present invention to provide an improved encoding device, decoding device, encoding method, decoding method, a program for executing these, and a recording medium on which these programs are recorded.
- An image coding apparatus generates a prediction image corresponding to a plurality of prediction modes indicating a prediction image generation method, and outputs the prediction image generation unit power.
- a prediction mode determination unit that evaluates the prediction efficiency of the prediction image and determines a predetermined prediction mode; and a code mode unit that outputs a variable length code as an output of the prediction mode determination unit, and the prediction mode determination The unit determines whether to use a common prediction mode for each color component constituting the input image signal or to use a separate prediction mode for each color component based on a predetermined control signal, When the common prediction mode is used, the common prediction mode information is multiplexed into the bit stream, and when the common prediction mode is not used, the prediction mode for each color component is multiplexed. Affection The information is multiplexed into a bitstream.
- the image decoding device According to the image encoding device, the image decoding device, the image encoding method, the image decoding method of the present invention, the program for executing these, and the recording medium on which these programs are recorded, Y, Cb, Cr, etc.
- intra prediction mode information and inter prediction mode information used for each color component can be flexibly selected. It is possible to perform optimum code processing even when the color space definition is various.
- FIG. 1 is an explanatory diagram showing the configuration of a video encoding device according to Embodiment 1.
- FIG. 2 is an explanatory diagram showing the configuration of the video decoding apparatus in the first embodiment
- FIG. 3 Prediction image generation method in intra 4 ⁇ 4 prediction mode evaluated by the spatial prediction unit 2 in FIG. Explanatory drawing explaining
- FIG. 4 is an explanatory diagram for explaining a prediction image generation method in the intra 16 ⁇ 16 prediction mode evaluated by the spatial prediction unit 2 in FIG.
- FIG. 13 is an explanatory diagram for explaining a prediction image generation method in the intra 8 ⁇ 8 prediction mode evaluated by the spatial prediction unit 2 in FIG. 11.
- FIG. 15 is an explanatory diagram showing a data arrangement of a video bit stream output from the video encoding device in Embodiment 2.
- FIG. 16 is an explanatory diagram showing another data arrangement of the video bit stream output from the video encoding device in the second embodiment
- FIG. 23 is a flowchart showing the flow of intra prediction mode decoding processing in the third embodiment.
- FIG. 28 is an explanatory diagram showing a binary sequence configuration of CurrlntraPredMode in Embodiment 6.
- FIG. 29 is an explanatory diagram showing another binary sequence configuration of CurrlntraPredMode in the sixth embodiment.
- ⁇ 30] is an explanatory diagram showing the configuration of the video encoding device in the seventh embodiment.
- FIG. 33 is a flowchart showing the flow of inter prediction mode determination processing in the seventh embodiment.
- FIG. 34 is an explanatory diagram showing a data arrangement of a video stream output from the video encoding device in Embodiment 7.
- FIG. 35 is a flowchart showing the flow of processing performed by the variable length decoding unit 25 in the seventh embodiment.
- the video code in the seventh embodiment is also different from the output video stream.
- FIG. 38 is a flowchart showing the flow of inter prediction mode determination processing in the eighth embodiment.
- FIG. 39 is an explanatory diagram showing the bit stream data arrangement at the macroblock level in the eighth embodiment.
- FIG. 40 is a flowchart showing the flow of inter prediction image generation processing in the eighth embodiment.
- FIG. 41 is an explanatory diagram showing another data arrangement of the bit stream at the macroblock level in the eighth embodiment.
- FIG. 42 is an explanatory diagram showing another data arrangement of the bit stream at the macroblock level in the eighth embodiment.
- FIG. 43 is a flowchart showing the flow of inter prediction mode determination processing in the ninth embodiment.
- FIG. 44 is a flowchart showing the flow of inter prediction image generation processing in the ninth embodiment.
- FIG. 45 is an explanatory diagram showing the configuration of the motion vector encoding unit
- FIG. 47 is an explanatory diagram showing the configuration of the motion vector decoding unit
- FIG. 48 is an explanatory diagram showing the state of the bitstream syntax.
- FIG. 49 is an explanatory diagram showing the configuration of the macroblock encoded data in the eleventh embodiment.
- FIG. 50 is an explanatory diagram showing the detailed configuration of the code key data of the Cn component header information in FIG. 49 in the eleventh embodiment.
- FIG. 56 is an explanatory diagram showing a detailed flow of the process of step S 162 in FIG. 55 in Embodiment 12.
- FIG. 58 is an explanatory diagram showing an example of a context model relating to a motion vector of a macroblock.
- FIG. 60 is a flowchart showing the flow of arithmetic decoding processing in the variable-length decoding unit 25 in the twelfth embodiment.
- FIG. 61 is an explanatory diagram showing a context model l lf in Embodiment 12.
- FIG. 62 An explanatory diagram showing the difference in mode of the current macroblock in the embodiment 12.
- ⁇ 63 An explanatory diagram showing the configuration of the decoding device in the thirteenth embodiment.
- ⁇ 64 An image in the thirteenth embodiment. Explanatory drawing which shows the structure of a code
- FIG. 70 is an explanatory diagram showing a bit stream configuration of slice data in each of common code key processing and independent code key processing
- FIG. 75 is an explanatory diagram showing a schematic configuration of the decoding apparatus according to the fourteenth embodiment.
- FIG. 76 is an explanatory diagram showing the internal configuration of the first picture decoding unit
- FIG. 77 is an explanatory diagram showing the internal configuration of the second picture decoding unit
- FIG. 78 is an explanatory diagram showing the internal configuration of the first picture code part that has undergone color space conversion processing.
- FIG. 79 shows the internal configuration of the first picture code part that has undergone color space conversion processing.
- Explanatory diagram [FIG. 80] An explanatory diagram showing the internal configuration of the first picture code key unit subjected to the reverse color space conversion process.
- FIG. 81 The first picture code key unit subjected to the reverse color space conversion process.
- Explanatory diagram showing internal structure [FIG. 82] Explanatory diagram showing the structure of code block data of macroblock header information included in a conventional YUV4: 2: 0 format bit stream
- FIG. 83 is an explanatory diagram showing the internal configuration of the prediction unit 461 of the first picture decoding unit that ensures compatibility with a conventional YUV4: 2: 0 format bitstream.
- FIG. 84 is an explanatory diagram showing the structure of the bit stream of encoded data to be multiplexed in the fifteenth embodiment
- FIG. 85 is an explanatory diagram showing picture code type information when picture data in an access unit starting with an AUD NAL unit is encoded.
- FIG. 86 is an explanatory diagram showing the structure of the bit stream of code key data to be multiplexed in the fifteenth embodiment
- an encoding device that performs code coding that is closed in a frame in units of equally dividing a video frame input in a 4: 4: 4 format into a rectangular area (macroblock) of 16 ⁇ 16 pixels And a corresponding decoding apparatus will be described.
- this encoding device and decoding device are based on the coding method adopted in the MPEG-4 AVC (ISO / IEC 14496-10) / ITU-T H.264 standard, which is Non-Patent Document 1.
- FIG. 1 shows the configuration of the video coding apparatus according to the first embodiment
- FIG. 2 shows the first embodiment.
- the input video signal 1 is input in individual video frames in a 4: 4: 4 format.
- the input video frame is input to the encoding unit in units of macroblocks in which the three color components are divided into 16 pixels X 16 pixels blocks of the same size. .
- Intra prediction processing is performed for each color component in units of the macroblock using the locally decoded image 15 stored in the memory 16.
- Three memories are prepared for each color component (this embodiment will be described as three faces, but may be changed as appropriate depending on the design).
- Intra prediction modes include intra 4x4 prediction mode in which spatial prediction is performed using neighboring pixels in units of 4 pixels x 4 lines shown in Fig. 3, and 16 pixels x 16 lines macroblock shown in Fig. 4. There is an intra 16x16 prediction mode that performs spatial prediction using neighboring pixels in units of.
- Luminance signal in a macro block A 16 X 16 pixel block is divided into 6 blocks consisting of 4 X 4 pixel blocks, and the 9 modes shown in Fig. 3 are selected in units of 4 X 4 pixel blocks. To do.
- the pixels in the surrounding blocks (upper left, upper, upper right, left) that have already been encoded and have been locally decoded and stored in the memory 16 are used for predictive image generation.
- Intra4x4_pred_mode 0: Adjacent upper pixels are used as they are as predicted images.
- Intra4x4_pred_mode 4: Predicted image by calculating the weighted average every 2 to 3 pixels from adjacent pixels Use as an image (corresponds to 45 ° left edge).
- Intra4x4_pred_mode 5: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the left 22.5 degree edge).
- Intra4x4_pred_mode 6: A weighted average is calculated every 2 to 3 pixels from adjacent pixels and used as a predicted image (corresponding to the left 67.5 degree edge).
- Intra4x4_pred_mode 7: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the right 22.5 degree edge).
- Intra4x4_pred_mode 8: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the left 112.5 degree edge).
- This mode predicts a 16 ⁇ 16 pixel block corresponding to the macroblock size at a time, and selects one of the four modes shown in FIG. 4 for each macroblock. Similar to the intra 4x4 prediction mode, the surrounding macroblock (upper left, upper, left) pixels that have already been encoded and have been locally decoded and stored in the memory 16 are used for prediction image generation.
- Intral6xl6_pred_mode 0: Use the bottom 16 pixels of the upper macroblock as the predicted image.
- Intral6xl6_pred_mode 1: The 16 pixels on the rightmost side of the left macroblock are used as the prediction image.
- Intral6xl6_pred_mode 2: The average value of the total of 32 pixels, 16 pixels at the bottom of the upper macroblock (A part in Fig. 4) and 16 pixels at the left side of the left macroblock (B part in Fig. 4) is used as the predicted image. use.
- Intral6xl6— pred— mode 3: Lower right corner pixel of the upper left macroblock, lowermost 15 pixels of the upper macroblock (excluding white pixels), rightmost 15 pixels of the left macroblock (white pixels) Predetermined arithmetic processing using the total of 31 pixels (excluding A predicted image is obtained by weighted addition processing according to the pixel position to be predicted).
- the video coding apparatus is characterized in that the intra prediction processing method for the three color components is switched based on the intra prediction mode common identification flag 23. This point is described in detail in 2 below.
- the prediction differential signal 4 is evaluated for prediction efficiency by the encoding mode determination unit 5 and, from the prediction processing executed by the spatial prediction unit 2, a prediction mode for obtaining the optimal prediction efficiency for the macroblock to be predicted is selected.
- the code key mode 6 is used per prediction unit region together with information for determining whether to use the intra 4x4 prediction mode or the intra 16x16 prediction mode (equivalent to the intra code key mode in FIG. 6).
- Individual prediction mode information (Intra 4x4_pred_mode or Intral6xl6_pred_mode) is also included.
- the prediction unit area corresponds to a 4x4 pixel block in the intra 4x4 prediction mode and a 16x16 pixel block in the intra 16x16 prediction mode.
- a weighting factor 20 for each code key mode determined by the determination of the coding control unit 19 may be added.
- the optimal prediction difference signal 4 obtained by using the code key mode 6 in the code key mode determination unit 5 is output to the orthogonal transform unit 8.
- the orthogonal transform unit 8 transforms the input prediction difference signal 4 and outputs it to the quantization unit 9 as an orthogonal transform coefficient.
- the quantization unit 9 quantizes the input orthogonal transform coefficient based on the quantization parameter 21 determined by the code control unit 19 and outputs the quantized transform coefficient 10 to the variable length code unit 11. .
- the quantized transform coefficient 10 is entropy-encoded by a variable-length code key unit 11 by means such as a Huffman code or an arithmetic code key. Also, the quantized transform coefficient 10 is restored to the local decoded prediction difference signal 14 via the inverse quantization unit 12 and the inverse orthogonal transform unit 13, and the predicted image 7 and the adder 18 generated based on the encoding mode 6.
- the locally decoded image 15 is generated by adding at.
- the locally decoded image 15 is stored in the memory 16 for use in subsequent intra prediction processing.
- the variable length coding unit 11 is also input with a deblocking filter control flag 24 indicating whether or not to apply a deblocking filter to the macroblock (in the prediction process performed by the spatial prediction unit 2). Is the pixel data before the deblocking filter is applied Is stored in the memory 16 and used, so the deblocking filter process itself is not necessary for the encoding process, but the decoding device side uses the instruction of the deblocking filter control flag 24
- the intra-prediction mode common identification flag 23, the quantized transform coefficient 10, the coding mode 6, and the quantization parameter 21 input to the variable-length code unit 11 are bitstreams according to a predetermined rule (syntax). As an array and is output to the transmission buffer 17.
- the bit stream is smoothed according to the bandwidth of the transmission path to which the encoding device is connected and the reading speed of the recording medium, and output as a video stream 22.
- feedback information is output to the encoding control unit 19 in accordance with the bit stream accumulation state in the transmission buffer 17, and the generated code amount in the subsequent video frame code is controlled.
- the intra-prediction mode determination process that is a feature of the coding apparatus according to the first embodiment will be described in detail.
- This process is performed in units of macroblocks that combine the above three color components, and is mainly performed by the spatial prediction unit 2 and the coding mode determination unit 5 in the coding apparatus of FIG. Fig. 5 shows a flowchart showing the flow of this process.
- the image data of the three color components that make up the block are C0, Cl, and C2.
- the coding mode determination unit 5 receives the intra prediction mode common identification flag 23, and determines whether or not to use a common intra prediction mode for C0, Cl, and C2 based on the value thereof. (Step Sl in Figure 5). If you want to share, go to step S2 and after. If not, go to step S5 and later.
- the encoding mode determination unit 5 When the intra prediction mode is shared by C0, Cl, and C2, the encoding mode determination unit 5 notifies the spatial prediction unit 2 of all the intra 4x4 prediction modes that can be selected. Spatial prediction unit 2 evaluates all the prediction efficiencies and selects an optimal intra 4x4 prediction mode common to C0, Cl, and C2 (step S2). Next, the coding mode determination unit 5 notifies the spatial prediction unit 2 of all the intra 16x16 prediction modes that can be selected, and the spatial prediction unit 2 evaluates all the prediction efficiencies, and C The optimal intra 16x16 prediction mode common to C2 is selected (step S3). The sign key mode determination unit 5 finally selects a mode most suitable for prediction efficiency from the modes obtained in steps S2 and S3 (step S4), and ends the process.
- Jm Dm + Rm (: positive number) as a standard for predicting the prediction efficiency of the prediction mode performed in the spatial prediction unit 2
- Dm is a sign distortion or a prediction error amount when the intra prediction mode m is applied.
- Code distortion is a method in which an intra prediction mode m is applied to obtain a prediction error, a prediction error is converted and quantized, the resultant force image is decoded, and an error with respect to a signal before coding is measured.
- the amount of prediction error is obtained by obtaining the difference between the predicted image when intra prediction mode m is applied and the signal before sign ⁇ and quantifying the magnitude of the difference. For example, the sum of absolute differences (Sum of Absolute Distance: SAD) is used.
- Rm is the amount of generated code when intra prediction mode m is applied.
- Jm is a value that defines the trade-off between the amount of code and the degree of degradation when intra prediction mode m is applied, and intra prediction mode m that gives the minimum Jm gives the optimal solution.
- Intra code mode 28 is information that determines whether intra 4x4 power intra 16x16 or basic intra
- intra prediction mode common identification flag 23 is “common to C0, Cl, C2”, it indicates common intra prediction mode information and not “common to C0, Cl, C2”. This indicates the intra prediction mode information for CO.
- the extended intra prediction mode 30 is multiplexed only when the intra prediction mode common identification flag 23 indicates that it is not “common to C0, Cl, and C2,” and indicates intra prediction mode information for Cl and C2. Subsequently, the quantization parameter 21 and the quantized transform coefficient 10 are multiplexed.
- FIG. 1 is a general term for the intra code key mode 28 and the intra prediction mode (basic 'extension) (FIG. 6 shows the variable length code key unit 11 in FIG.
- the input deblocking filter control flag 24 is included, but it is not a necessary component for explaining the characteristics of the first embodiment, so it is omitted.
- the color space definition is fixed to Y, Cb, Cr.
- Y, Cb, Cr Various color spaces can be used without being limited to the above.
- the intra prediction mode information as shown in FIG. 6, the optimum encoding process can be performed even when the color space of the input video signal 1 has various definitions. For example, when the color space is defined in RGB, the video texture structure remains evenly in each of the R, G, and B components, so the intra prediction mode information itself can be obtained by using the common intra prediction mode information. Therefore, it is possible to increase the code efficiency.
- the optimal code efficiency can be obtained by adaptively using the extended intra prediction mode 30.
- the decoding device of FIG. 2 receives the video stream 22 according to the arrangement of FIG. 6 output from the encoding device of FIG. 1, and the three color components are the same size (4: 4: 4 format) macro. Decoding processing is performed in units of blocks to restore individual video frames.
- variable length decoding unit 25 receives the stream 22 and decodes the stream 22 according to a predetermined rule (syntax), so that the intra prediction mode common identification flag 23, the quantized transformation coefficient 10. Extract information such as coding mode 6, quantization parameter 21, and so on. Quantized Only the transform coefficient 10 is input to the inverse quantization unit 12 together with the quantization parameter 21, and the inverse quantization process is performed. Next, the output is input to the inverse orthogonal transform unit 13 and restored to the local decoded prediction difference signal 14. On the other hand, the code prediction mode 6 and the intra prediction mode common identification flag 23 are input to the spatial prediction unit 2, and a predicted image 7 is obtained according to these pieces of information. A specific procedure for obtaining the predicted image 7 will be described later.
- Local decoded prediction difference signal 14 and predicted image 7 are added by adder 18 to obtain provisional decoded image 15 (this is exactly the same signal as local decoded image 15 in the encoder).
- the provisional decoded image 15 is written back to the memory 16 for use in the subsequent intra prediction of the macroblock.
- Three memories are prepared for each color component (this embodiment will be described as three faces, but may be changed as appropriate depending on the design).
- the deblocking filter 26 is acted on the provisional decoded image 15 based on the instruction of the deblocking filter control flag 24 decoded by the variable length code key unit 25 to obtain the final decoded image 27.
- the intra-predicted image generation process that is a feature of the decoding apparatus according to the first embodiment will be described in detail.
- This processing is performed in units of macroblocks in which the above three color components are combined, and is mainly performed by the variable length decoding unit 25 and the spatial prediction unit 2 in the decoding device of FIG. Fig. 7 shows a flowchart showing the flow of this process.
- step S 10 to S 14 are performed by the variable length decoding unit 25.
- the video stream 22 that is input to the variable length decoding unit 25 is assumed to follow the data arrangement shown in FIG.
- step S10 the intra code mode 28 in the data of FIG. 6 is first decoded, and then the intra prediction mode common identification flag 23 is decoded (step Sll). Further, the basic intra prediction mode 29 is decoded (step S12).
- step S13 the result of the intra prediction mode common identification flag 23 is used to determine whether or not to share the intra prediction mode between C0, Cl, and C2. In the case of common use, all of C0, Cl, and C2 are determined.
- the basic intra prediction mode 29 is used, if it is not shared, the basic intra prediction mode 29 is used as the CO mode, and the extended intra prediction mode 30 is decoded (step S14).
- Get C2 mode information Since the sign key mode 6 of each color component is determined through the above processing steps, this is output to the spatial prediction unit 2, and the input of each color component is performed according to steps S15 to S17. A tiger prediction image is obtained.
- the process for obtaining an intra-predicted image follows the procedure shown in FIGS.
- FIG. 8 shows variations of the bit stream data array of FIG.
- the intra prediction mode common identification flag 23 is multiplexed as a flag located in an upper data layer such as a slice, picture, sequence, etc., as a macroblock level flag, and the extended intra prediction mode 30
- An extended intra prediction mode table indication flag 31 is provided so that one of a plurality of code tables defining a code word can be selected.
- the extended intra prediction mode table indication flag 31 is provided to define the prediction mode specialized for Cl and C2 components with the same definition as the basic intra prediction mode 29. It becomes possible to select, and encoding processing adapted to the definition of the color space can be performed.
- an intra prediction mode set different from the luminance (Y) is defined for the color difference components (Cb, Cr).
- the color difference signal in a macroblock is 8 pixels x 8 lines, and the decoding process is performed by selecting one of the four modes shown in Fig. 9 for each macroblock.
- the 8 X 8 block is divided into four 4 X 4 blocks.
- the processing is performed by changing the position of the pixel for which the average value is obtained.
- the block “a + x, a or x” indicates that 8 pixels a and X are available when both pixel a and pixel X are available, and if only a is available. If only 4 pixels of a and only X are available, the average value is obtained using only 4 pixels of X and used as predicted image 7. If neither a nor X is available, the value 128 is used as the predicted image 7.
- the average value is obtained by using 4 pixels of b when the image b is available and using 4 pixels of X when only the pixel X is available.
- a video frame input in 4: 4: 4 format is divided into 16 ⁇ 16 pixel rectangular areas (macroblocks), and the code is closed within the frame.
- An encoding apparatus and a corresponding decoding apparatus will be described.
- the present encoding device and decoding device are the same as those in the MPEG-4 AVC (ISO / IEC 14496-10) / ITU-T H.264 standard, which is Non-Patent Document 1. Based on the method, the features unique to the present invention are given.
- FIG. 11 shows the configuration of the video encoding device in the second embodiment
- FIG. 12 shows the configuration of the video decoding device in the second embodiment
- elements having the same numbers as those of the coding apparatus in FIG. 1 are the same elements.
- elements having the same numbers as those of the components of the reference sign apparatus in FIG. 11 are the same elements.
- 32 is a transform block size identification flag
- 33 is an intra code key mode common identification flag.
- the input video signal 1 has an individual video frame in a 4: 4: 4 format and three color components in the same size macroblock as shown in FIG. It is assumed that the code is input to the code unit in units divided and collected.
- the intra prediction mode includes an intra 4x4 prediction mode in which spatial prediction is performed using neighboring pixels in units of 4 pixel x 4 line blocks shown in Fig. 3, and an 8 pixel x 8 line block shown in Fig. 13.
- Intra 8x8 prediction mode that performs spatial prediction using neighboring pixels in units of 16 pixels
- Intra 1 6x16 prediction mode that performs spatial prediction using neighboring pixels in macroblock units of 16 pixels x 16 lines as shown in Fig. 4 .
- the intra 4x4 prediction mode and the intra 8x8 prediction mode are set according to the state of the transform block size identification flag 32.
- the intra NxN prediction coding mode (N is 4 or 4), in which coding is performed using either the intra 4x4 prediction mode or the intra 8x8 prediction mode as the intra code mode.
- Luminance signal in a macro block A 16x16 pixel block consists of 4x4 pixel blocks 1 Intra 4x4 prediction mode in which a prediction mode is selected individually for each 4x4 pixel block, and in a macro block The luminance signal of 16x16 pixel block is divided into 4 blocks composed of 8x8 pixel blocks, and the 8x8 pixel block is selected individually for each 8x8 pixel block. This is the mode to perform the trap. Switching between intra 4x4 prediction mode and intra 8x8 prediction mode is linked to the state of the conversion block size identification flag 32. This point will be described later.
- any of the nine modes shown in Fig. 3 is selected in units of 4x4 pixel blocks.
- the pixels of the surrounding blocks (upper left, upper, upper right, left) that have already been encoded and have been locally decoded and stored in the memory 16 are used for prediction image generation.
- the intra 8x8 prediction mode the! /, Shift among the nine modes shown in Fig. 13 is selected in units of 8x8 pixel blocks.
- the prediction method of the intra 4x4 prediction mode has been changed to be compatible with the 8x8 pixel block.
- Intra8x8_pred_mode 4: Predicted image by calculating the weighted average every 2 to 3 pixels from adjacent pixels Use as an image (corresponds to 45 ° left edge).
- Intra8x8_pred_mode 5: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the left 22.5 degree edge).
- Intra8x8_pred_mode 6: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the left 67.5 degree edge).
- Intra8x8_pred_mode 7: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the right 22.5 degree edge).
- Intra8x8_pred_mode 8: A weighted average is obtained every 2 to 3 pixels from adjacent pixels and used as the predicted image (corresponding to the left 112.5 degree edge).
- the intra 4x4 prediction mode When the intra 4x4 prediction mode is selected, 16 pieces of mode information per macroblock are required. Therefore, in order to reduce the amount of code of the mode information itself, predictive coding is performed from the mode information of adjacent blocks using the fact that the mode information has a high correlation with adjacent blocks. Similarly, when the intra 8x8 prediction mode is selected, a prediction code is obtained from the mode information of the adjacent block by utilizing the fact that the correlation of the intra prediction mode is high between the adjacent blocks.
- This mode predicts a 16x16 pixel block corresponding to the macroblock size at a time, and selects one of the four modes shown in Fig. 4 for each macroblock. Similar to the intra 4x4 prediction mode, the surrounding macroblock (upper left, upper, left) pixels that have already been encoded and have been locally decoded and stored in the memory 16 are used to generate a predicted image.
- the mode type is as described in FIG. 4 in the first embodiment.
- the transform block size is always 4x4.
- 16 DCs (direct current component, average value) of 4x4 block units are collected first, 4x4 block conversion is performed in that unit, and the remaining AC components excluding DC components are converted for each 4x4 block. Apply conversion.
- the video coding apparatus is characterized in that it switches the intra prediction / conversion 'coding method for three color components based on the intra-code mode common identification flag 33. And This point is described in detail in 2 below.
- an intra code mode is applied to the input three color component signals. Based on the instruction of the common identification flag 33, the intra prediction mode is evaluated.
- the intra code key mode common identification flag 33 indicates whether to assign the intra code key mode to each of the three input color components individually or to assign the same intra code key mode to all three components. This is due to the following background.
- RGB can be directly used in addition to the Y, Cb, and Cr color spaces conventionally used for encoding.
- the Y, Cb, and Cr color spaces components that depend on the texture structure of the video are removed from the Cb and Cr signals.
- the optimal intra coding method changes between the Y component and the Cb and Cr2 components.
- the encoding method targeting the 4: 2: 0 format such as the high 4: 2: 0 profile, uses the Y, Cb, and Cr components for the intra prediction mode.
- the texture structure between the color components is not removed as in the Y, Cb, and Cr color spaces, and the signal in the same space is used. Since the components are highly correlated, it is possible to increase the code efficiency by configuring the intra code mode so that it can be selected in common. Even if this color space is used, it depends on the nature of the video, and it is desirable that the coding method itself can adapt to the nature of the video signal.
- the common mode identification flag 33 is provided, A 4: 4: 4 format video coding system was constructed so that flexible coding was possible.
- the intra-code key mode common identification flag set as described above
- prediction processing for each color component is executed for all intra prediction modes shown in FIG. 3, FIG. 4, and FIG.
- the prediction signal 4 is evaluated for prediction efficiency by the sign key mode determination unit 5, and intra prediction that provides the optimal prediction efficiency for the target macroblock from the prediction processing executed by the spatial prediction unit 2.
- Select a mode when intra NxN prediction is selected, intra NxN prediction code key mode is output as code key mode 6, and when the prediction mode is intra 4x4 prediction, transform block size identification flag 32 is set. Set to "Conversion with 4x4 block size". If the prediction mode is intra 8x8 prediction, set the conversion block size identification flag 32 to "Conversion with 8x8 block size”. Set.
- Various methods are conceivable for determining the transform block size identification flag 32.
- the method is determined according to the N value. For example, when the intra 4x4 prediction mode is used and the conversion block size is 8x8 pixel block, the spatial continuity of the prediction signal may be disrupted in units of 4x4 blocks in the prediction difference signal 4 obtained as a result of prediction. This increases the performance and generates unnecessary high-frequency components, reducing the effect of signal power concentration by conversion. If the conversion block size is set to 4x4 pixel block according to the prediction mode, this problem does not occur.
- the intra 16x16 prediction code key mode is output as the coding mode 6. It should be noted that in selecting the code mode 6, the weighting factor 20 force S for each coding mode determined by the coding control unit 19 may be used.
- the prediction difference signal 4 obtained by the code key mode 6 is output to the orthogonal transform unit 8.
- the orthogonal transform unit 8 transforms the input prediction difference signal and outputs it to the quantization unit 9 as an orthogonal transform coefficient.
- the quantization unit 9 quantizes the input orthogonal transform coefficient based on the quantization parameter 21 determined by the sign key control unit 19, and outputs the quantized transform coefficient 10 to the variable length coding unit 11. Output.
- the prediction difference signal 4 input to the orthogonal transform unit 8 is divided into 4 ⁇ 4 block units and orthogonally transformed, and the quantization unit 9 performs quantization.
- the transform block size is 8x8 block units
- the prediction difference signal 4 input to the orthogonal transform unit 8 is divided into 8x8 block units and orthogonally transformed, and the quantization unit 9 performs quantization.
- the quantized transform coefficient 10 is entropy-encoded by the variable-length encoding unit 11 by means such as Huffman encoding or arithmetic encoding. Also, the quantized transform coefficient 10 is restored to the local decoded prediction differential signal 14 through the inverse quantization unit 12 and the inverse orthogonal transform unit 13 with a block size based on the transform block size identification flag 32 and the like, and the code key mode 6 On the basis of the The locally decoded image 15 is generated by adding the generated predicted image 7 and the adder 18. The locally decoded image 15 is stored in the memory 16 for use in subsequent intra prediction processing.
- the variable length coding unit 11 is also input with a deblocking filter control flag 24 indicating whether or not to apply a deblocking filter to the macroblock (in the prediction process performed by the spatial prediction unit 2). Since the pixel data before being subjected to the deblocking filter is stored in the memory 16 and used, the deblocking filter processing itself is not necessary for the encoding process, but on the decoding device side, the deblocking filter control flag 24 The final decoded image is obtained by performing a debucking filter according to the instructions in (1).
- Intra coding mode common identification flag 33, quantized transform coefficient 10, coding mode 6, and quantization parameter 21 input to variable length coding unit 11 are bitstreams according to a predetermined rule (syntax). As an array and is output to the transmission buffer 17.
- the transmission buffer 17 smoothes the bit stream in accordance with the bandwidth of the transmission path to which the encoding device is connected and the reading speed of the recording medium, and outputs it as a video stream 22. Also, feedback information is output to the encoding control unit 19 in accordance with the bit stream accumulation state in the transmission buffer 17, and the generated code amount in the code frame of subsequent video frames is controlled.
- the determination process of the intra code key mode and the intra prediction mode which is a feature of the code key device of the second embodiment, will be described in detail.
- This processing is performed in units of macro blocks in which the above three color components are combined, and is mainly performed by the spatial prediction unit 2 and the code key mode determination unit 5 in the code key device of FIG. Fig. 14 shows a flowchart showing the flow of this process.
- the image data of the three color components that make up the block are C0, Cl, and C2.
- the coding mode determination unit 5 receives the intra code key mode common identification flag 33 and, based on the value, determines whether or not the common intra code key mode is used in C0, Cl, and C2. (Step S20 in FIG. 14). If you want to share, go to step S21 and after. If not, go to step S22 and later.
- the coding mode determination unit 5 instructs the spatial prediction unit 2 to select all intra prediction modes (intra NxN prediction). , (Intra 16 ⁇ 16 prediction), the spatial prediction unit 2 evaluates all the prediction efficiencies, and selects the intra code mode and intra prediction mode that are optimal for all components (step S21).
- the conversion block size identification flag 32 is set to "conversion with 4 x 4 block size".
- the conversion block size identification flag 32 is set to "conversion with 8x8 block size”.
- Jm Dm + Rm (: positive number) as a standard for predicting the prediction efficiency of the prediction mode performed in the spatial prediction unit 2
- the rate 'distortion cost given by can be used.
- Dm is a sign distortion or a prediction error amount when the intra prediction mode m is applied.
- Code distortion is obtained by applying the intra prediction mode m to obtain a prediction error, decoding the image from the result of converting and quantizing the prediction error, and measuring the error with respect to the signal before encoding.
- the prediction error amount is obtained by obtaining the difference between the prediction image when the intra prediction mode m is applied and the signal before the sign, and quantifying the magnitude of the difference. For example, the sum of absolute differences (Sum of Absolute Distance: SAD) is used.
- Rm is the amount of generated code when intra prediction mode m is applied.
- Jm is a value that regulates the trade-off between the amount of code and the degree of degradation when intra prediction mode m is applied, and intra prediction mode m that gives the smallest Jm gives the optimal solution.
- intra coding modes 0 (34a), l (34b), and 2 (34c) multiplexed in the bitstream at the macroblock level are coding modes 6 for C0, Cl, and C2, respectively.
- the intra code mode is the intra NxN prediction code mode
- the conversion block size identification flag 32 and the information of the intra prediction mode are multiplexed into the bit stream.
- the intra code mode is the intra 16x16 prediction code mode
- the intra prediction mode information is encoded as part of the intra code mode information, and the transform block size identification flag 32, intra prediction Mode information is not multiplexed into the bitstream.
- the intra code key mode common identification flag 33 indicates “common to C0, Cl, C2”
- the intra code key mode l (34b) '2 (34c), transform block size identification flag l (32b) '2 (32c) and intra prediction mode l (35b)' 2 (35c) are not multiplexed into the bitstream (the dotted circle in Fig. 15 indicates the branch).
- the intra coding mode 0 (34a), the transform block size identification flag 0 (32a), and the intra prediction mode 0 (35a) each function as coding information common to all color components.
- the intra code key mode common identification flag 33 shows an example of being multiplexed as bit stream data at a higher level than a macro block such as a slice, a picture, and a sequence.
- a macro block such as a slice, a picture, and a sequence.
- the intra code key mode common identification flag 33 is multiplexed at the sequence level. The purpose can be achieved.
- the intra coding mode common identification flag 33 is used to mean "whether all components have common power". For example, this can be done according to the color space definition of the input video signal 1. It may be used to mean “whether it is common to two specific components such as C1 and C2” (in the case of Y, Cb, Cr, etc., it is highly possible that Cb and Cr can be shared).
- the common range of the intra code mode common identification flag 33 is limited to the intra code mode only, and the intra code mode common identification flag 33 is limited to the intra code mode.
- the transform block size and the NxN prediction mode may be selected independently for each color component (FIG. 16). With the syntax configuration shown in Fig. 16, it is possible to change the prediction method for each color component while sharing the encoding mode information for complex picture images that require NxN prediction. It is possible to improve the prediction efficiency.
- the encoding device may be configured to perform the encoding by fixing the intra encoding mode common identification flag 33 to any value V, and what is a video bitstream? It can be transmitted separately.
- the decoding device in FIG. 12 receives the video stream 22 according to the arrangement in FIG. 15 output from the encoding device in FIG. 11, and the macro color of the three color components of the same size (4: 4: 4 format) is received. It is assumed that individual video frames are restored by performing decoding processing in units.
- variable length decoding unit 25 receives the stream 22 and decodes the stream 22 in accordance with a predetermined rule (syntax), so that an intra code key mode common identification flag 33, a quantized transform Extract information such as coefficient 10, encoding mode 6, quantization parameter 21, and so on.
- the quantized transform coefficient 10 is input to the inverse quantization unit 12 together with the quantization parameter 21, and the inverse quantization process is performed.
- the output is input to the inverse orthogonal transform unit 13 and restored to the local decoded prediction difference signal 14.
- code key mode 6 and intra code key mode common identification flag 33 are input to spatial prediction unit 2, and predicted image 7 is obtained according to these pieces of information. A specific procedure for obtaining the predicted image 7 will be described later.
- the intra-predicted image generation process that is a feature of the decoding apparatus according to the second embodiment will be described in detail.
- This processing is performed in units of macroblocks in which the above three color components are combined, and is mainly performed by the variable length decoding unit 25 and the spatial prediction unit 2 in the decoding device of FIG.
- a flowchart showing the flow of this process is shown in FIG.
- step S 25 to S 38 are performed by the variable length decoding unit 25.
- the video stream 22 that is the input to the variable length decoding unit 25 is assumed to follow the data arrangement shown in FIG.
- step S25 the intra coding mode 0 (34a) (C0 component correspondence) is first decoded from the data in FIG.
- the intra code mode 0 (34a) is “intra NxN prediction”
- the transform block size identification flag 0 (32a) and the intra prediction mode 0 (35a) are decoded (steps S26 and S27).
- Fig. 17 illustrates processing in units of macroblocks.
- the intra coding mode common identification flag 33 used for the determination in step S29 is variable at the layer level above the slice before entering the START process in Fig. 17. It is assumed that data is read from the bitstream 22 by the long decoding unit 25.
- step S29 in Fig. 17 If it is determined in step S29 in Fig. 17 that the intra-coding 'prediction mode information is coded for each color component, in the subsequent steps S31 to S38, intra-coding for the C1 and C2 components 'Decode prediction mode information.
- the encoding mode 6 of each color component is determined, and this is output to the spatial prediction unit 2, and an intra prediction image of each color component is obtained according to steps S39 to S41.
- the process for obtaining an intra-predicted image follows the procedure shown in FIGS. 3, 4, and 13 and is the same as the process performed by the encoder apparatus shown in FIG.
- the decoding device may be configured to perform decoding with a fixed value in advance so that the value is analyzed from the video bitstream, and transmitted separately from the video bitstream. May be.
- the color space definition is fixed to Y, Cb, Cr.
- Y, Cb Various color spaces can be used without being limited to Cr.
- optimal code processing can be performed according to the definition of the color space of the input video signal 1 and the nature of the video signal.
- Video decoding / playback processing can be performed by uniquely interpreting the bit stream obtained as a result of such code processing.
- Embodiment 3 shows another configuration example of the encoding device in FIG. 11 and the decoding device in FIG.
- the present encoding device and decoding device are based on the encoding method employed in the MPEG-4 AV C0SO / IEC 14496-10) / ITU-T H.264 standard, which is Non-Patent Document 1.
- the features unique to the present invention are given.
- the video encoding device in the third embodiment is different from the encoding device in the second embodiment described in FIG. 11 only in the variable length encoding unit 11.
- the video decoding apparatus according to the third embodiment is different from the decoding apparatus according to the second embodiment described in FIG. 12 only in the variable length decoding unit 25.
- the other operations are the same as those in the second embodiment, and only the differences will be described here.
- variable length encoding unit 11 uses the data shown on the bitstream for the information of the intra NxN prediction mode, in particular, the power that does not indicate the encoding procedure. I got it.
- a specific method of the sign key sequence is shown.
- the value correlation between the color components is used for the intra NxN prediction mode obtained for each color component. It is characterized in that the entropy code is performed.
- the bit stream array has the format shown in FIG.
- the value of the intra code key common identification flag 33 is set to share the intra code mode with C0, Cl, and C2, and the intra code key mode is set.
- Is the intra NxN prediction mode and the transform block size 0-2 is 4x4 blocks.
- all intra prediction modes 0 to 2 (35a to 35c) are set to the intra 4x4 prediction mode.
- Fig. 18 to Fig. 20 Let X be the current macroblock to be encoded. The macroblock on the left is macroblock A, and the macroblock just above is macroblock B.
- FIG. 18 to FIG. 20 are used as explanatory diagrams of the sign key sequence of each color component of C0, Cl, and C2.
- the flowchart of the procedure is shown in Figs.
- FIG. 18 shows the state of the CO component of macroblock X.
- the 4x4 block to be encoded is called block X
- the left and upper 4x4 blocks of block X are called block A and block B, respectively.
- Case 2 is a case where the 4x4 block on the left and the top of the 4x4 block to be encoded belongs to the inside of the current macroblock X, that is, the macroblock X.
- one intra 4x4 prediction mode is assigned to each 4x4 block X in macroblock X, which is called CurrlntraPredMod e.
- the intra 4x4 prediction mode for block A is IntraPredModeA
- the intra 4x4 prediction mode for block B is IntraPredModeB.
- Both IntraPredModeA and IntraPredModeB are already coded information at the time when the block X is coded.
- these parameters are assigned first (step S50 in Fig. 21).
- a prediction value predCurrlntraPredMode for the CurrlntraPredMode of the block X is determined by the following equation (step S51).
- predCurrlntraPredMode iin, IntraPredModeA, IntraPredModeB)
- CurrlntraPredMode predCurrlntraPredMode
- CurrlntraPredMode! Pred and urrlntraPredMode
- CurrlntraPredMode compare CurrlntraPredMode and predCurrlntraPredMode _b and if CurrlntraPredMode is smaller, encode CurrlntraPredMode as it is. If CurrlntraPredMode is larger! /, CurrlntraPredMode-l is encoded (step S52).
- FIG. 19 shows a CI component encoding procedure.
- neighboring encoding parameters such as IntraPredModeA and IntraPredModeB are set according to the position of the block X (step S53).
- a prediction value candidate 1 predCurrlntraPredModel for CurrlntraPredMode of block X is determined by the following equation (step S54).
- predCurrlntraPredMode 1 Min (IntraPredModeA, IntraPredModeB)
- prev_intra_pred_mode_flag 1 for the CO component
- this predCurrlntraPred Model is directly adopted as predCurrlntraPredMode in the CI component block X.
- the C1 component can also have a high correlation between neighboring image areas as in the C0 component. There is sex.
- the predicted value of the C1 component does not depend on the intra 4x4 prediction mode of the C0 component.
- the CO component [prev-intra-pred-mode-flag 0]
- ie, rem-intra-pred-mode is encoded (step S55)
- the CO component CurrlntraPredMode Is a predicted value candidate 2 (step S56). That is,
- predCurrIntraPredMode2 then urrlntraPredMode— CO
- Coding rem_intra_pred_mode with the C0 component means that the intra prediction correlation between neighboring image regions is low in the C0 component. In that case, the correlation between neighboring image regions is also expected to be low in the C1 component as well, and the intra prediction mode at the same block position in different color components may give a better prediction value.
- the predicted value of CurrlntraPredMode in block X of the C1 component is finally determined as one of predCurrl ntraPredModel force and predCurrIntraPredMode2 (step S57).
- Which value to use is an additional sign with a 1-bit flag (pred jkg).
- pred_flag is encoded only when CurrlntraPredMode matches the predicted value, and predCurrlntraPredModel is used for the predicted value when V ⁇ does not match (when rem_intra_pred_mode is encoded).
- predCurrIntraPredMode2 CurrlntraPredMode— CO;
- prev-intra-pred-mode-flag pred-flag
- rem-intra-pred-mode force code data step S58
- FIG. 20 shows a procedure for coding the C2 component.
- neighboring encoding parameters such as IntraPredModeA and IntraPredModeB are set according to the position of block X (step S59).
- a prediction value candidate 1 predCurrlntraPredModel for CurrlntraPredMode of block X is determined by the following equation (step S60).
- the C2 component is the same as the C0 and C1 components. Correlation between image regions may be high. Therefore, it is judged that the predicted value of the C2 component does not depend on the intra 4x4 prediction mode of the C0 and C1 components.
- pre dCurrIntraPredMode2 CurrlntraPredMode— Cl;
- predCurrIntraPredMode2 CurrlntraPredMode— Cl;
- the background of setting this as a predicted value candidate is as follows. If rem_intra_predjnode is encoded with C0 or C1 component, this means that the correlation of intra prediction between neighboring image regions is low with respect to C0 or C1 component. In that case, the correlation between neighboring image regions is also expected to be low in the C2 component, and the intra prediction mode at the same block position in different color components may give a better prediction value. Also, according to this idea, when remjntra_pred_mode is encoded for both C0 and Cl components, the current intra prediction modes for both C0 and Cl can be candidates for prediction values. The mode is adopted as a predicted value.
- C0 is likely to be treated as luminance
- C1 / C2 is likely to be treated as color difference.
- C1 is considered to be closer to C2's prediction mode than C0. Because it is. RGB color space
- the choice of CO or C1 is not a big factor, and it is generally considered appropriate to use the C1 component for the predicted value (by design, the C2 component is used for the predicted value). You may).
- CurrlntraPredMode in block C of the C2 component is finally determined as one of predCurrl ntraPredModel force and predCurrIntraPredMode2 (step S63). Which value is used is additionally encoded with a 1-bit flag (predjlag).
- predCurrIntraPredMode2 CurrlntraPredMode— CO;
- predCurrIntraPredMode2 CurrlntraPredMode— CI;
- prev— intra— pred— mode— flag 0;
- prev-intra-pred-mode-flag, pred-flag, and rem-intra-pred-mode force are encoded as S code data (step S64).
- the above encoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the correlation with the prediction mode selected in the other color components can be used, and the code amount of the prediction mode itself can be reduced. Therefore, it is possible to improve the code efficiency.
- Fig. 21 The difference between Fig. 21 and Fig. 22 is whether the encoding process in the intra prediction mode per MB is performed separately for each color component or collectively.
- the sign of each color component is performed in units of 4 ⁇ 4 blocks, and a collection of 16 patterns of them is arranged in a bit stream (step S65).
- 16 4x4 blocks of each color component are encoded together and arranged in a bit stream for each color component (steps S66, S67, S68).
- predjag may be determined to include the case of force prev_intra_pred_mode_flag force that is valid information only when prev_intra_pred_mode_flag is used. That is, for example, taking the example of the C1 component,
- predCurrlntraPredMode Min (IntraPredModeA, IntraPredModeB);
- predCurrlntraPredModel Min (IntraPredModeA, IntraPredModeB);
- predCurrIntraPredMode2 CurrlntraPredMode— CO;
- predCurrlntraPredMode predCurrlntraPredModel
- Encode rem— intra— pred— mode You may comprise so that it may carry out a code
- it may be configured to encode pred_flag without depending on whether or not rem_intra_pred_mode is encoded in the intra prediction mode in the block at the same position of the CO component. In this case, the CO component intra prediction mode is always used as a candidate for the predicted value.
- predCurrlntraPredMode 1 Min (IntraPredModeA, IntraPredModeB);
- predCurrIntraPredMode2 CurrlntraPredMode— CO;
- predCurrlntraPredMode predCurrIntraPredMode2; il CurrlntraPredMode ⁇ predCurrlntraPredMode)
- pred_flag may be set in units of macroblocks or sequences in units of 4x4 blocks.
- all 4x4 blocks in the macroblock are shared! /, And prediction value candidate 1 or prediction value candidate 2 is used in common.
- overhead information to be transmitted can be further reduced.
- it in order to determine which one of predicted value candidate 1 or predicted value candidate 2 is used according to the input color space definition, it can be determined in units of sequences. In this case, it is not necessary to transmit pred Jkg for each macro block, and overhead information can be further reduced.
- variable length decoding unit 25 has the data arrangement on the bitstream shown for the intra NxN prediction mode information, in particular, the decoding procedure is not shown.
- the third embodiment a specific method of the decoding procedure is shown.
- the third embodiment considers the case where the intra NxN prediction mode value is high and correlates between color components, and the intra NxN prediction mode obtained for each color component correlates the values between color components. It is characterized in that it decodes a bitstream that has been subjected to entropy coding using.
- the bit stream array has the format shown in FIG.
- the value of the intra code mode common identification flag 33 in the bitstream is set so that the intra code mode is shared by C0, Cl, and C2. It is assumed that “Yes” is set. It is also assumed that the intra code mode is the intra NxN prediction mode, and the conversion block sizes 0 to 2 are 4x4 blocks. At this time, intra prediction modes 0 to 2 (35a to 35c) are all set to the intra 4x4 prediction mode. Similar to the encoding device, the relationship shown in FIGS. 18 to 20 is also used in the decoding device.
- FIG. 23 shows the flowchart of the decryption procedure.
- steps given the same numbers as in FIGS. 21 and 22 indicate that the same processing as that of the encoding device is executed.
- FIG. 18 shows the state of the CO component of macroblock X.
- macroblock X There are two cases of macroblock X depending on the position of the 4x4 block to be decoded. Casel is the case where the 4x4 block on the left and top of the 4x4 block to be decoded is outside the current macroblock X, that is, the macroblock A or macroblock B. Case 2 is the case where the 4x4 block on the left and upper side of the 4x4 block to be decoded belongs to the inside of the current macroblock X, that is, the macroblock X.
- the 4x4 block to be decoded is called block X
- the left and upper 4x4 blocks of block X are called block A and block B, respectively.
- one intra 4x4 prediction mode is assigned to each 4 x 4 block X in macroblock X, and this is called CurrlntraPredMode.
- the intra 4x4 prediction mode of block A is IntraPredModeA
- the intra 4x4 prediction mode of block B is IntraPredModeB. Both IntraPredModeA and IntraPredModeB are already decoded information when the block X is encoded.
- these parameters are assigned first (step S5 0).
- step S51 the prediction value predCurrlntraPredMode for the CurrlntraPredMode of the block X is determined by the following equation (step S51).
- predCurrlntraPredMode Kiin, IntraPredModeA, IntraPredMoaeB)
- CurrlntraPredMode rem_intra_pred_mode + 1 is set (step S65).
- predCurrlntraPredMode Kiin, IntraPredModeA, IntraPredMoaeB);
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- FIG. 19 shows a CI component decoding procedure.
- neighboring encoding parameters such as IntraPredModeA and IntraPredModeB are set (step S53).
- a prediction value candidate 1 predCurrlntraPredModel for CurrlntraPredMode of block X is determined by the following equation (step S54).
- predCurrlntraPredMode 1 Min (IntraPredModeA, IntraPredModeB)
- Model is used as it is for predCurrlntraPredMode in CI block X. This is the same as the reason described in the sign key device.
- predCurrIntraPredMode2 then urrlntraPredMode— CO
- the background of setting this as a predicted value candidate is the same as the reason explained in the sign key device.
- the predicted value of CurrlntraPredMode in block X of the C1 component is finally determined as one of predCurrl ntraPredModel force and predCurrIntraPredMode2 (step S57). Which value to use is determined by decoding a 1-bit flag (precLflag). However, precLflag is decoded only when CurrlntraPredMode matches the predicted value, and when it does not match (when rem_intra_pred_mode is decoded), the predicted value uses predCurrlntraPredModel.
- Prediction value on the sleeve prediction value item 2, prev-intra-pred-mode-flag, pred-flag, rem-intra-pred-mode are given, and according to the following procedure, CurrlntraPredMode Is decrypted (step S66).
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- predCurrlntraPredMode predCurrlntraPredModel
- predCurrlntraPredMode predCurrIntraPredMode2;
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- FIG. 20 shows a procedure for decoding the C2 component.
- neighboring encoding parameters such as IntraPredModeA and IntraPredModeB are set according to the position of the block X (step S59).
- a prediction value candidate 1 predCurrlntraPredModel for CurrlntraPredMode of block X is determined by the following equation (step S60).
- predCurrlntraPredMode 1 Min (IntraPredModeA, IntraPredModeB)
- predCurrlntraPredModel is used as it is in predCurrlntraPredMode in CI component block X. This is the same as the reason described in the sign key device.
- prev_intra_pred_mode_flag 0 that is, when remj ntra_pred_mode is decoded in C0 or C1 component (step S61), C0 or CurlntraPre dMode of C1 component is set as prediction value candidate 2 (step S62). .
- pre dCurrIntraPredMode2 CurrlntraPredMode— Cl;
- predCurrIntraPredMode2 CurrlntraPredMode— Cl;
- the reason for setting this as a predicted value candidate is also the same as the reason described in the sign key device.
- the predicted value of CurrlntraPredMode in block X of the C2 component is finally determined as one of predCurrl ntraPredModel force and predCurrIntraPredMode2 (step S63). Which value to use is determined by decoding a 1-bit flag (predjlag). However, predjag is decoded only when CurrlntraPredMode matches the predicted value. When it does not match (when rem_intra_pred_mode is decoded), predCurrlntraPredModel is used as the predicted value.
- Predicted value candidate sleeve B predicted value candidate B2, prev-intra-pred-mode-flag, pred-flag, rem-intra-pred-mode are given, and the following procedure is used to set CurrlntraPredMode.
- Decrypt Step S71
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- predCurrlntraPredModel Min (IntraPredModeA, IntraPredModeB);
- predCurrIntraPredMode2 CurrlntraPredMode— CO;
- predCurrIntraPredMode2 CurrlntraPredMode— CI;
- predCurrlntraPredMode predCurrlntraPredModel
- predCurrlntraPredMode predCurrIntraPredMode2;
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1; [0116]
- the above decoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the coding amount of the prediction mode itself is reduced by using the correlation with the prediction mode selected in other color components, and the coding is performed. A bitstream with improved efficiency can be decoded.
- predjag may be decoded as including information even when the force prev_intra_pred_mode_flag is 0, which is information that is decoded only when prev_intra_pred_mode_flag power is ⁇ ! ⁇ .
- predCurrlntraPredMode Min (IntraPredModeA, IntraPredModeB);
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- predCurrlntraPredMode predCurrlntraPredModel
- predCurrlntraPredMode predCurrIntraPredMode2;
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- the effect of this method is as described in the description of the encoding procedure on the corresponding encoding device side. Further, it may be configured to decode predjkg without depending on whether or not rem_intra_pred_mode is decoded in the intra prediction mode in the block at the same position of the C0 component. In this case, the C0 component intra prediction mode is always used as a prediction value candidate.
- predCurrlntraPredModel Min (IntraPredModeA, IntraPredModeB);
- predCurrIntraPredMode2 CurrlntraPredMode— CO; Decode prev— intra— pred— mode— flag;
- predCurrlntraPredMode predCurrlntraPredModel
- predCurrlntraPredMode predCurrIntraPredMode2;
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- pred_flag may be included in the bitstream not in units of 4x4 blocks but in units of macroblocks or sequences.
- Embodiment 4 In Embodiment 2, the bit stream in the format of FIG. 16 has been described.
- each color component of C0, Cl, and C2 depends on the value of transform block size identification flags 0 to 2 (32a to 32c).
- Intra prediction mode of force It is described that it is recognized as S intra 4x4 prediction mode or intra 8x8 prediction mode.
- this bit stream arrangement is changed, and as shown in FIG. 24, intra prediction mode instruction flags 1 and 2 (36a, 36b) are transmitted at the sequence level for Cl and C2 components. Constitute.
- the intra prediction mode instruction flag is used when the intra NxN prediction mode is selected in the intra code key mode, and when the transform block size identification flag indicates 4x4 conversion, that is, in the case of the intra 4x4 prediction mode. It is valid, and it is possible to switch between the following two states according to this value.
- a 4x4 block corresponds to a very small image area.
- the Cb and Cr components and the texture structure of the two images are retained, and the prediction mode information itself is fixed to one rather than giving room for selecting nine prediction modes for the components.
- the prediction mode may be used. State 2 may also be determined to use the same intra 4x4 prediction mode as CO for C1 or C2 components. In this case as well, overhead bits can be reduced because there is no need to code the intra 4x4 prediction mode for the C1 or C2 components.
- the encoding device and decoding device according to the fifth embodiment conform to the MPEG-4 AVC (ISO / IEC 14496-10) / ITU-T H.264 standard, which is Non-Patent Document 1. It is assumed that features unique to the present invention are given based on the code method employed.
- the video encoding device in the fifth embodiment is different from the configuration of the encoding device in FIG. 11 described in the second and third embodiments only in the operation of the variable-length encoding unit 11.
- the video decoding apparatus according to the fifth embodiment is different from the decoding apparatus of FIG. 12 described in the second and third embodiments only in the operation of the variable length decoding unit 25.
- the other operations are the same as those in the second and third embodiments, and only the differences will be described here.
- the variable length coding unit 11 has shown a specific coding method for intra NxN prediction mode information in the bitstream in the format of FIG.
- another specific method of the sign key procedure is shown.
- the fifth embodiment pays attention to the fact that the value of the intra NxN prediction mode reflects the structure of the texture as an image pattern, and provides a method for performing adaptive prediction within the neighboring pixel region in the same color component. There is a feature in the point. The following description assumes a bitstream arrangement of the format shown in Figure 16.
- the CO code encoding method is performed on the assumption that the code N of the intra NxN prediction mode information of each component of C0, Cl, and C2 is encoded independently for each color component.
- the value of the intra code key common identification flag 33 is set to share the intra code key mode with C0, Cl, and C2, and the intra code key mode is set to INT.
- La NxN prediction mode, transform block size identification flags 0 to 2 (32a to 32c) are assumed to be 4x4 blocks. At this time, all intra prediction modes 0 to 2 (35a to 35c) are set to the intra 4x4 prediction mode.
- FIG. 18 is used as an explanatory diagram of the coding procedure for the intra NxN prediction mode information of the CO component.
- X is the current macroblock that is the target of the sign ⁇ .
- the macroblock A on the left next to the macroblock A and the macroblock directly above are the macroblock B.
- FIG. 25 shows a flowchart of the sign key sequence.
- the prediction value predCurrlntraPredMode for the intra 4x4 prediction mode CurrlntraPredMode assigned to each 4x4 block X in Fig. 18 is set to the smaller one of IntraPredModeA and IntraPredModeB. Assigned uniquely.
- This method is also used in the current AVC / H.264 standard, and the larger the value of the intra NxN prediction mode, the more complicated the predicted image generation method is with pixel interpolation that takes into account the directionality of the image pattern. This is because it is a mode and has a high conformity to general image patterns, a small mode, and a value assigned to it.
- the code amount increment in the prediction mode has a greater effect on the mode selection than the distortion increment, so this method is also beneficial to the overall code efficiency.
- the distortion increment has a greater effect on the mode selection than the prediction mode code amount increment, so the smaller of IntraPredModeA and IntraPredModeB is not necessarily optimal. Disappear.
- the accuracy of the prediction value is improved by adapting the prediction value setting according to the states of IntraPredModeA and IntraPredModeB as described below.
- predCurrlntraPredMode is determined based on the state of IntraPredModeA and IntraPredModeB as the value that can be estimated to best estimate CurrlntraPredMode when viewed as an image pattern (steps S73, S74, S75).
- MIN IntraP redModeA, IntraPredModeB
- IntraPredModeA and IntraPredModeB are 3 or more and the prediction direction is the same (Example: 0 when IntraPredModeA is ⁇ and IntraPredModeB is 7, prediction is from the top right)
- the prediction mode to be used (7 in the above example) is predCurrlntraPredMode.
- Step S50, S53, and S59 preparation processing for encoding such as IntraPredModeA and IntraPredModeB is performed in advance (steps S50, S53, and S59).
- predCurrlntraPredMode is uniquely derived from the values of IntraPredModeA and IntraPredModeB.
- Figure 26 shows the rules for setting the predicted values. The shaded portion in FIG. 26 is a case where the conventional rule of MIN (IntraPredModeA, IntraPredModeB) is not followed! /, And a case where a predicted value with a better continuity of the image pattern is judged.
- MIN IntraPredModeA, IntraPredModeB
- step (1) a class 0 table is used.
- a class 1 table is used.
- predCurrlntraPredMode is determined as a result of the above, the sign code is completed by executing the remaining sign code procedure for the C0 component described in the third embodiment (steps S52, S58, S64).
- the above encoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the correlation of the prediction mode in the neighboring pixel region of the same color component can be used more effectively, and the code amount of the prediction mode itself can be reduced. Therefore, it is possible to improve the code efficiency.
- one specific decoding procedure of the information of the intra NxN prediction mode in the variable length decoding unit 25 is shown for the bit stream in the format of FIG.
- another specific method of the decoding procedure is shown.
- adaptive prediction is performed within the neighboring pixel region in the same color component, and the sign This is characterized in that the bitstream subjected to the above is decoded.
- the bit stream array has the format shown in FIG.
- the value of the intra code key mode common identification flag 33 in the bitstream is set to share the intra code key mode with C0, Cl, and C2.
- the intra code mode is the intra NxN prediction mode
- the transform block size identification flags 0 to 2 32a to 32c) are 4x4 blocks.
- all intra prediction modes 0 to 2 35a to 35c are set to the intra 4x4 prediction mode.
- the decoding device will explain only the CO component using the relationship shown in FIG. 18 (Cl and C2 are decoded independently of C0 in the same procedure).
- X be the current macroblock to be decoded.
- the left macroblock is macroblock A
- the macroblock just above is macroblock B.
- predCurrlntraPredMode is determined using the table in FIG. 26 in exactly the same procedure as shown in the encoding procedure. Since IntraPredModeA and IntraPredModeB have already been decoded and are already known, it is possible to perform exactly the same processing as the encoding procedure.
- CurrlntraPredMode rem— intra— pred— mode
- CurrlntraPredMode rem— intra— pred— mode + 1;
- the above decoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the stream can be decoded.
- the table of FIG. 26 is used in a fixed manner and predCurrlntraPredMode is determined and the encoding is performed.
- the intra prediction mode may be predCurrrl ntraPredMode, and the encoding / decoding may be performed while sequentially updating.
- IntraPredModeB 0, predCurrlntraPredMode is always 0.
- the sixth embodiment another configuration example of the encoding device in FIG. 11 and the decoding device in FIG. 12 is shown.
- the encoding device and decoding device according to the sixth embodiment are based on the MPEG-4 AVC (ISO / IEC 14496-10) / ITU-T H.264 standard, which is Non-Patent Document 1, as in the other embodiments described above. It is assumed that features unique to the present invention are given based on the code method employed.
- the video coding apparatus according to the sixth embodiment differs from the coding apparatus of FIG. 11 described in the second, third, and fifth embodiments only in the operation of the variable-length coding unit 11.
- the video decoding apparatus according to the sixth embodiment differs from the decoding apparatus of FIG. 12 described in the second, third, and fifth embodiments only in the operation of the variable length decoding unit 25.
- the other operations are the same as those in the second, third, and fifth embodiments, and only the differences will be described here.
- a specific coding method for intra NxN prediction mode information has been shown for the bit stream in the format of FIG.
- another specific method of the encoding procedure is shown.
- adaptive arithmetic coding is performed in the neighboring pixel region in the same color component. It is characterized in that it gives a method.
- the following explanation assumes a bitstream array of the format shown in Figure 16.
- Embodiment 6 intra Nx of each component of C0, Cl, C2
- the sign of the N prediction mode information is coded independently for each color component, and the CO component coding method is applied to Cl and C2 as well. Only will be described.
- the value of the intra code key mode common identification flag 33 is set to share the intra code key mode with C0, Cl, and C2, and the intra code key mode is set to the intra NxN prediction mode and transform block size.
- the identification flags 0 to 2 (32a to 32c) are assumed to be 4x4 blocks. At this time, the intra prediction modes 0 to 2 (35a to 35c) are all set to the intra 4x4 prediction mode.
- FIG. 18 is used as an explanatory diagram of the coding procedure of the intra NxN prediction mode information of the CO component.
- X is the current macroblock that is the target of the sign ⁇ .
- the left macroblock is macroblock A, and the macroblock directly above is macroblock B.
- Figure 27 shows the flowchart of the sign key sequence.
- the prediction value predCurrlntra aPredMode for the intra 4x4 prediction mode CurrlntraPredMode assigned to each 4x4 block X in Fig. 18 is the smaller of IntraPredModeA and IntraPredModeB.
- prevjntra_pred_mode_flag is set to 1
- the encoding of intra 4x4 prediction mode for block X is discontinued, and if different, the code is transmitted by rem_intra_pred_mode.
- CurrlntraPredMode is directly arithmetically encoded using the states of IntraPredModeA and IntraPredModeB.
- a code key procedure according to the context adaptive binary arithmetic code key adopted in the AVC / H.264 standard is used.
- the CurrlntraPredMode to be encoded is binarized according to the format shown in Fig. 28 (step S76).
- the second bin gives the Terminate bit to the prediction mode value that is considered to have the highest appearance frequency in both the vertical and horizontal directions.
- the code is constructed so that the remaining prediction mode values are terminated in descending order of appearance frequency.
- the second and subsequent bins in the binary sequence structure in FIG. 28 are preferably set according to the symbol occurrence probability in the actual image data code process.
- the arithmetic sign ⁇ is executed while selecting the (0, 1) occurrence probability table to be used in sequence for each bin of the binary sequence.
- the context used for the arithmetic sign ⁇ is determined as follows (step S78).
- Context A binary table indicating whether the intra prediction mode is vertical prediction or horizontal prediction.
- the present flag intra_pred_direction_flag is defined for IntraPredModeA and IntraPredModeB, and the following four states are used as context values.
- C IntraPredModeA
- IntraPredMod IntraPredMod
- the conditional probability of CurrlntraPredMode assuming the state of eB is obtained, and an initial occurrence probability table (0, 1) determined based on it is assigned.
- an initial occurrence probability table (0, 1) determined based on it is assigned.
- the occurrence probability table is updated with the sign key value (step S79).
- an initial occurrence probability table (0, 1) determined according to the occurrence probability of each prediction mode value is assigned in advance (step S80).
- binary arithmetic coding and occurrence probability table update are performed (step S81).
- the above encoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the adaptive arithmetic code ⁇ is used to encode the prediction mode information using the correlation of the prediction mode in the neighboring pixel region of the same color component. Since it can be applied, the code efficiency can be improved.
- one specific decoding procedure of intra NxN prediction mode information in the variable-length coding unit 25 is shown for the bit stream in the format of FIG.
- another specific method of the decoding procedure is shown.
- the state 6 uses the adaptive arithmetic code ⁇ within the neighboring pixel region in the same color component to This is characterized in that the bitstream subjected to the above is decoded.
- the bit stream array has the format shown in FIG.
- the value of the intra code key mode common identification flag 33 in the bitstream is set to share the intra code key mode with C0, Cl, and C2.
- the intra code mode is the intra NxN prediction mode
- the transform block size identification flags 0 to 2 are 4x4 blocks.
- all intra prediction modes 0 to 2 are set to the intra 4x4 prediction mode.
- the decoding device will explain only the CO component using the relationship of FIG. 18 (Cl and C2 are decoded independently of CO in the same procedure).
- X be the current macroblock to be decoded.
- the left macroblock is macroblock A
- the macroblock just above is macroblock B.
- PredCurrlntraPredMode for CurrlntraPre dMode is uniquely assigned to the smaller one of IntraPredModeA and IntraPredMod eB. Is configured to restore block X's intra 4x4 prediction mode by decoding rem_intra_pred_mode.
- CurrlntraPredMode is directly arithmetically decoded using the states of IntraPredModeA and IntraPredModeB. At this time, the decoding procedure according to the context adaptive binary arithmetic decoding adopted in the AVC / H.264 standard is used.
- the CurrlntraPredMode to be decoded is encoded as a binary sequence according to the format shown in Fig. 28, and this sequence is also subjected to binary arithmetic decoding sequentially with the leftmost force.
- the first bin of the binary sequence is a code that classifies CurrlntraPredMode for vertical prediction or horizontal prediction power (see Fig. 3). After 2 bins, the appearance frequency is high in the prediction mode value, and it is coded so that it is sequentially terminated from the first. It is configured. The reason for this code configuration is as described in the code ⁇ procedure.
- the decoding process first, when decoding the first bin, the same C as the content used in the encoding process is determined. Select the occurrence probability table of the 1st bin according to the value of C and perform arithmetic recovery
- the occurrence probability table is updated with the decoded value.
- an initial occurrence probability table (0, 1) determined according to the occurrence probability of each prediction mode value is assigned.
- binary arithmetic decoding and occurrence probability table update are performed as in the first bin.
- the binary sequence in FIG. 28 is configured so that each prediction mode value can be uniquely identified. Therefore, when a predetermined number of bins are restored, the Currln traPredMode is sequentially decoded.
- the above decoding procedure can be similarly defined for the intra 8x8 prediction mode.
- the stream can be decoded.
- Context B Intra prediction mode power 3 ⁇ 4C prediction power, binary table of DC prediction
- the present flag intra_dc_pred_flag is defined for IntraPredModeA and IntraPredModeB, and the following four states are used as context values.
- intra_dc_pred_flag is set to 1 in Fig. 3 if intra4x4_pred_mode takes value 2, and 0 if it takes other values.
- a video signal input in 4: 4: 4 format is encoded by using inter-frame prediction in units obtained by equally dividing a 16 ⁇ 16 pixel rectangular area (macroblock).
- a dredge device and a corresponding decoding device will be described.
- the present encoding device and decoding device are based on the encoding method employed in the MPEG-4 AVC (ISO / IEC 14496-10) / ITU-TH H.264 standard (hereinafter referred to as AVC), and are included in the present invention. It shall be given unique features.
- FIG. 30 shows the configuration of the video encoding apparatus in the seventh embodiment
- FIG. 31 shows the configuration of the video decoding apparatus in the seventh embodiment.
- FIG. 31 it is shown that elements having the same numbers as the constituent elements of the sign key device of FIG. 30 are the same elements.
- the input video signal 1 is an individual video frame having a 4: 4: 4 format, and the three color components are divided into macro blocks of the same size. It is assumed that the unit is input to the encoding device.
- one frame of reference image is selected from one or more frames of motion compensation prediction reference image data stored in the memory 16, and each color component is selected in units of the macroblock. Then, motion compensation prediction processing is performed. Three memories are prepared for each color component (in this embodiment, it is described as three faces, but it may be changed as appropriate depending on the design).
- the selected size information is the macro block size information. As lock type, size information of 8x8 block unit is output as sub macro block type. In addition, the identification number and motion vector information of the reference image selected for each block are output.
- the video coding apparatus is characterized in that the motion compensation prediction processing method for the three color components is switched based on the inter prediction mode common identification flag 123. This point is described in detail in 2 below.
- the motion compensated prediction unit 102 has all block sizes or sub-block sizes shown in FIG. 32, all motion vectors 137 in a predetermined search range, and one or more selectable reference images. Then, a motion compensation prediction process is performed on the image, and a prediction difference signal 4 is obtained by the motion vector 137, one reference image, and the subtractor 3. The prediction differential signal 4 is evaluated for the prediction efficiency by the coding mode determination unit 5, and the macro block that can obtain the optimal prediction efficiency for the macro block to be predicted from the prediction processing executed by the motion compensation prediction unit 102.
- Type Z sub-macroblock type 106, motion vector 137 and reference image identification number are output.
- the weighting factor 20 for each type determined by the coding control unit 19 may be considered. Further, the prediction difference signal 4 obtained by motion compensation prediction based on the selected type, the motion vector 137 and the reference image is output to the orthogonal transform unit 8.
- the orthogonal transform unit 8 transforms the input prediction difference signal 4 and outputs it to the quantization unit 9 as an orthogonal transform coefficient.
- the quantizing unit 9 quantizes the input orthogonal transform coefficient based on the quantization parameter 21 determined by the code key control unit 19, and outputs the quantized transform coefficient 10 to the variable length code key unit 11. Output.
- the quantized transform coefficient 10 is entropy-encoded by a variable-length code key unit 11 by means such as a Huffman code or an arithmetic code key. Also, the quantized transform coefficient 10 is restored to the local decoded prediction differential signal 14 via the inverse quantization unit 12 and the inverse orthogonal transform unit 13, and is referred to the selected macroblock type Z sub macroblock type 106 and motion vector 137.
- the local decoded image 15 is generated by adding the predicted image 7 generated based on the image and the adder 18.
- the locally decoded image 15 is stored in the memory 16 for use in subsequent motion compensation prediction processing.
- the variable length coding unit 11 also receives a deblocking filter control flag 24 indicating whether or not to apply a deblocking filter to the macroblock.
- the pixel data before being subjected to the deblocking filter is stored in the memory 16 and used, so the deblocking filter process itself is not necessary for the encoding process. However, on the decoding device side, a deblocking filter is performed according to the instruction of the deblocking filter control flag 24 to obtain a final decoded image).
- the quantization parameter 21 is arranged and shaped as a bit stream according to a predetermined rule (syntax), and is output to the transmission buffer 17.
- the bit stream is smoothed according to the bandwidth of the transmission path to which the encoding device is connected and the reading speed of the recording medium, and is output as the video stream 22. Further, feedback is applied to the code key control unit 19 in accordance with the bit stream accumulation state in the transmission buffer 17, and the generated code amount in the code key of the subsequent video frame is controlled.
- the inter prediction mode refers to the block size that is the unit of motion compensation prediction described above, that is, the macroblock type Z sub-macroblock type
- the inter prediction mode determination process refers to the macroblock type Z sub-macroblock. This is the process of selecting the type, motion vector, and reference image. This processing is performed in units of macroblocks in which the above three color components are combined, and is mainly performed by the motion compensation prediction unit 102 and the code key mode determination unit 5 in the encoding device in FIG. Fig. 33 shows a flowchart showing the flow of this process.
- the image data of the three color components that make up the block are C0, Cl, and C2.
- the coding mode determination unit 5 receives the inter prediction mode common identification flag 123, and based on the value, C0, Cl, C2 common inter prediction modes and common motion vectors 137 and It is determined whether or not the common reference image is used (step S100 in FIG. 33). When sharing, go to step S101 and after, otherwise go to step S102 and after.
- the code mode determination unit 5 determines all of the motion compensation prediction units 102 that can be selected.
- the inter prediction mode, the motion vector search range, and the reference image are notified, and the motion compensated prediction unit 102 evaluates all the prediction efficiencies and determines the optimal inter prediction mode common to C0, Cl, and C2.
- a motion vector 137 and a reference image are selected (step S101).
- the coding mode determination unit 5 performs the motion.
- Dm, v, r is a code distortion or a prediction error amount when the inter prediction mode m, the motion vector V in a predetermined range, and the reference image r are applied.
- Code distortion is obtained by applying inter prediction mode m, motion vector V, and reference image r to obtain a prediction error, and converting the prediction error. The error with respect to is measured.
- the amount of prediction error is obtained by obtaining the difference between the prediction image when the inter prediction mode m, the motion vector V, and the reference image r are applied, and the signal before the sign, and quantifying the magnitude of the difference. For example, the sum of absolute distance (SAD) is used.
- Rm, v, r is the amount of generated code when the inter prediction mode m, the motion vector V, and the reference image r are applied.
- Jm, v, r is a value that specifies the trade-off between the code amount and the degree of degradation when the inter prediction mode m, the motion vector V, and the reference image r are applied, and gives the minimum Jm, v, r Inter prediction mode m, motion vector V, and reference image r give the optimal solution.
- the inter prediction mode, the motion vector 137, and the information of the reference image are allocated to a macroblock including three color components.
- the processing after step S102 when the processing after step S102 is performed, the inter prediction mode information, the motion vector 137, and the reference image are assigned to each color component. The Therefore, since the inter prediction mode assigned to the macroblock, the motion vector 137, and the information of the reference image are different, the coding device performs the processing process after S101 and the processing process after S102. It is necessary to multiplex the inter prediction mode common identification flag 123 into the bit stream so that the decoding apparatus can recognize it.
- Figure 34 shows the data arrangement of such a bitstream.
- Fig. 34 shows the bit stream data arrangement at the macroblock level.
- the macroblock type indicates intra-inter, and information indicating the block size that is a unit of motion compensation in inter mode. Including.
- the sub macroblock type is multiplexed only when the 8x8 block size is selected for the macroblock type, and includes block size information for each 8x8 block.
- the basic macroblock type 128 and basic submacroblock type 129 indicate that the common macroblock type and common submacroblock type are used when the inter prediction mode common identification flag 123 indicates "common in C0, Cl, C2". Otherwise, indicate the macroblock type and sub-macroblock type for CO.
- Extended macroblock type 130 and extended sub-macroblock type 131 are multiplexed for Cl and C2 only when the inter prediction mode common identification flag 123 indicates that it is not “common to C0, Cl and C2,” The macro block type and sub macro block type for Cl and C2 are shown.
- the reference image identification number is information for specifying a reference image to be selected for each block of 8x8 block size or more which is a motion compensation unit.
- a reference image that can be selected is one frame, one reference image identification number is multiplexed for each block.
- a set of motion vector information is multiplexed for each block as a motion compensation unit.
- the reference image identification number and the motion vector information need to be multiplexed by the number of blocks that are units of motion compensation included in the macroblock.
- the common reference image identification number and the common motion vector Information are “common to C0, Cl, and C2,” the common reference image identification number and the common motion vector Information, otherwise reference image identification number and motion vector information for CO.
- the inter prediction mode common identification flag 123 is “C0, Cl, C2 Only when it is not “common”, it is multiplexed for Cl and C2, respectively, and indicates the reference image identification number and motion vector information for Cl and C2.
- FIG. 34 includes a deblocking filter control flag 24 that is input to the variable-length encoding unit 11 in FIG. 30, and is a configuration necessary for explaining the features of the seventh embodiment. I have omitted it because it is not an element)
- the color space definition is fixed to Y, Cb, Cr.
- Y, Cb Various color spaces can be used without being limited to Cr.
- the inter prediction mode information as shown in FIG. 34, the optimum encoding process can be performed even when the color space of the input video signal 1 has various definitions. For example, when the color space is defined in RGB, the video texture structure remains uniformly in each of the R, G, and B components, and the common inter prediction mode information and the common motion vector information are used in the region where the color space is defined.
- the redundancy of the inter prediction mode information and the motion vector information itself can be reduced, and the code efficiency can be increased.
- the inter prediction mode and motion vector information optimal for the R component and the inter prediction mode and motion vector information optimal for the G and B components must be different. It is. Therefore, optimal code efficiency can be obtained by adaptively using the extended inter prediction mode, the extended reference image identification information, and the extended motion vector information.
- the decoding device in FIG. 31 receives the video stream 22 according to the arrangement in FIG. 34 output from the encoding device in FIG. 30, and the three color components have the same size (4: 4: 4 format). It is assumed that individual video frames are restored by performing decoding processing in units.
- variable-length decoding unit 25 receives the stream 22 and decodes the video stream 22 according to a predetermined rule (syntax) to obtain an inter prediction mode common identification flag 123, a quantized transform coefficient. 10. Extract information such as macroblock type / sub-macroblock type 106, reference image identification number, motion vector information, quantization parameter 21, and the like. The quantized transform coefficient 10 is input to the inverse quantization unit 12 together with the quantization parameter 21, and the inverse quantization process is performed. Then, the output is input to the inverse orthogonal transform unit 13 and the local decoded prediction difference Restored to minute signal 14.
- a predetermined rule syntax
- the macro block type Z sub-macroblock type 106, the inter prediction mode common identification flag 123, the motion vector 137, and the reference image identification number are input to the motion compensated prediction unit 102. Get. A specific procedure for obtaining the predicted image 7 will be described later. Local decoded prediction difference signal 14 and predicted image 7 are added by adder 18 to obtain provisional decoded image 15 (this is exactly the same signal as local decoded image 15 in the coding device). The provisional decoded image 15 is written back to the memory 16 to be used for motion compensation prediction of the subsequent macroblock. Three memories are prepared for each color component (this embodiment will be described as three faces, but may be changed as appropriate depending on the design). Further, based on the instruction of the deblocking filter control flag 24 decoded by the variable length decoding unit 25, the deblocking filter 26 is made to act on the provisional decoded image 15 to obtain the final decoded image 27.
- the decoding device in FIG. 31 receives the video stream 22 according to the arrangement in FIG. 34 output from the encoding device in FIG. 30, and the three color components have the same size (4: 4: 4 format). It is assumed that individual video frames are restored by performing decoding processing in units.
- FIG. 35 shows a flow chart showing the flow of processing performed by the variable length decoding unit 25 in this processing.
- step S110 the inter prediction mode common identification flag 123 in the data of FIG. 34 is decoded (step S110). Furthermore, basic macroblock type 128 and basic sub-macroblock type 129 are decoded (step Slll).
- step S112 it is determined whether the inter prediction mode is shared by C0, C1, and C2 using the result of the inter prediction mode common identification flag 123, and in the case of common (Yes in step S112), Basic macroblock type 128 and basic submacroblock type 129 shall be used for all of Cl and C2, otherwise (No in step S112) basic macroblock type 128
- the basic sub macroblock type 129 is used as a CO mode, and the extended macro block type 130 and the extended sub macro block type 131 are decoded for each of Cl and C2 (step S113). Get prediction mode information.
- step S114 when the basic reference image identification number 132 and basic motion vector information 133 are decoded (step S114), and the inter prediction mode common identification flag 123 indicates that “common with C0, Cl, C2” is indicated. (Yes in step S115), the basic reference image identification number 132 and basic motion vector information 133 are used for all of C0, Cl, and C2, otherwise the basic reference image is determined (No in step S115).
- the identification number 132 and the basic motion vector information 133 are used as CO information, and the extended reference image identification number 134 and the extended motion vector information 135 are decoded for each of Cl and C2 (step S116).
- the macroblock type Z sub-macroblock type 106, reference image identification number, and motion vector information for each color component are determined. Get a statue.
- FIG. 36 shows a variation of the bit stream data array in FIG.
- the inter prediction mode common identification flag 123 is multiplexed as a flag located in an upper data layer such as a slice, a picture, and a sequence, not as a macroblock level flag.
- the power inter prediction mode common identification flag 123 is multiplexed by multiplexing the inter prediction mode common identification flag 123 for each macroblock or upper data layer such as slice, picture, sequence, etc.
- FIG. 37 shows the bit stream data arrangement in that case.
- the inter prediction mode common identification flag 123 does not exist, and the profile information 136 that instructs the upper data layer such as a sequence to handle the input image in the 4: 4: 4 format is multiplexed.
- extended macroblock type 130, extended submacroblock type 131, extended reference image identification number 134, extended motion vector Information 135 is multiplexed.
- the macro block type Z sub-macro block type, the motion vector, and the reference image have different power for each color component.
- the macro block type Z sub-macro A video encoding device and a video decoding device are described in which the block type and the reference image are common to each component, and only the motion vector can be different for each component.
- the configuration of the video encoding device and the video decoding device in the eighth embodiment is the same as that in FIGS. 30 and 31 in the seventh embodiment.
- a motion vector common identification flag 123b The point that is using is different.
- the inter prediction mode determination process which is a feature of the coding apparatus according to the eighth embodiment, will be described in detail focusing on processes different from those in the seventh embodiment.
- This process is performed in units of macroblocks in which the above three color components are combined, and is mainly performed by the motion compensation prediction unit 102 and the code key mode determination unit 5 in the code key device of FIG.
- FIG. 38 is a flowchart showing the flow of this process.
- the image data of the three color components that make up the block are C0, Cl, and C2.
- the encoding mode determination unit 5 receives the motion vector common identification flag 123b, and determines whether or not to use the common motion vector 137 for C0, Cl, and C2 based on the value ( Step S120 in Fig. 37). In the case of sharing, go to Step S121 and after, otherwise go to Step S122 and after.
- the sign mode determination unit 5 instructs the motion compensation prediction unit 102 to select all inter prediction modes and motion vector searches.
- the motion compensation prediction unit 102 evaluates all prediction efficiencies and selects the optimal inter prediction mode, motion vector 137, and reference image common to C0, Cl, and C2. (Step S121).
- the sign key mode determination unit 5 performs motion compensation prediction. For part 102 Notify all possible inter prediction modes, motion vector search range and reference image
- the motion vector common identification flag 123b needs to be multiplexed with the bit stream so that the decoding apparatus can recognize it.
- Figure 39 shows the data arrangement of such a bitstream.
- FIG. 39 shows the data arrangement of the bit stream at the macroblock level.
- the macro block type 128b, the sub macro block type 129b, and the reference image identification number 132b are “common to C0, Cl, and C2.”
- the basic motion vector information 133 indicates common motion vector information when the motion vector common identification flag 123b indicates “common to C0, Cl, and C2,” otherwise indicates motion vector information for CO. .
- the extended motion vector information 135 is multiplexed for Cl and C2 only when the motion vector common identification flag 123b indicates “not common to C0, Cl, and C2,” and motion for Cl and C2 is performed. Shows vector information.
- the macroblock type / sub-macroblock type 106 in FIGS. 30 and 31 is a generic term for the macroblock type 128b and the sub-macroblock type 129b in FIG.
- the decoding apparatus receives the video stream 22 according to the arrangement in FIG. 39 output as the coding power of the eighth embodiment, and the three color components have the same size (4: 4: 4 It is assumed that individual video frames are restored by performing decoding processing in units of macro blocks in a format.
- FIG. 40 is a flowchart showing the flow of processing performed by the variable length decoding unit 25 in this processing.
- step S126 the macroblock type 128b and the sub macroblock type 129b that are common to C0, Cl, and C2 are decoded.
- the block size that is the unit of motion compensation is determined by the decoded macroblock type 128b or sub-macroblock type 129b, so the reference image identification number common to C0, Cl, and C2 for each block that will be the next unit of motion compensation
- the number 132b is decoded (step S127).
- step S128 the motion vector common identification flag 123b is decoded.
- step S129 the basic motion vector information 133 is decoded for each block serving as a motion compensation unit
- step S130 it is determined whether or not the motion vector 137 is shared by C0, Cl, and C2 using the result of the motion vector sharing identification flag 1 23b. , C2 is used for the basic motion vector information, otherwise (No in step S130), the basic motion vector information 133 is used as the CO mode, and the extended motion vector for each of Cl and C2. Information 135 is decrypted (step S131).
- the macroblock type Z sub-macroblock type 106, reference image identification number, and motion vector information for each color component are determined, and these are output to the motion compensation prediction unit 102 for motion compensation prediction of each color component. Obtain an image.
- FIG. 41 shows variations of the bit stream data array of FIG. 39, the motion vector common identification flag 123b is multiplexed as a flag located in an upper data layer such as a slice, a picture, or a sequence that is not as a macroblock level flag.
- an upper data layer such as a slice, a picture, or a sequence that is not as a macroblock level flag.
- the force vector common identification flag 123b is multiplexed without multiplexing the force vector common identification flag 123b for each macroblock or upper data layer such as slice, picture, sequence, etc. 4:
- Figure 42 shows the bitstream data arrangement in this case.
- the motion vector common identification flag 123b does not exist, and the profile information 136 instructing the upper data layer such as a sequence to handle the input image in the 4: 4: 4 format is multiplexed.
- the extended motion vector information 135 is multiplexed according to the decoding result.
- the macroblock type Z sub-macroblock type 106 and the reference image are made common to each color component, and only the motion vector 137 can be made different for each color component. did.
- the macroblock type Z sub-macroblock type 106 and the reference image identification number are multiplexed for each color component. Overhead bits can be reduced without any problems.
- the motion vector 137, and the reference image common to three components by the inter prediction mode common identification flag 123 or the profile information 136 For each color component that makes the macroblock type / sub-macroblock type 106, the motion vector 137, and the reference image common to three components by the inter prediction mode common identification flag 123 or the profile information 136, In this Embodiment 9, assuming a 4: 4: 4 format image such as Y, Cb, Cr format, etc., luminance component (Y) and color difference component (Cb , Cr) can be switched between different ones (in this case, the common mode is used for the two color difference components). In other words, it is possible to switch between the three components that are common to each component or different for each component, and that some can be different for the luminance component and the color difference component.
- a video encoding device and a video decoding device will be described. The configurations of the video encoding device and the video decoding device in the ninth embodiment are the same as those in FIGS. 30 and 31 in the seventh embodiment.
- the inter prediction mode determination process that is a feature of the coding apparatus according to the ninth embodiment will be described in detail with a focus on processes different from those in the seventh embodiment.
- FIG. 43 shows a flowchart showing the flow of this process.
- the image data of the three color components that make up the block are C0, Cl, and C2.
- the encoding mode determination unit 5 receives the inter prediction mode common identification flag 123, and based on the value, C0, Cl, and C2 share a common motion prediction mode with a common inter prediction mode. It is determined whether or not it is possible to use the reference 137 and the common reference image (step S132 in FIG. 43). In the case of sharing, go to Step S133 and after, otherwise go to Step S134 and after or Step S137 and after.
- the code mode determination unit 5 determines all of the motion compensation prediction units 102 that can be selected.
- the inter prediction mode, the motion vector search range, and the reference image are notified, and the motion compensated prediction unit 102 evaluates all the prediction efficiencies and determines the optimal inter prediction mode common to C0, Cl, and C2.
- a motion vector 137 and a reference image are selected (step S133).
- the data arrangement of the bitstream output by the coding apparatus according to the ninth embodiment is the same as in Fig. 34, but the inter prediction mode common identification flag 123 is "common to C1 and C2".
- the extended macroblock type 130, the extended submacroblock type 131, the extended reference identification number 134, and the extended motion vector information 135 are information common to Cl and C2. [0203] 2. Inter prediction duplication processing in duplication equipment.
- the decoding apparatus receives the video stream 22 according to the arrangement of FIG. 34 output by the coding apparatus power according to the ninth embodiment, and the three color components have the same size (4: 4: 4 It is assumed that individual video frames are restored by performing decoding processing in units of macro blocks in a format.
- FIG. 44 shows a flowchart showing the flow of processing performed by the variable length decoding unit 25 in this processing.
- step S140 the inter prediction mode common identification flag 123 is decoded from the data in FIG. 34 (step S140). Further, basic macroblock type 128 and basic sub-macroblock type 129 are decoded (step S141). In step S142, it is determined whether or not the inter prediction mode is shared by C0, C1, and C2 using the result of the inter prediction mode common identification flag 123. In the case of common use, all of C0, Cl, and C2 are determined.
- the basic macro block type 128 and the basic sub macro block type 129 are used, and the basic macro block type 128 and the basic sub macro block type 129 are used as the CO mode.
- the extended macroblock type 130 and the extended sub-macroblock type 131 common to the Cl and C2 components are decoded (step S143).
- extended macroblock type 130 and extended submacroblock type 131 are decoded for Cl and C2, respectively (steps S144, S145, and S146) Get mode information for Cl, C2.
- step S147 when the basic reference image identification number 132 and the basic motion vector information 133 are decoded (step S147), and the inter prediction mode common identification flag 123 indicates that “common with C0, Cl, C2” is indicated. , C 0, Cl, C2 all use basic reference image identification number 132 and basic motion vector information 133, otherwise basic reference image identification number 132 and basic motion vector information 133 are CO information.
- Cl and C2 are used in common, Cl and C2 Extended reference image identification number 134 and extended motion vector information 135 common to the components are decoded (step S149).
- the extended reference image identification number 134 and the extended motion vector information 135 for each of the C2 to decrypt step S150, S151, S152
- the macroblock type Z sub-macroblock type 106 of each color component, the reference image identification number, and the motion vector information are determined. These are output to the motion compensation prediction unit 102, and the motion compensated prediction image of each color component is output. Get.
- the extended macroblock type 130 when the inter prediction mode common identification flag 123 indicates “common to C1 and C2,” the extended macroblock type 130, The extended sub macroblock type 131, the extended reference identification number 134, and the extended motion vector information 135 are information common to Cl and C2, and the video code input / output is a video stream according to the data arrangement shown in FIG.
- the operation of the dredge device and the video decoding device is the same as in FIG.
- the macro block type Z sub macro block type 106, the motion vector 137, and the reference image can be made different for each color component.
- the macro block type 106 and the reference image are common to each component, and only the motion vector 137 is the force common to the three components, is different for each component, or is shared by Cl and C2, and the CO and C1, C2 You may switch between choosing the best one for each.
- the bit stream data arrangement in this case follows FIG. 39 or FIG. 41, and in this case as well, when the inter prediction mode common identification flag 123 indicates “common to C1 and C2,” the extended motion vector information 135 Is information common to Cl and C2.
- variable length coding unit 11 of the coding apparatus described in the seventh embodiment corresponds to a method of encoding the input motion vector 137 and multiplexing it into a bitstream.
- a method for decoding the bitstream force motion vector 137 in the variable length decoding unit 25 of the decoding device will be described.
- FIG. 45 is a part of the variable length code key unit 11 of the code key device shown in FIG. 30, and shows the configuration of the motion vector code key unit that codes the motion vector 137. [0210] A method of multiplexing the motion vector 137 of the three color components (C0, Cl, C2) into a bitstream in the order of C0, Cl, C2 is described.
- CO motion vector 137 be mvO.
- the motion vector prediction unit 111 obtains a prediction vector (mvpO) of the CO motion vector 137.
- the motion vectors (mvA 0, mvBO, mvCO) of blocks (A, B, C in Fig. 46) adjacent to the block where the motion vector (mvO) to be encoded is located are stored in memory. get. It is assumed that motion vectors 137 for A, B, and C have already been multiplexed into the bit stream. The median value of mvA0, mvB0, and mvCO is calculated as mvpO.
- the calculated prediction vector mvpO and the motion vector mvO to be encoded are input to the differential motion vector calculation unit 112.
- the difference motion vector calculation unit 112 calculates a difference vector (mvdO) between mvO and mvpO.
- the calculated mvdO is input to the differential motion vector variable length code input unit 113 and is entropy encoded by means such as a Huffman code or an arithmetic code.
- the motion vector prediction unit 111 obtains a prediction vector (mvpl) of the motion vector 137 of C1.
- the motion vector (mvAl, mvBl, m vCl) of the block adjacent to the block where the motion vector (mvl) to be encoded is located and the CO motion vector ( mvO) is obtained from memory 16.
- motion vectors 137 for A, B, and C are already multiplexed in the bitstream. Calculate the median of mvAl, mvBl, mvCl, mvO as mvpl.
- the calculated mvdl is input to the differential motion vector variable length coding unit 113 and entropy-coded by means such as Huffman coding or arithmetic coding.
- the motion vector prediction unit 111 obtains a prediction vector (mvp2) of the motion vector 137 of C2.
- the motion vectors (mvA2, mvB2, mVC2) of the block adjacent to the block where the motion vector (mv2) to be encoded is located and the CO and CI motion at the same position as the block where mv2 is located
- the median value of mvA2, mvB2, mvC2, mvO, mvl is calculated as mvp2.
- the calculated mvd2 is input to the differential motion vector variable length code key unit 113, and entropy-coded by means such as Huffman coding or arithmetic code y.
- FIG. 47 shows a configuration of motion vector decoding section 250 that decodes motion vector 137 in a part of variable length decoding section 25 of the decoding apparatus shown in FIG.
- the motion vector decoding unit 250 decodes the motion vectors 137 of the three color components multiplexed in the video stream 22 in the order of C0, Cl, and C2.
- the differential motion vector variable length decoding unit 251 extracts the differential motion vectors (mvd0, mvdl, mvd2) of the three color components (C0, Cl, C2) multiplexed in the video stream 22 and variable length. Decrypt.
- the motion vector prediction unit 252 calculates prediction vectors (mvpO, mvpl, mvp2) of the motion vectors 137 of C0, Cl, and C2.
- the calculation method of the prediction vector is the same as that of the motion vector prediction unit 111 of the encoding device.
- the calculated motion vector 137 is Stored in memory 16 for use as a prediction vector candidate.
- the motion vector of the same color component block adjacent to the block in which the motion vector to be encoded is located, Since the motion vector of a different color component block at the same position as the block where the target motion vector is located is used as a prediction vector candidate, the motion of adjacent blocks in the same color component in the boundary region of the object, etc.
- using the motion vector of the block in the same position with different color components as the prediction vector candidate can improve the efficiency of motion vector prediction and reduce the amount of motion vector code
- Embodiment 11 an example of another encoding apparatus and decoding apparatus derived from the encoding apparatus and decoding apparatus described in Embodiment 7 will be described.
- the encoding apparatus' decoding apparatus in the eleventh embodiment uses C0, Cl, and C2 components in the macroblock as individual header information. Whether or not to perform coding according to the information is determined according to a predetermined control signal, and the information of the control signal is multiplexed into the video stream 22.
- the header information necessary for decoding the C0, Cl, and C2 components is multiplexed into the video stream 22 according to the control signal, and there is no motion vector or conversion coefficient information to be transmitted according to the control signal. It is characterized by providing a means to efficiently code skip (or not coded) macroblocks.
- a special signal is given to the case where there is no code information to be transmitted for the macroblock to be coded, so that the macroblock A high-efficiency code with a minimum code amount is realized.
- the image data at the exact same position on the reference image used for motion compensated prediction is used as the predicted image (that is, the motion vector is zero), and the obtained prediction is used.
- the error signal is transformed and quantized, if all the transform coefficients after quantization in the macroblock become zero, the prediction error signal obtained by performing inverse quantization on the decoding side has an amplitude of zero. There is no conversion coefficient data to be transmitted to the device.
- Such macroblocks are conventionally called skip macroblocks or not coded macroblocks, and they are designed to prevent extra information from being transmitted by performing special signaling.
- motion vector assumptions are as follows: ⁇ When performing 16x16 prediction in Fig. 32 (a) and predicting values used for coding motion vectors (predicted vectors mvp0, mvpl, mvp2 are applicable) are actual motion vectors. If it is equal, the condition is met, and if there is no transform coefficient data to be transmitted, it is regarded as a skip macroblock.
- this skip macroblock is encoded, either of the following two methods is selected according to the variable-length encoding method to be used.
- Method 1 Count the number of skip macroblocks (RUN length) that are consecutive in the slice, and variable-length encode the RUN length.
- Method 2 For each macro block, encode a flag indicating whether or not it is a skip macro block.
- Figure 48 shows the bitstream syntax for each method.
- Figure 48 (a) shows variable-length coding.
- Fig. 48 (b) shows the case when adaptive arithmetic coding is used (method 2).
- method 1 skip macroblock signaling is performed by mb_skip_run, and in method 2, mb_skip_flag.
- MB (n) refers to the encoded data of the nth (not skipped) macroblock. Note that mb_skip_run and mb_skip_flag are assigned in units of macroblocks that combine C0, Cl, and C2 components.
- the decoding apparatus depends on the state of the control signal, that is, the signal corresponding to the inter prediction mode common identification flag 123 described in the seventh embodiment.
- the header information including the motion vector is changed for each component of C0, Cl, and C2, and a method for performing skip macroblock signaling for each component of C0, Cl, and C2 is provided. Examples of specific bitstream syntax are shown in FIG. 49 and FIG.
- FIG. 49 shows the configuration of the macroblock code data that is output from the encoding device of the eleventh embodiment and becomes the input of the decoding device of the eleventh embodiment.
- FIG. 50 shows the Cn in FIG. The detailed structure of the code key data of the component header information is shown.
- the operation on the decoding device side that receives the bit stream and restores the video signal will be mainly described. Refer to FIG. 31 for explanation of the operation of the decoding apparatus.
- the inter prediction mode common identification flag 123 in Embodiment 7 is expanded and defined as a macroblock header common identification flag 123c.
- Macro block header common identification flag 123c regards CO component header information 139a as basic macro block header information and multiplexes only CO component header information 139a as header information that is commonly used for Cl and C2 components, or C1 component It is a flag that indicates whether header information 139b and C2 component header information 139c are individually multiplexed as extension header information.
- the macroblock header common identification flag 123c is extracted and decoded from the video stream 22 by the variable length decoding unit 25.
- the macro block header common identification flag 123c indicates that only CO component header information 139a is multiplexed as header information that is also used in common with C1 and C2 components, all components of C0, C1, and C2
- the macroblock is decoded based on the various macroblock header information included in the CO component header information 139a.
- the CO component skip instruction information 138a and the C0 component header information 139a are also applied to the C1 and C2 components in common, the skip instruction information for the Cl and C2 components (138b and 138c)
- the header information (139b, 139c) is not multiplexed in the bit stream.
- the variable length decoding unit 25 first decodes and evaluates the C0 component skip instruction information 138a. If the C 0 component skip instruction information 138a indicates “skip”, it is assumed that the C0 component header information 139a is not coded, and the conversion coefficient in the C0 component header information 139a is valid / invalid.
- the indication information 142 is assumed to be zero (no encoded transform coefficient at all).
- the C0 to C2 component transform coefficient data 140a to 140c
- the motion vector 137 of all components C0, Cl and C2 is set to the same value and output.
- C0 component skip instruction information 138a indicates that it is not "skip"
- macro block type 128b indicates intra coding in C0 component header information 139a
- intra prediction mode 141 indicates intra prediction mode 141
- transform coefficient valid / invalid instruction information 142 (if transform coefficient valid / invalid instruction information 142 is not 0) Decode the quantization parameter. If the transform coefficient valid / invalid instruction information 142 is not zero, the C0 to C2 component transform coefficient data (140a to 140c) is decoded and output in the form of a quantized transform coefficient 10.
- C0 to C2 component transform coefficient data (140a to 140c) are all zero, and all quantized transform coefficients 10 in the macroblock are output as zero. If the macro block type 128b indicates inter coding, the sub macro block type 129b is decoded as necessary, Further, reference image identification number 132b, motion vector information 133b, transform coefficient valid / invalid instruction information 142, and quantization parameter 21 are decoded (if transform coefficient valid / invalid instruction information 142 is not 0). If the transform coefficient valid / invalid instruction information 142 is not zero, the C0 to C2 component transform coefficient data (140a to 140c) is decoded and output in the form of a quantized transform coefficient 10.
- C0 to C2 component transform coefficient data 140a to 140c are all zero, and all quantized transform coefficients 10 in the macroblock are output as zero.
- the macroblock decoding is performed in accordance with a predetermined processing procedure using the output from the variable length decoding unit 25 by the above operation as in the seventh embodiment.
- the macro block header common identification flag 123c indicates that the header information 139b of the C1 component header and the header information 139c of the C2 component are individually multiplexed as extension header information separately from the CO component header information 139a , C0, Cl, and C2 are decoded based on various macroblock header information included in the corresponding header information (139a to 139c).
- skip instruction information (138b, 138c) and header blue (139b, 139c) force S bitstreams are multiplexed on the Cl and C2 components.
- the variable length decoding unit 25 first decodes and evaluates the CO component skip instruction information 138a.
- the CO component header information 139a is regarded as not coded, and the conversion coefficient in the CO component header information 139a is valid / invalid.
- the indication information 142 is assumed to be zero (no encoded transform coefficient at all).
- the C0 component transform coefficient data 140a is regarded as unsigned, and all the quantized transform coefficients in the C0 component are set to zero (that is, C0 component transform coefficient data 140a is determined by the value of the macroblock header common identification flag 123c.
- the relationship between the component skip instruction information 138a and the conversion coefficient valid / invalid instruction information 142 changes).
- the motion vector 137 of the C0 component is set and output according to the definition in the case of skipping the C0 component.
- the C0 component skip instruction information 138a indicates that it is not "skip"
- the C0 component header information 139a exists and is decoded.
- the intra prediction mode 141 Spatial pixel prediction mode in which the neighboring pixel of the pixel to be predicted in the frame is used as the prediction value
- transform coefficient valid / invalid instruction information 142 and (quantum if transform coefficient valid / invalid instruction information 1 42 is not 0) Decryption parameter 21. If the transform coefficient valid / invalid instruction information 142 is not zero, the CO component transform coefficient data is decoded and output in the form of a quantized transform coefficient of 10.
- the conversion coefficient valid / invalid instruction information is zero, all CO component conversion coefficient data are assumed to be zero. If the macro block type indicates inter coding, the sub macro block type is decoded as necessary, and the reference image identification number, motion vector information, transform coefficient valid / invalid instruction information, (transform coefficient valid / invalid instruction) Decode the quantization parameter (if not info power). If the transform coefficient valid / invalid instruction information is not zero, the CO component transform coefficient data is decoded and output in the form of a quantized transform coefficient 10. If the conversion coefficient valid / invalid instruction information is zero, the CO component conversion coefficient data is all zero. Repeat the above procedure for Cl and C2.
- the signal component power equivalent to the luminance signal that conveys the content of the image signal S When it is equivalent to the three color components, the input video signal to each component Variations in signal characteristics may occur due to the way the noise is ridden, etc. It may not be optimal to code all components C0 to C2 together.
- the encoding apparatus uses the macro block header common identification flag 123c according to the signal characteristics for each component of C0 to C2.
- the optimal code key mode (macroblock type including intra and inter code key types) and motion vector can be selected to increase the code key efficiency. .
- the presence / absence of encoded information is determined for each component. Since the skip instruction information 138 can be discriminated, only one component is force skipped.1S When another component is not skipped, it is not necessary to skip all components. Allocation can be performed.
- the value of the skip instruction information 138 is converted into the quantized conversion coefficient data 10 and the motion vector 137, the reference image identification number 132b, the macro block type Z sub-macro in the variable length code section 11. Based on block type 106,
- the configuration of the bitstream handled by the decoding apparatus may be as shown in FIG.
- skip instruction information (138), header information (139 & 1390), and conversion coefficient data (140 & 1400) for each component of C0, Cl, and C2 are arranged together.
- the skip instruction information 138 may be arranged such that the C0, Cl, and C2 states are arranged in 1-bit code symbols, and the 8 states are combined into one code symbol. . If there is a high correlation between the color components in the skip state, the coding efficiency of the skip instruction information 138 itself is determined by combining the code symbols and appropriately defining a context model of arithmetic code ⁇ (described later in Embodiment 12). You can increase the
- the macroblock header common identification flag 123c may be multiplexed into the bitstream in units of an arbitrary data layer such as a macroblock, a slice, a picture, or a sequence. If there is a steady difference in signal characteristics between color components in the input signal, it is efficient with less overhead information if the macroblock header common identification flag 123c is multiplexed in sequence units. Can be performed. If the macro block header common identification flag 123c is configured to be multiplexed in units of pictures, the macro block type has few NORMALizations, and the I picture has a common header, and the macro block type has many variations. P, B picture Therefore, using individual headers for each color component is expected to improve the balance between code efficiency and computation load. it can.
- switching at the picture layer is also desirable from the viewpoint of video signal coding control that changes the signal characteristics for each picture, such as scene changes.
- the macroblock header common identification flag 123c is multiplexed in units of macroblocks, the amount of code per macroblock increases, but the header information is shared based on the signal status of each color component in units of macroblocks. It is possible to control whether or not to perform the encoding, and it is possible to configure a coding apparatus that improves the compression efficiency by following the local signal fluctuation of the image.
- the macroblock header common identification flag 123c is multiplexed for each slice, and the flag is “common to C0, Cl, C2”.
- the bit stream is configured to include all the sign information of the three color components, and the flag is not “common to C0, Cl and C2”.
- a possible method is to configure the bitstream so that one slice contains information of one color component. This is shown in FIG. In FIG. 52, the macro block header common identification flag 123c indicates that “the current slice includes all the encoding information of the three color components” or “the current slice is the code of a specific color component. It should be given meaning as slice configuration identification information.
- Such slice configuration identification information may be prepared separately from the macroblock header common identification flag 123c! /. If it is identified as “the current slice contains the sign information of a specific color component,” it shall contain the identification “C0, Cl, C2”.
- one macroblock header is commonly used for C0, Cl, and C2 components in units of slices (C0, Cl, and C2 mixed slices), or a macroblock header is added for each C0, Cl, and C2 component.
- the encoder apparatus can mix C0, Cl, and C2 slices, CO slices, C1 slices, and C2 slices according to the nature of the local signal in the picture.
- variable length decoding unit 25 decodes the slice configuration identification information from the bit stream power every time slice data is input. This also identifies which slice in FIG. 52 is the slice to be decoded.
- the inter prediction mode common identification flag 123 or macro block header common If the state of the identification flag 123c) is set to “use individual inter prediction mode or (macroblock header) for C0, Cl, C2”, the decoding operation should be performed. Since it is guaranteed that the value of first_mb_in_slice for each slice is equal to the number of macroblocks in the slice, C0, Cl, and C2 mixed slices can be decoded based on this without causing overlap or gaps on the picture. Processing is possible.
- different slice configuration identification information in the picture It may be configured to add identification information that enables selection at the picture level or sequence level whether or not to allow mixing of slices having a value of.
- the decoding device is a symbol used for the arithmetic code ⁇ when the C0, Cl, and C2 components in the macroblock are encoded using the adaptive arithmetic coding method. It is characterized by adaptively switching whether the occurrence probability and its learning process are shared by all components or separated for each component according to instruction information multiplexed in the bitstream.
- Embodiment 12 differs from Embodiment 11 only in the processing in variable-length encoding unit 11 in Fig. 30 in the encoding device, and in processing in variable-length decoding unit 25 in Fig. 31 in the decoding device. Action Corresponds to the eleventh embodiment.
- the arithmetic code and decoding processing which are the points of the twelfth embodiment, will be described in detail.
- FIG. 54 shows an internal configuration related to arithmetic code key processing in the variable length code key unit 11, and FIGS. 55 and 56 show an operation flow thereof.
- the variable length code key unit 11 in the present embodiment 12 includes a motion vector 137, reference image identification number 132b, macro block type Z sub-macro block type 106, and intra prediction mode, which are code target data. 141, a context model determination unit 1 la that defines a context model (described later) defined for each data type, such as a quantized transform coefficient 10, and the binary key rule defined for each target data type Binary part l lb to convert multi-value data into binary data, occurrence probability generation part l lc that gives the probability of occurrence of each bin value (0 or 1) after binary ⁇ , generated It consists of a code part l ld for executing an arithmetic sign ⁇ based on the occurrence probability and a memory l lg for storing occurrence probability information.
- the input to the context model determination unit 11a is a variable length code such as motion vector 137, reference image identification number 132b, macro block type / sub macro block type 106, intra prediction mode 141, quantized transform coefficient 10, etc. It is various data input as encoding target data to the frame 11, and the output from the encoding unit id corresponds to information related to the macroblock of the video stream 22.
- the context model is a model of the dependency relationship with other information that causes fluctuations in the occurrence probability of the source symbol.By switching the state of the occurrence probability according to this dependency relationship, the symbol It is possible to perform a sign ⁇ adapted to the actual occurrence probability.
- Figure 57 shows the concept of the context model (ctx).
- the information source symbol may be a multi-valued power multi-value.
- the ctx options 0 to 2 in Fig. 57 are defined assuming that the state of the occurrence probability of the information source symbol using this ctx will change depending on the situation.
- the value of ctx is switched according to the dependency between the encoded data in a certain macroblock and the encoded data of the surrounding macroblocks.
- D. Marpe et al. “Video An example of a context model related to motion vectorization of a macroblock disclosed in Compression Using Context-Based Adaptive Arithmetic Coding J, International Conference on Image Processing 2001 is shown.
- the motion vector of block C is the encoding target (more precisely, the prediction difference value mvd (C
- Motion vector prediction difference value, mvd (B) is the motion vector prediction difference in block B
- the evaluation value e (C) indicates the degree of variation in nearby motion vectors.
- Mvd (C) is small when the variation of the value is small, and mvd (C) is large when e (C) is large.
- This variation set of occurrence probabilities is a context model, and in this case, there are three types of occurrence probability variations.
- a context model is defined in advance for each of the encoding target data such as the macroblock type Z sub-macroblock type 106, the intra prediction mode 141, and the quantized transform coefficient 10, and the encoding device and the decoding are performed. Shared on the device.
- the context model determination unit 1 la performs a process of selecting a model determined in advance based on the type of data to be encoded (which occurrence probability variation in the context model is selected). Whether to select it corresponds to the occurrence probability generation process of (3) below).
- the context model is defined according to each bin (binary position) of the binary sequence by converting the target data of the code into binary sequences at the binary part l ib.
- conversion to a binary sequence of variable length is performed in accordance with a rough distribution of values that can be taken by each encoded data.
- Binary key can reduce the number of probability linear divisions by simplifying the calculation by signing in bin units rather than arithmetic code as it is as it is. There are advantages such as the slimness of the content model.
- step S162 in FIG. 55 (details of step S162 are shown in FIG. 56))
- binarization of multi-value encoding target data is performed.
- the setting of the context model to be applied to each bin is completed, and the sign is ready.
- occurrence probability generator 11c The generation process of the occurrence probability state used for the arithmetic code ⁇ is performed at. Since each context model includes a notional occurrence probability for each value of 0/1, processing is performed with reference to the context model 1 If determined in step S160 as shown in FIG. . Establish an evaluation value for selecting the occurrence probability as shown in e (C) of Fig. 58, and
- variable length code key unit 11 in the twelfth embodiment includes the occurrence probability information storage memory l lg and stores the occurrence probability states l lh that are sequentially updated in the encoding process for each color component. Provide mechanism. Depending on the value of the occurrence probability state parameter common identification flag 143, the occurrence probability generation unit 11c selects the occurrence probability state l lh used for the current encoding from those held for each color component of C0 to C2. Then, it is selected whether to share the CO component for Cl and C2, and the occurrence probability state l lh actually used for encoding is determined (S162b to S162d in FIG. 56).
- the occurrence probability state parameter common identification flag 143 needs to be multiplexed into a bit stream so that the decoding device can perform the same selection.
- This configuration has the following effects. For example, referring to FIG. 58, if the macroblock header common identification flag 123c indicates that the CO component header information 139a is also used for other components, the macroblock type 128b indicates the 16x16 prediction mode. For example, e (C) in Figure 58 is
- the occurrence probability state prepared for the CO component is always used.
- the macroblock header common identification flag 123c indicates that the header information (139a to 139c) corresponding to each component is used
- the macroblock type 128b force C0, Cl, C2 is set to 16x16 prediction mode. If shown, e (C) in Figure 58 can have three variations per macroblock. The encoding part of the latter stage l id is k
- Two options are possible: In the former, when the C0, Cl, and C2 components have almost the same motion vector distribution, the occurrence probability state l lh is used in common. There is a possibility that the probability can be learned. On the contrary, if the C0, Cl, and C2 components have different motion vector distributions, the latter is The occurrence probability state l lh can be used and updated individually to reduce mismatches due to learning, and there is a possibility that the occurrence probability of motion vectors can be better learned. Since the video signal is non-stationary, the efficiency of the arithmetic code can be improved by enabling such adaptive control.
- Step S163 in FIG. 55 The actual encoded value (0 or l) l le is fed back to the occurrence probability generation unit 11c, and 0/1 occurrence frequency is counted to update the occurrence probability state l lh used (step S164). For example, when 100 bins are processed using a certain occurrence probability state l lh, the occurrence probability power of 0/1 in the occurrence probability variation is 25, 0.75.
- the code key value l ie is output from the variable length code key unit 11 and is output as a video stream 22 from the code key device.
- FIG. 59 shows an internal configuration related to arithmetic decoding processing in the variable-length decoding unit 25, and FIG. 60 shows its operation flow.
- variable length decoding unit 25 includes a motion vector 137, a reference image identification number 132b, a macroblock type Z sub-macroblock type 106, an intra prediction mode 141, a quantized transform coefficient 10, etc.
- Context model determination unit l la which determines the type of individual decoding target data and defines a context model that is defined in common with the encoding device, and is defined based on the type of decoding target data.
- a binarization part l lb that generates a rule an occurrence probability generator l lc that gives the occurrence probability of each bin (0 or 1) according to the binarization rule and the context model, arithmetic decoding based on the generated occurrence probability From the binary sequence obtained as a result and the above binary rule, the motion vector 137 ⁇ Reference image identification number 132b, macroblock type Z sub-macroblock type 106 , Intra prediction mode 141, decoding unit 25a for decoding data such as quantized transform coefficient 10, occurrence probability It consists of 1 lg of memory for storing information.
- 1 la ⁇ : L lc and 1 lg are the same as the internal components of the variable-length code section 11 in FIG.
- the decoding unit 25a restores the value of bin according to a predetermined arithmetic decoding process (step S166 in FIG. 60).
- the bin restoration value 25b is fed back to the occurrence probability generation unit 11c, and the occurrence frequency 0/1 is counted to update the occurrence probability state l lh used (step S164).
- the decryption unit 25a confirms a match with the binary sequence pattern defined by the binarization rule, and outputs the data value indicated by the matched pattern as a decoded data value. (Step S167). Unless the decrypted data is confirmed, the process returns to step S166 and the decryption process is continued.
- the decoding apparatus adaptively computes coding information for each color component according to the macroblock header common identification flag 123c. In the case of encoding, more efficient encoding is possible.
- the unit for multiplexing the occurrence probability state parameter common identification flag 143 may be any of a macroblock unit, a slice unit, a picture unit, and a sequence unit, as specifically illustrated. If sufficient code efficiency is ensured by switching at the upper layer above the slice by multiplexing as a flag located in the upper data layer such as slice, picture, sequence, etc., the macro block It is possible to reduce overhead bits without multiplexing the occurrence probability state parameter common identification flag 143 at each level.
- the occurrence probability state parameter common identification flag 143 may be information determined inside the decoding apparatus based on related information included in a bit stream different from itself.
- the macroblock header common identification flag 123c is set to When arithmetic codes are entered in lock units, the model shown in Fig. 61 is used for the context model 1 If. In FIG. 61, the value of the macroblock header common identification flag 123c in macroblock X is IDC. Common macroblock header in macroblock C
- the code for the identification flag 123c When the code for the identification flag 123c is used, the value IDC of the macroblock header common identification flag 123c for macroblock A and the macroblock header common identification for macroblock B are used.
- the header information of FIG. 50 included in the macroblock header (macroblock type, sub macroblock type, intra prediction mode, reference image identification number, motion vector, transform coefficient valid / invalid instruction information,
- the arithmetic code is used in the context model defined for each information type, and as shown in FIG. , B is defined with reference to the corresponding information.
- the macro block C is in the “use common macro block header in C0, Cl, C2” mode
- the macro block B is in the “individual C0, Cl, C2”
- C0, Cl, and C2 are used as reference information in the context model definition. Information on a specific color component is used.
- macroblock C is in the “use separate macroblock headers for C0, Cl, C2” mode
- macroblock B is “ If there is a ⁇ Use common macroblock header for C0, Cl, C2 '' mode, the header information of the three color components must be encoded and decoded in the macroblock.
- the header information common to the three components is used as the same value for the three components.
- the macroblock header common identification flag 123c shows the same value for all of the obvious 1S macroblocks A, B, and C, the corresponding reference information always exists, so use them. .
- arithmetic decoding is performed by defining a context model in the same procedure on both the encoding side and the decoding side.
- the occurrence probability state associated with the context model is updated based on the state of the occurrence probability state parameter common identification flag 143. And execute.
- arithmetic coding according to the occurrence probability distribution of the data to be encoded is also performed on the transform coefficient data of the C0, Cl, and C2 components. Regardless of whether the macroblock header is shared or not, these data always contain 3 component code data in the bitstream.
- intra prediction and inter prediction are performed on the color space of the encoded input signal to obtain a prediction difference signal, so that the distribution of transform coefficient data obtained by integer conversion of the prediction difference signal is performed.
- the occurrence probability distribution is considered to be the same regardless of the surrounding state where the macro block header is not shared as shown in Fig. 62. Therefore, in Embodiment 12, for each component of C0, Cl, and C2, a ma Regardless of whether the black block header is shared or not, a common context model is defined and used for encoding and decoding.
- the encoding apparatus according to Embodiment 13 is a video decoding apparatus that performs color space conversion processing at the input stage of the encoding apparatus described in Embodiments 7 to 12 and that is input to the encoding apparatus after imaging.
- An encoding device that multiplexes information specifying reverse conversion processing for converting the signal color space into an arbitrary color space suitable for encoding and returning it to the color space at the time of imaging on the decoding side in a bit stream
- the information specifying the inverse conversion process is also extracted with the bitstream power, and the decoded image is obtained by the decoding device described in Embodiments 7 to 12, and then the inverse color space is based on the information specifying the inverse conversion process. It is characterized by a configuration that performs conversion.
- FIG. 63 shows the configuration of the decoding apparatus according to the thirteenth embodiment. With reference to FIG. 63, the coding apparatus according to the thirteenth embodiment will be described.
- the encoding device of the thirteenth embodiment is provided with a color space conversion unit 301 in front of the encoding device 303 of the seventh to twelfth embodiments.
- the color space conversion unit 301 includes one or a plurality of color space conversion processes, and selects a color space conversion process to be used according to the characteristics of the input video signal or system settings, and performs color space conversion processing on the input video signal
- the converted video signal 302 obtained as a result is sent to the encoding device 303.
- information for identifying the color space conversion processing used at the same time is output to the encoding device 303 as color space conversion method identification information 304.
- the encoding device 303 multiplexes the color space conversion method identification information 304 on the bit stream 305 compressed and encoded by the method shown in Embodiments 7 to 12 using the converted video signal 302 as the encoding target signal. Send to the transmission line or record to the recording media. Output to the recording device.
- the prepared color space conversion method is, for example, a conversion from RGB to YUV used in the conventional standard.
- the input to the color space conversion unit 301 is not necessarily limited to RGB, and the conversion process is not limited to the above three types.
- the decoding apparatus includes an inverse color space conversion unit 308 in the subsequent stage in addition to the decoding apparatuses 306 according to the seventh to twelfth embodiments.
- Decoding device 306 receives bit stream 305 as input, extracts color space conversion method identification information 304 from bit stream 305 and outputs it, and also outputs decoded image 307 obtained by the operation of the decoding device described in Embodiments 7 to 12. Output.
- the inverse color space conversion unit 308 includes an inverse conversion process corresponding to each of the color space conversion methods that can be selected by the color space conversion unit 301, and color space conversion method identification information 304 output from the decoding device 306. Based on the above, the conversion executed by the color space conversion unit 301 is specified, the inverse conversion process is performed on the decoded image 307, and the image is returned to the color space of the input video signal for the coding apparatus of the thirteenth embodiment. Process.
- the encoding apparatus and decoding apparatus as in the thirteenth embodiment, optimal color space conversion processing is performed on the video signal encoded in the preceding stage of encoding and the subsequent stage of decoding processing. Therefore, the correlation included in the image signal composed of the three color component forces is removed before the sign, It is possible to perform signing in a state where redundancy is reduced, and it is possible to increase compression efficiency.
- the color space of the signal to be encoded is limited to one type of YUV.
- the color space conversion unit 301 and the inverse color space conversion unit 308 are provided, and the color space conversion is performed.
- the method identification information 304 By including the method identification information 304 in the bitstream 305, it is possible to remove restrictions on the color space of the input video signal, and to perform the optimal conversion from multiple types of means for removing the correlation between the color components. It is possible to sign by using.
- FIG. 64 shows an encoding apparatus configured as described above
- FIG. 65 shows a decoding apparatus.
- 64 includes a transform unit 310 instead of the orthogonal transform unit 8 and an inverse transform unit 312 instead of the inverse orthogonal transform unit 13.
- Inverter 312 is provided.
- the conversion unit 310 applies a plurality of C0, Cl, and C2 component prediction difference signals 4 output from the sign key mode determination unit 5 as shown in the processing of the color space conversion unit 301.
- Color space conversion is first executed by selecting an optimal conversion process from among the color space conversion processes. After that, the transformation corresponding to the orthogonal transformation unit 8 is performed on the result of the color space transformation.
- the color space conversion method identification information 311 indicating which conversion is selected is sent to the variable length encoding unit 11, multiplexed into a bit stream, and output as a video stream 22.
- the inverse conversion unit 312 first performs inverse conversion equivalent to the inverse orthogonal conversion unit 13 and then executes the inverse color space conversion process using the color space conversion process specified by the color space conversion method identification information 311. .
- variable length decoding unit 25 extracts the color space conversion method identification information 311 from the bitstream, and sends the result to the inverse conversion unit 312 so that the inverse conversion unit in the encoding device described above The same processing as 312 is performed. With this configuration, it remains between the color components. If the correlation is sufficiently removed in the prediction difference region, it can be executed as part of the encoding process, which has the effect of increasing the efficiency of the code. However, when individual macroblock headers are used for the C0, Cl, and C2 components, the prediction method can change for each component, such as intra prediction for the CO component and inter prediction for the C1 component. Correlation in the region of the measurement difference signal 4 may be difficult to maintain.
- the conversion unit 310 and the inverse conversion unit 312 may be operated so as not to perform color space conversion, and the prediction difference It may be configured to multiplex in the bit stream as identification information whether or not it is capable of performing color space conversion in the region of signal 4.
- the color space conversion method identification information 311 may be switched in units of deviation of sequence, picture, slice, and macroblock.
- the conversion coefficient data of the C0, Cl, and C2 components is the signal definition domain of the encoding target signal according to the color space conversion method identification information 311. The inn will be different. Therefore, it is generally considered that the distribution of the conversion coefficient data becomes a different occurrence probability distribution according to the color space conversion method identification information 311. Therefore, when configuring the coding device and the decoding device as shown in FIGS. 64 and 65, there is an individual occurrence probability state for each state of the color space conversion method identification information 311 for each component of C0, Cl, and C2. Encoding / decoding is performed using the associated context model.
- arithmetic decoding is performed by defining a context model in the same procedure on both the decoding side and the decoding side.
- the occurrence probability state associated with the context model is updated based on the state of the occurrence probability state parameter common identification flag 143. And execute.
- FIG. 1, FIG. 2, FIG. 30, FIG. an input video signal consisting of three color components is input to the encoding device at once, and the three color components are shared in the device.
- Encoding is performed while selecting whether to encode based on the prediction mode or macroblock header of each, or encoding based on the individual prediction mode or macroblock header, and the resulting bit stream is A bit stream that is input to the decoding device and indicates whether the three color components are encoded based on the prediction mode or macroblock header or the individual prediction mode or macroblock header.
- the above flag has already stated that it may be encoded and decoded in units of arbitrary data layers such as macroblocks, slices, pictures, sequences, etc.
- three color component signals are shared.
- the macroblock header according to the fourteenth embodiment includes a transform block size identification flag as shown in Fig. 15, a code block such as a macroblock type ⁇ sub-macroblock type 'intra prediction mode as shown in Fig. 50 ⁇ a prediction mode It includes macroblock overhead information other than conversion coefficient data, such as information, motion prediction information such as reference image identification number / motion vector, transform coefficient valid / invalid instruction information, and quantization parameter for the transform coefficient.
- the process of encoding the three color component signals of one frame with a common macroblock header is “common encoding processing”, and the three color component signals of one frame are encoded with individual independent macroblock headers.
- the encoding process is referred to as “independent encoding process”.
- the process of decoding the bitstream power frame image data in which the three color component signals of one frame are encoded by a common macroblock header is called “common decoding processing”, and the three color component signals of one frame
- the process of decoding the frame image data with the bitstream power encoded by the individual independent macroblock header is referred to as “independent decoding process”.
- common code processing in the fourteenth embodiment as shown in FIG.
- an input video signal for one frame is divided into macroblocks in which three color components are combined.
- the independent encoding process is shown in FIG. In this way, the input video signal for one frame is divided into three color components, and these are divided into macro blocks having a single color component.
- the macroblock that is subject to the common code processing is a force-independent coding process that includes samples of the three color components C0, Cl, and C2. Any of these forces include only one component sample.
- FIG. 68 is an explanatory diagram showing the temporal motion prediction reference relationship between pictures in the encoding device * decoding device according to the fourteenth embodiment.
- the data unit indicated by the thick vertical bar is a picture
- the relationship between the picture and the access unit is indicated by a surrounding dotted line.
- one picture is data representing a video signal for one frame in which three color components are mixed.
- one picture is one The video signal for one frame of the color component.
- An access unit is the smallest data unit that gives a time stamp for the purpose of synchronizing audio and other information to the video signal.
- one access unit has one picture. Minute data is included (427a in Figure 68).
- three pictures are included in one access unit (427b in FIG. 68). This is because, in the case of independent encoding and decoding processing, a reproduced video signal for one frame is obtained starting with all the pictures of the same display time for all three color components. Note that the number given to the top of each picture indicates the temporal encoding / decoding order of the picture (frame_num of AVC). In FIG. 68, an arrow between pictures indicates a reference direction for motion prediction.
- an IDR (instantaneous decoder refresh) picture is defined that performs intra coding and resets the contents of the reference image memory used for motion compensation prediction. Because IDR pictures can be decoded without depending on other pictures Used as a random access point.
- the identification information indicating whether the encoding by the common encoding process or the encoding by the independent encoding process has been performed is referred to as the common encoding and the independent encoding identification signal in the fourteenth embodiment. Call.
- FIG. 69 is an explanatory diagram showing an example of the structure of a bitstream that is generated by the encoding device according to the fourteenth embodiment and is the target of input ′ decoding processing by the decoding device according to the fourteenth embodiment. .
- FIG. 69 shows the bit stream configuration from the sequence to the frame level.
- the common encoding 'independent encoding identification signal 423 is multiplexed on the sequence level upper header (sequence parameter set in the case of AVC). Keep it.
- Individual frames are encoded in units of access units.
- AUD is an Access Unit Delimiter NAL unit that is a unique NAL unit for identifying breaks in access units in AVC.
- the access unit includes coded data for one picture.
- the picture at this time is assumed to be data representing a video signal for one frame in which three color components are mixed as described above.
- the encoded data of the i-th access unit is configured as a set of slice data Slice (U). j is the index of slice data in one picture.
- one picture is a video signal for one frame of any one color component.
- the encoded data of the pth access unit is configured as a set of slice data Slice (p, q, r) of the qth picture in the access unit.
- r is the index of slice data in one picture.
- the number of values that q can take is three.
- additional data such as transparency information for alpha blending is also available.
- q takes Set the number of values to be obtained to be 4 or more. Since the encoding device and the decoding device according to the fourteenth embodiment select the independent encoding process, each color component constituting the video signal is encoded completely independently. Therefore, the encoding / decoding process is performed in principle. Changing The number of color components can be changed freely. In the future, even if the signal format for color representation of the video signal is changed, there is an effect that can be handled by the independent code processing in the fourteenth embodiment.
- the common coding 'independent coding identification signal 423 is included in one access unit, and each does not perform motion prediction reference to each other. It is expressed in the form of “the number of independently encoded pictures”.
- the decoding device decodes and refers to num_pictures_in_au, so that it can be distinguished in one access unit as much as possible by distinguishing between code data by common code process and code data by independent code process. It is possible to know how many single-color component pictures exist at the same time, and to support the expansion of color representation of future video signals, while seamlessly performing common encoding processing and independent encoding processing in the bitstream. It is possible to handle.
- Fig. 70 is an explanatory diagram showing a bit stream configuration of slice data in each of the common encoding process and the independent code encoding process.
- the slice data received by the decoding device can be identified as to which color component picture in the access unit the slice belongs to.
- the color component identification flag (color_channel_idc) is assigned to the header area at the beginning of the slice data.
- color_channel_idc groups slices with the same value. That is, between slices with different values of color.chann eljdc, any encoding'decoding dependency (e.g. motion estimation Reference, CABAC context modeling, occurrence probability learning, etc.).
- frame_num encoding of picture to which a slice belongs'decoding processing order
- multiplexed in each slice header is set to the same value in all color component pictures in one access unit.
- FIG. 71 is an explanatory diagram showing a schematic configuration of the coding apparatus according to the fourteenth embodiment.
- common code key processing is executed in the first picture code key unit 503a, and independent coding processing is performed in the second picture coding units 503bO, 503b 1, and 503b2 (for three color components). It is executed in preparation.
- the input video signal 1 is supplied by the switch (SW) 501 to the first picture code section 503a, or one of the color component separation section 502 and the second picture code sections 503bO to 503b2.
- the switch 501 is driven by the common encoding / independent encoding identification signal 423 and supplies the input video signal 1 to the designated path.
- the key identification signal 423 needs to be multiplexed in the bit stream as information specifying it. Therefore, the common encoding / independent encoding identification signal 423 is input to the multiplexing unit 504.
- the multiplexing unit of this common coding 'independent coding identification signal 423 can be any higher layer than a picture, such as a GOP (group' of 'picture) unit consisting of several picture groups in a sequence. Even a unit like this! /.
- the first picture code section 503a divides the input video signal 1 into macroblocks in which three color component samples are combined as shown in FIG. Then, the sign key process is advanced in that unit.
- the encoding process in the first picture code unit 503a will be described later.
- Input video signal 1 when independent encoding processing is selected Is separated into data for one frame of C0, Cl, and C2 by the color component separation unit 502, and supplied to the corresponding second picture code unit 503b0 to 503b2.
- the second picture coding units 503bO to 503b2 the signal for one frame separated for each color component is divided into macro blocks of the format shown in FIG. The encoding process in the second picture encoding unit will be described later.
- the video signal for one picture having three color component powers is input to the first picture code part 503a, and the code data is output as a video stream 422a.
- a video signal for one picture having a single color component power is input to the second picture code sections 503bO to 503b2, and the encoded data is output as video streams 422b0 to 422b2.
- These video streams are multiplexed in the format of the video stream 422c by the multiplexing unit 504 based on the state of the common encoding / independent encoding identification signal 423, and output.
- the multiplexing order and transmission order in the bit stream of the slice data are determined according to the picture (each color in the access unit).
- the components can be interleaved (Fig. 72).
- a color component identification flag multiplexed as shown in FIG. 70 is used in the header area at the beginning of the slice data.
- the encoder apparatus can convert the pictures of the three color components into independent second picture encoding units 503bO to 503b2 as in the encoder apparatus of FIG.
- the code data is immediately Can be sent out.
- AVC one picture can be divided into multiple slice data and encoded, and the slice data length and the number of macroblocks included in the slice can be flexibly changed according to the code condition. Can do.
- the encoding apparatus can reduce the transmission buffer size required for transmission, that is, the processing delay on the encoding apparatus side. This is shown in FIG. If multiplexing of slice data is not allowed for a picture, the encoding device buffers the code data of another picture until the code of the picture of a specific color component is completed. It is necessary to let This means that a delay at the picture level will occur. On the other hand, as shown in the lowermost part of FIG. 72, if interleaving is possible at the slice level, the picture code part of a specific color component can output the code data to the multiplexing part in units of slice data. And delay can be suppressed.
- the slice data included in the picture may be transmitted in the raster scan order of the macroblock, or interleaved transmission is also possible within one picture. You can configure it.
- FIG. 73 shows an internal configuration of the first picture code key unit 503a.
- input video signal 1 is input in a 4: 4: 4 format and in units of macroblocks in which three color components in the format of FIG. 66 are combined.
- the prediction unit 461 selects a reference image from the motion compensated prediction reference image data stored in the memory 16a, and the motion compensation prediction process is performed in units of the macroblock.
- the memory 16a stores a plurality of reference image data composed of three color components over a plurality of times, and the prediction unit 461 selects an optimal reference image in units of these medium power macroblocks.
- the arrangement of the reference image data in the memory 16a may be stored in a frame sequential manner for each color component, or a sample of each color component may be stored in a dot sequential manner.
- Prediction unit 461 performs macroblocks for all or some of the block sizes and subblock sizes shown in Fig. 32, the motion vector of a predetermined search range, and one or more available reference images.
- a motion compensation prediction process is executed, and a prediction difference signal 4 for each block serving as a motion compensation prediction unit is obtained from the motion vector information, the reference image identification number 463 used for prediction, and the subtractor 3.
- the prediction difference signal 4 is evaluated for its prediction efficiency by the sign key mode determination unit 5, and from among the prediction processes executed by the prediction unit 461, a macroblock that can obtain the optimal prediction efficiency for the macroblock to be predicted is obtained.
- Type Z sub-macroblock type 106 and motion vector information ⁇ Reference image identification number 463 is output.
- Macroblock header information such as macroblock type, submacroblock type, reference image index, motion vector, etc. are all determined as common header information for the three color components, used for sign, bit Multiplexed into a stream.
- evaluating the optimality of the prediction efficiency only the prediction error amount for a given color component (for example, the G component in RGB, the Y component in YU V, etc.) is evaluated for the purpose of suppressing the amount of computation. Alternatively, although the calculation amount is large, the prediction error amount for all the color components may be comprehensively evaluated in order to obtain optimum prediction performance.
- a weighting factor 20 for each type determined by the judgment of the sign key control unit 19 may be added.
- the prediction unit 461 also performs intra prediction.
- the intra prediction mode information is output to the output signal 463.
- the output signal 463 collectively refers to intra prediction mode information, motion vector information, and reference image identification number as prediction overhead information.
- the prediction error amount for only a predetermined color component may be evaluated, or the prediction error amount for all color components may be comprehensively evaluated.
- the coding mode determination unit 5 predicts whether the macroblock type is the intra prediction or the inter prediction. Select by evaluating measurement efficiency or coding efficiency.
- the selected macroblock type Z sub-macroblock type 106 and the prediction difference signal 4 obtained by the intra prediction 'motion compensation prediction based on the prediction overhead information 463 are output to the conversion unit 310.
- the conversion unit 310 converts the input prediction difference signal 4 and outputs it to the quantization unit 9 as a conversion coefficient.
- the block size as a unit for conversion may be selected from 4x4 or 8x8 based on the displacement force!
- the quantization unit 9 quantizes the input conversion coefficient based on the quantization parameter 21 determined by the code control unit 19, and outputs the quantized conversion coefficient 10 to the variable length code unit 11. .
- the quantized transform coefficient 10 includes information for three color components, and is entropy-encoded by means such as a Huffman code or an arithmetic code in the variable length code section 11. Also, the quantized transform coefficient 10 is restored to the local decoded prediction difference signal 14 via the inverse quantization unit 12 and the inverse transform unit 312, and the selected macroblock type Z sub-macroblock type 106 and the prediction overhead information By adding the predicted image 7 generated based on 463 and the adder 18, a locally decoded image 15 is generated.
- the locally decoded image 15 is stored in the memory 16a for use in subsequent motion compensation prediction processing after the block distortion removal processing is performed by the deblocking filter 462.
- the variable length coding unit 11 also receives a deblocking filter control flag 24 indicating whether or not to apply a deblocking filter to the macroblock.
- Quantized transform coefficient 10, macroblock type Z sub-macroblock type 106, prediction overhead information 463, quantization parameter 21 input to variable length coding unit 11 are bits according to a predetermined rule (syntax).
- the data is arranged as a stream, and is output to the transmission buffer 17 as NAL-cut encoded data in which one or a plurality of macro blocks of the format shown in FIG.
- the bit stream is smoothed in accordance with the bandwidth of the transmission path to which the encoding device is connected and the reading speed of the recording medium, and output as a video stream 422a.
- all slice data in the sequence is CO, Cl, C2 mixed slices (that is, the common coding 'independent coding identification signal 423 (that is, Therefore, the color component identification flag is not multiplexed in the slice header.
- FIG. 74 shows the inner structure of the second picture code key 503bO (503bl, 503b2).
- the input video signal 1 is input in units of macroblocks that also have the sampling power of a single color component in the format of FIG.
- the prediction unit 461 selects a reference image from the motion compensated prediction reference image data stored in the memory 16b, and performs motion compensation prediction processing in units of the macroblock.
- the memory 16b can store a plurality of pieces of reference image data composed of a single color component over a plurality of times, and the prediction unit 461 selects an optimum reference image in units of macroblocks.
- the memory 16b may be shared with the memory 16a in units of three color components.
- the prediction unit 461 performs macroblocks for all or part of the block size / subblock size in FIG. 32, the motion vector of a predetermined search range, and one or more available reference images.
- a motion compensation prediction process is executed, and a prediction difference signal 4 for each block serving as a motion compensation prediction unit is obtained from the motion vector information, the reference image identification number 463 used for prediction, and the subtractor 3.
- the prediction difference signal 4 is evaluated for its prediction efficiency by the sign key mode determination unit 5, and from among the prediction processes executed by the prediction unit 461, a macroblock that can obtain the optimal prediction efficiency for the macroblock to be predicted is obtained.
- Type Z sub macroblock type 1 06 and motion vector information ⁇ Reference image identification number 463 is output.
- Macro block header information such as macro block type, sub macro block type, reference image index, and motion vector are all determined as header information for the single color component signal of input video signal 1 and used for encoding. Is multiplexed into the bitstream. In evaluating the optimality of the prediction efficiency, only the prediction error amount for the single color component to be encoded is evaluated. In selecting the final macroblock type Z sub-macroblock type 106, the weighting factor 20 for each type determined by the judgment of the sign key control unit 19 may be taken into account.
- the prediction unit 461 also executes intra prediction.
- the intra prediction mode information is output to the output signal 463.
- the output signal 463 collectively refers to intra prediction mode information, motion vector information, and reference image identification number as prediction overhead information. Even with intra prediction, only the prediction error amount for a single color component to be encoded is evaluated. Finally, the macro block type is selected based on the prediction efficiency or coding efficiency to determine whether it is intra prediction or inter prediction.
- the selected macroblock type Z sub-macroblock type 106 and the prediction difference signal 4 obtained from the prediction overhead information 463 are output to the conversion unit 310.
- the conversion unit 310 converts the input prediction difference signal 4 for a single color component, and outputs it to the quantization unit 9 as a conversion coefficient.
- the block size as a unit for conversion may be selected from 4x4 force or 8x8.
- the block size selected at the time of encoding is reflected in the value of the transform block size designation flag 464, and the flag is multiplexed into a bitstream.
- the quantization unit 9 quantizes the input transform coefficient based on the quantization parameter 21 determined by the sign key control unit 19, and the quantized transform coefficient 10 is variable length coding unit 11.
- the quantized transform coefficient 10 includes information for a single color component, and is encoded by the entangle py code by means such as a Huffman code or an arithmetic code in the variable length code section 11.
- the quantized transform coefficient 10 is restored to the local decoded prediction differential signal 14 via the inverse quantization unit 12 and the inverse transform unit 31 2, and the selected macroblock type Z sub-macroblock type 106 and the prediction overhead information 463 Forecast generated based on
- a locally decoded image 15 is generated.
- the local decoded image 15 is stored in the memory 16b for use in the subsequent motion compensation prediction process after the block distortion removal process is performed by the deblocking filter 462.
- the variable length coding unit 11 also receives a deblocking filter control flag 24 indicating whether or not to apply a deblocking filter to the macroblock.
- Quantized transform coefficient 10 input to variable length coding unit 11, macroblock type Z sub-macroblock type 106, prediction overhead information 463, quantization parameter 21 are bits according to a predetermined rule (syntax).
- the data is arranged as a stream, and is output to the transmission buffer 17 as encoded data that is NAL-cut in units of slice data in which one or a plurality of macroblocks in the format of FIG.
- the transmission buffer 17 smoothes the bit stream in accordance with the bandwidth of the transmission path to which the encoding device is connected and the reading speed of the recording medium, and outputs it as a video stream 422b0 (422bl, 422b2).
- a feed knock is applied to the encoding control unit 19 in accordance with the bit stream accumulation state in the transmission buffer 17 to control the amount of generated code in the code frame of subsequent video frames.
- the output of the second picture encoding units 503bO to 503b2 is a slice that has only the power of single color component data, and it is necessary to control the code amount in units in which access units are combined.
- the multiplexing unit 504 a common transmission buffer in a unit in which slices of all color components are multiplexed is provided, and the encoding control unit 19 for each color component is fed back based on the occupied amount of the same buffer. You may comprise. Also, at this time, only the amount of generated information of all color components may be used to perform the encoding control, and the encoding control may be performed in consideration of the state of the transmission buffer 17 of each color component. Again! ⁇ .
- the transmission buffer 17 is omitted by realizing the function equivalent to the transmission buffer 17 with the common transmission buffer in the multiplexing unit 504. It can also be taken.
- each of the second picture code key units 503bO to 503b2 can transmit the data from one transmission buffer 17 when one slice of data is accumulated without accumulating the output from the transmission buffer 17 for one picture.
- the common coding 'independently code I ⁇ another signal (num_pi C tur es _in_ a u ) distinguishes code I spoon data by independent code I spoon processing the code I spoon data by the common code I spoon treatment Information (common coding identification information) and information (number of color components) indicating how many single color component pictures exist in one access unit can be expressed simultaneously. It may be coded as independent information.
- the first picture code part 503a and the second picture code parts 503bO to 503b2 handle the macroblock header information as information common to the three components, or information on a single color component. The only difference is whether the bit stream structure of slice data is handled. Many of the basic processing blocks such as the prediction unit, transform unit, inverse transform unit, quantization unit, inverse quantization unit, and deblocking filter in Fig. 73 and Fig. 74 are processed together with the information of the three color components.
- the first picture coding unit 5003a and the second picture coding unit 503bO to 503b2 can be realized by a common functional block only by the difference of handling only information of a single color component. it can.
- the encoding apparatus uses a virtual stream buffer (encoded picture buffer) that buffers the video stream 422c according to the arrangement shown in Figs. ) And a virtual frame memory (decoded picture buffer) for buffering the decoded images 427a and 427b, it is assumed that there is an underflow or a failure of the decoded picture buffer.
- Video stream 422c is generated so that there is no such thing. This control is mainly performed by the sign key control unit 19.
- the decoding apparatus when the video stream 422c is decoded in accordance with the operation of the coded picture buffer and the decoded picture buffer (virtual buffer model), the decoding apparatus is not broken. Guarantee that it will not occur.
- the virtual buffer model is specified below.
- the operation of the coded picture buffer is performed in units of access units.
- one access unit when performing common decoding processing, one access unit includes code data for one picture, and when performing independent decoding processing, one access unit includes as many pictures as the number of color components (3 It contains 3 pictures of encoded data.
- the behavior specified for the sign-picture-noffer is that the time when the first and last bits of the access unit are input to the sign-picture picture and the bits of the access unit are also read. It's time. It should be noted that the reading of the sign key picture buffer power is performed instantaneously, and all bits of the access unit are read from the sign key picture buffer at the same time.
- the decoding process is performed in the first picture decoding unit or the second picture decoding unit, It is output as a color video frame bundled in units of access units.
- the process from reading the bit of the code buffer power to outputting it as a color video frame in units of access units is performed instantaneously in accordance with the virtual buffer model.
- Color video frames configured in units of access units are input to the decoded picture buffer, and the output time of the decoded picture buffer is calculated.
- the output time of the decoded picture buffer is a value obtained by adding a predetermined delay time to the read time of the encoded picture buffer.
- This delay time can be multiplexed with the bit stream to control the decoding apparatus.
- the delay time is 0, that is, when the output time from the decoded picture buffer is equal to the read time from the coded picture buffer, a color video frame is input to the decoded picture buffer and output from the decoded picture buffer at the same time. Is done. In other cases, that is, when the output time from the decoded picture buffer is later than the read time of the code-picture buffer, the color video frame is stored in the decoded picture buffer until the output time from the decoded picture buffer is reached. .
- the operation from the decoding picture buffer is defined for each access unit.
- FIG. 75 is an explanatory diagram showing a schematic configuration of the decoding apparatus according to the fourteenth embodiment.
- the common decoding process is executed in the first picture decoding unit 603a, and the independent decoding process is performed. Is executed by the color component determination unit 602 and the second picture decoding units 603b0, 603b1, and 603b2 (preparing three color components).
- the video stream 422c is divided into NAL units by the upper header analysis unit 610, and the upper header information such as the sequence parameter set and the picture parameter set is decoded as it is, and the first picture decoding unit in the decoding apparatus.
- the data is stored in a predetermined memory area that can be referred to by 603a, the color component determination unit 602, and the second picture decoding units 603b0 to 603b2.
- the common encoding 'independent code identification signal 423 (num_pictures_in_au) multiplexed in sequence units is held as a part of the higher-order header information'.
- the color component determination unit 602 Based on the value of the color component identification flag shown in FIG. 70, the color component determination unit 602 identifies which color component picture in the current access unit corresponds to the slice NAL unit, Distribution is supplied to the second picture decoding units 603b0 to 603b2. With such a decoding apparatus configuration, even if a bitstream in which slices are interleaved in the access unit and received as shown in FIG. 72 is received, it is easy to determine which slice belongs to which color component picture. This has the effect of discriminating and correctly decoding.
- FIG. 76 shows the internal configuration of the first picture decoding unit 603a.
- the first picture decoding unit 603a divides the video stream 442c output from the encoding device in FIG. 71 according to the arrangement in FIG. 69 and FIG. 70 into NAL units by the upper header analysis unit 610, and then C0, Cl and C2 mixed slices are received in units of slices, and decoding processing is performed in units of macroblocks consisting of the three color component samples shown in Fig. 66 to restore output video frames.
- the variable length decoding unit 25 receives the video stream 442c divided into NAL units as input, The video stream 442c is decoded according to a predetermined rule (syntax), and the quantized transform coefficient 10 for the three components and the macro block header information used in common for the three components (macro block type / sub macro block type) 106, prediction overhead information 463, transform block size designation flag 464, and quantization parameter 21) are extracted.
- the quantized transform coefficient 10 is input together with the quantization parameter 21 to the inverse quantization unit 12 that performs the same process as the first picture code unit 503a, and the inverse quantization process is performed.
- the output is input to the inverse transform unit 312 that performs the same processing as the first picture code key unit 503a, and is restored to the local decoded prediction difference signal 14 (the transform block size designation flag 464 is included in the video stream 422c). If it exists, it is referred to in the process of inverse quantization and inverse transform).
- the prediction unit 461 includes only the process of generating the predicted image 7 with reference to the prediction overhead information 463 out of the prediction units 461 in the first picture code encoding unit 503a.
- a macroblock type / sub-macroblock type 106 and prediction overhead information 463 are input to 461, and a predicted image 7 for three components is obtained.
- the prediction image 7 for three components is obtained from the prediction overhead information 463 according to the intra prediction mode information, and when indicating that it is macroblock type force inter prediction, the prediction overhead is indicated. From information 463, a predicted image 7 for three components is obtained according to the motion vector and the reference image index. The local decoded prediction difference signal 14 and the predicted image 7 are added by an adder 18 to obtain a provisional decoded image (local decoded image) 15 for three components. Since the provisional decoded image 15 is used for motion compensation prediction of the subsequent macroblock, the deblocking filter 462 that performs the same processing as the first picture code part 503a is applied to the provisional decoded image sample for three components.
- the decoded image 427a is output and stored in the memory 16a.
- the deblocking filter process is applied to the temporary decoded image 15 based on the instruction of the deblocking filter control flag 24 decoded by the variable length decoding unit 25.
- the memory 16a stores a plurality of pieces of reference image data composed of three color components over a plurality of times, and the prediction unit 461 extracts a reference image obtained by extracting the bitstream force in units of macroblocks from these. A reference image indicated by the index is selected to generate a predicted image.
- the arrangement of the reference image data in the memory 16a may be stored in a frame sequential manner for each color component, or each color component sample is dot-sequentially stored. May be stored.
- the decoded image 427a includes three color components and becomes a color video frame constituting the access unit 427a0 in the common decoding process as it is.
- FIG. 77 shows an internal configuration of second picture decoding sections 603b0 to 603b2.
- the second picture decoding units 603b0 to 603b2 are divided into units of NAL units by the high-order header analysis unit 610 in the video stream 442c force according to the arrangement of FIGS. 69 and 70 output from the encoding device of FIG.
- the CO or C1 or C2 slice NAL units distributed by the unit 602 are received in units of NAL units, and are decoded in units of macroblocks composed of single color component samples shown in FIG. 67 to restore output video frames.
- the variable length decoding unit 25 receives the video stream 422c, decodes the video stream 422c according to a predetermined rule (syntax), and converts it to a quantized transform coefficient 10 of a single color component and a single color component. Extract applicable macroblock header information (macroblock type Z sub-macroblock type 106, prediction overhead information 463, transform block size designation flag 464, quantization parameter 21). The quantized transform coefficient 10 is input together with the quantization parameter 21 to the inverse quantization unit 12 that performs the same processing as the second picture code encoding unit 503b0 (503bl, 503b2), and the inverse quantization process is performed.
- a predetermined rule syntax
- the output is input to the inverse transform unit 312 that performs the same processing as the second picture code unit 503b0 (503bl, 503b2), and is restored to the local decoded prediction difference signal 14 (the transform block size designation flag 464 is If it exists in the video stream 422c, it is referred to in the process of inverse quantization and inverse orthogonal transform).
- the prediction unit 461 includes only the process of generating the predicted image 7 with reference to the prediction overhead information 463 among the prediction units 461 in the second picture code key units 503b0 (503bl, 503b2), A macroblock type Z sub-macroblock type 106 and prediction overhead information 463 are input to the prediction unit 461, and a prediction image 7 having a single color component is obtained.
- the prediction image 7 of a single color component is obtained from the prediction overhead information 463 according to the intra prediction mode information, and when indicating that it is macro block type cauter prediction
- a prediction image 7 having a single color component is obtained from the prediction overhead information 463 according to the motion beta and the reference image index.
- the difference signal 14 and the predicted image 7 are added by an adder 18, and a single color component macroblock is added.
- Provisional decoded image 15 is obtained. Since the provisional decoded image 15 is used for subsequent motion compensation prediction of the macroblock, the deblocking filter 26 that performs the same processing as the second picture encoding unit 503bO (503bl, 503b2) converts the provisional decoded image sample into a single color component provisional decoded image sample. After block distortion removal processing is performed on the image, it is output as a decoded image 427b and stored in the memory 16b.
- the deblocking filter process is applied to the provisional decoded image 15 based on the instruction of the deblocking filter control flag 24 decoded by the variable length decoding unit 25.
- the decoded image 427b includes only a sample of a single color component, and the decoded image 427b that is the output of each of the second picture decoding units 603b0 to 603b2 to be processed in parallel in FIG. 75 is used as the unit of the access unit 427b0. By bundling, it is configured as a color video frame.
- the first picture decoding unit 603a and the second picture decoding unit 6 03b0 to 603b2 handle macroblock header information as information common to three components, or a single
- Many of the basic decoding processing blocks such as motion compensation prediction processing, inverse transform, and inverse quantization in Fig. 73 and Fig. 74 differ only in the difference in whether it is handled as color component information and the bit stream configuration of slice data.
- the configuration of the memory 16a and the memory 16b is changed to the first picture decoding unit 603a and the second picture decoding units 603b0 to 603b2. And can be shared.
- the decoding apparatus of FIG. 75 always fixes the common encoding 'independent code identification signal 423 to “independent code encoding process” as another form of the encoding apparatus of FIG.
- the switch 601 and the first coding are used as another form of the decoding apparatus of FIG. 75.
- the switch 601 and the first coding are used as a decoding device that only performs independent decoding without the picture decoding unit 603a Make up.
- the common encoding and independent code I ⁇ another signal (num_pi C tur es _in_ a u )
- the code I spoon distinguishing data information by independent code I spoon processing the code I spoon data by the common encoding processing (Common encoding identification information) and information (number of color components) indicating how many single color component pictures exist in one access unit, but the above two pieces of information are independent information It's a sign.
- the first picture decoding unit 603a is provided with a decoding function of an AVC high profile compliant bitstream that is encoded together for the conventional YUV4: 2: 0 format.
- the header analysis unit 610 refers to the profile identifier decoded from the video stream 422c to determine in which format the bit stream is encoded, and the determination result is the common encoding 'independent encoding identification signal 423. If it is configured to transmit to the switch 601 and the first picture decoding unit 603a as part of the signal line information, a decoding device that ensures compatibility with the conventional YUV4: 2: 0 format bit stream is configured. You can
- the prediction error signal may be configured to be subjected to color space conversion processing as described in the thirteenth embodiment.
- FIG. 78 shows an example in which the color space conversion process is performed at the pixel level before the conversion process.
- the color space conversion unit 465 is set before the conversion unit 310, and the reverse color space conversion unit 466 is set at the reverse conversion unit 312.
- FIG. 79 shows an example in which the color space conversion process is performed while appropriately selecting the frequency component to be processed for the coefficient data obtained after the conversion process.
- the color space conversion unit 465 is converted into the conversion unit 310.
- the inverse color space conversion unit 466 is arranged before the inverse conversion unit 312.
- a plurality of conversion methods as described in Embodiment 13 may be used by switching in units of macroblocks according to the property of the image signal to be encoded. You may comprise so that the presence or absence of conversion may be determined by the unit of a macroblock.
- the type of conversion method that can be selected can be specified at the sequence level, and it can be configured to specify which one of them is selected in units of pictures, slices, and macroblocks. Further, it may be configured to select whether to perform before or after orthogonal transformation.
- the code key mode determination unit 5 evaluates the code key efficiency for all selectable options and selects the one with the highest code key efficiency. Can be configured.
- signaling information 467 for determining selection at the time of encoding on the decoding side is multiplexed into the bit stream.
- Such signaling may be specified at a different level than macroblocks, such as slices, pictures, GOPs, and sequences.
- FIG. 80 is a decoding device that decodes a bitstream that has been encoded by color space conversion before the conversion processing by the encoding device of FIG.
- the variable length decoding unit 25 is information indicating whether or not to perform conversion in the inverse color space conversion unit 466 from the bit stream, and information for selecting a conversion method executable in the inverse color space conversion unit 466.
- the signal information 467 is decoded and supplied to the inverse color space conversion unit 466.
- the inverse color space conversion unit 466 performs color space conversion processing on the prediction error signal after inverse conversion based on these pieces of information.
- variable-length decoding unit uses the bit stream to determine whether or not to perform the conversion in the inverse color space conversion unit 466, information for selecting the conversion method to be executed in the inverse color space conversion unit, Signaling information 467, which is identification information including information specifying a frequency component to be subjected to color space conversion, is decoded and supplied to the inverse color space conversion unit 466.
- the decoding device in FIG. 81 uses inverse color space conversion. In section 466, color space conversion processing is performed on the conversion coefficient data after inverse quantization based on such information.
- the decoding devices in Fig. 80 and Fig. 81 have the AVC encoded by the first picture decoding unit 603a in which the three components are encoded for the conventional YUV4: 2: 0 format. It has a bitstream decoding function compliant with the noise profile, and an upper header analysis unit
- the profile identifier to be decoded from the video stream 422c is referred to, it is determined whether it is a bit stream encoded in a different format, and the determination result is determined as the signal line information of the common code ⁇ 'independent code ⁇ identification signal 423. If it is configured to transmit to the switch 601 and the first picture decoding unit 603a as part of the above, it is possible to configure a decoding apparatus that ensures compatibility with a conventional YUV4: 2: 0 format bitstream.
- Fig. 82 shows the structure of the encoded data of the macroblock header information included in the conventional YUV4: 2: 0 format bit stream.
- the only difference from the Cn component header information shown in FIG. 50 is that the encoded data of the intra color difference prediction mode 144 is included in the macroblock type power S intra prediction.
- the structure of the code key data of the macroblock header information is the same as the Cn component header information shown in FIG. 50, but the reference image included in the macroblock header information is Using the identification number and the motion vector information, a motion vector of the color difference component is generated in a method different from the luminance component.
- variable length decoding unit 25 of the first picture decoding unit equipped with the conventional YUV4: 2: 0 format bitstream decoding function will be described.
- the color difference format instruction flag is decoded.
- the color difference format indication flag is included in the sequence parameter header of the video stream 422c, and the input video format is 4: 4: 4 force, 4: 2: 2 force, 4: 2: 0 force, 4: 0: 0 force. !, A flag indicating the format of either.
- Decoding process of macroblock header information of video stream 422c is color Switching is performed according to the value of the difference format instruction flag.
- the intra color difference prediction mode 144 is also decoded by the bitstream monitor.
- the color difference format indication flag power indicates 4: 4
- decoding of the intra color difference prediction mode 144 is skipped.
- the input video signal is a format composed only of luminance signals (4: 0: 0 format), so decoding of the intra color difference prediction mode 144 is skipped.
- the decoding process of macro block header information other than intra color difference prediction mode 144 is the same as the variable length decoding unit of the first picture decoding unit 603a that does not have the conventional YUV4: 2: 0 format bitstream decoding function.
- the prediction unit 461 receives a color difference instruction format instruction flag (not shown) and prediction overhead information 463 to obtain a predicted image 7 for three components.
- FIG. 83 shows the internal configuration of the prediction unit 461 of the first picture decoding unit that ensures compatibility with the conventional YUV4: 2: 0 format bitstream, and its operation will be described.
- the switching unit 461 la determines the macroblock type, and when the macroblock type indicates intra prediction, the switching unit 461 lb determines the value of the color difference format instruction flag. If the value of the color difference format indication flag is 4: 2: 0 or 4: 2: 2 !, it indicates a deviation from the prediction overhead information 463 according to the intra prediction mode information and the intra color difference prediction mode information. Get a predicted image of 7 minutes.
- the luminance signal prediction image is generated by the luminance signal intra prediction unit 4612 according to the intra prediction mode information.
- the color difference signal two-component prediction image is generated by the color difference signal intra prediction unit 4613 that performs processing different from the luminance component in accordance with the intra color difference prediction mode information.
- the luminance signal intra prediction unit 4612 When the value of the color difference format instruction flag indicates 4: 4: 4, the luminance signal intra prediction unit 4612 generates the information according to the prediction image power intra prediction mode information of all three components. If the value of the color difference format indication flag indicates 4: 0: 0, the 4: 0: 0 format is the luminance signal (1 component). Therefore, only the predicted image of the luminance signal is generated by the luminance signal intra prediction unit 4612 according to the power S intra prediction mode information.
- the switching unit 4611c determines the value of the color difference format instruction flag. If the value of the color difference format instruction flag indicates 4: 2: 0 or 4: 2: 2 !, it indicates a deviation, the luminance signal inter prediction unit 4614 predicts the overhead information. From the motion vector and the reference image index, a predicted image is generated according to the predicted image generation method of the luminance signal defined by the AVC standard. For the color difference signal two-component prediction image, the color difference signal inter prediction unit 4615 generates a color difference motion vector by scaling the motion vector obtained from the prediction overhead information 463 based on the color difference format, and performs the prediction overhead.
- a predicted image is generated from the reference image indicated by the reference image index obtained from the reference information 463 according to the method defined by the AVC standard.
- the 4: 0: 0 format is composed only of luminance signals (one component), so only the predicted image of the luminance signal is a motion vector, It is generated by the luminance signal inter prediction unit 4614 according to the reference image index.
- a conventional YUV4: 2: 0 format color difference signal prediction image generation unit is provided, and bitstream power is predicted according to the value of the decoded color difference format instruction flag. Since the means used to generate the image are switched, it is possible to configure a decoding device that ensures compatibility with the conventional YUV4: 2: 0 format bitstream.
- the video stream 422c supplied to the decoding device in FIGS. 80 and 81 does not support color space conversion processing as in the decoding device in FIG. 75, and is a bit stream that can also be decoded by the decoding device. If information indicating whether or not is given in units such as a sequence parameter set, any of the decoding devices in FIG. 80, FIG. 81 and FIG. 75 can decode the bit stream according to the decoding performance of each bit stream. This has the effect of ensuring compatibility.
- the encoding device in the fifteenth embodiment multiplexes encoding data with the bit stream configuration shown in FIG.
- the AUD NAL unit includes information called primary_pic_type as its element. This shows the picture code type information when picture data in an access unit starting with an AUD NAL unit is encoded.
- Embodiment 15 of the present invention when performing independent coding of each color component picture, the remaining two color component pictures are added in the AUD NAL queue of FIG. 69 according to the value of num_pictures_in_au.
- the power to insert primary_pic_type as shown in the bitstream configuration in Fig. 84, the encoded data of each color component picture is configured to start from the NAL unit (Color Channel Delimiter) indicating the start of the color component picture.
- the CCD NAL unit is configured to include primary_pi C _type information of the corresponding picture.
- the color component identification flag (color_channel_idc) described in the above embodiment 14 is included in the CCD NAL unit, not in the slice header. To do so. As a result, the information of the color component identification flag that has been required to be multiplexed to each slice can be aggregated into data in units of pictures, so that the overhead information can be reduced.
- the CCD NAL unit configured as a byte sequence is detected and the color_channel_idc is verified only once per color component picture, the head of the color component picture can be found quickly without performing variable length decoding. On the decoding device side, each color component is restored. It is not necessary to verify the color_channel_idc in the slice header one by one in order to separate the NAL units that are subject to issue, and the data can be supplied smoothly to the second picture decoding unit.
- the effect of reducing the coding size and processing delay of the coding apparatus as described in FIG. 72 of Embodiment 14 is reduced, so that the color component identification flag is sliced. It may be configured to signal at a higher level (sequence or GOP) whether to multiplex in units or color component picture units.
- the encoding device can be flexibly implemented according to its usage.
- multiplexing of code data may be performed with the bit stream configuration shown in FIG. In FIG. 86, color_channel_idc and primary_pic_type included in the CCD NAL unit in FIG. 84 are included in each AUD.
- the bit stream configuration in the fifteenth embodiment is configured such that one (color component) picture is included in one access unit even in the case of independent encoding processing. Even with such a configuration, the overhead information can be reduced by collecting the information of the color component identification flag into data in units of pictures, and the AUD NAL unit configured as a byte string is detected and the color_channel dc is only set once per picture.
- the bit stream configuration in FIG. 86 can also be configured so that sequence numbers (coding in the time direction, decoding order, etc.) of each picture are added to the AUD.
- the decoding device can verify the decoding order of each picture (display order, color component attributes, IDR, etc.) without decoding any slice data, and perform bitstream level editing and special playback. It becomes possible to carry out efficiently.
- the conversion process and the inverse conversion process may be conversions that guarantee direct orthogonality such as DCT, or strictly like DCT. Quantization without orthogonal transformation / transformation that approximates orthogonality in combination with inverse quantization processing. Further, it may be configured such that the prediction error signal is encoded as pixel level information without performing conversion.
- the present invention can be applied to a digital image signal encoding apparatus and a digital image signal decoding apparatus used for an image compression encoding technique, a compressed image data transmission technique, and the like.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Color Television Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (18)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2006800251408A CN101218830B (zh) | 2005-07-22 | 2006-06-16 | 图像编码装置和方法、以及图像解码装置和方法 |
BRPI0611672-8A BRPI0611672A2 (pt) | 2005-07-22 | 2006-06-16 | codificador e decodificador de imagem, mÉtodo de codificaÇço de imagem, programa de codificaÇço de imagem, meio de gravaÇço legÍvel por computador, mÉtodo de decodificaÇço de imagem, programa de decodificaÇço de imagem, e, corrente de bits codificada por imagem |
CA 2610276 CA2610276C (en) | 2005-07-22 | 2006-06-16 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
JP2006551668A JP4542107B2 (ja) | 2005-07-22 | 2006-06-16 | 画像復号装置及び画像復号方法 |
US11/912,680 US20090034856A1 (en) | 2005-07-22 | 2006-06-16 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein |
EP06757388A EP1909508A4 (en) | 2005-07-22 | 2006-06-16 | Image Coding Device, Image Decoding Device, Image Coding Method, Image Decoding Method, Image Coding Program, Image Decoding Program, Computer Readable Recording Medium With Image Recoding Program Recorded Thereon, and Computer Readable Recording Medium With Image Image Coding Program Recorded Thereon |
KR1020117021863A KR101217400B1 (ko) | 2005-07-22 | 2006-06-16 | 화상 부호화 장치, 화상 복호 장치, 화상 부호화 방법 및 화상 복호 방법 |
US11/980,363 US20080130990A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/980,460 US8509551B2 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recording with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,445 US20080130988A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,714 US20080123947A1 (en) | 2005-07-22 | 2007-10-31 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein |
US11/932,236 US20080130989A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/932,759 US20080123977A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,557 US20090123066A1 (en) | 2005-07-22 | 2007-10-31 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein, |
US11/931,636 US20090034857A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,503 US8488889B2 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/932,465 US20080137744A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,756 US20080165849A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
Applications Claiming Priority (10)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005-212601 | 2005-07-22 | ||
JP2005212601 | 2005-07-22 | ||
JP2005-294767 | 2005-10-07 | ||
JP2005294768 | 2005-10-07 | ||
JP2005-294768 | 2005-10-07 | ||
JP2005294767 | 2005-10-07 | ||
JP2005377638 | 2005-12-28 | ||
JP2005-377638 | 2005-12-28 | ||
JP2006-085210 | 2006-03-27 | ||
JP2006085210 | 2006-03-27 |
Related Child Applications (12)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/912,680 A-371-Of-International US20090034856A1 (en) | 2005-07-22 | 2006-06-16 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein |
US11/931,714 Division US20080123947A1 (en) | 2005-07-22 | 2007-10-31 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein |
US11/980,363 Division US20080130990A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,636 Division US20090034857A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,756 Division US20080165849A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/932,465 Division US20080137744A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/932,236 Division US20080130989A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,503 Division US8488889B2 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/980,460 Division US8509551B2 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recording with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,445 Division US20080130988A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/932,759 Division US20080123977A1 (en) | 2005-07-22 | 2007-10-31 | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program |
US11/931,557 Division US20090123066A1 (en) | 2005-07-22 | 2007-10-31 | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein, |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007010690A1 true WO2007010690A1 (ja) | 2007-01-25 |
Family
ID=37668575
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2006/312159 WO2007010690A1 (ja) | 2005-07-22 | 2006-06-16 | 画像符号化装置、画像復号装置、および画像符号化方法、画像復号方法、画像符号化プログラム、画像復号プログラム、ならびに画像符号化プログラムを記録したコンピュータ読み取り可能な記録媒体、画像復号プログラムを記録したコンピュータ読み取り可能な記録媒体 |
Country Status (10)
Country | Link |
---|---|
US (1) | US20090034856A1 (ja) |
EP (1) | EP1909508A4 (ja) |
JP (2) | JP4542107B2 (ja) |
KR (5) | KR100995226B1 (ja) |
CN (5) | CN102176754B (ja) |
BR (1) | BRPI0611672A2 (ja) |
CA (2) | CA2610276C (ja) |
HK (2) | HK1159913A1 (ja) |
RU (1) | RU2506714C1 (ja) |
WO (1) | WO2007010690A1 (ja) |
Cited By (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008126139A1 (ja) * | 2007-03-30 | 2008-10-23 | Fujitsu Limited | 画像データ圧縮装置及び復号装置 |
WO2008132791A1 (ja) * | 2007-04-13 | 2008-11-06 | Panasonic Corporation | 画像処理装置、集積回路及び画像処理方法 |
WO2009001864A1 (ja) * | 2007-06-28 | 2008-12-31 | Mitsubishi Electric Corporation | 画像符号化装置および画像復号装置 |
WO2009051010A1 (ja) * | 2007-10-15 | 2009-04-23 | Mitsubishi Electric Corporation | 画像符号化装置、画像復号装置、画像符号化方法、および画像復号方法 |
JP2009094828A (ja) * | 2007-10-10 | 2009-04-30 | Hitachi Ltd | 画像符号化装置及び画像符号化方法、画像復号化装置及び画像復号化方法 |
JP2009177787A (ja) * | 2008-01-25 | 2009-08-06 | Samsung Electronics Co Ltd | 映像の符号化、復号化の方法及びその装置 |
EP2063644A3 (en) * | 2007-10-30 | 2009-09-30 | Hitachi Ltd. | Image encoding device and encoding method, and image decoding device and decoding method |
EP2034742A3 (en) * | 2007-07-25 | 2009-10-14 | Hitachi Ltd. | Video coding method and device |
WO2010004726A1 (ja) * | 2008-07-08 | 2010-01-14 | パナソニック株式会社 | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置、プログラム、及び集積回路 |
JP2011501533A (ja) * | 2007-10-12 | 2011-01-06 | クゥアルコム・インコーポレイテッド | ビデオブロックヘッダ情報の適応可能なコーディング |
RU2454823C2 (ru) * | 2007-10-30 | 2012-06-27 | Ниппон Телеграф Энд Телефон Корпорейшн | Способ кодирования и способ декодирования видео, аппараты для этого, программы для этого, а также носители данных, которые сохраняют программы |
US8369404B2 (en) | 2007-01-12 | 2013-02-05 | Mitsubishi Electric Corporation | Moving image decoding device and moving image decoding method |
US8428133B2 (en) | 2007-06-15 | 2013-04-23 | Qualcomm Incorporated | Adaptive coding of video block prediction mode |
JP5289440B2 (ja) * | 2008-07-10 | 2013-09-11 | 三菱電機株式会社 | 画像符号化装置、画像復号装置、画像符号化方法及び画像復号方法 |
US8571104B2 (en) | 2007-06-15 | 2013-10-29 | Qualcomm, Incorporated | Adaptive coefficient scanning in video coding |
JP2013232843A (ja) * | 2012-05-01 | 2013-11-14 | Canon Inc | 動画像符号化装置及び動画像符号化方法 |
KR101330630B1 (ko) | 2006-03-13 | 2013-11-22 | 삼성전자주식회사 | 최적인 예측 모드를 적응적으로 적용하여 동영상을부호화하는 방법 및 장치, 동영상을 복호화하는 방법 및장치 |
US8655087B2 (en) | 2007-01-12 | 2014-02-18 | Mitsubishi Electric Corporation | Image decoding device and image decoding method for decoding compressed data |
US8718139B2 (en) | 2007-01-12 | 2014-05-06 | Mitsubishi Electric Corporation | Image decoding device and image decoding method |
RU2518417C2 (ru) * | 2007-02-21 | 2014-06-10 | Майкрософт Корпорейшн | Управление вычислительной сложностью и точностью в мультимедийном кодеке, основанном на преобразовании |
US8938009B2 (en) | 2007-10-12 | 2015-01-20 | Qualcomm Incorporated | Layered encoded bitstream structure |
WO2015008417A1 (ja) * | 2013-07-19 | 2015-01-22 | 日本電気株式会社 | 映像符号化装置、映像復号装置、映像符号化方法、映像復号方法及びプログラム |
JP2015015772A (ja) * | 2010-07-15 | 2015-01-22 | 三菱電機株式会社 | 動画像符号化装置及び動画像符号化方法 |
US8971405B2 (en) | 2001-09-18 | 2015-03-03 | Microsoft Technology Licensing, Llc | Block transform and quantization for image and video coding |
JP2015103969A (ja) * | 2013-11-25 | 2015-06-04 | キヤノン株式会社 | 画像符号化装置及び画像符号化方法 |
US9445113B2 (en) | 2006-01-10 | 2016-09-13 | Thomson Licensing | Methods and apparatus for parallel implementations of 4:4:4 coding |
JP2017028728A (ja) * | 2011-09-28 | 2017-02-02 | サン パテント トラスト | 画像復号化方法及び画像復号化装置 |
US10021384B2 (en) | 2010-12-23 | 2018-07-10 | Samsung Electronics Co., Ltd. | Method and device for encoding intra prediction mode for image prediction unit, and method and device for decoding intra prediction mode for image prediction unit |
US10264254B2 (en) | 2011-01-14 | 2019-04-16 | Huawei Technologies Co., Ltd. | Image coding and decoding method, image data processing method, and devices thereof |
US10306229B2 (en) | 2015-01-26 | 2019-05-28 | Qualcomm Incorporated | Enhanced multiple transforms for prediction residual |
US10623774B2 (en) | 2016-03-22 | 2020-04-14 | Qualcomm Incorporated | Constrained block-level optimization and signaling for video coding tools |
US11323748B2 (en) | 2018-12-19 | 2022-05-03 | Qualcomm Incorporated | Tree-based transform unit (TU) partition for video coding |
US11330270B2 (en) * | 2020-03-31 | 2022-05-10 | University Of Electronic Science And Technology Of China | Temporal domain rate distortion optimization considering coding-mode adaptive distortion propagation |
Families Citing this family (102)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102176754B (zh) * | 2005-07-22 | 2013-02-06 | 三菱电机株式会社 | 图像编码装置和方法、以及图像解码装置和方法 |
US8300694B2 (en) | 2005-09-20 | 2012-10-30 | Mitsubishi Electric Corporation | Image encoding method and image decoding method, image encoder and image decoder, and image encoded bit stream and recording medium |
CA2623297C (en) * | 2005-09-20 | 2011-11-01 | Mitsubishi Electric Corporation | Image encoding method and image decoding method, image encoder and image decoder, and image encoded bit stream and recording medium |
US8300700B2 (en) | 2005-09-20 | 2012-10-30 | Mitsubishi Electric Corporation | Image encoding method and image decoding method, image encoder and image decoder, and image encoded bit stream and recording medium |
US8306112B2 (en) | 2005-09-20 | 2012-11-06 | Mitsubishi Electric Corporation | Image encoding method and image decoding method, image encoder and image decoder, and image encoded bit stream and recording medium |
US8250618B2 (en) * | 2006-09-18 | 2012-08-21 | Elemental Technologies, Inc. | Real-time network adaptive digital video encoding/decoding |
WO2008084817A1 (ja) * | 2007-01-09 | 2008-07-17 | Kabushiki Kaisha Toshiba | 画像符号化と復号化の方法及び装置 |
US8565519B2 (en) * | 2007-02-09 | 2013-10-22 | Qualcomm Incorporated | Programmable pattern-based unpacking and packing of data channel information |
RU2496252C2 (ru) * | 2007-06-29 | 2013-10-20 | Шарп Кабусики Кайся | Устройство кодирования изображения, способ кодирования изображения, устройство декодирования изображения, способ декодирования изображения, программа и запоминающий носитель |
US9648325B2 (en) * | 2007-06-30 | 2017-05-09 | Microsoft Technology Licensing, Llc | Video decoding implementations for a graphics processing unit |
US8184715B1 (en) * | 2007-08-09 | 2012-05-22 | Elemental Technologies, Inc. | Method for efficiently executing video encoding operations on stream processor architectures |
TW200910971A (en) * | 2007-08-22 | 2009-03-01 | Univ Nat Cheng Kung | Direction detection algorithms for H.264 intra prediction |
US8121197B2 (en) | 2007-11-13 | 2012-02-21 | Elemental Technologies, Inc. | Video encoding and decoding using parallel processors |
US8761253B2 (en) * | 2008-05-28 | 2014-06-24 | Nvidia Corporation | Intra prediction mode search scheme |
US8325801B2 (en) | 2008-08-15 | 2012-12-04 | Mediatek Inc. | Adaptive restoration for video coding |
WO2010021700A1 (en) | 2008-08-19 | 2010-02-25 | Thomson Licensing | A propagation map |
KR101611375B1 (ko) | 2008-08-19 | 2016-04-11 | 톰슨 라이센싱 | 압축된 비디오에서 구문 요소의 cabac/avc 준수 워터마킹 |
WO2010021691A1 (en) | 2008-08-19 | 2010-02-25 | Thomson Licensing | Luminance evaluation |
CN102187583B (zh) * | 2008-08-19 | 2013-09-11 | 汤姆森特许公司 | 基于上下文的自适应二进制算术编码(cabac)的视频流兼容性 |
US8824727B2 (en) | 2008-08-20 | 2014-09-02 | Thomson Licensing | Selection of watermarks for the watermarking of compressed video |
KR101306834B1 (ko) | 2008-09-22 | 2013-09-10 | 에스케이텔레콤 주식회사 | 인트라 예측 모드의 예측 가능성을 이용한 영상 부호화/복호화 장치 및 방법 |
US8831099B2 (en) * | 2008-12-17 | 2014-09-09 | Nvidia Corporation | Selecting a macroblock encoding mode by using raw data to compute intra cost |
JP2012516626A (ja) | 2009-01-27 | 2012-07-19 | トムソン ライセンシング | ビデオ符号化およびビデオ復号における変換の選択のための方法および装置 |
US9432674B2 (en) * | 2009-02-02 | 2016-08-30 | Nvidia Corporation | Dual stage intra-prediction video encoding system and method |
EP2230849A1 (en) * | 2009-03-20 | 2010-09-22 | Mitsubishi Electric R&D Centre Europe B.V. | Encoding and decoding video data using motion vectors |
KR101631278B1 (ko) * | 2009-07-28 | 2016-06-16 | 삼성전자주식회사 | 모드 정보를 부호화, 복호화하는 방법 및 장치 |
KR101631280B1 (ko) * | 2009-07-28 | 2016-06-16 | 삼성전자주식회사 | 스킵 모드에 기초한 영상을 복호화하는 방법 및 장치 |
KR101710622B1 (ko) * | 2009-07-28 | 2017-02-28 | 삼성전자주식회사 | 스킵 모드에 따라 영상을 부호화, 복호화하는 방법 및 장치 |
KR101633459B1 (ko) | 2009-08-10 | 2016-06-24 | 삼성전자주식회사 | 컬러 간의 상관 관계를 이용한 영상 데이터 인코딩 장치 및 방법, 그리고 영상 데이터 디코딩 장치 및 방법 |
US8600179B2 (en) | 2009-09-17 | 2013-12-03 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding and decoding image based on skip mode |
WO2011040796A2 (ko) * | 2009-10-01 | 2011-04-07 | 에스케이텔레콤 주식회사 | 가변 크기의 매크로블록을 이용한 영상 부호화/복호화 방법 및 장치 |
US9549190B2 (en) | 2009-10-01 | 2017-01-17 | Sk Telecom Co., Ltd. | Method and apparatus for encoding/decoding image using variable-size macroblocks |
US8355057B2 (en) * | 2009-10-14 | 2013-01-15 | Sony Corporation | Joint scalar embedded graphics coding for color images |
KR101487687B1 (ko) | 2010-01-14 | 2015-01-29 | 삼성전자주식회사 | 큰 크기의 변환 단위를 이용한 영상 부호화, 복호화 방법 및 장치 |
CN105472394B (zh) | 2010-01-15 | 2018-11-30 | 三星电子株式会社 | 用于预测编码的使用可变分区的视频编码的方法和设备以及用于预测编码的使用可变分区的视频解码的方法和设备 |
CN106454380B (zh) * | 2010-01-15 | 2019-04-05 | 三星电子株式会社 | 对视频进行解码的方法 |
JP5547301B2 (ja) * | 2010-01-25 | 2014-07-09 | トムソン ライセンシング | 各色平面について別個のビデオ・エンコーダ、ビデオ・デコーダ、ビデオ・エンコード方法およびビデオ・デコード方法 |
CA2787495A1 (en) * | 2010-01-26 | 2011-08-04 | Vidyo, Inc. | Low complexity, high frame rate video encoder |
BR112012019745B1 (pt) | 2010-02-09 | 2020-11-10 | Contentarmor | Método de detecção de marca dágua utilizando um mapa de propagação |
KR20180028430A (ko) * | 2010-02-17 | 2018-03-16 | 한국전자통신연구원 | 초고해상도 영상을 부호화하는 장치 및 방법, 그리고 복호화 장치 및 방법 |
MX2012009474A (es) * | 2010-02-24 | 2012-10-09 | Sharp Kk | Dispositivo de codificacion de imagen y dispositivo de decodificacion de imagen. |
KR101997462B1 (ko) | 2010-04-09 | 2019-07-08 | 엘지전자 주식회사 | 비디오 데이터 처리 방법 및 장치 |
SG184528A1 (en) * | 2010-04-09 | 2012-11-29 | Mitsubishi Electric Corp | Moving image encoding device and moving image decoding device |
ES2549734T3 (es) | 2010-04-13 | 2015-11-02 | Ge Video Compression, Llc | Codificación de vídeo que usa subdivisiones multi-árbol de imágenes |
KR101584480B1 (ko) | 2010-04-13 | 2016-01-14 | 지이 비디오 컴프레션, 엘엘씨 | 평면 간 예측 |
KR102166520B1 (ko) | 2010-04-13 | 2020-10-16 | 지이 비디오 컴프레션, 엘엘씨 | 샘플 영역 병합 |
CN106454371B (zh) | 2010-04-13 | 2020-03-20 | Ge视频压缩有限责任公司 | 解码器、数组重建方法、编码器、编码方法及存储介质 |
KR101813189B1 (ko) | 2010-04-16 | 2018-01-31 | 에스케이 텔레콤주식회사 | 영상 부호화/복호화 장치 및 방법 |
KR101791242B1 (ko) | 2010-04-16 | 2017-10-30 | 에스케이텔레콤 주식회사 | 영상 부호화/복호화 장치 및 방법 |
KR101791078B1 (ko) * | 2010-04-16 | 2017-10-30 | 에스케이텔레콤 주식회사 | 영상 부호화/복호화 장치 및 방법 |
KR20110123651A (ko) * | 2010-05-07 | 2011-11-15 | 한국전자통신연구원 | 생략 부호화를 이용한 영상 부호화 및 복호화 장치 및 그 방법 |
US9510009B2 (en) | 2010-05-20 | 2016-11-29 | Thomson Licensing | Methods and apparatus for adaptive motion vector candidate ordering for video encoding and decoding |
US20130182768A1 (en) * | 2010-09-30 | 2013-07-18 | Korea Advanced Institute Of Science And Technology | Method and apparatus for encoding / decoding video using error compensation |
CN106210737B (zh) | 2010-10-06 | 2019-05-21 | 株式会社Ntt都科摩 | 图像预测解码装置、图像预测解码方法 |
CN106851271B (zh) * | 2011-03-08 | 2019-10-18 | Jvc 建伍株式会社 | 动图像编码装置以及动图像编码方法 |
JP5748553B2 (ja) * | 2011-05-13 | 2015-07-15 | キヤノン株式会社 | 撮像装置 |
JP6004375B2 (ja) * | 2011-06-03 | 2016-10-05 | サン パテント トラスト | 画像符号化方法および画像復号化方法 |
KR102008030B1 (ko) | 2011-06-23 | 2019-08-06 | 선 페이턴트 트러스트 | 화상 복호 방법, 화상 부호화 방법, 화상 복호 장치, 화상 부호화 장치 및 화상 부호화 복호 장치 |
CN105791835A (zh) * | 2011-06-23 | 2016-07-20 | Jvc建伍株式会社 | 图像编码装置和图像编码方法 |
USRE47366E1 (en) | 2011-06-23 | 2019-04-23 | Sun Patent Trust | Image decoding method and apparatus based on a signal type of the control parameter of the current block |
RU2603552C2 (ru) | 2011-06-24 | 2016-11-27 | Сан Пэтент Траст | Способ декодирования изображения, способ кодирования изображения, устройство декодирования изображения, устройство кодирования изображения и устройство кодирования и декодирования изображения |
WO2012176464A1 (ja) | 2011-06-24 | 2012-12-27 | パナソニック株式会社 | 画像復号方法、画像符号化方法、画像復号装置、画像符号化装置及び画像符号化復号装置 |
BR112013030347B1 (pt) | 2011-06-27 | 2022-06-28 | Sun Patent Trust | Método de decodificação de imagem, método de codificação de imagem, aparelho de decodificação de imagem, aparelho de codificação de imagem e aparelho de codificação e de decodificação de imagem |
MY165469A (en) | 2011-06-28 | 2018-03-23 | Sun Patent Trust | Image decoding method, image coding method, image decoding apparatus, image coding apparatus, and image coding and decoding apparatus |
MX2013010892A (es) | 2011-06-29 | 2013-12-06 | Panasonic Corp | Metodo de decodificacion de imagenes, metodo de codificacion de imagenes, aparato de decodificacion de imagenes, aparato de codificacion de imagenes y aparato de codificacion y decodificacion de imagenes. |
CN103583048B (zh) | 2011-06-30 | 2017-05-17 | 太阳专利托管公司 | 图像解码方法、图像编码方法、图像解码装置、图像编码装置及图像编码解码装置 |
KR102060619B1 (ko) | 2011-06-30 | 2019-12-30 | 선 페이턴트 트러스트 | 화상 복호 방법, 화상 부호화 방법, 화상 복호 장치, 화상 부호화 장치 및 화상 부호화 복호 장치 |
CN103765885B (zh) | 2011-07-11 | 2017-04-12 | 太阳专利托管公司 | 图像解码方法、图像编码方法、图像解码装置、图像编码装置及图像编解码装置 |
EP2736255A4 (en) * | 2011-07-18 | 2014-12-17 | Panasonic Ip Corp America | IMAGE ENCODING METHOD, IMAGE DECODING METHOD, IMAGE ENCODING APPARATUS, IMAGE DECODING APPARATUS, AND IMAGE ENCODING / DECODING APPARATUS |
WO2013014693A1 (ja) * | 2011-07-22 | 2013-01-31 | 株式会社日立製作所 | 動画像復号化方法及び画像符号化方法 |
EP3139617B1 (en) * | 2011-11-07 | 2018-01-17 | Tagivan Ii Llc | Arithmetic coding of the position of the last non-zero coefficient |
JP2013110517A (ja) | 2011-11-18 | 2013-06-06 | Canon Inc | 動きベクトル符号化装置、動きベクトル符号化方法及びプログラム、動きベクトル復号装置、動きベクトル復号方法及びプログラム |
CN107277552B (zh) * | 2011-12-21 | 2019-10-29 | 太阳专利托管公司 | 图像编码方法及装置、图像解码方法及装置 |
CN107707912B (zh) * | 2011-12-28 | 2020-05-22 | Jvc 建伍株式会社 | 动图像编码装置以及动图像编码方法 |
WO2013104210A1 (en) * | 2012-01-12 | 2013-07-18 | Mediatek Inc. | Method and apparatus for unification of significance map context selection |
ES2728146T3 (es) | 2012-01-20 | 2019-10-22 | Sun Patent Trust | Procedimientos y aparato de codificación y decodificación de vídeo utilizando predicción temporal de vector de movimiento |
CN103220508B (zh) | 2012-01-20 | 2014-06-11 | 华为技术有限公司 | 编解码方法和装置 |
WO2013114860A1 (ja) | 2012-02-03 | 2013-08-08 | パナソニック株式会社 | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置及び画像符号化復号装置 |
WO2013132792A1 (ja) | 2012-03-06 | 2013-09-12 | パナソニック株式会社 | 動画像符号化方法、動画像復号方法、動画像符号化装置、動画像復号装置、及び動画像符号化復号装置 |
GB2501535A (en) | 2012-04-26 | 2013-10-30 | Sony Corp | Chrominance Processing in High Efficiency Video Codecs |
AU2013272989B2 (en) | 2012-06-08 | 2017-05-25 | Sun Patent Trust | Image coding method, image decoding method, image coding apparatus, image decoding apparatus, and image coding and decoding apparatus |
WO2013187060A1 (ja) * | 2012-06-12 | 2013-12-19 | パナソニック株式会社 | 動画像符号化方法、動画像復号化方法、動画像符号化装置および動画像復号化装置 |
CN108235014B (zh) | 2012-06-27 | 2020-08-14 | 太阳专利托管公司 | 图像编码方法和图像编码装置 |
JP6137817B2 (ja) * | 2012-11-30 | 2017-05-31 | キヤノン株式会社 | 画像符号化装置、画像符号化方法及びプログラム |
JP6151909B2 (ja) * | 2012-12-12 | 2017-06-21 | キヤノン株式会社 | 動画像符号化装置、方法およびプログラム |
US9049442B2 (en) | 2013-03-15 | 2015-06-02 | Canon Kabushiki Kaisha | Moving image encoding apparatus and method for controlling the same |
KR101432769B1 (ko) * | 2013-07-19 | 2014-08-26 | 에스케이텔레콤 주식회사 | 인트라 예측 모드의 예측 가능성을 이용한 영상 부호화/복호화 장치 및 방법 |
JP6362370B2 (ja) * | 2014-03-14 | 2018-07-25 | 三菱電機株式会社 | 画像符号化装置、画像復号装置、画像符号化方法及び画像復号方法 |
AU2015343932A1 (en) * | 2014-11-04 | 2017-06-01 | Samsung Electronics Co., Ltd. | Probability updating method for binary arithmetic coding/decoding, and entropy coding/decoding apparatus using same |
EP3041233A1 (en) * | 2014-12-31 | 2016-07-06 | Thomson Licensing | High frame rate-low frame rate transmission technique |
CN107409208B (zh) * | 2015-03-27 | 2021-04-20 | 索尼公司 | 图像处理装置、图像处理方法以及计算机可读存储介质 |
WO2017086738A1 (ko) * | 2015-11-19 | 2017-05-26 | 한국전자통신연구원 | 영상 부호화/복호화 방법 및 장치 |
KR20170058838A (ko) * | 2015-11-19 | 2017-05-29 | 한국전자통신연구원 | 화면간 예측 향상을 위한 부호화/복호화 방법 및 장치 |
CN105407352A (zh) * | 2015-11-23 | 2016-03-16 | 小米科技有限责任公司 | 图像压缩方法、装置及服务器 |
KR101640572B1 (ko) * | 2015-11-26 | 2016-07-18 | 이노뎁 주식회사 | 효율적인 코딩 유닛 설정을 수행하는 영상 처리 장치 및 영상 처리 방법 |
EP3429204B1 (en) * | 2016-03-07 | 2020-04-15 | Sony Corporation | Encoding device and encoding method |
RU169308U1 (ru) * | 2016-11-07 | 2017-03-14 | Федеральное государственное бюджетное образовательное учреждение высшего образования "Юго-Западный государственный университет" (ЮЗГУ) | Устройство для оперативного восстановления видеосигнала RGB-модели |
TWI627521B (zh) * | 2017-06-07 | 2018-06-21 | 財團法人工業技術研究院 | 時序估算方法與模擬裝置 |
CN112205022B (zh) * | 2018-05-28 | 2024-04-12 | 三菱电机株式会社 | 无线接入网络的管理装置 |
KR102666666B1 (ko) | 2018-06-01 | 2024-05-20 | 삼성전자주식회사 | 이미지 부호화 장치 및 이미지 복호화 장치 |
US11475601B2 (en) * | 2019-10-21 | 2022-10-18 | Google Llc | Image decoding during bitstream interruptions |
KR20220152299A (ko) * | 2020-03-12 | 2022-11-15 | 인터디지털 브이씨 홀딩스 프랑스 | 비디오 인코딩 및 디코딩을 위한 방법 및 장치 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005039743A (ja) * | 2003-07-18 | 2005-02-10 | Sony Corp | 画像情報符号化装置及び方法、並びに画像情報復号装置及び方法 |
JP2005034975A (ja) * | 2003-07-18 | 2005-02-10 | Kr Kogyo Kk | ラチェットレンチ及びその組立て方法 |
JP2006203909A (ja) * | 2005-01-21 | 2006-08-03 | Seiko Epson Corp | マクロブロックをイントラ符号化するための予測モードを選択する方法、ビデオデータのマクロブロックに対して少なくとも一つの予測モードを選択するための方法、予測モードの選択を可能にするためにコンピュータ可読媒体に実装されたコンピュータプログラム製品、および複数の予測モードでデータを符号化するためのエンコーダ |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07109990B2 (ja) * | 1989-04-27 | 1995-11-22 | 日本ビクター株式会社 | 適応型フレーム間予測符号化方法及び復号方法 |
JP2683181B2 (ja) * | 1992-05-12 | 1997-11-26 | 三菱電機株式会社 | カラー画像処理装置 |
JP4326758B2 (ja) * | 1995-11-02 | 2009-09-09 | 三菱電機株式会社 | 動画像符号化装置及び動画像復号化装置 |
JPH1013859A (ja) * | 1996-06-26 | 1998-01-16 | Mitsubishi Electric Corp | 画像用高能率符号化器及び画像用高能率復号化器及び画像用高能率符号化復号化システム |
CN100518319C (zh) * | 1996-12-18 | 2009-07-22 | 汤姆森消费电子有限公司 | 将数据压缩成固定长度数据块及解压的方法 |
WO1998036576A1 (en) * | 1997-02-13 | 1998-08-20 | Mitsubishi Denki Kabushiki Kaisha | Moving picture prediction system |
JP3901287B2 (ja) * | 1997-02-27 | 2007-04-04 | 松下電器産業株式会社 | 映像信号変換装置、映像信号変換方法及び映像提供システム |
WO1998042134A1 (en) * | 1997-03-17 | 1998-09-24 | Mitsubishi Denki Kabushiki Kaisha | Image encoder, image decoder, image encoding method, image decoding method and image encoding/decoding system |
KR100511693B1 (ko) * | 1997-10-23 | 2005-09-02 | 미쓰비시덴키 가부시키가이샤 | 화상 복호화 장치 |
US6493385B1 (en) * | 1997-10-23 | 2002-12-10 | Mitsubishi Denki Kabushiki Kaisha | Image encoding method, image encoder, image decoding method, and image decoder |
JP2001285863A (ja) * | 2000-03-30 | 2001-10-12 | Sony Corp | 画像情報変換装置及び方法 |
US6757429B2 (en) * | 2001-02-21 | 2004-06-29 | Boly Media Communications Inc. | Method of compressing digital images |
JP4193406B2 (ja) * | 2002-04-16 | 2008-12-10 | 三菱電機株式会社 | 映像データ変換装置および映像データ変換方法 |
EP2860978B1 (en) * | 2002-05-28 | 2020-04-08 | Dolby International AB | Method and systems for image intra-prediction mode estimation, communication, and organization |
JP4724351B2 (ja) * | 2002-07-15 | 2011-07-13 | 三菱電機株式会社 | 画像符号化装置、画像符号化方法、画像復号装置、画像復号方法、および通信装置 |
CN100553339C (zh) * | 2002-07-15 | 2009-10-21 | 株式会社日立制作所 | 动态图像解码方法 |
US7469069B2 (en) * | 2003-05-16 | 2008-12-23 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding/decoding image using image residue prediction |
JP4815107B2 (ja) * | 2003-07-16 | 2011-11-16 | 三星電子株式会社 | カラー平面間予測を利用した無損失映像符号化/復号化方法及び装置 |
CN101616330B (zh) * | 2003-07-16 | 2012-07-04 | 三星电子株式会社 | 用于色彩图像的视频编码/解码装置和方法 |
EP1515561B1 (en) * | 2003-09-09 | 2007-11-21 | Mitsubishi Electric Information Technology Centre Europe B.V. | Method and apparatus for 3-D sub-band video coding |
KR100964401B1 (ko) * | 2003-10-23 | 2010-06-17 | 삼성전자주식회사 | 칼라 영상을 위한 인트라 부호화/복호화 방법 및 장치 |
KR20050061762A (ko) * | 2003-12-18 | 2005-06-23 | 학교법인 대양학원 | 부호화 모드 결정방법, 움직임 추정방법 및 부호화 장치 |
JP2005349775A (ja) | 2004-06-14 | 2005-12-22 | Noritsu Koki Co Ltd | 写真プリントシステム |
KR101246915B1 (ko) * | 2005-04-18 | 2013-03-25 | 삼성전자주식회사 | 동영상 부호화 또는 복호화 방법 및 장치 |
JP2006304102A (ja) * | 2005-04-22 | 2006-11-02 | Renesas Technology Corp | 画像符号化ユニットと画像符号化方法 |
CN102176754B (zh) * | 2005-07-22 | 2013-02-06 | 三菱电机株式会社 | 图像编码装置和方法、以及图像解码装置和方法 |
-
2006
- 2006-06-16 CN CN201110094143XA patent/CN102176754B/zh active Active
- 2006-06-16 JP JP2006551668A patent/JP4542107B2/ja active Active
- 2006-06-16 KR KR1020097027351A patent/KR100995226B1/ko active IP Right Grant
- 2006-06-16 CA CA 2610276 patent/CA2610276C/en active Active
- 2006-06-16 EP EP06757388A patent/EP1909508A4/en not_active Ceased
- 2006-06-16 BR BRPI0611672-8A patent/BRPI0611672A2/pt not_active Application Discontinuation
- 2006-06-16 CN CN2006800251408A patent/CN101218830B/zh active Active
- 2006-06-16 KR KR20087001610A patent/KR100957754B1/ko active IP Right Grant
- 2006-06-16 CA CA 2732532 patent/CA2732532C/en active Active
- 2006-06-16 KR KR1020107013666A patent/KR101037855B1/ko active IP Right Grant
- 2006-06-16 KR KR1020117021863A patent/KR101217400B1/ko active IP Right Grant
- 2006-06-16 CN CN201010114053A patent/CN101815224A/zh active Pending
- 2006-06-16 US US11/912,680 patent/US20090034856A1/en not_active Abandoned
- 2006-06-16 KR KR1020107028571A patent/KR101213904B1/ko active IP Right Grant
- 2006-06-16 CN CN2011102357344A patent/CN102231835B/zh active Active
- 2006-06-16 WO PCT/JP2006/312159 patent/WO2007010690A1/ja active Application Filing
- 2006-06-16 CN CN201110235078.8A patent/CN102281448B/zh active Active
-
2010
- 2010-01-08 JP JP2010003109A patent/JP5138709B2/ja active Active
-
2012
- 2012-01-05 HK HK12100070A patent/HK1159913A1/en unknown
- 2012-03-21 HK HK12102830A patent/HK1162791A1/xx unknown
- 2012-07-10 RU RU2012129160/08A patent/RU2506714C1/ru active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005039743A (ja) * | 2003-07-18 | 2005-02-10 | Sony Corp | 画像情報符号化装置及び方法、並びに画像情報復号装置及び方法 |
JP2005034975A (ja) * | 2003-07-18 | 2005-02-10 | Kr Kogyo Kk | ラチェットレンチ及びその組立て方法 |
JP2006203909A (ja) * | 2005-01-21 | 2006-08-03 | Seiko Epson Corp | マクロブロックをイントラ符号化するための予測モードを選択する方法、ビデオデータのマクロブロックに対して少なくとも一つの予測モードを選択するための方法、予測モードの選択を可能にするためにコンピュータ可読媒体に実装されたコンピュータプログラム製品、および複数の予測モードでデータを符号化するためのエンコーダ |
Non-Patent Citations (1)
Title |
---|
See also references of EP1909508A4 * |
Cited By (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8971405B2 (en) | 2001-09-18 | 2015-03-03 | Microsoft Technology Licensing, Llc | Block transform and quantization for image and video coding |
US9445113B2 (en) | 2006-01-10 | 2016-09-13 | Thomson Licensing | Methods and apparatus for parallel implementations of 4:4:4 coding |
US10034000B2 (en) | 2006-03-13 | 2018-07-24 | Samsung Electronics Co., Ltd. | Method, medium, and system encoding and/or decoding moving pictures by adaptively applying optimal prediction modes |
US9654779B2 (en) | 2006-03-13 | 2017-05-16 | Samsung Electronics Co., Ltd. | Method, medium, and system encoding and/or decoding moving pictures by adaptively applying optimal predication modes |
KR101330630B1 (ko) | 2006-03-13 | 2013-11-22 | 삼성전자주식회사 | 최적인 예측 모드를 적응적으로 적용하여 동영상을부호화하는 방법 및 장치, 동영상을 복호화하는 방법 및장치 |
JP2014197865A (ja) * | 2006-03-13 | 2014-10-16 | サムスン エレクトロニクス カンパニー リミテッド | 予測映像の生成方法、装置及び記録媒体 |
JP2014158305A (ja) * | 2006-03-13 | 2014-08-28 | Samsung Electronics Co Ltd | 予測映像の生成方法、装置及び記録媒体 |
US8718139B2 (en) | 2007-01-12 | 2014-05-06 | Mitsubishi Electric Corporation | Image decoding device and image decoding method |
US8369404B2 (en) | 2007-01-12 | 2013-02-05 | Mitsubishi Electric Corporation | Moving image decoding device and moving image decoding method |
US8655087B2 (en) | 2007-01-12 | 2014-02-18 | Mitsubishi Electric Corporation | Image decoding device and image decoding method for decoding compressed data |
RU2518417C2 (ru) * | 2007-02-21 | 2014-06-10 | Майкрософт Корпорейшн | Управление вычислительной сложностью и точностью в мультимедийном кодеке, основанном на преобразовании |
US8942289B2 (en) | 2007-02-21 | 2015-01-27 | Microsoft Corporation | Computational complexity and precision control in transform-based digital media codec |
US8306341B2 (en) | 2007-03-30 | 2012-11-06 | Fujitsu Limited | Image data compression apparatus and decoding apparatus |
WO2008126139A1 (ja) * | 2007-03-30 | 2008-10-23 | Fujitsu Limited | 画像データ圧縮装置及び復号装置 |
WO2008132791A1 (ja) * | 2007-04-13 | 2008-11-06 | Panasonic Corporation | 画像処理装置、集積回路及び画像処理方法 |
US8428133B2 (en) | 2007-06-15 | 2013-04-23 | Qualcomm Incorporated | Adaptive coding of video block prediction mode |
US8619853B2 (en) | 2007-06-15 | 2013-12-31 | Qualcomm Incorporated | Separable directional transforms |
US9578331B2 (en) | 2007-06-15 | 2017-02-21 | Qualcomm Incorporated | Separable directional transforms |
US8571104B2 (en) | 2007-06-15 | 2013-10-29 | Qualcomm, Incorporated | Adaptive coefficient scanning in video coding |
US8520732B2 (en) | 2007-06-15 | 2013-08-27 | Qualcomm Incorporated | Adaptive coding of video block prediction mode |
US8488668B2 (en) | 2007-06-15 | 2013-07-16 | Qualcomm Incorporated | Adaptive coefficient scanning for video coding |
RU2470480C1 (ru) * | 2007-06-28 | 2012-12-20 | Мицубиси Электроник Корпорейшн | Устройство кодирования изображения и устройство декодирования изображения |
US8145002B2 (en) | 2007-06-28 | 2012-03-27 | Mitsubishi Electric Corporation | Image encoding device and image encoding method |
CN101889449B (zh) * | 2007-06-28 | 2013-06-19 | 三菱电机株式会社 | 图像编码装置以及图像解码装置 |
US8345968B2 (en) | 2007-06-28 | 2013-01-01 | Mitsubishi Electric Corporation | Image encoding device, image decoding device, image encoding method and image decoding method |
WO2009001864A1 (ja) * | 2007-06-28 | 2008-12-31 | Mitsubishi Electric Corporation | 画像符号化装置および画像復号装置 |
US8422803B2 (en) | 2007-06-28 | 2013-04-16 | Mitsubishi Electric Corporation | Image encoding device, image decoding device, image encoding method and image decoding method |
JP2013176168A (ja) * | 2007-06-28 | 2013-09-05 | Mitsubishi Electric Corp | 画像符号化装置および画像復号装置 |
CN101889449A (zh) * | 2007-06-28 | 2010-11-17 | 三菱电机株式会社 | 图像编码装置以及图像解码装置 |
JP5296679B2 (ja) * | 2007-06-28 | 2013-09-25 | 三菱電機株式会社 | 画像復号装置および画像復号方法 |
CN103327328A (zh) * | 2007-06-28 | 2013-09-25 | 三菱电机株式会社 | 图像编码装置以及图像解码装置 |
KR101379087B1 (ko) | 2007-06-28 | 2014-04-11 | 미쓰비시덴키 가부시키가이샤 | 화상 부호화 장치 및 방법, 그리고 화상 복호 장치 및 방법 |
JPWO2009001864A1 (ja) * | 2007-06-28 | 2010-08-26 | 三菱電機株式会社 | 画像符号化装置および画像復号装置 |
US8139875B2 (en) | 2007-06-28 | 2012-03-20 | Mitsubishi Electric Corporation | Image encoding device, image decoding device, image encoding method and image decoding method |
KR101088972B1 (ko) | 2007-06-28 | 2011-12-01 | 미쓰비시덴키 가부시키가이샤 | 화상 부호화 장치 및 방법 |
EP2034742A3 (en) * | 2007-07-25 | 2009-10-14 | Hitachi Ltd. | Video coding method and device |
CN104052996B (zh) * | 2007-10-10 | 2018-04-10 | 日立麦克赛尔株式会社 | 图像编码装置及方法,和图像解码装置及方法 |
CN104038763B (zh) * | 2007-10-10 | 2018-04-24 | 麦克赛尔株式会社 | 图像编码装置及方法,和图像解码装置及方法 |
US9706202B2 (en) | 2007-10-10 | 2017-07-11 | Hitachi Maxell, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
US9699459B2 (en) | 2007-10-10 | 2017-07-04 | Hitachi Maxell, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
CN104038763A (zh) * | 2007-10-10 | 2014-09-10 | 株式会社日立制作所 | 图像编码装置及方法,和图像解码装置及方法 |
CN104052996A (zh) * | 2007-10-10 | 2014-09-17 | 株式会社日立制作所 | 图像编码装置及方法,和图像解码装置及方法 |
CN104052995A (zh) * | 2007-10-10 | 2014-09-17 | 株式会社日立制作所 | 图像编码装置及方法,和图像解码装置及方法 |
EP2059048A3 (en) * | 2007-10-10 | 2009-10-21 | Hitachi Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
US8867626B2 (en) | 2007-10-10 | 2014-10-21 | Hitachi, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
US9699458B2 (en) | 2007-10-10 | 2017-07-04 | Hitachi Maxell, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
US9609322B2 (en) | 2007-10-10 | 2017-03-28 | Hitachi Maxell, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
JP2009094828A (ja) * | 2007-10-10 | 2009-04-30 | Hitachi Ltd | 画像符号化装置及び画像符号化方法、画像復号化装置及び画像復号化方法 |
CN104052995B (zh) * | 2007-10-10 | 2017-12-29 | 日立麦克赛尔株式会社 | 图像解码方法 |
US9451255B2 (en) | 2007-10-10 | 2016-09-20 | Hitachi Maxell, Ltd. | Image encoding apparatus, image encoding method, image decoding apparatus, and image decoding method |
CN104052997B (zh) * | 2007-10-10 | 2018-06-15 | 日立麦克赛尔株式会社 | 图像编码装置及方法,和图像解码装置及方法 |
US9386316B2 (en) | 2007-10-12 | 2016-07-05 | Qualcomm Incorporated | Adaptive coding of video block header information |
JP2011501533A (ja) * | 2007-10-12 | 2011-01-06 | クゥアルコム・インコーポレイテッド | ビデオブロックヘッダ情報の適応可能なコーディング |
US8938009B2 (en) | 2007-10-12 | 2015-01-20 | Qualcomm Incorporated | Layered encoded bitstream structure |
WO2009051010A1 (ja) * | 2007-10-15 | 2009-04-23 | Mitsubishi Electric Corporation | 画像符号化装置、画像復号装置、画像符号化方法、および画像復号方法 |
EP2063644A3 (en) * | 2007-10-30 | 2009-09-30 | Hitachi Ltd. | Image encoding device and encoding method, and image decoding device and decoding method |
US8520727B2 (en) | 2007-10-30 | 2013-08-27 | Nippon Telegraph And Telephone Corporation | Video encoding method and decoding method, apparatuses therefor, programs therefor, and storage media which store the programs |
RU2454823C2 (ru) * | 2007-10-30 | 2012-06-27 | Ниппон Телеграф Энд Телефон Корпорейшн | Способ кодирования и способ декодирования видео, аппараты для этого, программы для этого, а также носители данных, которые сохраняют программы |
JP2009177787A (ja) * | 2008-01-25 | 2009-08-06 | Samsung Electronics Co Ltd | 映像の符号化、復号化の方法及びその装置 |
WO2010004726A1 (ja) * | 2008-07-08 | 2010-01-14 | パナソニック株式会社 | 画像符号化方法、画像復号方法、画像符号化装置、画像復号装置、プログラム、及び集積回路 |
JP5289440B2 (ja) * | 2008-07-10 | 2013-09-11 | 三菱電機株式会社 | 画像符号化装置、画像復号装置、画像符号化方法及び画像復号方法 |
JP2015222973A (ja) * | 2010-07-15 | 2015-12-10 | 三菱電機株式会社 | 動画像符号化装置、動画像符号化方法、動画像復号装置、動画像復号方法及びビットストリーム |
JP2015015772A (ja) * | 2010-07-15 | 2015-01-22 | 三菱電機株式会社 | 動画像符号化装置及び動画像符号化方法 |
US10021384B2 (en) | 2010-12-23 | 2018-07-10 | Samsung Electronics Co., Ltd. | Method and device for encoding intra prediction mode for image prediction unit, and method and device for decoding intra prediction mode for image prediction unit |
US10630986B2 (en) | 2010-12-23 | 2020-04-21 | Samsung Electronics Co., Ltd. | Method and device for encoding intra prediction mode for image prediction unit, and method and device for decoding intra prediction mode for image prediction unit |
US11509899B2 (en) | 2010-12-23 | 2022-11-22 | Samsung Electronics Co., Ltd. | Method and device for encoding intra prediction mode for image prediction unit, and method and device for decoding intra prediction mode for image prediction unit |
US11070811B2 (en) | 2010-12-23 | 2021-07-20 | Samsung Electronics Co., Ltd. | Method and device for encoding intra prediction mode for image prediction unit, and method and device for decoding intra prediction mode for image prediction unit |
US10264254B2 (en) | 2011-01-14 | 2019-04-16 | Huawei Technologies Co., Ltd. | Image coding and decoding method, image data processing method, and devices thereof |
JP2017028728A (ja) * | 2011-09-28 | 2017-02-02 | サン パテント トラスト | 画像復号化方法及び画像復号化装置 |
JP2013232843A (ja) * | 2012-05-01 | 2013-11-14 | Canon Inc | 動画像符号化装置及び動画像符号化方法 |
US10178408B2 (en) | 2013-07-19 | 2019-01-08 | Nec Corporation | Video coding device, video decoding device, video coding method, video decoding method, and program |
JPWO2015008417A1 (ja) * | 2013-07-19 | 2017-03-02 | 日本電気株式会社 | 映像符号化装置、映像復号装置、映像符号化方法、映像復号方法及びプログラム |
WO2015008417A1 (ja) * | 2013-07-19 | 2015-01-22 | 日本電気株式会社 | 映像符号化装置、映像復号装置、映像符号化方法、映像復号方法及びプログラム |
JP2015103969A (ja) * | 2013-11-25 | 2015-06-04 | キヤノン株式会社 | 画像符号化装置及び画像符号化方法 |
US10306229B2 (en) | 2015-01-26 | 2019-05-28 | Qualcomm Incorporated | Enhanced multiple transforms for prediction residual |
US10623774B2 (en) | 2016-03-22 | 2020-04-14 | Qualcomm Incorporated | Constrained block-level optimization and signaling for video coding tools |
US11323748B2 (en) | 2018-12-19 | 2022-05-03 | Qualcomm Incorporated | Tree-based transform unit (TU) partition for video coding |
US11330270B2 (en) * | 2020-03-31 | 2022-05-10 | University Of Electronic Science And Technology Of China | Temporal domain rate distortion optimization considering coding-mode adaptive distortion propagation |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007010690A1 (ja) | 画像符号化装置、画像復号装置、および画像符号化方法、画像復号方法、画像符号化プログラム、画像復号プログラム、ならびに画像符号化プログラムを記録したコンピュータ読み取り可能な記録媒体、画像復号プログラムを記録したコンピュータ読み取り可能な記録媒体 | |
JP5296679B2 (ja) | 画像復号装置および画像復号方法 | |
US8488889B2 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US8509551B2 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recording with image encoding program and computer readable recording medium recorded with image decoding program | |
RU2502216C2 (ru) | Кодер изображения и декодер изображения, способ кодирования изображения и способ декодирования изображения | |
US20090123066A1 (en) | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein, | |
US20090034857A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080123947A1 (en) | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, image decoding program, computer readable recording medium having image encoding program recorded therein | |
US20080165849A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080123977A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080137744A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080130989A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080130990A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
US20080130988A1 (en) | Image encoder and image decoder, image encoding method and image decoding method, image encoding program and image decoding program, and computer readable recording medium recorded with image encoding program and computer readable recording medium recorded with image decoding program | |
WO2011125313A1 (ja) | 動画像符号化装置および動画像復号装置 | |
WO2011125314A1 (ja) | 動画像符号化装置および動画像復号装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680025140.8 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006551668 Country of ref document: JP |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006757388 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2610276 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 5700/CHENP/2007 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087001610 Country of ref document: KR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008106777 Country of ref document: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11912680 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0611672 Country of ref document: BR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020097027351 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020107013666 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020107028571 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020117021863 Country of ref document: KR |