WO2006035584A1 - 符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体 - Google Patents
符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体 Download PDFInfo
- Publication number
- WO2006035584A1 WO2006035584A1 PCT/JP2005/016571 JP2005016571W WO2006035584A1 WO 2006035584 A1 WO2006035584 A1 WO 2006035584A1 JP 2005016571 W JP2005016571 W JP 2005016571W WO 2006035584 A1 WO2006035584 A1 WO 2006035584A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- mode
- cost value
- inter
- value
- motion compensation
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
Definitions
- the present invention relates to an encoding device, an encoding method, an encoding method program, and a recording medium on which an encoding method program is recorded, and is applied to a video camera, an electronic still camera, a monitoring device, etc. be able to.
- the present invention detects the optimal mode of the inter prediction mode based on the cost value based on the accuracy of the integer pixel, calculates the cost value with an accuracy of 1 pixel or less for this optimal mode, and calculates the intra prediction mode and the inter prediction mode.
- the optimum mode can be detected by a simple process when the optimum mode is detected by the cost function and the image data is encoded. '' Background technology
- image data is compressed by orthogonal transform such as discrete cosine transform and motion compensation.
- MPEG 2 (ISO / IEC 13818-2), which is one of these methods, is a method defined as a general-purpose image coding method, and supports both interlaced scanning and sequential scanning methods. It is defined to support both standard resolution images and high-definition images, and is now widely used in a wide range of professional and consumer applications. Specifically, M P E G
- image data of a standard resolution and interlaced scanning method with 7 2 0 X 4 8 0 pixels is compressed to a bit rate of 4 to 8 [M bps], and 1 9 2
- interlaced image data 1 8 to 2 2 Data can be compressed to a bit rate of [Mbps] to ensure high image quality and a high compression rate.
- MP EG 2 is a high image quality encoding method suitable for broadcasting, and does not support a high compression rate encoding method with a smaller code amount than MP EG 1.
- the spread of mobile terminals in recent years is expected to increase the need for such a high-compression coding method with a smaller code amount than MP EG 1. Therefore, the MPEG4 encoding standard is IS. / IECC (International Organization for Standardization / international Electrotechnical Commission) 14496—2 1 Approved as an international standard in December 998.
- H26 L ITU-T Q6 / 16 VCEG
- MPEG2 and MPEG4 higher coding efficiency can be secured, and as part of MP EG 4 activities, this H26 L is used as a base.
- Standardization of coding methods that incorporate various functions and ensure even higher coding efficiency has been promoted as Joint Model of Enhanced- Compression Video Coding. In these methods, H264 and It was set as an international standard under the name of MPEG-4 Part 10 (AVC: Advanced Video Coding).
- FIG. 1 is a block diagram showing an encoding device based on this AVC.
- the encoding device 1 selects an optimal prediction mode from a plurality of intra prediction modes and a plurality of inter prediction modes, and subtracts a prediction value of the selected prediction mode from the image data to generate difference data.
- the difference data is subjected to orthogonal transform processing, quantization processing, and variable length coding processing, whereby the image data is coded by intra coding and inter coding.
- the analog-digital conversion circuit (AZD) 2 performs analog-digital conversion processing on the video signal SV and outputs image data D1.
- the screen rearrangement buffer 3 receives the image data D1 output from the analog-digital conversion circuit 2, and in accordance with the GO P (Group of Pictures) structure related to the encoding process of the encoding device 1, Rearrange the frames of image data D 1 Output.
- GO P Group of Pictures
- the subtraction circuit 4 receives the image data D 1 output from the screen rearrangement buffer 3, and generates and outputs difference data D 2 from the prediction value generated by the intra prediction circuit 5 in intra coding. On the other hand, in inter coding, difference data D 2 from the prediction value generated by the motion prediction / compensation circuit 6 is generated and output.
- the orthogonal transformation circuit 7 inputs the output data D 2 of the subtraction circuit 4, performs orthogonal transformation processing such as discrete cosine transformation and force Lunen 'label transformation, and outputs transformation coefficient data D 3 based on the processing result .
- the quantization circuit 8 quantizes the transform coefficient data D3 by the quantization scale by the rate control of the rate control circuit 9 and outputs it.
- the lossless encoding circuit 10 performs a lossless encoding process on the output data of the quantization circuit 8 by variable length encoding, arithmetic encoding, or the like, and outputs the result.
- the lossless encoding circuit 10 obtains information about the intra prediction mode related to intra encoding, information about the motion vector related to inter encoding, and the like from the intra prediction circuit 5, the motion prediction / compensation circuit 6, and the like. Set the information in the header information of output data D4 and output.
- the accumulation buffer 11 1 accumulates the output data D 4 of the lossless encoding circuit 10 and outputs it at the transmission rate of the subsequent transmission line.
- the rate control circuit 9 monitors the amount of code generated by the encoding process by monitoring the free capacity of the storage buffer 11, and switches the quantization scale in the quantization circuit 8 based on the monitoring result. Controls the amount of code generated by device 1.
- the inverse quantization circuit 13 performs inverse quantization processing on the output data of the quantization circuit 8, thereby reproducing the input data of the quantization circuit 8.
- the inverse orthogonal transform circuit 14 performs inverse orthogonal transform processing on the output data of the inverse quantization circuit 13, thereby reproducing the input data of the orthogonal transform circuit 7.
- the deblocking filter 15 removes block distortion from the output data of the inverse orthogonal transform circuit 14 and outputs the result.
- the frame memory 16 appropriately adds the prediction value generated by the intra prediction circuit 5 or the motion prediction / compensation circuit 6 to the output data of the deblock filter 15 and records it as reference image information.
- the intra prediction circuit 5 detects the optimum mode of the intra prediction mode based on the reference image information stored in the frame memory 16 in the intra coding. When encoding is performed by intra prediction, a prediction value of the predicted image information is generated from the reference image information in this optimal mode and is output to the subtraction circuit 4.
- multiple reference frames are used to select one of a plurality of reference frames R ef for the processing target frame Org as shown in FIG. If the part corresponding to the motion compensation block is hidden in the previous frame, or if the entire pixel value temporarily changes in the previous frame due to flash etc. Even in this case, motion compensation is performed with high accuracy to improve data compression efficiency.
- the motion compensation block is designed to compensate for motion based on a block consisting of 16 pixels x 16 pixels, but variable MC Block Size Therefore, as shown in Fig. 4 (A 2) to (A 4), the macro block consisting of 16 pixels x 16 pixels is set to 2 in the horizontal and / or vertical directions. Split 16 pixels x 8 pixels, 8 pixels XI 6 pixels, and 8 pixels x 8 pixels are set so that motion compensation can be performed by setting a motion vector and a reference frame independently. In addition, as shown in Fig.
- each tap input of the 6-tap FIR filter is weighted by the values 1, 1, 20, 20, 1, 5, 1, and
- the pixel value c is calculated with 1 Z 2 pixel accuracy between consecutive pixels in the horizontal and vertical directions.
- the block related to the motion vector mv divides one macroblock into two in the horizontal direction.
- the reference frame refldxE related to the detection of the prediction motion vector mv the reference frame refldxA of the submacroblock A adjacent to the left side, and If they are equal, the motion vector mvA detected by the adjacent submacroblock A on the left side is lost to the predicted motion vector pmv as shown in the following equation. Loss ⁇ 9 -Q o
- the chrominance signal is motion-compensated using a motion compensation block, a motion vector, and a reference frame based on the luminance signal.
- AVC detects the optimal mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes, and processes the image data in this optimal mode to efficiently encode the image data. .
- a coarse motion prediction circuit that calculates the cost value of a device, an inter mode determination circuit that detects an optimal mode in the plurality of inter prediction modes based on a cost value based on the coarse motion compensation position, and the inter mode For the optimal mode detected by the judgment circuit, a fine motion that calculates the cost value with motion compensation with an accuracy of 1 pixel or less.
- a prediction circuit; and an intra-intermediate mode determination circuit that detects an optimal mode to be used for the encoding process by comparing the cost value by the fine motion prediction circuit and the cost value by the plurality of intra prediction modes.
- the cost value is calculated with an accuracy of 1 pixel or less only for the optimal mode of the inter prediction mode, without calculating the cost value by performing motion compensation with an accuracy of 1 pixel or less.
- the optimal mode is detected by the cost function. When the image data is encoded and processed, the optimum mode can be detected by a simple process.
- the inter-mode determination step for detecting the optimum mode in the mode and in the inter-mode determination step A fine cost value calculating step for calculating the cost value by performing motion compensation with an accuracy of 1 pixel or less for the optimum mode detected by the method, and the fine cost value calculating step A step of intra / inter mode determination for detecting an optimum mode to be used for the encoding process by comparing the cost value by the step and the cost value by the plurality of intra prediction modes.
- FIG. 1 is a block diagram showing an AVC encoding apparatus.
- FIG. 2 is a block diagram showing an AVC decoding apparatus.
- FIG. 7 is a schematic diagram used for explaining the predicted value of the motion vector related to the sub macroblock. .
- FIG. 8 is a schematic diagram for explaining the predicted value of the motion vector according to another example.
- FIG. 9 is a schematic diagram for explaining the temporal direct mode.
- FIG. 10 is a block diagram showing an encoding apparatus according to Embodiment 1 of the present invention.
- Figure 11 is a plan view used to explain the calculation of the expected value.
- Fig. 12 is a schematic diagram used to calculate the expected value using a linear function.
- FIG. 10 is a block diagram showing an encoding apparatus according to an embodiment of the present invention.
- this encoding device 40 the same components as those of the encoding device 1 described above with reference to FIG. 1 are indicated by the corresponding reference numerals, and redundant description is omitted.
- the intra prediction circuit 5 performs the cost value C ost (M ode (M ode ) Is calculated.
- Intra prediction modes calculated in this way are all modes of 4 X 4 prediction mode and 16 X 16 prediction mode.
- the intra-prediction circuit 5 notifies the intra-inter mode determination circuit 45 of the cost values of all the calculated prediction modes.
- the intra / inter mode judgment circuit 4 5 When the intra coding is instructed and the optimum mode is instructed, a predicted value of the luminance signal is generated by this optimum mode and output to the subtracting circuit 4.
- the optimum mode is detected by calculating and comparing the cost value, and the predicted value of the chrominance signal is output to the subtraction circuit 4 in this optimum mode.
- the coarse motion prediction circuit 42 detects the motion vector with integer pixel accuracy in all the inter prediction modes, and the cost value of the coarse motion compensation position by this motion vector.
- the coarse motion prediction circuit 42 detects motion vectors by correcting motion with accuracy of one pixel for all inter prediction modes of the luminance signal.
- all inter prediction modes are modes other than the direct mode, which allows the coarse motion prediction circuit 42 to perform all motion compensation blocks, forward prediction, and backward prediction using all reference frames, macroblocks, and sub-macroblocks. For all combinations of prediction and bi-directional prediction directions, motion vector with integer pixel accuracy is detected.
- MV intBit (Mode) is the predicted motion vector described above with reference to Figs. This is the amount of code used to transmit motion vector information using the tor pmv. To calculate this amount of code, it is necessary to generate the predicted motion vector pmv from the motion vector values of the surrounding blocks. In this case, if adjacent blocks A to C in Fig. 8 are macroblocks, the predicted motion vector pmv is set by detecting the optimal mode in these adjacent blocks A to C and having already performed the encoding process.
- the motion vector value mv intX detected with integer pixel accuracy by the coarse motion prediction circuit 42 is set to the predicted motion vector pmv int, thereby making a pseudo prediction
- the motion vector pmv is generated and the cost value C ost (M ode) is calculated.
- the coarse motion prediction circuit 42 has a rough cost value based on the coarse motion compensation position, a surrounding compensation position based on the coarse motion compensation position, and a surrounding compensation position based on the motion compensation accuracy related to the coarse motion compensation position. Similarly, the cost value is detected and output. Specifically, in this embodiment, since the motion compensation accuracy is 1 pixel, the coarse motion prediction circuit 42 can determine the X direction and Y direction components of the motion vector related to the coarse motion compensation position. Change the value by ⁇ 1 pixel, perform motion compensation in the same way, and calculate the cost value C ost (M ode).
- the inter mode determination circuit 43 Based on the cost value of the coarse motion compensation position detected by the coarse motion prediction circuit 42, the inter mode determination circuit 43 has a cost at the motion compensation position where the cost value is the minimum value with an accuracy of 1 pixel or less. The expected value of the value is calculated, and the optimum mode in the multiple inter prediction modes is detected by comparing the estimated value in the multiple inter prediction modes. That is, as shown in Fig.
- the inter-mode determination circuit 43 has a minimum point in the direction connecting the coarse motion compensation position E and the two surrounding compensation positions facing each other with the coarse motion compensation position E in between.
- the change in cost is expressed by a linear function whose sign changes, and the minimum value of the cost value is detected, and the expected value is detected based on the minimum value of the cost value. That is, in this case, the rough motion compensation position E and the rough motion compensation position E are opposed to each other in the four directions of the horizontal direction, the vertical direction, and the oblique direction passing through the rough motion compensation position E. 2
- the direction connecting two surrounding compensation positions can be set.
- the point with the smallest cost value is detected by linear approximation using a linear function in each of these four directions.
- the minimum cost value C ostint (BH) in the vertical direction is detected by the intersection of the straight line L 1 passing through ostint (H) and the cost value C ostint (E).
- various local minimum values are detected, for example, when detecting by approximation using a quadratic function instead of the method using linear approximation using a linear function.
- the method can be widely applied.
- x be the continuous direction of these three points B, E, and H
- y be the cost value of these three points B, E, and H. Is obtained by substituting This can be done by solving the cubic system of equations, obtaining coefficients a, b, and c to obtain a quadratic curve equation, and detecting the minimum value from this quadratic curve equation.
- the inter-mode decision circuit 43 can obtain each prediction mode obtained in this way. By comparing the estimated cost value C ostest (Mode) of the cost value, the prediction mode with the smallest possible value C ostest (Mode) is determined, and the optimum mode for intra prediction is detected. In the detection of the optimum mode, the reference frame described above with reference to FIG. 3, all motion compensation blocks having different sizes described with reference to FIG. 4, and the optimum mode for each macroblock based on combinations of prediction directions are used. is there . The inter mode determination circuit 43 notifies the fine motion prediction circuit 44 of the determination result.
- the intra-inter mode decision circuit 4 5 performs the following processing to obtain a cost value that is the smallest of the various cost values output from the intra prediction circuit 5 and the motion prediction / compensation circuit 4 1.
- the optimal mode is detected by selecting and the encoding by intra prediction and inter prediction is determined. Also, when encoding is performed by inter prediction based on the judgment result, the corresponding optimal mode is notified to the fine motion prediction circuit 4 4 of the motion prediction / compensation circuit 4 1 and detected by this fine motion prediction circuit 4 4. Instructs the output of the predicted value in the optimal mode or the output of the predicted value in the direct mode. On the other hand, when encoding is performed by intra prediction, the optimum mode related to the intra prediction is notified to the intra prediction circuit 5 to instruct the output of the predicted value.
- Fig. 1 3 shows a series of processing of the encoding device 40 related to detection of these optimum modes It is a flowchart which shows a procedure.
- the encoding device 40 performs this processing procedure for each macroblock, moves from step SP 1 to step SP 2, performs motion prediction with integer pixel accuracy for all of the inter prediction modes that become candidates, In the following step SP3, the cost value is calculated using the motion vector with integer pixel accuracy based on the motion prediction in step SP1.
- step SP 4 the expected value of the cost value using this cost value is calculated as described above, and in the subsequent step SP 5, the minimum value is detected from the calculated value in this step SP 4.
- the optimum inter prediction mode is detected.
- the cost value is calculated by the trap calculation process using the actual FIR filter in this optimal inter prediction mode.
- step SP 7 cost values are calculated in all modes for the direct mode, the intra 4 X 4 prediction mode, and the intra 16 X 1 6 prediction mode, and in step SP 8, the step SP 6, After selecting the optimal mode by comparing the cost values calculated by SP 7, the process proceeds to step SP 9 to end the series of processing.
- the video signal SV sequentially input is converted into image data D 1 by the analog-digital conversion circuit 2, and this image data D 1 is arranged on the screen.
- the data is rearranged in the processing order by the replacement buffer 3 and input to the subtraction circuit 4.
- the image data D 1 is subtracted from the prediction values obtained by intra prediction and inter prediction to generate subtraction data D 2
- this subtraction data D 2 is converted into an orthogonal transformation circuit 7, a quantization circuit 8, a lossless code.
- the encoded data D4 is recorded on a recording medium by a recording system, for example.
- the output data of the quantization circuit 8 is decoded into image data and recorded as a reference image in the frame memory 16, and the motion prediction / compensation circuit 4 1 and the intra prediction circuit 5 perform inter prediction and intra prediction based on the reference image.
- the predicted value of is generated.
- the image data D 1 is converted into a cost by a cost function indicating the coding efficiency in each of the inter prediction and intra prediction modes in the motion prediction / compensation circuit 4 1 and the intra prediction circuit 5.
- the optimum mode most suitable for the encoding process is detected by comparing the cost values in each prediction mode.
- the optimum mode is detected by comparing the cost values, and the optimum prediction mode is finally detected by the intra mode determination circuit 45 from these optimum modes.
- the encoding device 40 determines whether to perform the encoding process using the intra prediction or the inter prediction using the optimum prediction mode. In this case, the prediction value in the optimum mode is generated by the intra prediction circuit 5 and output to the subtraction circuit 4.
- a prediction value in the optimal mode is generated by the motion prediction / compensation circuit 41 and output to the subtraction circuit 4.
- the encoding device 40 detects the optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes by comparing the cost values using the cost function indicating the coding efficiency.
- the image data D 1 is sequentially encoded in the optimum mode.
- the cost value of each prediction mode is calculated with integer pixel accuracy that does not require arithmetic processing using such an FIR filter, and the optimum mode of the inter prediction mode is calculated based on this cost value. Is detected.
- a cost value with an accuracy of 1 pixel or less is calculated, and the optimal mode between the intra prediction mode and the inter prediction mode is detected.
- only the optimal mode of inter prediction can be detected by detecting the optimal mode of multiple inter prediction modes and multiple inter prediction modes only by executing the calculation process using the FIR filter. Therefore, when the optimal mode is detected by the cost function and the image data is encoded, the optimal mode can be detected by a simple process.
- the cost value based on this inter prediction and the cost value based on multiple intra prediction modes are the intra-inter mode decision circuit 4
- the optimal mode is detected by judging 5 and the optimal mode of the inter prediction mode is detected based on the cost value with the accuracy of 1 pixel or more, and the cost value with the accuracy of 1 pixel or less by this optimal mode. By means of this, the optimum mode of the intra prediction mode and the inter prediction mode is detected.
- the optimal inter prediction mode is determined from the encoding process using a combination of multiple block sizes of different sizes, different reference frames, forward prediction, backward prediction, and bidirectional prediction.
- the optimal mode of the intra prediction mode and the inter prediction mode is detected from the cost value of the optimal inter prediction mode.
- the cost value is obtained with an accuracy of 1 pixel or less for each inter prediction mode based on the cost value based on the coarse motion compensation position.
- the estimated value of the cost value at the motion compensation position that is the minimum value is calculated, and the optimum mode in the multiple inter prediction modes is detected by comparing the estimated values.
- this expected value is calculated so that a practically sufficient accuracy can be secured. It is possible to detect the optimum mode of the inter prediction mode with simple processing by setting various arithmetic processing to be provided, and accordingly, the arithmetic processing as a whole can be simplified.
- the coarse motion prediction circuit 4 2 uses the coarse motion compensation position. Along with the cost value, a cost value is calculated at the compensation position around this coarse motion compensation position, and the intermode decision circuit 43 uses the cost value of this coarse motion compensation position and the cost value of the surrounding compensation position.
- the expected value is calculated by reflecting the change of the cost value due to the motion compensation position, and the inter prediction mode is detected as the optimum mode based on the original cost value. Can be accurately detected.
- the inter-mode determination circuit 43 it is minimal in the direction connecting the coarse motion compensation position and the two surrounding compensation positions that are opposed to each other with the coarse motion compensation position in between.
- the local minimum value of the cost value By detecting the local minimum value of the cost value by expressing the change of the cost value by a linear function in which the sign of the slope changes at the point, and by detecting the expected value based on the local minimum value of the cost value, it is possible to perform simple processing.
- the optimum mode can be detected with sufficient accuracy for practical use.
- the expected value is detected using a quadratic function instead of the expected value using such a linear function, the optimum mode can be detected with even higher accuracy.
- the accuracy can be further improved.
- the optimal mode can be detected.
- the predicted motion vector is set by using the motion vector of the adjacent motion compensation block that has already been encoded, and the motion vector information is transmitted based on the predicted motion vector. Then, it is necessary to calculate the cost value by setting the predicted motion vector for all motion vectors that are predicted to be set in this adjacent motion compensation block. However, this will increase the number of cost value calculations exponentially. As a result, in this case, the motion vector associated with the coarse motion compensation position is set as the predicted motion vector, and the cost value is calculated, thereby ensuring sufficient practical accuracy and further simplifying the processing. be able to.
- the inter prediction mode is based on the cost value with integer pixel accuracy.
- the optimal mode is detected by the cost function by detecting the optimal mode of the inter prediction mode and the inter prediction mode by calculating the cost value with an accuracy of 1 pixel or less for this optimal mode.
- the optimum mode can be detected by simple processing.
- inter prediction modes are inter prediction encoding processing in which motion compensation is performed using a plurality of block sizes having different sizes
- inter prediction encoding processing in which motion compensation is performed using different reference frames In the case of inter prediction encoding processing by forward prediction, backward prediction and bidirectional prediction, the optimum mode can be detected by a simple process.
- the estimated cost value at the motion compensation position where the cost value is the minimum value with an accuracy of 1 pixel or less is calculated.
- a cost value is calculated at a compensation position around the coarse motion compensation position, along with a cost value based on the coarse motion compensation position.
- the expected value is detected by reflecting the change in the cost value due to the motion compensation position, and the optimal mode is selected based on the original cost value.
- the inter prediction mode detected as can be accurately detected.
- the sign of the slope is switched at the minimum point in the direction connecting the coarse motion compensation position and the two surrounding compensation positions facing each other with the coarse motion compensation position in between.
- the cost value is calculated based on the coarse motion compensation position with 1 pixel accuracy and the optimum mode of the inter prediction mode is detected is described.
- the present invention is not limited to this, and is practically sufficient. If a high level of accuracy can be ensured, a rough cost value may be calculated with an integer pixel accuracy of two or more pixels.
- the present invention is not limited to this, and by comparing cost values using cost functions indicating encoding efficiency, The present invention can be widely applied when image data is encoded by detecting the optimum mode for each macroblock from a plurality of intra prediction modes and a plurality of inter prediction modes.
- the present invention is not limited to this, and can also be applied to the case where image data is processed by software.
- the encoding processing and decoding processing programs related to such software for example, when provided by a network such as the Internet, when provided by various recording media such as an optical disk, a magnetic disk, and a memory card, Can be widely applied.
- the present invention relates to an encoding device, an encoding method, an encoding method program, and a recording medium on which an encoding method program is recorded, and is applied to a video camera, an electronic still camera, a monitoring device, etc. be able to.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP20050778334 EP1802125A1 (en) | 2004-09-28 | 2005-09-02 | Encoder, encoding method, program of encoding method and recording medium wherein program of encoding method is recorded |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004280843A JP2006100871A (ja) | 2004-09-28 | 2004-09-28 | 符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体 |
JP2004-280843 | 2004-09-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006035584A1 true WO2006035584A1 (ja) | 2006-04-06 |
Family
ID=36118737
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/016571 WO2006035584A1 (ja) | 2004-09-28 | 2005-09-02 | 符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体 |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1802125A1 (ja) |
JP (1) | JP2006100871A (ja) |
KR (1) | KR20070057212A (ja) |
CN (1) | CN101032173A (ja) |
WO (1) | WO2006035584A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020073937A1 (en) * | 2018-10-11 | 2020-04-16 | Mediatek Inc. | Intra prediction for multi-hypothesis |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4908180B2 (ja) | 2006-12-20 | 2012-04-04 | 株式会社東芝 | 動画像符号化装置 |
JP4650461B2 (ja) * | 2007-07-13 | 2011-03-16 | ソニー株式会社 | 符号化装置、符号化方法、プログラム、及び記録媒体 |
US8228991B2 (en) | 2007-09-20 | 2012-07-24 | Harmonic Inc. | System and method for adaptive video compression motion compensation |
CN101867818B (zh) * | 2008-06-06 | 2012-08-29 | 浙江大学 | 宏块模式的选择方法和装置 |
JP2009027762A (ja) * | 2008-11-07 | 2009-02-05 | Renesas Technology Corp | 動き補償画像符号化装置及び符号化方法 |
JP5686499B2 (ja) * | 2009-01-22 | 2015-03-18 | 株式会社Nttドコモ | 画像予測符号化装置、方法及びプログラム、画像予測復号装置、方法及びプログラム、並びに、符号化・復号システム及び方法 |
KR101377530B1 (ko) | 2009-08-21 | 2014-03-27 | 에스케이텔레콤 주식회사 | 적응적 움직임 벡터 해상도를 이용한 영상 부호화/복호화 방법 및 장치 |
KR101441863B1 (ko) * | 2009-11-25 | 2014-09-23 | 에스케이텔레콤 주식회사 | 움직임 추정/보상을 위한 적응적 필터 생성 장치, 방법 및 기록 매체 |
US20120250767A1 (en) * | 2009-12-15 | 2012-10-04 | Ferran Valldosera | Method and apparatus for bi-directional prediction within p-slices |
CN103081471B (zh) * | 2010-07-15 | 2017-06-06 | 夏普株式会社 | 帧内预测模式估计装置、编码装置、以及解码装置 |
KR20120012385A (ko) * | 2010-07-31 | 2012-02-09 | 오수미 | 인트라 예측 부호화 장치 |
CN105812829B (zh) * | 2016-03-10 | 2018-11-02 | 福州瑞芯微电子股份有限公司 | 一种向量场外插的错误隐藏方法和装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05110927A (ja) * | 1991-10-11 | 1993-04-30 | Sony Corp | 画像の動きベクトル検出装置 |
JPH06326980A (ja) * | 1993-04-08 | 1994-11-25 | Sony United Kingdom Ltd | 動き補償映像信号処理方式 |
JPH09187013A (ja) * | 1995-12-28 | 1997-07-15 | Sony Corp | 動きベクトル検出装置および方法 |
JPH11112871A (ja) * | 1997-09-30 | 1999-04-23 | Sony Corp | 画像抜き出し装置および画像抜き出し方法、画像符号化装置および画像符号化方法、画像復号装置および画像復号方法、画像記録装置および画像記録方法、画像再生装置および画像再生方法、並びに記録媒体 |
JP2003284091A (ja) * | 2002-03-25 | 2003-10-03 | Toshiba Corp | 動画像符号化方法及び動画像符号化装置 |
JP2004241957A (ja) * | 2003-02-05 | 2004-08-26 | Sony Corp | 画像処理装置および符号化装置とそれらの方法 |
-
2004
- 2004-09-28 JP JP2004280843A patent/JP2006100871A/ja active Pending
-
2005
- 2005-09-02 WO PCT/JP2005/016571 patent/WO2006035584A1/ja not_active Application Discontinuation
- 2005-09-02 KR KR20077007028A patent/KR20070057212A/ko not_active Application Discontinuation
- 2005-09-02 CN CNA2005800328956A patent/CN101032173A/zh active Pending
- 2005-09-02 EP EP20050778334 patent/EP1802125A1/en not_active Withdrawn
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05110927A (ja) * | 1991-10-11 | 1993-04-30 | Sony Corp | 画像の動きベクトル検出装置 |
JPH06326980A (ja) * | 1993-04-08 | 1994-11-25 | Sony United Kingdom Ltd | 動き補償映像信号処理方式 |
JPH09187013A (ja) * | 1995-12-28 | 1997-07-15 | Sony Corp | 動きベクトル検出装置および方法 |
JPH11112871A (ja) * | 1997-09-30 | 1999-04-23 | Sony Corp | 画像抜き出し装置および画像抜き出し方法、画像符号化装置および画像符号化方法、画像復号装置および画像復号方法、画像記録装置および画像記録方法、画像再生装置および画像再生方法、並びに記録媒体 |
JP2003284091A (ja) * | 2002-03-25 | 2003-10-03 | Toshiba Corp | 動画像符号化方法及び動画像符号化装置 |
JP2004241957A (ja) * | 2003-02-05 | 2004-08-26 | Sony Corp | 画像処理装置および符号化装置とそれらの方法 |
Non-Patent Citations (1)
Title |
---|
"Joint Model Number 1, Revision 1(JM-1rl).", 3 December 2001 (2001-12-03) - 7 December 2001 (2001-12-07), pages 54 - 61, XP002993312 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020073937A1 (en) * | 2018-10-11 | 2020-04-16 | Mediatek Inc. | Intra prediction for multi-hypothesis |
Also Published As
Publication number | Publication date |
---|---|
JP2006100871A (ja) | 2006-04-13 |
KR20070057212A (ko) | 2007-06-04 |
CN101032173A (zh) | 2007-09-05 |
EP1802125A1 (en) | 2007-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2006035584A1 (ja) | 符号化装置、符号化方法、符号化方法のプログラム及び符号化方法のプログラムを記録した記録媒体 | |
JP3879741B2 (ja) | 画像情報符号化装置および画像情報符号化方法 | |
KR100950743B1 (ko) | 화상 정보 부호화 장치 및 방법, 및 화상 정보 복호 장치및 방법 | |
RU2665877C2 (ru) | Способ и устройство обработки изображения | |
JP5018085B2 (ja) | 動き予測補償方法及び動き予測補償装置 | |
US20110176614A1 (en) | Image processing device and method, and program | |
US11758179B2 (en) | Image encoding method, image encoding device, image decoding method, and image decoding device | |
US20090092189A1 (en) | Movement prediction method and movement prediction apparatus | |
KR20050074286A (ko) | 화상 부호화 장치, 화상 부호화 방법 및 화상 부호화프로그램 | |
JP2010028220A (ja) | 動きベクトル検出装置、動きベクトル検出方法、画像符号化装置及びプログラム | |
JP2007251815A (ja) | 再符号化装置及び再符号化用プログラム | |
Sugimoto et al. | Description of video coding technology proposal by Mitsubishi Electric | |
JP2003348595A (ja) | 画像処理装置および方法、記録媒体、並びにプログラム | |
JP4281667B2 (ja) | 画像符号化装置 | |
JP6181242B2 (ja) | 画像復号化方法 | |
JP5951915B2 (ja) | 画像復号化方法 | |
JP5946980B1 (ja) | 画像復号化方法 | |
JP5911982B2 (ja) | 画像復号化方法 | |
EP4441999A1 (en) | Motion compensation considering out-of-boundary conditions in video coding | |
JP5750191B2 (ja) | 画像復号化方法 | |
WO2023141177A1 (en) | Motion compensation considering out-of-boundary conditions in video coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 200580032895.6 Country of ref document: CN Ref document number: 1020077007028 Country of ref document: KR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005778334 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2005778334 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2005778334 Country of ref document: EP |