US20090110077A1 - Image coding device, image coding method, and image coding integrated circuit - Google Patents
Image coding device, image coding method, and image coding integrated circuit Download PDFInfo
- Publication number
- US20090110077A1 US20090110077A1 US12/301,630 US30163007A US2009110077A1 US 20090110077 A1 US20090110077 A1 US 20090110077A1 US 30163007 A US30163007 A US 30163007A US 2009110077 A1 US2009110077 A1 US 2009110077A1
- Authority
- US
- United States
- Prior art keywords
- image
- block
- designation information
- prediction
- blocks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/43—Hardware specially adapted for motion estimation or compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/436—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the present invention relates to an image encoding apparatus which compression encodes images, and in particular to speeding-up of compression encoding processing.
- MPEG4AVC Motion Pictures Experts Group 4 Advanced Video Coding
- MPEG4AVC Motion Pictures Experts Group 4 Advanced Video Coding
- both of these encoding modes calculate the motion vector of the macroblock being encoded (hereinafter, referred to as “encoding target block”) and the reference image number (hereinafter, a set of a motion vector and a reference image number are referred to as “motion information”) identifying the reference image, based on the motion information of a plurality of macroblocks (hereinafter, referred to as “adjacent blocks”) adjacent to the encoding target block, the reference image being equivalent in size with the encoding target block and used to predict an image of the encoding target block.
- these encoding modes encode the difference information which is information showing the difference between the prediction image, of the encoding target block, predicted based on the calculated motion information, and the original image of the encoding target block.
- the motion information of the macroblock encoded in the skip mode or the spatial direct mode (hereinafter, also referred to as “skip mode or the like”) can be calculated using the motion information of the adjacent blocks when the macroblock is decoded, it is not necessary to encode the motion information of the encoding target block.
- the difference information indicating the difference between the prediction image of the encoding target block and the original image of the encoding target block. Accordingly, code amounts can be reduced by performing encoding in the skip mode or the like.
- One conventionally known technique processes macroblocks in parallel using a pipeline structure in order to speed up encoding processing of an image.
- the motion information of the adjacent blocks are required.
- the first stage sequentially calculating the motion information of each macroblock encoded in the skip mode or the like, and the second stage, with respect to a macroblock which has been processed by the first stage, judging whether or not to perform the encoding in the skip mode or the like based on the code amount in the skip mode.
- the processing by the first stage starts, if the processing by the second stage has not been completed on all the adjacent blocks of the encoding target block, the encoding mode has not been determined for all of the adjacent blocks.
- the motion information has not been determined for all of the adjacent blocks, and thus, in the first stage, the motion information of the encoding target block cannot be calculated in the skip mode or the like.
- Patent Document 1 An example of a technique for solving this problem is adopted by an image encoding apparatus disclosed by Patent Document 1. In the following, the image encoding apparatus according to Patent Document 1 is described.
- the image encoding apparatus encodes the encoding target image in units of macroblocks which are processed in parallel using a pipeline structure. Starting from the upper left macroblock of the encoding target image, the image encoding apparatus processes one row of macroblocks in the horizontal direction, moves on to process the next row, and sequentially processes the macroblocks up to the bottom right macroblock.
- the image encoding apparatus of Patent Document 1 uses a block (hereinafter, referred to as “vicinity block”) on the left of the previous block, for which the motion vector has been calculated, instead of using the previous block, to generate the motion information of the encoding target block in the skip mode.
- the image encoding apparatus confirms whether the calculated motion information of the previous block and the motion information of the vicinity block match each other, and, if these two match each other, is able to encode the encoding target block in the skip mode or the like.
- the image encoding apparatus of Patent Document 1 calculates the motion vector of the encoding target block using the motion information of the vicinity block, thereby realizing speeding-up of the encoding processing without hindering the pipelined parallel processing.
- Patent Document 1 Japanese Patent Publication No.
- Patent Document 1 calculates the motion vector of the encoding target block using the motion information of the vicinity block, it is often the case that the calculated motion information of the previous block and the motion information of the vicinity block do not match each other. In this case, the encoding target block cannot be encoded in the skip mode or the like.
- the present invention is conceived in view of the above problem and aims to provide an image encoding apparatus that is able to realize speeding-up of the encoding processing with use of a method different from the conventional method.
- the image encoding apparatus of the present invention is an image encoding apparatus which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing unit operable to sequentially generate candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing unit operable to, in parallel with generation of the candidate designation information corresponding to the block by the first processing unit, (i) select, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputs a difference signal showing a difference which is between the selected prediction image and an original image of the one of the blocks, and
- the image encoding method used by the image encoding apparatus of the present invention is an image encoding method which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing step of sequentially generating candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing method of, in parallel with generation of the candidate designation information corresponding to the block in the first processing step, (i) selecting, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputting a difference signal showing a difference which is between the selected prediction image and an original image of
- the computer program used in the image encoding apparatus of the present invention is a computer program for causing a computer to perform compression encoding processing which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing step of sequentially generating candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing method of, in parallel with generation of the candidate designation information corresponding to the block in the first processing step, (i) selecting, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputting a difference signal showing a difference which is between the
- the integrated circuit used by the image encoding apparatus of the present invention is an integrated circuit used for image encoding, the integrated circuit compression encoding an image in units of blocks of a predetermined size and comprising: a first processing unit operable to sequentially generate candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing unit operable to, in parallel with generation of the candidate designation information corresponding to the block by the first processing unit, (i) select, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputs a difference signal showing a difference which is between
- the blocks of the predetermined size are macroblocks of n*n pixels, for example, of 16*16 pixels, and the prediction images are of the same size as the blocks.
- the first processing unit generates candidate designation information for each of the blocks based on candidate designation information of a prior block and, (ii) the second processing unit, in parallel with the processing by the first processing unit, for one of the blocks which has been processed by the first processing unit, selects a prediction image from among the plurality of prediction images indicated by the candidate designation information of the one of the blocks, based on the prediction image designation information designating a prediction image for a prior block processed prior to the one of the blocks. Consequently, for each of the blocks, compression encoding processing can be started even when a prediction image has not been determined for the prior block processed prior to the block at the start of the processing by the first processing unit.
- candidate designation information of each block is composed of motion information candidates generated based on motion information candidates of a prior block. This enables encoding in the skip mode or spatial direct mode according to MPEG4AVC, thereby suppressing code amounts.
- the above-described image encoding apparatus may compression encode a plurality of images which are input sequentially, and each candidate designation information may include designation information designating, as one of the candidates for the prediction image of the corresponding block, one of the plurality of prediction images which is generated based on a motion vector detected by searching one of the plurality of images that is other than the image being compression encoded.
- the candidate designation information includes designation information designating a prediction image according to the inter prediction (inter mode).
- the image encoding apparatus realizes compression encoding with suppressed code amounts by selecting for each block, for example, a prediction image which minimizes the code amount, from among the multiple prediction image candidates indicated by the prediction image designation information.
- Each candidate designation information of the above-described image encoding apparatus may include designation information designating, as a plurality of candidates among the candidates for the prediction image of the corresponding block, out of the plurality of prediction images, two or more prediction images generated based on motion vectors of a plurality of adjacent blocks including the prior block of the corresponding block, the plurality of adjacent blocks being located in predetermined positions with respect to the corresponding block.
- the prediction image candidates indicated by candidate designation information include prediction images encoded in the skip mode or the spatial direct mode. Accordingly, the code amount can be suppressed when these prediction images are selected.
- the above-described image encoding apparatus may further comprise a third processing unit operable to cause each candidate designation information to include designation information designating, as one of the candidates for the prediction image of the corresponding block, one of the plurality of prediction images which is generated based on one or more pixel values of one or more pixels located in predetermined positions with respect to the corresponding block, and the second processing unit may select the prediction image for the prior block of the corresponding block, from among the plurality of prediction images indicated by the candidate designation information caused by the third processing unit to have the designation information.
- the candidate designation information includes designation information designating a prediction image candidate in the intra prediction (intra mode).
- the image encoding apparatus realizes compression encoding with suppressed code amounts by selecting for each block, for example, a prediction image which minimizes the code amount, from among the multiple prediction image candidates indicated by the candidate designation information.
- Each candidate designation information of the above-described image encoding apparatus may be generated also based on prediction image designation information of one of the blocks which has been processed by the second processing unit.
- the image encoding apparatus generates the candidate designation information of MBn based on the candidate designation information of MBn ⁇ 1 which is generated based on the candidate designation information of MBn ⁇ 2. In this way, that is to say, by generating the candidate designation information of MBn based on the prediction image designation information of MBn ⁇ 2 which has been processed by the second processing unit, the image encoding apparatus is able to suppress the number of the prediction image candidates indicated by the candidate designation information of MBn.
- Each candidate designation information of the above-described image encoding apparatus may be generated based on information designating only some of the plurality of prediction images which are candidates for the prediction image of the prior block of the corresponding block.
- the image encoding apparatus generates the candidate designation information of the encoding target block based on only some of the prediction image candidates of the previous block. This suppresses the number of the prediction image candidates indicated by the candidate designation information, thereby enabling a reduction in the processing load of the image encoding apparatus.
- the above-described image encoding apparatus may perform processing in a pipeline structure composed of two stages that are a first stage and a second stage.
- the first processing unit processes the first stage and includes: a motion detection unit operable to generate, for each of the blocks, the candidate designation information including designation information designating, as one of the candidates for the prediction image of the block, one of the plurality of prediction images which is generated based on a motion vector detected by searching an image other than the image being compression encoded; and a motion vector speculative calculation unit operable to cause the candidate designation information of the block to include designation information designating, as a plurality of candidates among the candidates for the prediction image of the block, out of the plurality of prediction images, two or more prediction images generated based on motion vectors of a plurality of adjacent blocks including the prior block, the plurality of adjacent blocks being located in predetermined positions with respect to the block, and the second processing unit processes the second stage and includes: a motion vector determination unit operable to, based on the prediction image designation information of the block prior to the one
- the candidate designation information generated for each block by the first processing unit includes designation information designating multiple prediction image candidates of the block based on the candidate designation information, generated by the motion vector speculative calculation unit, of the prior block
- the motion vector determination unit in the second processing unit selects one of the multiple prediction image candidates indicated by the designation information generated by the motion vector speculative calculation unit based on the prediction image designation information of the block prior to the one of the blocks.
- the candidate designation information corresponding to each of the blocks includes designation information generated by the motion detection unit, and the mode judgement unit in the second processing unit selects the prediction image from among (a) the prediction image candidate selected by the motion vector determination unit and (b) the prediction image candidate, among the prediction image candidates indicated by the candidate designation information, indicated by the designation information generated by the motion detection unit. Consequently, by selecting for each block, for example, a prediction image which minimizes the code amount, the image encoding apparatus realizes the compression encoding which suppresses the code amount.
- FIG. 1 is a diagram for explaining a process flow of pipeline processing performed by an image encoding apparatus 100 of a first embodiment of the present invention
- FIG. 2 is a diagram showing a functional structure of the image encoding apparatus 100 of the first embodiment
- FIG. 3 is a flowchart showing operations of the image encoding apparatus 100 of the first embodiment
- FIG. 4 is a diagram showing a functional structure of an image encoding apparatus 200 of a second embodiment
- FIG. 5 is a flowchart showing operations of the image encoding apparatus 200 of the second embodiment
- FIG. 6 is a diagram for explaining relationships between an encoding target block and adjacent pixels.
- FIG. 7 is a diagram for explaining relationship between an encoding target block and adjacent blocks.
- An image encoding apparatus of a first embodiment is a modification of an image encoding apparatus that conforms to the MPEG4AVC standard and encodes an encoding target image in units of macroblocks of a predetermined size (for example, 16*16 pixels) using pipeline processing.
- the image encoding apparatus of the first embodiment selects, for each macroblock, an encoding mode out of three encoding modes, which are the intra mode, inter mode, and skip mode, so as to minimize the code amount of the macroblock, and encodes the macroblock using the selected encoding mode.
- the intra mode is an encoding mode that (i) generates a prediction image of an encoding target block based on pixel values of pixels (hereinafter, referred to as “adjacent pixels”) adjacent to the encoding target block which is the macroblock targeted for encoding, and (ii) encodes the difference information indicating the difference between the generated prediction image and the original image.
- FIG. 6 is a diagram for explaining relationships between the encoding target block and the adjacent pixels.
- the adjacent pixels refer to, with respect to the encoding target block shown in FIG. 6 , 16 pixels on the left (pixels A in the figure) and 16 pixels on the top (pixels B in the figure).
- Generation of the prediction image is performed, upon selection of a prediction mode out of multiple prediction modes (vertical prediction, horizontal prediction, and the like), using the pixel values of the adjacent pixels which correspond with the selected prediction mode.
- pixel values of different adjacent pixels are used in accordance with each prediction mode.
- the prediction image is generated on the assumption that the pixel value of each column of pixels in the encoding target block is equivalent to the pixel value of the pixel in the same column in B in FIG. 6 .
- selection methods of the prediction mode are prior arts and thus not described in detail here, it is possible to apply a method which makes a selection based on prediction modes of macroblocks in the vicinity.
- the inter mode is an encoding mode which (i) detects a motion vector indicating a relative position with respect to a block resembling to the encoding target block, by searching an image (hereinafter, referred to as “search area image”) other than the encoding target image containing the encoding target block, (ii) generates a prediction image of the encoding target block based on motion information composed of the detected motion vector and a reference image number indicating the search area image, and (iii) encodes (a) the difference information indicating the difference between the generated prediction image and the original image of the encoding target block and (b) the motion information.
- search area image an image
- the skip mode is an encoding mode which (i) calculates the motion information of the encoding target block using the motion information of the adjacent blocks which are multiple macroblocks adjacent to the encoding target block, (ii) generates a prediction image of the encoding target block based on the calculated motion information, and (iii) encodes the difference information indicating the difference between the generated prediction image and the original image of the encoding target block.
- the skip mode only the difference information is encoded, and it is not necessary to encode motion information. Accordingly, the code amount can be reduced.
- FIG. 7 is a diagram for explaining relationship between an encoding target block and adjacent blocks.
- macroblocks A to F shown in the figure if the block E is the encoding target block in the skip mode, then the adjacent blocks refer to the blocks B to D.
- the block A replaces the block C as one of the adjacent blocks of the block E.
- the block D becomes the only adjacent block of the block E. Note that, hereinafter, the block D on the left of the encoding target block E is called “the previous block”.
- the motion vector of the block E is determined by finding out the medians of the horizontal components and vertical components of the motion vectors of the blocks B to D. It should be noted that when the adjacent blocks include a macroblock encoded in the intra mode, the motion vector of the block E is calculated assuming that the motion vector of the macroblock is “0”.
- the reference image of the block E in the skip mode is an image displayed immediately preceding the encoding target image which contains the block E, and the reference image number is “0”.
- the reference image number is a number indicating the position (relative position) of the reference image, in terms of display order, with reference to the encoding target image containing the encoding target block. The further the reference image is from the encoding target image in terms of the display order, the larger the reference image number assigned to the reference image is.
- FIG. 1 is a diagram for explaining a process flow of pipeline processing performed by the image encoding apparatus of the first embodiment.
- the image encoding apparatus of the first embodiment (i) selects an encoding mode for each macroblock in the encoding target image such that the code amount of the macroblock is minimized and (ii) encodes the macroblock in the selected encoding mode. Starting from the upper left macroblock, the image encoding apparatus processes a row of macroblocks in the horizontal direction, moves on to the next row to continue processing, and in a similar manner, sequentially processes the macroblocks up to the bottom right macroblock.
- processing by the image encoding apparatus of the first embodiment is a 4-stage pipeline structure composed of the first to fourth stages, and in each time slot (TS 1 , TS 2 , TS 3 , TS 4 , . . . ) processing of the pipeline stages are performed in parallel.
- MBn is encoded in the encoding mode determined in the second stage as the encoding mode for MBn, and an encoded stream is generated.
- the motion information of MBn in the skip mode in the first stage is required.
- the encoding mode of MBn ⁇ 1 is determined upon completion of the processing in the second stage.
- the second-stage processing for MBn ⁇ 1 is about to start, and thus, the encoding mode of MBn ⁇ 1 has not been determined. In other words, the motion information of MBn ⁇ 1 has not been determined.
- the image encoding apparatus of the first embodiment calculates pieces of motion information (hereinafter, referred to as “motion information candidates”) for MBn in the skip mode, the pieces of motion information corresponding to the cases where the encoding mode of MBn ⁇ 1 is assumed to be one of the above-described encoding modes (intra mode, inter mode, and skip mode), respectively.
- motion information candidates pieces of motion information
- intra mode, inter mode, and skip mode the pieces of motion information candidates
- the motion information candidate corresponding to the determined encoding mode for MBn ⁇ 1 is selected as the motion information of MBn in the skip mode.
- the prediction image of MBn in the skip mode is generated based on the selected motion information, and the difference information indicating the difference between the generated prediction image and the original image of MBn is calculated.
- the prediction images of MBn in the intra mode and inter mode are generated, and the difference information each indicating difference between each prediction image and the original image of MBn is calculated. It should be noted that the prediction image of MBn in the inter mode is generated based on the motion information detected in the first stage.
- the code amount in each encoding mode is calculated using the difference information of the encoding mode, and the encoding mode which yields the least code amount is determined as the encoding mode for MBn.
- the image encoding apparatus of the first embodiment allows the encoding of the encoding target block in the skip mode, thereby suppressing the coding amount while maintaining the high-speed encoding processing using the pipeline structure.
- FIG. 2 is a diagram showing the functional structure of the image encoding apparatus 100 .
- the image encoding apparatus 100 includes a motion detection unit 101 , a motion vector speculative calculation unit 102 , a first motion compensation unit 103 , a motion vector determination unit 104 , a second motion compensation unit 105 , an intra prediction unit 106 , a mode judgement unit 107 , an orthogonal transform unit 108 , a quantization unit 109 , an inverse quantization unit 110 , an inverse orthogonal transform unit 111 , an addition unit 112 , a deblocking filter unit 113 , a variable length encoding unit 114 , a DMA controller 115 , and an external memory 116 .
- the image encoding apparatus 100 is equipped with a CPU (Central Processing Unit) and an internal memory, and each function of the above-mentioned motion detection unit 101 , motion vector speculative calculation unit 102 , first motion compensation unit 103 , motion vector determination unit 104 , second motion compensation unit 105 , intra prediction unit 106 , mode judgement unit 107 , orthogonal transform unit 108 , quantization unit 109 , inverse quantization unit 110 , inverse orthogonal transform unit 111 , addition unit 112 , deblocking filter unit 113 , and variable length encoding unit 114 is realized by causing the CPU to execute programs stored in the internal memory.
- a CPU Central Processing Unit
- pipeline processing is configured such that (i) processing by the motion detection unit 101 and processing by the motion vector speculative calculation unit 102 correspond to the first stage, (ii) processing by the first motion compensation unit 103 , processing by the motion vector determination unit 104 , processing by the second motion compensation unit 105 , processing by the intra prediction unit 106 , and processing by the mode judgement unit 107 correspond to the second stage, (iii) processing by the orthogonal transform unit 108 , processing by the quantization unit 109 , processing by the inverse quantization unit 110 , processing by the inverse orthogonal transform unit 111 , and processing by the addition unit 112 correspond to the third stage, and (iv) processing by the deblocking filter unit 113 and processing by the variable length encoding unit 114 correspond to the fourth stage.
- the first-stage processing corresponds to the first processing unit of the present invention
- the second-stage processing corresponds to the second processing unit and the third processing unit of the present invention.
- the motion detection unit 101 (i) detects the motion vector of the encoding target block in the inter mode, and (ii) transmits, to the motion vector speculative calculation unit 102 and the first motion compensation unit 103 , the motion information composed of the detected motion vector and the reference image number indicating the search area image used to detect the motion vector.
- the motion detection unit 101 (i) reads, from the external memory 116 to the internal memory (not depicted) via the DMA controller 115 , the original image of the encoding target block (for example, a macroblock of 16* 16 pixels) and the search area image targeted for searching for the motion vector, (ii) performs block-matching between the read original image of the encoding target block and the read search area image, and (c) detects, by finding a macroblock which is most similar to the original image of the encoding target block, the motion vector indicating the relative position to the found macroblock.
- the original image of the encoding target block for example, a macroblock of 16* 16 pixels
- the search area image targeted for searching for the motion vector performs block-matching between the read original image of the encoding target block and the read search area image
- the search area image is a decrypted image that has been deblocked and stored in the external memory 116 by the deblocking filter unit 113 , which is described later.
- the search area image is described as an image immediately preceding, such as in the display order, the encoding target image containing the encoding target block.
- the motion detection unit 101 Upon detection of the motion vector, the motion detection unit 101 transmits the detected motion vector and the reference image number indicating the search area image, that is to say, the motion information, to the motion vector speculative calculation unit 102 and the first motion compensation unit 103 .
- the motion vector speculative calculation unit 102 calculates motion information candidates in the skip mode for the encoding target block, and transmits the calculated motion information candidates of the encoding target block to the vector determination unit 104 .
- the motion vector speculative calculation unit 102 calculates the motion information candidates of the encoding target block based on (a) the motion information of the adjacent blocks of the encoding target block except for the previous block, the motion information being stored in the internal memory by the mode judgement unit 107 which is described later, and (b) the motion information each of which corresponds to one of the encoding modes selectable by the previous block whose encoding mode has not been determined.
- the motion vector of the encoding target block is calculated based on the motion vectors of the adjacent blocks including the previous block on the assumption that the motion vector of the previous block is “0”.
- the motion information calculated under the assumption that the previous block is the intra mode is referred to as “m 1 ”.
- the motion vector of the encoding target block is calculated based on (a) the motion vector of the previous block in the inter mode, which has been received from the motion detection unit 101 and (b) the motion vectors of the other adjacent blocks.
- the motion information calculated under the assumption that the encoding mode of the previous block is the inter mode is referred to as “m 2 ”.
- the previous block When it is assumed that the encoding mode of the previous block is the skip mode, the previous block also has three possibilities for its motion information in the skip mode.
- the motion vector speculative calculation unit 102 with use of mode information which indicates the encoding mode determined for the macroblock on the left of the previous block and has been received from the mode judgement unit 107 , selects, out of the three motion information candidates for the motion information of the previous block in the skip mode, a motion vector corresponding to the encoding mode determined for the macroblock on the left of the previous block as the motion vector of the previous block in the skip mode.
- the motion vector speculative calculation unit 102 then calculates the motion vector of the encoding target block based on (a) the motion vector in the motion information of the previous block in the skip mode and (b) the motion vectors of the other adjacent blocks.
- m 3 the motion information calculated under the assumption that the encoding mode of the previous block is the skip mode
- the motion vector speculative calculation unit 102 stores, in the internal memory, the motion information candidates (m 1 to m 3 ) calculated for the encoding target block.
- the stored motion information candidates (m 1 to m 3 ) are used when calculating the motion information “m 3 ” for the next macroblock to be processed.
- the motion vector speculative calculation unit 102 reads the reference images indicated by the calculated motion information candidates (m 1 to m 3 ) from the external memory 116 to the internal memory via the DMA controller 115 . These reference images are used when the second motion compensation unit 105 generates the prediction image of the encoding target block in the skip mode, as described later.
- the first motion compensation unit 103 (i) generates the prediction image based on the motion information, received from the motion detection unit 101 , of the encoding target block in the inter mode, (ii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iii) transmits the prediction image, difference information, and motion information to the mode judgement unit 107 .
- the prediction image is a block indicated by the motion vector, the block being within the search area image indicated by the reference mode number received from the motion detection unit 101 .
- the motion vector determination unit 104 (i) selects, from among the three motion information candidates (m 1 to m 3 ), received from the motion vector speculative calculation unit 102 , of the encoding target block in the skip mode, a piece of motion information corresponding to the encoding mode of the previous block indicated by the mode information received from the mode judgement unit 107 , and (ii) transmits the selected motion information candidate as the motion information of the encoding target block in the skip mode to the second motion compensation unit 105 .
- the mode judgment unit 107 is described below.
- m 2 is transmitted to the second motion compensation unit 105 as the motion information of the encoding target block in the skip mode.
- the second motion compensation unit 105 (i) generates the prediction image based on the motion information of the encoding target block in the skip mode received from the motion vector determination unit 104 , (ii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iii) transmits, to the mode judgement unit 107 , the prediction image, difference information, and motion information in the skip mode which has been received from the motion vector determination unit 104 .
- the prediction image is the reference image indicated by the motion information received from the motion vector determination unit 104 .
- the intra prediction unit 106 (i) reads the images of the adjacent pixels of the encoding target block from the external memory 116 to the internal memory via the DMA controller 115 , (ii) generates the prediction image of the encoding target block in the intra mode based on the pixel values of the adjacent pixels, (iii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iv) transmits the prediction image, difference information, and the prediction mode information which indicates the prediction mode used to generate the prediction image, to the mode judgement unit 107 .
- the images of the adjacent pixels read from the external memory 116 by the intra prediction unit 106 is a decrypted image stored into the external memory 116 by the addition unit 112 , which is described later.
- the mode judgement unit 107 selects, as the encoding mode of the encoding target block, an encoding mode which minimizes the code amount, based on the difference information in the inter mode received from the first motion compensation unit 103 , the difference information in the skip mode received from the second motion compensation unit 105 , and the difference information in the intra mode received from the intra prediction unit 106 . It should be noted that when the encoding is performed in the inter mode, the code amount includes the code amount of the motion information, in addition to the code amount of the above-mentioned difference information.
- the mode judgement unit 107 (i) transmits the mode information indicating the determined encoding mode to the vector speculative calculation unit 102 and the motion vector determination unit 104 , (ii) transmits the difference information corresponding to the determined encoding mode to the orthogonal transform unit 108 , (iii) transmits the prediction image corresponding to the determined encoding mode to the addition unit 112 , and (iv) transmits the mode information and information (the motion information when the inter mode is selected, the prediction mode information when the intra mode is selected, and the like) in accordance with the determined encoding mode, to the variable length encoding unit 114 .
- the mode judgement unit 107 stores the motion information (the motion information in the inter mode or the motion information in the skip mode) in accordance with the determined encoding mode into the internal memory.
- the orthogonal transform unit 108 performs orthogonal transform processing such as discrete cosine transform with respect to the difference information received from the mode judgement unit 107 , and transmits coefficient information, which is the result of the processing, to the quantization unit 109 .
- the quantization unit 109 performs quantization processing with respect to the coefficient information received from the orthogonal transform unit 108 , and transmits the quantized coefficient information to the inverse quantization unit 110 and the variable length encoding unit 114 .
- the inverse quantization unit 110 performs inverse quantization processing with respect to the quantized coefficient information received from the quantization unit 109 and transmits coefficient information, which is the result of the processing, to the inverse orthogonal transform unit 111 .
- the inverse orthogonal transform unit 111 performs inverse orthogonal transform processing with respect to the coefficient information received from the inverse quantization unit 110 , and transmits difference information, which is the result of the processing, to the addition unit 112 .
- the addition unit 112 (i) generates a decrypted image of the encoding target block by adding the prediction image received from the mode judgement unit 107 and the difference information received from the inverse orthogonal transform unit 111 , and (ii) transmits the generated decrypted image to the deblocking filter unit 113 , as well as storing the generated decrypted image into the external memory 116 via the DMA controller 115 .
- the deblocking filter unit 113 with respect to the decrypted image received from the addition unit 112 , performs block noise removal processing (hereinafter, referred to as “deblocking processing”), and stores the deblocking-processed decrypted image into the external memory 116 via the DMA controller 115 .
- deblocking processing block noise removal processing
- variable length encoding unit 114 (i) performs variable length encoding processing, arithmetic encoding processing, or the like with respect to the quantitized coefficient information received from the quantization unit 109 and (ii) stores the processed encoded stream into the external memory 116 via the DMA controller 115 . It should be noted that the mode information received from the mode judgement unit 107 and the information in accordance with the determined encoding mode are used for generating the header of the encoded stream.
- the DMA controller 115 is a general DMA controller which arbitrates access requests from the respective units to the external memory 116 and transmits data between the external memory 116 and the internal memory.
- the external memory 116 is a memory composed of a DRAM and the like storing each encoding target block, the decrypted image stored by the addition unit 112 , the deblocking-processed decrypted image stored by the deblocking filter unit 113 , and the encoded stream stored by the variable length encoding unit 114 .
- FIG. 3 is a flowchart showing the operations of the image encoding apparatus 100 .
- processing by the image encoding apparatus 100 is described in pipeline stage units. It should be noted that the pipeline stages from the first stage to the fourth stage are processed in parallel with one another, each of the pipeline stage targeting a different macroblock for processing.
- the motion detection unit 101 reads the original image of the encoding target block and the search area image which is the search target of the motion vector, from the external memory 116 to the internal memory via the DMA controller 115 (steps S 01 and S 02 ).
- the motion detection unit 101 (i) detects the motion vector by searching the search area image for a macroblock which is most similar to the original image of the encoding target block (step S 03 ) and (ii) transmits the detected motion vector and the reference image number indicating the search area image, that is to say, the motion information, to the first motion compensation unit 103 .
- the motion vector speculative calculation unit 102 calculates the motion information candidates (m 1 to m 3 ) assuming the encoding mode of the previous block of the encoding target block to be the intra mode, inter mode, and skip mode, respectively (step S 04 ), and transmits the calculated motion information candidates (m 1 to m 3 ) to the motion vector determination unit 104 .
- the motion vector speculative calculation unit 102 reads, from the external memory 116 to the internal memory via the DMA controller 115 (step S 05 ), the reference images indicated by the motion information candidates (m 1 to m 3 ).
- step S 06 and S 07 the processing by the first motion compensation unit 103 (steps S 06 and S 07 ), (b) the processing by the motion vector determination unit 104 and the second motion compensation unit 105 (steps S 08 to S 10 ), and (c) the processing by the intra prediction unit 106 (steps S 11 and S 12 ) are executed in parallel with one another.
- the processing by each unit is described below.
- the first motion compensation unit 103 generates a prediction image of the encoding target block in the inter mode based on the motion information received from the motion detection unit 101 (step S 06 ).
- the first motion compensation unit 103 calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S 07 ) and (ii) transmits the calculated difference information, generated prediction image, and motion information received from the motion detection unit 101 , to the mode judgement unit 107 .
- the motion vector determination unit 104 selects, from among the three motion information candidates (m 1 to m 3 ) of the encoding target block in the skip mode received from the motion vector speculative calculation unit 102 , as the motion information of the encoding target block in the skip mode, the motion information candidate corresponding to the encoding mode of the previous block indicated by the mode information received from the mode judgement unit 107 (step S 08 ), and (ii) transmits the selected motion information to the second motion compensation unit 105 .
- the second motion compensation unit 105 generates a prediction image of the encoding target block in the skip mode based on the motion information received from the motion vector determination unit 104 (step S 09 ).
- the second motion compensation unit 105 (i) calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S 10 ), and (ii) transmits the calculated difference information, generated prediction image, and the motion information in the skip mode received from the motion vector determination unit 104 , to the mode judgement unit 107 .
- the intra prediction unit 106 reads the images of the adjacent pixels of the encoding target block from the external memory 116 into the internal memory via the DMA controller 115 , and generates a prediction image of the encoding target block in the intra mode based on the pixel values of the adjacent pixels (step S 11 ).
- the intra prediction unit 106 calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S 12 ), and transmits the calculated difference information, generated prediction image, and the prediction mode information, to the mode judgement unit 107 .
- the mode judgement unit 107 selects an encoding mode which minimizes the code amount, from among the encoding modes (the inter mode, skip mode, and intra mode) as the encoding mode of the encoding target block, based on the difference information received from the first motion compensation unit 103 , second motion compensation unit 105 , and intra prediction unit 106 , respectively (step S 13 ).
- the mode judgement unit 107 (i) transmits the mode information indicating the determined encoding mode to the vector speculative calculation unit 102 and the motion vector determination unit 104 , (ii) transmits the difference information corresponding to the determined encoding mode to the orthogonal transform unit 108 , (iii) transmits the prediction image corresponding to the determined encoding mode to the addition unit 112 , and (iv) transmits the mode information and information (the motion information when the inter mode is selected, the prediction mode information when the intra mode is selected, and the like) in accordance with the determined encoding mode, to the variable length encoding unit 114 .
- the orthogonal transform unit 108 performs orthogonal transform processing such as discrete cosine transform with respect to the difference information received from the mode judgement unit 107 , and transmits coefficient information, which is the result of the processing, to the quantization unit 109 (step S 14 ).
- the quantization unit 109 performs quantization processing with respect to the coefficient information received from the orthogonal transform unit 108 , and transmits the quantized coefficient information to the inverse quantization unit 110 and the variable length encoding unit 114 (step S 15 ).
- the inverse quantization unit 110 performs inverse quantization processing with respect to the quantized coefficient information received from the quantization unit 109 , and transmits coefficient information, which is the result of the processing, to the inverse orthogonal transform unit 111 (step S 16 ).
- the inverse orthogonal transform unit 111 performs inverse orthogonal transform processing with respect to the coefficient information received from the inverse quantization unit 110 , and transmits difference information, which is the result of the processing, to the addition unit 112 (step S 17 ).
- step S 18 to S 20 the processing by the addition unit 112 and deblocking filter unit 113 (steps S 18 to S 20 ) and the processing by the variable length encoding unit 114 (steps S 21 and S 22 ) are executed in parallel.
- the processing by these units are described below.
- the addition unit 112 (i) generates the decrypted image of the encoding target block by adding the prediction image received from the mode judgement unit 107 and the difference information received from the inverse orthogonal transform unit 111 (step S 18 ), and (ii) transmits the generated decrypted image to the deblocking filter unit 113 , as well as storing the generated decrypted image into the external memory 116 via the DMA controller 115 .
- the deblocking filter unit 113 performs the deblocking processing with respect to the decrypted image received from the addition unit 112 (step S 19 ), and stores the deblocking-processed decrypted image into the external memory 116 via the DMA controller 115 (step S 20 ).
- variable length encoding unit 114 (i) performs variable length encoding processing, arithmetic encoding processing, or the like with respect to the quantitized coefficient information received from the quantization unit 109 (step S 21 ) and (ii) stores the processed encoded stream into the external memory 116 via the DMA controller 115 (step S 22 ).
- the image encoding apparatus generates the motion information of the encoding target block in the skip mode using motion information of a vicinity block instead of using the motion information of the previous block. Accordingly, although the high-speed encoding processing using the pipeline structure can be maintained, it is not possible to perform encoding in the skip mode in a case where the motion information of the previous block, which is determined subsequently, differs from the motion information of the vicinity block.
- the image encoding apparatus 100 calculates the motion information candidates of the encoding target block in the skip mode based on all the motion information selectable by the previous block of the encoding target block. Following that, upon determination of the motion information of the previous block, the image encoding apparatus 100 selects, out of the calculated motion information candidates, a motion information candidate which corresponds with the determined motion information of the previous block as the motion information of the encoding target block in the skip mode.
- each macroblock can be encoded in the skip mode reliably without exception, thereby enabling a reduction in the code amount, while the high-speed encoding processing with the pipeline structure is maintained.
- the motion vector speculative calculation unit 102 of the first embodiment (i) calculates all the motion information candidates of the encoding target block in the skip mode based on all the encoding modes (intra mode, inter mode, and skip mode) selectable by the previous block, and (ii) acquires all the reference images indicated by the above-mentioned respective motion information candidates.
- processing time of the motion vector speculative calculation unit 102 since processing time for transferring the reference images, requires a large amount of time. Moreover, at the same time, a high-capacity internal memory is required to store all the reference images.
- the motion vector speculative calculation unit of the second embodiment (i) calculates motion information candidates of the encoding target block in the skip mode based on motion information candidates corresponding to, among all the encoding modes selectable by the previous block, the encoding modes which are effective in improving image quality and encoding efficiency, namely, the inter mode and skip mode, and (ii) acquires the reference images indicated by these motion information candidates.
- processing time of the motion vector speculative calculation unit especially the processing time required for transferring the reference images, can be suppressed, and the capacity of the internal memory can be reduced.
- the image encoding apparatus 200 of the second embodiment is described with reference to FIG. 4 .
- FIG. 4 is a diagram showing a functional structure of the image encoding apparatus 200 .
- the image encoding apparatus 200 includes the motion detection unit 101 , first motion compensation unit 103 , second motion compensation unit 105 , intra prediction unit 106 , orthogonal transform unit 108 , quantization unit 109 , inverse quantization unit 110 , inverse orthogonal transform unit 111 , addition unit 112 , deblocking filter unit 113 , variable length encoding unit 114 , DMA controller 115 , external memory 116 , a motion vector speculative calculation unit 201 , a confirmation unit 202 , a motion vector determination unit 203 , and a mode judgement unit 204 .
- the above-mentioned components are the same as those of the image encoding apparatus 100 of the first embodiment except for the motion vector speculative calculation unit 201 , confirmation unit 202 , motion vector determination unit 203 , and mode judgement unit 204 , and therefore, their description is omitted here.
- the image encoding apparatus 200 is equipped with the CPU and the internal memory, and each function of the motion vector speculative calculation unit 201 , confirmation unit 202 , motion vector determination unit 203 , and mode judgement unit 204 is realized by causing the CPU to execute programs stored in the internal memory.
- the pipeline processing is configured such that (i) processing by the motion vector speculative calculation unit 201 corresponds to the first stage, and (ii) processing by the confirmation unit 202 , processing by the motion vector determination unit 203 , and processing by the mode judgement unit 204 correspond to the second stage.
- the motion vector speculative calculation unit 201 basically has the same functions as the motion vector speculative calculation unit 102 of the image encoding apparatus 100 . However, the motion vector speculative calculation unit 201 is different from the motion vector speculative calculation unit 102 in the following respects: the motion vector speculative calculation unit 201 (i) calculates the motion information candidates (m 2 and m 3 ) based on the motion information in the inter mode and in the skip mode, respectively, among the encoding modes selectable by the previous block whose motion information has not been determined, and (ii) transmits the motion information candidates (m 2 and m 3 ) to the motion vector determination unit 203 while reading the reference images indicated by the calculated motion information candidates (m 2 and m 3 ) from the external memory 116 to the internal memory via the DMA controller 115 as well.
- the confirmation unit 202 (i) confirms whether the encoding mode indicated by the mode information of the previous block received from the mode judgement unit 204 is either the inter mode or the skip mode, and (ii) transmits confirmation result information indicating a confirmation result to the motion vector determination unit 203 and mode judgement unit 204 .
- the motion vector determination unit 203 basically has the same functions as the motion vector determination unit 104 of the image encoding unit 100 . However, the motion vector determination unit 203 is different from the motion vector determination unit 104 in the following respects: when the confirmation result indicated by the confirmation result information received from the confirmation unit 202 indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is, the encoding mode of the previous block is indicated to be the intra mode, the motion vector determination unit 203 does not select the motion information of the encoding target block in the skip mode.
- the mode judgement unit 204 basically has the same functions as the mode judgement unit 107 of the image encoding apparatus 100 . However, the mode judgement unit 204 is different from the mode judgement unit 107 in the following respects: when the confirmation result indicated by the confirmation result information received from the confirmation unit 202 indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is, when the encoding mode of the previous block is indicated to be the intra mode, the mode judgement unit 204 determines an encoding mode other than the skip mode as the encoding mode of the encoding target block.
- the mode judgement unit 204 determines, as the encoding mode of the encoding target block, one of the inter mode and intra mode so as to minimize the code amount of the encoding target block, based on the difference information in the inter mode received from the first motion compensation unit 103 and the difference information in the intra mode received from the intra prediction unit 106 .
- FIG. 5 is a flowchart showing the operations of the image encoding apparatus 200 . Since the processing in the steps S 01 to S 22 are the same as the processing in the corresponding steps of the image encoding apparatus 100 , their description is omitted. In the following, only differences between the image encoding apparatus 200 and the image encoding apparatus 100 are described.
- the motion vector speculative calculation unit 201 (i) assumes the encoding mode of the previous block of the encoding target block to be either the inter mode or the skip mode, (ii) calculates the motion information candidates (m 2 and m 3 ) for these modes, respectively (step S 31 ), and (iii) transmits the motion information candidates (m 2 and m 3 ) to the motion vector determination unit 203 .
- the motion vector speculative calculation unit 201 reads the reference images respectively indicated by the motion information candidates (m 2 and m 3 ), from the external memory 116 into the internal memory via the DMA controller 115 (step S 32 ).
- the confirmation unit 202 (i) confirms whether the encoding mode indicated by the mode information of the previous block received from the mode judgement unit 204 is either the inter mode or the skip mode (step S 33 ) and (ii) transmits the confirmation result information to the motion vector determination unit 203 and the mode judgement unit 204 .
- the motion vector determination unit 203 selects, from among the motion information candidates (m 2 and m 3 ) received from the motion vector speculative calculation unit 201 , one motion information candidate corresponding to the encoding mode indicated by the mode information of the previous block received from the mode judgement unit 204 , as the motion information of the encoding target block in the skip mode (step S 34 ) and (ii) transmits the selected motion information to the second motion compensation unit 105 .
- the second motion compensation unit 105 (i) generates the prediction image of the encoding target block in the skip mode (step S 09 ), (ii) calculates the difference information indicating the difference between the prediction image of the encoding target block and the original image (step S 10 ), and (iii) transmits the calculated difference information, generated prediction image, and the motion information in the skip mode received from the motion vector determination unit 203 , to the mode judgement unit 204 .
- step S 33 If the confirmation result information indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is to say, the encoding mode of the previous block is indicated to be the intra mode (step S 33 :N), the processing proceeds to the step S 35 , which is described later, without performing the processing in the step S 34 , S 09 , and S 10 .
- the mode judgement unit 204 determines the encoding mode which minimizes the code amount of the encoding target block, as the encoding mode of the encoding target block (step S 35 )
- the mode judgement unit 204 determines the encoding mode of the encoding target block based on the three pieces of difference information.
- the mode judgement unit 204 determines the encoding mode of the encoding target block based on the two pieces of difference information received from the first motion compensation unit 103 and the intra prediction unit 106 .
- the research area image used by the motion detection unit 101 to detect the motion vector is the decrypted image of the previous image of the encoding target image which includes the encoding target block.
- the decrypted image of the previous image it is not limited to the decrypted image of the previous image, and can be the decrypted image of an image which precedes the encoding target image by more than one image.
- the search area image is also described to be the entire decrypted image of the previous image of the encoding target image. However, it can be part of the encoding target image.
- the search area image can be a block and 15 pixels in its vicinity in the decrypted image of the previous image of the encoding target image, the position of the block in the previous image being the same as the position of the encoding target block in the encoding target image.
- processing is performed in the four-stage pipeline structure.
- the third stage and the fourth stage in the first embodiment can be merged into one stage so that the processing is performed in a three-stage pipeline structure.
- the motion vector speculative calculation unit 102 calculates the motion information candidates of the encoding target block in the skip mode. However, it may calculate motion information candidates in the spatial direct mode.
- the reference image number is “0” irrespective of the encoding mode of the previous block.
- the spatial direct mode it is different from the skip mode in that, with respect to each direction L 0 and L 1 , the smallest image number among the reference image numbers of the adjacent blocks is selected to be the reference image number of the encoding target block.
- the reference image number of the adjacent blocks is “ ⁇ 1”, that is to say, the encoding mode of the adjacent block is the intra mode
- the reference image number of the encoding target block in the spatial direct mode is “0”. It should be noted that the motion vector speculative calculation unit 201 of the second embodiment can be similarly modified.
- the motion vector speculative calculation unit 201 calculates the motion information candidates (m 2 and m 3 ) based on the motion information in the inter mode and in the skip mode, respectively, among the encoding modes selectable by the previous block whose motion information has not been determined.
- the motion vector speculative calculation unit 201 may calculate motion information candidates (m 1 and m 3 ) of the encoding target block based on the motion information in the intra mode and the skip mode, or may calculate a motion information candidate (m 3 ) of the encoding target block based only on the motion information in the skip mode.
- the number of motion information candidates calculated by the motion vector speculative calculation unit 201 is not limited, and it can be determined in accordance with the processing ability and the like of the CPU.
- the processing by the intra prediction unit 106 is included in the processing in the second stage.
- the processing by the intra prediction unit 106 can be included in the processing in the first stage. That is to say, there is no particular restriction as to in which stage the processing by the intra prediction unit 106 should be included, as long as it is completed before the mode judgement unit 107 (or 204 ) determines the encoding mode of the encoding target block.
- each function of the motion detection unit 101 , motion vector speculative calculation unit 102 , first motion compensation unit 103 , motion vector determination unit 104 , second motion compensation unit 105 , intra prediction unit 106 , mode judgement unit 107 , orthogonal transform unit 108 , quantization unit 109 , inverse quantization unit 110 , inverse orthogonal transform unit 111 , addition unit 112 , deblocking filter unit 113 , and variable length encoding unit 114 is realized by causing the CPU to execute programs stored in the internal memory. However, part or an entirety of the processing may be realized by hardware.
- each function is typically realized by an LSI, an integrated circuit. These functions may be integrated into a single chip individually, or part or all of these functions may be integrated into a single chip.
- the recording media can be an IC card, an optical disc, a hard disk, a flexible disc, a ROM or the like.
- the circulated/distributed program is made available for use by being stored in a memory or the like that can be read by the CPU of an apparatus, and each function of the image encoding apparatuses described in the embodiments is realized by execution of the program by the CPU.
- the image encoding apparatuses conform to MPEG4AVC standards. However, they may conform to another compression encoding method such as MPEG4 or the like.
- the image encoding apparatus of the present invention is utilized to speed up encoding processing.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
- The present invention relates to an image encoding apparatus which compression encodes images, and in particular to speeding-up of compression encoding processing.
- MPEG4AVC (Motion Pictures Experts Group 4 Advanced Video Coding) standard, which is a standard for compression encoding of images, defines a skip mode and a spatial direct mode as encoding modes.
- When compression encoding (hereinafter, also referred to as simply “encoding”) a target image (hereinafter, referred to as “encoding target image”) in units of macroblocks of a predetermined size, for example, of 16*16 pixels, both of these encoding modes calculate the motion vector of the macroblock being encoded (hereinafter, referred to as “encoding target block”) and the reference image number (hereinafter, a set of a motion vector and a reference image number are referred to as “motion information”) identifying the reference image, based on the motion information of a plurality of macroblocks (hereinafter, referred to as “adjacent blocks”) adjacent to the encoding target block, the reference image being equivalent in size with the encoding target block and used to predict an image of the encoding target block. After that, these encoding modes encode the difference information which is information showing the difference between the prediction image, of the encoding target block, predicted based on the calculated motion information, and the original image of the encoding target block.
- In other words, because the motion information of the macroblock encoded in the skip mode or the spatial direct mode (hereinafter, also referred to as “skip mode or the like”) can be calculated using the motion information of the adjacent blocks when the macroblock is decoded, it is not necessary to encode the motion information of the encoding target block. Thus, it is necessary only to encode the difference information indicating the difference between the prediction image of the encoding target block and the original image of the encoding target block. Accordingly, code amounts can be reduced by performing encoding in the skip mode or the like.
- One conventionally known technique processes macroblocks in parallel using a pipeline structure in order to speed up encoding processing of an image.
- As described above, in order to encode the encoding target block in the skip mode or the like, the motion information of the adjacent blocks are required.
- For example, assume a case where the pipeline is divided into a first stage and a second stage, the first stage sequentially calculating the motion information of each macroblock encoded in the skip mode or the like, and the second stage, with respect to a macroblock which has been processed by the first stage, judging whether or not to perform the encoding in the skip mode or the like based on the code amount in the skip mode. In this case, when the processing by the first stage starts, if the processing by the second stage has not been completed on all the adjacent blocks of the encoding target block, the encoding mode has not been determined for all of the adjacent blocks.
- In other words, the motion information has not been determined for all of the adjacent blocks, and thus, in the first stage, the motion information of the encoding target block cannot be calculated in the skip mode or the like.
- An example of a technique for solving this problem is adopted by an image encoding apparatus disclosed by
Patent Document 1. In the following, the image encoding apparatus according toPatent Document 1 is described. - The image encoding apparatus according to
Patent Document 1 encodes the encoding target image in units of macroblocks which are processed in parallel using a pipeline structure. Starting from the upper left macroblock of the encoding target image, the image encoding apparatus processes one row of macroblocks in the horizontal direction, moves on to process the next row, and sequentially processes the macroblocks up to the bottom right macroblock. - In a case where the motion information of the macroblock (hereinafter, referred to as “previous block”) on the left of the encoding target block has not been calculated when beginning to process the encoding target block, the image encoding apparatus of
Patent Document 1 uses a block (hereinafter, referred to as “vicinity block”) on the left of the previous block, for which the motion vector has been calculated, instead of using the previous block, to generate the motion information of the encoding target block in the skip mode. - After that, upon completion of calculating the motion information of the previous block, the image encoding apparatus confirms whether the calculated motion information of the previous block and the motion information of the vicinity block match each other, and, if these two match each other, is able to encode the encoding target block in the skip mode or the like.
- Thus, even when the calculation of the motion information of the previous block has not been completed, the image encoding apparatus of
Patent Document 1 calculates the motion vector of the encoding target block using the motion information of the vicinity block, thereby realizing speeding-up of the encoding processing without hindering the pipelined parallel processing. - Patent Document 1: Japanese Patent Publication No.
- However, because the image encoding apparatus according to
Patent Document 1 calculates the motion vector of the encoding target block using the motion information of the vicinity block, it is often the case that the calculated motion information of the previous block and the motion information of the vicinity block do not match each other. In this case, the encoding target block cannot be encoded in the skip mode or the like. - The present invention is conceived in view of the above problem and aims to provide an image encoding apparatus that is able to realize speeding-up of the encoding processing with use of a method different from the conventional method.
- In order to solve the stated problem, the image encoding apparatus of the present invention is an image encoding apparatus which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing unit operable to sequentially generate candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing unit operable to, in parallel with generation of the candidate designation information corresponding to the block by the first processing unit, (i) select, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputs a difference signal showing a difference which is between the selected prediction image and an original image of the one of the blocks, and is to be reflected in a result of compression encoding of the one of the blocks, and (iii) generates prediction image designation information designating the selected prediction image.
- Also, in order to solve the stated problem, the image encoding method used by the image encoding apparatus of the present invention is an image encoding method which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing step of sequentially generating candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing method of, in parallel with generation of the candidate designation information corresponding to the block in the first processing step, (i) selecting, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputting a difference signal showing a difference which is between the selected prediction image and an original image of the one of the blocks, and is to be reflected in a result of compression encoding of the one of the blocks, and (iii) generating prediction image designation information designating the selected prediction image.
- Also, in order to solve the stated problem, the computer program used in the image encoding apparatus of the present invention is a computer program for causing a computer to perform compression encoding processing which compression encodes an image in units of blocks of a predetermined size and comprises: a first processing step of sequentially generating candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing method of, in parallel with generation of the candidate designation information corresponding to the block in the first processing step, (i) selecting, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputting a difference signal showing a difference which is between the selected prediction image and an original image of the one of the blocks, and is to be reflected in a result of compression encoding of the one of the blocks, and (iii) generating prediction image designation information designating the selected prediction image.
- Further more, in order to solve the stated problem, the integrated circuit used by the image encoding apparatus of the present invention is an integrated circuit used for image encoding, the integrated circuit compression encoding an image in units of blocks of a predetermined size and comprising: a first processing unit operable to sequentially generate candidate designation information corresponding to each of the blocks, based on candidate designation information corresponding to a prior block which has been processed prior to the block, each candidate designation information designating a plurality of prediction images as candidates for a prediction image of the corresponding block; and a second processing unit operable to, in parallel with generation of the candidate designation information corresponding to the block by the first processing unit, (i) select, as a prediction image of one of the blocks which has been processed by the first processing unit, a prediction image from among the plurality of prediction images indicated by the candidate designation information corresponding to the one of the blocks, based on prediction image designation information designating a prediction image of a block processed prior to the one of the blocks, (ii) outputs a difference signal showing a difference which is between the selected prediction image and an original image of the one of the blocks, and is to be reflected in a result of compression encoding of the one of the blocks, and (iii) generates prediction image designation information designating the selected prediction image.
- Here, the blocks of the predetermined size are macroblocks of n*n pixels, for example, of 16*16 pixels, and the prediction images are of the same size as the blocks.
- According to the image encoding apparatus of the present invention having the stated structure, (i) the first processing unit generates candidate designation information for each of the blocks based on candidate designation information of a prior block and, (ii) the second processing unit, in parallel with the processing by the first processing unit, for one of the blocks which has been processed by the first processing unit, selects a prediction image from among the plurality of prediction images indicated by the candidate designation information of the one of the blocks, based on the prediction image designation information designating a prediction image for a prior block processed prior to the one of the blocks. Consequently, for each of the blocks, compression encoding processing can be started even when a prediction image has not been determined for the prior block processed prior to the block at the start of the processing by the first processing unit.
- In other words, because encoding processing of each block can be started without waiting for the prediction image of the prior block of the block to be determined, the encoding processing can be speeded up.
- In addition, by using, for example, motion information (a motion vector and a reference image) candidates as the candidate designation information, candidate designation information of each block is composed of motion information candidates generated based on motion information candidates of a prior block. This enables encoding in the skip mode or spatial direct mode according to MPEG4AVC, thereby suppressing code amounts.
- The above-described image encoding apparatus may compression encode a plurality of images which are input sequentially, and each candidate designation information may include designation information designating, as one of the candidates for the prediction image of the corresponding block, one of the plurality of prediction images which is generated based on a motion vector detected by searching one of the plurality of images that is other than the image being compression encoded.
- According to the stated structure, when the image encoding apparatus uses, for example, a motion vector defined by the inter prediction (inter mode) according to MPEG4AVC as the motion vector detected by searching another image, the candidate designation information includes designation information designating a prediction image according to the inter prediction (inter mode).
- Accordingly, the image encoding apparatus realizes compression encoding with suppressed code amounts by selecting for each block, for example, a prediction image which minimizes the code amount, from among the multiple prediction image candidates indicated by the prediction image designation information.
- Each candidate designation information of the above-described image encoding apparatus may include designation information designating, as a plurality of candidates among the candidates for the prediction image of the corresponding block, out of the plurality of prediction images, two or more prediction images generated based on motion vectors of a plurality of adjacent blocks including the prior block of the corresponding block, the plurality of adjacent blocks being located in predetermined positions with respect to the corresponding block.
- According to the stated structure, when the image encoding apparatus uses, for example, predetermined macroblocks used in the skip mode according to MPEG4AVC standard as the adjacent blocks, the prediction image candidates indicated by candidate designation information include prediction images encoded in the skip mode or the spatial direct mode. Accordingly, the code amount can be suppressed when these prediction images are selected.
- The above-described image encoding apparatus may further comprise a third processing unit operable to cause each candidate designation information to include designation information designating, as one of the candidates for the prediction image of the corresponding block, one of the plurality of prediction images which is generated based on one or more pixel values of one or more pixels located in predetermined positions with respect to the corresponding block, and the second processing unit may select the prediction image for the prior block of the corresponding block, from among the plurality of prediction images indicated by the candidate designation information caused by the third processing unit to have the designation information.
- According to the stated structure, when the image encoding apparatus uses, for example, pixel values of predetermined pixels defined by the intra prediction (intra mode) according to MPEG4AVC, the candidate designation information includes designation information designating a prediction image candidate in the intra prediction (intra mode).
- Accordingly, the image encoding apparatus realizes compression encoding with suppressed code amounts by selecting for each block, for example, a prediction image which minimizes the code amount, from among the multiple prediction image candidates indicated by the candidate designation information.
- Each candidate designation information of the above-described image encoding apparatus may be generated also based on prediction image designation information of one of the blocks which has been processed by the second processing unit.
- According to the stated structure, for example, given that the processing target block of the first processing unit is MBn, the processing target block of the second processing unit is MBn−1, and the block which has been processed by the second processing unit is MBn−2, the image encoding apparatus generates the candidate designation information of MBn based on the candidate designation information of MBn−1 which is generated based on the candidate designation information of MBn−2. In this way, that is to say, by generating the candidate designation information of MBn based on the prediction image designation information of MBn−2 which has been processed by the second processing unit, the image encoding apparatus is able to suppress the number of the prediction image candidates indicated by the candidate designation information of MBn.
- Each candidate designation information of the above-described image encoding apparatus may be generated based on information designating only some of the plurality of prediction images which are candidates for the prediction image of the prior block of the corresponding block.
- According to the stated structure, the image encoding apparatus generates the candidate designation information of the encoding target block based on only some of the prediction image candidates of the previous block. This suppresses the number of the prediction image candidates indicated by the candidate designation information, thereby enabling a reduction in the processing load of the image encoding apparatus.
- The above-described image encoding apparatus may perform processing in a pipeline structure composed of two stages that are a first stage and a second stage. Here, the first processing unit processes the first stage and includes: a motion detection unit operable to generate, for each of the blocks, the candidate designation information including designation information designating, as one of the candidates for the prediction image of the block, one of the plurality of prediction images which is generated based on a motion vector detected by searching an image other than the image being compression encoded; and a motion vector speculative calculation unit operable to cause the candidate designation information of the block to include designation information designating, as a plurality of candidates among the candidates for the prediction image of the block, out of the plurality of prediction images, two or more prediction images generated based on motion vectors of a plurality of adjacent blocks including the prior block, the plurality of adjacent blocks being located in predetermined positions with respect to the block, and the second processing unit processes the second stage and includes: a motion vector determination unit operable to, based on the prediction image designation information of the block prior to the one of the blocks, select a prediction image out of the one or more prediction images designated by the designation information caused by the motion vector speculative calculation unit to be included in the candidate designation information of the one of the blocks; and a mode judgement unit operable to (i) select the prediction image for the one of the blocks, from among the prediction image selected by the motion vector determination unit and the prediction image indicated by the designation information generated by the motion detection unit, (ii) output the difference signal, and (iii) generate the prediction image designation information designating the selected prediction image.
- According to the stated structure, (i) the candidate designation information generated for each block by the first processing unit includes designation information designating multiple prediction image candidates of the block based on the candidate designation information, generated by the motion vector speculative calculation unit, of the prior block, and (ii) the motion vector determination unit in the second processing unit selects one of the multiple prediction image candidates indicated by the designation information generated by the motion vector speculative calculation unit based on the prediction image designation information of the block prior to the one of the blocks. Thus, even when the prediction image of the prior block has not been determined when the first processing unit starts processing, the compression encoding of the block can be started.
- Additionally, the candidate designation information corresponding to each of the blocks includes designation information generated by the motion detection unit, and the mode judgement unit in the second processing unit selects the prediction image from among (a) the prediction image candidate selected by the motion vector determination unit and (b) the prediction image candidate, among the prediction image candidates indicated by the candidate designation information, indicated by the designation information generated by the motion detection unit. Consequently, by selecting for each block, for example, a prediction image which minimizes the code amount, the image encoding apparatus realizes the compression encoding which suppresses the code amount.
-
FIG. 1 is a diagram for explaining a process flow of pipeline processing performed by an image encodingapparatus 100 of a first embodiment of the present invention; -
FIG. 2 is a diagram showing a functional structure of the image encodingapparatus 100 of the first embodiment; -
FIG. 3 is a flowchart showing operations of the image encodingapparatus 100 of the first embodiment; -
FIG. 4 is a diagram showing a functional structure of an image encoding apparatus 200 of a second embodiment; -
FIG. 5 is a flowchart showing operations of the image encoding apparatus 200 of the second embodiment; -
FIG. 6 is a diagram for explaining relationships between an encoding target block and adjacent pixels; and -
FIG. 7 is a diagram for explaining relationship between an encoding target block and adjacent blocks. -
-
- 100, 200 Image encoding apparatus
- 101 Motion detection unit
- 102, 201 Motion vector speculative calculation unit
- 103 First motion compensation unit
- 104, 203 Motion vector determination unit
- 105 Second motion compensation unit
- 106 Intra prediction unit
- 107, 204 Mode judgement unit
- 108 Orthogonal transform unit
- 109 Quantization unit
- 110 Inverse quantization unit
- 111 Inverse orthogonal transform unit
- 112 Addition unit
- 113 Deblocking filter unit
- 114 Variable length encoding unit
- 115 DMA controller
- 116 External memory
- 202 Confirmation unit
- Embodiments of the present invention are described below with reference to the drawings.
- <Outline>
- An image encoding apparatus of a first embodiment is a modification of an image encoding apparatus that conforms to the MPEG4AVC standard and encodes an encoding target image in units of macroblocks of a predetermined size (for example, 16*16 pixels) using pipeline processing.
- The image encoding apparatus of the first embodiment selects, for each macroblock, an encoding mode out of three encoding modes, which are the intra mode, inter mode, and skip mode, so as to minimize the code amount of the macroblock, and encodes the macroblock using the selected encoding mode.
- In the following, a brief description is given on each encoding mode.
- The intra mode is an encoding mode that (i) generates a prediction image of an encoding target block based on pixel values of pixels (hereinafter, referred to as “adjacent pixels”) adjacent to the encoding target block which is the macroblock targeted for encoding, and (ii) encodes the difference information indicating the difference between the generated prediction image and the original image.
- In the following, the adjacent pixels are described with reference to
FIG. 6 . -
FIG. 6 is a diagram for explaining relationships between the encoding target block and the adjacent pixels. - The adjacent pixels refer to, with respect to the encoding target block shown in
FIG. 6 , 16 pixels on the left (pixels A in the figure) and 16 pixels on the top (pixels B in the figure). - Generation of the prediction image is performed, upon selection of a prediction mode out of multiple prediction modes (vertical prediction, horizontal prediction, and the like), using the pixel values of the adjacent pixels which correspond with the selected prediction mode. In other words, pixel values of different adjacent pixels are used in accordance with each prediction mode.
- For example, when the vertical prediction is selected as the prediction mode, the prediction image is generated on the assumption that the pixel value of each column of pixels in the encoding target block is equivalent to the pixel value of the pixel in the same column in B in
FIG. 6 . It should be noted that while selection methods of the prediction mode are prior arts and thus not described in detail here, it is possible to apply a method which makes a selection based on prediction modes of macroblocks in the vicinity. - The inter mode is an encoding mode which (i) detects a motion vector indicating a relative position with respect to a block resembling to the encoding target block, by searching an image (hereinafter, referred to as “search area image”) other than the encoding target image containing the encoding target block, (ii) generates a prediction image of the encoding target block based on motion information composed of the detected motion vector and a reference image number indicating the search area image, and (iii) encodes (a) the difference information indicating the difference between the generated prediction image and the original image of the encoding target block and (b) the motion information.
- The skip mode is an encoding mode which (i) calculates the motion information of the encoding target block using the motion information of the adjacent blocks which are multiple macroblocks adjacent to the encoding target block, (ii) generates a prediction image of the encoding target block based on the calculated motion information, and (iii) encodes the difference information indicating the difference between the generated prediction image and the original image of the encoding target block. Thus, according to the skip mode, only the difference information is encoded, and it is not necessary to encode motion information. Accordingly, the code amount can be reduced.
- Here, the adjacent blocks are described with reference to
FIG. 7 . -
FIG. 7 is a diagram for explaining relationship between an encoding target block and adjacent blocks. Among macroblocks A to F shown in the figure, if the block E is the encoding target block in the skip mode, then the adjacent blocks refer to the blocks B to D. - However, if the motion information of the block C cannot be used because, for example, the block C does not exist, the block A replaces the block C as one of the adjacent blocks of the block E. In addition, if all the motion information of the blocks A to C cannot be used because, for example, none of the blocks A to C exists in the encoding target image, the block D becomes the only adjacent block of the block E. Note that, hereinafter, the block D on the left of the encoding target block E is called “the previous block”.
- When the blocks B to D are the adjacent blocks of the block E, the motion vector of the block E is determined by finding out the medians of the horizontal components and vertical components of the motion vectors of the blocks B to D. It should be noted that when the adjacent blocks include a macroblock encoded in the intra mode, the motion vector of the block E is calculated assuming that the motion vector of the macroblock is “0”.
- Additionally, the reference image of the block E in the skip mode is an image displayed immediately preceding the encoding target image which contains the block E, and the reference image number is “0”.
- It should be noted that the reference image number is a number indicating the position (relative position) of the reference image, in terms of display order, with reference to the encoding target image containing the encoding target block. The further the reference image is from the encoding target image in terms of the display order, the larger the reference image number assigned to the reference image is.
- In the following, the image encoding apparatus of the first embodiment is specifically described with reference to
FIG. 1 . -
FIG. 1 is a diagram for explaining a process flow of pipeline processing performed by the image encoding apparatus of the first embodiment. - The image encoding apparatus of the first embodiment (i) selects an encoding mode for each macroblock in the encoding target image such that the code amount of the macroblock is minimized and (ii) encodes the macroblock in the selected encoding mode. Starting from the upper left macroblock, the image encoding apparatus processes a row of macroblocks in the horizontal direction, moves on to the next row to continue processing, and in a similar manner, sequentially processes the macroblocks up to the bottom right macroblock.
- As shown in
FIG. 1 , processing by the image encoding apparatus of the first embodiment is a 4-stage pipeline structure composed of the first to fourth stages, and in each time slot (TS1, TS2, TS3, TS4, . . . ) processing of the pipeline stages are performed in parallel. - In the following, the outline of the processing in each pipeline stage is described with a focus on a macroblock (MBn) which is processed n-th.
- First, in the first stage, (a) detection of the motion information in the case of encoding MBn in the inter mode and (b) calculation of the motion information in the case of encoding MBn in the skip mode are performed in parallel.
- Next, in the second stage, (a) generation of prediction images assuming that MBn is encoded in the respective encoding modes and (b) calculation of difference information each indicating difference between each prediction image and the original image are performed in parallel so as to determine, for MBn, the encoding mode which minimizes the code amount thereof.
- Next, in the third and fourth stages, MBn is encoded in the encoding mode determined in the second stage as the encoding mode for MBn, and an encoded stream is generated.
- As is clear from above, to calculate the motion information of MBn in the skip mode in the first stage, the motion information of MBn−1, which is the previous block of MBn, is required. However, the encoding mode of MBn−1 is determined upon completion of the processing in the second stage.
- Accordingly, when the first-stage processing of MBn starts (when TS1 starts), the second-stage processing for MBn−1 is about to start, and thus, the encoding mode of MBn−1 has not been determined. In other words, the motion information of MBn−1 has not been determined.
- Consequently, in the first stage, the image encoding apparatus of the first embodiment calculates pieces of motion information (hereinafter, referred to as “motion information candidates”) for MBn in the skip mode, the pieces of motion information corresponding to the cases where the encoding mode of MBn−1 is assumed to be one of the above-described encoding modes (intra mode, inter mode, and skip mode), respectively. In other words, three pieces of motion information are calculated as candidates for the motion information of MBn in the skip mode.
- Also, since the second-stage processing of MBn−1 has been completed when the second-stage processing of MBn starts (when TS2 starts), the encoding mode of MBn−1 has been determined.
- Accordingly, in the second stage, from among the three motion information candidates for the motion information of MBn in the skip mode, which have been calculated in the first stage, the motion information candidate corresponding to the determined encoding mode for MBn−1 is selected as the motion information of MBn in the skip mode. Following that, the prediction image of MBn in the skip mode is generated based on the selected motion information, and the difference information indicating the difference between the generated prediction image and the original image of MBn is calculated.
- Additionally, in the second stage, in parallel with the above processing, the prediction images of MBn in the intra mode and inter mode are generated, and the difference information each indicating difference between each prediction image and the original image of MBn is calculated. It should be noted that the prediction image of MBn in the inter mode is generated based on the motion information detected in the first stage.
- Furthermore, in the second stage, upon completion of calculating the difference information for each of the encoding modes (intra mode, inter mode, and skip mode) of MBn, the code amount in each encoding mode is calculated using the difference information of the encoding mode, and the encoding mode which yields the least code amount is determined as the encoding mode for MBn.
- Thus, being able to start processing the encoding target block without waiting for the determination of the motion information of the previous block, the image encoding apparatus of the first embodiment allows the encoding of the encoding target block in the skip mode, thereby suppressing the coding amount while maintaining the high-speed encoding processing using the pipeline structure.
- <Structure>
- First, the structure of the
image encoding apparatus 100 of the first embodiment is described with reference toFIG. 2 . -
FIG. 2 is a diagram showing the functional structure of theimage encoding apparatus 100. - As shown in
FIG. 2 , theimage encoding apparatus 100 includes amotion detection unit 101, a motion vectorspeculative calculation unit 102, a firstmotion compensation unit 103, a motionvector determination unit 104, a secondmotion compensation unit 105, anintra prediction unit 106, amode judgement unit 107, anorthogonal transform unit 108, aquantization unit 109, aninverse quantization unit 110, an inverseorthogonal transform unit 111, anaddition unit 112, adeblocking filter unit 113, a variablelength encoding unit 114, aDMA controller 115, and anexternal memory 116. - Also, although not depicted, the
image encoding apparatus 100 is equipped with a CPU (Central Processing Unit) and an internal memory, and each function of the above-mentionedmotion detection unit 101, motion vectorspeculative calculation unit 102, firstmotion compensation unit 103, motionvector determination unit 104, secondmotion compensation unit 105,intra prediction unit 106,mode judgement unit 107,orthogonal transform unit 108,quantization unit 109,inverse quantization unit 110, inverseorthogonal transform unit 111,addition unit 112,deblocking filter unit 113, and variablelength encoding unit 114 is realized by causing the CPU to execute programs stored in the internal memory. - Additionally, in the present embodiment, pipeline processing is configured such that (i) processing by the
motion detection unit 101 and processing by the motion vectorspeculative calculation unit 102 correspond to the first stage, (ii) processing by the firstmotion compensation unit 103, processing by the motionvector determination unit 104, processing by the secondmotion compensation unit 105, processing by theintra prediction unit 106, and processing by themode judgement unit 107 correspond to the second stage, (iii) processing by theorthogonal transform unit 108, processing by thequantization unit 109, processing by theinverse quantization unit 110, processing by the inverseorthogonal transform unit 111, and processing by theaddition unit 112 correspond to the third stage, and (iv) processing by thedeblocking filter unit 113 and processing by the variablelength encoding unit 114 correspond to the fourth stage. Note that the first-stage processing corresponds to the first processing unit of the present invention, and the second-stage processing corresponds to the second processing unit and the third processing unit of the present invention. - The motion detection unit 101 (i) detects the motion vector of the encoding target block in the inter mode, and (ii) transmits, to the motion vector
speculative calculation unit 102 and the firstmotion compensation unit 103, the motion information composed of the detected motion vector and the reference image number indicating the search area image used to detect the motion vector. - Specifically, the motion detection unit 101 (i) reads, from the
external memory 116 to the internal memory (not depicted) via theDMA controller 115, the original image of the encoding target block (for example, a macroblock of 16* 16 pixels) and the search area image targeted for searching for the motion vector, (ii) performs block-matching between the read original image of the encoding target block and the read search area image, and (c) detects, by finding a macroblock which is most similar to the original image of the encoding target block, the motion vector indicating the relative position to the found macroblock. - It should be noted that regarding the detection of a motion vector, it is possible to detect a motion vector of fractional pixel accuracy (½ pixel accuracy and ¼ pixel accuracy). However, for the sake of description, description is given on the assumption that a motion vector of integer pixel accuracy is detected.
- Additionally, the search area image is a decrypted image that has been deblocked and stored in the
external memory 116 by thedeblocking filter unit 113, which is described later. In the following, the search area image is described as an image immediately preceding, such as in the display order, the encoding target image containing the encoding target block. - Upon detection of the motion vector, the
motion detection unit 101 transmits the detected motion vector and the reference image number indicating the search area image, that is to say, the motion information, to the motion vectorspeculative calculation unit 102 and the firstmotion compensation unit 103. - The motion vector
speculative calculation unit 102 calculates motion information candidates in the skip mode for the encoding target block, and transmits the calculated motion information candidates of the encoding target block to thevector determination unit 104. - That is to say, the motion vector
speculative calculation unit 102 calculates the motion information candidates of the encoding target block based on (a) the motion information of the adjacent blocks of the encoding target block except for the previous block, the motion information being stored in the internal memory by themode judgement unit 107 which is described later, and (b) the motion information each of which corresponds to one of the encoding modes selectable by the previous block whose encoding mode has not been determined. - Specific description is given below on the calculation of the motion information candidates. It should be noted that irrespective of the encoding mode of the previous block, the reference image number is “0”.
- When it is assumed that the encoding mode of the previous block is the intra mode, the motion vector of the encoding target block is calculated based on the motion vectors of the adjacent blocks including the previous block on the assumption that the motion vector of the previous block is “0”. Hereinafter, the motion information calculated under the assumption that the previous block is the intra mode is referred to as “m1”.
- When it is assumed that the encoding mode of the previous block is the inter mode, the motion vector of the encoding target block is calculated based on (a) the motion vector of the previous block in the inter mode, which has been received from the
motion detection unit 101 and (b) the motion vectors of the other adjacent blocks. Hereinafter, the motion information calculated under the assumption that the encoding mode of the previous block is the inter mode is referred to as “m2”. - When it is assumed that the encoding mode of the previous block is the skip mode, the previous block also has three possibilities for its motion information in the skip mode.
- Thus, the motion vector
speculative calculation unit 102, with use of mode information which indicates the encoding mode determined for the macroblock on the left of the previous block and has been received from themode judgement unit 107, selects, out of the three motion information candidates for the motion information of the previous block in the skip mode, a motion vector corresponding to the encoding mode determined for the macroblock on the left of the previous block as the motion vector of the previous block in the skip mode. The motion vectorspeculative calculation unit 102 then calculates the motion vector of the encoding target block based on (a) the motion vector in the motion information of the previous block in the skip mode and (b) the motion vectors of the other adjacent blocks. Hereinafter, the motion information calculated under the assumption that the encoding mode of the previous block is the skip mode is referred to as “m3”. - It should be noted that the motion vector
speculative calculation unit 102 stores, in the internal memory, the motion information candidates (m1 to m3) calculated for the encoding target block. The stored motion information candidates (m1 to m3) are used when calculating the motion information “m3” for the next macroblock to be processed. - In addition, the motion vector
speculative calculation unit 102 reads the reference images indicated by the calculated motion information candidates (m1 to m3) from theexternal memory 116 to the internal memory via theDMA controller 115. These reference images are used when the secondmotion compensation unit 105 generates the prediction image of the encoding target block in the skip mode, as described later. - The first motion compensation unit 103 (i) generates the prediction image based on the motion information, received from the
motion detection unit 101, of the encoding target block in the inter mode, (ii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iii) transmits the prediction image, difference information, and motion information to themode judgement unit 107. - As to the generation of the prediction image, more specifically, the prediction image is a block indicated by the motion vector, the block being within the search area image indicated by the reference mode number received from the
motion detection unit 101. - The motion vector determination unit 104 (i) selects, from among the three motion information candidates (m1 to m3), received from the motion vector
speculative calculation unit 102, of the encoding target block in the skip mode, a piece of motion information corresponding to the encoding mode of the previous block indicated by the mode information received from themode judgement unit 107, and (ii) transmits the selected motion information candidate as the motion information of the encoding target block in the skip mode to the secondmotion compensation unit 105. Themode judgment unit 107 is described below. - For example, when the encoding mode of the previous block is determined to be the inter mode, “m2” is transmitted to the second
motion compensation unit 105 as the motion information of the encoding target block in the skip mode. - The second motion compensation unit 105 (i) generates the prediction image based on the motion information of the encoding target block in the skip mode received from the motion
vector determination unit 104, (ii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iii) transmits, to themode judgement unit 107, the prediction image, difference information, and motion information in the skip mode which has been received from the motionvector determination unit 104. - As to the generation of the prediction image, specifically, the prediction image is the reference image indicated by the motion information received from the motion
vector determination unit 104. - The intra prediction unit 106 (i) reads the images of the adjacent pixels of the encoding target block from the
external memory 116 to the internal memory via theDMA controller 115, (ii) generates the prediction image of the encoding target block in the intra mode based on the pixel values of the adjacent pixels, (iii) calculates the difference information indicating the difference between the prediction image and the original image of the encoding target block stored in the internal memory, and (iv) transmits the prediction image, difference information, and the prediction mode information which indicates the prediction mode used to generate the prediction image, to themode judgement unit 107. - It should be noted that the images of the adjacent pixels read from the
external memory 116 by theintra prediction unit 106 is a decrypted image stored into theexternal memory 116 by theaddition unit 112, which is described later. - The
mode judgement unit 107 selects, as the encoding mode of the encoding target block, an encoding mode which minimizes the code amount, based on the difference information in the inter mode received from the firstmotion compensation unit 103, the difference information in the skip mode received from the secondmotion compensation unit 105, and the difference information in the intra mode received from theintra prediction unit 106. It should be noted that when the encoding is performed in the inter mode, the code amount includes the code amount of the motion information, in addition to the code amount of the above-mentioned difference information. - Furthermore, the mode judgement unit 107 (i) transmits the mode information indicating the determined encoding mode to the vector
speculative calculation unit 102 and the motionvector determination unit 104, (ii) transmits the difference information corresponding to the determined encoding mode to theorthogonal transform unit 108, (iii) transmits the prediction image corresponding to the determined encoding mode to theaddition unit 112, and (iv) transmits the mode information and information (the motion information when the inter mode is selected, the prediction mode information when the intra mode is selected, and the like) in accordance with the determined encoding mode, to the variablelength encoding unit 114. - It should be noted that the
mode judgement unit 107 stores the motion information (the motion information in the inter mode or the motion information in the skip mode) in accordance with the determined encoding mode into the internal memory. - The
orthogonal transform unit 108 performs orthogonal transform processing such as discrete cosine transform with respect to the difference information received from themode judgement unit 107, and transmits coefficient information, which is the result of the processing, to thequantization unit 109. - The
quantization unit 109 performs quantization processing with respect to the coefficient information received from theorthogonal transform unit 108, and transmits the quantized coefficient information to theinverse quantization unit 110 and the variablelength encoding unit 114. - The
inverse quantization unit 110 performs inverse quantization processing with respect to the quantized coefficient information received from thequantization unit 109 and transmits coefficient information, which is the result of the processing, to the inverseorthogonal transform unit 111. - The inverse
orthogonal transform unit 111 performs inverse orthogonal transform processing with respect to the coefficient information received from theinverse quantization unit 110, and transmits difference information, which is the result of the processing, to theaddition unit 112. - The addition unit 112 (i) generates a decrypted image of the encoding target block by adding the prediction image received from the
mode judgement unit 107 and the difference information received from the inverseorthogonal transform unit 111, and (ii) transmits the generated decrypted image to thedeblocking filter unit 113, as well as storing the generated decrypted image into theexternal memory 116 via theDMA controller 115. - The
deblocking filter unit 113, with respect to the decrypted image received from theaddition unit 112, performs block noise removal processing (hereinafter, referred to as “deblocking processing”), and stores the deblocking-processed decrypted image into theexternal memory 116 via theDMA controller 115. - The variable length encoding unit 114 (i) performs variable length encoding processing, arithmetic encoding processing, or the like with respect to the quantitized coefficient information received from the
quantization unit 109 and (ii) stores the processed encoded stream into theexternal memory 116 via theDMA controller 115. It should be noted that the mode information received from themode judgement unit 107 and the information in accordance with the determined encoding mode are used for generating the header of the encoded stream. - The
DMA controller 115 is a general DMA controller which arbitrates access requests from the respective units to theexternal memory 116 and transmits data between theexternal memory 116 and the internal memory. - The
external memory 116 is a memory composed of a DRAM and the like storing each encoding target block, the decrypted image stored by theaddition unit 112, the deblocking-processed decrypted image stored by thedeblocking filter unit 113, and the encoded stream stored by the variablelength encoding unit 114. - <Operations>
- Next, operations of the
image encoding apparatus 100 having the stated structure are described with reference toFIG. 3 . -
FIG. 3 is a flowchart showing the operations of theimage encoding apparatus 100. - In the following, processing by the
image encoding apparatus 100 is described in pipeline stage units. It should be noted that the pipeline stages from the first stage to the fourth stage are processed in parallel with one another, each of the pipeline stage targeting a different macroblock for processing. - <Operations in First Stage>
- First, operations by the
image encoding apparatus 100 in the first stage are described. In the first stage, the processing by the motion detection unit 101 (steps S01 to S03) and the processing by the motion vector speculative calculation unit 102 (steps S04 and S05), which are described below, are executed in parallel. - The
motion detection unit 101 reads the original image of the encoding target block and the search area image which is the search target of the motion vector, from theexternal memory 116 to the internal memory via the DMA controller 115 (steps S01 and S02). - The motion detection unit 101 (i) detects the motion vector by searching the search area image for a macroblock which is most similar to the original image of the encoding target block (step S03) and (ii) transmits the detected motion vector and the reference image number indicating the search area image, that is to say, the motion information, to the first
motion compensation unit 103. - On the other hand, the motion vector
speculative calculation unit 102 calculates the motion information candidates (m1 to m3) assuming the encoding mode of the previous block of the encoding target block to be the intra mode, inter mode, and skip mode, respectively (step S04), and transmits the calculated motion information candidates (m1 to m3) to the motionvector determination unit 104. - Furthermore, the motion vector
speculative calculation unit 102 reads, from theexternal memory 116 to the internal memory via the DMA controller 115 (step S05), the reference images indicated by the motion information candidates (m1 to m3). - <Operations in Second Stage>
- Next, operations by the
image encoding apparatus 100 in the second stage are described. - In the second stage, (a) the processing by the first motion compensation unit 103 (steps S06 and S07), (b) the processing by the motion
vector determination unit 104 and the second motion compensation unit 105 (steps S08 to S10), and (c) the processing by the intra prediction unit 106 (steps S11 and S12) are executed in parallel with one another. The processing by each unit is described below. - The first
motion compensation unit 103 generates a prediction image of the encoding target block in the inter mode based on the motion information received from the motion detection unit 101 (step S06). - The first motion compensation unit 103 (i) calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S07) and (ii) transmits the calculated difference information, generated prediction image, and motion information received from the
motion detection unit 101, to themode judgement unit 107. - The motion vector determination unit 104 (i) selects, from among the three motion information candidates (m1 to m3) of the encoding target block in the skip mode received from the motion vector
speculative calculation unit 102, as the motion information of the encoding target block in the skip mode, the motion information candidate corresponding to the encoding mode of the previous block indicated by the mode information received from the mode judgement unit 107 (step S08), and (ii) transmits the selected motion information to the secondmotion compensation unit 105. - The second
motion compensation unit 105 generates a prediction image of the encoding target block in the skip mode based on the motion information received from the motion vector determination unit 104 (step S09). - The second motion compensation unit 105 (i) calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S10), and (ii) transmits the calculated difference information, generated prediction image, and the motion information in the skip mode received from the motion
vector determination unit 104, to themode judgement unit 107. - The
intra prediction unit 106 reads the images of the adjacent pixels of the encoding target block from theexternal memory 116 into the internal memory via theDMA controller 115, and generates a prediction image of the encoding target block in the intra mode based on the pixel values of the adjacent pixels (step S11). - The
intra prediction unit 106 calculates the difference information indicating the difference between the generated prediction image and the original image of the encoding target block stored in the internal memory (step S12), and transmits the calculated difference information, generated prediction image, and the prediction mode information, to themode judgement unit 107. - The
mode judgement unit 107 selects an encoding mode which minimizes the code amount, from among the encoding modes (the inter mode, skip mode, and intra mode) as the encoding mode of the encoding target block, based on the difference information received from the firstmotion compensation unit 103, secondmotion compensation unit 105, andintra prediction unit 106, respectively (step S13). - The mode judgement unit 107 (i) transmits the mode information indicating the determined encoding mode to the vector
speculative calculation unit 102 and the motionvector determination unit 104, (ii) transmits the difference information corresponding to the determined encoding mode to theorthogonal transform unit 108, (iii) transmits the prediction image corresponding to the determined encoding mode to theaddition unit 112, and (iv) transmits the mode information and information (the motion information when the inter mode is selected, the prediction mode information when the intra mode is selected, and the like) in accordance with the determined encoding mode, to the variablelength encoding unit 114. - <Operations in Third Stage>
- Next, operations of the
image encoding apparatus 100 in the third stage are described. - The
orthogonal transform unit 108 performs orthogonal transform processing such as discrete cosine transform with respect to the difference information received from themode judgement unit 107, and transmits coefficient information, which is the result of the processing, to the quantization unit 109 (step S14). Thequantization unit 109 performs quantization processing with respect to the coefficient information received from theorthogonal transform unit 108, and transmits the quantized coefficient information to theinverse quantization unit 110 and the variable length encoding unit 114 (step S15). - The
inverse quantization unit 110 performs inverse quantization processing with respect to the quantized coefficient information received from thequantization unit 109, and transmits coefficient information, which is the result of the processing, to the inverse orthogonal transform unit 111 (step S16). The inverseorthogonal transform unit 111 performs inverse orthogonal transform processing with respect to the coefficient information received from theinverse quantization unit 110, and transmits difference information, which is the result of the processing, to the addition unit 112 (step S17). - <Operations in Fourth Stage>
- Next, operations of the
image encoding apparatus 100 in the fourth stage are described. - In the fourth stage, the processing by the
addition unit 112 and deblocking filter unit 113 (steps S18 to S20) and the processing by the variable length encoding unit 114 (steps S21 and S22) are executed in parallel. The processing by these units are described below. - The addition unit 112 (i) generates the decrypted image of the encoding target block by adding the prediction image received from the
mode judgement unit 107 and the difference information received from the inverse orthogonal transform unit 111 (step S18), and (ii) transmits the generated decrypted image to thedeblocking filter unit 113, as well as storing the generated decrypted image into theexternal memory 116 via theDMA controller 115. - The
deblocking filter unit 113 performs the deblocking processing with respect to the decrypted image received from the addition unit 112 (step S19), and stores the deblocking-processed decrypted image into theexternal memory 116 via the DMA controller 115 (step S20). - The variable length encoding unit 114 (i) performs variable length encoding processing, arithmetic encoding processing, or the like with respect to the quantitized coefficient information received from the quantization unit 109 (step S21) and (ii) stores the processed encoded stream into the
external memory 116 via the DMA controller 115 (step S22). - <Observations>
- In the following, effects achieved by the
image encoding apparatus 100 of the present embodiment are described in comparison with the image encoding apparatus according toPatent Document 1. - The image encoding apparatus according to
Patent Document 1 generates the motion information of the encoding target block in the skip mode using motion information of a vicinity block instead of using the motion information of the previous block. Accordingly, although the high-speed encoding processing using the pipeline structure can be maintained, it is not possible to perform encoding in the skip mode in a case where the motion information of the previous block, which is determined subsequently, differs from the motion information of the vicinity block. - On the other hand, the
image encoding apparatus 100 calculates the motion information candidates of the encoding target block in the skip mode based on all the motion information selectable by the previous block of the encoding target block. Following that, upon determination of the motion information of the previous block, theimage encoding apparatus 100 selects, out of the calculated motion information candidates, a motion information candidate which corresponds with the determined motion information of the previous block as the motion information of the encoding target block in the skip mode. - Consequently, each macroblock can be encoded in the skip mode reliably without exception, thereby enabling a reduction in the code amount, while the high-speed encoding processing with the pipeline structure is maintained.
- <Outline>
- The motion vector
speculative calculation unit 102 of the first embodiment (i) calculates all the motion information candidates of the encoding target block in the skip mode based on all the encoding modes (intra mode, inter mode, and skip mode) selectable by the previous block, and (ii) acquires all the reference images indicated by the above-mentioned respective motion information candidates. - However, if it is necessary to calculate all the motion information candidates of the encoding target block in the skip mode and acquire all the reference images indicated by those motion information candidates, processing time of the motion vector
speculative calculation unit 102, especially processing time for transferring the reference images, requires a large amount of time. Moreover, at the same time, a high-capacity internal memory is required to store all the reference images. - Thus, the motion vector speculative calculation unit of the second embodiment (i) calculates motion information candidates of the encoding target block in the skip mode based on motion information candidates corresponding to, among all the encoding modes selectable by the previous block, the encoding modes which are effective in improving image quality and encoding efficiency, namely, the inter mode and skip mode, and (ii) acquires the reference images indicated by these motion information candidates.
- As a result, processing time of the motion vector speculative calculation unit, especially the processing time required for transferring the reference images, can be suppressed, and the capacity of the internal memory can be reduced.
- <Structure>
- First, the image encoding apparatus 200 of the second embodiment is described with reference to
FIG. 4 . -
FIG. 4 is a diagram showing a functional structure of the image encoding apparatus 200. - As shown in
FIG. 4 , the image encoding apparatus 200 includes themotion detection unit 101, firstmotion compensation unit 103, secondmotion compensation unit 105,intra prediction unit 106,orthogonal transform unit 108,quantization unit 109,inverse quantization unit 110, inverseorthogonal transform unit 111,addition unit 112,deblocking filter unit 113, variablelength encoding unit 114,DMA controller 115,external memory 116, a motion vectorspeculative calculation unit 201, aconfirmation unit 202, a motionvector determination unit 203, and amode judgement unit 204. - The above-mentioned components are the same as those of the
image encoding apparatus 100 of the first embodiment except for the motion vectorspeculative calculation unit 201,confirmation unit 202, motionvector determination unit 203, andmode judgement unit 204, and therefore, their description is omitted here. - Although not depicted, the image encoding apparatus 200 is equipped with the CPU and the internal memory, and each function of the motion vector
speculative calculation unit 201,confirmation unit 202, motionvector determination unit 203, andmode judgement unit 204 is realized by causing the CPU to execute programs stored in the internal memory. - Additionally, in the present embodiment, the pipeline processing is configured such that (i) processing by the motion vector
speculative calculation unit 201 corresponds to the first stage, and (ii) processing by theconfirmation unit 202, processing by the motionvector determination unit 203, and processing by themode judgement unit 204 correspond to the second stage. - The motion vector
speculative calculation unit 201 basically has the same functions as the motion vectorspeculative calculation unit 102 of theimage encoding apparatus 100. However, the motion vectorspeculative calculation unit 201 is different from the motion vectorspeculative calculation unit 102 in the following respects: the motion vector speculative calculation unit 201 (i) calculates the motion information candidates (m2 and m3) based on the motion information in the inter mode and in the skip mode, respectively, among the encoding modes selectable by the previous block whose motion information has not been determined, and (ii) transmits the motion information candidates (m2 and m3) to the motionvector determination unit 203 while reading the reference images indicated by the calculated motion information candidates (m2 and m3) from theexternal memory 116 to the internal memory via theDMA controller 115 as well. - The confirmation unit 202 (i) confirms whether the encoding mode indicated by the mode information of the previous block received from the
mode judgement unit 204 is either the inter mode or the skip mode, and (ii) transmits confirmation result information indicating a confirmation result to the motionvector determination unit 203 andmode judgement unit 204. - The motion
vector determination unit 203 basically has the same functions as the motionvector determination unit 104 of theimage encoding unit 100. However, the motionvector determination unit 203 is different from the motionvector determination unit 104 in the following respects: when the confirmation result indicated by the confirmation result information received from theconfirmation unit 202 indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is, the encoding mode of the previous block is indicated to be the intra mode, the motionvector determination unit 203 does not select the motion information of the encoding target block in the skip mode. - The
mode judgement unit 204 basically has the same functions as themode judgement unit 107 of theimage encoding apparatus 100. However, themode judgement unit 204 is different from themode judgement unit 107 in the following respects: when the confirmation result indicated by the confirmation result information received from theconfirmation unit 202 indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is, when the encoding mode of the previous block is indicated to be the intra mode, themode judgement unit 204 determines an encoding mode other than the skip mode as the encoding mode of the encoding target block. - In other words, when the encoding mode of the previous block is the intra mode, the
mode judgement unit 204 determines, as the encoding mode of the encoding target block, one of the inter mode and intra mode so as to minimize the code amount of the encoding target block, based on the difference information in the inter mode received from the firstmotion compensation unit 103 and the difference information in the intra mode received from theintra prediction unit 106. - <Operations>
- Next, operations of the image encoding apparatus 200 having the stated structure are described with reference to
FIG. 5 . -
FIG. 5 is a flowchart showing the operations of the image encoding apparatus 200. Since the processing in the steps S01 to S22 are the same as the processing in the corresponding steps of theimage encoding apparatus 100, their description is omitted. In the following, only differences between the image encoding apparatus 200 and theimage encoding apparatus 100 are described. - In the first stage, in parallel with the processing by the motion detection unit 101 (steps S01 to S03), the motion vector speculative calculation unit 201 (i) assumes the encoding mode of the previous block of the encoding target block to be either the inter mode or the skip mode, (ii) calculates the motion information candidates (m2 and m3) for these modes, respectively (step S31), and (iii) transmits the motion information candidates (m2 and m3) to the motion
vector determination unit 203. - Also, the motion vector
speculative calculation unit 201 reads the reference images respectively indicated by the motion information candidates (m2 and m3), from theexternal memory 116 into the internal memory via the DMA controller 115 (step S32). - In the second stage, in parallel with the processing by the first motion compensation unit 103 (steps S06 and S07) and the processing by the intra prediction unit 106 (steps S11 and S12), the confirmation unit 202 (i) confirms whether the encoding mode indicated by the mode information of the previous block received from the
mode judgement unit 204 is either the inter mode or the skip mode (step S33) and (ii) transmits the confirmation result information to the motionvector determination unit 203 and themode judgement unit 204. - If the confirmation result information indicates that the encoding mode of the previous block is either the inter mode or the skip mode (step S33:Y), the motion vector determination unit 203 (i) selects, from among the motion information candidates (m2 and m3) received from the motion vector
speculative calculation unit 201, one motion information candidate corresponding to the encoding mode indicated by the mode information of the previous block received from themode judgement unit 204, as the motion information of the encoding target block in the skip mode (step S34) and (ii) transmits the selected motion information to the secondmotion compensation unit 105. - As described in the first embodiment, the second motion compensation unit 105 (i) generates the prediction image of the encoding target block in the skip mode (step S09), (ii) calculates the difference information indicating the difference between the prediction image of the encoding target block and the original image (step S10), and (iii) transmits the calculated difference information, generated prediction image, and the motion information in the skip mode received from the motion
vector determination unit 203, to themode judgement unit 204. - [0]
- If the confirmation result information indicates that the encoding mode of the previous block is neither the inter mode nor the skip mode, that is to say, the encoding mode of the previous block is indicated to be the intra mode (step S33:N), the processing proceeds to the step S35, which is described later, without performing the processing in the step S34, S09, and S10.
- The
mode judgement unit 204 determines the encoding mode which minimizes the code amount of the encoding target block, as the encoding mode of the encoding target block (step S35) - In other words, in the case where the
mode judgement unit 204 receives, from theconfirmation unit 202, the confirmation result information indicating that the encoding mode of the previous block is either the inter mode or the skip mode, themode judgement unit 204 has received a piece of difference information from each of the firstmotion compensation unit 103, the secondmotion compensation unit 105, and theintra prediction unit 106. Thus, in this case, themode judgement unit 204 determines the encoding mode of the encoding target block based on the three pieces of difference information. - On the other hand, in the case where the encoding mode of the previous block is indicated to be neither the inter mode nor the skip mode, that is to say, the encoding mode of the previous block is indicated to be the intra mode by the confirmation result information received from the
confirmation unit 202, themode judgement unit 204 has not received the difference information from the secondmotion compensation unit 105. Thus, in this case, themode judgement unit 204 determines the encoding mode of the encoding target block based on the two pieces of difference information received from the firstmotion compensation unit 103 and theintra prediction unit 106. - <Additional Particulars>
- Although the image encoding apparatus of the present invention has been described by way of the embodiments above, it is to be noted that the following modifications are possible, and naturally, the present invention is not limited to the image encoding apparatuses described in the embodiments above.
- (1) According to the embodiments, the research area image used by the
motion detection unit 101 to detect the motion vector is the decrypted image of the previous image of the encoding target image which includes the encoding target block. However, it is not limited to the decrypted image of the previous image, and can be the decrypted image of an image which precedes the encoding target image by more than one image. - The search area image is also described to be the entire decrypted image of the previous image of the encoding target image. However, it can be part of the encoding target image. For example, the search area image can be a block and 15 pixels in its vicinity in the decrypted image of the previous image of the encoding target image, the position of the block in the previous image being the same as the position of the encoding target block in the encoding target image.
- (2) According to the embodiments, processing is performed in the four-stage pipeline structure. However, in accordance with the processing capacity of the CPU, for example, the third stage and the fourth stage in the first embodiment can be merged into one stage so that the processing is performed in a three-stage pipeline structure.
- (3) According to the first embodiment, the motion vector
speculative calculation unit 102 calculates the motion information candidates of the encoding target block in the skip mode. However, it may calculate motion information candidates in the spatial direct mode. - It should be noted that, according to the embodiments, in the skip mode, the reference image number is “0” irrespective of the encoding mode of the previous block. However, when using the spatial direct mode, it is different from the skip mode in that, with respect to each direction L0 and L1, the smallest image number among the reference image numbers of the adjacent blocks is selected to be the reference image number of the encoding target block.
- However, when the reference image number of the adjacent blocks is “−1”, that is to say, the encoding mode of the adjacent block is the intra mode, the reference image number of the encoding target block in the spatial direct mode is “0”. It should be noted that the motion vector
speculative calculation unit 201 of the second embodiment can be similarly modified. - (4) According to the second embodiment, the motion vector
speculative calculation unit 201 calculates the motion information candidates (m2 and m3) based on the motion information in the inter mode and in the skip mode, respectively, among the encoding modes selectable by the previous block whose motion information has not been determined. However, it is not limited to this, and the motion vectorspeculative calculation unit 201 may calculate motion information candidates (m1 and m3) of the encoding target block based on the motion information in the intra mode and the skip mode, or may calculate a motion information candidate (m3) of the encoding target block based only on the motion information in the skip mode. - In addition, the number of motion information candidates calculated by the motion vector
speculative calculation unit 201 is not limited, and it can be determined in accordance with the processing ability and the like of the CPU. - (5) According to the embodiments, the processing by the
intra prediction unit 106 is included in the processing in the second stage. However, the processing by theintra prediction unit 106 can be included in the processing in the first stage. That is to say, there is no particular restriction as to in which stage the processing by theintra prediction unit 106 should be included, as long as it is completed before the mode judgement unit 107 (or 204) determines the encoding mode of the encoding target block. - (6) According to the first embodiment, each function of the
motion detection unit 101, motion vectorspeculative calculation unit 102, firstmotion compensation unit 103, motionvector determination unit 104, secondmotion compensation unit 105,intra prediction unit 106,mode judgement unit 107,orthogonal transform unit 108,quantization unit 109,inverse quantization unit 110, inverseorthogonal transform unit 111,addition unit 112,deblocking filter unit 113, and variablelength encoding unit 114 is realized by causing the CPU to execute programs stored in the internal memory. However, part or an entirety of the processing may be realized by hardware. - In this case, each function is typically realized by an LSI, an integrated circuit. These functions may be integrated into a single chip individually, or part or all of these functions may be integrated into a single chip.
- It should be noted that it is the same with each function of the motion vector
speculative calculation unit 201,confirmation unit 202, motionvector determination unit 203, andmode judgement unit 204 in the second embodiment. - (7) It is also possible to circulate/distribute a program for CPU to realize each function of the image encoding apparatus by recording the program on a recording medium or using various communication channels. The recording media can be an IC card, an optical disc, a hard disk, a flexible disc, a ROM or the like. The circulated/distributed program is made available for use by being stored in a memory or the like that can be read by the CPU of an apparatus, and each function of the image encoding apparatuses described in the embodiments is realized by execution of the program by the CPU.
- (8) According to the embodiments, the image encoding apparatuses conform to MPEG4AVC standards. However, they may conform to another compression encoding method such as MPEG4 or the like.
- The image encoding apparatus of the present invention is utilized to speed up encoding processing.
Claims (10)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006-143840 | 2006-05-24 | ||
JP2006143840 | 2006-05-24 | ||
PCT/JP2007/060516 WO2007136088A1 (en) | 2006-05-24 | 2007-05-23 | Image coding device, image coding method, and image coding integrated circuit |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2007/060516 A-371-Of-International WO2007136088A1 (en) | 2006-05-24 | 2007-05-23 | Image coding device, image coding method, and image coding integrated circuit |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/555,825 Continuation US9667972B2 (en) | 2006-05-24 | 2014-11-28 | Image coding device, image coding method, and image coding integrated circuit |
Publications (1)
Publication Number | Publication Date |
---|---|
US20090110077A1 true US20090110077A1 (en) | 2009-04-30 |
Family
ID=38723398
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/301,630 Abandoned US20090110077A1 (en) | 2006-05-24 | 2007-05-23 | Image coding device, image coding method, and image coding integrated circuit |
US14/555,825 Active US9667972B2 (en) | 2006-05-24 | 2014-11-28 | Image coding device, image coding method, and image coding integrated circuit |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/555,825 Active US9667972B2 (en) | 2006-05-24 | 2014-11-28 | Image coding device, image coding method, and image coding integrated circuit |
Country Status (5)
Country | Link |
---|---|
US (2) | US20090110077A1 (en) |
EP (1) | EP2026585A4 (en) |
JP (2) | JPWO2007136088A1 (en) |
CN (2) | CN101455087B (en) |
WO (1) | WO2007136088A1 (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090297053A1 (en) * | 2008-05-29 | 2009-12-03 | Renesas Technology Corp. | Image encoding device and image encoding method |
US20100195922A1 (en) * | 2008-05-23 | 2010-08-05 | Hiroshi Amano | Image decoding apparatus, image decoding method, image coding apparatus, and image coding method |
US20110243234A1 (en) * | 2008-10-02 | 2011-10-06 | Sony Corporation | Image processing apparatus and method |
US20120014451A1 (en) * | 2009-01-15 | 2012-01-19 | Wei Siong Lee | Image Encoding Methods, Image Decoding Methods, Image Encoding Apparatuses, and Image Decoding Apparatuses |
WO2012074344A2 (en) * | 2010-12-03 | 2012-06-07 | 엘지전자 주식회사 | Method for indexing motion information lists and apparatus using same |
WO2013119569A1 (en) * | 2012-02-09 | 2013-08-15 | Google Inc. | Encoding motion vectors for video compression |
US8768081B2 (en) | 2009-07-24 | 2014-07-01 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US8908767B1 (en) | 2012-02-09 | 2014-12-09 | Google Inc. | Temporal motion vector prediction |
CN104601988A (en) * | 2014-06-10 | 2015-05-06 | 腾讯科技(北京)有限公司 | Video coder, method and device and inter-frame mode selection method and device thereof |
US20150181222A1 (en) * | 2009-02-19 | 2015-06-25 | Sony Corporation | Image processing device and method |
US9094689B2 (en) | 2011-07-01 | 2015-07-28 | Google Technology Holdings LLC | Motion vector prediction design simplification |
US9172970B1 (en) | 2012-05-29 | 2015-10-27 | Google Inc. | Inter frame candidate selection for a video encoder |
US9185428B2 (en) | 2011-11-04 | 2015-11-10 | Google Technology Holdings LLC | Motion vector scaling for non-uniform motion vector grid |
US20150350688A1 (en) * | 2014-05-30 | 2015-12-03 | Apple Inc. | I-frame flashing fix in video encoding and decoding |
US20160080766A1 (en) * | 2008-03-07 | 2016-03-17 | Sk Planet Co., Ltd. | Encoding system using motion estimation and encoding method using motion estimation |
US9313493B1 (en) | 2013-06-27 | 2016-04-12 | Google Inc. | Advanced motion estimation |
US9485515B2 (en) | 2013-08-23 | 2016-11-01 | Google Inc. | Video coding using reference motion vectors |
US9503746B2 (en) | 2012-10-08 | 2016-11-22 | Google Inc. | Determine reference motion vectors |
CN106713931A (en) * | 2010-09-30 | 2017-05-24 | 三菱电机株式会社 | Dynamic image encoding device, dynamic image decoding device, dynamic image encoding method, and dynamic image decoding method |
US10015508B2 (en) | 2015-10-30 | 2018-07-03 | Ntt Electronics Corporation | Video encoding device and video encoding method |
US20190098334A1 (en) * | 2010-04-08 | 2019-03-28 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
RU2685389C1 (en) * | 2010-07-20 | 2019-04-17 | Нтт Докомо, Инк. | Prediction image coding device, prediction image coding method, prediction image coding program, prediction image decoding device, prediction image decoding method, prediction image decoding program |
US11317101B2 (en) | 2012-06-12 | 2022-04-26 | Google Inc. | Inter frame candidate selection for a video encoder |
US20220329808A1 (en) * | 2010-07-08 | 2022-10-13 | Texas Instruments Incorporated | Method and apparatus for sub-picture based raster scanning coding order |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101377527B1 (en) * | 2008-10-14 | 2014-03-25 | 에스케이 텔레콤주식회사 | Method and Apparatus for Encoding and Decoding Motion Vector in Plural Number of Reference Pictures and Video Encoding/Decoding Method and Apparatus Using Same |
KR20140043242A (en) * | 2011-06-30 | 2014-04-08 | 가부시키가이샤 제이브이씨 켄우드 | Image encoding device, image encoding method, image encoding program, image decoding device, image decoding method, and image decoding program |
KR20130050407A (en) | 2011-11-07 | 2013-05-16 | 오수미 | Method for generating motion information in inter prediction mode |
JP5938935B2 (en) * | 2012-02-21 | 2016-06-22 | 富士通株式会社 | Moving picture coding apparatus and moving picture coding method |
JPWO2013136678A1 (en) * | 2012-03-16 | 2015-08-03 | パナソニックIpマネジメント株式会社 | Image decoding apparatus and image decoding method |
US9854261B2 (en) * | 2015-01-06 | 2017-12-26 | Microsoft Technology Licensing, Llc. | Detecting markers in an encoded video signal |
JP6501532B2 (en) * | 2015-01-23 | 2019-04-17 | キヤノン株式会社 | Image coding apparatus, image coding method and program |
JP6329521B2 (en) * | 2015-04-09 | 2018-05-23 | 日本電信電話株式会社 | Reference image buffer |
KR20180111378A (en) * | 2017-03-31 | 2018-10-11 | 주식회사 칩스앤미디어 | A method of video processing providing independent properties between coding tree units and coding units, a method and appratus for decoding and encoding video using the processing. |
CN111050164B (en) * | 2018-10-15 | 2022-05-17 | 华为技术有限公司 | Method and device for encoding and decoding |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030048847A1 (en) * | 1999-04-06 | 2003-03-13 | Broadcom Corporation | Video encoding and video/audio/data multiplexing device |
US20030128762A1 (en) * | 1998-10-29 | 2003-07-10 | Fujitsu Limited | Motion vector encoding device and decoding device |
US20030215014A1 (en) * | 2002-04-10 | 2003-11-20 | Shinichiro Koto | Video encoding method and apparatus and video decoding method and apparatus |
US20040057523A1 (en) * | 2002-01-18 | 2004-03-25 | Shinichiro Koto | Video encoding method and apparatus and video decoding method and apparatus |
US20040146108A1 (en) * | 2003-01-23 | 2004-07-29 | Shih-Chang Hsia | MPEG-II video encoder chip design |
US20060222074A1 (en) * | 2005-04-01 | 2006-10-05 | Bo Zhang | Method and system for motion estimation in a video encoder |
US20070206675A1 (en) * | 2004-08-04 | 2007-09-06 | Takeshi Tanaka | Image Decoding Device |
US20070286281A1 (en) * | 2004-02-25 | 2007-12-13 | Toshiharu Tsuchiya | Picture Information Encoding Apparatus and Picture Information Encoding Method |
US7320522B2 (en) * | 2003-12-17 | 2008-01-22 | Sony Corporatin | Data processing apparatus and method and encoding device of same |
US20110293007A1 (en) * | 2002-08-08 | 2011-12-01 | Kiyofumi Abe | Moving picture coding method and moving picture decoding method |
US20110293016A1 (en) * | 2002-07-15 | 2011-12-01 | Yoshinori Suzuki | Moving Picture Encoding Method And Decoding Method |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000244925A (en) * | 1999-02-24 | 2000-09-08 | Fuji Xerox Co Ltd | Picture processor and its processing method |
US6934335B2 (en) * | 2000-12-11 | 2005-08-23 | Sony Corporation | Video encoder with embedded scene change and 3:2 pull-down detections |
JP4193406B2 (en) * | 2002-04-16 | 2008-12-10 | 三菱電機株式会社 | Video data conversion apparatus and video data conversion method |
AU2003284958A1 (en) * | 2003-01-10 | 2004-08-10 | Thomson Licensing S.A. | Fast mode decision making for interframe encoding |
JP2005005844A (en) * | 2003-06-10 | 2005-01-06 | Hitachi Ltd | Computation apparatus and coding processing program |
CN1225127C (en) * | 2003-09-12 | 2005-10-26 | 中国科学院计算技术研究所 | A coding/decoding end bothway prediction method for video coding |
JP4577048B2 (en) * | 2004-03-11 | 2010-11-10 | パナソニック株式会社 | Image coding method, image coding apparatus, and image coding program |
US8111752B2 (en) * | 2004-06-27 | 2012-02-07 | Apple Inc. | Encoding mode pruning during video encoding |
JP4501631B2 (en) * | 2004-10-26 | 2010-07-14 | 日本電気株式会社 | Image coding apparatus and method, computer program for image coding apparatus, and portable terminal |
-
2007
- 2007-05-23 CN CN2007800189789A patent/CN101455087B/en not_active Expired - Fee Related
- 2007-05-23 JP JP2008516716A patent/JPWO2007136088A1/en active Pending
- 2007-05-23 CN CN201010548570.6A patent/CN101998120B/en not_active Expired - Fee Related
- 2007-05-23 US US12/301,630 patent/US20090110077A1/en not_active Abandoned
- 2007-05-23 WO PCT/JP2007/060516 patent/WO2007136088A1/en active Application Filing
- 2007-05-23 EP EP07743950.3A patent/EP2026585A4/en not_active Withdrawn
-
2013
- 2013-07-25 JP JP2013154349A patent/JP5617014B2/en not_active Expired - Fee Related
-
2014
- 2014-11-28 US US14/555,825 patent/US9667972B2/en active Active
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030128762A1 (en) * | 1998-10-29 | 2003-07-10 | Fujitsu Limited | Motion vector encoding device and decoding device |
US20030048847A1 (en) * | 1999-04-06 | 2003-03-13 | Broadcom Corporation | Video encoding and video/audio/data multiplexing device |
US20040057523A1 (en) * | 2002-01-18 | 2004-03-25 | Shinichiro Koto | Video encoding method and apparatus and video decoding method and apparatus |
US20030215014A1 (en) * | 2002-04-10 | 2003-11-20 | Shinichiro Koto | Video encoding method and apparatus and video decoding method and apparatus |
US20110293016A1 (en) * | 2002-07-15 | 2011-12-01 | Yoshinori Suzuki | Moving Picture Encoding Method And Decoding Method |
US20110293007A1 (en) * | 2002-08-08 | 2011-12-01 | Kiyofumi Abe | Moving picture coding method and moving picture decoding method |
US20040146108A1 (en) * | 2003-01-23 | 2004-07-29 | Shih-Chang Hsia | MPEG-II video encoder chip design |
US7320522B2 (en) * | 2003-12-17 | 2008-01-22 | Sony Corporatin | Data processing apparatus and method and encoding device of same |
US20070286281A1 (en) * | 2004-02-25 | 2007-12-13 | Toshiharu Tsuchiya | Picture Information Encoding Apparatus and Picture Information Encoding Method |
US20070206675A1 (en) * | 2004-08-04 | 2007-09-06 | Takeshi Tanaka | Image Decoding Device |
US20060222074A1 (en) * | 2005-04-01 | 2006-10-05 | Bo Zhang | Method and system for motion estimation in a video encoder |
Cited By (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160080766A1 (en) * | 2008-03-07 | 2016-03-17 | Sk Planet Co., Ltd. | Encoding system using motion estimation and encoding method using motion estimation |
US20100195922A1 (en) * | 2008-05-23 | 2010-08-05 | Hiroshi Amano | Image decoding apparatus, image decoding method, image coding apparatus, and image coding method |
US9319698B2 (en) | 2008-05-23 | 2016-04-19 | Panasonic Intellectual Property Management Co., Ltd. | Image decoding apparatus for decoding a target block by referencing information of an already decoded block in a neighborhood of the target block |
US8897583B2 (en) * | 2008-05-23 | 2014-11-25 | Panasonic Corporation | Image decoding apparatus for decoding a target block by referencing information of an already decoded block in a neighborhood of the target block |
US8315467B2 (en) * | 2008-05-29 | 2012-11-20 | Renesas Electronics Corporation | Image encoding device and image encoding method |
US20090297053A1 (en) * | 2008-05-29 | 2009-12-03 | Renesas Technology Corp. | Image encoding device and image encoding method |
US8831103B2 (en) * | 2008-10-02 | 2014-09-09 | Sony Corporation | Image processing apparatus and method |
US20110243234A1 (en) * | 2008-10-02 | 2011-10-06 | Sony Corporation | Image processing apparatus and method |
US20120014451A1 (en) * | 2009-01-15 | 2012-01-19 | Wei Siong Lee | Image Encoding Methods, Image Decoding Methods, Image Encoding Apparatuses, and Image Decoding Apparatuses |
US10334244B2 (en) | 2009-02-19 | 2019-06-25 | Sony Corporation | Image processing device and method for generation of prediction image |
US9872020B2 (en) | 2009-02-19 | 2018-01-16 | Sony Corporation | Image processing device and method for generating prediction image |
US9462294B2 (en) * | 2009-02-19 | 2016-10-04 | Sony Corporation | Image processing device and method to enable generation of a prediction image |
US20150181222A1 (en) * | 2009-02-19 | 2015-06-25 | Sony Corporation | Image processing device and method |
US10931944B2 (en) | 2009-02-19 | 2021-02-23 | Sony Corporation | Decoding device and method to generate a prediction image |
US8768081B2 (en) | 2009-07-24 | 2014-07-01 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US8885958B2 (en) | 2009-07-24 | 2014-11-11 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US9516317B2 (en) | 2009-07-24 | 2016-12-06 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US9131231B2 (en) | 2009-07-24 | 2015-09-08 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US9131232B2 (en) | 2009-07-24 | 2015-09-08 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US9137534B2 (en) | 2009-07-24 | 2015-09-15 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding images and method and apparatus for decoding images |
US20190149840A1 (en) * | 2010-04-08 | 2019-05-16 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US10560717B2 (en) | 2010-04-08 | 2020-02-11 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US20190098334A1 (en) * | 2010-04-08 | 2019-03-28 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US11889107B2 (en) * | 2010-04-08 | 2024-01-30 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US10542281B2 (en) * | 2010-04-08 | 2020-01-21 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US10715828B2 (en) | 2010-04-08 | 2020-07-14 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US20220132161A1 (en) * | 2010-04-08 | 2022-04-28 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US11265574B2 (en) | 2010-04-08 | 2022-03-01 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US10999597B2 (en) | 2010-04-08 | 2021-05-04 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US10779001B2 (en) * | 2010-04-08 | 2020-09-15 | Kabushiki Kaisha Toshiba | Image encoding method and image decoding method |
US20220329808A1 (en) * | 2010-07-08 | 2022-10-13 | Texas Instruments Incorporated | Method and apparatus for sub-picture based raster scanning coding order |
US11800109B2 (en) * | 2010-07-08 | 2023-10-24 | Texas Instruments Incorporated | Method and apparatus for sub-picture based raster scanning coding order |
US20240048707A1 (en) * | 2010-07-08 | 2024-02-08 | Texas Instruments Incorporated | Sub-picture based raster scanning coding order |
RU2685389C1 (en) * | 2010-07-20 | 2019-04-17 | Нтт Докомо, Инк. | Prediction image coding device, prediction image coding method, prediction image coding program, prediction image decoding device, prediction image decoding method, prediction image decoding program |
RU2685388C1 (en) * | 2010-07-20 | 2019-04-17 | Нтт Докомо, Инк. | Prediction image coding apparatus, prediction image encoding method, predictive image coding program, prediction image decoding device, prediction image decoding method and image predictive decoding program |
RU2685393C1 (en) * | 2010-07-20 | 2019-04-17 | Нтт Докомо, Инк. | Prediction image coding device, prediction image coding method, prediction image coding program, prediction image decoding device, method for decoding images with prediction, prediction image decoding program |
RU2685390C1 (en) * | 2010-07-20 | 2019-04-17 | Нтт Докомо, Инк. | Prediction image coding device, prediction image coding method, prediction image coding program, prediction image decoding device, prediction image decoding method, prediction image decoding program |
CN106713931A (en) * | 2010-09-30 | 2017-05-24 | 三菱电机株式会社 | Dynamic image encoding device, dynamic image decoding device, dynamic image encoding method, and dynamic image decoding method |
CN106713930A (en) * | 2010-09-30 | 2017-05-24 | 三菱电机株式会社 | Dynamic image encoding device, dynamic image decoding device, dynamic image encoding method, and dynamic image decoding method |
WO2012074344A2 (en) * | 2010-12-03 | 2012-06-07 | 엘지전자 주식회사 | Method for indexing motion information lists and apparatus using same |
WO2012074344A3 (en) * | 2010-12-03 | 2012-07-26 | 엘지전자 주식회사 | Method for indexing motion information lists and apparatus using same |
US9094689B2 (en) | 2011-07-01 | 2015-07-28 | Google Technology Holdings LLC | Motion vector prediction design simplification |
US9185428B2 (en) | 2011-11-04 | 2015-11-10 | Google Technology Holdings LLC | Motion vector scaling for non-uniform motion vector grid |
WO2013119569A1 (en) * | 2012-02-09 | 2013-08-15 | Google Inc. | Encoding motion vectors for video compression |
US20130208795A1 (en) * | 2012-02-09 | 2013-08-15 | Google Inc. | Encoding motion vectors for video compression |
US8908767B1 (en) | 2012-02-09 | 2014-12-09 | Google Inc. | Temporal motion vector prediction |
US9172970B1 (en) | 2012-05-29 | 2015-10-27 | Google Inc. | Inter frame candidate selection for a video encoder |
US11317101B2 (en) | 2012-06-12 | 2022-04-26 | Google Inc. | Inter frame candidate selection for a video encoder |
US9503746B2 (en) | 2012-10-08 | 2016-11-22 | Google Inc. | Determine reference motion vectors |
US9313493B1 (en) | 2013-06-27 | 2016-04-12 | Google Inc. | Advanced motion estimation |
US9485515B2 (en) | 2013-08-23 | 2016-11-01 | Google Inc. | Video coding using reference motion vectors |
US10986361B2 (en) | 2013-08-23 | 2021-04-20 | Google Llc | Video coding using reference motion vectors |
US20150350688A1 (en) * | 2014-05-30 | 2015-12-03 | Apple Inc. | I-frame flashing fix in video encoding and decoding |
CN104601988A (en) * | 2014-06-10 | 2015-05-06 | 腾讯科技(北京)有限公司 | Video coder, method and device and inter-frame mode selection method and device thereof |
US10015508B2 (en) | 2015-10-30 | 2018-07-03 | Ntt Electronics Corporation | Video encoding device and video encoding method |
Also Published As
Publication number | Publication date |
---|---|
CN101998120B (en) | 2014-07-30 |
CN101455087B (en) | 2011-01-12 |
JPWO2007136088A1 (en) | 2009-10-01 |
WO2007136088A1 (en) | 2007-11-29 |
CN101998120A (en) | 2011-03-30 |
US9667972B2 (en) | 2017-05-30 |
US20150085918A1 (en) | 2015-03-26 |
EP2026585A4 (en) | 2016-08-31 |
JP5617014B2 (en) | 2014-10-29 |
EP2026585A1 (en) | 2009-02-18 |
JP2013258732A (en) | 2013-12-26 |
CN101455087A (en) | 2009-06-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9667972B2 (en) | Image coding device, image coding method, and image coding integrated circuit | |
US8218635B2 (en) | Systolic-array based systems and methods for performing block matching in motion compensation | |
US8488678B2 (en) | Moving image encoding apparatus and moving image encoding method | |
US8731044B2 (en) | Moving-picture processing apparatus | |
US20060002470A1 (en) | Motion vector detection circuit, image encoding circuit, motion vector detection method and image encoding method | |
US9167260B2 (en) | Apparatus and method for video processing | |
GB2378345A (en) | Method for scanning a reference macroblock window in a search area | |
US20060002472A1 (en) | Various methods and apparatuses for motion estimation | |
KR20080112568A (en) | Method and apparatus for motion estimation | |
WO2007057985A1 (en) | Image encoding apparatus and image encoding method | |
US20080031335A1 (en) | Motion Detection Device | |
CN101185339B (en) | Image decoding apparatus and method for decoding image data, device and method for encoding image | |
CN103517071B (en) | Image encoding apparatus and method for encoding images | |
JP4377693B2 (en) | Image data search | |
JP4597103B2 (en) | Motion vector search method and apparatus | |
US6665340B1 (en) | Moving picture encoding/decoding system, moving picture encoding/decoding apparatus, moving picture encoding/decoding method, and recording medium | |
JP5182285B2 (en) | Decoding method and decoding apparatus | |
JP5906993B2 (en) | Encoding apparatus, encoding method, and program | |
JP2006279330A (en) | Motion compensation processing method | |
US8908777B2 (en) | Memory request ordering for a motion compensation process, implemented by a picture processing apparatus, a picture processing method, and a picture processing program | |
JPH1196138A (en) | Inverse cosine transform method and inverse cosine transformer | |
US8948263B2 (en) | Read/write separation in video request manager | |
JP4032049B2 (en) | Motion vector detection method and apparatus | |
JP2006333100A (en) | Image coding unit | |
CN107431821A (en) | High-efficient low-complexity video compress |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: PANASONIC CORPORATION, JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:AMANO, HIROSHI;TANAKA, TAKESHI;MAEDA, MASAKI;AND OTHERS;REEL/FRAME:021926/0263;SIGNING DATES FROM 20081021 TO 20081029 |
|
AS | Assignment |
Owner name: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD., JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PANASONIC CORPORATION;REEL/FRAME:034194/0143 Effective date: 20141110 Owner name: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LT Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PANASONIC CORPORATION;REEL/FRAME:034194/0143 Effective date: 20141110 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: SOVEREIGN PEAK VENTURES, LLC, TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.;REEL/FRAME:048268/0916 Effective date: 20181012 |
|
AS | Assignment |
Owner name: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD., JAPAN Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE ERRONEOUSLY FILED APPLICATION NUMBERS 13/384239, 13/498734, 14/116681 AND 14/301144 PREVIOUSLY RECORDED ON REEL 034194 FRAME 0143. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT;ASSIGNOR:PANASONIC CORPORATION;REEL/FRAME:056788/0362 Effective date: 20141110 |