WO2007004678A1 - 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム - Google Patents
動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム Download PDFInfo
- Publication number
- WO2007004678A1 WO2007004678A1 PCT/JP2006/313416 JP2006313416W WO2007004678A1 WO 2007004678 A1 WO2007004678 A1 WO 2007004678A1 JP 2006313416 W JP2006313416 W JP 2006313416W WO 2007004678 A1 WO2007004678 A1 WO 2007004678A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- decoding
- encoding
- decoded
- signal
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/537—Motion estimation other than block-based
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/587—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- Moving picture encoding apparatus moving picture encoding method, moving picture encoding program, moving picture decoding apparatus, moving picture decoding method, and moving picture decoding program
- the present invention relates to a video encoding device, a video encoding method, a video encoding program, a video decoding device, a video decoding method, and a video decoding program.
- H.264 moving image encoding method which is an international standard recommended by ITU-T (International Telecommunication Union Telecommunication Standardization Sector), is used.
- ITU-T International Telecommunication Union Telecommunication Standardization Sector
- the technology related to the H. 2 64 moving picture code system is disclosed in Non-Patent Document 1 below, for example.
- Patent Document 1 Japanese Patent Laid-Open No. 2-62180
- Non-Patent Document 1 Kakuno et al., “H. 264ZAVC Textbook Impress Standard Textbook Series”, Invention Disclosure
- the present invention has been made in view of the above circumstances, and a moving image that enables more efficient code encoding than when performing inter-frame prediction using a motion vector and performing code encoding. It is possible to provide an encoding device, a moving image encoding method, a moving image encoding program, a moving image decoding device, a moving image decoding method, and a moving image decoding program.
- a moving image encoding apparatus is a moving image encoding apparatus that encodes moving image data in units of blocks, and includes a plurality of encoding targets for frame images constituting the moving image data.
- the storage means for storing the image data is encoded using a template that is generated with a reproduction signal force that is adjacent to the target block of the encoding key in a predetermined positional relationship and that belongs to the reproduced moving image data stored by the storage means.
- Prediction signal generation means for generating a prediction block that is a prediction signal of the encoding target block, and the encoding means converts the prediction block from the encoding target block.
- a difference block that is a difference signal of the encoding target block is generated by subtraction in units of pixels, the difference block is encoded, and the reproduction image generation means is a reproduction signal of the difference block encoded by the encoding means.
- a decoded differential block is generated, and the decoded differential block and the prediction block are added in units of pixels to generate a decoded block.
- a coding target block is generated using a template that is adjacent to the coding target block in a predetermined positional relationship and is generated from a playback signal belonging to the played back moving picture data.
- a prediction block that is a prediction signal is generated. Sign prediction is performed using this prediction block. That is, according to the moving picture coding apparatus according to the present invention, it is possible to generate a prediction block that is a prediction signal without using a motion vector, thereby enabling efficient coding.
- a moving picture encoding apparatus is a moving picture encoding apparatus that encodes moving picture data in units of blocks, and a plurality of encoding targets for frame images constituting the moving picture data.
- Dividing means for dividing into blocks, encoding means for encoding the encoding target block, and reproduced image generating means for generating a decoded block that is a reproduction signal of the encoding target block
- a storage means for storing the reproduced moving image data generated from the reproduction signal, and a reproduction belonging to the reproduced moving image data stored in the storage means and adjacent to the encoding target block in a predetermined positional relationship.
- Prediction signal determining means for determining a prediction block, which is a prediction signal of the encoding target block, from the reproduced moving image data stored in the storage means, and the encoding means includes the encoding target block power prediction block.
- the generating means generates a decoded differential block that is a reproduction signal of the differential block encoded by the encoding means, and adds the decoded differential block and the prediction block in units of pixels to generate a decoded block. It is characterized by that. According to this configuration, a pixel group having a high correlation with the template is searched from the reproduced moving image data, and a prediction block is determined based on the searched pixel group and the predetermined positional relationship. Therefore, a prediction block can be determined with certainty, and the present invention can be implemented with certainty.
- the moving image encoding apparatus compares the template and the reproduced moving image data stored by the storage means, and estimates the spatial continuity of the image of the target block for encoding based on the comparison result. Based on the estimation means and the spatial continuity of the image estimated by the estimation means, V further divides the encoding target block and sets the divided encoding target block as a new encoding target block. And setting means for setting a template for the new block to be encoded. According to this configuration, the size of the prediction block can be appropriately selected based on the spatial continuity of the reproduced image data. Therefore, even in the sign of moving image data in which the amount of motion changes drastically. Encoding efficiency is improved. In addition, the prediction performance of the prediction signal is improved by changing the shape and size of the template region and the prediction region according to the characteristics of the signal.
- a moving image decoding apparatus is a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in units of blocks, and reproduces a decoding target block to be decoded.
- Decoding means for decoding the encoded data required for Decoded code data power Reproduced image generating means for generating a decoded block that is a reproduced signal of the decoding target block, storage means for storing reproduced moving image data generated from the reproduced signal, and decoding target block
- a prediction block which is a prediction signal of the decoding target block is generated by using a template that is generated with a reproduction signal force that is adjacent to the reproduction moving image data stored in the storage unit and is adjacent to the image in a predetermined positional relationship.
- a prediction signal generation means for generating, wherein the decoding means generates a decoded difference block that is a difference signal of the decoding target block, and the reproduced image generation means adds the decoded difference block and the prediction block in units of pixels. Then, a decoding block is generated.
- the video decoding device With the video decoding device according to the present invention, it is possible to generate a prediction block and decode a video as in the above-described video encoding device. That is, according to the moving picture decoding apparatus according to the present invention, moving picture data that has been efficiently encoded by the moving picture encoding device can be correctly decoded.
- a moving image decoding apparatus is a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in units of blocks, and reproduces a decoding target block to be decoded.
- a decoding means for decoding the encoded data required for the decoding, a code data power decoded by the decoding means, a reproduction image generation means for generating a decoded block which is a reproduction signal of the block to be decoded, and a reproduction generated from the reproduction signal
- Storage means for storing already-moved moving image data, and a template for generating a reproduction signal force that is adjacent to the decoding target block in a predetermined positional relationship and that belongs to the reproduced moving image data stored by the storing means.
- a prediction signal determining unit that determines a prediction block that is a prediction signal of the decoding target block from the reproduced moving image data stored in the storage unit based on the relationship
- the decoding unit includes the decoding target block
- the decoded difference block that is the difference signal of the decoded image is generated, and the reproduction image generating means adds the decoded difference block and the prediction block in units of pixels to generate a decoded block.
- the moving image decoding apparatus includes a reproduced moving image data stored in a template and storage means.
- the estimation means for estimating the spatial continuity of the image of the decoding target block based on the comparison result, and the decoding target block based on the spatial continuity of the image estimated by the estimation means. It is preferable to further comprise setting means for further dividing and setting the divided decoding target block as a new decoding target block and setting a template for the new decoding target block. According to this configuration, it is possible to correctly decode the moving image data encoded by the moving image encoding device.
- the encoding means generates a difference block as a reduced difference block having a smaller number of pixels than the difference block by a reduction process for reducing the number of pixels by a predetermined method, encodes the reduced difference block
- the reproduction image generation means generates a decoded reduced difference block that is a reproduction signal of the reduced difference block, and generates a decoded block from the decoded reduced difference block by an enlargement process that increases the number of pixels by a predetermined method. It is preferable.
- the reduced differential block to be encoded can be reduced in number of pixels, so that the prediction performance is low for areas with strong V characteristics, and flat areas are! The code amount of the prediction signal can be efficiently reduced without lowering.
- the decoding unit generates a decoded reduced difference block having a smaller number of pixels than the difference block by decoding the encoded data, and the reproduced image generation unit expands the number of pixels by a predetermined method. It is preferable to generate a decoded reduced differential block force decoded block by the processing. According to this configuration, it is possible to correctly decode the moving image data encoded by the moving image encoding device.
- a moving image encoding apparatus is a moving image encoding apparatus that encodes moving image data in units of blocks, and a frame image constituting the moving image data is encoded.
- a dividing unit that divides the coding target block into a plurality of coding target blocks, a coding unit that codes the coding target block, and a reproduction image generation unit that generates a decoded block that is a reproduction signal of the coding target block;
- a prediction signal generating means for generating a prediction block that is a prediction signal of the coding target block by a predetermined method, and the encoding means includes the coding target block.
- a reduced difference block having a smaller number of pixels than that of the difference block is generated by reducing the difference block, which is a difference signal of the target block of the sign subtracted pixel by pixel, by a predetermined method. Then, the reduced difference block is encoded, and the reproduced image generation means generates a decoded reduced difference block that is a reproduction signal of the reduced difference block, and performs the decoding reduced difference by an enlargement process that increases the number of pixels by a predetermined method.
- a decoding block is generated from the block.
- the encoded signal target block is a prediction signal of the encoded signal target block that is adjacent to the encoded signal target block in a predetermined positional relationship and belongs to the reproduced moving image data.
- a prediction block is generated.
- a reduced differential block to be encoded with a smaller number of pixels than the above-described differential block is generated from the prediction block. That is, according to the moving picture coding apparatus according to the present invention, the reduced difference block to be coded can be reduced to have a small number of pixels. Therefore, it is possible to efficiently reduce the code amount of the prediction signal without degrading the image quality.
- a moving picture decoding apparatus is a moving picture decoding apparatus that reproduces encoded data of moving picture data to reproduced moving picture data in units of blocks, and reproduces a decoding target block to be decoded.
- a decoding means for decoding the encoded data required for the decoding, a code data power decoded by the decoding means, a reproduction image generation means for generating a decoded block which is a reproduction signal of the block to be decoded, and a reproduction generated from the reproduction signal
- a prediction signal generation unit that generates a prediction block that is a prediction signal of the block to be decoded, and the decoding unit decodes the encoded data
- a decoded reduced difference block having a smaller number of pixels than the difference block that is the difference signal of the decoding target block is generated, and the reproduction image generating means increases the number of pixels by a predetermined method! ]
- a decoding block is generated from the decoding reduced difference block by the enlarging process. According to this configuration, it is possible to correctly decode the moving image data encoded by the moving image encoding device.
- the encoding means applies a reduction process to the block to be encoded and the prediction block to As the reduced block and the reduced predicted block, the reduced block power is generated by subtracting the reduced predicted block in units of pixels to generate a reduced difference block, and the reproduced image generating means is a reduced difference encoded by the encoding means.
- a decoded reduced difference block that is a playback signal of the block is generated, and the decoded reduced difference block and the reduced prediction block are added in units of pixels to generate a decoded reduced block, and the decoded reduced block is subjected to an enlargement process. It is preferable to apply to generate a decoding block. According to this configuration, the reduced difference block to be encoded can be generated with certainty, so that the present invention can be reliably implemented.
- the encoding unit generates a reduced difference block by applying a reduction process to the difference block
- the reproduction image generation unit generates a reproduction signal of the reduced difference block encoded by the encoding unit.
- a decoded reduced block is generated, and a decoding differential block is generated by applying an enlargement process to the decoded reduced block, and the decoded differential block and the prediction block are added in units of pixels. It is preferable to generate. According to this configuration, it is possible to reliably generate a reduced difference block to be encoded, so that the present invention can be reliably implemented.
- the reproduced image generation means applies a reduction process to the prediction block to obtain a reduced prediction block, adds the decoded reduced difference block and the reduced predicted block in units of pixels to generate a decoded reduced block, It is preferable to generate a decoded block by applying an enlargement process to the decoded reduced block. According to this configuration, the moving image data encoded by the above-described moving image encoding device can be correctly decoded.
- the reproduced image generation means generates a decoded difference block by applying an enlargement process to the decoded reduced difference block, and adds the decoded difference block and the prediction block in units of pixels to generate a decoded block. Is preferred. According to this configuration, the moving image data encoded by the moving image encoding device can be correctly decoded.
- the prediction signal determination means selects one template from a plurality of templates having different shapes. According to this configuration, it is possible to efficiently generate a prediction block and improve the efficiency of the sign key processing.
- the prediction signal determination means refers to the reproduction signal of the reproduced moving image data stored in the storage means or information related to the reproduction signal, and selects one template. Is preferred. According to this configuration, a template can be appropriately selected.
- the encoding means encodes information specifying the template selected by the prediction signal determining means. According to this configuration, it is possible to facilitate the selection of a template in the moving picture decoding apparatus and perform more efficient decoding.
- the prediction signal determining means selects one template from a plurality of templates having different shapes. According to this configuration, the moving image data encoded by the moving image encoding device can be correctly decoded.
- the prediction signal determination unit selects one template with reference to a reproduction signal of the reproduced moving image data stored in the storage unit or information related to the reproduction signal. According to this configuration, it is possible to correctly decode the moving image data encoded by the moving image encoding device.
- the decoding means decodes information specifying the selected template
- the prediction signal determining means refers to the information specifying the selected template decoded by the decoding means, so that a plurality of shapes having different shapes can be obtained. It is preferable to select one template from the templates. According to this configuration, it is possible to correctly decode the moving image data encoded by the above-described moving image encoding device.
- the present invention can be described as the invention of the moving image encoding device and the moving image decoding device as described above, as well as the moving image encoding method, the moving image encoding key program, the moving image as follows. It can also be described as an invention of an image decoding method and a moving image decoding program. These are substantially the same inventions only with different categories and the like, and have the same operations and effects.
- a moving image encoding method is a moving image encoding method in a moving image encoding apparatus that encodes moving image data in units of blocks, and is a frame constituting moving image data.
- a division step for dividing the image into a plurality of encoding target blocks, a coding step for encoding the coding target block, and a reproduction image generation step for generating a decoding block that is a reproduction signal of the encoding target block;
- a storage step for storing the reproduced moving image data generated from the reproduction signal, and a reproduction that belongs to the reproduced moving image data that is adjacent to the encoding target block in a predetermined positional relationship and stored in the storing step.
- a prediction signal generation step for generating a prediction block that is a prediction signal of the encoding target block using a template generated from the signal.
- the prediction block is generated from the encoding target block.
- a difference block that is a difference signal of the target block is generated by subtraction in units of pixels, the difference block is encoded, and encoded in the encoding step in the reproduction image generation step.
- a decoded difference block that is a reproduction signal of the difference block is generated, and the decoded difference block and the prediction block are added in units of pixels to generate a decoded block.
- a moving image encoding method is a moving image encoding method in a moving image encoding apparatus that encodes moving image data in units of blocks, and is a frame constituting moving image data.
- a division step for dividing the image into a plurality of encoding target blocks, a coding step for encoding the coding target block, and a reproduction image generation step for generating a decoding block that is a reproduction signal of the encoding target block;
- a storage step for storing the reproduced moving image data generated from the reproduction signal, and a reproduction signal belonging to the reproduced moving image data adjacent to the encoding target block in a predetermined positional relationship and stored in the storage step.
- the difference block is encoded, V is generated in the reproduction image generation step, and a decoded difference block that is a reproduction signal of the difference block encoded in the encoding step is generated.
- V is generated in the reproduction image generation step
- a decoded difference block that is a reproduction signal of the difference block encoded in the encoding step is generated.
- a moving picture decoding method is a moving picture decoding method in a moving picture decoding apparatus that reproduces encoded data of moving picture data into reproduced moving picture data in units of blocks, and is a decoding target to be decoded.
- a decoding step for decoding the encoded data required for block reproduction, and the decoding target block from the encoded data decoded in the decoding step.
- a reproduction image generation step for generating a decoded block that is a reproduction signal
- a storage step for storing reproduced moving image data generated by a reproduction signal force
- a storage step adjacent to the decoding target block in a predetermined positional relationship In a reproduction image generation step for generating a decoded block that is a reproduction signal, a storage step for storing reproduced moving image data generated by a reproduction signal force, and a storage step adjacent to the decoding target block in a predetermined positional relationship.
- a decoding difference block that is a difference signal of the decoding target block is generated, and the decoding difference block and the prediction block are added in units of pixels in the reproduction image generation step to generate a decoding block.
- a moving image decoding method is a moving image decoding method in a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in units of blocks, and is a decoding target to be decoded.
- a decoding step for decoding encoded data required for block reproduction, a reproduction image generation step for generating a decoded block that is a reproduction signal of the block to be decoded from the encoded data decoded in the decoding step, and a reproduction signal power A storage step for storing the reproduced moving image data to be generated, and a template for generating a reproduction signal force that is adjacent to the decoding target block in a predetermined positional relationship and that belongs to the reproduced moving image data stored in the storage step.
- a moving image encoding method is a moving image encoding method in a moving image encoding apparatus that encodes moving image data in units of blocks, and is a frame constituting moving image data.
- a division step for dividing an image into a plurality of encoding target blocks as regions to be encoded, an encoding step for encoding the encoding target block, and a decoded block that is a reproduction signal of the encoding target block Reproduction image generation step to generate and reproduction signal From the reproduction step belonging to the reproduced moving image data stored in the storing step for storing the reproduced moving image data to be generated, and adjacent to the encoding target block in a predetermined positional relationship and stored in the storing step,
- a prediction signal generation step for generating a prediction block that is a prediction signal of the encoding target block by a predetermined method.
- the difference block which is the difference signal of the current block to be encoded, is reduced to reduce the number of pixels using a specified method, and a reduced difference block with fewer pixels than the difference block is generated. Then, the reduced difference block is encoded, and in the reproduced image generation step, a decoded reduced difference block which is a reproduction signal of the reduced difference block is generated. It generates a click, the expansion process of increasing the number of pixels in Jo Tokoro manner, to generate the decoded reduced difference block output decoding block, and wherein the.
- a moving image decoding method is a moving image decoding method in a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in units of blocks, and is a decoding target to be decoded.
- a decoding step for decoding encoded data required for block reproduction, a reproduction image generation step for generating a decoded block that is a reproduction signal of the block to be decoded from the encoded data decoded in the decoding step, and a reproduction signal power A storage step for storing the reproduced video data to be generated, and a reproduction signal that is adjacent to the decoding target block in a predetermined positional relationship and that belongs to the reproduced moving image data stored in the storage step.
- a prediction signal generation step for generating a prediction block that is a prediction signal of the decoding target block by a technique.
- the decoding step by decoding the encoded data, a decoded reduced difference block having a smaller number of pixels than the difference block that is the difference signal of the block to be decoded is generated, and in the reproduction image generating step, the number of pixels is set to a predetermined value.
- a decoding reduced difference block power decoding block is generated by an enlargement process that is increased by the above method.
- a moving picture code key program is a moving picture code key program for controlling a moving picture code key apparatus that codes moving picture data in units of blocks, and is a moving picture coding apparatus. Is divided into a plurality of encoding target blocks, an encoding means for encoding the encoding target block, and an encoding target block.
- Reproduced image generating means for generating a decoded block that is a lock reproduction signal, storage means for storing reproduced moving image data generated from the reproduced signal, and adjacent to the block to be encoded in a predetermined positional relationship
- a prediction signal generating means for generating a prediction block that is a prediction signal of the encoding target block using a template that also generates a reproduction signal force belonging to the reproduced moving image data stored by the storage means
- encoding means Subtracts the prediction block from the encoding target block in units of pixels to generate a differential block that is a differential signal of the encoding target block, encodes the differential block, and the reproduced image generation means
- a decoded difference block that is a reproduction signal of the difference block encoded by the means is generated, and the decoded difference block and the prediction block are generated.
- a moving picture code key program is a moving picture code key program for controlling a moving picture code key apparatus that codes moving picture data in units of blocks, and is a moving picture coding apparatus. Are divided into a plurality of coding target blocks, a coding means for coding the coding target block, and a decoding signal that is a reproduction signal of the coding target block.
- Reproduced image generating means for generating a block, storage means for storing reproduced moving image data generated from a reproduction signal, adjacent to the encoding target block in a predetermined positional relationship, and stored by the storage means
- a searcher that searches the reproduced moving image data stored by the storage means for a pixel group having a high correlation with the template that also generates the reproduced signal power belonging to the reproduced moving image data.
- a prediction block which is a prediction signal of the code target block, is determined from the reproduced moving image data stored by the storage unit based on the pixel group searched by the search unit and a predetermined positional relationship.
- the encoding means subtracts the prediction block from the encoding target block in units of pixels to generate a difference block that is a difference signal of the encoding target block, and encodes the difference block.
- the reproduced image generating means generates a decoded differential block that is a reproduced signal of the differential block encoded by the encoding means, and adds the decoded differential block and the prediction block in units of pixels to generate a decoded block. It is characterized by that.
- a moving image decoding program converts the code data of moving image data into block units.
- a moving picture decoding program for controlling a moving picture decoding apparatus that reproduces moving picture data that has been reproduced at the position.
- the moving picture decoding apparatus decodes encoded data required for reproduction of a decoding target block to be decoded.
- Functioning as a prediction signal generation means for generating a certain prediction block, and the decoding means is a decoded differential block that is a differential signal of the decoding target block. Generated, reproduced image generation means, the decoded difference block and the prediction block are added in pixel units to generate the decoded block, and wherein the.
- a moving image decoding program is a moving image decoding program for controlling a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in block units, Decoding means for decoding encoded data required for reproduction of a decoding target block to be decoded, and a decoding block that is a reproduction signal of the decoding target block from the encoded data decoded by the decoding means.
- Regenerated image generating means for generating the replayed video, storage means for storing the replayed moving image data generated from the replay signal, and a reproduced moving image that is adjacent to the block to be decoded in a predetermined positional relationship and stored by the storage means
- a prediction block that is a prediction signal of the decoding target block is determined as a reproduced moving image data stored in the storage means.
- the decoding means generates a decoded difference block that is a difference signal of the decoding target block, and the reproduction image generation means adds the decoded difference block and the prediction block in units of pixels. It is characterized by generating a decoding block.
- a moving picture code key program is a moving picture code key program for controlling a moving picture code key device that codes moving picture data in units of blocks, and is a moving picture coding program.
- An apparatus includes a dividing unit that divides a frame image constituting moving image data into a plurality of encoding target blocks as an encoding target region, an encoding unit that encodes the encoding target block, and an encoding unit.
- a reproduction image generation unit that generates a decoded block that is a reproduction signal of the target block, a storage unit that stores reproduced moving image data generated from the reproduction signal, and a predetermined positional relationship adjacent to the encoding target block
- a prediction signal generation unit that generates a prediction block that is a prediction signal of the block to be encoded from a reproduction signal belonging to the reproduced moving image data stored by the storage unit by a predetermined method.
- Is a difference block which is a difference signal of the target block obtained by subtracting the prediction block from the target block in units of pixels.
- the decoded reduced difference block power decoding block is generated by an expansion process of generating a decoded reduced difference block which is a reproduction signal of the above and increasing the number of pixels by a predetermined method.
- a moving image decoding program is a moving image decoding program for controlling a moving image decoding apparatus that reproduces encoded data of moving image data into reproduced moving image data in block units, Decoding means for decoding encoded data required for reproduction of a decoding target block to be decoded, and a decoding block that is a reproduction signal of the decoding target block from the encoded data decoded by the decoding means.
- Regenerated image generating means for generating the replayed video, storage means for storing the replayed moving image data generated from the replay signal, and a reproduced moving image that is adjacent to the block to be decoded in a predetermined positional relationship and stored by the storage means
- Prediction signal generation means for generating a prediction block that is a prediction signal of a decoding target block from a reproduction signal belonging to image data by a predetermined method.
- the decoding means decodes the encoded data to generate a decoded reduced difference block having a smaller number of pixels than the difference block that is the difference signal of the decoding target block, and the reproduced image generation means
- a decoding block is generated from the decoding reduced difference block by an expansion process for increasing the number by a predetermined method.
- a moving image encoding apparatus encodes moving image data.
- a dividing unit that divides a frame image constituting moving image data into a plurality of regions as regions to be encoded, and a code that encodes an image of each region divided by the dividing unit.
- Encoding means, reproduction image generation means for generating a reproduction image of the image encoded by the encoding means, storage means for storing the reproduction image generated by the reproduction image generation means, and encoding by the encoding means An area of an image that is adjacent to the area of the image that is the target of ⁇ with a predetermined positional relationship and that has a high correlation with the reproduced image of the template area that is a part of the reproduced image stored by the storage means.
- the prediction signal of the area to be encoded is stored by the storage means.
- a prediction signal determination unit that determines from the reproduced image, and the encoding unit generates a difference signal between the prediction signal determined by the prediction signal determination unit and an image of a region to be encoded.
- the differential signal is encoded.
- the moving image encoding apparatus In the moving image encoding apparatus according to the present invention, first, an area of an image having a high correlation with a reproduction image of a template area adjacent to the area of the image to be encoded in a predetermined positional relationship is reproduced from the reproduction image. Explore. Subsequently, based on the searched area and the predetermined positional relationship, a prediction signal of the area to be encoded is determined from the reproduced image. This prediction signal is used for signing. That is, according to the moving picture coding apparatus according to the present invention, a prediction signal can be determined without using a motion vector, and efficient coding is possible.
- the moving picture decoding apparatus is a moving picture decoding apparatus that decodes moving picture data in which a frame image divided into a plurality of areas is encoded, and the encoded data of each area Decoding means for decoding the image, reproduction image generation means for generating a reproduction image from the image decoded by the decoding means, storage means for storing the image generated by the reproduction image generation means, and a decoding target by the decoding means An image area that is adjacent to the image area in a predetermined positional relationship and has a high correlation with the reproduction image in the template area that is a part of the reproduction image stored in the storage means is reproduced image power stored in the storage means.
- the prediction signal of the area to be decoded is stored in the reproduced image power stored in the storage means.
- Predictive signal determining means for determining, the reproduced image generating means by the predictive signal determining means It is characterized in that a reproduced image is obtained by generating a sum signal of the determined prediction signal and the image decoded by the decoding means.
- the moving picture decoding apparatus it is possible to decode a moving picture by determining a prediction signal in the same manner as in the above moving picture encoding apparatus. That is, according to the moving picture decoding apparatus according to the present invention, it is possible to correctly decode the moving picture data that has been efficiently encoded by the moving picture encoding apparatus.
- the moving image encoding apparatus compares the reproduced image in the template area with the reproduced image stored in the storage unit, and based on the comparison result, the spatial image of the image in the area to be encoded. Based on the estimation means for estimating the continuity and the spatial continuity of the image estimated by the estimation means! Then, further divide the area to be encoded, set the divided area as a new area to be encoded, and set a template area for the new area to be encoded. It is preferable to further comprise setting means. According to this configuration, it is possible to appropriately select the size of the prediction signal area based on the spatial continuity of the reproduced image. Therefore, even in the sign of moving image data in which the amount of motion changes drastically. Encoding efficiency is improved.
- the moving image decoding apparatus compares the reproduced image in the template area with the reproduced image stored in the storage unit, and based on the comparison result, the spatial continuity of the image in the area to be decoded. Based on the estimation means for estimating the image and the spatial continuity of the image estimated by the estimation means, the area to be decoded is further divided and the divided area is set as a new area to be decoded And setting means for setting a template area for the new decoding target area. According to this configuration, it is possible to correctly decode the moving image data encoded by the above moving image encoding device.
- a moving image encoding method according to the present invention is a moving image encoding method in a moving image encoding device that encodes moving image data, in which a frame image constituting moving image data is encoded.
- a division step that divides the target region into a plurality of regions, a coding step that encodes the image of each region divided by! /, And a coding step that encodes! / ⁇ A reproduction image generation step for generating a reproduction image of the recorded image, a storage step for storing the reproduction image generated in the reproduction image generation step, and an area of the image to be encoded in the encoding step The area of the image that is adjacent to the image in the predetermined positional relationship and has a high correlation with the reproduced image of the template area that is a part of the reproduced image stored in the storing step!
- a search step for searching from the reconstructed image stored in advance, and a prediction signal for the region to be encoded based on the search step and the predetermined positional relationship in the search step.
- a prediction signal determination step for determining the reproduced image stored in the step, and in the encoding step, the prediction signal determined in the prediction signal determination step and the region to be encoded
- a difference signal from the image is generated, and the difference signal is encoded.
- a moving image encoding program is a moving image encoding program for controlling a moving image encoding device that encodes moving image data.
- a dividing unit that divides the frame image constituting the image into a plurality of regions as a region to be encoded, a code unit that encodes an image of each region divided by the dividing unit, and an encoding
- a reproduction image generating means for generating a reproduction image of the image encoded by the means, a storage means for storing the reproduction image generated by the reproduction image generation means, and an image to be encoded by the encoding means.
- An area of an image that is adjacent to the area in a predetermined positional relationship and has a high correlation with the reproduction image of the template area that is a part of the reproduction image stored by the storage means is the reproduction image stored by the storage means.
- the prediction signal of the area to be encoded is determined as a prediction signal for determining the reproduced image force stored in the storage means.
- the encoding means generates a difference signal between the prediction signal determined by the prediction signal determination means and an image of a region to be encoded, and encodes the difference signal.
- a moving image decoding method is a moving image decoding method in a moving image decoding apparatus for decoding moving image data in which a frame image divided into a plurality of regions is encoded.
- a prediction signal determination step for determining a prediction signal of a region to be decoded from the reproduced image stored in the storage step.
- the prediction signal determination step is performed.
- the decoding step a sum signal of the decoded signal and the decoded image is generated to obtain a reproduced image.
- a moving picture decoding program is a moving picture decoding program for controlling a moving picture decoding apparatus that decodes moving picture data in which a frame image divided into a plurality of regions is encoded.
- the image decoding apparatus stores a decoding unit that decodes data of each encoded region, a reproduction image generation unit that generates a reproduction image from an image decoded by the decoding unit, and an image generated by the reproduction image generation unit
- This area is subject to decoding based on the search means for searching the reproduced image stored in the storage means, and the area searched for by the search means and a predetermined positional relationship.
- the prediction signal of the region is caused to function as a prediction signal determination unit that determines from the reproduced image stored in the storage unit, and the reproduction image generation unit includes the prediction signal determined by the prediction signal determination unit and the image decoded by the decoding unit. It is characterized in that a regenerated image is produced by generating a sum signal.
- a template adjacent to a region of an image to be encoded in a predetermined positional relationship A reproduction signal having a high correlation with the plate region is searched, and a prediction signal is determined based on the searched region and the above-described positional relationship, so that efficient coding can be performed without using a motion vector.
- FIG. 1 is a diagram showing a configuration of a moving image encoding device according to a first embodiment of the present invention.
- FIG. 2 is a diagram showing a configuration of a prediction generation unit in the moving picture code encoding apparatus.
- FIG. 3 is a diagram showing a positional relationship between a template region and a prediction target region.
- FIG. 4 is a diagram for explaining a detailed operation of determining a prediction signal by template matching.
- FIG. 5 is a flowchart showing processing executed by the video encoding apparatus according to the first embodiment of the present invention.
- FIG. 6 is a diagram showing a configuration of a video decoding device according to the first embodiment of the present invention.
- FIG. 7 is a flowchart showing processing executed by the video decoding device according to the first embodiment of the present invention.
- FIG. 8 is a diagram showing a configuration of a prediction generation unit in the second embodiment.
- FIG. 9 is a diagram showing prediction target areas divided in the second embodiment.
- FIG. 10 is a flowchart showing processing executed by the video encoding apparatus according to the second embodiment of the present invention.
- FIG. 11 is a flowchart showing processing executed by the video decoding device according to the second embodiment of the present invention.
- FIG. 12 is a diagram illustrating a sign key sequence.
- FIG. 13 is a diagram illustrating an example of a positional relationship between a template region and a prediction target region according to a code key sequence.
- FIG. 14 is a diagram for explaining a function of a determination unit in the second embodiment.
- FIG. 15 is a diagram for explaining a function of a determination unit in the second embodiment.
- FIG. 16 is a diagram showing a configuration of a moving picture code key program according to an embodiment of the present invention.
- FIG. 17 is a diagram showing a configuration of a video decoding program according to an embodiment of the present invention.
- FIG. 18 is a diagram showing a configuration of a modified example of the video encoding device according to the first embodiment.
- FIG. 19 is a diagram illustrating a configuration of a modified example of the video decoding device according to the first embodiment.
- ⁇ 20 A diagram showing a configuration of a modified example of the prediction generation unit according to the first embodiment.
- FIG. 22 is a diagram illustrating a configuration of a video decoding device according to the third embodiment of the present invention.
- FIG. 23 is a diagram showing block reduction / enlargement processing in the third embodiment.
- FIG. 24 is a flowchart showing a process executed by the video encoding apparatus according to the third embodiment of the present invention.
- ⁇ 25 A flow chart showing processing executed by the video decoding device according to the third embodiment of the present invention.
- FIG. 26 is a diagram showing a configuration of a modified example of the video encoding device according to the third embodiment.
- FIG. 27 is a diagram showing a configuration of a modified example of the video decoding device according to the third embodiment.
- FIG. 28 is a diagram showing another example of block reduction / enlargement processing in the third embodiment.
- FIG. 29 is a diagram showing another example of block reduction / enlargement processing in the third embodiment.
- FIG. 30 is a diagram showing a configuration of a modified example of the video encoding device according to the third embodiment.
- FIG. 31 is a diagram showing a configuration of a modified example of the video decoding device according to the third embodiment.
- FIG. 32 is a diagram showing a configuration of a modified example of the video encoding device according to the third embodiment.
- ⁇ 33] A configuration of a modified example of the video decoding device according to the third embodiment.
- FIG. 34 is a diagram showing a configuration of a modified example of the video encoding device according to the third embodiment.
- FIG. 35 is a diagram illustrating a configuration of a modified example of the video decoding device according to the third embodiment.
- ⁇ 36 It is a diagram showing a configuration of a modified example of the prediction generation unit according to the third embodiment.
- FIG. 37 is a diagram showing an example of prediction processing in the third embodiment.
- Video encoding device 600, 19 00, 2200, 2700, 3100, 3300, 3500 ⁇ Video decoding device, 101 ⁇ Dividing part, 1 02, 2102 ... Subtracting part, 103, 2103 ... Converting part, 104, 2104, 3004 ... Signing part, 105, 602, 2105, 2202 ...
- FIG. 1 shows a moving picture coding apparatus 100 according to the first embodiment.
- the moving image encoding apparatus 100 is an apparatus that encodes moving image data in units of blocks.
- the moving image encoding apparatus 100 encodes moving image data by sequentially encoding the frame images with a frame image constituting moving image data as an input.
- the moving image encoding device 100 is realized by hardware such as an information processing device including a CPU (Central Processing Unit), a frame memory, and a hard disk.
- the moving image coding apparatus 100 realizes functional components described below by operating the hardware components described above.
- moving image encoding apparatus 100 includes area dividing unit 101, subtracting unit 102, converting unit 103, encoding unit 104, inverse converting unit 105, and adding unit 106. And a storage unit 107 and a prediction generation unit 108.
- areas dividing unit 101 subtracting unit 102, converting unit 103, encoding unit 104, inverse converting unit 105, and adding unit 106.
- a storage unit 107 and a prediction generation unit 108 a prediction generation unit 108.
- the area dividing unit 101 is a dividing unit that divides a frame image constituting the input moving image data into a plurality of areas as areas to be encoded. That is, the area dividing unit 1 01 is a dividing means for dividing the frame image constituting the input moving image data into a plurality of encoding target blocks. Specifically, the area dividing unit 101 divides the block into predetermined blocks having a predetermined size (for example, 8 pixels ⁇ 8 pixels, encoding target block).
- the divided original images are output in the order in which the codes are applied and input to the subtracting unit 102.
- the output order may be the raster scan order, which is the order of the upper left force and lower right of the frame image as shown in Fig.
- the order to the right end and the right end force may be a zigzag order in which the order to the left end is alternately increased by one step. Note that the output order is stored in advance in the area dividing unit 101.
- the subtracting unit 102 is a constituent element of an encoding unit that generates and outputs a difference signal between the original signal of the encoding target block and a prediction signal described later.
- the difference signal is generated by subtracting the prediction signal of the code target block output from the prediction generation unit 108 from the original signal of the code target block output from the region dividing unit 101 in units of pixels.
- the output difference signal is a signal to be encoded, and is input to the conversion unit 103 for encoding.
- the conversion unit 103 is a conversion unit that converts the difference signal input from the subtraction unit 102 based on a predetermined conversion method and outputs conversion coefficient data.
- a predetermined conversion method for example, an orthogonal transformation represented by a discrete cosine transform (DCT) can be used. Relational expressions for conversion and the like are stored in the conversion unit 103 in advance. This conversion may be irreversible or irreversible. This conversion is to make the subsequent sign ⁇ more efficient.
- the output transform coefficient data is input to the sign key unit 104 and the inverse transform unit 105. For information compression, quantize the coefficients after orthogonal transformation.
- the code unit 104 is a component of an encoding unit that performs entropy encoding on the transform coefficient data input from the transform unit 103.
- the encoded data is output from the moving image encoding apparatus 100.
- a variable-length code method such as a Huffman code or an arithmetic code method represented by CABAC (Context-based Adaptive Binary Arithmetic Coding) can be used. In both cases, the amount of information can be compressed by changing the conversion method based on the bias in the occurrence probability of the conversion coefficient data.
- CABAC Context-based Adaptive Binary Arithmetic Coding
- the inverse conversion unit 105 is a component of a reproduction image generation unit that generates a differential signal used for generation of a reproduction image.
- the difference signal is generated by performing the inverse process of the conversion process performed in the conversion unit 103 on the conversion coefficient data input from the conversion unit 103. Relational expressions for reverse conversion are stored in advance in the reverse conversion unit 105. The difference signal generated by the inverse conversion unit 105 is input to the addition unit 106.
- the adding unit 106 uses the prediction signal output from the prediction generation unit 108 described later (the same as the prediction signal input to the subtraction unit 102) and the difference signal generated by the inverse conversion unit 105. This is a constituent element of a reproduction image generation means for generating a reproduction signal as a sum signal by adding the signal.
- the reproduction signal constitutes a reproduction image.
- the reproduction signal generated by the adding unit 106 is the same as the reproduction signal generated in the decoding device.
- the reproduction signal generated by the adding unit 106 is input to the storage unit 107.
- the storage unit 107 stores the reproduction signal input from the addition unit 106 as reproduced moving image data in a storage device provided in the moving image encoding device 100 such as a frame memory and stores the reproduced signal. It is. All playback signals are stored until all of the moving image data has been encoded. As described above, the reproduced moving image data is sequentially accumulated.
- the prediction generation unit 108 is a feature of the present invention.
- the prediction generation unit 108 reads out the reproduced image stored in the storage unit 107, and generates a prediction signal of a prediction target (encoding target) block based on the reproduction signal. Generate.
- Figure 2 shows functional blocks that further refine the functions of the prediction generation unit 108.
- the prediction generation unit 108 includes a template region determination unit 201, a matching unit 202, and a compensation unit 203.
- Template region determination unit 201 is a component of search means that determines a template region used to generate a prediction signal and a signal (template) in that region based on an input from storage unit 107. . That is, the template area determination unit 201 is a search unit that generates a template for the reproduction signal power that is adjacent to the target block of the encoding key in a predetermined positional relationship and that belongs to the reproduced moving image data stored in the storage unit 17. One component. Further, the template region determination unit 201 is a prediction signal generation unit that generates a prediction block that is a prediction signal of the encoding target block using a template. It is also a component. As shown in FIG.
- the template area 301 is adjacent to the area 302 of the image to be encoded in a predetermined positional relationship and is the area of the reproduced image stored by the storage unit 107, that is, has been reproduced.
- This is a pixel area composed of playback signals of moving image data.
- the template area includes a pixel area that is a reproduced area of the same frame as the prediction target block stored in the storage unit 107 and is located in a position spatially adjacent to the prediction target block. A pixel group of a predetermined size is applied. For this reason, the position of the template region depends on the code order of the blocks (the order of output from the region dividing unit 101 and the encoding process).
- the template region determination unit 201 stores in advance conditions as described below for determining the template region.
- FIG. 13 shows an example of the positional relationship between the template area and the prediction target area.
- the template region 1301 is a region located on the left and upper side of the prediction target block 1302.
- regions 1303 located on the left and upper side of the prediction target block 1302 are regions where reproduced images are accumulated in the storage unit 107.
- the order of the symbols is zigzag, the position of the template area can be changed depending on the progress of the symbols.
- the prediction target block 1304 is an area located on the right and upper side of the prediction target block 1305.
- the prediction target block 1304 is an area located on the left and upper side of the prediction target block 1305.
- Matching section 202 performs template matching using the playback image stored in storage section 107 as a search area using the playback signal of the template area determined by template area determination section 201, and searches the search area.
- the matching unit 202 is also a constituent element of a prediction signal generation unit that generates a prediction block that is a prediction signal of the encoding target block using a template. Template matching will be described in more detail later.
- the compensation unit 203 predicts the same size as the prediction target block. It is a prediction signal determination means for setting and determining a signal from a reproduced image. In other words, the compensation unit 203 reproduces a prediction block that is a prediction signal of the target block for coding based on the pixel group searched by the matching unit 202 and the positional relationship, and has been stored in the storage unit 107. It is a prediction signal determination means determined from moving image data. Further, the compensation unit 203 is also a constituent element of a prediction signal generation unit that generates a prediction block that is a prediction signal of a coding target block using a template.
- the positional relationship between the searched high correlation region and the image region used as the prediction signal is the same as the positional relationship between the template region and the prediction target block. For example, if the block code order is the raster scan order, the regions adjacent to the right and lower sides of the high correlation region are the regions of the prediction signal.
- the determined prediction signal is input as an output from the prediction generation unit 108 to the subtraction unit 102 and the addition unit 106.
- the matching unit 202 performs template matching that searches for the internal forces of the search ranges 403 and 404 for a portion similar to the image of the template region 401.
- the search range includes a reproduced pixel region 403 in a frame including the template region 401 and the prediction target region 402 (prediction target frame), and an image 404 of the other reproduced frame.
- the correlation between the signal of the template region 401 and the signal of the pixel group having the same shape as the template region 401 at an arbitrary location within the search range is measured.
- index value indicating the correlation at this time it is possible to use SAD (sum of absolute difference) representing the sum of absolute values of the difference signal, MSE (mean square error) representing the mean square error of the difference signal, or the like. Obtain index values of correlation for all possible pixel groups within the search range, and output information (address) indicating the pixel group with the smallest index value (excluding the image of the template area 401 itself) as the search result To do.
- SAD sum of absolute difference
- MSE mean square error
- the compensation unit 203 sets a pixel group adjacent to the highly correlated region searched by template matching as a prediction signal. In locations where the correlation with the template region is high, the region adjacent to the template region is also likely to have a high correlation. Can be. For this reason, the prediction method by this method is materialized.
- moving image data to be encoded is input to the moving image encoding apparatus 100, it is input to the region dividing unit 101 for each frame image constituting the moving image.
- the input frame image is divided into a plurality of blocks of a predetermined size by the area dividing unit 101 (S501, dividing step). All subsequent processing is performed in units of blocks.
- the block is input to the subtraction unit 102 as an image of the region to be encoded.
- a prediction signal for the block to be encoded is generated by the prediction generation unit 108 as follows.
- the prediction generation unit 108 supervises the coding process in the moving picture coding apparatus 100, and the controller (not shown) force is also notified of the coding order of the blocks.
- the prediction generation unit 108 stores the code order of the blocks in advance.
- the template area determination unit 201 determines a template area on the reproduced image adjacent to the block (S502, search step).
- the matching unit 202 performs template matching on the playback image in the same and different frames as the encoding target block, and searches for a region having a high correlation with the playback signal of the template region (S503, Search step).
- a region having the same size as the current block to be encoded is adjacent to the highly correlated region obtained by template matching by the filling unit 203 in a predetermined positional relationship (right and lower in the example of FIG. 4).
- the set prediction signal is input to the subtraction unit 102 and the addition unit 106.
- the subtraction unit 102 subtracts the prediction signal, in which the force of the prediction generation unit 108 (compensation unit 203) is also input, from the original image input from the region division unit 101 in the pixel region, thereby generating a differential signal. Is generated (S505, encoding step).
- the generated difference signal is input to the conversion unit 103 and converted by the conversion unit 103 (S506).
- the converted difference signal is input to the encoding unit 104 and the inverse conversion unit 105.
- the converted differential signal input to the code key unit 104 is entropy encoded by the code key unit 104 to generate compressed encoded data (S507, encoding step).
- the converted differential signal input to the inverse transform unit 105 is subjected to inverse transform by the inverse transform unit 105.
- a differential signal after inverse transformation is generated (S508, reproduction image generation step).
- the difference signal after the inverse transformation is input to the addition unit 106, and the addition unit 106 adds the prediction signal input by the prediction generation unit 108 (compensation unit 203) to form a sum signal, thereby generating a reproduction signal (S509).
- Playback image generation step The generated reproduction signal is input to the storage unit 107, and is stored in the storage unit 107 at a predetermined address corresponding to the frame to be encoded in the frame memory or the like (S510, storage step).
- the reproduction signal stored in the storage unit 107 is based on the difference signal converted by the conversion unit 103 and inversely converted by the inverse conversion unit 105. This is because it is also assumed that the encoding by the encoding unit 104 is reversible, and a reproduced image to be reproduced in the decoding device is obtained by the above processing.
- the power given as an example in which prediction is performed only by compensation by template matching may include other processing.
- the prediction by the motion vector may be used. In this case, for example, by adding information indicating whether the prediction method is based on this method or using a motion vector and a motion vector value to the header of the block information and adding a motion vector value, both methods can be made efficient. Can be used and separated.
- inter-frame prediction is performed.
- the encoded image can be used for inter-frame prediction without using a motion vector, and efficient code recognition is possible. That is, the data encoded by the moving image encoding apparatus 100 according to the present embodiment is substantially the result of encoding only the converted differential signal, and moves compared to the conventional encoding method. The vector is deleted. In addition to this, when determining the prediction signal, only the area that has been reproduced at that time is used, so that it is possible to always code in the scanning order of the moving image data.
- FIG. 6 shows a moving picture decoding apparatus 600 according to this embodiment.
- the video decoding device 600 is a device that decodes the video data encoded by the video encoding device 100 and generates reproduced video data.
- the moving image decoding apparatus 600 is realized by hardware such as an information processing apparatus including a CPU (Central Processing Unit), a frame memory, a hard disk, and the like.
- the moving image decoding apparatus 600 realizes functional components described below by operating the above hardware components.
- moving picture decoding apparatus 600 includes decoding section 601, inverse transform section 602, addition section 603, storage section 604, and prediction generation section 605. .
- decoding section 601 inverse transform section 602, addition section 603, storage section 604, and prediction generation section 605. .
- inverse transform section 602 inverse transform section 602
- addition section 603 storage section 604
- prediction generation section 605. The function of each part is described below.
- Decoding section 601 is a decoding means for decoding input compressed encoded data.
- the compressed code data is encoded by the moving image encoding apparatus 100 according to the present embodiment, and the frame image is divided into a plurality of region signals (encoding target blocks). And encoded (block to be decoded).
- the decoding scheme in the decoding unit 601 corresponds to the entropy coding scheme used by the moving picture coding apparatus 100, and information for decoding is stored in advance by the decoding unit 601. Also, decoding and output in the decoding unit 601 are performed in encoded units (block units), and are performed in the encoded order.
- the decrypted data is input to the inverse transform unit 602.
- the inverse transform unit 602 is used for generating a reproduced image by performing an inverse process of the transform process performed by the moving image encoding device 100 on the data input from the decoding unit 601. It is a component of a reproduction image generation unit that generates a difference signal.
- the inverse transform unit 602 corresponds to the inverse transform unit 105 in the moving image encoding device 100.
- For inverse transformation Relational expressions and the like are stored in the inverse conversion unit 602 in advance.
- the difference signal generated in the inverse conversion unit 602 is input to the addition unit 603.
- Addition unit 603 adds a prediction signal output from prediction generation unit 605 described later and a difference signal generated by inverse transformation unit 602 to generate a sum signal, and generates a reproduction signal. It is a component of the means.
- the adder 603 corresponds to the adder 106 in 100 moving image encoders.
- the reproduction signal generated by the adding unit 106 is input to the storage unit 604 and output from the video decoding device 600.
- the storage unit 604 uses the playback signal, which is the decoded block (decoded block to be encoded) input from the adder 603, as the played back video data to the video decoding device 600 such as a frame memory.
- the storage unit 604 corresponds to the storage unit 107 in the video encoding device 100. All decoding blocks are stored until all decoding of moving image data is completed. In this way, the reproduced moving image data is sequentially accumulated.
- the prediction generation unit 605 reads out the reproduced image stored in the storage unit 604, and generates a prediction signal of a prediction target (decoding target) block based on the reproduced image.
- the prediction generation unit 605 corresponds to the prediction generation unit 108 in the video encoding device 100 and has the same function, and thus the description thereof is omitted here.
- decoding is performed by the decoding unit 601 (S701, decoding step).
- decrypted the converted data in units of blocks is extracted.
- This converted data is input to the inverse conversion unit 602 by the decoding unit 601.
- the position information dynamic image decoding apparatus 600 in the frame of the decoding target block is input to the prediction generation unit 605 from a controller (not shown) that supervises the decoding process. Note that the position of the block to be decoded depends on the encoding order.
- the prediction generation unit 605 generates a prediction signal of the decoding target block as follows. It is. First, a template area is set on the reproduced image adjacent to the block by the template area determining unit 201 (S702, search step). Next, the matching unit 202 performs template matching on the reproduced images in the same and different frames as the encoding target block, and searches for an area having a high correlation with the reproduction signal of the template area (S703, search). Step). Next, the filling unit 203 has the same size as the target block of the code that is adjacent to the highly correlated region obtained by template matching in a predetermined positional relationship (right and lower in the example of FIG. 4). The area is set as a prediction signal (S704, prediction signal determination step). The set prediction signal is input to the adding unit 603.
- the conversion data input from the decoding unit 601 is inversely converted by the inverse conversion unit 602 to generate a differential signal (S705, reproduction image generation step).
- the series of processing from S702 to S704 and the processing from S705 are performed before the processing from S706 onwards described below! Well, so it's okay if the order is reversed! /.
- the difference signal input from the inverse transformation unit 602 and the prediction signal input from the prediction generation unit 6 05 are added together by the addition unit 603 to obtain a sum signal, which is a reproduction signal.
- Is generated (S706, reproduction image generation step).
- the generated decoded block is input to the storage unit 604, and stored in the storage unit 604 at a predetermined address corresponding to the frame to be decoded in the frame memory or the like (S707, storage step).
- the moving picture decoding apparatus 600 of the present embodiment it is possible to determine a prediction signal and decode a moving picture as in the moving picture encoding apparatus 100. That is, according to the moving image decoding apparatus 600 of the present embodiment, it is possible to correctly decode the moving image data that has been efficiently encoded by the moving image encoding apparatus 100 and generate a reproduced image.
- the following modifications can be considered. The following modifications are described with respect to the moving image encoding device and the moving image decoding device, but the same can be applied to the moving image encoding process and the moving image decoding process.
- the search range that is the target of template matching is used as the playback area (403) in the frame to be encoded and the playback image 404 of the playback frame.
- the present invention can also be applied to an intra frame in which only a reproduced signal in a frame is predicted as long as it is limited to the reproduced region 403.
- the reproduced image 404 there is an effect that it is possible to reduce the amount of calculation at the time of decoding the inter frame accompanied by the prediction between frames.
- a prediction mode (intra template matching prediction) in which the regenerated region 403 is a search target for template matching and a prediction mode (inter template matching prediction) in which the replayed image 404 is a search target for template matching are prepared.
- the method of selecting in units is also effective because it reduces the amount of computation during decoding.
- a reproduced image area in the frame to be encoded may be included.
- the reproduced image of the reproduced frame to be the template matching target and the reproduced image area of the encoding target frame will be collectively referred to as a reference image of the reference frame.
- the reference image may be a high-resolution image including not only integer pixels but also real pixels generated by the filter processing. A method for generating real pixels is described in Non-Patent Document 1, for example.
- the reference frame number to be selected may be a block unit or a frame unit, or may be selected without additional information by an index value such as SAD.
- the prediction target area (in the block to be encoded)
- the original signal of the region is compared with the prediction signal of the prediction target region (in the encoding target block) generated by the reference image force of a plurality of reference frames, and one reference frame is selected.
- this encoding process is performed by adding a selection unit 109 between the prediction generation unit 108 and the subtraction unit 102 in the video encoding apparatus 100 shown in FIG. it can.
- the selection unit 109 calculates an index value (SAD, MSE, etc.) between the prediction signal generated by the prediction generation unit 108 for a plurality of reference frames and the original signal of the code target block, and the index Select the reference frame with the smallest value.
- the selected reference frame number is entropy-coded by the code key unit 104. Note that even if the processing of the selection unit 109 is included in the prediction generation unit 108, the same result can be obtained, and therefore this modification can be implemented even with this configuration.
- the prediction generation unit 606 generates a prediction signal using a reference frame corresponding to the reference frame number decoded by the decoding unit 601.
- two reference frames are selected and obtained from the selected reference frame.
- the prediction signal of the block to be encoded may be averaged pixel by pixel to calculate the final prediction signal (averaging process).
- selection candidates for the prediction signal of the code key block from the same reference frame. Selecting two prediction signals with the same reference frame force at a position shifted by one pixel (or 1Z2 pixel or 1Z4 pixel) also has the effect of increasing the accuracy of the motion vector to be searched by interpolation. Since smoothing has the effect of adding noise components to the prediction error signal, it generally has good compatibility with transform coding.
- the final prediction signal of the prediction target area is calculated by weighted averaging for each pixel rather than by simple averaging (weighted averaging) Is also possible.
- the method for setting the weighting coefficient and the sign key method are not particularly limited, but the method described in Non-Patent Document 1 can be applied.
- averaging processing, weighted averaging processing, and median prediction processing can be realized by a prediction generation unit 1108 in which a signal generation unit 204 is added to the prediction generation unit 108 shown in Fig. 2 as shown in Fig. 20 ( (Applied to the prediction generation unit 108 in FIG. 1 and the prediction generation unit 605 in FIG. 6).
- the signal generation unit 204 receives a prediction signal of a prediction target region generated from a plurality of frames, and generates a final prediction signal by the processing method described above.
- the processing of the signal generation unit 204 can be performed using information that also derives the force of featured data (such as motion vector) belonging to the reconstructed image and the reconstructed image (data related to the reconstructed image). Can be implemented.
- a method may be considered in which a plurality of processing methods are prepared from one frame selection processing, averaging processing, weighted averaging processing, median prediction processing, and the like, and processing methods are selected in units of blocks or frames.
- prediction processing using a template composed of decoded values there is no guarantee that the motion with the optimal index value will minimize the prediction error signal. Therefore, it is effective to select an appropriate method from multiple processing methods that have different prediction signal characteristics in the prediction target region.
- a method for selecting a processing method first, a method for minimizing the sum of prediction error absolute values (or the sum of squares of prediction errors) of the prediction target region is selected on the sign side (selection unit 109 in Fig. 18). ), A method of transmitting to the decoding side can be considered.
- This selection method can be realized by replacing the prediction generation unit 108 in FIG. 18 and the prediction generation unit 606 in FIG. 19 with the prediction generation unit 1108 in FIG.
- the selection unit 109 in FIG. 18 outputs information on the selected processing technique to the encoding unit 104 instead of the selected reference frame number. Since the same result is obtained even in the configuration in which the processing of the selection unit 109 is included in the prediction generation unit 1108, this configuration can also be implemented
- a template region prediction signal is generated, and an index value (SAD, MSE, etc.) is automatically calculated from the template region generation signal.
- a method of selecting a method is also conceivable. For example, in the averaging process, two prediction signal candidates in the template area are averaged pixel by pixel, and the prediction signal is calculated. An index value is calculated between the prediction signal and the reproduction signal of the template area.
- the processing method can be uniquely determined using the information derived from the reproduced image and the feature data (such as motion vectors) belonging to the reproduced image, so the information of the processing method is encoded. There is no need.
- both TaV and EvV are smaller than the threshold value Averaging process, weighting averaging process when only TaV is smaller than threshold, median prediction process when only EvV is smaller than threshold, 1 frame selection process when both TaV and EvV are larger than threshold A method such as In this case, the strength of the feature in the spatial direction of the template is evaluated by the variance of the reproduction signal in the template region, and the strength of the feature in the temporal direction of the template region is evaluated by the variance of the index value.
- Such an automatic selection method can be realized by replacing the prediction generation unit 108 in FIG. 18 with the prediction generation unit 1108 in FIG. 20 and introducing the selection method described above into the selection unit 109 in FIG. . Even in this selection method, the processing method can be uniquely determined using the information from which the reproduced image and the characteristic data (variance value, etc.) force belonging to the reproduced image are derived. There is no need. Therefore, the output from the selection unit 109 to the sign key unit 104 is omitted.
- the input to the selection unit 109 is replaced with the reproduction signal of the original signal power template region of the encoding target block, and the prediction generation unit 1 108 to the selection unit 109 are replaced. It is necessary to add index values of multiple reference frames to the input to. Note that the same result can be obtained with the configuration in which the prediction generation unit 1108 includes the processing of the selection unit 109, and therefore this configuration can also be implemented.
- the processing on the decoding side can be realized by replacing the prediction generation unit 605 in FIG. 6 with a combination of the prediction generation unit 1108 in FIG. 20 and the selection unit 109 in FIG.
- the automatic selection method is not limited to the method described here, and any method that uses only information derived from a reproduced image or feature data belonging to the reproduced image can be realized. [0109] (3) Configuration of prediction generator
- the prediction generation unit 108 is configured by the template region determination unit 201, the matching unit 202, and the compensation unit 203.
- the present invention can be realized without being limited to this configuration. For example, if the reproduction signal of the template area is directly input from the reproduction signal of the encoding target frame according to a predetermined procedure, the template area determination unit 201 is unnecessary. Further, when the matching unit 201 acquires the prediction signal of the reference frame force template region, the prediction signal of the prediction target region can be acquired at the same time, so that the prediction signal can be generated without the compensation unit.
- the size of the code target block is 8 pixels ⁇ 8 pixels. Since the present invention can be implemented with other block sizes, it is not limited to this size.
- the size of the template area is not limited. For example, in the case of 8 pixels x 8 pixels, the template area and the prediction target area are combined into 12 pixels x 12 pixels, or the size of the template area is 10 pixels x 10 pixels. It is also effective to change the size of the encoding target block and the size of the template area in units of blocks or frames. As shown in Non-Patent Document 1, it is effective to prepare sets of encoding target blocks and template areas of different sizes because different patterns in an image can be handled. In addition, when considering intra template matching prediction and inter template matching prediction, it is expected that the prediction efficiency can be improved by reducing the block size of intra template matching prediction, where the redundancy of the template region and search range is generally low.
- the prediction of the present invention can be performed even with a block size different from that of the prediction target block and the code target block.
- the reproduction signal of the template area is composed of the reproduced pixels of the target frame, and the reproduced pixels on other reference frames are also known because they are known to the decoding side.
- a prediction target area of 8 pixel ⁇ 8 pixel block is divided into 4 pixel ⁇ 4 pixel block and a prediction signal is generated in units of 4 ⁇ 4 blocks.
- the rate area and the prediction target area are combined into a 6 pixel x 6 pixel block, the template area of the 4 X 4 block at the upper left of the 8 X 8 block! Is composed of the reproduced pixels of the encoding target frame. it can.
- the 6 X 2 pixels at the top of the block can be composed of the reconstructed pixels of the target frame.
- the 2 X 2 pixels at the left of the block are still encoded. Since it is not displayed, substitute the prediction signal of the 4 X 4 block in the upper left.
- the reproduced pixels of the encoding target frame are encoded for all the pixels in the template area. Therefore, the prediction signal is used instead.
- the reproduction signal of the template area is configured by acquiring the reproduced pixels of the block adjacent to the encoding target block.
- the reproduced signal is subjected to a filter or the like for removing noise.
- An area reproduction signal may be generated. For example, in an image with a lot of noise, motion detection that is not affected by noise can be performed by filtering the reproduction signal of the template region and the reference image.
- the index value used to generate the prediction signal in the target area by template matching is the power of the sum of absolute differences (SAD) and mean square error (MSE) of the prediction signal and target signal in the template area. It is not limited to this.
- SAD sum of absolute differences
- MSE mean square error
- a value considering the magnitude of the differential motion vector is also applicable to the index value of the present invention.
- the difference absolute value between the boundary pixels of the template area and the prediction area can be set to 4, and the distance from the boundary can be reduced to 3, 2, 1, and so on. .
- the prediction performance can be improved by giving priority to the pixels close to the boundary.
- the input of the inverse transform unit 105 is the output from the transform unit 103.
- the output from 104 may be used.
- the process of the decoding unit 601 in FIG. 6 is performed before the process of the inverse transform unit 105.
- the present invention can also be implemented by an implementation method that unifies the processing of the video encoding device and the video decoding device. That is, a configuration in which the output of the code key unit 104 is processed by the decoding unit 601 in FIG. 6 and the decoded image is input to the storage unit 107 is also conceivable.
- the device configurations of the video encoding device and the video decoding device in the second embodiment are the same as the device configuration in the first embodiment except for the detailed configuration of the prediction generation unit.
- differences between the prediction generation unit in the present embodiment and the prediction generation units 108 and 605 in the first embodiment will be described.
- the prediction generation unit 800 of this embodiment includes a determination unit 801, a template region determination unit 802, a matching unit 803, and a compensation unit 804.
- the determination unit 801 compares the reproduction signal of the template region with the reproduction image stored in the storage units 107 and 604, and based on the comparison result, the region to be encoded or decoded (prediction target block) It is an estimation means for estimating the spatial continuity of the signal. Spatial continuity is an index that indicates how well the features such as the direction of movement in a space match. That is, in a certain area, when the motion characteristics are different between the upper half and the lower half of the area, there is no spatial continuity.
- the determination unit 801 further divides the region to be encoded or decoded based on the estimated spatial continuity of the image, and divides the divided region into a new encoding or decoding target region ( A setting unit for setting a template region for the new region to be encoded or decoded.
- the determination unit 801 analyzes the reproduced image stored by the storage units 107 and 604, determines a prediction parameter including the size of the template region and the size of the prediction target region, and uses the information as the template region determination unit 802. Output to the compensation unit 804. A specific method for determining the prediction parameter will be described later.
- template region determination unit 802 Based on the size information of the template region input from determination unit 801, template region determination unit 802 sets a template region used to generate a prediction signal and an image of the region. , A component of the search means.
- the template area determination unit 802 corresponds to the template area determination unit 201 in the first embodiment and has the same function. Have.
- the matching unit 803 uses the template region image set by the template region determining unit 802 to perform template matching using the reproduced images stored in the storage units 107 and 604 as search regions, and performs search. This is a search means for searching for an area having the highest correlation with the pixel group of the template area within the area.
- the matching unit 803 corresponds to the matching unit 202 in the first embodiment and has the same function.
- the compensation unit 804 predicts the same size as the prediction target block based on the region searched by the matching unit 803 (high correlation region) and the positional relationship between the prediction target block and the template region. This is a prediction signal determining means for determining the signal by setting the reproduction image force.
- the size of the prediction target block at this time is set by the determination unit 801.
- the compensation unit 804 corresponds to the compensation unit 203 in the first embodiment and has a similar function.
- FIG. 14 is a diagram showing the pixels of the prediction target block 1401 and the reproduced pixels 1402 around it.
- a region A that covers the entire region 1402 adjacent to the prediction target block 1401 and a region 1402 are divided to cover each and part of the region 1402.
- Prepare four areas, areas B, C, and D areas B, C, and D do not overlap each other, and add to area 1402).
- template matching is performed on the reproduced image stored in the storage unit using the regions A, B, C, and D as template regions, and regions with high correlation are obtained.
- SAD is used as the correlation value.
- SAD for each region A, B, C, D is SAD, SAD, SAD, SAD, and SAD and (SAD + SAD + SA
- the size of the template region and the size of the prediction target region are set to be smaller than the block size by further dividing the block.
- the size of the prediction target area at this time can be set to a size according to the division of the areas B, C, and D, for example.
- the template area is sized according to the size of the prediction target area.
- the prediction target block 1401. It is estimated that there is spatial continuity in the prediction target block 1401. Based on this estimation, it is determined that template matching by area A is effective, and the prediction target area is the block size (the prediction target area is the area where the block is not divided). Note that the area 1402 adjacent to the prediction target block 1401 used for the above judgment is divided into patterns such as areas B, C, and D as shown in FIG. You may subdivide it further like
- the prediction based on template matching as in the present invention cannot be an accurate prediction. For this reason, it is necessary to avoid mispredictions. In general, misprediction is likely to occur when the size of the template area is small. On the other hand, if the template area or the prediction target area is large in a portion where the motion is not spatially continuous, it will not be possible to deal with fine motion and the prediction error will increase. Therefore, it is effective to reduce the size of the template area and the size of the prediction target area as in the method according to the present invention to increase the probability of adapting to fine movement.
- the prediction transition when the size of the template area and the size of the prediction target area are changed will be described with reference to FIG. If there is spatial continuity in the area adjacent to the prediction target block 901 and the size of the prediction target area is the entire prediction target block 901, processing is performed by one template matching as in the first embodiment. Do. For example, as shown in FIG. 9, the prediction J target block 901 is divided into four regions 901a, 901b, 901c, and 901d, the prediction target region size is reduced, and the template region size is also adjusted. Will be described below. It should be noted that the left and upper areas of the prediction target block 901 are areas of an image that has been reproduced.
- the left and upper regions 902a are set as template regions with respect to the upper left region 901a of the prediction target block 901, and a prediction signal is set by template matching. I do.
- the upper region 902b is set as a template region with respect to the right region 901b of the region 901a, and a prediction signal is set by template matching.
- the left region 902c is set as the template region with respect to the lower region 901c where the prediction signal was initially set, and the predicted signal by template matching is set. Set up.
- the left and upper regions 901d including the regions 901a, 901b, and 901c are set as template regions.
- the prediction signal is set by template matching using the prediction signal as the target signal of the template region. As a result, prediction signals are set for all regions of the prediction target block 901, and encoding and decoding are possible.
- the size of the template area and the size of the prediction target area can be changed by simply dividing the template area in both vertical and horizontal directions as shown in Figs. 9 (a) to (d). As in (), only the vertical or horizontal direction may be divided. For example, in FIG. 14 (b), when the highly correlated region in region A includes the highly correlated regions in regions B, C, D, and E, and only the highly correlated region in region F does not include ( e) Divide vertically as shown in (f). This is because in such a case, it can be determined that the spatial continuity is lost between the upper half and the lower half of the prediction target block.
- the left and upper regions 902e are set as template regions relative to the upper half region 901e of the prediction target block 901, and template matching is performed. Set the prediction signal.
- the left region 902f is set as the template region with respect to the lower region 901f of the region 901e, and a prediction signal is set by template matching.
- prediction signals are set for all regions of the prediction target block 901, and code encoding and decoding become possible.
- moving image data to be encoded is input to the moving image encoding device, it is input to the area dividing unit 101 for each frame image constituting the moving image.
- the input frame image is divided into a plurality of blocks of a predetermined size by the area dividing unit 101 (S1001). All subsequent processing is performed in units of blocks.
- the block is input to the prediction generation unit 800 and the subtraction unit 102 as a pixel in the encoding target area.
- the prediction generation unit 800 generates a prediction signal of the target block for encoding as follows.
- the determination unit 801 determines a prediction parameter to be used for the code target block using the reproduced pixels adjacent to the code target block (S102, estimation step and determination step).
- the determined prediction parameter is input to the template area determination unit 802.
- the template region determination unit 802 sets an encoding target region, and sets a reproduced pixel group adjacent to the region on the reproduced pixel region (template region). Set as a template (S1003).
- the encoding target area set may be obtained by dividing the encoding target block as described above.
- the matching unit 803 performs template matching on the reproduced image in the same and different frames as the encoding target block, and searches for a region having a high correlation with the pixel group of the template region (S10 04). .
- the compensation unit 804 sets a region having the same size as the region to be encoded adjacent to the highly correlated region obtained by template matching in a predetermined positional relationship as a prediction signal (S 1005 ).
- the prediction signal is set in all areas of the encoding target block (S 1006).
- the series of processing of S1003 to S1005 is repeated until the prediction signal is set in all the regions of the encoding target block. Note that the above determination may be made, for example, by any one of the above-described components, or may be performed by providing means for supervising the encoding process in the moving image encoding apparatus.
- the subsequent processes are the same as the corresponding processes (S505 to S511) in the first embodiment.
- a motion vector is obtained by using a reproduced image that has been encoded.
- Inter-frame prediction can be performed without using, enabling efficient coding.
- the size of the predicted signal area can be selected appropriately based on the spatial continuity of the reproduced signal. The coding efficiency is also improved.
- decoding is performed by the decoding unit 601 (S 1101).
- the conversion data in units of blocks is extracted.
- This converted data is input to the inverse conversion unit 602 by the decoding unit 601.
- the position information power decoding unit 601 in the frame of the decoding target block is input to the prediction generation unit 800. Note that the position of the block to be decoded depends on the encoding order.
- the prediction generation unit 800 generates a prediction signal of the decoding target block as follows.
- the determination unit 801 uses the reproduced image adjacent to the decoding target block to determine a prediction parameter to be used for the decoding target block (S1102, estimation step and determination step).
- the determined prediction parameter is input to the template region determination unit 802.
- the template area determination unit 802 sets a decoding target area based on the set prediction parameter, and sets a reproduced pixel group adjacent to the area as a template (S1103).
- the decoding target area set here may be obtained by dividing the decoding target block as described above.
- the matching unit 803 performs template matching on the reproduced image in the same and different frame as the decoding target block, and searches for a region having a high correlation with the pixel group of the template region (S 1104).
- the compensation unit 804 sets a region having the same size as the region to be decoded adjacent to the highly correlated region obtained by template matching in a predetermined positional relationship as a prediction signal (S 1105).
- the above determination may be made, for example, by any one of the above-described constituent elements, and a means for supervising the decoding process in the moving picture decoding apparatus may be provided and performed.
- the subsequent processing (S1107 to S1110) is the same as the corresponding processing (S705 to S708) in the first embodiment.
- the series of processing of S1102 to S1106 and the processing of S1107 need only be performed before the processing after S1108, and therefore the order may be reversed. ,.
- the moving picture decoding apparatus of the present embodiment it is possible to determine a prediction signal and decode a moving picture similarly to the moving picture encoding apparatus of the present embodiment. That is, according to the moving picture decoding apparatus of the present embodiment, it is possible to correctly decode the moving picture data that has been efficiently encoded by the moving picture encoding apparatus of the present embodiment to generate a reproduced image. it can.
- the determination unit 801 in FIG. 8 may determine the size and shape of the prediction target region and the template region at the same time. Therefore, the present invention can be applied to the case where the size and shape of the prediction target area are fixed and the size or shape of the template area is to be switched adaptively. In this case, output from the determination unit 801 to the compensation unit 804 is not necessary.
- a method of selecting a template for the prediction target area 1401 from the areas A, B, C, and D in FIGS. 14 (a) and 14 (b) can be considered.
- As a template of the prediction target region 1401 it is better that there is continuity of the pattern between the prediction target region 1401 and the number of pixels constituting the template is large.
- the determination processing in the determination unit 801 is not limited to the method described above.
- a method of comparing the prediction signal of the template region with the prediction signal when the pixels of the region are actually coded, and selecting the template shape and size with a small average difference absolute value is conceivable.
- the prediction signal may be generated again by using the reproduction signal of the block to be encoded to which the region belongs instead of the prediction signal at the time of encoding.
- Another possible method is to select a template shape and size with a small average difference absolute value between the prediction signal of the template region and the target signal (reproduced signal).
- a method for ensuring continuity of movement instead of a pattern is also effective. For example, the difference between the motion vector detected using the region A as the template region and the motion vector of the adjacent block or the predicted motion vector calculated from the adjacent block is calculated. If the difference motion vector is smaller than a predetermined threshold, the detected motion vector is used as the motion vector of the prediction region. On the other hand, if the motion vector of the difference is larger than a predetermined threshold value, a different template shape (for example, regions B, C, D in Fig. 14 (b) or regions B, D, F in Fig. 14 (c)) A motion vector is detected for.
- a different template shape for example, regions B, C, D in Fig. 14 (b) or regions B, D, F in Fig. 14 (c)
- the motion vector when the pixels in the region are encoded and the detected motion vector are compared, and if the difference is small, the motion vector is selected as the motion vector of the prediction target region.
- the motion vector may be detected again using the reproduction signal of the code target block to which the area belongs. Either method can be implemented by using information from which a reproduction signal or characteristic data (such as a motion vector) force belonging to the reproduction signal (information relating to the reproduction signal) is derived. It is also possible to determine the template shape and the size of the prediction region by comparing the magnitudes of motion vectors detected in multiple template shapes. Similarly, it is possible to determine the template shape and the size of the prediction area by comparing the size of the motion vector of adjacent blocks.
- the shape and size of a candidate template is not limited to template division as shown in FIG. For example, in the case of 8 pixels ⁇ 8 pixels, a case where the combined size of the template region and the prediction target region is selected from 12 pixels ⁇ 12 pixels, 10 pixels ⁇ 10 pixels, and 14 pixels ⁇ 14 pixels is also included in this modification. [0146] (2) Size determination of template area and prediction target area
- the decision unit 801 in FIG. 8 determines the size and shape of the template region and the prediction target region by calculation, but the optimal size is determined using the original signal (encoding target block) of the prediction target region.
- the shape information may be signed.
- the determination method outputs a prediction signal of a prediction target region generated by the prediction generation unit 108 using a plurality of types of templates, and the selection unit minimizes an index value (SAD, MSE, etc.).
- the template size and shape to be selected are selected, and the information is entropy-coded by the code key unit 104.
- the processing of the selection unit 109 is performed in a configuration that is included in the prediction generation unit 108.
- the target signal of the template region may not exist.
- a template is generated only by existing target signals.
- the region C shown in Fig. 14 (b) is used as the template region at the left end of the image
- the region D is used as the template region at the upper end of the image.
- a search range force is detected for a motion that minimizes the index value of the target signal in the template region and the difference signal for which the prediction signal power is calculated. Therefore, if the target signal in the template region has a distinct feature, appropriate motion prediction can be performed based on the feature.
- a distinctive feature does not appear in the target signal of the template area as in a flat area, even if the index value is minimum, there is a high possibility of detecting a motion different from the actual one. In this case, if the difference between the prediction signal in the prediction target region and the target signal is large, the code amount also increases.
- a flat region with no distinctive features has high spatial similarity between the original signal and a signal with a reduced resolution due to a small amount of high frequency components contained in the signal. Therefore, even if the target signal in the prediction target region is encoded with a lower resolution and the reproduced signal with the lower resolution is expanded on the decoding side by a simple method, the degradation of the original signal power can be suppressed.
- a differential encoding method suitable for a flat region a method of reducing the resolution of the target signal in the prediction target region and the prediction signal and encoding the low-resolution differential signal (code target block and Reduce the prediction block and code the reduced difference block This is a method for the
- a block composed of a prediction signal is referred to as a prediction block
- a block composed of a difference signal is referred to as a difference block
- a block composed of a reproduction signal is referred to as a decoding block.
- the encoding target block is a block composed of the original signal of the encoding target frame of moving image data.
- FIG. 21 shows a video encoding device 2100 that implements the third embodiment.
- This can be realized by providing a reduction unit 2110 (a general term for the reduction unit 2110-1 and the reduction unit 2110-2) and an enlargement unit 2111 in the moving image encoding apparatus 100 of FIG.
- the functions of the subtractor 2102, the converter 2103, the sign key 2104, the inverse converter 2105, and the adder 2106 are limited by the block size force to be handled (for example, 4 pixels x 4 pixels).
- the function is the same as that of the 1 subtraction unit 102, the conversion unit 103, the encoding unit 104, the inverse conversion unit 105, and the addition unit 106.
- the conversion unit 2 103 and the inverse conversion unit 2015 can be handled in units of 4 pixels ⁇ 4 pixels as shown in Non-Patent Document 1 in the conversion unit 103 and the inverse conversion unit 105 of FIG.
- the difference between the conversion unit 103 and the inverse conversion unit 105 in FIG. 1 and the conversion unit 2103 and the inverse conversion unit 2105 in FIG. 21 means that the number of blocks to be processed is reduced from one to one.
- the reduction unit 2110-1 and the reduction unit 2110-2 are respectively a code encoding target block obtained from the region dividing unit 108 and a prediction block obtained from the prediction generation unit 108, respectively.
- the data is reduced to the target block and the reduced prediction block and output to the subtraction unit 2102.
- the subtraction unit 2102 calculates the difference between the two reduced blocks in units of pixels and outputs the reduced difference block to the conversion unit 2103.
- the conversion unit 2103 performs conversion (and quantization) processing, and the converted data (quantized data) is encoded by the encoding unit 2104.
- the transform data (quantized data) is subjected to inverse transform processing (inverse quantization and) in the inverse transform unit, and the decoded reduced difference block is output to the adder 2106.
- the adding unit 2106 adds the decoded reduced difference block and the reduced prediction block in units of pixels to generate a decoded reduced block.
- the enlargement unit 211 1 enlarges the decoded reduced block to a decoded block having the same size as the encoding target block, and outputs the decoded block to the recording unit 107.
- the processing of the reduction unit 2110 and the enlargement unit 2111 will be described later with reference to FIG.
- FIG. 22 shows a video decoding device 2200 that implements the third embodiment.
- Figure 6 video This can be realized by providing the decoding device 600 with a reduction unit 2207 and an enlargement unit 2208.
- the processes of the reduction unit 2207 and the enlargement unit 2208 have the same functions as the reduction unit 2110 and the enlargement unit 2111 in FIG.
- the functions of the sub-decoding unit 2201, the inverse conversion unit 2202, and the addition unit 2203 are only for a small block size (for example, 4 pixels ⁇ 4 pixels), and the decoding unit 601 and the inverse conversion unit 602 in FIG.
- the function of the adding unit 603 is the same.
- the inverse transform unit can be handled in units of 4 pixels ⁇ 4 pixels in the inverse transform unit 602 in FIG. 6 as shown in Non-Patent Document 1 in the same manner as the encoder device in FIG. .
- the difference between the inverse transform unit 602 in FIG. 6 and the inverse transform unit 2202 in FIG. 22 means that the number of blocks to be processed is reduced from four to one.
- the reduction unit 2207 reduces the prediction block obtained from the prediction generation unit 605 to a reduced prediction block, and outputs the reduced prediction block to the addition unit 2203.
- the adding unit 2203 adds the decoded reduced block decoded by the processing of the decoding unit 2201 and the inverse conversion unit 2202 and the reduced prediction block in units of pixels to generate a decoded reduced block. Similar to the encoder device of FIG. 22, the inverse transform unit 2202 may include a quantization process.
- the expansion unit 2208 expands the decoded reduced block to a decoded block having the same size as the decoded block, and outputs the decoded block to the recording unit 604.
- FIG. 23 shows reduction / enlargement processing in the reduction units 2110 and 2207 and the enlargement units 2111 and 2208.
- Block 2301 shows the block before being reduced.
- Process 2304 describes the pixel generation method on the reduced block in the reduction process. Pixels!, K, m, n indicate pixels on the block 23 01 and pixels on the reduced block where the pixel P is generated. In processing 2304, pixel averaging processing is performed in units of four pixels, and pixels on the reduced block are calculated.
- Block 2302 indicates a reduced block obtained by the reduction process.
- a process 2305 shows a pixel generation method on the enlarged block in the enlargement process.
- Pixels A to D indicate pixels on the block 2302
- pixels a to i indicate pixels on the enlarged image.
- pixel interpolation / extrapolation processing is performed by a different method depending on the pixel position. Since pixel a is the only pixel on the adjacent reduced block, pixel a is used as pixel a.
- the pixels indicated by white circles in the block 2303 are calculated by a method of copying adjacent pixels on the reduced block. For pixels b to e, there are two pixels on the adjacent reduced block. Therefore, these pixels are two adjacent pixels on the reduced block.
- the pixels indicated by black squares in the block 2303 are calculated by extrapolation using two adjacent pixels on the reduced block. For pixels f to i, there are 4 pixels on the adjacent reduced block. Therefore, these pixels are calculated by linear interpolation using four adjacent pixels on the reduced block. Similarly, the pixels indicated by black circles in block 2303 are calculated by linear interpolation using the four adjacent pixels on the reduced block.
- FIG. 24 and FIG. 25 show a moving image encoding process and a moving image decoding process that realize the third embodiment, respectively.
- FIGS. 24 and 25 correspond to FIGS. 5 and 7 in the first embodiment, respectively.
- the prediction signal generation processing (S2404, S2504) is collectively described.
- S2401, S2405, S2406, S2407, S2408, S2410, and S2411 in FIG. 24 respectively correspond to S501, S505, S506, S507, S508, S510, and S511 in FIG.
- the reduction unit 2110-1 reduces the code key block to be input to the area division unit 101 into a reduced code key block, and then reduces the block.
- the resulting block is output to the subtraction unit 2102.
- the reduction unit 2110-2 reduces the prediction block input from the prediction generation unit 108 into a reduced prediction block, and the reduced blocks are subtracted and added 2102 and 2106. Output to.
- the reduced difference block is encoded and decoded to generate a decoded reduced difference block.
- the adding unit 2106 adds the reduced prediction block and the decoded reduced difference block in pixel units to generate a decoded reduced block.
- the enlarging unit 211 expands the decoded reduced block into a decoded block.
- the reduction unit 2207 reduces the prediction block input from the prediction generation unit 605 to a reduced prediction block, and the reduced block is input to the addition unit 2203. Output.
- the reduced differential block is decoded, and a decoded reduced differential block is generated.
- the addition unit 2203 adds the reduced prediction block and the decoded reduced difference block in units of pixels to generate a decoded reduced block.
- the enlargement unit 2208 enlarges the decoded reduced block to a decoded block.
- the template matching method in the prediction generation unit is not limited to the method shown in FIG. That is, also in the present embodiment and modifications, the prediction signal generation method using template matching shown in the first embodiment, the second embodiment, and modifications thereof can be applied.
- a selection unit can be added, and the prediction generation units 108 and 605 can be replaced with the prediction generation unit 1108 shown in FIG.
- replacing the prediction generation units 108 and 605 with the prediction generation unit 800 shown in FIG. 8 can be applied as it is because the signal input / output flow does not change.
- FIG. 36 shows the configuration of the prediction generation unit in this modification.
- the figure shows an example of intra prediction in which a prediction signal is generated from a template signal.
- a method for generating a prediction signal from a template reproduction signal can be implemented by replacing the prediction generation unit 108 in FIGS. 21 and 22 with a prediction generation unit 3608 in FIG.
- the encoding target A template is composed of 13 regenerated pixels adjacent to the block.
- the compensation unit 3603 generates a prediction block from the pixels in the template by the method shown in the process 3711 shown in FIG. In FIG. 37, nine types of compensation methods are presented, but the present invention can be implemented by predefining at least one of them.
- one type may be selected from a plurality of compensation methods in the compensation unit, and a prediction block generated by the selected compensation method may be output.
- the selection method of the compensation method is not limited in the present invention.
- Information on the selected compensation method may be transmitted, or a method of determining using only data shared by the encoder (moving image encoding device) and the decoder (moving image decoding device) may be used.
- the shape of the template is not limited to that shown in FIG.
- the present invention can be realized even with a template composed of pixels separated by the block boundary force, as long as it is a pixel in the reproduced region of the frame to which the block to be encoded belongs only by the pixel at the block boundary.
- FIGS. 32 and 34 show another example of the moving picture coding apparatus
- FIGS. 33 and 35 show another example of the moving picture decoding apparatus.
- the reducing unit 3210 reduces the difference block generated by subtracting the encoding target block force prediction block that is not the prediction block in units of pixels. Then, the decoded decoded differential block is enlarged by the enlargement unit 3211, and the enlarged block and the prediction block are added in units of pixels to form a decoded block.
- the video decoding device 3300 in Fig. 33 is a decoding device corresponding to the video encoding device 3200 shown in Fig. 32.
- the prediction block is not reduced
- the decoded decoded difference block is enlarged by the enlargement unit 3308
- the enlarged block and the prediction block are added in units of pixels to form a decoded block.
- the moving image encoding device 3400 in Fig. 34 has the function of the reduction unit 2110-2 in Fig. 21 in advance. This is a configuration included in the compensation unit in the measurement generation unit 3408.
- moving picture decoding apparatus 3500 in FIG. 35 has a configuration in which reduction section 2207 in FIG. 22 is included in the compensation section of the prediction generation section.
- the compensation unit can combine the reduction process and the compensation process so as to directly generate the reduced prediction block.
- the memory size of the filling unit 203 is the capacity that can be saved in the reduced prediction block.
- the moving image data power also acquires only necessary information directly and generates a reduced prediction block.
- the template region determination unit and the matching unit may acquire all the pixels in the template region from the reproduced moving image data in the storage unit 107 as described above, The matching process may be performed by reducing the size and acquiring only necessary information.
- the template area determination unit need only acquire necessary information directly from the reproduced moving picture data in the storage unit 604.
- the configuration of the prediction generation unit in the above-described video encoding devices 3200 and 3400 and video decoding devices 3300 and 3500 can be realized by either the configuration shown in Fig. 2 or the configuration shown in Fig. 36. Also, in the case where a selection unit is included, such as the video encoding device 1800 and the video decoding device 1900 shown in FIG. 18 and FIG. 19, it can be realized by adding a selection unit. It is also possible to replace the prediction generation unit shown. Furthermore, replacing Fig. 2 with the prediction generation unit shown in Fig. 8 can be applied as it is because the signal input / output flow does not change.
- the decoding reduced block enlargement process may not be performed in Fig. 21, and may be stored in the frame memory as it is.
- template matching is performed on the reduced image obtained by integrating the decoded reduced blocks to generate reduced predicted blocks. Then, the difference code between the reduced code key target block and the reduced prediction block is performed.
- the reduced prediction block may be enlarged and differential encoding with the encoding target block may be performed.
- the decoding reduction block enlargement processing may not be performed, and it may be stored in the frame memory as it is.
- template matching is performed on a reduced image obtained by integrating reduced blocks, and a reduced predicted block is generated.
- the decoded reduced difference block and the reduced prediction block are added to reproduce the reduced block. Also May enlarge the reduced prediction block, add the decoded difference block and the enlarged block, and reproduce the decoded block. Thus, even if the application method of the reduction process and the enlargement process is changed, the effect of reducing the code amount can be obtained.
- the method of the reduction process and the enlargement process may be the method shown in FIG. 23 or another example described later.
- FIGS. 21 and 22 are configured with a block reduction / enlargement process assuming that all the images in the template area are flat. Show. Practically, since a flat part and a characteristic part are mixed in the design of the image, it is combined with the configuration in which the block reduction / enlargement process described in the first embodiment is not performed.
- FIG. 26 shows a video encoding device 2600 that is a combination of the video encoding devices 100 and 2100 shown in FIGS. 1 and 21, and
- FIG. 27 shows a video decoding device 600 and 2200 shown in FIGS.
- a combined video decoding device 2700 is shown.
- blocks having the same numbers as those in FIGS. 1, 6, 21, and 22 indicate the same functions, and thus detailed description thereof is omitted here.
- the code key blocks divided by the region dividing unit 101 are input to the subtracting unit 102 and the reducing unit 2110-1, respectively.
- There are two types of predictive coding methods (TMP-E mode; coding method of the moving picture coding apparatus 100 shown in FIG. 1, TMP L mode; coding of the moving picture coding apparatus 2100 shown in FIG. 21.
- the conversion unit 103 and the conversion unit 2103 output two types of conversion data (quantized data) to the switching unit 2 613.
- two types of decoding blocks that are locally decoded by two methods are output to selection section 2612.
- Selection section 2612 selects one type from the two types of decoding blocks, and outputs the selection information to switching section 2613 and sign key section 2604.
- the switching unit 26 13 outputs the transform data (quantized data) to the encoding unit 2604 according to the selection information.
- the code key part entropy codes the selection information and transform data (quantized data) together
- the difference signal between the separately input encoding target block and two types of decoding blocks is compared, and the difference is calculated.
- the method of selecting the one with the smaller sum of squares since the code amount is not converted in this method, an encoding method with high encoding efficiency is not selected.
- a selection method considering the encoding efficiency for example, a method as described in Non-Patent Document 2 can be cited. According to this method, transform data (quantized data) generated by two types of predictive coding techniques are virtually encoded.
- the code data of the transform data may be input to the selection unit 2612 and executed by the selection unit 2612, or may be executed by the encoding unit 2 604, and information on the amount of code You can enter in the selection section!
- the decoding unit 2701 uses the TMP-E mode and TMP-L mode selection information and the transformed data (quantized data) based on the selected prediction code method. Are entropy decoded. Based on the selection information, the switching unit 2709 outputs transformed data (quantized data) to the inverse transformation unit 602 when the selected information indicates TMP-E, and converts the converted data (quantized data) when the selected information indicates the TMP-L mode. Data) is output to the inverse conversion unit 2202. The transformed data (quantized data) is decoded by the decoding method indicated by the selection information.
- TMP-E and TMP-L are treated as different prediction code encoding methods, and the selection information is encoded on the encoding side.
- the two types of prediction codes are treated as one type! It is also possible to select automatically using decoding information shared by the encoding side and the decoding side (information derived from the reproduced data and the characteristic data force belonging to the reproduced image).
- the target signal in the template area can be used as the feature data. For example, there is a method using dispersion of the target signal in the template area. A method is conceivable in which a threshold is set in advance and TMP-E is set if the variance is larger than the threshold, and TMP-L is set if the variance is smaller.
- the pixel gradient of the target signal in the template area (difference value between adjacent pixels) is calculated, and if the number of pixels whose difference value exceeds the threshold is greater than a predetermined number, TMP-E, and if less A method such as TMP-L is also conceivable.
- TMP-E the motion vector detected by template matching and the motion vector when decoding the pixel group in the template area (decoding process) If the difference is smaller than a predetermined threshold value, TMP-E can be used, and if the difference is larger, TMP-L can be used. is there. You can select the size of the detected motion vector or the motion vector of an adjacent block as a reference. These selections based on variance values, pixel gradients, and motion vectors may be combined. If the selected method is different, the final selection may be made by majority vote, or a method of transmitting selection information only when the selected method is different is feasible. can get.
- This selection unit performs the same operation as the selection unit of the encoding device, and outputs selection information to the switching unit 2709.
- the prediction generation unit in each of the devices 2600 and 2700 shown in FIGS. 26 and 27 is not limited to the configuration shown in FIG. 2 as shown in the modification (1).
- the configurations shown in FIGS. 8, 20 and 36 can also be applied.
- the prediction generation unit 3608 shown in FIG. 36 performs TMP only in the case of a predetermined compensation method.
- the prediction code encoding methods TMP-E and TMP-L described above are described in Non-Patent Document 1, and include a plurality of prediction code encoding methods (inter prediction mode for encoding motion vectors and It can be selectively used in combination with an intra prediction mode. At this time, a plurality of block sizes may be prepared for each prediction encoding method.
- the optimum selection of the prediction code method and the block size can be realized by the method shown in Non-Patent Document 2, for example. Combine the devices 2600 and 2700 shown in Fig. 26 and Fig. 27 (the prediction generation method can be modified as shown in the modification example (1)) with the conventional prediction code input method to expand the selection unit Can be realized. It is also possible to adapt only TMP-L to the conventional predictive coding method.
- a conventional predictive coding method may be combined with 0 and each of the devices 3200 and 3300 shown in FIGS. 32 and 33 and each of the devices 3400 and 3500 shown in FIGS.
- the method of block reduction processing by the reduction unit and block expansion processing by the enlargement unit are not limited to the method of FIG. Another example is shown in FIGS.
- a block 2801 indicates a block before being reduced
- a block 2802 indicates a reduced block.
- a reduced block is generated by simple pixel sampling that does not involve filtering processing such as processing 2304.
- Process 2805 shows a pixel generation method on the enlargement block in the enlargement process.
- Pixels A to D are pixels on the block 2802
- pixels a to c are pixels on the enlarged image. Since the pixels A to D are originally pixels before the reduction process, they are copied to the enlarged block 2803 as they are.
- the pixels excluded by the reduction process are calculated by a simple linear interpolation process, as in the pixels a to c of the process 2305.
- the pixels indicated by the squares in block 2803 are similarly calculated by linear interpolation using adjacent pixels.
- a block 2901 indicates a block before being reduced
- a block 2902 indicates a reduced block.
- the processing 2904 will explain the reduction processing method.
- processing 2904 [this pixel p [this contact 8 pixels (j, k, 1, m, n, o, q, r)] is processed on the reduced block by the finer processing using ⁇ lj. Of pixel P.
- Process 2905 shows a pixel generation method on the enlargement block 2903 in the enlargement process. Since this processing is the same as 2805 in FIG. 28, description thereof is omitted. Also in this case, since the pixel group 2906 is stored in the storage unit, an input path from the storage unit to the enlargement unit is required in FIGS. 21, 22, 26, and 27 in order to realize this processing. It becomes.
- the size of the reduced block is 1Z2 in both the vertical and horizontal reduction ratios, but the reduction ratio is not limited to this.
- the reduction ratio may be 1Z4, or vertical and horizontal The reduction ratio may be different.
- the reduction / enlargement method is not limited to one type, and multiple method powers may be selected.
- the method for minimizing the sum of absolute values and sum of squares of the sign error may be selected on a frame basis or on a block basis on the sign key side, and the selection information may be signed. Alternatively, it may be automatically determined from a plurality of decoding block candidates.
- a determination method it is only necessary to use the information derived from the reproduced image and the characteristic data force belonging to the reproduced image.For example, a method of calculating an average value in pixel units, a method of selecting a median value in pixel units, etc. Can be considered.
- the reduction method for the encoding target block is not limited in the decoding device. Therefore, if the number of pixels of the reduced block is the same, different reduction methods may be applied to the prediction block and the encoding target block. In the decoding device and the decoding process, a method for reducing the encoding target block should not be specified.
- Fig. 26 and Fig. 27 [Expanded by each device 2600, 2700, enlargement 2111, 2208 [Expanded block is treated as a candidate for a decoded block. It is also possible to adaptively select the prediction block generated by the units 108 and 605. Since the high-frequency component of the block enlarged by the enlargement unit is limited by the filter processing, re-encoding this has the effect of improving the image quality.
- FIG. 30 and FIG. 31 show a video encoding device 3000 and a video decoding device 3100, respectively, that implement this modification.
- the moving image encoder apparatus 3000 in FIG. 30 handles the functions of the selection unit 3012 and encoding unit 3004 and the conversion data (quantized data) output from the conversion unit 2103. Is different.
- the selection unit 2612 of FIG. 26 two types of decoding block candidates are input, but in the selection unit 3012 of this modification, two types of prediction block candidates are input from the prediction generation unit 108 and the enlargement unit 2111.
- the selection method the method shown in FIG. 26 can be used. However, when using the method of Non-Patent Document 2, it is necessary to virtually encode and decode two types of prediction block candidates in order to calculate total distortion and code amount.
- the conversion data (quantized data) output from the conversion unit 2103 must also be virtually encoded and converted into a code amount.
- the selected prediction block is the adder 1 06 and output to the subtraction unit 102, and conversion and encoding.
- the switch 3013 is turned on, and the conversion data (quantization data) output from the conversion unit 2103 is output to the encoding unit 3004.
- the encoding unit 3004 encodes data from the conversion unit 103, the conversion unit 2103 (in the case of TMP-L), and the selection unit (if necessary).
- the decoding unit 3101 first performs entropy decoding on the selection information.
- the prediction code method is TMP-L mode
- entropy decoding is performed on the transformed data (quantized data) of the reduced block.
- the transformed data (quantized data) of the reduced block is output to the inverse transform unit 2202 under the control of the switching unit 3109.
- Enlarged block transform data (quantized data) is entropy decoded and output to the inverse transform unit 602 under the control of the switching unit 3109.
- the prediction block generated by template matching in the prediction generation unit 605 is output to the reduction unit 2207 by the control of the switch 3110 based on the selection information.
- Adder 603 adds the difference block obtained from inverse transformer 602 and the prediction block obtained from enlarger 2208 to generate a decoded block.
- the predictive coding method is the TMP-E mode
- the entropy-decoded transformed data is output to the inverse transform unit 602 under the control of the switching unit 3109.
- the prediction block generated by template matching in the prediction generation unit 605 is output to the addition unit 603 by the control of the switch 3110 based on the selection information.
- the adding unit 603 adds the difference block obtained from the inverse transform unit 602 and the prediction block obtained from the prediction generation unit 605 through the switch 3110 to generate a decoded block.
- the moving picture coding program 1601 is stored in a program storage area 1600a formed on a recording medium 1600 that can be read or provided by the moving picture coding apparatus.
- the moving picture encoding program 1601 includes a main module 1601a that centrally controls moving picture encoding processing, a region dividing module 1601b, a subtraction module 1601c, a conversion module 1601d, an encoding module 1601e, An inverse conversion module 1601f, an addition module 1601g, a storage module 1601h, and a prediction generation module 1601i are provided.
- the prediction generation module 1601i includes a template area determination module 1601j, a matching module 1601k, and a compensation module 1601m.
- the functions realized by executing the respective modules are the same as the functions of the respective constituent elements of the moving picture coding apparatus 100 described above. That is, the area division module 1601b, the subtraction module 1601c, the conversion module 1601d, the encoding module 1601e, the inverse conversion module 1601f, the calorie calculation module 1601g, the storage module 1601h, and the prediction generation module 1601i
- the functions realized by the execution are the area dividing unit 101, the subtracting unit 102, the converting unit 103, the encoding unit 104, and the inverse converting unit 105 in the video encoding device 100 of the above embodiment.
- the addition unit 106, the storage unit 107, and the prediction generation unit 108 have the same functions.
- the function realized by executing each of the template area determination module 160 lj, the matching module 1601k, and the compensation module 1601m is a template area determination unit in the moving image encoding apparatus 100 of the above embodiment.
- the functions of 201, matching unit 202, and compensation unit 203 are the same.
- the moving picture decoding program 1701 is stored in a program storage area 1700a formed on a recording medium 1700 that can be read or provided by the moving picture decoding apparatus.
- the moving picture decoding program 1701 includes a main module 1701a that comprehensively controls moving picture decoding processing, a decoding module 1701b, an inverse conversion module 1701c, an addition module 1701d, a storage module 1701e, And a prediction generation module 1701f.
- the prediction generation module 1701f includes a template region determination module 1701g, a matching module 1701h, and a compensation module 1701i.
- the functions realized by executing the respective modules are the same as the functions of the respective components of the moving picture decoding apparatus 600 described above. That is, the functions realized by executing the decoding module 1701b, the inverse conversion module 1701c, the calorie calculation module 1701d, the storage module 1701e, and the prediction generation module 1701f are the moving image of the above embodiment.
- the functions of the decoding unit 601, the inverse transformation unit 602, the addition unit 603, the storage unit 604, and the prediction generation unit 605 in the decoding device 600 are the same.
- each of the template area determination module 1701g, the matching module 1701h, and the compensation module 170li is the same as that in the video encoding device 100 or the video decoding device 600 of the above embodiment.
- the functions of the template region determination unit 201, the matching unit 202, and the compensation unit 203 are the same.
- Part or all of the moving image encoding program 1601 and the moving image decoding program 1701 is transmitted via a transmission medium such as a communication line, and is received and recorded (installed) by another device. Also included as a configuration!
- 1701 a module for performing the functions of the moving image encoding device and the moving image decoding device is prepared for the modified example of the first embodiment, the second and third embodiments, and the modified example.
- a moving image code program and a moving image decoding program can be configured, and thus are included in the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Image Processing (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06780792A EP1919223A4 (en) | 2005-07-05 | 2006-07-05 | DEVICE, METHOD AND PROGRAM FOR DYNAMIC IMAGE ENCODING AND DYNAMIC IMAGE DECODING DEVICE, METHOD, AND PROGRAM |
US11/994,712 US8369628B2 (en) | 2005-07-05 | 2006-07-05 | Video encoding device, video encoding method, video encoding program, video decoding device, video decoding method, and video decoding program |
KR1020127029681A KR20120138248A (ko) | 2005-07-05 | 2006-07-05 | 동화상 부호화 장치, 동화상 부호화 방법, 동화상 부호화 프로그램, 동화상 복호 장치, 동화상 복호 방법 및 동화상 복호 프로그램 |
US13/596,672 US9282340B2 (en) | 2005-07-05 | 2012-08-28 | Video encoding device, video encoding method, video encoding program, video decoding device, video decoding method, and video decoding program |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005196351 | 2005-07-05 | ||
JP2005-196351 | 2005-07-05 | ||
JP2006-094391 | 2006-03-30 | ||
JP2006094391A JP2007043651A (ja) | 2005-07-05 | 2006-03-30 | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/994,712 A-371-Of-International US8369628B2 (en) | 2005-07-05 | 2006-07-05 | Video encoding device, video encoding method, video encoding program, video decoding device, video decoding method, and video decoding program |
US13/596,672 Continuation US9282340B2 (en) | 2005-07-05 | 2012-08-28 | Video encoding device, video encoding method, video encoding program, video decoding device, video decoding method, and video decoding program |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007004678A1 true WO2007004678A1 (ja) | 2007-01-11 |
Family
ID=37604537
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2006/313416 WO2007004678A1 (ja) | 2005-07-05 | 2006-07-05 | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム |
Country Status (7)
Country | Link |
---|---|
US (2) | US8369628B2 (ja) |
EP (3) | EP2475175A3 (ja) |
JP (1) | JP2007043651A (ja) |
KR (4) | KR20100019541A (ja) |
CN (1) | CN103024382B (ja) |
RU (2) | RU2391794C2 (ja) |
WO (1) | WO2007004678A1 (ja) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007093629A1 (en) * | 2006-02-17 | 2007-08-23 | Thomson Licensing | Process for coding images using intra prediction mode |
WO2008126843A1 (ja) * | 2007-04-09 | 2008-10-23 | Ntt Docomo, Inc. | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
JP2008283662A (ja) * | 2007-04-09 | 2008-11-20 | Ntt Docomo Inc | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
JP2008289005A (ja) * | 2007-05-18 | 2008-11-27 | Ntt Docomo Inc | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
JP2008300943A (ja) * | 2007-05-29 | 2008-12-11 | Sharp Corp | 画像復号装置及び画像符号化装置 |
EP2262270A1 (en) * | 2008-04-11 | 2010-12-15 | Huawei Technologies Co., Ltd. | Method, device and system for interframe prediction encoding and decoding |
JP2011205693A (ja) * | 2011-06-14 | 2011-10-13 | Sharp Corp | 画像復号装置及び画像符号化装置 |
RU2472305C2 (ru) * | 2007-02-23 | 2013-01-10 | Ниппон Телеграф Энд Телефон Корпорейшн | Способ кодирования видео и способ декодирования видео, устройства для этого, программы для этого и носители хранения, на которых хранятся программы |
US9565442B2 (en) | 2011-11-08 | 2017-02-07 | Kt Corporation | Method and apparatus for coefficient scan based on partition mode of prediction unit |
JP2018513571A (ja) * | 2016-02-17 | 2018-05-24 | テレフオンアクチーボラゲット エルエム エリクソン(パブル) | ビデオピクチャを符号化および復号する方法および装置 |
US11700384B2 (en) | 2011-07-17 | 2023-07-11 | Qualcomm Incorporated | Signaling picture size in video coding |
Families Citing this family (114)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2007244443A1 (en) * | 2006-04-28 | 2007-11-08 | Ntt Docomo, Inc. | Image predictive coding device, image predictive coding method, image predictive coding program, image predictive decoding device, image predictive decoding method and image predictive decoding program |
JP4994767B2 (ja) * | 2006-10-03 | 2012-08-08 | 株式会社エヌ・ティ・ティ・ドコモ | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法及び画像予測復号プログラム |
CN101573982B (zh) * | 2006-11-03 | 2011-08-03 | 三星电子株式会社 | 利用运动矢量跟踪编码/解码图像的方法和装置 |
KR101365567B1 (ko) * | 2007-01-04 | 2014-02-20 | 삼성전자주식회사 | 영상의 예측 부호화 방법 및 장치, 그 복호화 방법 및 장치 |
JP2010509799A (ja) * | 2006-11-03 | 2010-03-25 | サムスン エレクトロニクス カンパニー リミテッド | 映像の予測符号化方法及び装置、その復号化方法及び装置 |
JP4685825B2 (ja) * | 2007-04-06 | 2011-05-18 | 学校法人東京理科大学 | 動画像復号装置、方法及びプログラム、並びに、動画像符号化装置、方法及びプログラム |
KR101378338B1 (ko) * | 2007-06-14 | 2014-03-28 | 삼성전자주식회사 | 영상 복구를 이용한 인트라 예측 부호화, 복호화 방법 및장치 |
US20090003443A1 (en) * | 2007-06-26 | 2009-01-01 | Nokia Corporation | Priority-based template matching intra prediction video and image coding |
EP2034742A3 (en) | 2007-07-25 | 2009-10-14 | Hitachi Ltd. | Video coding method and device |
KR101403343B1 (ko) * | 2007-10-04 | 2014-06-09 | 삼성전자주식회사 | 부화소 움직임 추정을 이용한 인터 예측 부호화, 복호화방법 및 장치 |
CN101415122B (zh) * | 2007-10-15 | 2011-11-16 | 华为技术有限公司 | 一种帧间预测编解码方法及装置 |
EP2232869A1 (fr) * | 2007-11-28 | 2010-09-29 | France Telecom | Codage de mouvement sans transmission d' information de mouvement, et decodage |
EP2234403A4 (en) * | 2007-12-28 | 2011-12-21 | Sharp Kk | MOBILE IMAGE ENCODER AND DECODER |
JP5011138B2 (ja) * | 2008-01-25 | 2012-08-29 | 株式会社日立製作所 | 画像符号化装置、画像符号化方法、画像復号化装置、画像復号化方法 |
KR20090095012A (ko) * | 2008-03-04 | 2009-09-09 | 삼성전자주식회사 | 연속적인 움직임 추정을 이용한 영상 부호화, 복호화 방법및 장치 |
JP2011515060A (ja) * | 2008-03-09 | 2011-05-12 | エルジー エレクトロニクス インコーポレイティド | ビデオ信号のエンコーディングまたはデコーディング方法及び装置 |
JP4623111B2 (ja) | 2008-03-13 | 2011-02-02 | ソニー株式会社 | 画像処理装置、画像処理方法及びプログラム |
JP5413923B2 (ja) * | 2008-04-11 | 2014-02-12 | トムソン ライセンシング | 変位イントラ予測およびテンプレート・マッチングのためのデブロッキング・フィルタリング |
JP5406465B2 (ja) * | 2008-04-24 | 2014-02-05 | 株式会社Nttドコモ | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法及び画像予測復号プログラム |
JP2010035137A (ja) * | 2008-07-01 | 2010-02-12 | Sony Corp | 画像処理装置および方法、並びにプログラム |
JP2010016454A (ja) * | 2008-07-01 | 2010-01-21 | Sony Corp | 画像符号化装置および方法、画像復号装置および方法、並びにプログラム |
JP2010016453A (ja) * | 2008-07-01 | 2010-01-21 | Sony Corp | 画像符号化装置および方法、画像復号装置および方法、並びにプログラム |
CN102106149A (zh) * | 2008-08-08 | 2011-06-22 | 夏普株式会社 | 动态图像编码装置以及动态图像解码装置 |
US20110170605A1 (en) * | 2008-09-24 | 2011-07-14 | Kazushi Sato | Image processing apparatus and image processing method |
CN102160381A (zh) * | 2008-09-24 | 2011-08-17 | 索尼公司 | 图像处理设备和方法 |
WO2010035733A1 (ja) * | 2008-09-24 | 2010-04-01 | ソニー株式会社 | 画像処理装置および方法 |
WO2010035735A1 (ja) * | 2008-09-24 | 2010-04-01 | ソニー株式会社 | 画像処理装置および方法 |
JP5422168B2 (ja) | 2008-09-29 | 2014-02-19 | 株式会社日立製作所 | 動画像符号化方法および動画像復号化方法 |
CN102224734B (zh) * | 2008-10-02 | 2013-11-13 | 索尼公司 | 图像处理设备和方法 |
KR101098739B1 (ko) * | 2008-11-24 | 2011-12-23 | 한국전자통신연구원 | 비디오 신호의 부호화/복호화 장치 및 방법 |
EP2392141A1 (fr) * | 2009-01-28 | 2011-12-07 | France Telecom | Procede et dispositif de codage d'une image utilisant un masque de prediction, procede et dispositif de decodage, signal et programmes d'ordinateur correspondants |
TW201032600A (en) * | 2009-02-20 | 2010-09-01 | Sony Corp | Image processing device and method |
TWI405469B (zh) * | 2009-02-20 | 2013-08-11 | Sony Corp | Image processing apparatus and method |
DK3567852T3 (da) * | 2009-03-23 | 2023-01-16 | Ntt Docomo Inc | Billedforudsigelsesafkodningsindretning og billedforudsigelsesafkodningsfremgangsmåde |
JP2010258738A (ja) * | 2009-04-24 | 2010-11-11 | Sony Corp | 画像処理装置および方法、並びにプログラム |
JP2010268259A (ja) * | 2009-05-15 | 2010-11-25 | Sony Corp | 画像処理装置および方法、並びにプログラム |
US8873626B2 (en) * | 2009-07-02 | 2014-10-28 | Qualcomm Incorporated | Template matching for video coding |
CN102577391A (zh) * | 2009-10-20 | 2012-07-11 | 夏普株式会社 | 图像编码装置、图像解码装置、以及编码数据的数据结构 |
KR101457418B1 (ko) * | 2009-10-23 | 2014-11-04 | 삼성전자주식회사 | 계층적 부호화 단위의 크기에 따른 비디오 부호화 방법과 그 장치, 및 비디오 복호화 방법과 그 장치 |
JP5321426B2 (ja) * | 2009-11-26 | 2013-10-23 | 株式会社Jvcケンウッド | 画像符号化装置、画像復号化装置、画像符号化方法、及び画像復号化方法 |
KR101601854B1 (ko) * | 2009-12-04 | 2016-03-10 | 에스케이 텔레콤주식회사 | 공간적 예측장치 및 그 예측방법, 그것을 이용한 영상 부호화 장치 및 방법, 및 영상 복호화 장치 및 방법 |
WO2011071514A2 (en) * | 2009-12-08 | 2011-06-16 | Thomson Licensing | Methods and apparatus for adaptive residual updating of template matching prediction for video encoding and decoding |
JP5321439B2 (ja) | 2009-12-15 | 2013-10-23 | 株式会社Jvcケンウッド | 画像符号化装置、画像復号化装置、画像符号化方法、及び、画像復号化方法 |
CN102804774B (zh) | 2010-01-19 | 2016-08-24 | 汤姆逊许可证公司 | 用于视频编解码的降低了复杂度的模板匹配预测方法和装置 |
US9813707B2 (en) | 2010-01-22 | 2017-11-07 | Thomson Licensing Dtv | Data pruning for video compression using example-based super-resolution |
KR101789845B1 (ko) | 2010-01-22 | 2017-11-20 | 톰슨 라이센싱 | 샘플링 기반 초 해상도 비디오 인코딩 및 디코딩을 위한 방법 및 장치 |
EP3703369B1 (en) * | 2010-04-13 | 2024-07-24 | GE Video Compression, LLC | Sample region merging |
CN106067983B (zh) | 2010-04-13 | 2019-07-12 | Ge视频压缩有限责任公司 | 解码数据流的方法、生成数据流的方法及解码器 |
CN106231337B (zh) | 2010-04-13 | 2020-06-19 | Ge视频压缩有限责任公司 | 解码器、解码方法、编码器以及编码方法 |
BR122020008249B1 (pt) | 2010-04-13 | 2021-02-17 | Ge Video Compression, Llc | herança em amostra de arranjo em subdivisão multitree |
JP5455229B2 (ja) * | 2010-04-26 | 2014-03-26 | 株式会社Kddi研究所 | 画像符号化装置及び画像復号装置 |
PL3104616T3 (pl) | 2010-07-09 | 2017-10-31 | Samsung Electronics Co Ltd | Urządzenie do entropijnego dekodowania współczynników przekształcenia |
WO2012005537A2 (ko) * | 2010-07-09 | 2012-01-12 | 한국전자통신연구원 | 템플릿 매칭을 이용한 영상 부호화 방법 및 장치, 그리고 복호화 방법 및 장치 |
US10091529B2 (en) * | 2010-07-09 | 2018-10-02 | Samsung Electronics Co., Ltd. | Method and apparatus for entropy encoding/decoding a transform coefficient |
KR101836981B1 (ko) * | 2010-07-09 | 2018-03-09 | 한국전자통신연구원 | 템플릿 매칭을 이용한 영상 부호화 방법 및 장치, 그리고 복호화 방법 및 장치 |
US9338477B2 (en) | 2010-09-10 | 2016-05-10 | Thomson Licensing | Recovering a pruned version of a picture in a video sequence for example-based data pruning using intra-frame patch similarity |
US20130163674A1 (en) * | 2010-09-10 | 2013-06-27 | Thomson Licensing | Encoding of the Link to a Reference Block in Video Compression by Image Content Based on Search and Ranking |
US9544598B2 (en) | 2010-09-10 | 2017-01-10 | Thomson Licensing | Methods and apparatus for pruning decision optimization in example-based data pruning compression |
CN102447895B (zh) * | 2010-09-30 | 2013-10-02 | 华为技术有限公司 | 扫描方法及装置、反扫描方法及装置 |
PL3637778T3 (pl) * | 2010-10-06 | 2024-09-23 | Ntt Docomo, Inc. | Sposób dwupredykcyjnego dekodowania obrazu |
US8428375B2 (en) * | 2010-11-17 | 2013-04-23 | Via Technologies, Inc. | System and method for data compression and decompression in a graphics processing system |
KR101950419B1 (ko) | 2010-11-24 | 2019-02-21 | 벨로스 미디어 인터내셔널 리미티드 | 움직임 벡터 산출 방법, 화상 부호화 방법, 화상 복호 방법, 움직임 벡터 산출 장치 및 화상 부호화 복호 장치 |
JP5711514B2 (ja) * | 2010-12-14 | 2015-04-30 | 日本電信電話株式会社 | 符号化装置、復号装置、符号化方法、復号方法、符号化プログラム及び復号プログラム |
CN103392341A (zh) | 2010-12-23 | 2013-11-13 | 三星电子株式会社 | 用于对图像预测单元的帧内预测模式进行编码的方法和装置,以及用于对图像预测单元的帧内预测模式进行解码的方法和装置 |
JP5594841B2 (ja) | 2011-01-06 | 2014-09-24 | Kddi株式会社 | 画像符号化装置及び画像復号装置 |
US8913662B2 (en) * | 2011-01-06 | 2014-12-16 | Qualcomm Incorporated | Indicating intra-prediction mode selection for video coding using CABAC |
CN106851306B (zh) * | 2011-01-12 | 2020-08-04 | 太阳专利托管公司 | 动态图像解码方法和动态图像解码装置 |
WO2012096150A1 (ja) * | 2011-01-12 | 2012-07-19 | 三菱電機株式会社 | 動画像符号化装置、動画像復号装置、動画像符号化方法及び動画像復号方法 |
JP5498972B2 (ja) * | 2011-01-21 | 2014-05-21 | 日本放送協会 | 符号化装置、復号装置及びプログラム |
JP6108309B2 (ja) | 2011-02-22 | 2017-04-05 | サン パテント トラスト | 動画像符号化方法、動画像符号化装置、動画像復号方法、および、動画像復号装置 |
MX2013009864A (es) | 2011-03-03 | 2013-10-25 | Panasonic Corp | Metodo de codificacion de imagenes en movimiento, metodo de decodificacion de imagenes en movimiento, aparato de codificacion de imagenes en movimiento, aparato de decodificacion de imagenes en movimiento y aparato de codificacion y decodificacion de imagenes en movimiento. |
JPWO2012120840A1 (ja) * | 2011-03-07 | 2014-07-17 | パナソニック株式会社 | 画像復号方法、画像符号化方法、画像復号装置および画像符号化装置 |
US9832460B2 (en) * | 2011-03-09 | 2017-11-28 | Canon Kabushiki Kaisha | Image coding apparatus, method for coding image, program therefor, image decoding apparatus, method for decoding image, and program therefor |
US9648334B2 (en) | 2011-03-21 | 2017-05-09 | Qualcomm Incorporated | Bi-predictive merge mode based on uni-predictive neighbors in video coding |
KR101383775B1 (ko) | 2011-05-20 | 2014-04-14 | 주식회사 케이티 | 화면 내 예측 방법 및 장치 |
JP5807402B2 (ja) * | 2011-06-15 | 2015-11-10 | 富士通株式会社 | 動画像復号装置、動画像符号化装置、動画像復号方法、動画像符号化方法、動画像復号プログラム及び動画像符号化プログラム |
US9313494B2 (en) * | 2011-06-20 | 2016-04-12 | Qualcomm Incorporated | Parallelization friendly merge candidates for video coding |
JP5649524B2 (ja) | 2011-06-27 | 2015-01-07 | 日本電信電話株式会社 | 映像符号化方法,装置,映像復号方法,装置およびそれらのプログラム |
JP5729817B2 (ja) * | 2011-06-29 | 2015-06-03 | 日本電信電話株式会社 | 動画像符号化装置、動画像復号装置、動画像符号化方法、動画像復号方法、動画像符号化プログラム及び動画像復号プログラム |
MX2014000159A (es) * | 2011-07-02 | 2014-02-19 | Samsung Electronics Co Ltd | Metodo y aparato para la codificacion de video, y metodo y aparato para la decodificacion de video acompañada por inter prediccion utilizando imagen co-localizada. |
EP3675497B1 (en) * | 2011-10-17 | 2021-12-29 | Kabushiki Kaisha Toshiba | Encoding method and decoding method |
JP2013102295A (ja) * | 2011-11-07 | 2013-05-23 | Canon Inc | 画像符号化方法、画像符号化装置及びプログラム、画像復号方法、画像復号装置及びプログラム |
EP3280139B1 (en) * | 2011-11-08 | 2020-03-18 | Kabushiki Kaisha Toshiba | Image decoding method and image decoding apparatus |
JP2013115583A (ja) * | 2011-11-28 | 2013-06-10 | Canon Inc | 動画像符号化装置及びその制御方法並びにプログラム |
CA2825767C (en) * | 2011-12-21 | 2018-11-27 | Panasonic Corporation | Image coding method, image decoding method, image coding apparatus and image decoding apparatus |
TWI720750B (zh) * | 2011-12-28 | 2021-03-01 | 日商Jvc建伍股份有限公司 | 動態影像編碼裝置及動態影像編碼方法 |
EP2615832A1 (en) * | 2012-01-13 | 2013-07-17 | Thomson Licensing | Method and device for encoding a block of an image and corresponding reconstructing method and device |
WO2013145642A1 (ja) * | 2012-03-28 | 2013-10-03 | 株式会社Jvcケンウッド | 画像符号化装置、画像符号化方法、画像符号化プログラム、送信装置、送信方法及び送信プログラム、並びに画像復号装置、画像復号方法、画像復号プログラム、受信装置、受信方法及び受信プログラム |
CA2847304C (en) | 2012-06-27 | 2017-08-22 | Kabushiki Kaisha Toshiba | Encoding device, decoding device, encoding method, and decoding method |
SI2869563T1 (en) | 2012-07-02 | 2018-08-31 | Samsung Electronics Co., Ltd. | The process of entropic video decoding |
WO2014017809A1 (ko) * | 2012-07-24 | 2014-01-30 | 한국전자통신연구원 | 영상의 복호화 방법 및 이를 이용하는 장치 |
JP5798539B2 (ja) * | 2012-09-24 | 2015-10-21 | 株式会社Nttドコモ | 動画像予測符号化装置、動画像予測符号化方法、動画像予測復号装置及び動画像予測復号方法 |
US20150229957A1 (en) * | 2012-09-24 | 2015-08-13 | Qualcomm Incorporated | Depth map coding |
RU2619199C1 (ru) * | 2012-11-08 | 2017-05-12 | Кт Корпорейшен | Способ декодирования видеосигнала |
WO2014094829A1 (en) * | 2012-12-18 | 2014-06-26 | Siemens Aktiengesellschaft | A method for coding a sequence of digital images |
BR112015025151B1 (pt) * | 2013-04-09 | 2022-11-29 | Siemens Aktiengesellschaft | Método de codificação, método de decodificação, método de codificação e decodificação, aparelho para codificação, aparelho para decodificação e codec para codificar e decodificar uma sequência de imagens digitais |
CN104104961B (zh) * | 2013-04-10 | 2018-09-21 | 华为技术有限公司 | 一种视频编码方法、解码方法和装置 |
EP3139605A4 (en) * | 2014-04-28 | 2017-05-17 | Panasonic Intellectual Property Corporation of America | Encoding method, decoding method, encoding apparatus, and decoding apparatus |
JP2019213242A (ja) * | 2014-04-28 | 2019-12-12 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | 符号化方法、復号方法、符号化装置および復号装置 |
CN105338351B (zh) | 2014-05-28 | 2019-11-12 | 华为技术有限公司 | 基于模板匹配的帧内预测编、解码、阵列扫描方法及装置 |
CN104363449B (zh) * | 2014-10-31 | 2017-10-10 | 华为技术有限公司 | 图像预测方法及相关装置 |
CN115278228A (zh) | 2015-11-11 | 2022-11-01 | 三星电子株式会社 | 对视频进行解码的方法和对视频进行编码的方法 |
RU2638756C2 (ru) * | 2016-05-13 | 2017-12-15 | Кабусики Кайся Тосиба | Устройство кодирования, устройство декодирования, способ кодирования и способ декодирования |
US10397569B2 (en) * | 2016-06-03 | 2019-08-27 | Mediatek Inc. | Method and apparatus for template-based intra prediction in image and video coding |
WO2017220164A1 (en) * | 2016-06-24 | 2017-12-28 | Huawei Technologies Co., Ltd. | Devices and methods for video coding using segmentation based partitioning of video coding blocks |
WO2018097077A1 (ja) * | 2016-11-22 | 2018-05-31 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 符号化装置、復号装置、符号化方法及び復号方法 |
MX2020008575A (es) * | 2018-02-15 | 2020-11-12 | Arris Entpr Llc | Tamaño de plantilla variable para coincidencia de plantilla. |
WO2019185808A1 (en) * | 2018-03-29 | 2019-10-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Intra-prediction mode concept for block-wise picture coding |
RU2767973C1 (ru) | 2019-03-08 | 2022-03-22 | ДжейВиСиКЕНВУД Корпорейшн | Устройство для кодирования видео, способ кодирования видео, устройство для декодирования видео и способ декодирования видео |
TWI749676B (zh) * | 2020-08-03 | 2021-12-11 | 緯創資通股份有限公司 | 影像品質評估裝置及其影像品質評估方法 |
WO2024091002A1 (ko) * | 2022-10-26 | 2024-05-02 | 주식회사 윌러스표준기술연구소 | 비디오 신호 처리 방법 및 이를 위한 장치 |
WO2024148106A2 (en) * | 2023-01-04 | 2024-07-11 | Qualcomm Incorporated | Multiple modes and multiple templates for template matching related video coding tools |
WO2024155170A1 (ko) * | 2023-01-20 | 2024-07-25 | 주식회사 윌러스표준기술연구소 | 비디오 신호 처리 방법 및 이를 위한 장치 |
WO2024159187A1 (en) * | 2023-01-27 | 2024-08-02 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. . | Systems and methods for indicating intra template matching prediction |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0262180A (ja) * | 1988-08-26 | 1990-03-02 | Fujitsu Ltd | 動画像の動き補償予測符号化方式 |
JP2001028756A (ja) * | 1999-06-07 | 2001-01-30 | Lucent Technol Inc | コンテクストベースでフレーム内コーディングモードとフレーム間コーディングモードとの間の選択を行なうための方法および装置 |
US6289052B1 (en) * | 1999-06-07 | 2001-09-11 | Lucent Technologies Inc. | Methods and apparatus for motion estimation using causal templates |
JP2002118849A (ja) * | 2000-10-06 | 2002-04-19 | Nec Corp | 動画像符号化方法、動画像符号化装置、動画像復号化装置及びそれらを備えた動画像通信システム |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SU1730724A1 (ru) | 1990-04-17 | 1992-04-30 | Всесоюзный научно-исследовательский институт телевидения | Кодер сигнала изображени |
JPH0795566A (ja) | 1993-09-21 | 1995-04-07 | Nippon Telegr & Teleph Corp <Ntt> | 画像符号化方法および装置 |
KR100365555B1 (ko) * | 1994-10-19 | 2003-08-27 | 마츠시타 덴끼 산교 가부시키가이샤 | 화상부호화/복호화장치 |
RU2093968C1 (ru) | 1995-08-02 | 1997-10-20 | Закрытое акционерное общество "Техно-ТМ" | Способ кодирования-декодирования изображений и устройство для его осуществления |
KR100371130B1 (ko) * | 1996-05-28 | 2003-02-07 | 마쯔시다덴기산교 가부시키가이샤 | 화상예측 복호화 장치 및 그 방법과 화상예측 부호화 장치및 그 방법 |
JP3343554B1 (ja) | 1996-05-28 | 2002-11-11 | 松下電器産業株式会社 | 画像予測復号化方法及び画像予測符号化装置 |
JP3466032B2 (ja) | 1996-10-24 | 2003-11-10 | 富士通株式会社 | 動画像符号化装置および復号化装置 |
TW358296B (en) | 1996-11-12 | 1999-05-11 | Matsushita Electric Ind Co Ltd | Digital picture encoding method and digital picture encoding apparatus, digital picture decoding method and digital picture decoding apparatus, and data storage medium |
JP3604864B2 (ja) * | 1997-04-25 | 2004-12-22 | シャープ株式会社 | 動画像符号化装置 |
US6359929B1 (en) * | 1997-07-04 | 2002-03-19 | Matsushita Electric Industrial Co., Ltd. | Image predictive decoding method, image predictive decoding apparatus, image predictive coding apparatus, and data storage medium |
SG116400A1 (en) * | 1997-10-24 | 2005-11-28 | Matsushita Electric Ind Co Ltd | A method for computational graceful degradation inan audiovisual compression system. |
JPH11341496A (ja) * | 1998-05-28 | 1999-12-10 | Matsushita Electric Ind Co Ltd | 画像処理方法,画像処理装置,及びデータ記憶媒体 |
JP3675429B2 (ja) * | 2002-09-12 | 2005-07-27 | 独立行政法人産業技術総合研究所 | 適応型予測符号化、復号化方法およびそれらの装置ならびに適応型予測符号化、復号化プログラムを記録した記録媒体 |
JP3715283B2 (ja) * | 2003-02-04 | 2005-11-09 | 株式会社半導体理工学研究センター | 動画像の画像圧縮符号化方法及び装置 |
CN100534192C (zh) * | 2003-10-28 | 2009-08-26 | 松下电器产业株式会社 | 帧内预测编码方法 |
JP2006174415A (ja) * | 2004-11-19 | 2006-06-29 | Ntt Docomo Inc | 画像復号装置、画像復号プログラム、画像復号方法、画像符号化装置、画像符号化プログラム及び画像符号化方法 |
US8014613B2 (en) * | 2007-04-16 | 2011-09-06 | Sharp Laboratories Of America, Inc. | Methods and systems for inter-layer image parameter prediction |
-
2006
- 2006-03-30 JP JP2006094391A patent/JP2007043651A/ja active Pending
- 2006-07-05 KR KR1020097027156A patent/KR20100019541A/ko not_active Application Discontinuation
- 2006-07-05 US US11/994,712 patent/US8369628B2/en active Active
- 2006-07-05 EP EP20110192765 patent/EP2475175A3/en not_active Withdrawn
- 2006-07-05 EP EP06780792A patent/EP1919223A4/en not_active Withdrawn
- 2006-07-05 RU RU2008104131A patent/RU2391794C2/ru not_active IP Right Cessation
- 2006-07-05 KR KR1020127029681A patent/KR20120138248A/ko active IP Right Grant
- 2006-07-05 EP EP13166788.3A patent/EP2627091B1/en active Active
- 2006-07-05 KR KR20087001224A patent/KR20080019294A/ko not_active Application Discontinuation
- 2006-07-05 WO PCT/JP2006/313416 patent/WO2007004678A1/ja active Application Filing
- 2006-07-05 KR KR1020127002838A patent/KR20120030585A/ko not_active Application Discontinuation
- 2006-07-05 CN CN201210539613.3A patent/CN103024382B/zh active Active
-
2009
- 2009-12-30 RU RU2009149744/09A patent/RU2009149744A/ru not_active Application Discontinuation
-
2012
- 2012-08-28 US US13/596,672 patent/US9282340B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0262180A (ja) * | 1988-08-26 | 1990-03-02 | Fujitsu Ltd | 動画像の動き補償予測符号化方式 |
JP2001028756A (ja) * | 1999-06-07 | 2001-01-30 | Lucent Technol Inc | コンテクストベースでフレーム内コーディングモードとフレーム間コーディングモードとの間の選択を行なうための方法および装置 |
US6289052B1 (en) * | 1999-06-07 | 2001-09-11 | Lucent Technologies Inc. | Methods and apparatus for motion estimation using causal templates |
JP2002118849A (ja) * | 2000-10-06 | 2002-04-19 | Nec Corp | 動画像符号化方法、動画像符号化装置、動画像復号化装置及びそれらを備えた動画像通信システム |
Non-Patent Citations (2)
Title |
---|
See also references of EP1919223A4 * |
SUGIMOTO K. ET AL.: "Inter frame coding with template matching spatio-temporal prediction", 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), vol. 1, 24 October 2004 (2004-10-24), pages 465 - 468, XP010784855 * |
Cited By (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007093629A1 (en) * | 2006-02-17 | 2007-08-23 | Thomson Licensing | Process for coding images using intra prediction mode |
RU2472305C2 (ru) * | 2007-02-23 | 2013-01-10 | Ниппон Телеграф Энд Телефон Корпорейшн | Способ кодирования видео и способ декодирования видео, устройства для этого, программы для этого и носители хранения, на которых хранятся программы |
US8472522B2 (en) | 2007-02-23 | 2013-06-25 | Nippon Telegraph And Telephone Corporation | Video encoding method and decoding method, apparatuses therefor, programs therefor, and storage media which store the programs |
CN103354613B (zh) * | 2007-04-09 | 2016-01-06 | 株式会社Ntt都科摩 | 图像预测编码装置、图像预测编码方法、图像预测解码装置、以及图像预测解码方法 |
JP2008283662A (ja) * | 2007-04-09 | 2008-11-20 | Ntt Docomo Inc | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
EP2154901A1 (en) * | 2007-04-09 | 2010-02-17 | NTT DoCoMo, Inc. | Image prediction/encoding device, image prediction/encoding method, image prediction/encoding program, image prediction/decoding device, image prediction/decoding method, and image prediction decoding program |
WO2008126843A1 (ja) * | 2007-04-09 | 2008-10-23 | Ntt Docomo, Inc. | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
US9031130B2 (en) | 2007-04-09 | 2015-05-12 | Ntt Docomo, Inc. | Image prediction/encoding device, image prediction/encoding method, image prediction/encoding program, image prediction/decoding device, image prediction/decoding method, and image prediction decoding program |
EP2154901A4 (en) * | 2007-04-09 | 2011-06-22 | Ntt Docomo Inc | DEVICE, METHOD AND PROGRAM FOR PICTURE PREDICTING / CODING, DEVICE, METHOD AND PROGRAM FOR PICTURE DECODING / DECODING |
CN104023242A (zh) * | 2007-04-09 | 2014-09-03 | 株式会社Ntt都科摩 | 图像预测编码装置及方法、图像预测解码装置及方法 |
EP2453655A1 (en) * | 2007-04-09 | 2012-05-16 | NTT DoCoMo, Inc. | Image coding using template matching |
CN103997655A (zh) * | 2007-04-09 | 2014-08-20 | 株式会社Ntt都科摩 | 图像预测编码装置及方法、图像预测解码装置及方法 |
EP2571272A1 (en) * | 2007-04-09 | 2013-03-20 | NTT DoCoMo, Inc. | Image coding using template matching |
CN104023242B (zh) * | 2007-04-09 | 2017-07-07 | 株式会社Ntt都科摩 | 图像预测编码装置及方法、图像预测解码装置及方法 |
JP2008289005A (ja) * | 2007-05-18 | 2008-11-27 | Ntt Docomo Inc | 画像予測符号化装置、画像予測符号化方法、画像予測符号化プログラム、画像予測復号装置、画像予測復号方法および画像予測復号プログラム |
JP2008300943A (ja) * | 2007-05-29 | 2008-12-11 | Sharp Corp | 画像復号装置及び画像符号化装置 |
US8693543B2 (en) | 2008-04-11 | 2014-04-08 | Huawei Technologies Co., Ltd. | Inter-frame prediction coding method, device and system |
EP2262270A4 (en) * | 2008-04-11 | 2011-04-06 | Huawei Tech Co Ltd | METHOD, DEVICE AND SYSTEM FOR PICTURE-TO-PICTURE PREDICTION CODING / DECODING |
EP2262270A1 (en) * | 2008-04-11 | 2010-12-15 | Huawei Technologies Co., Ltd. | Method, device and system for interframe prediction encoding and decoding |
JP2011205693A (ja) * | 2011-06-14 | 2011-10-13 | Sharp Corp | 画像復号装置及び画像符号化装置 |
US11700384B2 (en) | 2011-07-17 | 2023-07-11 | Qualcomm Incorporated | Signaling picture size in video coding |
US9648331B2 (en) | 2011-11-08 | 2017-05-09 | Kt Corporation | Method and apparatus for coefficient scan based on partition mode of prediction unit |
US9565442B2 (en) | 2011-11-08 | 2017-02-07 | Kt Corporation | Method and apparatus for coefficient scan based on partition mode of prediction unit |
US9854245B2 (en) | 2011-11-08 | 2017-12-26 | Kt Corporation | Method and apparatus for coefficient scan based on partition mode of prediction unit |
US10080023B2 (en) | 2011-11-08 | 2018-09-18 | Kt Corporation | Method and apparatus for coefficient scan based on partition mode of prediction unit |
JP2018513571A (ja) * | 2016-02-17 | 2018-05-24 | テレフオンアクチーボラゲット エルエム エリクソン(パブル) | ビデオピクチャを符号化および復号する方法および装置 |
Also Published As
Publication number | Publication date |
---|---|
KR20120138248A (ko) | 2012-12-24 |
KR20080019294A (ko) | 2008-03-03 |
EP1919223A4 (en) | 2011-06-15 |
EP2475175A3 (en) | 2012-11-07 |
EP1919223A1 (en) | 2008-05-07 |
CN103024382A (zh) | 2013-04-03 |
JP2007043651A (ja) | 2007-02-15 |
RU2391794C2 (ru) | 2010-06-10 |
EP2627091B1 (en) | 2016-09-07 |
KR20120030585A (ko) | 2012-03-28 |
US20120320976A1 (en) | 2012-12-20 |
US8369628B2 (en) | 2013-02-05 |
CN103024382B (zh) | 2015-03-25 |
US20090116759A1 (en) | 2009-05-07 |
RU2008104131A (ru) | 2009-08-10 |
KR20100019541A (ko) | 2010-02-18 |
RU2009149744A (ru) | 2011-07-10 |
US9282340B2 (en) | 2016-03-08 |
EP2475175A2 (en) | 2012-07-11 |
EP2627091A1 (en) | 2013-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007004678A1 (ja) | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP5650294B2 (ja) | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、動画像復号装置、動画像復号方法及び動画像復号プログラム | |
JP6220023B2 (ja) | 画像予測復号方法 | |
RU2608264C2 (ru) | Способ и устройство для кодирования/декодирования вектора движения | |
JP4213646B2 (ja) | 画像符号化装置、画像符号化方法、画像符号化プログラム、画像復号装置、画像復号方法、及び画像復号プログラム。 | |
JP4495580B2 (ja) | 面内予測装置および面内予測方法 | |
JP5373626B2 (ja) | 複数の動きベクトル・プレディクタを使用して動きベクトルを推定する方法、装置、エンコーダ、デコーダ及びデコーディング方法 | |
JP4763422B2 (ja) | イントラ予測装置 | |
JP5686499B2 (ja) | 画像予測符号化装置、方法及びプログラム、画像予測復号装置、方法及びプログラム、並びに、符号化・復号システム及び方法 | |
KR100510137B1 (ko) | 고속 움직임 추정을 위한 참조 픽쳐 및 블록 모드 결정방법, 그 장치, 블록 모드 결정 방법 및 그 장치 | |
KR20100019537A (ko) | 화상 예측 부호화 장치, 화상 예측 복호 장치, 화상 예측 부호화 방법, 화상 예측 복호 방법, 화상 예측 부호화 프로그램, 및 화상 예측 복호 프로그램 | |
KR20110027480A (ko) | 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
JP5367097B2 (ja) | 動きベクトル予測符号化方法、動きベクトル予測復号方法、動画像符号化装置、動画像復号装置およびそれらのプログラム | |
JP5216710B2 (ja) | 復号化処理方法 | |
KR20060084483A (ko) | 동영상 코덱에서의 주파수 변환 계수 예측 방법 및 장치,이를 구비한 부호화 및 복호화 장치와 방법 | |
US20090028241A1 (en) | Device and method of coding moving image and device and method of decoding moving image | |
JP2006005659A (ja) | 画像符号化装置及びその方法 | |
WO2011002091A1 (ja) | 動画像復号化方法及び動画像復号化装置 | |
KR101432777B1 (ko) | 참조 이미지 기반 2차 예측을 통한 동영상 부호화 방법, 장치 및 기록 매체 | |
KR20120035769A (ko) | 움직임 정보 부호화 및 복호화 방법과 이를 이용한 장치 | |
JP2007243859A (ja) | 画像符号化装置及び画像符号化プログラム | |
JP2012124542A (ja) | 動画像符号化方法及び動画像復号化方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680024625.5 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087001224 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006780792 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008104131 Country of ref document: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11994712 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020097027156 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020127002838 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020127029681 Country of ref document: KR |