US20130028326A1

US20130028326A1 - Moving image encoding device and moving image decoding device

Info

Publication number: US20130028326A1
Application number: US13/639,134
Authority: US
Inventors: Yoshimi Moriya; Shunichi Sekiguchi; Kazuo Sugimoto; Kohtaro Asai; Tokumichi Murakami
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2010-04-09
Filing date: 2011-03-31
Publication date: 2013-01-31
Also published as: JP6605063B2; BR112012025206B1; US10412385B2; CA2795425A1; JP2020017972A; KR101389163B1; KR101817481B1; JP2020017970A; RU2627101C2; TWI601415B; JP2015029348A; CN102934438A; RU2663374C1; US10554970B2; TWI688267B; EP2557792A1; RU2014116111A; SG184528A1; CN107046644B; EP3101897A1

Abstract

An encoding controlling unit 3 selects one transformation block size which provides an optimal degree of encoding efficiency from a set of transformation block sizes which are determined in accordance with an encoding mode 7, and includes the transformation block size selected thereby in optimal compression parameters 20 a to notify the transformation block size to a transformation/quantization unit 19, and the transformation/quantization unit 19 divides an optimal prediction differential signal 13 a into blocks having the transformation block size included in the optimal compression parameters 20 a, and carries out a transformation and quantization process on each of the blocks to generate compressed data 21.

Description

FIELD OF THE INVENTION

The present invention relates to a moving image encoding device which divides a moving image into predetermined areas and encodes the moving image in units of one area, and a moving image decoding device which decodes an encoded moving image in units of one predetermined area.

BACKGROUND OF THE INVENTION

Conventionally, in an international standard video encoding system, such as MPEG or ITU-T H.26×, a method of defining block data (referred to as “macroblock” from here on) as a unit, the block data being a combination of 16×16 pixels of brightness signal and 8×8 pixels of color difference signal corresponding to the 16×16 pixels of brightness signal, and compressing each frame of a video signal in units of block data in accordance with a motion compensation technique, and an orthogonal transformation/transform coefficient quantization technique is used.
The motion compensation technique is used to reduce the redundancy of a signal in a time direction for each macroblock by using a high correlation existing between video frames. In accordance with this motion compensation technique, an already-encoded frame which has been previously encoded is stored in a memory as a reference image, and a block area which provides the smallest difference in electric power between the block area itself and the current macroblock which is a target block for the motion-compensated prediction is searched for through a search range predetermined in the reference image, and a spatial displacement between the spatial position of the current macroblock and the spatial position of the block area in the reference image which is determined as the result of the search is then encoded as a motion vector.
Further, in accordance with the orthogonal transformation/transform coefficient quantization technique, a differential signal which is acquired by subtracting a prediction signal acquired as the result of the above-mentioned motion-compensated prediction from the current macroblock is orthogonal transformed and quantized so that the amount of information is compressed.
In the case of MPEG-4 Visual, each block which is used as a unit for motion-compensated prediction has a minimum size of 8×8 pixels, and DCT (discrete cosine transform) having a 8×8 pixel size is used also for orthogonal transformation. In contrast with this, in the case of (ITU-T H.264) MPEG-4 AVC (Moving Picture Experts Group-4 Advanced Video Coding), a motion-compensated prediction with a block size smaller than 8×8 pixels is prepared in order to efficiently carry out encoding on even an area, such as a boundary between objects, having a small correlation between pixels in a spatial direction. Further, in the orthogonal transformation, the compression and encoding can be carried out by adaptively switching between 8×8-pixel DCT having integer pixel accuracy and 4×4-pixel DCT having integer pixel accuracy on a per-macroblock basis.
In accordance with such a conventional international standard video image encoding method, particularly when the resolution of the image becomes higher resulting from the macroblock size being fixed, an area which is covered by each macroblock is easily localized because the macroblock size is fixed. As a result, there occurs a case in which a peripheral macroblock is placed in the same encoding mode or the same motion vector is allocated to a peripheral macroblock. In such a case, because the overhead of encoding mode information, motion vector information and so on which are encoded even though the prediction efficiency is not improved increases, the encoding efficiency of the entire encoder is reduced.
To solve such a problem, a device which switches between macroblock sizes in accordance with the resolution or the contents of an image is disclosed (for example, refer to patent reference 1). The moving image encoding device disclosed by patent reference 1 can carry out compression and encoding by switching between selectable orthogonal transformation block sizes or between selectable sets of orthogonal transformation block sizes in accordance with the macroblock size.

Claims

1.-3. (canceled)

4. A moving image encoding device comprising:

a block dividing unit for dividing an inputted image into macroblock images of two or more blocks each having a predetermined size and dividing each of the macroblock images into a block image of one or more blocks in accordance with an encoding mode to output the block image;

an intra-prediction unit for, when said block image is inputted thereto, carrying out an intra-frame prediction on said block image by using an image signal in a frame to generate a prediction image;

a motion-compensated prediction unit for, when said block image is inputted thereto, carrying out an image motion-compensated prediction on said block by using one or more frames of reference images to generate a prediction image;

a transformation/quantization unit for carrying out a transformation and quantization process on a prediction difference signal which is generated by subtracting the prediction image outputted from either one of said intra-prediction unit and said motion-compensated prediction unit from said block image outputted from said block dividing unit to generate compressed data;

a variable length encoding unit for entropy-encoding said compressed data to multiplex said compressed data entropy-encoded thereby into a bitstream; and

an encoding controlling unit for selecting a certain transformation block size from a set of transformation block sizes predetermined in accordance with a block size of said block image to notify said transformation block size selected thereby to said transformation/quantization unit, wherein

said transformation/quantization unit divides said prediction difference signal into blocks each having said transformation block size notified thereto from said encoding controlling unit, and carries out a transformation and quantization process on each of the blocks to generate compressed data.

5. The moving image encoding device according to claim 4, wherein said encoding controlling unit notifies a certain transformation block size among a set of transformation block sizes fixed for each encoding mode of said block image to said transformation/quantization unit, and said variable length encoding unit entropy-encodes information indicating said transformation block size, and multiplexes said information into the bitstream.

6. The moving image encoding device according to claim 5, wherein said moving image encoding device includes a switching unit for inputting said block image outputted from said block dividing unit to either one of said intra-prediction unit and said motion-compensated prediction unit according to an encoding mode of said block image, and a subtraction unit for subtracting the prediction image outputted from either one of said intra-prediction unit and said motion-compensated prediction unit from said block image outputted from said block dividing unit to generate the prediction difference signal.

7. The moving image encoding device according to claim 5, wherein said encoding controlling unit notifies said transformation/quantization unit of each of one or more transformation block sizes included in the set of transformation block sizes, acquires compressed data associated with said each of one or more transformation block sizes to evaluate a degree of encoding efficiency, and selects one transformation block size from said set of transformation block sizes on a basis of results of the evaluation, said transformation/quantization unit divides the prediction difference signal into blocks for each of cases in which said blocks have a size equal to each of the one or more transformation block sizes, which are included in the set of transformation block sizes notified thereto from said encoding controlling unit, and for a case in which said blocks have said transformation block size selected from said set, and carries out a transformation and quantization process on each of the blocks to generate compressed data in each of said cases, and said variable length encoding unit entropy-encodes information specifying said transformation block size selected from said set and the compressed data associated with said selected transformation block size in units of one block of the block image to multiplex said information and said compressed data entropy-encoded thereby into the bitstream.

8. A moving image decoding device comprising:

a variable length decoding unit for receiving a bitstream inputted thereto and compression-encoded in units of each of macroblocks having a predetermined size into which an image is divided and then entropy-decoding an encoding mode in units of one of said macroblocks from said bitstream, and for entropy-decoding prediction parameters, information indicating a transformation block size, and compressed data in units of one of the macroblocks into which the image is divided in accordance with said decoded encoding mode;

an intra-prediction unit for, when said prediction parameters are inputted thereto, generating a prediction image by using an intra prediction mode and a decoded image signal in a frame which are included in the prediction parameters;

a motion-compensated prediction unit for, when said prediction parameters are inputted thereto, carrying out a motion-compensated prediction by using a motion vector included in the prediction parameters and a reference image specified by a reference image index included in the prediction parameters to generate a prediction image;

an inverse quantization/inverse transformation unit for carrying out an inverse quantization and inverse transformation process on said compressed data by using said information indicating the transformation block size to generate a decoded prediction difference signal; and

an adding unit for adding the prediction image outputted from either one of the said intra-prediction unit and said motion-compensated prediction unit to said decoded prediction difference signal to output a decoded image signal, wherein

said inverse quantization/inverse transformation unit determines a transformation block size on a basis of said decoded information indicating the transformation block size, and carries out an inverse quantization and inverse transformation process on said compressed data in units of one block having said transformation block size.

9. The moving image decoding device according to claim 8, wherein said information indicating the transformation block size is identification information for identifying the transformation block size among a set of transformation block sizes fixed for each encoding mode of said block image.

10. The moving image decoding device according to claim 9, wherein said moving image decoding device includes a switching unit for inputting the prediction parameters decoded by said variable length decoding unit to either one of said intra-prediction unit and said motion-compensated prediction unit according to said decoded encoding mode.