WO2020019316A1

WO2020019316A1 - Intra-frame prediction mode searching device, video encoding method and device, and recording medium

Info

Publication number: WO2020019316A1
Application number: PCT/CN2018/097528
Authority: WO
Inventors: 缪泽翔; 郑萧桢; 李蔚然
Original assignee: 深圳市大疆创新科技有限公司
Priority date: 2018-07-27
Filing date: 2018-07-27
Publication date: 2020-01-30
Also published as: CN110419214A

Abstract

An intra-frame prediction mode searching device, comprising: step 1, on the basis of X object prediction modes, generating X first prediction signals of the current encoding block under the X object prediction modes by using an original pixel of a neighboring block at the periphery of the current encoding block, and calculating a residual between the X first prediction signals and an original pixel value of the encoding block as X first residuals; step 2, calculating a first cost of the X object prediction modes according to the first residual by means of a first cost calculation method, and using Y object prediction modes at one side having the smallest first cost in the object prediction mode as an intermediate prediction mode; step 3, on the basis of the intermediate prediction mode, generating a second prediction signal of the current encoding block under the intermediate prediction mode by using a reconstruction pixel of the neighboring block, and calculating a residue between the second prediction signal and the original pixel value of the encoding block as a second residue; and step 4, calculating a second cost of the intermediate prediction mode according to the second residue by means of a second cost calculation method, and using a prediction mode having the smallest second cost in the intermediate prediction mode as a searched final prediction mode, wherein Y is a natural number less than X.

Description

Method and device for searching intra prediction mode, method and device for video encoding, and recording medium

Technical field

Embodiments of the present invention relate to the field of video coding, and in particular, to a method and a device for searching an intra prediction mode, a video coding method and device, and a recording medium.

Background technique

Visual information is one of the most important sources for humans to obtain external information, but the uncompressed raw video data collected by the camera takes up a huge storage space with a resolution of 1920 × 1080, a video format of yuv420 (8bit), and a frame rate As an example, a video of 30 frames per second and 10 minutes in length is used to store the video without compression, which requires 52.14 Gbytes of storage space. In order to reduce the bandwidth occupied by video storage and transmission, the video data needs to be encoded and compressed.

However, in the encoding and compression processing of the prior art, when the calculation is performed, the coded blocks of each size depend on the reconstruction values of neighboring pixels on the left and above to generate prediction values. The prediction value of the image boundary (outside of the coding block) must wait for the coding blocks on the left and above of it to be encoded before it can be performed, so there is a strong dependency that may cause coding delay.

The problem caused by this delay is even more significant when coding compression is performed by the chip. Specifically, when a chip implements an algorithm, the algorithm steps are usually divided into several pipeline stages, and each pipeline stage processes different data in parallel. This approach can improve processing speed and resource utilization.

However, since the intra prediction algorithm of the high-efficiency video coding standard calculates the prediction mode of each size coded block, as described above, it depends on the reconstruction values of adjacent pixels on the left and above to generate prediction values. The prediction value of each coding block (except the coding block located at the boundary of the image) must wait for the coding blocks on the left and above of it to be encoded first. Because the coding block has a strong dependence on the reconstructed pixels of neighboring coding blocks, the chip cannot implement parallelization within the intra prediction algorithm when implementing the algorithm, resulting in insufficient chip processing speed and efficiency.

Summary of the Invention

The present invention is made in response to the above problems, and provides a video encoding method, a video encoding device, and a recording medium, which effectively reduce prediction value calculation resources and time overhead and reduce data dependence on neighboring encoding blocks. This is because in the case of encoding by a chip, a more prominent technical effect can be obtained.

A method for searching an intra prediction mode according to a first aspect of the present invention includes:

In the first step, based on the X object prediction modes, the original pixels of neighboring blocks around the current encoding block are used to generate X first prediction signals of the current encoding block in the X object prediction mode, and obtain all The residuals of the X first prediction signals and the original image values of the coding block are taken as the X first residuals;

In a second step, the first cost of the X object prediction modes is calculated by the first cost calculation method according to the first residual, and the Y objects on the least side of the first cost in the object prediction mode are calculated. Prediction mode as an intermediate prediction mode;

In a third step, based on the intermediate prediction mode, a reconstructed pixel of the neighboring block is used to generate a second prediction signal of the current coding block in the intermediate prediction mode, and the second prediction signal and the obtained prediction signal are obtained. State the residual of the original image value of the current coding block as the second residual; and,

In a fourth step, the second cost of the intermediate prediction mode is calculated by the second cost calculation method according to the second residual, and the prediction mode with the smallest second cost in the intermediate prediction mode is used as the searched final Prediction model,

Here, Y is a natural number smaller than X.

A video encoding method according to a second aspect of the present invention includes:

The video is encoded using the final prediction mode searched out by the aforementioned intra prediction mode search method.

The intra prediction mode search apparatus according to the third aspect of the present invention is configured to perform the foregoing intra prediction mode search method, wherein:

Including: N-2 pipeline stage, N-1 pipeline stage and N pipeline stage,

The first step and the second step are performed by the N-2 pipeline stage,

The third step is performed by the N-1 pipeline stage,

The fourth step is performed by the N pipeline stage.

An apparatus for searching an intra prediction mode according to a fourth aspect of the present invention includes: a memory and a processor, where:

The memory is used to store program instructions;

The processor calls the program instruction, and when the program instruction is executed, is used to perform the following operations:

In a second step, the first cost of the X object prediction modes is calculated by the first cost calculation method according to the first residual, and the Y object predictions in which the first cost in the object prediction mode is the smallest one are predicted. Mode as an intermediate prediction mode;

In a third step, based on the intermediate prediction mode, a reconstructed pixel of the neighboring block is used to generate a second prediction signal of the current coding block in the intermediate prediction mode, and the second prediction signal and the obtained prediction signal are obtained. State the residual of the original image value of the current coding block as the second residual; and

In a fourth step, a second cost of the intermediate prediction mode is calculated by a second cost calculation method according to the second residual, and the prediction mode with the smallest second cost in the intermediate prediction mode is used as the searched final result. Prediction model,

Here, Y is a natural number smaller than X.

A video encoding device according to a fifth aspect of the present invention includes the above-mentioned intra prediction mode search device.

A recording medium according to a sixth aspect of the present invention stores a program that causes a computer to execute the intra prediction mode search method as described above.

A recording medium according to a seventh aspect of the present invention stores a program for causing a computer to execute the video encoding method as described above.

According to the solution of an embodiment of the present invention, by pre-analyzing the number of intra prediction modes that need to be searched when the texture gradient information of the current coding block is reduced, the prediction value calculation resource and time overhead are reduced, and the original pixel information and Reconstructing pixel information reduces dependence on neighboring coded blocks.

For example, in the case of 35 prediction modes, at least 35 prediction value calculations are required for each coding block, which is reduced to only 16 prediction value calculations for each coding block, and by using adjacent in the first stage The original pixel values of the pixels are calculated for prediction, eliminating dependencies.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to explain the technical solutions in the embodiments of the present invention more clearly, the drawings used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present invention. Those skilled in the art can also obtain other drawings according to these drawings without paying creative labor.

Figure 1 shows the current popular hybrid coding framework structure.

Fig. 2 shows that the intra prediction mode corresponds to 35 prediction modes.

FIG. 3 is a schematic diagram showing an example of division of a 64 × 64 encoding block.

FIG. 4 is a flowchart showing an embodiment of an intra prediction mode search method according to the present invention.

FIG. 5 is a frame flowchart illustrating a specific example of the intra prediction mode search method of the present invention.

FIG. 6 is a flowchart illustrating a specific example of the intra prediction mode search method according to the present invention.

FIG. 7 is a block diagram showing a configuration of an intra prediction mode search device 70 according to the present invention.

FIG. 8 is a diagram for explaining an example of a coding block for pipeline processing performed by the intra prediction mode search device 70.

FIG. 9 is a timing chart for explaining an example of pipeline processing performed by the intra prediction mode search device 70 based on the framework flowchart shown in FIG. 5.

detailed description

The technical solutions in the embodiments of the present invention will be clearly described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

It should be noted that when a component is called "fixed to" another component, it may be directly on another component or a centered component may exist. When a component is considered to be "connected" to another component, it can be directly connected to another component or a centered component may exist at the same time.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terminology used herein in the description of the invention is for the purpose of describing particular embodiments only and is not intended to limit the invention. The term "and / or" as used herein includes any and all combinations of one or more of the associated listed items.

The video encoding is briefly described below. Generally, the video encoding process includes the steps of prediction, transform, quantization, and entropy encoding. Figure 1 shows the current popular hybrid encoding framework structure. The prediction includes two types of intra prediction and inter prediction, and the purpose is to remove redundant information of the current image block to be encoded by using prediction block information.

Intra prediction uses the information of the frame image to obtain prediction block data, and the process includes dividing the image block to be encoded into several sub-image blocks; then, for each sub-image block, using neighboring pixels around the image block to generate a prediction of the current image block Pixel blocks.

Inter prediction uses the information of the reference frame to obtain prediction block data, and the process includes dividing the image block to be encoded into several sub-image blocks; then, for each sub-image block, searching in the reference image for an image that best matches the current sub-image block The block is used as a prediction block.

An image frame that uses only the intra prediction mode in encoding is called an I frame, and an image frame that uses both intra prediction and inter prediction is called a P or B frame. After obtaining a predicted pixel block by using intra prediction or inter prediction, the corresponding pixel values of the sub-image block and the predicted block are subtracted to obtain a residual.

The residual can use the transformation matrix to remove the correlation of the residual of the image block, that is, remove the redundant information of the image block in order to improve the coding efficiency. The transformation of the data block in the image block usually uses two-dimensional transformation, that is, the data is encoded at the encoding end. The block residual information is multiplied with an NxN transform matrix and its transpose matrix, and the transform coefficients are obtained after multiplication. Transform coefficients can be quantized by quantizing the quantization parameters. Finally, the quantized coefficients are entropy coded. Finally, the bit stream obtained by entropy coding and the coding mode information after coding, such as intra prediction mode, motion vector information, etc. Store or send to the decoder.

At the decoding side of the image, the entropy-coded bitstream is first obtained and then the entropy decoding is performed to obtain the corresponding residuals. According to the predicted image block corresponding to the information image block such as the motion vector or intra prediction obtained by the decoding, according to the predicted image block and the image block Residual to get the reconstructed value of each pixel in the current sub-image block.

Figure 2 shows the 35 prediction modes corresponding to the intra prediction mode: including 33 directional modes and DC and Planar modes. When intra prediction is performed for each image block, the neighboring pixel information around the block is used to calculate 35 candidate prediction pixel blocks according to the calculation formula corresponding to the 35 prediction modes, and then from the 35 prediction modes according to the optimization principle Choose the best prediction mode.

As an example of video encoding, the basic encoding unit is a 64x64 encoding block. In the intra prediction encoding mode, it may be further divided into one 64x64 encoding block or four 32x32 encoding blocks, or 16 16x16 encoding blocks, or 64 8x8 coded blocks, or 256 4x4 coded blocks or several blocks of different sizes are combined.

FIG. 3 is a schematic diagram showing an example of division of a 64 × 64 encoding block. The encoding unit with a size of 64x64 on the left is finally divided into one 32x32 encoding block, ten 16x16 encoding blocks, seven 8x8 encoding blocks, and four 4x4 encoding blocks after encoding. In order to obtain the optimal partition structure, it is necessary to first generate the result of predicting pixel blocks for all 1 + 4 + 16 + 64 + 256 = 341 possible prediction blocks in actual coding, and then determine the optimal structure in order. Since each possible prediction block corresponds to 35 possible prediction modes, 35x341 = 11935 predictions need to be calculated when encoding a 64x64 encoded block, which consumes a huge amount of hardware resources and time.

As an embodiment of the present invention, a method for searching an intra prediction mode is proposed, which is particularly suitable for encoding and compression processing by a chip. These include:

Pipeline N-2:

Based on X object prediction modes, using the original pixels of neighboring blocks around the current encoding block to generate X first prediction signals of the current encoding block in the X object prediction modes, and obtaining the X number 1 the residual of the prediction signal and the original image value of the coding block is taken as the first residual of the X object prediction modes;

Calculate the first costs of the X object prediction modes through the first cost calculation method according to the first residual, and use the Y with the least one example of the first cost in the object prediction mode as the intermediate prediction mode;

Pipeline N-1:

Further performing comprehensive processing on the Y intermediate prediction modes and a final prediction mode of at least one neighboring block of the current coding block to adjust the intermediate prediction modes to Z,

Based on the intermediate prediction mode, using the reconstructed pixels of the neighboring block to generate a second prediction signal of the current encoding block in the intermediate prediction mode, and obtain the second prediction signal and the current encoding block The residual of the original image value is taken as the second residual;

Pipeline stage N: Calculate a second cost of the intermediate prediction mode by a second cost calculation method according to the second residual, and use the prediction mode with the smallest second cost in the intermediate prediction mode as the searched final Forecasting mode.

Among them, X, Y, and Z are natural numbers that satisfy X> Y and X> Z. In addition, it is preferable that Y ≧ Z, but it is not limited to this.

Hereinafter, some embodiments of the present invention will be described in detail with reference to the drawings. In the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.

[Step S401]

Based on X object prediction modes, using the original pixels of neighboring blocks around the current encoding block to generate X first prediction signals of the current encoding block in the X object prediction modes, and obtaining the X number The residuals of the 1 prediction signal and the original image values of the coding block are taken as X first residuals.

The object prediction mode refers to a prediction mode that is a search target of the intra prediction mode search method of the present invention. For example, as shown in Figure 2, X is 35 prediction modes, including 33 directional modes and DC and Planar modes. However, it is not limited to this, and may be appropriately determined in accordance with the coding standard and the like to be applied.

Optionally, all prediction modes in the coding standard may not be used as target prediction modes. For example, all prediction modes (for example, 35 prediction modes) in intra prediction may be divided into at least two groups, and one group is selected as the prediction mode. Object prediction mode.

As a basis for grouping, the grouping may be performed according to at least one of an attribute of an object prediction mode, such as a direction or a category, of the object prediction mode. The number of prediction modes in each of the groups may be the same or different. The prediction modes in each group may or may not include the prediction modes in other groups. Taking the 35 prediction modes shown in FIG. 2 as an example, all the 33 direction modes can be divided into a plurality of groups such as 2, 3, 4, 5, 6, 7, and the like in a manner of adjacent numbers. Take three groups as an example, as follows:

Group 1: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12

Group 2: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23

Group 3: 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34

In addition, the groups may be grouped in such a manner that the direction pattern numbers in the groups are not continuous. Take three groups as an example, as follows:

Group 1: 2, 5, 8, 11, 14, 17, 20, 23, 26, 29, 32

Group 2: 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33

Group 3: 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34

In a case where the prediction mode includes a directional mode, a DC mode (Mode 1), and a Planar mode (Mode 0), each packet may be further included in the DC mode and / or the Planar mode.

Optionally, as a basis for selecting a group, the selection may be performed randomly, or according to various characteristic indexes of the current coding block. For example, one group is selected as the target prediction mode according to the directivity index of the current coding block in each prediction direction.

Optionally, the directionality index is gradient information (Gradient Information) of the current coding block in each prediction direction.

Optionally, the current coding block is divided into a plurality of regions, and gradient information of the plurality of regions in the prediction direction is determined according to a prediction direction in a group, and a set of prediction modes with a minimum gradient information is selected as all The object prediction mode is described. As a specific method of dividing the coding block, for example, the coding block can be divided from left to right, right to left, or from top to bottom, and from bottom to top in four-grid, nine-grid, or the like. Randomly divide the area in such a way that it fills the entire area. The divided areas may be adjacent to each other, separated from each other, or overlapped with each other.

Optionally, generating a prediction signal (a first prediction signal) in a prediction mode and obtaining a residual between the prediction signal and the original pixel, etc., may be performed by an existing algorithm or algorithms specified in various video coding standards.

Optionally, the coding block includes NxN pixel blocks, N is a natural number, and the original pixels of neighboring blocks around the current coding block are used to generate a first number of the current coding block in the X object prediction mode. The prediction signal includes generating a prediction signal of a current coding block from reference pixels of a surrounding pixel block of the coding block. Here, the pixel blocks around the coding block may be pixel blocks adjacent to each pixel block in the coding block, or pixel blocks smaller than a predetermined distance, and may be set as required.

[Step S402]

The first cost of the X object prediction modes is calculated by the first cost calculation method according to the first residual, and the Y object prediction mode with the least one example of the first cost in the object prediction mode is used as the intermediate prediction. mode. Among them, X and Y are natural numbers and satisfy X> Y.

Here, as the first cost calculation method, any existing method capable of calculating the cost of the prediction mode may be used, and may be performed by an existing algorithm or algorithms specified in various video coding standards. As an example, the sum of squared or absolute sum of residuals can be used as the cost of the prediction mode. As another example, the cost of the prediction mode is calculated based on the transform coefficients obtained by performing a two-dimensional matrix transformation on the residual and the number of bits of the prediction mode. As an example of a two-dimensional matrix transformation, a discrete sine / cosine transformation or a Hadamard transformation may be used.

Optionally, according to the first cost calculated by the first cost calculation method, Y are selected from the X object prediction modes as the intermediate prediction mode, where Y may be any natural number less than X. For example, when X is 35, 2, 3, 4, 5, 6, 10, 15, 20, 25, 30, etc. may be taken. Here, the value of Y can be a preset value, a value obtained according to factors such as accuracy and / or efficiency, or a value obtained according to hardware conditions and the like.

[Step S403]

Based on the intermediate prediction mode, using the reconstructed pixels of the neighboring block to generate a second prediction signal of the current encoding block in the intermediate prediction mode, and obtain the second prediction signal and the current encoding block The residual of the original image value is taken as the second residual.

Here, as with the generation of the first prediction signal, the existing or various video encodings can be used to generate the prediction signal (second prediction signal) in a certain prediction mode and to obtain the residuals between the prediction signal and the original pixel. The algorithm specified in the standard is performed.

In addition, in the above description, it is described that the original pixels of adjacent blocks are used in the generation of the first prediction signal, and the reconstructed pixels of adjacent blocks are used in the generation of the second prediction signal. However, the original pixels and Reconstructed pixels also include pixels obtained by padding. For example, when generating the first prediction signal and / or the second prediction signal, at least a part of pixel blocks in adjacent blocks may be filled (Padding), and the filled pixels are used as the original pixels and the The pixels are reconstructed and used to generate the first prediction signal and / or the second prediction signal. As a method of filling, various methods such as copying adjacent pixel values, interpolation, and inserting 0 values can be used.

As a specific example, the pixel block to be filled is a pixel block at a position where a reconstructed pixel cannot be obtained when the second prediction signal is generated. For example, the neighboring blocks that are close to the corners of the video image and do not actually exist, or because of the coding order of the coded blocks, etc., have not been encoded when the second prediction signal is generated without the phase of reconstructed pixels Adjacent blocks, etc.

[Step S404]

A second cost of the intermediate prediction mode is calculated according to a second cost calculation method according to the second residual, and a prediction mode with the smallest second cost in the intermediate prediction mode is used as a searched final prediction mode.

Here, as the second cost calculation method, any existing method capable of calculating the cost of the prediction mode may also be used, and may be performed by an existing algorithm or algorithms specified in various video coding standards. As an example, the sum of squared or absolute sum of residuals can be used as the cost of the prediction mode. As another example, the cost of the prediction mode is calculated based on the transform coefficients obtained by performing a two-dimensional matrix transformation on the residual and the number of bits of the prediction mode. The second cost calculation method may be the same as or different from the first cost calculation method.

In step S403 described above, the intermediate prediction mode may be further adjusted. Specifically, after using the Y on the least cost side of the target prediction mode as the intermediate prediction mode, the Y intermediate prediction modes may be further compared with at least one of the current coding block. The final prediction modes of neighboring blocks are comprehensively processed to adjust the intermediate prediction modes to Z, where Z is a natural number and satisfies that X is greater than Z. Preferably, the Z intermediate prediction modes are 5 intermediate prediction modes.

The final prediction mode here may be a prediction mode used by at least one neighboring block during inter-coding, and may specifically be the final prediction mode searched by the intra prediction mode search method of the present invention, or may be The final prediction mode obtained by a standard algorithm specified in the video coding standard.

Optionally, the adjacent block is a pixel block in a coding block adjacent to the current coding block, may be a pixel block in a coding block connected to the current coding block, or may be a block that is not related to the current coding block. A block of pixels in which the coded blocks are located next to each other.

Optionally, in the comprehensive processing, at least one prediction mode among the Y intermediate prediction modes and at least one final prediction mode of the neighboring block may be used as the intermediate prediction mode.

Optionally, it is preferable to discard at least a part of the Y intermediate prediction modes that is different from the final prediction mode of the neighboring block, and retain all the final prediction modes of the neighboring block. As one way, the Y intermediate prediction modes that are discarded are those with the first cost. In other words, the reserved intermediate prediction mode is one or more prediction modes that are different from the prediction mode of the neighboring block and that the first cost is the smallest.

In the comprehensive processing, a final prediction mode of at least one of the neighboring blocks in three directions of a left side, an upper side, and an upper left side of the current coding block is used.

In order to make the intra prediction mode search method of the present invention better understandable, the following uses FIG. 5 to describe in detail a specific example using the present invention as an example.

The overall technical framework of this specific example is shown in Figure 5, and is divided into three major phases:

[S501] Intra mode estimation

At this stage, the original pixel values of the current coded block are used to filter out 11 prediction modes from a total of 35 prediction modes, and then the predicted values are generated from the original pixel values of neighboring blocks around the current coded block, and further to the 11 The prediction residuals corresponding to the prediction mode are respectively subjected to a Hadamard transformation to screen out the five prediction modes with the least cost.

[S502] Prediction refinement

At this stage, the three prediction modes (Most Probable Modes) derived from the neighboring information are combined with the five prediction modes obtained in S501 to select five prediction modes, and the reconstructed pixel values of surrounding neighboring blocks are used to regenerate prediction Pixel values.

[S503] Final mode selection

At this stage, the prediction residuals corresponding to the five prediction modes reselected in S502 are respectively subjected to a Hadamard transformation to select the final prediction mode with the least cost.

S501 Intra mode estimation

The intra mode estimation phase of this specific example includes the stages of preliminary mode determination (Rough mode Decision) S5011, first prediction (Predict1) S5012, and first Hadama mode determination (Had mode Decision1) S5013. Be specific.

S5011: Preliminary mode determination

At this stage, by extracting the gradient information of the original pixel value of the current coding block, 11 prediction modes are selected from 35 intra prediction modes (including 33 directional modes and DC and Planar modes). Since no real predicted value is generated in this stage, it will not increase the consumption of computing resources.

First, the 35 intra prediction modes shown in FIG. 2 are divided into 0 °, 45 °, 90 °, and 135 ° for 0 °, 45 °, 90 °, and 135 ° in the prediction direction. The corresponding 4 groups are selected as the target prediction mode according to the directivity index of the current coding block in each prediction direction.

Specifically, according to the prediction direction, it is divided into 4 groups of 0 °, 45 °, 90 °, and 135 °, each group has 11 modes, and each group contains mode 0 (DC mode) and mode 1 (Planar mode), such as Formulas (3-1) to (3-4) are shown.

0 ° = {0,1,6,7,8,9,10,11,12,13,14} (3-1)

45 ° = {0,1,2,3,4,5,30,31,32,33,34} (3-2)

90 ° = {0, 1, 22, 23, 24, 25, 26, 27, 28, 29, 30} (3-3)

135 ° = {0,1,14,15,16,17,18,19,20,21,22} (3-4)

The above-mentioned directionality index is, for example, gradient information (Gradient Information) of the current coding block in each prediction direction. As an example, the current encoding block may be divided into multiple regions, and gradient information of the multiple regions in the prediction direction is determined according to a prediction direction in a group, and a set of prediction modes with a minimum gradient information is selected as all The object prediction mode is described.

For example, first divide the current coding block into sub-blocks. FIG. 6 is a schematic diagram showing an example of a sub-block division method in this specific example. As shown in FIG. 6, the current coding block is divided into five equally-sized sub-blocks a, b, c, d, and e in the order of upper left, upper right, lower left, lower right, and center. Take 5 sub-blocks from the current coding block. Next, the mean values of the sub-blocks a, b, c, d, and e are calculated, respectively. Then use the mean value to obtain the gradient information of the corresponding direction group according to the equations (3-5) to (3-8), and select 11 prediction modes (Index values) in the mode set corresponding to the smallest median value of GI1 to GI4 as the output.

GI1 = | b-a | + | d-c | ---- 0 ° (3-5)

GI2 = | c-e | + | e-b | ---- 45 ° (3-6)

GI3 = | c-a | + | d-b | ---- 90 ° (3-7)

GI4 = | d-e | + | e-a | ---- 135 ° (3-8)

In addition, although the preliminary mode determination stage S5011 is included in this specific example, it is not limited to this, and it may be omitted, and 35 prediction modes may be directly used as input to the first prediction S5012 stage described later.

S5012: Predict1

The role of this stage is to use the original pixels of neighboring blocks to generate 11 sets (corresponding to the 11 prediction modes output in the preliminary mode determination stage) of the prediction signals of the current encoding block. According to the existing video coding standards, the practice of this link should be to use the reconstructed pixels of neighboring blocks to generate 35 sets of prediction signals, which will cause the link to wait for the generation of reconstructed pixels and cause delay. According to the scheme of the present invention, the parallelism of calculation can be greatly improved and the calculation complexity can be reduced.

Taking a coding block containing NxN pixel blocks as an example, a prediction signal of a current coding block is generated from reference pixels of surrounding 4 * N + 1 neighboring blocks during prediction, where N is a natural number. FIG. 7 is a schematic diagram showing an example of the distribution positions of adjacent blocks used in the first prediction when the coding block is 4 × 4, that is, N is 4. In this example, the prediction signal of the current coding block is generated from 4 * N + 1, that is, reference pixels of 17 pixel blocks, on the left, upper left, and upper sides of the coding block.

In the first prediction stage, a prediction signal of a current coding block is generated from reference pixels of 17 neighboring blocks.

Prediction _{i, j} = f _m (NeighbourPixels) i, j = 0 ... N-1, m = 0 ... 34 (3-9)

Equation (3-9) describes a process of generating predicted pixels from reference pixels of neighboring blocks, where f _m , m = 0 ... 34 represents 35 different prediction algorithms corresponding to the optional 35 prediction modes. An algorithm for generating a prediction signal of a current coding block by reference pixels of neighboring neighboring blocks is an algorithm specified in an existing video coding standard, and is not described here.

After obtaining the predicted pixel values corresponding to the 11 prediction modes, they are respectively different from the original pixel values of the current coding block to obtain the residuals. The prediction residuals corresponding to the 11 prediction modes are determined as the subsequent first Hadamard mode determination. Phase input.

S5013: The first Hadama Mode determination (Had ModeDecision1)

In this stage, the 11 sets of residuals output from the first prediction stage S5012 are used as input, and the 11 sets of residuals are respectively subjected to a Hadamard transform. The corresponding prediction mode is calculated by combining the absolute value of the transformation coefficients and the number of bits representing the 11 prediction modes. The price. Table 1 shows an example of the number of bits corresponding to the 35 patterns.

【Table 1】

Index Index	RateRate

模式0Mode 0	1bit 1bit

模式1Mode 1	2bit 2bit

模式26Pattern 26	3bit3bit
其他模式Other modes	5bit5bit

As shown in equation (3-10), λ is a weighting coefficient calculated according to a specified algorithm in the existing video encoding method.

cost _i = Sum (| Hadamard (Resi _i ) |) + λ * Rate (3-10)

After the 11 costs are calculated, the smallest 5 corresponding prediction modes are selected as the output from the 11 costs.

In addition, although the Hadamard transform has been described as an example in this specific example, the present invention is not limited to this, and may be another two-dimensional matrix transform.

S502 Prediction refinement

The prediction improvement phase of this specific example includes a comprehensive processing (Combine) S5021 and a second prediction (Predict2) S5022. Each phase will be described in detail below.

S5021: Comprehensive processing

In this stage, the prediction modes obtained from the neighboring encoding blocks and the five prediction modes obtained during the intra mode estimation stage are comprehensively processed. The prediction modes obtained from the adjacent encoding blocks and the 5 This model is compared and combined, and 5 prediction modes are obtained.

Here, the prediction mode obtained from neighboring coding blocks may be a prediction mode selected by the method of the present invention, or three prediction modes derived from neighboring coding blocks according to a standard algorithm in an existing video coding method. (Most Probable Mode (MPM)).

As an optional example of comprehensive processing, 5 modes can be selected from a total of 8 modes selected from the intra mode estimation stage and 3 prediction modes obtained from adjacent coding blocks: first retain 3 Prediction modes obtained from neighboring coding blocks, and then from the prediction modes selected in the 5 intra-mode estimation stages, 2 prediction modes obtained from neighboring coding blocks that are different from the prediction modes obtained at the least cost are selected , Take a total of 5 prediction modes as the output of the comprehensive processing stage.

In addition, although the comprehensive processing stage S5021 is included in this specific example, it is not limited to this, and it may be omitted, and the five prediction modes output from the S5013 stage may be directly used as input to the second prediction S5022 stage described later.

S5022: Second prediction

The inputs at this stage are the five prediction modes output by S5021, and the reconstructed pixels of neighboring blocks are used to generate the prediction signals of the five groups of currently coded blocks. An algorithm for generating a prediction signal of a current coding block from reconstructed pixels (reference pixels) of neighboring neighboring blocks is a prescribed algorithm in an existing video coding method, and will not be described here.

After obtaining the predicted pixel values corresponding to the five prediction modes, the differences are respectively different from the original pixel values of the current coding block, and the prediction residuals corresponding to the five prediction modes are obtained as the input of the subsequent final mode selection stage.

S503 Final mode selection

The final mode selection stage of this specific example includes the second Hadama Mode Decision 2 (S5031). Each of these stages will be specifically described below.

2nd Hadamard Mode Confirmed

In this stage, the five groups of residuals output from the second prediction stage S5022 are used as input. The five groups of residuals are respectively subjected to a Hadamard transform. The absolute value of the transform coefficients and the number of bits representing the five prediction modes are combined to calculate the corresponding prediction mode. The price. The cost calculation here adopts the same method as the cost calculation in the first Hadamard mode determination phase S5013. In addition, it is not limited to this, and other cost calculation methods different from the first Hadamard mode determination stage S5013 may also be adopted.

After the 5 costs are calculated, a corresponding prediction mode with the lowest cost is selected from the 5 costs as the finally selected 1 prediction mode. When video coding is performed, the residual output is entered into subsequent stages such as transform quantization and entropy coding RDO.

FIG. 7 is a block diagram showing a configuration of an intra prediction mode search device 70 according to the present invention. The intra-prediction mode searching device 70 of the present invention may be, for example, a chip, including: an N-2 pipeline stage 701, an N-1 pipeline stage 702, and an N pipeline stage 703. When such an intra prediction mode search device 70 is used to implement the intra prediction mode search method shown in FIG. 4, for example, steps 401 and 402 are performed by an N-2 pipeline stage, and step 403 is performed by an N-1 pipeline stage. 404 is performed by the N pipeline stage.

In the intra-prediction mode search device 70, the N-2 pipeline stage 701, the N-1 pipeline stage 702, and the N pipeline stage 703 work in parallel, so that the N pipeline stage 703 pairs the encoding blocks processed by the N-1 pipeline stage 702. The processing, the N-1 pipeline stage 702 processing of the coding block processed by the N pipeline stage 701, and the N-2 pipeline stage 701 processing of the current coding block are performed in parallel. This will be specifically described below.

FIG. 8 is a diagram for explaining an example of a coding block for pipeline processing performed by the intra prediction mode search device 70. For convenience of explanation, the same coding block as the 64 × 64 coding block in FIG. 3 is exemplified here. Among them, the upper left four coding blocks are respectively marked with 0, 1, 2, and 3.

In the N-2 pipeline stage 701, the same processing as the intra mode estimation in S501 described above is performed.

That is, in the N-2 pipeline stage 701, the original pixel values of the current encoding block are used to filter out 11 prediction modes from a total of 35 prediction modes, and then the prediction values are generated from the original pixel values of neighboring blocks around the current encoding block. Furthermore, the five prediction modes with the least cost are screened by performing Hadamard transform on the prediction residuals corresponding to the 11 prediction modes.

In the N-1 pipeline stage 702, the same processing as the prediction refinement in S502 described above is performed.

That is, in the N-1 pipeline stage 702, the three prediction modes (Most, Probable Modes) derived from the neighboring information are combined with the 5 prediction modes selected in the N-2 pipeline stage 701 to select 5 prediction modes, and use the surrounding The reconstructed pixel values of the neighboring blocks of the image are used to regenerate predicted pixel values.

In the N pipeline stage 703, the same processing as in the final mode selection in S503 described above is performed.

At this stage, the prediction residuals corresponding to the five prediction modes reselected in the N-1 pipeline stage 702 are respectively subjected to a Hadamard transformation to screen the final prediction mode with the least cost.

Specifically, at the initial time t0, the encoding block 0 is processed by the N-2 pipeline stage 701 (intra mode estimation S501). At this time, the N-1 pipeline stage 702 and the N pipeline stage 703 are idle.

Next, at time t1, the N-2 pipeline stage 701 processes the new coded block 1 (intra mode estimation S501), and the N-1 pipeline stage 702 performs the N-2 pipeline process obtained from the N-2 pipeline stage 701. The encoding block 0 processed by the stage 701 is processed (prediction improvement S502). At this time, the N pipeline stage 703 is idle.

Next, at time t2, the N-2 pipeline stage 701 processes the new coded block 2 (intra-frame mode estimation S501), and the N-1 pipeline stage 702 processes the N-pipeline stage 701 obtained from the N-2 pipeline stage 701. 2 The encoding block 1 processed by the pipeline 701 is processed (prediction improvement S502), and the N-2 pipeline 703 processes the encoding block 0 obtained by the N-1 pipeline 702 and processed by the N-1 pipeline 702 ( The final mode is S503).

Thereafter, each pipeline stage performs the above-mentioned processing of S501 to S503 in parallel until the processing is completed for all the coding blocks.

By adopting a staged intra prediction mode search algorithm, the original pixels are introduced into the intra prediction mode to release the key dependency on the search phase, and then the entire intra prediction algorithm is divided into three pipeline stages to improve the parallelism, as shown in the figure As shown in FIG. 9, starting from time t2, each of the N-2 to N pipeline stages 701 to 703 starts processing the coded blocks in parallel, so that the chip processing speed and efficiency can be improved.

In addition, since the N-2 pipeline stage 702 does not involve any data dependency, as an optional solution of the present invention, it can be split out. For example, all possible coding blocks in FIG. 8 are processed in advance, and the obtained results are stored. Wait for N-1 pipeline stage 701 to take.

In the several embodiments provided by the present invention, it should be understood that the disclosed device and method may be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, which may be electrical, mechanical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware, or in the form of hardware plus software functional units.

The above integrated unit implemented in the form of a software functional unit may be stored in a computer-readable storage medium or memory. The software functional unit is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) or a processor to execute the methods described in the embodiments of the present invention. Some steps. The aforementioned storage media include: U disks, mobile hard disks, read-only memory (ROM), random access memory (RAM), magnetic disks or compact discs, and other media that can store program codes .

Those skilled in the art can clearly understand that for the convenience and brevity of the description, only the above-mentioned division of the functional modules is used as an example. In practical applications, the above-mentioned functions can be allocated by different functional modules according to needs, that is, the device The internal structure is divided into different functional modules to complete all or part of the functions described above. For the specific working process of the device described above, reference may be made to the corresponding process in the foregoing method embodiment, and details are not described herein again.

Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention, but not limited thereto. Although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features can be equivalently replaced; and these modifications or replacements do not depart from the essence of the corresponding technical solutions of the technical solutions of the embodiments of the present invention. range.

Claims

A method for searching an intra prediction mode includes:

In the first step, based on the X object prediction modes, the original pixels of neighboring blocks around the current encoding block are used to generate X first prediction signals of the current encoding block in the X object prediction mode, and obtain all The residuals of the X first prediction signals and the original image values of the coding block are taken as the X first residuals;

In a second step, the first cost of the X object prediction modes is calculated by the first cost calculation method according to the first residual, and the Y objects on the least side of the first cost in the object prediction mode are calculated. Prediction mode as an intermediate prediction mode;

In a third step, based on the intermediate prediction mode, a reconstructed pixel of the neighboring block is used to generate a second prediction signal of the current coding block in the intermediate prediction mode, and the second prediction signal and the obtained prediction signal are obtained. State the residual of the original image value of the current coding block as the second residual; and,

In a fourth step, a second cost of the intermediate prediction mode is calculated by a second cost calculation method according to the second residual, and the prediction mode with the smallest second cost in the intermediate prediction mode is used as the searched final result. Prediction model,

Here, Y is a natural number smaller than X.
The method of searching for an intra prediction mode according to claim 1, wherein:

In the third step, the Y intermediate prediction modes obtained in the second step and the final prediction mode of at least one neighboring block of the current coding block are comprehensively processed to integrate the intermediate prediction mode. Adjusted to Z,

Here, Z is a natural number smaller than X.
The method of searching for an intra prediction mode according to claim 2, wherein:

In the third step, a final prediction mode of at least one of the neighboring blocks in three directions of a left side, an upper side, and an upper left side of the current coding block is used in the comprehensive processing.
The method of searching for an intra prediction mode according to claim 2, wherein:

In the third step, at least one prediction mode among the Y intermediate prediction modes is used as the intermediate prediction mode together with the final prediction mode of the neighboring block in the comprehensive processing.
The intra prediction mode search method according to claim 4, wherein:

At least one prediction mode among the Y intermediate prediction modes is a prediction mode that is different from the final prediction mode of the neighboring block and has the least first cost.
The method of searching for an intra prediction mode according to claim 1, wherein:

The X object prediction modes are 35 object prediction modes, and the Y intermediate prediction modes are 5 intermediate prediction modes.
The method of searching for an intra prediction mode according to claim 2, wherein:

The Z intermediate prediction modes are 5 intermediate prediction modes.
The method of searching for an intra prediction mode according to claim 1, wherein:

In the first step, all prediction modes of intra prediction are divided into at least two groups, and one group is selected as the target prediction mode.
The method for searching an intra prediction mode according to claim 8, wherein:

In the first step, the grouping is performed based on at least one of a direction or a type of the target prediction mode.
The method of searching for an intra prediction mode according to claim 9, wherein:

The number of prediction modes in each said group is the same.
The method for searching an intra prediction mode according to claim 8, wherein:

The prediction mode includes a direction mode, a DC mode, and a Planar mode,

Each packet includes the DC mode and / or the Planar mode.
The method for searching an intra prediction mode according to claim 8, wherein:

In the first step, one set of prediction modes is selected randomly or according to a feature index of a current coding block as the target prediction mode.
The method for searching an intra prediction mode according to claim 8, wherein:

In the first step, the prediction modes are divided into 4 groups corresponding to 0 °, 45 °, 90 °, and 135 ° with respect to 0 °, 45 °, 90 °, and 135 °.

One group is selected as the target prediction mode according to the directivity index of the current coding block in each prediction direction.
The method of searching for an intra prediction mode according to claim 13, wherein:

The directionality index is gradient information of the current coding block in each prediction direction.
The method for searching an intra prediction mode according to claim 14, wherein:

In the first step, the current coding block is divided into a plurality of regions, and gradient information of the plurality of regions in the prediction direction is determined according to a prediction direction in a group, and a group of gradient information having a minimum value is selected. A prediction mode is used as the target prediction mode.
The method of searching for an intra prediction mode according to claim 13, wherein:

In the first step, the current coding block is divided into five sub-blocks a, b, c, d, and e of equal size in the order of upper left, upper right, lower left, lower right, and center, and the sub blocks are calculated respectively The average value of a, b, c, d, e, and then use the average value to find the gradient information at 0 °, 45 °, 90 °, and 135 °.

0 °: GI1 = | b-a | + | d-c |

45 °: GI2 = | c-e | + | e-b |

90 °: GI3 = | c-a | + | d-b |

135 °: GI4 = | d-e | + | e-a |

One set of prediction modes in the prediction direction corresponding to the minimum value among GI1 to GI4 is selected as the target prediction mode.
The method of searching for an intra prediction mode according to claim 1, wherein:

The neighboring block is a pixel block in a coding block adjacent to the current coding block.
The method of searching for an intra prediction mode according to claim 1, wherein:

The original pixels and the reconstructed pixels include pixels obtained by performing padding.
The method of searching for an intra prediction mode according to claim 18, wherein:

The pixel block to be filled is a pixel block at a position where a reconstructed pixel cannot be obtained when the second prediction signal is generated.
The method of searching for an intra prediction mode according to claim 1, wherein:

The coding block includes NxN pixel blocks, where N is a natural number,

The generating a first prediction signal of the current coding block under the X object prediction modes by using the original pixels of neighboring blocks around the current coding block includes: the left side, the upper left side of the coding block And the reference pixels of the 4 * N + 1 pixel blocks on the upper side generate a prediction signal of the current coding block.
The method of searching for an intra prediction mode according to claim 1, wherein:

In the first cost calculation method and / or the second cost calculation method, a sum of squares or absolute values of residuals is used as a cost of the prediction mode.
The method of searching for an intra prediction mode according to claim 1, wherein:

In the first cost calculation method and / or the second cost calculation method, the cost of the prediction mode is calculated according to the transform coefficient and the number of bits of the prediction mode obtained by performing a two-dimensional matrix transformation on the residual.
The intra prediction mode search method according to claim 21, wherein:

The two-dimensional matrix transformation is a discrete sine / cosine transformation.
The intra prediction mode search method according to claim 21, wherein:

The two-dimensional matrix transformation is a Hadamard transformation.
The method of searching for an intra prediction mode according to claim 1, wherein:

The first cost calculation method is the same as or different from the second cost calculation method.
The method of searching for an intra prediction mode according to claim 1, wherein:

The intra prediction mode search method is executed by a chip including an N-2 pipeline stage, an N-1 pipeline stage, and an N pipeline stage,

The first step and the second step are performed by an N-2 pipeline stage,

The third step is performed by an N-1 pipeline stage,

The fourth step is performed by an N pipeline stage.
The method of searching for an intra prediction mode according to claim 26, wherein:

The N-2 pipeline stage performs pre-processing on all coding blocks, and stores the processed results for the N-1 pipeline stage to call.
The method of searching for an intra prediction mode according to claim 26, wherein:

The N-2 pipeline stage, the N-1 pipeline stage, and the N pipeline stage work in parallel, so that the N pipeline stage processes the coding blocks processed by the N-1 pipeline stage, the N The -1 pipeline stage processes the encoding blocks processed by the N pipeline stage in parallel with the N-2 pipeline stage processes the current encoding blocks in parallel.
A video encoding method includes:

The video is encoded using the final prediction mode searched by the intra prediction mode search method according to any one of claims 1 to 28.
An intra prediction mode search device for performing the intra prediction mode search method according to any one of claims 1 to 25, wherein:

Including: N-2 pipeline stage, N-1 pipeline stage and N pipeline stage,

The first step and the second step are performed by the N-2 pipeline stage,

The third step is performed by the N-1 pipeline stage,

The fourth step is performed by the N pipeline stage.
The intra prediction mode search method according to claim 30, wherein:

The N-2 pipeline stage performs pre-processing on all coding blocks, and stores the processed results for the N-1 pipeline stage to call.
The intra prediction mode search method according to claim 30, wherein:

The N-2 pipeline stage, the N-1 pipeline stage, and the N pipeline stage work in parallel, so that the N pipeline stage processes the coding blocks processed by the N-1 pipeline stage, the N The -1 pipeline stage processes the encoding blocks processed by the N pipeline stage in parallel with the N-2 pipeline stage processes the current encoding blocks in parallel.
An intra prediction mode search device includes: a memory and a processor, wherein:

The memory is used to store program instructions;

The processor calls the program instruction, and when the program instruction is executed, is used to perform the following operations:

In the first step, based on the X object prediction modes, the original pixels of neighboring blocks around the current encoding block are used to generate X first prediction signals of the current encoding block in the X object prediction mode, and obtain all The residuals of the X first prediction signals and the original image values of the coding block are taken as the X first residuals;

In a second step, the first cost of the X object prediction modes is calculated by the first cost calculation method according to the first residual, and the Y objects on the least side of the first cost in the object prediction mode are calculated. Prediction mode as an intermediate prediction mode;

In a third step, based on the intermediate prediction mode, a reconstructed pixel of the neighboring block is used to generate a second prediction signal of the current coding block in the intermediate prediction mode, and the second prediction signal and the obtained prediction signal are obtained. State the residual of the original image value of the current coding block as the second residual; and

In a fourth step, a second cost of the intermediate prediction mode is calculated by a second cost calculation method according to the second residual, and the prediction mode with the smallest second cost in the intermediate prediction mode is used as the searched final result. Prediction model,

Here, Y is a natural number smaller than X.
The intra prediction mode search device according to claim 33, wherein:

In the third step, the Y intermediate prediction modes obtained in the second step and the final prediction mode of at least one neighboring block of the current coding block are comprehensively processed to integrate the intermediate prediction mode. Adjusted to Z,

Here, Z is a natural number smaller than X.
The intra prediction mode search device according to claim 34, wherein:

In the third step, a final prediction mode of at least one of the neighboring blocks in three directions of a left side, an upper side, and an upper left side of the current coding block is used in the comprehensive processing.
The intra prediction mode search device according to claim 34, wherein:

In the third step, in the comprehensive processing, at least one prediction mode among the Y intermediate prediction modes is used as the intermediate prediction mode together with a final prediction mode of the neighboring block.
The intra prediction mode search device according to claim 36, wherein:

At least one prediction mode among the Y intermediate prediction modes is a prediction mode that is different from the final prediction mode of the neighboring block and has the least first cost.
The intra prediction mode search device according to claim 33, wherein:

The X object prediction modes are 35 object prediction modes, and the Y intermediate prediction modes are 5 intermediate prediction modes.
The intra prediction mode search device according to claim 34, wherein:

The Z intermediate prediction modes are 5 intermediate prediction modes.
The intra prediction mode search device according to claim 33, wherein:

In the first step, all prediction modes of intra prediction are divided into at least two groups, and one group is selected as the target prediction mode.
The intra prediction mode search device according to claim 40, wherein:

The grouping is performed according to at least one of a direction or a kind of the target prediction mode.
The intra prediction mode search device according to claim 41, wherein:

The number of prediction modes in each said group is the same.
The intra prediction mode search device according to claim 40, wherein:

The prediction mode includes a direction mode, a DC mode, and a Planar mode,

Each packet includes the DC mode and / or the Planar mode.
The intra prediction mode search device according to claim 40, wherein:

In the first step, one set of prediction modes is selected randomly or according to a feature index of a current coding block as the target prediction mode.
The intra prediction mode search device according to claim 40, wherein:

In the first step, the prediction modes are divided into 4 groups corresponding to 0 °, 45 °, 90 °, and 135 ° with respect to 0 °, 45 °, 90 °, and 135 °.

One group is selected as the target prediction mode according to the directivity index of the current coding block in each prediction direction.
The intra prediction mode search device according to claim 45, wherein:

The directionality index is gradient information of the current coding block in each prediction direction.
The intra prediction mode search device according to claim 46, wherein:

In the first step, the current coding block is divided into a plurality of regions, and gradient information of the plurality of regions in the prediction direction is determined according to a prediction direction in a group, and a group of gradient information having a minimum value is selected. A prediction mode is used as the target prediction mode.
The intra prediction mode search device according to claim 45, wherein:

In the first step, the current coding block is divided into five sub-blocks a, b, c, d, and e of equal size in the order of upper left, upper right, lower left, lower right, and center, and the sub blocks are calculated respectively The average value of a, b, c, d, e, and then use the average value to find the gradient information at 0 °, 45 °, 90 °, and 135 °.

0 °: GI1 = | b-a | + | d-c |

45 °: GI2 = | c-e | + | e-b |

90 °: GI3 = | c-a | + | d-b |

135 °: GI4 = | d-e | + | e-a |

One set of prediction modes in the prediction direction corresponding to the minimum value among GI1 to GI4 is selected as the target prediction mode.
The intra prediction mode search device according to claim 33, wherein:

The neighboring block is a pixel block in a coding block adjacent to the current coding block.
The intra prediction mode search device according to claim 33, wherein:

The original pixels and the reconstructed pixels include pixels obtained by performing padding.
The intra prediction mode search device according to claim 50, wherein:

The pixel block to be filled is a pixel block at a position where a reconstructed pixel cannot be obtained when the second prediction signal is generated.
The intra prediction mode search device according to claim 33, wherein:

The coding block includes NxN pixel blocks, where N is a natural number,

The generating a first prediction signal of the current coding block under the X object prediction modes by using the original pixels of neighboring blocks around the current coding block includes: the left side, the upper left side of the coding block And the reference pixels of the 4 * N + 1 pixel blocks on the upper side generate a prediction signal of the current coding block.
The intra prediction mode search device according to claim 33, wherein:

In the first cost calculation method and / or the second cost calculation method, a sum of squares or absolute values of residuals is used as a cost of the prediction mode.
The intra prediction mode search device according to claim 33, wherein:

In the first cost calculation method and / or the second cost calculation method, the cost of the prediction mode is calculated according to the transform coefficient and the number of bits of the prediction mode obtained by performing a two-dimensional matrix transformation on the residual.
The intra prediction mode search device according to claim 53, wherein:

The two-dimensional matrix transformation is a discrete sine / cosine transformation.
The intra prediction mode search device according to claim 53, wherein:

The two-dimensional matrix transformation is a Hadamard transformation.
The intra prediction mode search device according to claim 33, wherein:

The first cost calculation method is the same as or different from the second cost calculation method.
A video encoding device includes the intra prediction mode search device according to any one of claims 30 to 57.
A recording medium storing a program for causing a computer to execute the method for searching an intra prediction mode according to claims 1 to 28.
A recording medium storing a program that causes a computer to execute the video encoding method according to claim 29.