CN113992911A - Intra-frame prediction mode determination method and device for panoramic video H264 coding - Google Patents

Intra-frame prediction mode determination method and device for panoramic video H264 coding Download PDF

Info

Publication number
CN113992911A
CN113992911A CN202111128793.1A CN202111128793A CN113992911A CN 113992911 A CN113992911 A CN 113992911A CN 202111128793 A CN202111128793 A CN 202111128793A CN 113992911 A CN113992911 A CN 113992911A
Authority
CN
China
Prior art keywords
mode
image
coded
coding
calibration area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111128793.1A
Other languages
Chinese (zh)
Inventor
王成成
朱伟
王寿峰
白俊奇
苗锋
邱文嘉
王扬红
朱恒
刘羽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Laisi Electronic Equipment Co ltd
Original Assignee
Nanjing Laisi Electronic Equipment Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Laisi Electronic Equipment Co ltd filed Critical Nanjing Laisi Electronic Equipment Co ltd
Priority to CN202111128793.1A priority Critical patent/CN113992911A/en
Publication of CN113992911A publication Critical patent/CN113992911A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention provides a method and a device for determining an intra-frame prediction mode of panoramic video H264 coding, belonging to the field of image processing and comprising the following steps: plotting the off-line panoramic image, identifying a designated observation area as a calibration area, and storing a polygon point set of the calibration area; judging whether the macro block to be coded is in the calibration area or not according to the polygon point set; for the macro blocks to be coded in the calibration area, an intra16 × 16 mode is used; for the macro blocks to be coded in the non-calibration area, calculating the smoothness of the image, and determining whether an intra16 × 16 mode or an intra4 × 4 mode is used according to the smoothness; for the macro block which is determined to use the intra4 multiplied by 4 mode, calculating the edge direction by using a Kirsch operator, and determining a coding alternative mode according to the edge direction; and finally, selecting an optimal coding mode under the alternative mode. Experimental results show that the panoramic improvement-based intra-frame prediction mode rapid determination method can greatly reduce the encoding time under the condition of ensuring the video encoding quality of a key attention area.

Description

Intra-frame prediction mode determination method and device for panoramic video H264 coding
Technical Field
The invention relates to the technical field of image processing, in particular to a method and equipment for determining an intra-frame prediction mode of panoramic video H264 coding.
Background
With the development of video encoding and decoding and streaming media technologies, performance indexes such as video image resolution, video real-time performance and the like put higher requirements on image processing algorithms, and airport panoramic video encoding is a typical example of such applications.
The h.264 coding standard proposes an intra-frame prediction method, i.e. the prediction of the current block is realized by using decoded and reconstructed neighboring blocks, and the corresponding subsequent processing is performed on the residual between the predicted block and the actual block. According to "Thomas Wiegand, Heiko Schwarz, Anthony Joch, Faouzi Kossenti, and Gary J. Sullivan Rate-Constrained Coder control and Complex of Video Coding standards.2005", H.264 intra-frame Coding has two cost function models, respectively, a Sum of Absolute Difference (SAD) cost model and a Rate-Distortion Optimization (RDO) model, the RDO model is more accurate for module selection than the SAD model. The RDO model rationale for intra mode selection is: in the optional macroblock type, each available coding mode is traversed, the number of coding bits in each mode and the corresponding distortion degree of the reconstructed image are calculated, and the mode with the minimum distortion degree is selected as the optimal mode. However, the intra-frame prediction technique faces huge computation workload, and a certain gap exists between the coding time and the real-time performance of the video, so that how to optimize the intra-frame prediction technique becomes an important problem.
Disclosure of Invention
The purpose of the invention is as follows: aiming at the defects of the prior art, the invention provides the intra-frame prediction mode determination method for the H264 coding of the panoramic video, which can quickly select the intra-frame prediction mode and improve the video coding speed.
The present invention also provides a computer device and a computer readable storage medium for implementing the method for determining intra prediction mode of H264 coding of a panoramic video.
The technical scheme is as follows: according to a first aspect of the present invention, there is provided an intra prediction mode determination method for panoramic video H264 coding, comprising the steps of:
plotting the off-line panoramic image, identifying a designated observation area as a calibration area, and storing a polygon point set of the calibration area;
carrying out macro block division on an input panoramic video image to be coded, and judging whether a macro block to be coded is in a calibration area or not according to a polygon point set;
for the macro block to be coded in the calibration area, an intra16 × 16 mode is used, and 4 coding alternative modes are corresponding to the macro block to be coded;
for the macro blocks to be coded in the non-calibration area, calculating the smoothness of the image, and determining whether an intra16 × 16 mode or an intra4 × 4 mode is used according to the smoothness; for the macro block which is determined to use the intra4 multiplied by 4 mode, calculating the edge direction by using a Kirsch operator, and determining a coding alternative mode according to the edge direction;
and according to the selected alternative mode, selecting the mode with the minimum cost as the final intra-frame prediction mode by calculating a cost function.
According to certain embodiments of the first aspect, the designated viewing area is a non-motion area in the airport panoramic image.
According to some embodiments of the first aspect, performing macroblock division on an input panoramic video image to be encoded, and determining whether a macroblock to be encoded is in a calibration region according to a polygon point set includes:
dividing an input airport panoramic image according to 16 multiplied by 16 macro blocks, combining a polygon point set, calculating an attribute value of a current macro block to be coded about a calibration area, and when the calculated attribute value is greater than or equal to a specified threshold th1, considering that the macro block is in the calibration area, otherwise, not in the calibration area.
According to some embodiments of the first aspect, the attribute value of the macroblock to be currently encoded with respect to the calibration area is calculated according to the following formula:
Figure BDA0003279765070000021
wherein, SuminThe number of pixels in the macroblock is indicated, and the part indicates the ratio of the macroblock to the target area.
According to some embodiments of the first aspect, for a macroblock to be encoded within a non-nominal region, the intra4 × 4 mode is used when the image smoothness is greater than the set threshold th2, and otherwise the intra16 × 16 mode is used.
According to certain embodiments of the first aspect, the image smoothness calculation formula is:
Figure BDA0003279765070000022
where f (x, y) is the pixel value of the macroblock at (x, y) and m is the average of all pixels of the macroblock.
According to some embodiments of the first aspect, computing an edge direction using the Kirsch operator, determining the coding candidate pattern from the edge direction comprises: the 8 directions in the Kirsch operator are respectively 0 °, 45 °, 90 °, 135 °, 180 °, 225 °, 270 °, 315 °, and correspond to the direction numbers of 1,8,7,6,5,4,3,2, respectively, and compared with the 8 modes except the DC prediction mode in the intra4 × 4 mode, there is no corresponding prediction mode corresponding to the 5,6,7,8 directions, so the directional derivative of each pixel point in the image is calculated according to the templates in the four directions of 1,2,3,4, and then the reciprocal is calculated from the partial derivatives in the 4 directions, the largest template corresponds to the edge direction of the pixel, and the alternative mode is selected according to the corresponding rule.
According to some embodiments of the first aspect, the rule for selecting the alternative mode is as follows:
if the direction of the Kirsch edge is direction 1, selecting the prediction modes 1, 6 and 8 as alternative modes;
if the direction of the Kirsch edge is direction 2, selecting prediction modes 4, 5 and 6 as alternative modes;
if the direction of the Kirsch edge is direction 3, selecting prediction modes 0, 5 and 7 as alternative modes;
if the direction of the Kirsch edge is direction 4, prediction modes 3, 7 are selected as alternative modes.
According to some embodiments of the first aspect, for intra4 × 4 mode macroblocks, a DC mode is added as an alternative mode according to the resulting alternative mode.
According to a second aspect of the present invention, there is provided a computer apparatus, the apparatus comprising:
one or more processors;
a memory; and
one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs when executed by the one or more processors implement the intra prediction mode determination method for panoramic video H264 encoding as described in the first aspect of the present invention.
According to a third aspect of the present invention, there is provided a computer-readable storage medium having stored thereon a computer program executable by one or more processors to implement the intra prediction mode determination method for panoramic video H264 encoding.
Has the advantages that: the invention discloses a method for determining an intra-frame prediction mode of panoramic video H264 coding, which can quickly select an H.264 intra-frame prediction mode of panoramic coding, firstly, an off-line panoramic plotting mode is adopted to plot a panoramic non-key attention area, and macroblocks to be coded in a calibration area can be quickly separated to use an intra16 multiplied by 16 mode; for the macro block to be coded in the non-calibration area, determining a proper intra-frame coding mode by utilizing image smoothness; aiming at the problem of excessive intra-frame brightness intra4 multiplied by 4 mode prediction modes, an improved Kirsch operator is utilized to obtain edge directions, and an intra-frame prediction alternative mode is selected; and finally, determining a final prediction mode according to the cost function. The invention solves the problem of the selection of the prediction mode of the ultrahigh-resolution panoramic video in the H.264 coding frame, reduces the coding time and improves the coding efficiency while ensuring the coding quality of the panoramic core area. The on-site verification and performance test are carried out in a plurality of airport environments, the coding time can be reduced by 38.92%, and the method has wide application prospect.
Drawings
Fig. 1 is a flowchart illustrating an intra prediction mode determination method for H264 coding of a panoramic video according to an embodiment of the present invention;
FIG. 2 is a schematic of an off-line plotting of non-emphasized regions of interest of a panoramic image in accordance with an embodiment of the present invention;
FIG. 3 is a diagram of 8 directions of the Kirsch operator and 8 prediction modes of the intra4 × 4 mode according to an embodiment of the present invention;
fig. 4 is a schematic diagram of macroblock-coded blocks of a panorama image according to an embodiment of the present invention.
Detailed Description
The technical scheme of the invention is further explained by combining the attached drawings.
In order to solve the problem of higher algorithm complexity in the intra-frame prediction mode selection process in the ultrahigh-resolution panoramic video coding process, the invention provides a method for quickly selecting an intra-frame prediction mode for panoramic video image coding. The general idea of the method is as follows: and stripping non-key attention areas through image prior information, and quickly separating intra4 multiplied by 4 and intra16 multiplied by 16 modes. Meanwhile, alternative modes are reduced in the intra4 × 4 mode by using the Kirsch algorithm. In an embodiment, the description of the method is made in terms of airport panoramic video images. As shown in fig. 1, the method comprises the following steps:
step 1, plotting the off-line panoramic image.
The airport panorama has the important characteristics of high image resolution and large field angle, and the important observation area is a moving area such as a runway and an airport apron. Therefore, starting from narrowing the selection range of the prediction mode, part of details of the image in the non-key observation area are lost, and the improvement of the real-time performance of the algorithm is completely acceptable. And plotting the spliced panoramic image under the forward line, using plotting software to demarcate a plotting area polygon on the panoramic image, and recording a polygon point set to realize the extraction of a non-key observation area. The non-key observation area is used as a calibration area, and the calibrated information is called prior information for subsequent judgment of the position of the macro block to be coded. Fig. 2 is a schematic diagram of plotting the non-emphasized region of interest under the panoramic image line in this embodiment.
And 2, judging the position of the macro block to be coded and preliminarily selecting a prediction mode.
The h.264 macroblock size is 16 × 16, which is also a basic processing unit, and intra4 × 4 and intra16 × 16 predictions are performed simultaneously for the current luminance macroblock during intra coding. H.264 luminance block intra-frame coding needs to traverse all intra16 × 16 and intra4 × 4 modes, the intra4 × 4 mode has 9 prediction directions, the prediction is more accurate, the method is suitable for macro blocks with larger variation and more details, and the intra16 × 16 mode only has 4 prediction directions and is more suitable for flat macro blocks. The best mode can be selected by two modes distortion degrees according to the technology disclosed in "t.wiegand and b.gird ladder multiple selection in hybrid Video decoder control, Processing coding.2001international reference on Image Processing, vol.3 Oct,2001, pp.542-545". The H.264 intra-frame coding selection mode can be combined with video image contents, an intra16 × 16 mode can be selected for areas with flat image texture changes, and an intra4 × 4 mode can be selected for areas with severe image detail changes. Since the airport panoramic focus area is a moving area such as a runway apron, if the area is a non-moving area, the intra16 × 16 mode is directly adopted. Firstly, stripping intra16 × 16 mode to-be-coded blocks in a calibration area.
Inputting a panoramic video image of an airport to be coded, dividing a macro block, judging whether pixels in the macro block are in each polygon of a calibration area according to a polygon point set obtained by calibrating the panoramic image, calculating the ratio of the macro block to the calibration area, and considering that the pixels are in the calibration area if the ratio is greater than a specified threshold value. The calculation formula of the macro block occupying the calibration area ratio is as follows;
Figure BDA0003279765070000051
wherein, SuminThe number of pixels in the macroblock is shown, the part is the ratio of the macroblock to the scaling area, if the part is greater than the threshold th1, the macroblock is considered to be in the scaling area and is considered to be a non-important area of interest, the macroblock adopts an intra16 × 16 mode, and there are 4 alternative prediction modes.
And if the macro block to be coded is not in the calibration area, selecting a proper macro block intra-frame coding mode according to the image smoothness. Defining the image smoothness Thr as:
Figure BDA0003279765070000052
where f (x, y) is the pixel value of the macroblock at (x, y), m is the average of all pixels of the macroblock, and x, y are the horizontal and vertical coordinates representing the location of the pixel value. A larger Thr indicates a more drastic change in the image, whereas the image is flatter. Image smoothness is also referred to as flatness. According to the statistical properties of the sequence of video images, a threshold th2 is set, the smaller the value, the sharper the image. When Thr < th2, the intra16 × 16 mode is adopted, otherwise the intra4 × 4 mode is selected, which is called detail macroblock.
And 3, obtaining the texture direction of the macro block according to the Kirsch operator, thereby preferably selecting the intra-frame coding alternative mode.
For the macro block of the intra4 × 4 intra coding mode selected in step 2, the Kirsch operator determines the gradient magnitude and direction using 8 template directions, and calculates the difference values in different directions respectively, wherein the direction corresponding to the maximum value is regarded as the edge direction. The detailed introduction of the Kirsch operator can refer to the "Gong phonophorus digital image processing and analysis Beijing: the Qinghua university Press: 2006.".
The step 3 specifically comprises the following steps:
step 3-1, calculating a directional derivative of each pixel point in the image according to the templates in the eight directions, referring to fig. 3, where the 8 directions are respectively 0 °, 45 °, 90 °, 135 °, 180 °, 225 °, 270 °, 315 °, and correspond to 1,8,7,6,5,4,3,2 direction numbers, and compared with 8 modes except for the DC prediction mode in h.264 intra prediction, there is no corresponding prediction mode in the 5,6,7,8 directions in Kirsch operator, so that 8 directions of Kirsch can be reduced to 4 directions, and a corresponding partial derivative calculation method is as follows:
(1) the partial derivative in the 0 ° direction is calculated as follows:
Figure BDA0003279765070000061
(2) the partial derivative in the 315 ° direction is calculated as follows:
Figure BDA0003279765070000062
(3) the partial derivative in the 270 ° direction is calculated as follows:
Figure BDA0003279765070000063
(4) the partial derivative in the 225 ° direction is calculated as follows:
Figure BDA0003279765070000064
Figure BDA0003279765070000071
after having solved for G0On the basis of (i, j), the partial derivative value of the remaining direction can be iterated through the following formula, and the operation amount is reduced:
G315(i,j)=G0(i,j)+8(f(i-1,j+1)-f(i+1,j))
G270(i,j)=G315(i,j)+8(f(i+1,j-1)-f(i,j+1))
G225(i,j)=G270(i,j)+8(f(i,j-1)-f(i+1,j+1))
where (i, j) represents the coordinates of pixel points x and y.
And 3-2, calculating the reciprocal of the partial derivatives in the 4 directions, determining the direction corresponding to the maximum template as the edge direction of the pixel, and selecting an alternative mode according to the following rule:
if the direction of the Kirsch edge is direction 1, selecting the prediction modes 1, 6 and 8 as alternative modes;
if the direction of the Kirsch edge is direction 2, selecting prediction modes 4, 5 and 6 as alternative modes;
if the direction of the Kirsch edge is direction 3, selecting prediction modes 0, 5 and 7 as alternative modes;
if the direction of the Kirsch edge is direction 4, prediction modes 3, 7 are selected as alternative modes.
And 4, calculating a cost function RD _ cost according to the alternative modes, and selecting the mode with the minimum cost as the best coding mode.
If the intra16 × 16 mode is selected, the four prediction modes of 16 × 16 vertical, horizontal, DC and planar are used as the candidate modes, the cost function RD _ cost is calculated, and the smallest is used as the final mode.
If the mode is a4 × 4 luminance mode, a DC mode is added as an alternative mode according to the obtained alternative mode, and RD _ cost is calculated, and the smallest is used as a final mode. The formula for selecting the mode with the minimum distortion degree as the best mode in the RD _ cost calculation is as follows:
RD_cost=SSD(s,c,MODE|QP)+λMODER(s,c,MODE|QP)
λMoDEfor the current lagrange coefficient, MODE is an encoding MODE selectable for the current macroblock, QP is a macroblock quantization parameter, s, c respectively represents pixel values of the original image and the reconstructed image in the corresponding MODE, R (s, c, MODE | QP) represents the number of encoding output bits of the current macroblock at a specific QP and MODE, SSD (s, c, MODE | QP) represents the distortion of the image, and the formula is as follows:
Figure BDA0003279765070000081
a schematic diagram of macroblock-coded blocks of a panorama image according to the present embodiment is shown in fig. 4.
The method for quickly selecting the H264 intra-frame prediction mode for the airport panorama coding, which is described by the embodiment of the invention, solves the problem of quickly selecting the H264 intra-frame prediction mode for the airport ultrahigh-resolution panorama H264 coding. The on-site verification and performance test are carried out in a plurality of airport environments, the coding time can be reduced by 38.92%, and the method has wide application prospect.
Based on the same technical concept as the method embodiment, according to another embodiment of the present invention, there is provided a computer apparatus including: one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs, when executed by the one or more processors, implement the steps in the method embodiments.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims (11)

1. An intra prediction mode determination method for H264 coding of panoramic video, characterized by comprising the following steps:
plotting the off-line panoramic image, identifying a designated observation area as a calibration area, and storing a polygon point set of the calibration area;
carrying out macro block division on an input panoramic video image to be coded, and judging whether a macro block to be coded is in a calibration area or not according to a polygon point set;
for the macro block to be coded in the calibration area, an intra16 × 16 mode is used, and 4 coding alternative modes are corresponding to the macro block to be coded;
for the macro blocks to be coded in the non-calibration area, calculating the smoothness of the image, and determining whether an intra16 × 16 mode or an intra4 × 4 mode is used according to the smoothness; for the macro block which is determined to use the intra4 multiplied by 4 mode, calculating the edge direction by using a Kirsch operator, and determining a coding alternative mode according to the edge direction;
and according to the selected alternative mode, selecting the mode with the minimum cost as the final intra-frame prediction mode by calculating a cost function.
2. The method of claim 1, wherein the specified observation region is a non-motion region in the airport panoramic image.
3. The method of claim 1, wherein the step of performing macroblock partition on the input panoramic video image to be encoded, and the step of determining whether the macroblock to be encoded is in the calibration area according to the polygon point set comprises:
dividing an input airport panoramic image according to 16 multiplied by 16 macro blocks, combining a polygon point set, calculating an attribute value of a current macro block to be coded about a calibration area, and when the calculated attribute value is greater than or equal to a specified threshold th1, considering that the macro block is in the calibration area, otherwise, not in the calibration area.
4. The method of claim 3, wherein the property value of the macroblock to be coded with respect to the anchor region is calculated according to the following formula:
Figure FDA0003279765060000011
wherein, SuminRepresenting a current macroblockThe number of intra pixels in the macroblock area, and the part represents the ratio of the current macroblock as the macroblock area.
5. The method of claim 1, wherein for macroblocks to be coded in non-anchor regions, the intra4 x 4 mode is used when image smoothness is greater than a set threshold th2, and otherwise the intra16 x 16 mode is used.
6. The method of claim 5, wherein the image smoothness calculation formula is:
Figure FDA0003279765060000021
where f (x, y) is the pixel value of the macroblock at (x, y) and m is the average of all pixels of the macroblock.
7. The method of claim 1, wherein the edge direction is calculated by using a Kirsch operator, and determining the encoding candidate mode according to the edge direction comprises: the 8 directions in the Kirsch operator are respectively 0 °, 45 °, 90 °, 135 °, 180 °, 225 °, 270 °, 315 °, and correspond to the direction numbers of 1,8,7,6,5,4,3,2, respectively, and compared with the 8 modes except the DC prediction mode in the intra4 × 4 mode, there is no corresponding prediction mode corresponding to the 5,6,7,8 directions, so the directional derivative of each pixel point in the image is calculated according to the templates in the four directions of 1,2,3,4, and then the reciprocal is calculated from the partial derivatives in the 4 directions, the largest template corresponds to the edge direction of the pixel, and the alternative mode is selected according to the corresponding rule.
8. The method of claim 7, wherein the alternative mode is selected by the following rules:
if the direction of the Kirsch edge is direction 1, selecting the prediction modes 1, 6 and 8 as alternative modes;
if the direction of the Kirsch edge is direction 2, selecting prediction modes 4, 5 and 6 as alternative modes;
if the direction of the Kirsch edge is direction 3, selecting prediction modes 0, 5 and 7 as alternative modes;
if the direction of the Kirsch edge is direction 4, prediction modes 3, 7 are selected as alternative modes.
9. The method of claim 8, wherein for intra4 x 4 macroblocks, a DC mode is added to an alternative mode obtained according to a rule as an alternative mode.
10. Providing a computer device, the device comprising:
one or more processors;
a memory; and
one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs when executed by the one or more processors implement the intra prediction mode determination method for panoramic video H264 encoding as described in the first aspect of the present invention.
11. A computer-readable storage medium having a computer program stored thereon, the computer program being executable by one or more processors to implement the method for intra prediction mode determination for panoramic video H264 coding.
CN202111128793.1A 2021-09-26 2021-09-26 Intra-frame prediction mode determination method and device for panoramic video H264 coding Pending CN113992911A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111128793.1A CN113992911A (en) 2021-09-26 2021-09-26 Intra-frame prediction mode determination method and device for panoramic video H264 coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111128793.1A CN113992911A (en) 2021-09-26 2021-09-26 Intra-frame prediction mode determination method and device for panoramic video H264 coding

Publications (1)

Publication Number Publication Date
CN113992911A true CN113992911A (en) 2022-01-28

Family

ID=79736674

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111128793.1A Pending CN113992911A (en) 2021-09-26 2021-09-26 Intra-frame prediction mode determination method and device for panoramic video H264 coding

Country Status (1)

Country Link
CN (1) CN113992911A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115209181A (en) * 2022-06-09 2022-10-18 咪咕视讯科技有限公司 Video synthesis method based on surround view angle, controller and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115209181A (en) * 2022-06-09 2022-10-18 咪咕视讯科技有限公司 Video synthesis method based on surround view angle, controller and storage medium
CN115209181B (en) * 2022-06-09 2024-03-22 咪咕视讯科技有限公司 Video synthesis method based on surrounding view angle, controller and storage medium

Similar Documents

Publication Publication Date Title
WO2018010492A1 (en) Rapid decision making method for intra-frame prediction mode in video coding
EP3389276B1 (en) Hash-based encoder decisions for video coding
TWI600317B (en) Image decoding apparatus, picture decoding method, picture encoding apparatus, picture encoding method, and data structure of encoded data
CN109068142B (en) 360-degree video intra-frame prediction rapid decision-making method, device, coder-decoder and medium
WO2022104498A1 (en) Intra-frame prediction method, encoder, decoder and computer storage medium
JP2008227670A (en) Image coding device
Zhu et al. An improved inter-frame prediction algorithm for video coding based on fractal and H. 264
US20240031576A1 (en) Method and apparatus for video predictive coding
US10542277B2 (en) Video encoding
Heidari et al. Reduction of intra-coding time for HEVC based on temporary direction map
Chen et al. Rough mode cost–based fast intra coding for high-efficiency video coding
JP4994877B2 (en) Method and system for selecting a macroblock coding mode in a video frame sequence
CN117156133B (en) Inter-frame prediction mode selection method and device for video coding
CN113992911A (en) Intra-frame prediction mode determination method and device for panoramic video H264 coding
JP5748225B2 (en) Moving picture coding method, moving picture coding apparatus, and moving picture coding program
RU2562414C1 (en) Method for fast selection of spatial prediction mode in hevc coding system
JP2008271127A (en) Coding apparatus
TWI510050B (en) Quantization control method and apparatus, and quantization control program
JP2005348008A (en) Moving picture coding method, moving picture coder, moving picture coding program and computer-readable recording medium with record of the program
JP2012129791A (en) Image encoder
CN110035285B (en) Depth prediction method based on motion vector sensitivity
CN110677646B (en) Intra-frame coding prediction method based on CPU + GPU hybrid coding
CN109168000B (en) HEVC intra-frame prediction rapid algorithm based on RC prediction
KR100771641B1 (en) Fast mehtod of determinning h.264 mode
WO2020258052A1 (en) Image component prediction method and device, and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination