CN104135662B

CN104135662B - Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition

Info

Publication number: CN104135662B
Application number: CN201410409829.7A
Authority: CN
Inventors: 李卓明; 王鹏; 高峰; 沙学军; 赵洪林; 李德志; 高玉龙
Original assignee: Harbin Institute of Technology
Current assignee: Harbin Institute of Technology
Priority date: 2014-08-19
Filing date: 2014-08-19
Publication date: 2017-04-26
Anticipated expiration: 2034-08-19
Also published as: CN104135662A

Abstract

The invention discloses an improved H.264 compressed encoding method for a video file under the limited bandwidth and transmission rate condition, and relates to the field of digital image processing. The improved H.264 compressed encoding method aims at solving the problem that adaptive network resources of a video downloading transmission method in an existing wireless mobile communication network are limited, and the high-speed and high-definition transmission requirement is met. The improved H.264 compressed encoding method includes the steps that in the initial phase, each frame of image is compressed in advance, characteristics of the images in different directions are extracted through plural wavelet transformation, and meanwhile information redundancies are reduced; in the inter-frame prediction phase, according to the directive characteristics obtained through the wavelet transformation, the spatial correlation between inter-frame images is judged, the complexity and the dimensionality of motion vector calculation are reduced, and a P frame and a B frame are obtained; in the intra-frame prediction phase, detail characteristics are eliminated, prediction and integer cosine transformation are only carried out on the overall characteristics, and an I frame is obtained. The improved H.264 compressed encoding method is suitable for improved H.264 compressed encoding on the video file under the limited bandwidth and transmission rate condition.

Description

For under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method

Technical field

The present invention relates to digital image processing field, and in particular to H.264 compression coding technology.

Background technology

The purpose of Video coding is to realize the compression of video.According to the viewpoint of Shannon information theory, with comentropy come table The information of data is levied, remaining information is redundancy.The redundancy of a large amount of fragment types is still existed in video information, therefore, remove Redundancy also becomes the core content in video coding technique, to reach the purpose of compression, video coding technique mainly for Redundancy have following a few classes：

(1), the time redundancy between frame and frame.The vision of people is that have a brief stay effect, and the scene flashed is not Can disappear immediately, therefore frequency is higher than this value of the mankind if picture flashes, human eye would not feel the pause between frame and frame, General video council using the 25 frames/more than second frame sampling rate.So, the time interval between two successive frames will be less than 1/25= 0.04 second, then when the speed of moving objects in video will not be too fast, the degree of association between adjacent two frame will be very high, shape Into redundancy.

(2), the spatial redundancy in each frame in portion.One sub-picture is exactly a frame, and spatial redundancy is existed in frame of video.

(3), the statistical redundancy of generally existing.Signal in our daily life, the value in its correspondence signal space The probability of the rule of order is very little, and would generally get maximum value or minimum value in one or several particular points.So, Can be by distributing compared with short code word for the larger data point of probability, being that probability is less distributes compared with long code word, so improves pressure Contracting efficiency.

(4), the visual redundancy based on human eye.Source video sequence is extremely wide, abundant in content colorful, it is desirable to which wherein all of information is all It is the hardly possible thing accomplished to receive this by human eye.Therefore, it can the blind area perceived for human eye, it suitably simplified, To eliminate visual redundancy.

In sum, technology of video compressing encoding should have following feature：

1), algorithm is simple：The high resolution of video image, frame rate is fast, therefore, the data volume of process is very big, in order that The process time for obtaining algorithm is reduced, and needs to reduce its complexity；

2) it is, practical：Current most application scenarios should be able to be extensively supported, if desired for high-resolution numeral electricity Depending on etc.；

3), various dimensions are processed：Image is compressed using redundancy of the image in time, space and frequency space.

Currently existing scheme and its pluses and minuses：Digital video decoding industry is rapidly grown up like the mushrooms after rain, it is contemplated that and Capacitive and intercommunity problem, different coding and decoding video producers are all made that tremendous contribution.The mark of video coding and decoding technology Quasi-ization problem seems and becomes more and more important that the Ge great worlds are organized in digital video decoding standardization aspect and expand many work and make A series of video encoding and decoding standards are determined.

MPEG tissues start to define the video compression algorithm for completing from nineteen ninety during MPEG-X series standards, employ block side The technologies such as motion compensation, discrete cosine transform (DCT), the quantization of formula.Speed from the 1.5Mbps of MPEG-1, MPEG-2 3～ The dynamic code rate of 100Mbit/s.MPEG-4 is pushed out within 1999, it is intended to in Internet or mobile communication equipment in real time Transmission sound, video signal and the MPEG standards formulated, numerous multimedia application are integrated in a complete frame, are multimedia Communication and applied environment provide canonical algorithm and instrument, can be applied by multimedia transmission, storage, retrieval etc. so as to set up one kind The Uniform data format that field generally adopts.

MPEG-4 is adopted and compressed based on object approach, and relatively before standard compression ratio is greatly improved, and compression multiple is 450 times (still image can reach 800 times), resolution input can from 320X240 to 1280X1204, this be homogenous quantities MPEG-1 and More than 10 times of MJEPG.

H.26X series standard is another kind of Video coding mode proposed by ITU-T, and at present application is most ripe to be exactly H.264.H.264 it is the MPEG (Motion Picture Expert Group) of the VCEG (Video Coding Experts Group) and ISO/IEC of ITU-T Joint video team (JVT：Joint video team) exploitation a new digital video coding standard, it is both ITU-T's H.264, be again ISO/IEC MPEG-4 the 10th part, formally issue in March, 2003.H.264 strengthen to various channels Adaptability, using the structure and grammer of network friendliness, be conducive to the process to the bit error rate and packet loss, identical rebuild scheme As under quality, can be than H.263 saving the code check of half or so.The network adaptability of code flow structure H.264 is strong, increased difference Wrong recovery capability, can be well adapted for IP and wireless network.

But, any video encoding and decoding standard while higher compression ratio and rate respectively is pursued, its algorithm Complexity generates restriction to real-time application, it is difficult to meet under wireless channel environment, channel is obtained and holding time, random The requirement of access delay.Meanwhile, the transmission requirement code stream of wireless channel itself has very strong error recovery capabilities.Radio communication The compressed encoding for requiring video supports that auto-adaptive parameter is arranged, and can pass through in the case of channel capacity or power limited Sacrificial section image quality, and do not bring the reduction of obvious Consumer's Experience.

The content of the invention

The present invention is to adapt to Internet resources to solve the video download transmission method of existing mobile radio networks It is limited, and in order to adapt to high speed, the demand of the transmission of high image quality, so as to provide a kind of being used in finite bandwidth and emission rate Under the conditions of to the modified model of video file H.264 compaction coding method.

For under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method, Under the limited wireless channel environment of frequency range, for the compressed encoding of the video of each two field picture, comprise the following steps：

Step one, initialization：Judge whether current frame image is start frame, i.e.,：Judge whether current frame image is video First frame of stream or the first frame of encoding stream, if it is judged that being yes, then execution step is one by one；If judged result is no, Then execution step one or two；

Step carries out Complex wavelet change one by one, to current frame image, obtains the level of present frame, vertical and 45° angle side To information, and execution step four；

Step one two, Complex wavelet change is carried out to current frame image, obtain present frame level, vertical and 45° angle direction Information, execution step two；

Step 2, prediction mode are selected：According to current frame image and the direction character dependency of reference frame image, judge to adopt Prediction mode, according to be inter prediction mode, then execution step three；According to be infra-frame prediction mode, then hold Row step 4；The reference frame image is, by block elimination effect filter, to rebuild original image and all directions by previous frame image Characteristic information after obtain image；

Step 3, offset dependence operation is carried out to present frame in level, vertical direction respectively, and judged according to peak value Current frame image relative to reference frame image motion principal direction, calculation of motion vectors, and execution step four；

If step 4, current frame image use inter prediction, execution step 41；What if current frame image was adopted It is infra-frame prediction, execution step four or two；

Step 4 one, according to motion vector and reference frame image, be residual error by the main constituent compression of images after wavelet transformation Signal, removes minutia, i.e.,：Remove diagonally opposed information, and execution step five；

Step 4 two, the Intra prediction mode selection to the execution of main constituent image H.264, while remove minutia, i.e.,： Remove diagonally opposed information, execution step five；

Step 5, integer cosine transformation and quantization, execution step six are done to the current frame image after prediction；

Step 6, judge that whether current frame image is the last frame of video flowing or encoding stream, if it is determined that judging knot Fruit is no, then execution step 61；If it is judged that being yes, then six or two are performed；

Step 6 one, re-quantization and inverse transformation are carried out to current frame image, while entropy code is carried out to current frame image, and Transmit under different host-host protocols, complete the compressed encoding of current frame image；

Step 6 two, entropy code is directly carried out, transmitted, complete the compressed encoding of current frame image.

It also includes：Step 7, after step 6 one is completed, by block elimination effect filter, rebuild original image and each The characteristic information in direction, as the reference frame of next frame.

Prediction mode selection course in step 2 is specially：

In the relative operation being both horizontally and vertically segmented respectively, operation in the horizontal direction is specially：

Step 2 one, according to the resolution of horizontal direction characteristic image, by horizontal direction characteristic image in the horizontal direction Carry out N deciles, and execution step two or two；N is positive integer；

Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, it is corresponding with reference frame interval Image carry out relative operation, calculate cross correlation value, obtain N number of cross correlation value；

Step 2 three, by obtain N number of cross correlation value do normalization operation, determine whether more than 50% cross correlation value More than 0.5, if it is judged that being yes, then execution step two or four；If it is judged that being no, then execution step two or five；

Step 2 four, determine and perform inter prediction, calculate initial level motion excursion amount and be equal to N number of interval off set amount and mutually The weighted sum form that correlation is multiplied；

Step 2 five, determination perform infra-frame prediction.

Complex wavelet changing method of the step one by one or used in step one two is specially：Will according to the image quality of video image Summation application scenarios, using different types of small echo directional pattern picture is extracted.

Complex wavelet change of the step one by one or used in step one two is realized using Haar or dbN small echos.

The present invention is a kind of for H.264 compressing to the modified model of video file under the conditions of finite bandwidth and emission rate Coded method, solves the problems, such as that the video download transmission scheme of existing mobile radio networks adapts to Internet resources and is limited. High speed, the demand of the transmission of high image quality are adapted to.

Description of the drawings

Fig. 1 is the H.264 video compression coding hierarchy schematic diagram that the present invention is used；

Fig. 2 is the present invention for H.264 compressing to the modified model of video file under the conditions of finite bandwidth and emission rate Coding structure schematic diagram；

Fig. 3 is the CABAC entropy code schematic flow sheets used in the present invention；

Fig. 4 is the complete video transmission plan schematic flow sheet in the present invention；

Specific embodiment

Specific embodiment one, under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 Compaction coding method, it is realized by following steps：

Step one, initialization：Whether present frame is start frame, if the first of the first frame of video flowing or encoding stream Frame, execution step one by one, if it is not, execution step one or two；

Step carries out Complex wavelet change one by one, to image, obtains level, vertical and 45° angle directional information, execution step Four；

Step one two, Complex wavelet change is carried out to image, obtain level, vertical and 45° angle directional information, execution step Two；

Step 2, predictive behavior are selected：According to present frame and the direction character dependency of reference frame, the prediction for adopting is judged Mode, if selecting inter prediction, execution step three, if selecting infra-frame prediction, execution step four；

Step 2 one, according to the resolution of horizontal direction characteristic image, N deciles, execution step are carried out in the horizontal direction Two or two；

Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, the corresponding area with reference picture Between image carry out relative operation, calculate cross correlation value；

Step 2 three, N number of cross correlation value is obtained, normalization operation is done, if more than 50% cross correlation value is more than 0.5, execution step two or four, if it is not, execution step two or five；

Step 2 five, determination perform infra-frame prediction；

Step 3, in level, vertical direction offset dependence operation is carried out respectively, judge that present frame is suitable according to peak value In the motion principal direction of reference frame, calculation of motion vectors, execution step four；

If step 4, present frame use inter prediction, execution step 41, if using infra-frame prediction, performing Step 4 two；

Step 4 one, according to motion vector and reference frame, be residual signals by the main constituent compression of images after wavelet transformation, Remove minutia, i.e., diagonally opposed information, execution step five；

Step 4 two, the Intra prediction mode selection process to the execution of main constituent image H.264, while it is special to remove details Levy, execution step five；

Step 5, integer cosine transformation and quantization, execution step six are done to the present frame after prediction；

If step 6, present frame are not the last frames of video flowing or encoding stream, if performing 61, being carried out six or two；

Step 6 one, the process of re-quantization and inverse transformation is carried out to present frame, while entropy code is carried out to it, according to difference Host-host protocol transmit, execution step seven；

Step 6 two, entropy code is directly carried out, transmitted, complete the compressed encoding work of video.

Step 7, by block elimination effect filter, the characteristic information of original image and all directions is rebuild, as next frame Reference frame is used.

The present invention carries out precommpression in the starting stage to the image of each frame, extracts image by Complex Wavelet Transform and exists The feature of different directions, while the redundancy of the information of reduction；It is the inter prediction stage, special according to the directivity obtained through Wavelet transformation Levy, judge the spatial coherence between inter frame image, reduce the complexity and dimension of motion vector calculation, obtain P frames and B frames； In the infra-frame prediction stage, minutia is weeded out, only global feature is predicted and integer cosine transformation, obtain I frames.

The compaction coding method complexity of the present invention is greatly lowered, and transmission speed is greatly improved.And permit Perhaps it is transmitted according to different transmission requirements, the video download transmission scheme for solving existing mobile radio networks is adapted to The limited problem of Internet resources.

Claims

1. the H.264 compaction coding method of the modified model to video file is used under the conditions of finite bandwidth and emission rate, and it is special Levying is：Under the wireless channel environment that frequency range is limited, for the compressed encoding of the video of each two field picture, including following step Suddenly：

Step one, initialization：Judge whether current frame image is start frame, i.e.,：Judge whether present frame is the first of video flowing First frame of frame or encoding stream, if it is judged that being yes, then execution step is one by one；If judged result is no, step is performed Rapid 1；

Step carries out Complex wavelet change one by one, to current frame image, obtains the level of current frame image, vertical and 45° angle side To information, and execution step four；

Step one two, Complex wavelet change is carried out to current frame image, obtain the level of current frame image, vertical and 45° angle side To information, execution step two；

Step 2, prediction mode are selected：According to current frame image and the direction character dependency of reference frame image, judge what is adopted Prediction mode, according to be inter prediction mode, then execution step three；According to be infra-frame prediction mode, then perform step Rapid four；The reference frame image is by block elimination effect filter, the spy for rebuilding original image and all directions by previous frame image The image obtained after reference breath；

Step 3, offset dependence operation is carried out to current frame image in level, vertical direction respectively, and judged according to peak value Current frame image relative to reference frame image motion principal direction, calculation of motion vectors, and execution step four；

If step 4, current frame image use inter prediction, execution step 41；If current frame image uses frame Interior prediction, execution step four or two；

Step 4 one, according to motion vector and reference frame image, be residual signals by the main constituent compression of images after wavelet transformation, Remove minutia, i.e.,：Remove diagonally opposed information, and execution step five；

Step 4 two, the Intra prediction mode selection to the execution of main constituent image H.264, while remove minutia, i.e.,：Remove Diagonally opposed information, execution step five；

Step 6, judge whether current frame image is the last frame of video flowing or encoding stream, if it is judged that be it is no, then Execution step 61；If it is judged that being yes, then six or two are performed；

Step 6 one, re-quantization and inverse transformation are carried out to current frame image, while entropy code is carried out to current frame image, and not Transmit under same host-host protocol, complete the compressed encoding of current frame image；

2. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method, it is characterised in that it also includes：

Step 7, after step 6 one is completed, by block elimination effect filter, rebuild the feature letter of original image and all directions Breath, as the reference frame of next frame.

3. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method, it is characterised in that the prediction mode selection course in step 2 is specially：

The relative operation being segmented in the horizontal direction, operation in the horizontal direction is specially：

Step 2 one, according to the resolution of horizontal direction characteristic image, horizontal direction characteristic image is carried out in the horizontal direction N Decile, and execution step two or two；N is positive integer；

Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, interval figure corresponding with reference frame As carrying out relative operation, cross correlation value is calculated, obtain N number of cross correlation value；

Step 2 three, by obtain N number of cross correlation value do normalization operation, the cross correlation value for determining whether more than 50% is more than 0.5, if it is judged that being yes, then execution step two or four；If it is judged that being no, then execution step two or five；

Step 2 four, determination perform inter prediction, calculate initial level motion excursion amount and are equal to N number of interval off set amount and cross-correlation The weighted sum form that value is multiplied；

Step 2 five, determination perform infra-frame prediction.

4. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method, it is characterised in that Complex wavelet changing method of the step one by one or used in step one two is specially： Required according to the image quality of video image and application scenarios, using different types of small echo directional pattern picture is extracted.

5. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method, it is characterised in that Complex wavelet change of the step one by one or used in step one two using Haar or DbN small echos are realized.