CN104135662B - Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition - Google Patents
Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition Download PDFInfo
- Publication number
- CN104135662B CN104135662B CN201410409829.7A CN201410409829A CN104135662B CN 104135662 B CN104135662 B CN 104135662B CN 201410409829 A CN201410409829 A CN 201410409829A CN 104135662 B CN104135662 B CN 104135662B
- Authority
- CN
- China
- Prior art keywords
- frame image
- image
- frame
- current frame
- execution step
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
The invention discloses an improved H.264 compressed encoding method for a video file under the limited bandwidth and transmission rate condition, and relates to the field of digital image processing. The improved H.264 compressed encoding method aims at solving the problem that adaptive network resources of a video downloading transmission method in an existing wireless mobile communication network are limited, and the high-speed and high-definition transmission requirement is met. The improved H.264 compressed encoding method includes the steps that in the initial phase, each frame of image is compressed in advance, characteristics of the images in different directions are extracted through plural wavelet transformation, and meanwhile information redundancies are reduced; in the inter-frame prediction phase, according to the directive characteristics obtained through the wavelet transformation, the spatial correlation between inter-frame images is judged, the complexity and the dimensionality of motion vector calculation are reduced, and a P frame and a B frame are obtained; in the intra-frame prediction phase, detail characteristics are eliminated, prediction and integer cosine transformation are only carried out on the overall characteristics, and an I frame is obtained. The improved H.264 compressed encoding method is suitable for improved H.264 compressed encoding on the video file under the limited bandwidth and transmission rate condition.
Description
Technical field
The present invention relates to digital image processing field, and in particular to H.264 compression coding technology.
Background technology
The purpose of Video coding is to realize the compression of video.According to the viewpoint of Shannon information theory, with comentropy come table
The information of data is levied, remaining information is redundancy.The redundancy of a large amount of fragment types is still existed in video information, therefore, remove
Redundancy also becomes the core content in video coding technique, to reach the purpose of compression, video coding technique mainly for
Redundancy have following a few classes:
(1), the time redundancy between frame and frame.The vision of people is that have a brief stay effect, and the scene flashed is not
Can disappear immediately, therefore frequency is higher than this value of the mankind if picture flashes, human eye would not feel the pause between frame and frame,
General video council using the 25 frames/more than second frame sampling rate.So, the time interval between two successive frames will be less than 1/25=
0.04 second, then when the speed of moving objects in video will not be too fast, the degree of association between adjacent two frame will be very high, shape
Into redundancy.
(2), the spatial redundancy in each frame in portion.One sub-picture is exactly a frame, and spatial redundancy is existed in frame of video.
(3), the statistical redundancy of generally existing.Signal in our daily life, the value in its correspondence signal space
The probability of the rule of order is very little, and would generally get maximum value or minimum value in one or several particular points.So,
Can be by distributing compared with short code word for the larger data point of probability, being that probability is less distributes compared with long code word, so improves pressure
Contracting efficiency.
(4), the visual redundancy based on human eye.Source video sequence is extremely wide, abundant in content colorful, it is desirable to which wherein all of information is all
It is the hardly possible thing accomplished to receive this by human eye.Therefore, it can the blind area perceived for human eye, it suitably simplified,
To eliminate visual redundancy.
In sum, technology of video compressing encoding should have following feature:
1), algorithm is simple:The high resolution of video image, frame rate is fast, therefore, the data volume of process is very big, in order that
The process time for obtaining algorithm is reduced, and needs to reduce its complexity;
2) it is, practical:Current most application scenarios should be able to be extensively supported, if desired for high-resolution numeral electricity
Depending on etc.;
3), various dimensions are processed:Image is compressed using redundancy of the image in time, space and frequency space.
Currently existing scheme and its pluses and minuses:Digital video decoding industry is rapidly grown up like the mushrooms after rain, it is contemplated that and
Capacitive and intercommunity problem, different coding and decoding video producers are all made that tremendous contribution.The mark of video coding and decoding technology
Quasi-ization problem seems and becomes more and more important that the Ge great worlds are organized in digital video decoding standardization aspect and expand many work and make
A series of video encoding and decoding standards are determined.
MPEG tissues start to define the video compression algorithm for completing from nineteen ninety during MPEG-X series standards, employ block side
The technologies such as motion compensation, discrete cosine transform (DCT), the quantization of formula.Speed from the 1.5Mbps of MPEG-1, MPEG-2 3~
The dynamic code rate of 100Mbit/s.MPEG-4 is pushed out within 1999, it is intended to in Internet or mobile communication equipment in real time
Transmission sound, video signal and the MPEG standards formulated, numerous multimedia application are integrated in a complete frame, are multimedia
Communication and applied environment provide canonical algorithm and instrument, can be applied by multimedia transmission, storage, retrieval etc. so as to set up one kind
The Uniform data format that field generally adopts.
MPEG-4 is adopted and compressed based on object approach, and relatively before standard compression ratio is greatly improved, and compression multiple is 450 times
(still image can reach 800 times), resolution input can from 320X240 to 1280X1204, this be homogenous quantities MPEG-1 and
More than 10 times of MJEPG.
H.26X series standard is another kind of Video coding mode proposed by ITU-T, and at present application is most ripe to be exactly
H.264.H.264 it is the MPEG (Motion Picture Expert Group) of the VCEG (Video Coding Experts Group) and ISO/IEC of ITU-T
Joint video team (JVT:Joint video team) exploitation a new digital video coding standard, it is both ITU-T's
H.264, be again ISO/IEC MPEG-4 the 10th part, formally issue in March, 2003.H.264 strengthen to various channels
Adaptability, using the structure and grammer of network friendliness, be conducive to the process to the bit error rate and packet loss, identical rebuild scheme
As under quality, can be than H.263 saving the code check of half or so.The network adaptability of code flow structure H.264 is strong, increased difference
Wrong recovery capability, can be well adapted for IP and wireless network.
But, any video encoding and decoding standard while higher compression ratio and rate respectively is pursued, its algorithm
Complexity generates restriction to real-time application, it is difficult to meet under wireless channel environment, channel is obtained and holding time, random
The requirement of access delay.Meanwhile, the transmission requirement code stream of wireless channel itself has very strong error recovery capabilities.Radio communication
The compressed encoding for requiring video supports that auto-adaptive parameter is arranged, and can pass through in the case of channel capacity or power limited
Sacrificial section image quality, and do not bring the reduction of obvious Consumer's Experience.
The content of the invention
The present invention is to adapt to Internet resources to solve the video download transmission method of existing mobile radio networks
It is limited, and in order to adapt to high speed, the demand of the transmission of high image quality, so as to provide a kind of being used in finite bandwidth and emission rate
Under the conditions of to the modified model of video file H.264 compaction coding method.
For under the conditions of finite bandwidth and emission rate to the modified model of video file H.264 compaction coding method,
Under the limited wireless channel environment of frequency range, for the compressed encoding of the video of each two field picture, comprise the following steps:
Step one, initialization:Judge whether current frame image is start frame, i.e.,:Judge whether current frame image is video
First frame of stream or the first frame of encoding stream, if it is judged that being yes, then execution step is one by one;If judged result is no,
Then execution step one or two;
Step carries out Complex wavelet change one by one, to current frame image, obtains the level of present frame, vertical and 45° angle side
To information, and execution step four;
Step one two, Complex wavelet change is carried out to current frame image, obtain present frame level, vertical and 45° angle direction
Information, execution step two;
Step 2, prediction mode are selected:According to current frame image and the direction character dependency of reference frame image, judge to adopt
Prediction mode, according to be inter prediction mode, then execution step three;According to be infra-frame prediction mode, then hold
Row step 4;The reference frame image is, by block elimination effect filter, to rebuild original image and all directions by previous frame image
Characteristic information after obtain image;
Step 3, offset dependence operation is carried out to present frame in level, vertical direction respectively, and judged according to peak value
Current frame image relative to reference frame image motion principal direction, calculation of motion vectors, and execution step four;
If step 4, current frame image use inter prediction, execution step 41;What if current frame image was adopted
It is infra-frame prediction, execution step four or two;
Step 4 one, according to motion vector and reference frame image, be residual error by the main constituent compression of images after wavelet transformation
Signal, removes minutia, i.e.,:Remove diagonally opposed information, and execution step five;
Step 4 two, the Intra prediction mode selection to the execution of main constituent image H.264, while remove minutia, i.e.,:
Remove diagonally opposed information, execution step five;
Step 5, integer cosine transformation and quantization, execution step six are done to the current frame image after prediction;
Step 6, judge that whether current frame image is the last frame of video flowing or encoding stream, if it is determined that judging knot
Fruit is no, then execution step 61;If it is judged that being yes, then six or two are performed;
Step 6 one, re-quantization and inverse transformation are carried out to current frame image, while entropy code is carried out to current frame image, and
Transmit under different host-host protocols, complete the compressed encoding of current frame image;
Step 6 two, entropy code is directly carried out, transmitted, complete the compressed encoding of current frame image.
It also includes:Step 7, after step 6 one is completed, by block elimination effect filter, rebuild original image and each
The characteristic information in direction, as the reference frame of next frame.
Prediction mode selection course in step 2 is specially:
In the relative operation being both horizontally and vertically segmented respectively, operation in the horizontal direction is specially:
Step 2 one, according to the resolution of horizontal direction characteristic image, by horizontal direction characteristic image in the horizontal direction
Carry out N deciles, and execution step two or two;N is positive integer;
Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, it is corresponding with reference frame interval
Image carry out relative operation, calculate cross correlation value, obtain N number of cross correlation value;
Step 2 three, by obtain N number of cross correlation value do normalization operation, determine whether more than 50% cross correlation value
More than 0.5, if it is judged that being yes, then execution step two or four;If it is judged that being no, then execution step two or five;
Step 2 four, determine and perform inter prediction, calculate initial level motion excursion amount and be equal to N number of interval off set amount and mutually
The weighted sum form that correlation is multiplied;
Step 2 five, determination perform infra-frame prediction.
Complex wavelet changing method of the step one by one or used in step one two is specially:Will according to the image quality of video image
Summation application scenarios, using different types of small echo directional pattern picture is extracted.
Complex wavelet change of the step one by one or used in step one two is realized using Haar or dbN small echos.
The present invention is a kind of for H.264 compressing to the modified model of video file under the conditions of finite bandwidth and emission rate
Coded method, solves the problems, such as that the video download transmission scheme of existing mobile radio networks adapts to Internet resources and is limited.
High speed, the demand of the transmission of high image quality are adapted to.
Description of the drawings
Fig. 1 is the H.264 video compression coding hierarchy schematic diagram that the present invention is used;
Fig. 2 is the present invention for H.264 compressing to the modified model of video file under the conditions of finite bandwidth and emission rate
Coding structure schematic diagram;
Fig. 3 is the CABAC entropy code schematic flow sheets used in the present invention;
Fig. 4 is the complete video transmission plan schematic flow sheet in the present invention;
Specific embodiment
Specific embodiment one, under the conditions of finite bandwidth and emission rate to the modified model of video file H.264
Compaction coding method, it is realized by following steps:
Step one, initialization:Whether present frame is start frame, if the first of the first frame of video flowing or encoding stream
Frame, execution step one by one, if it is not, execution step one or two;
Step carries out Complex wavelet change one by one, to image, obtains level, vertical and 45° angle directional information, execution step
Four;
Step one two, Complex wavelet change is carried out to image, obtain level, vertical and 45° angle directional information, execution step
Two;
Step 2, predictive behavior are selected:According to present frame and the direction character dependency of reference frame, the prediction for adopting is judged
Mode, if selecting inter prediction, execution step three, if selecting infra-frame prediction, execution step four;
Step 2 one, according to the resolution of horizontal direction characteristic image, N deciles, execution step are carried out in the horizontal direction
Two or two;
Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, the corresponding area with reference picture
Between image carry out relative operation, calculate cross correlation value;
Step 2 three, N number of cross correlation value is obtained, normalization operation is done, if more than 50% cross correlation value is more than
0.5, execution step two or four, if it is not, execution step two or five;
Step 2 four, determine and perform inter prediction, calculate initial level motion excursion amount and be equal to N number of interval off set amount and mutually
The weighted sum form that correlation is multiplied;
Step 2 five, determination perform infra-frame prediction;
Step 3, in level, vertical direction offset dependence operation is carried out respectively, judge that present frame is suitable according to peak value
In the motion principal direction of reference frame, calculation of motion vectors, execution step four;
If step 4, present frame use inter prediction, execution step 41, if using infra-frame prediction, performing
Step 4 two;
Step 4 one, according to motion vector and reference frame, be residual signals by the main constituent compression of images after wavelet transformation,
Remove minutia, i.e., diagonally opposed information, execution step five;
Step 4 two, the Intra prediction mode selection process to the execution of main constituent image H.264, while it is special to remove details
Levy, execution step five;
Step 5, integer cosine transformation and quantization, execution step six are done to the present frame after prediction;
If step 6, present frame are not the last frames of video flowing or encoding stream, if performing 61, being carried out six or two;
Step 6 one, the process of re-quantization and inverse transformation is carried out to present frame, while entropy code is carried out to it, according to difference
Host-host protocol transmit, execution step seven;
Step 6 two, entropy code is directly carried out, transmitted, complete the compressed encoding work of video.
Step 7, by block elimination effect filter, the characteristic information of original image and all directions is rebuild, as next frame
Reference frame is used.
The present invention carries out precommpression in the starting stage to the image of each frame, extracts image by Complex Wavelet Transform and exists
The feature of different directions, while the redundancy of the information of reduction;It is the inter prediction stage, special according to the directivity obtained through Wavelet transformation
Levy, judge the spatial coherence between inter frame image, reduce the complexity and dimension of motion vector calculation, obtain P frames and B frames;
In the infra-frame prediction stage, minutia is weeded out, only global feature is predicted and integer cosine transformation, obtain I frames.
The compaction coding method complexity of the present invention is greatly lowered, and transmission speed is greatly improved.And permit
Perhaps it is transmitted according to different transmission requirements, the video download transmission scheme for solving existing mobile radio networks is adapted to
The limited problem of Internet resources.
Claims (5)
1. the H.264 compaction coding method of the modified model to video file is used under the conditions of finite bandwidth and emission rate, and it is special
Levying is:Under the wireless channel environment that frequency range is limited, for the compressed encoding of the video of each two field picture, including following step
Suddenly:
Step one, initialization:Judge whether current frame image is start frame, i.e.,:Judge whether present frame is the first of video flowing
First frame of frame or encoding stream, if it is judged that being yes, then execution step is one by one;If judged result is no, step is performed
Rapid 1;
Step carries out Complex wavelet change one by one, to current frame image, obtains the level of current frame image, vertical and 45° angle side
To information, and execution step four;
Step one two, Complex wavelet change is carried out to current frame image, obtain the level of current frame image, vertical and 45° angle side
To information, execution step two;
Step 2, prediction mode are selected:According to current frame image and the direction character dependency of reference frame image, judge what is adopted
Prediction mode, according to be inter prediction mode, then execution step three;According to be infra-frame prediction mode, then perform step
Rapid four;The reference frame image is by block elimination effect filter, the spy for rebuilding original image and all directions by previous frame image
The image obtained after reference breath;
Step 3, offset dependence operation is carried out to current frame image in level, vertical direction respectively, and judged according to peak value
Current frame image relative to reference frame image motion principal direction, calculation of motion vectors, and execution step four;
If step 4, current frame image use inter prediction, execution step 41;If current frame image uses frame
Interior prediction, execution step four or two;
Step 4 one, according to motion vector and reference frame image, be residual signals by the main constituent compression of images after wavelet transformation,
Remove minutia, i.e.,:Remove diagonally opposed information, and execution step five;
Step 4 two, the Intra prediction mode selection to the execution of main constituent image H.264, while remove minutia, i.e.,:Remove
Diagonally opposed information, execution step five;
Step 5, integer cosine transformation and quantization, execution step six are done to the present frame after prediction;
Step 6, judge whether current frame image is the last frame of video flowing or encoding stream, if it is judged that be it is no, then
Execution step 61;If it is judged that being yes, then six or two are performed;
Step 6 one, re-quantization and inverse transformation are carried out to current frame image, while entropy code is carried out to current frame image, and not
Transmit under same host-host protocol, complete the compressed encoding of current frame image;
Step 6 two, entropy code is directly carried out, transmitted, complete the compressed encoding of current frame image.
2. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file
H.264 compaction coding method, it is characterised in that it also includes:
Step 7, after step 6 one is completed, by block elimination effect filter, rebuild the feature letter of original image and all directions
Breath, as the reference frame of next frame.
3. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file
H.264 compaction coding method, it is characterised in that the prediction mode selection course in step 2 is specially:
The relative operation being segmented in the horizontal direction, operation in the horizontal direction is specially:
Step 2 one, according to the resolution of horizontal direction characteristic image, horizontal direction characteristic image is carried out in the horizontal direction N
Decile, and execution step two or two;N is positive integer;
Step 2 two, the skew for carrying out on the subimage of the by stages such as n-th n-1 units, interval figure corresponding with reference frame
As carrying out relative operation, cross correlation value is calculated, obtain N number of cross correlation value;
Step 2 three, by obtain N number of cross correlation value do normalization operation, the cross correlation value for determining whether more than 50% is more than
0.5, if it is judged that being yes, then execution step two or four;If it is judged that being no, then execution step two or five;
Step 2 four, determination perform inter prediction, calculate initial level motion excursion amount and are equal to N number of interval off set amount and cross-correlation
The weighted sum form that value is multiplied;
Step 2 five, determination perform infra-frame prediction.
4. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file
H.264 compaction coding method, it is characterised in that Complex wavelet changing method of the step one by one or used in step one two is specially:
Required according to the image quality of video image and application scenarios, using different types of small echo directional pattern picture is extracted.
5. it is according to claim 1 under the conditions of finite bandwidth and emission rate to the modified model of video file
H.264 compaction coding method, it is characterised in that Complex wavelet change of the step one by one or used in step one two using Haar or
DbN small echos are realized.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410409829.7A CN104135662B (en) | 2014-08-19 | 2014-08-19 | Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410409829.7A CN104135662B (en) | 2014-08-19 | 2014-08-19 | Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104135662A CN104135662A (en) | 2014-11-05 |
CN104135662B true CN104135662B (en) | 2017-04-26 |
Family
ID=51808174
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410409829.7A Active CN104135662B (en) | 2014-08-19 | 2014-08-19 | Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104135662B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3318060B1 (en) * | 2015-07-27 | 2021-05-26 | Mediatek Inc. | Reference picture storage in intra block copy mode |
WO2018106752A1 (en) * | 2016-12-06 | 2018-06-14 | Nissan North America, Inc. | Bandwidth constrained image processing for autonomous vehicles |
CN110545402A (en) * | 2019-08-18 | 2019-12-06 | 宁波职业技术学院 | underground monitoring video processing method, computer equipment and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1794818A (en) * | 2005-12-01 | 2006-06-28 | 西安交通大学 | Control method of high performance three-dimensional code rate in flexible video coding |
CN101031083A (en) * | 2007-04-04 | 2007-09-05 | 中山大学 | Method for realizing realtime video transmission by wavelet |
CN101127903A (en) * | 2007-09-29 | 2008-02-20 | 华南理工大学 | DCT-based resolution flexible image coding and decoding method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7848584B2 (en) * | 2005-09-08 | 2010-12-07 | Monro Donald M | Reduced dimension wavelet matching pursuits coding and decoding |
-
2014
- 2014-08-19 CN CN201410409829.7A patent/CN104135662B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1794818A (en) * | 2005-12-01 | 2006-06-28 | 西安交通大学 | Control method of high performance three-dimensional code rate in flexible video coding |
CN101031083A (en) * | 2007-04-04 | 2007-09-05 | 中山大学 | Method for realizing realtime video transmission by wavelet |
CN101127903A (en) * | 2007-09-29 | 2008-02-20 | 华南理工大学 | DCT-based resolution flexible image coding and decoding method |
Non-Patent Citations (1)
Title |
---|
基于H.264和小波变换的视频监控系统的设计与实现;刘辉;《东北师范大学硕士学位论文》;20140215;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN104135662A (en) | 2014-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9071841B2 (en) | Video transcoding with dynamically modifiable spatial resolution | |
US10321138B2 (en) | Adaptive video processing of an interactive environment | |
KR100781525B1 (en) | Method and apparatus for encoding and decoding FGS layers using weighting factor | |
JP5639687B2 (en) | Method and apparatus for encoding / decoding picture boundary coding unit | |
CN107040781A (en) | The Real-Time Video Encoder speed control switched using dynamic resolution | |
KR101266667B1 (en) | Dual-mode compression of images and videos for reliable real-time transmission | |
MX2007000810A (en) | Method and apparatus for encoder assisted-frame rate up conversion (ea-fruc) for video compression. | |
US8243117B2 (en) | Processing aspects of a video scene | |
US20140119456A1 (en) | Encoding video into lower resolution streams | |
CN107071422B (en) | Low complex degree HEVC rate adaption transformation coding method based on image correlation model | |
CN103442228B (en) | Code-transferring method and transcoder thereof in from standard H.264/AVC to the fast frame of HEVC standard | |
CN101663895B (en) | Video coding mode selection using estimated coding costs | |
US20180324453A1 (en) | Image coding/decoding method, device, and system | |
CN109905717A (en) | A kind of H.264/AVC Encoding Optimization based on Space-time domain down-sampling and reconstruction | |
CN106165420A (en) | For showing the system and method for the Pingdu detection of stream compression (DSC) | |
CN104135662B (en) | Improved H.264 compressed encoding method for video file under limited bandwidth and emission rate condition | |
CN107343202B (en) | Feedback-free distributed video coding and decoding method based on additional code rate | |
KR20130103140A (en) | Preprocessing method before image compression, adaptive motion estimation for improvement of image compression rate, and image data providing method for each image service type | |
CN106028029A (en) | Lossless video compression method and system for digital KVM switcher having low delay and low complexity | |
US9210424B1 (en) | Adaptive prediction block size in video coding | |
Tan et al. | Image compression algorithms based on super-resolution reconstruction technology | |
CN100586185C (en) | Mode selection method for transcoding 264 video to reduce resolving capability | |
CN102811349B (en) | No-feedback multi-description distributed video coder-decoder capable of adjusting redundancy automatically and coding method thereof | |
CN109660804A (en) | A kind of video frame coding/decoding method based on model selection | |
KR20140030535A (en) | Apparatus and method for encoding image, apparatus and method for decoding image |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |