CN104837019B - AVS to HEVC optimization video transcoding methods based on SVMs - Google Patents
AVS to HEVC optimization video transcoding methods based on SVMs Download PDFInfo
- Publication number
- CN104837019B CN104837019B CN201510215888.5A CN201510215888A CN104837019B CN 104837019 B CN104837019 B CN 104837019B CN 201510215888 A CN201510215888 A CN 201510215888A CN 104837019 B CN104837019 B CN 104837019B
- Authority
- CN
- China
- Prior art keywords
- avs
- hevc
- transcoding
- motion vector
- depth
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Abstract
A kind of AVS to HEVC optimization video transcoding methods based on SVMs, by the characteristic vector for gathering AVS code streams, and it is learnt using SVMs and obtains training pattern, the CU that the AVS characteristic vectors extracted are divided into the relevant position in HEVC divides or not divided two classes, whether need to divide with training pattern prediction CU in the transcoding stage, when obtain current CU need division when, carry out 2N × 2N patterns and SKIP mode computations respectively under current HEVC depth again, and select optimal prediction modes from both patterns, it need not be divided when prediction obtains current CU, then optimization model selection is carried out according to HEVC standard cataloged procedure.Present invention incorporates machine learning basic thought, whole transcoding process training stage and transcoding stage have been divided into, training pattern is obtained by study, predict the division of CU in HEVC, and combine fast mode decision algorithm, both the speed of transcoding had been improved, in turn ensure that the overall video quality of transcoding rear video.
Description
Technical field
It is specifically a kind of based on SVMs the present invention relates to a kind of technology of video signal processing field
AVS to HEVC optimizes video transcoding method.
Background technology
Video Transcoding Technology, it is that the code stream that will have been compressed obtains satisfactory target code stream by decoding re-encoding.With
The extensive use and fast development of multimedia technology and internet etc., is had become in the various video datas of transmission over networks existing
In the trend of network technical development, there has been various video coding standard, including MPEG-4, MPEG-2, H.264,
AVS, HEVC etc..Because video resource is varied, and the display capabilities of different terminal devices, storage capacity, to code stream
Disposal ability have differences, user is also not quite similar to the demand of video under different situations, therefore, how to realize height
The transcoding of effect, it is allowed to be adapted in different hardware equipment and network transmission environment, is paid close attention to extensively by industry always.HEVC
It is newest video encoding standard at present, compression efficiency improves about 50% than the standard such as H.264, and it will obtain more and more wider
General application.AVS is the standard of China's independent research, have with H.264 suitable coding efficiency and lower encoder complexity,
There is important influence power in field of video applications.Many AVS code streams compressed existing at present, it will grow in standards such as HEVC
Phase coexists, therefore realizes that AVS turns into important research direction to the Video Quality Metric between HEVC standard.
Machine learning is applied to multiple research fields, including artificial intelligence, machine translation, data mining, Text region with
And commercial field etc., with the development of Digital object identifier and transmission network etc., it be also constantly applied to video search,
Video analysis and video compile the research directions such as transcoding, and machine learning algorithm is applied to the application in video code conversion field at present
It is being on the increase.SVMs is an important method of machine learning, is extracted from input code flow front face framing
Decoded information etc., the corresponding relation for learning between AVS information and HEVC coding modes using SVMs, encoded below
During, the pattern that HEVC is encoded directly just is predicted, without being changed completely according to the AVS decoded information
The process of generation traversal, so as to reach the purpose for reducing Video Quality Metric complexity.Therefore, by using this machine of SVMs
Learning method, how to learn to obtain accurate training pattern, realize Accurate Prediction to reduce the complexity of Video Quality Metric, raising regards
The speed of frequency transcoding turns into an important topic of current research.
Found by the retrieval to prior art, Chinese patent literature CN104320667A discloses (bulletin) day
2015.01.28, a kind of more process optimizing coded systems, including several parallel encoders, prediction buffer and two are disclosed
The output end of secondary encoder, the input of buffer of looking forward to the prospect and parallel encoder connects, the output end for buffer of looking forward to the prospect with it is secondary
The input connection of encoder, and its method is disclosed, including the first coding stage, optimization selection stage and the second coding rank
3 steps of section, the first coding stage are encoded simultaneously by several parallel encoders, and prediction buffer is to the first coding stage
Resulting result carries out optimization selection to obtain optimum code path, and secondary coding device is obtained according to the optimization selection stage
Second of the optimum code path obtained coding, obtains final and optimal coding result.The technical performance, quality, bandwidth efficiency
Higher, coding/transcoding result is more preferable, is highly susceptible to configuring and very flexibly, can be not only used for the 4K and superelevation of high video quality
Clear application, it can also be used to the mobile video application of ultra high efficiency bandwidth.It is no to utilize but the input of the technology is source code flow
The coding information to be included during compressed bit stream is inputted, and for encoder more complicated HEVC, coding path is various, realizes multiple
Miscellaneous degree also can be relatively high.
The content of the invention
The present invention is directed to deficiencies of the prior art, proposes that a kind of AVS to HEVC based on SVMs is excellent
Change video transcoding method, simplified video code conversion framework is employed, in the characteristic vector that the training stage will extract from AVS code streams
Training obtains training pattern, and then carrying out region to the coding unit dividing condition in HEVC using the model in the transcoding stage turns
Code, in combination with fast mode decision algorithm, reduces recodification stage coding dividing elements and the complexity of model selection,
Video code conversion speed is substantially increased, while ensure that transcoded video Quality Down limited.
The present invention is achieved by the following technical solutions:
The present invention is learnt and instructed to it using SVMs by gathering the characteristic vector of AVS code streams
Practice model, the AVS characteristic vectors extracted are divided into the CU divisions of the relevant position in HEVC or do not divide two classes, in transcoding rank
Whether section needs to divide with training pattern prediction CU.
Described AVS characteristic vectors are gathered from AVS code streams, including:Macro-block coding pattern, motion vector and conversion coefficient etc.
Information.
Described macro-block coding pattern refers to:The pattern information of each macro block in AVS.
Described motion vector refers to:The average motion vector size of macro block in AVS.
Described conversion coefficient refers to:The number of nonzero coefficient in discrete cosine transform (DCT) coefficient of AVS codings.
Described corresponding relation refers to:Whether AVS characteristic vectors need division by learning with coding unit in HEVC
The training pattern arrived, namely mapping relations.
Described prediction, when obtaining current CU and needing division, then carry out 2N × 2N respectively under current HEVC depth
Pattern and SKIP mode computations, and optimal prediction modes are selected from both patterns, do not needed when prediction obtains current CU
Divided, then carry out optimization model selection according to HEVC standard cataloged procedure.
Technique effect
Compared with prior art, the present invention combine machine learning basic thought and SVMs learning method, to regarding
Frequency study obtains training pattern and predicts HEVC coding modes in follow-up transcoding process, so as to reduce transcoding complexity, improves
Transcoding speed.Whole transcoding process is divided into two stages, i.e. training stage and transcoding stage.In the training stage, AVS spies are extracted
The CU division informations of relevant position in vector sum HEVC are levied, are trained to obtain both correspondences according to algorithm of support vector machine and instrument
Relation, i.e. training pattern;The model obtained using the training stage, it is 0 to depth according to AVS characteristic vector in the transcoding stage
It is predicted with whether HEVC relevant positions CU when 1 divides, if prediction obtains current CU and needs to divide, to the CU of current depth
SKIP and 2Nx2N model selections are only carried out, if prediction obtains current CU and need not divided, according to HEVC standard process to optimal
Pattern is selected.Combined by machine learning and fast schema selection method, each sequence of basis itself that can be adaptive
Feature obtain corresponding training pattern, ensure transcoded video Quality Down it is limited in the case of, greatly reduce transcoding process
In computation complexity, so as to improve transcoding speed, save the transcoding time.
Brief description of the drawings
Fig. 1 is transcoding frame diagram of the present invention;
Fig. 2 is flow chart of the present invention.
Embodiment
Embodiments of the invention are elaborated below, the present embodiment is carried out lower premised on technical solution of the present invention
Implement, give detailed embodiment and specific operating process, but protection scope of the present invention is not limited to following implementation
Example.
Embodiment 1
As shown in Fig. 2 the present embodiment is divided into following four step:
Step 1: the characteristic vector of collection AVS code streams, is specially:It is with depth in corresponding HEVC in collection AVS code streams
The information that relevant position CU is divided when 0 and 1.
Described AVS characteristic vectors are gathered from AVS code streams, including:Macro-block coding pattern, motion vector and conversion coefficient etc.
Information.
Described macro-block coding pattern refers to:When depth is 0 in HEVC corresponding to AVS, include 16 in each CU
Macro block, therefore contain 16 features;When depth is 1, includes 4 macro blocks in each CU, there are 4 pattern features accordingly.
Described motion vector refers to:Motion vector is for unit with 8 × 8 in AVS, and 4 motions are included in a macro block
Vectorial elementary cell, each motion vector distinguish modulus, then using the average value of four motion vector moulds in a macro block as
One feature.Equally, 16 motion features are contained when depth is 0 in CU, depth contains 4 motion features when being 1.Motion vector
The average value v of mouldavgIt is equal toWherein N is for 8 × 8 size blocks that current coded unit is included
Number, vxAnd v (i)y(i) be respectively i-th of 8 × 8 pieces of motion vectors in coding unit horizontal component and vertical component.
Described conversion coefficient refers to:DCT is also for unit, by all DCT coefficients in a macro block with 8 × 8 pieces in AVS
The number of middle nonzero coefficient is as a feature, and therefore, depth contains 16 DCT coefficient features when being 0, and depth contains 4 when being 1
Individual DCT coefficient feature.
Step 2: the AVS characteristic vectors obtained using SVMs to step 1 are learnt and obtain training pattern,
For realize to HEVC depth be 0 and 1 when coding unit classify, the AVS characteristic vectors extracted are divided into two classes, i.e.,
The CU of relevant position is divided or not divided in HEVC.
Step 3: directly current CU is made whether using the training pattern obtained in step 2 to need the prediction divided,
When prediction result for that need not divide, then division terminates and no longer carries out further model selection, otherwise will enter HEVC
Next depth iteration and perform step 4.
Step 4: carry out 2N × 2N patterns and SKIP mode computations respectively under current HEVC depth, and from both
Optimal prediction modes are selected in pattern, need not be divided when prediction obtains current CU, then it is encoded according to HEVC standard
Cheng Jinhang optimization models select, and so as to reduce the computation complexity in mode selection processes, improve whole transcoding process
Transcoding speed.
Described 2N × 2N mode computations refer to:Represent that current coded unit can not be divided again;
Described SKIP mode computations refer to:Represent that current coding macro block does not need coded prediction residual information;
Described optimal prediction modes refer to:Rate distortion is carried out to 2N × 2N and SKIP patterns to compare, the rate selected is lost
The minimum coding mode of true cost (rate-distortion cost);
Described optimization model selection refers to:The comparison of rate distortion is carried out to coding mode all in HEVC, so as to select
Select out the minimum process of rate distortion costs.
In summary, advantage of this embodiment is that:
1) feature of video itself is combined, each sequence can train to obtain the model suitable for current sequence.
2) computation complexity for determining CU optimal dividings in transcoding in HEVC cataloged procedures is predicted according to training pattern, so as to
In the case where ensureing predictablity rate, transcoding computation complexity is greatly reduced, saves the transcoding time, improves transcoding effect
Rate.
3) fast mode decision algorithm is combined, unnecessary calculating process is further reduced, improves transcoding speed.
Algorithm proposed by the present invention is realized on the basis of HEVC identifying codes HM10.1.By AVS to HEVC complete solutions
Full transcoder of compiling is compared as standard transcoder, by comparing three standards come the validity of assessment algorithm, i.e. Δ T, Δ
PSNR and Δ BR.Calculation is as follows:
Δ PSNR=PSNRref-PSNRprop
Δ T, Δ PSNR and Δ BR represent scramble time percentage, PSNR decreasing value and the code check growth rate saved respectively.
Experimental result is as shown in table 1.
AVS to HEVC transcoding algorithm experimental result of the table 1 based on SVMs.
By test result in table 1 can be seen that based on the transcoding algorithm of SVMs for 1920 × 1080 sequence
Row, transcoding speed can be made, which averagely to improve 63.68%, PSNR, averagely reduces 0.083dB, while code check rises only
1.82%, for 1280 × 720 sequence so that transcoding speed averagely improves 71.78%, PSNR and averagely reduced
0.09dB, code check rise 1.55%.Test result indicates that inventive algorithm while video code conversion speed is greatly improved, is protected
Limited video quality has been demonstrate,proved to decline.
Claims (4)
1. a kind of AVS to HEVC optimization video transcoding methods based on SVMs, it is characterised in that by gathering AVS codes
The characteristic vector of stream, and it is learnt using SVMs and obtains training pattern, the AVS characteristic vectors that will be extracted
It is divided into the CU divisions of the relevant position in HEVC or does not divide two classes, whether needs to draw with training pattern prediction CU in the transcoding stage
Point, when obtaining current CU and needing division, then carry out 2N × 2N patterns and SKIP pattern meters respectively under current HEVC depth
Calculate, and select optimal prediction modes from both patterns, need not be divided when prediction obtains current CU, then according to
HEVC standard cataloged procedure carries out optimization model selection;
Described AVS characteristic vectors are gathered from AVS code streams, including:Macro-block coding pattern, motion vector and conversion coefficient;
Described optimal prediction modes refer to:Rate distortion is carried out to 2N × 2N and SKIP patterns to compare, the rate distortion generation selected
The minimum coding mode of valency.
2. according to the method for claim 1, it is characterized in that, described macro-block coding pattern refers to:Each macro block in AVS
Pattern information;Described motion vector refers to:The average motion vector size of macro block in AVS;Described conversion coefficient refers to:
The number of nonzero coefficient in the discrete cosine transform coefficient of AVS codings.
3. method according to claim 1 or 2, it is characterized in that, described macro-block coding pattern refers to:Corresponding to AVS
When depth is 0 in HEVC, include 16 macro blocks in each CU, therefore contain 16 features;When depth is 1, in each CU
Include 4 macro blocks, there are 4 pattern features accordingly;Described motion vector refers to:Motion vector is to be with 8 × 8 in AVS
Unit, 4 motion vector elementary cells are included in a macro block, each motion vector distinguishes modulus, then by a macro block
The average value of four motion vector moulds contains 16 motion features, when depth is 1 when depth is 0 as a feature in CU
Contain 4 motion features;Described conversion coefficient refers to:DCT is also for unit, by institute in a macro block with 8 × 8 pieces in AVS
There is the number of nonzero coefficient in DCT coefficient as a feature, contain 16 DCT coefficient features, depth 1 when depth is 0
4 DCT coefficient features of Shi Hanyou.
4. according to the method for claim 1, it is characterized in that, the selection of described optimization model refers to:To all in HEVC
Coding mode carries out the comparison of rate distortion, so as to select the minimum process of rate distortion costs.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510215888.5A CN104837019B (en) | 2015-04-30 | 2015-04-30 | AVS to HEVC optimization video transcoding methods based on SVMs |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510215888.5A CN104837019B (en) | 2015-04-30 | 2015-04-30 | AVS to HEVC optimization video transcoding methods based on SVMs |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104837019A CN104837019A (en) | 2015-08-12 |
CN104837019B true CN104837019B (en) | 2018-01-02 |
Family
ID=53814621
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510215888.5A Expired - Fee Related CN104837019B (en) | 2015-04-30 | 2015-04-30 | AVS to HEVC optimization video transcoding methods based on SVMs |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104837019B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105306947B (en) * | 2015-10-27 | 2018-08-07 | 中国科学院深圳先进技术研究院 | video transcoding method based on machine learning |
CN105430391B (en) * | 2015-12-04 | 2018-11-16 | 上海交通大学 | The intraframe coding unit fast selecting method of logic-based recurrence classifier |
CN106791828B (en) * | 2016-12-21 | 2019-05-07 | 四川大学 | Video transcoding method and its transcoder based on machine learning |
CN107071479B (en) * | 2017-04-28 | 2020-06-05 | 南京理工大学 | 3D video depth image prediction mode selection method based on viewpoint correlation |
CN107071496B (en) * | 2017-05-14 | 2020-01-17 | 北京工业大学 | Fast depth selection method for H.265/HEVC inter-frame coding unit |
CN108184114B (en) * | 2018-01-06 | 2021-08-31 | 北京航空航天大学 | Method for rapidly judging Intra prediction mode in P frame based on Support Vector Machine (SVM) |
CN108769736B (en) * | 2018-05-24 | 2019-09-17 | 重庆瑞景信息科技有限公司 | The foundation and parameter determination method of video code conversion code rate decision model towards display |
CN111953988B (en) * | 2020-07-15 | 2022-08-09 | 广州柯维新数码科技有限公司 | Video transcoding method, computer device and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102025999A (en) * | 2010-12-31 | 2011-04-20 | 北京工业大学 | Video transcoding fast intra-frame predicating method based on support vector machine |
CN103248893A (en) * | 2013-05-10 | 2013-08-14 | 四川大学 | Method for quick inter-frame transcoding from H. 264/AVC standard to HEVC standard and transcoder thereof |
CN103475880A (en) * | 2013-09-11 | 2013-12-25 | 浙江大学 | Method for low-complexity video transcoding from H.264 to HEVC based on statistic analysis |
CN103491334A (en) * | 2013-09-11 | 2014-01-01 | 浙江大学 | Video transcode method from H264 to HEVC based on region feature analysis |
CN103796023A (en) * | 2014-03-05 | 2014-05-14 | 武汉拓宝电子系统有限公司 | H.265/HEVC transcoding method and transcoder based on H.264/AVC macro block structure and texture |
WO2014137268A1 (en) * | 2013-03-07 | 2014-09-12 | Telefonaktiebolaget L M Ericsson (Publ) | Video transcoding |
CN104079937A (en) * | 2014-07-16 | 2014-10-01 | 山东大学 | H.264-to-HEVC fast inter-frame transcoding method and device based on motion vector analysis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015031891A2 (en) * | 2013-08-30 | 2015-03-05 | ARRIS Enterprises, Inc | Multipass encoder with heterogeneous codecs |
-
2015
- 2015-04-30 CN CN201510215888.5A patent/CN104837019B/en not_active Expired - Fee Related
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102025999A (en) * | 2010-12-31 | 2011-04-20 | 北京工业大学 | Video transcoding fast intra-frame predicating method based on support vector machine |
WO2014137268A1 (en) * | 2013-03-07 | 2014-09-12 | Telefonaktiebolaget L M Ericsson (Publ) | Video transcoding |
CN103248893A (en) * | 2013-05-10 | 2013-08-14 | 四川大学 | Method for quick inter-frame transcoding from H. 264/AVC standard to HEVC standard and transcoder thereof |
CN103475880A (en) * | 2013-09-11 | 2013-12-25 | 浙江大学 | Method for low-complexity video transcoding from H.264 to HEVC based on statistic analysis |
CN103491334A (en) * | 2013-09-11 | 2014-01-01 | 浙江大学 | Video transcode method from H264 to HEVC based on region feature analysis |
CN103796023A (en) * | 2014-03-05 | 2014-05-14 | 武汉拓宝电子系统有限公司 | H.265/HEVC transcoding method and transcoder based on H.264/AVC macro block structure and texture |
CN104079937A (en) * | 2014-07-16 | 2014-10-01 | 山东大学 | H.264-to-HEVC fast inter-frame transcoding method and device based on motion vector analysis |
Non-Patent Citations (1)
Title |
---|
H.264到HEVC视频转码技术研究;蒋炜;《浙江大学博士学位论文》;20140917;第36-38页,第45-47页,第67页 * |
Also Published As
Publication number | Publication date |
---|---|
CN104837019A (en) | 2015-08-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104837019B (en) | AVS to HEVC optimization video transcoding methods based on SVMs | |
CN105359531B (en) | Method and system for determining for the coder side of screen content coding | |
CN104221377B (en) | A kind of video decoding method, device and computer-readable storage medium | |
CN104038764B (en) | A kind of H.264 arrive video transcoding method H.265 and transcoder | |
CN101610413B (en) | Video coding/decoding method and device | |
CN104796694B (en) | Optimization intraframe video coding method based on video texture information | |
CN102223526B (en) | Method and related device for coding and decoding image | |
CN102665077A (en) | Rapid and efficient encoding-transcoding method based on macro block classification | |
CN106559669A (en) | The method and device of image prediction | |
CN107566848A (en) | The method and device of encoding and decoding | |
CN101061515A (en) | Coding scheme for a data stream representing a temporally varying graphics model | |
CN103248893A (en) | Method for quick inter-frame transcoding from H. 264/AVC standard to HEVC standard and transcoder thereof | |
CN101068355A (en) | MPEG2-H.264 code fast converting method | |
CN103782597A (en) | Method and apparatus for coding video and method and apparatus for decoding video accompanied with arithmetic coding | |
CN103369315B (en) | The coding of the intra prediction mode of chromatic component, coding/decoding method, equipment and system | |
CN104883565A (en) | Decision-making method and device for intra-frame prediction mode of high efficiency video coding | |
CN104702958A (en) | HEVC intraframe coding method and system based on spatial correlation | |
CN102804772A (en) | Image encoding/decoding apparatus and method using multi-dimensional integer transform | |
CN104702957A (en) | Motion vector compression method and device | |
CN103873864A (en) | Object flag bit efficient encoding method applied to video object retrieval | |
CN103561270A (en) | Coding control method and device for HEVC | |
CN103888770B (en) | A kind of video code conversion system efficiently and adaptively based on data mining | |
CN106791828A (en) | High performance video code-transferring method and its transcoder based on machine learning | |
CN101588487B (en) | Video intraframe predictive coding method | |
CN109819260A (en) | Video steganography method and device based on the fusion of multi-embedding domain |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180102 Termination date: 20200430 |
|
CF01 | Termination of patent right due to non-payment of annual fee |