CN104811722B - A kind of decoding method and device of video data - Google Patents

A kind of decoding method and device of video data Download PDF

Info

Publication number
CN104811722B
CN104811722B CN201510180497.4A CN201510180497A CN104811722B CN 104811722 B CN104811722 B CN 104811722B CN 201510180497 A CN201510180497 A CN 201510180497A CN 104811722 B CN104811722 B CN 104811722B
Authority
CN
China
Prior art keywords
information
frame
processed
mode
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510180497.4A
Other languages
Chinese (zh)
Other versions
CN104811722A (en
Inventor
宋锦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201510180497.4A priority Critical patent/CN104811722B/en
Publication of CN104811722A publication Critical patent/CN104811722A/en
Priority to PCT/CN2016/079034 priority patent/WO2016165603A1/en
Application granted granted Critical
Publication of CN104811722B publication Critical patent/CN104811722B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The embodiment of the invention discloses a kind of coding methods of video data, comprising: obtains the specify information of the frame to be processed in video data, the specify information includes: at least one of frame per second information, time complexity information and spatial complexity information;If the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, then frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode, wherein, the first coding mode and the second coding mode are different coding mode;According to the pre-processed results of the pre-processed results of the first coding mode and the second coding mode, the selection target coding mode from the first coding mode and the second coding mode, and the frame to be processed is encoded according to the target code mode.The embodiment of the invention also discloses a kind of code devices of video data.Using the present invention, the advantages of there is the quality for guaranteeing video data processing, enhance the user experience of video data encoding.

Description

A kind of decoding method and device of video data
Technical field
The present invention relates to field of communication technology more particularly to the decoding methods and device of a kind of video data.
Background technique
During current video codec, video data will be become by processes such as prediction, transformation, quantization and entropy codings At code stream, the redundancy between data is eliminated, improves the efficiency of transmission of video data.Switch technology is a kind of in frame rate Low frame-rate video is transformed into high frame per second and regarded by Video post-processing technology by way of being inserted into intermediate frame in original video frame Frequently, better visual quality is provided.Multi-reference frame technology is to transport under multi-reference frame mode to a macro block or sub-block When dynamic compensation, coding can select a frame as reference frame from several past encoded frames, seek current coding macro block or The best matching blocks of sub-block, to obtain better prediction effect.When many objects have masking, multi-reference frame draws Enter to can be improved code efficiency.
Coding side does not do the selection of the processing mode of video frame in the video data decoding method of the prior art, is not required to yet The video datas such as the processing mode of video frame are transmitted to decoding end.Decoding end is based on one or several kinds of adaptation rules, choosing Carrying out to selecting property the operation converted in frame rate, adaptation rule may include the severity of movement, the model selection of coding, The texture structure etc. of video content.If Fig. 1, Fig. 1 are the behaviour that decoding end adaptively carries out video frame processing in the prior art Make mode.When decoding end handles the data of present frame, the motion vector information of statistics available previous decoded frame judges above-mentioned solved Whether the average length of the motion vector of code frame is more than threshold value.If the average length of motion vector is more than threshold value, repeat previous Frame is inserted into before present frame, and then can then handle next frame.If the average length of motion vector is less than threshold value, then Interpolated frame is obtained with transfer algorithm in frame rate, before interpolated frame is inserted into present frame, and then can then handle next frame.
Decoding end uses adaptive method (processing mode that video data can not be known from coding side) in the prior art And then not can guarantee that the processing mode of not video data is generated to judge by accident and determine, it not can guarantee for interpolation algorithm (in such as frame rate Transfer algorithm) applicability.The prior art is using simple interpolation method not to motion conditions (including violent and gentle fortune Emotionally condition) it distinguishes, there is no particular requirement to the content of the reference frame of falling in lines of reference frame lists, so that in high frame-rate video source In the case where, it is unable to give full play the performance of reference frame algorithm, not can guarantee video quality.
Summary of the invention
The embodiment of the present invention provides the decoding method and device of a kind of video data, and the more of coding mode selection can be improved Sample and flexibility guarantee the processing quality of video data, enhance the user experience of video data encoding.
First aspect of the embodiment of the present invention provides a kind of coding method of video data, can include:
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity Spend at least one of information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath;
If the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, according to the One coding mode and the second coding mode respectively pre-process the frame to be processed, wherein first coding mode and Second coding mode is different coding mode;
According to the pre-processed results of the pre-processed results of first coding mode and second coding mode, from described Selection target coding mode in first coding mode and second coding mode, and according to the target code mode to described Frame to be processed is encoded.
With reference to first aspect, in the first possible implementation, described to designate the information as frame per second information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Frame per second indicated by the frame per second information is greater than preset frame per second threshold value.
With reference to first aspect, in the second possible implementation, described to designate the information as time complexity information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Time complexity indicated by the time complexity information is greater than preset time complexity threshold value.
With reference to first aspect, in the third possible implementation, described to designate the information as spatial complexity information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold.
It is any in the third possible implementation to first aspect with reference to first aspect, in the 4th kind of possible realization It is described that the frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode in mode, comprising:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to meet and specify The video data of video quality index, wherein the pre-processed results of first coding mode include first video quality The spent first coding data amount size of processing;
The second video quality processing is carried out to obtain described in satisfaction to the frame to be processed according to second coding mode The video data of designated quality index, wherein the pre-processed results of second coding mode include second video Second amount of coded data size spent by quality treatment.
4th kind of possible implementation with reference to first aspect, it is in a fifth possible implementation, described according to institute State the pre-processed results of the first coding mode and the pre-processed results of second coding mode, from first coding mode and Selection target coding mode in second coding mode, comprising:
Judge whether the first coding data amount size is less than the second amount of coded data size, if judging result is Be, then by first coding mode selection be target code mode, if judging result be it is no, by second coding mode It is selected as target code mode.
It is any in the third possible implementation to first aspect with reference to first aspect, in the 6th kind of possible realization It is described that the frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode in mode, comprising:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to expend and specify Video data in the case where amount of coded data size, wherein the pre-processed results of first coding mode include described First quality index of the video data that one video quality is handled;
The second video quality processing is carried out to obtain described in consuming to the frame to be processed according to second coding mode Video data in the case where prescribed coding data volume size, wherein the pre-processed results of second coding mode include institute State the second quality index of the video data that the second video quality is handled.
6th kind of possible implementation with reference to first aspect, it is described according to institute in the 7th kind of possible implementation State the pre-processed results of the first coding mode and the pre-processed results of second coding mode, from first coding mode and Selection target coding mode in second coding mode, comprising:
Judge whether first quality index is higher than second quality index, it if the determination result is YES, then will be described First coding mode selection is target code mode, is target by second coding mode selection if judging result is no Coding mode.
The 6th kind of possible implementation of 4th kind of possible implementation and first aspect with reference to first aspect, the 8th In kind possible implementation, the pre-processed results according to first coding mode and second coding mode it is pre- Processing result, the selection target coding mode from first coding mode and second coding mode, comprising:
Obtain the first coding data amount size and the second amount of coded data size and first quality index and Second quality index;
Judge the ratio of first quality index Yu the first coding data amount size, if be higher than second matter The ratio of figureofmerit and the second amount of coded data size;
If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, It is then target code mode by second coding mode selection.
6th kind of possible implementation with reference to first aspect, it is described according to institute in the 9th kind of possible implementation State the pre-processed results of the first coding mode and the pre-processed results of second coding mode, from first coding mode and Selection target coding mode in second coding mode, comprising:
Judge whether the frame to be processed is video frame in monitor video data or feature video data;
If the determination result is YES, then first quality index and second quality index are obtained;
Judge whether first quality index is higher than second quality index, it if the determination result is YES, then will be described First coding mode selection is target code mode.
With reference to first aspect, described from first coding mode and described in the tenth kind of possible implementation In two coding modes after selection target coding mode, the method also includes:
The target code mode is marked by the way of index, to the index information of the target code mode It is encoded to obtain the identification information in a manner of the target code, and the identification information is passed into solution in the form of code stream Code end;Or
Use the physical quantity of characterization time-domain information that the target code mode is marked to obtain the target code The identification information of mode, and decoding end is passed to using the identification information of the target code mode as control information;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
Tenth kind of possible implementation with reference to first aspect, in a kind of the tenth possible implementation, the basis After the target code mode encodes the frame to be processed, the method also includes:
Judge whether the target code mode is video encoding standard technology;
If the target code mode is the video encoding standard technology, the frame to be processed that processing obtains is added Enter the reference frame lists, to generate the corresponding reference frame lists of next frame to be processed.
Second aspect of the embodiment of the present invention provides a kind of coding/decoding method of video data, can include:
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity Spend at least one of information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath;
If the specify information meets the preset condition that starting target decoder mode handles the frame to be processed, according to institute The identification information carried in the code stream information of frame to be processed is stated, determines the target decoder mode of the frame to be processed;
The auxiliary information being decoded to the frame to be processed is determined according to the target decoder mode, according to the target Decoding process and the auxiliary information decode the frame to be processed.
It is in the first possible implementation, described to designate the information as frame per second information in conjunction with second aspect;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Frame per second indicated by the frame per second information is greater than preset frame per second threshold value.
It is in the second possible implementation, described to designate the information as time complexity information in conjunction with second aspect;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Time complexity indicated by the time complexity information is greater than preset time complexity threshold value.
It is in the third possible implementation, described to designate the information as spatial complexity information in conjunction with second aspect;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold.
In conjunction with any in the third possible implementation of second aspect to second aspect, in the 4th kind of possible realization In mode, the identification information carried in the code stream information according to the frame to be processed determines the target of the frame to be processed Decoding process, comprising:
The code stream information of the frame to be processed is decoded, obtain carried in the code stream information for marking target The index information of coding mode determines the target code mode according to the index information;
According to the corresponding relationship of preset coding mode and decoding process, in conjunction with the target code mode determine described in Handle the target decoder mode of frame.
In conjunction with any in the third possible implementation of second aspect to second aspect, in the 5th kind of possible realization In mode, the identification information carried in the code stream information according to the frame to be processed determines the target of the frame to be processed Decoding process, comprising:
The code stream information of the frame to be processed is decoded, whether obtains the identification information that carries in the code stream information Include image display information or timestamp;
If in the identification information including image display information or timestamp, it is determined that the target code mode is institute Video encoding standard technology is stated, and according to the corresponding relationship of preset coding mode and decoding process, determines the Video coding The corresponding target decoder mode of standard technique;
If there is no image display information or timestamp in the identification information, it is determined that the target code mode is institute Switch technology or the resolution ratio zoom technology in frame rate are stated, and according to the correspondence of preset coding mode and decoding process Relationship determines that the corresponding target decoder mode of switch technology or the resolution ratio zoom technology are corresponding in the frame rate Target decoder mode.
In conjunction with the 5th kind of possible implementation of the 4th kind of possible implementation of second aspect or second aspect, It is described that the auxiliary being decoded to the frame to be processed is determined according to the target decoder mode in six kinds of possible implementations Information, comprising:
When the target decoder mode is the corresponding decoding process of the video encoding standard technology, obtain to it is described to The auxiliary information that processing frame is decoded, the auxiliary information are the corresponding residual error data information of the frame to be processed and control head Information;
When target decoder mode decoding process corresponding for switch technology in frame rate, obtain to described to be processed The auxiliary information that frame is decoded, the auxiliary information are sky;
When target decoder mode decoding process corresponding for resolution ratio zoom technology, obtain to the frame to be processed The auxiliary information being decoded, the auxiliary information be the frame to be processed zoom in and out the corresponding residual error data information of processing and Control head information.
It is described according to institute in the 7th kind of possible implementation in conjunction with the 6th kind of possible implementation of second aspect It states target decoder mode and determines the auxiliary information that is decoded to the frame to be processed, according to the target decoder mode and described After auxiliary information decodes the frame to be processed, the method also includes:
Judge whether the target decoder mode is the corresponding decoding process of the video encoding standard technology;
If the target decoder mode is the corresponding decoding process of the video encoding standard technology, processing is obtained The reference frame lists are added in the frame to be processed, to generate the corresponding reference frame lists of next frame to be processed.
The third aspect of the embodiment of the present invention provides a kind of code device of video data, can include:
Module is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame per second At least one of information, time complexity information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath;
Preprocessing module, the specify information for obtaining in the acquisition module meet at a variety of coding modes of starting When managing the preset condition of the frame to be processed, according to the first coding mode and the second coding mode respectively to the frame to be processed into Row pretreatment, wherein first coding mode and second coding mode are different coding mode;
Processing module, the pre-processed results of first coding mode for being handled according to the preprocessing module and institute The pre-processed results for stating the second coding mode, selection target encodes from first coding mode and second coding mode Mode, and the frame to be processed is encoded according to the target code mode.
It is in the first possible implementation, described to designate the information as frame per second information in conjunction with the third aspect;
The preprocessing module, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, then It determines that the specify information meets and starts the preset condition that a variety of coding modes handle the frame to be processed.
It is in the second possible implementation, described to designate the information as time complexity information in conjunction with the third aspect;
The preprocessing module, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, If the determination result is YES, it is determined that the specify information, which meets, starts the default item that a variety of coding modes handle the frame to be processed Part.
It is in the third possible implementation, described to designate the information as spatial complexity information in conjunction with the third aspect;
The preprocessing module, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if Judging result is yes, it is determined that the specify information, which meets, starts the default item that a variety of coding modes handle the frame to be processed Part.
In conjunction with any in the third possible implementation of the third aspect to the third aspect, in the 4th kind of possible realization In mode, the preprocessing module is specifically used for:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to meet and specify The video data of video quality index, wherein the pre-processed results of first coding mode include first video quality The spent first coding data amount size of processing;
The second video quality processing is carried out to obtain described in satisfaction to the frame to be processed according to second coding mode The video data of designated quality index, wherein the pre-processed results of second coding mode include second video Second amount of coded data size spent by quality treatment.
In conjunction with the 4th kind of possible implementation of the third aspect, in a fifth possible implementation, the processing mould Block is specifically used for:
Judge whether the first coding data amount size that the preprocessing module is handled is less than second volume First coding mode selection is then if the determination result is YES target code mode, if judging result by code data volume size It is no, then is target code mode by second coding mode selection.
In conjunction with any in the third possible implementation of the third aspect to the third aspect, in the 6th kind of possible realization In mode, the preprocessing module is specifically used for:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to expend and specify Video data in the case where amount of coded data size, wherein the pre-processed results of first coding mode include described First quality index of the video data that one video quality is handled;
The second video quality processing is carried out to obtain described in consuming to the frame to be processed according to second coding mode Video data in the case where prescribed coding data volume size, wherein the pre-processed results of second coding mode include institute State the second quality index of the video data that the second video quality is handled.
In conjunction with the 6th kind of possible implementation of the third aspect, in the 7th kind of possible implementation, the processing mould Block is specifically used for:
Whether first quality index for judging that the preprocessing module is handled is higher than second quality index, If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, will described in Second coding mode selection is target code mode.
In conjunction with the 6th kind of possible implementation of the 4th kind of possible implementation of the third aspect and the third aspect, the 8th In the possible implementation of kind, the processing module is specifically used for:
The first coding data amount size and the second amount of coded data size of the preprocessing module processing are obtained, with And first quality index and second quality index;
Judge the ratio of first quality index Yu the first coding data amount size, if be higher than second matter The ratio of figureofmerit and the second amount of coded data size;
If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, It is then target code mode by second coding mode selection.
In conjunction with the 6th kind of possible implementation of the third aspect, in the 9th kind of possible implementation, the processing mould Block is specifically used for:
Judge whether the frame to be processed is video frame in monitor video data or feature video data;
If the determination result is YES, then first quality index and second matter of the preprocessing module processing are obtained Figureofmerit;
Judge whether first quality index is higher than second quality index, it if the determination result is YES, then will be described First coding mode selection is target code mode.
In conjunction with the third aspect, in the tenth kind of possible implementation of the third aspect, the code device further includes label Module, the mark module are used for:
The target code mode is marked by the way of index, to the index information of the target code mode It is encoded to obtain the identification information in a manner of the target code, and the identification information is passed into solution in the form of code stream Code end;Or
Use the physical quantity of characterization time-domain information that the target code mode is marked to obtain the target code The identification information of mode, and decoding end is passed to using the identification information of the target code mode as control information;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
In conjunction with the tenth kind of possible implementation of the third aspect, in a kind of the tenth possible implementation, the coding Device, further includes:
Reference frame judgment module, for judging the target code mode that the processing module selects whether for video volume Code standard technique obtains processing described to be processed if the target code mode is the video encoding standard technology The reference frame lists are added in frame, to generate the corresponding reference frame lists of next frame to be processed.
Fourth aspect of the embodiment of the present invention provides a kind of decoding apparatus of video data, can include:
Module is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame per second At least one of information, time complexity information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath;
Determining module, the specify information for obtaining in the acquisition module meet starting target decoder mode and handle When the preset condition of the frame to be processed, according to the identification information carried in the code stream information of the frame to be processed, determine described in The target decoder mode of frame to be processed;
Processing module, the target decoder mode for being determined according to the determining module are determined to the frame to be processed The auxiliary information being decoded decodes the frame to be processed according to the target decoder mode and the auxiliary information.
It is in the first possible implementation, described to designate the information as frame per second information in conjunction with fourth aspect;
The determining module, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, then Determine that the specify information meets the preset condition that starting target decoder mode handles the frame to be processed.
It is in the second possible implementation, described to designate the information as time complexity information in conjunction with fourth aspect;
The determining module, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, If the determination result is YES, it is determined that the specify information meets the default item that starting target decoder mode handles the frame to be processed Part.
It is in the third possible implementation, described to designate the information as spatial complexity information in conjunction with fourth aspect;
The determining module, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if Judging result is yes, it is determined that the specify information meets the default item that starting target decoder mode handles the frame to be processed Part.
In conjunction with any in the third possible implementation of fourth aspect to fourth aspect, in the 4th kind of possible realization In mode, the determining module is specifically used for:
The code stream information of the frame to be processed is decoded, obtain carried in the code stream information for marking target The index information of coding mode determines the target code mode according to the index information;
According to the corresponding relationship of preset coding mode and decoding process, in conjunction with the target code mode determine described in Handle the target decoder mode of frame.
In conjunction with any in the third possible implementation of fourth aspect to fourth aspect, in the 5th kind of possible realization In mode, the determining module is specifically used for:
The code stream information of the frame to be processed is decoded, whether obtains the identification information that carries in the code stream information Include image display information or timestamp;
If in the identification information including image display information or timestamp, it is determined that the target code mode is institute Video encoding standard technology is stated, and according to the corresponding relationship of preset coding mode and decoding process, determines the Video coding The corresponding target decoder mode of standard technique;
If there is no image display information or timestamp in the identification information, it is determined that the target code mode is institute Switch technology or the resolution ratio zoom technology in frame rate are stated, and according to the correspondence of preset coding mode and decoding process Relationship determines that the corresponding target decoder mode of switch technology or the resolution ratio zoom technology are corresponding in the frame rate Target decoder mode.
In conjunction with the 5th kind of possible implementation of the 4th kind of possible implementation of fourth aspect or fourth aspect, In six kinds of possible implementations, the processing module is specifically used for:
When the target decoder mode that the determining module determines is the corresponding decoding of the video encoding standard technology When mode, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is that the frame to be processed is corresponding Residual error data information and control head information;
When the target decoder mode that the determining module determines is the corresponding decoding process of switch technology in frame rate When, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is sky;
When the target decoder mode that the determining module determines decoding process corresponding for resolution ratio zoom technology, The auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is that the frame to be processed zooms in and out processing pair The residual error data information and control head information answered.
In conjunction with the 6th kind of possible implementation of fourth aspect, in the 7th kind of possible implementation, the decoding dress It sets further include:
Reference frame judgment module, whether the target decoder mode for judging that the determining module determines is the view The corresponding decoding process of frequency coding standard technology;
If the target decoder mode is the corresponding decoding process of the video encoding standard technology, processing is obtained The reference frame lists are added in the frame to be processed, to generate the corresponding reference frame lists of next frame to be processed.
The 5th aspect of the embodiment of the present invention provides a kind of coding/decoding system of video data, comprising: the above-mentioned third aspect The decoding apparatus that the code device of offer and above-mentioned fourth aspect provide.
The embodiment of the present invention can be according to the specify information of the frame to be processed of video data, it is determined whether uses a variety of coding staffs Formula handles frame to be processed, and then can be carried out according to above-mentioned first coding mode and the second coding mode to frame to be processed pre- The pre-processed results of processing, which determine, selects specific any coding mode to encode for target code mode to frame to be processed.This Inventive embodiments can select a kind of specific coding mode to carry out coded treatment to frame to be processed from a variety of coding modes, improve The video processing quality of the frame to be processed of video data enhances the user experience of video data processing.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is the schematic diagram that decoding end adaptively carries out video frame processing in the prior art;
Fig. 2 is the first embodiment flow diagram of the coding method of video data provided in an embodiment of the present invention;
Fig. 3 is the second embodiment flow diagram of the coding method of video data provided in an embodiment of the present invention;
Fig. 4 is the 3rd embodiment flow diagram of the coding method of video data provided in an embodiment of the present invention;
Fig. 5 is the embodiment flow diagram of the coding/decoding method of video data provided in an embodiment of the present invention;
Fig. 6 is the schematic structural diagram of the first embodiment of the code device of video data provided in an embodiment of the present invention;
Fig. 7 is the schematic structural diagram of the second embodiment of the code device of video data provided in an embodiment of the present invention;
Fig. 8 is the 3rd embodiment structural schematic diagram of the code device of video data provided in an embodiment of the present invention;
Fig. 9 is a structural schematic diagram of the embodiment of the decoding apparatus of video data provided in an embodiment of the present invention;
Figure 10 is another structural schematic diagram of the embodiment of the decoding apparatus of video data provided in an embodiment of the present invention;
Figure 11 is the structural schematic diagram of the embodiment of the coding/decoding system of video data provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
The coding method of video data described in the embodiment of the present invention and coding/decoding method can wait locating according to video data Managing the frame per second of frame, perhaps the spatial complexity informations such as image complexity determine pair in time complexity information or video data Video data carries out encoding used coding mode, and the coding quality and decoding quality of video data can be improved.It below will knot Fig. 2-Figure 11 is closed, respectively the coding method from coding side and decoding end to video data provided in an embodiment of the present invention and decoding side Method and device are specifically described.
Referring to fig. 2, be video data provided in an embodiment of the present invention coding method first embodiment flow diagram. The coding method of video data described in the embodiment of the present invention, comprising steps of
S101, obtain video data in frame to be processed specify information, the specify information include: frame per second information, when Between at least one of complexity information and spatial complexity information.
It in some possible embodiments, can be first before coding side encodes the frame to be processed of video data Frame to be processed is parsed, obtains the specify information of frame to be processed, and then can adopt according to the determination of the specify information of frame to be processed Which type of frame to be processed is encoded with coding mode.Specifically, coding side can preset a variety of coding modes of starting When carrying out coded treatment to frame to be processed, the condition met needed for the specify information of above-mentioned frame to be processed, and then can obtain To after the specify information of frame to be processed, according to the specify information of frame to be processed judge above-mentioned specify information whether preset condition. If the specify information of the above-mentioned frame to be processed acquired meets preset condition, a variety of coding modes can be started to frame to be processed It is pre-processed.It, can be according to general coding if the specify information of the above-mentioned frame to be processed acquired is unsatisfactory for preset condition Mode encodes video data to be processed.Specifically, above-mentioned general coding mode concretely existing coding and decoding video The coding mode of prescribed by standard, such as H.264 mixed architecture coding mode specified in standard, herein with no restrictions.
In the specific implementation, the specify information of frame to be processed described in the embodiment of the present invention can include: frame per second information, when Between complexity information and spatial complexity information etc..Wherein, the time complexity information of above-mentioned frame to be processed can include: wait locate Manage the index information of the length information of the motion vector of frame or the reference frame of frame to be processed.That is, in embodiments of the present invention, Coding side can be according to reflecting times complexities such as the index informations of the length information of the motion vector of frame to be processed or reference frame Physical quantity judge the motion intense degree of the motion picture for including in frame to be processed.The physics of above-mentioned reflecting time complexity Amount is only citing, and non exhaustive, including but not limited to above- mentioned information.The spatial complexity information of above-mentioned frame to be processed can include: Variation range information, image texture information and picture coding patterns information of image chroma etc..That is, in the embodiment of the present invention In, coding side can be according to the variation range for the image chroma for including in frame to be processed, image texture or picture coding patterns etc. Reflect the physical quantity of space complexity to judge the picture material for including in frame to be processed.
S102, if the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, The frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode, wherein first coding Mode and second coding mode are different coding mode.
In some possible embodiments, since the frame per second of video data is different, the code requirement of video data also will Difference, in order to guarantee that the coding quality of video data, coding side can determine the coding specifically used according to the frame per second of video data Mode.If above-mentioned frame to be processed designates the information as frame per second information, coding side can refer in the frame per second information of above-mentioned frame to be processed When the frame per second shown is greater than preset frame per second threshold value, determine the specify information of frame to be processed meet the processing of starting a variety of coding modes to Handle the preset condition of frame.That is, in embodiments of the present invention, coding side can determine whether directly to adopt according to the frame per second of frame to be processed With the coding mode of video encoding and decoding standard defined, or whether need to use a variety of coding modes carry out processing simultaneously with from Screening obtains target code mode in a variety of coding modes.For example, when coding side judges to obtain the frame per second of video data to be processed When greater than 60 frames/second, a variety of coding modes to handle video data to be processed, and then can be used therefrom select processing result More target code mode of the coding mode of meet demand as video data to be processed, the no general coding that then be used directly Mode, i.e., the coding mode of defined in video encoding and decoding standard, details are not described herein.
In some possible embodiments, it if above-mentioned frame to be processed designates the information as time complexity information, compiles Code end can the time complexity indicated by the time complexity information of above-mentioned frame to be processed be greater than preset time complexity threshold When value, determines that the specify information of frame to be processed meets and start the preset condition that a variety of coding modes handle frame to be processed.For example, working as When the length of the average motion vector of video data to be processed is greater than 5, then it can determine whether the time complexity information institute of frame to be processed The time complexity of instruction is greater than preset time complexity threshold value.Further, a variety of coding modes can be started to be processed Video data is carried out while being handled, with therefrom selection target coding mode.If the time complexity of video data to be processed is discontented Sufficient preset condition, the general coding mode that then be used directly encode video data to be processed, and details are not described herein.
In some possible embodiments, it if above-mentioned frame to be processed designates the information as spatial complexity information, compiles Code end can the space complexity indicated by the spatial complexity information of above-mentioned frame to be processed be greater than pre-set space complexity threshold When, it determines that the specify information of frame to be processed meets and starts the preset condition that a variety of coding modes handle frame to be processed.For example, when to When the texture number of some specific texture strength is more than 40% in processing frame, coding side can determine whether that the image texture of frame to be processed is big In pre-set space complexity threshold, and then a variety of coding modes can be started, pending data is encoded, otherwise using general Coding mode handled.Concretely video compiles solution to general coding mode described in the embodiment of the present invention at this stage Code standard specified in coding mode, refer generally to include: prediction, transformation, quantization and entropy coding and etc. mixed architecture coding Mode can also be the coding modes such as other time domains, the prediction of frequency domain and transformation, herein with no restrictions.
In some possible embodiments, coding side determines that the specify information of above-mentioned frame to be processed meets a variety of volumes of starting After code mode handles frame preset condition to be processed, then it can start the first coding mode and the second coding mode, and then can be according to Above-mentioned first coding mode and the second coding mode handle frame to be processed.Specifically, coding side can be according to above-mentioned first Coding mode and the second coding mode respectively pre-process frame to be processed, to be determined according to above-mentioned pretreated processing result Specific any coding mode is selected to encode frame to be processed.In the specific implementation, coding side locates frame to be processed in advance The purpose of reason is can be much better in order to determine that specific any coding mode treats the encoding efficiency of processing mode, and indirect Frame to be processed is encoded using above-mentioned first coding mode and the second coding mode, so, coding side to frame to be processed into When row pretreatment, first video frame to be processed can be carried out to simplify processing (such as compression processing etc.), obtained compared with small data size Frame data, then compressed frame data are handled, and then can determine target code mode by the comparison of processing result.Tool During body is realized, above-mentioned compression processing, which carries out one of pretreatment mode implementation, specific processing mode, to answer according to using It is limited with scene, details are not described herein.
In the specific implementation, the first coding mode described in the embodiment of the present invention concretely video encoding and decoding standard institute Defined video encoding standard technology perhaps switch technology or resolution ratio zoom technology etc. in frame rate.Wherein, above-mentioned view The video encoding standard technology of frequency encoding and decoding standard defined may include the video encoding standards such as H.263, H.264 and H.265 The coding techniques of middle defined, herein with no restrictions.Above-mentioned coding mode is only citing, and non exhaustive, including but not limited to upper State coding mode.In addition, the second coding mode described in the embodiment of the present invention is the different codings with the first coding mode Technology specifically may include the video encoding standard technology different from the video encoding and decoding standard defined of the first coding mode, or Person is different from switch technology in the frame rate of the first coding mode, or the resolution ratio zoom technology different from the first coding mode Deng.Above-mentioned coding mode is only citing, and non exhaustive, including but not limited to above-mentioned coding mode.That is, the first coding mode can be Any one of above-mentioned a variety of coding techniques, the second coding mode can also be any in above-mentioned a variety of coding techniques, but the The selected coding techniques of two coding modes is different from the coding techniques of the first coding mode.For example, when the first coding mode is frame When rate upconversion techniques A, the second coding mode is only in the video encoding standard technology of video encoding and decoding standard defined Any or any resolution ratio zoom technology or frame rate on switch technology B.That is the first coding mode is frame Rate upconversion techniques A, the second coding mode are not then switch technology A in frame rate.In the specific implementation, above-mentioned first coding staff The selection of formula and the second coding mode can be determining according to time application scenarios demand, herein with no restrictions.
S103, according to the pre-processed results of the pre-processed results of first coding mode and second coding mode, The selection target coding mode from first coding mode and second coding mode, and according to the target code mode The frame to be processed is encoded.
In some possible embodiments, coding side uses a variety of codings such as the first coding mode and the second coding mode It, can be according to the pre-processed results of above-mentioned a variety of coding modes from above-mentioned a variety of after mode simultaneously pre-processes frame to be processed Selection target coding mode in coding mode.Specifically, the first coding mode and the second coding mode can be used to treat for coding side The video for the video data that processing frame is pre-processed, and then can be pre-processed the first coding mode to frame to be processed Quality or the first coding mode carry out amount of coded data size spent in preprocessing process to frame to be processed, compile with second The video quality for the video data that code mode pre-processes frame to be processed or the second coding mode are to frame to be processed It carries out the information such as amount of coded data size spent in preprocessing process to be compared, be compiled according to comparison result from above-mentioned first The coding mode for meeting concrete application scene demand is selected to have as to frame to be processed in code mode and the second coding mode The target code mode of body coded treatment.
In some possible embodiments, after coding side has selected target code mode, target code side can be used Formula encodes frame to be processed.In addition, target code mode can be accused first after coding side has selected target code mode Know that, to decoding end, decoding end is used when can determine that coding side encodes frame to be processed according to the information that coding side transmits Coding mode, and then corresponding decoding process can be used to be decoded, without voluntarily judging, reduce the erroneous judgement of coding mode Rate.In the specific implementation, target code mode can be informed decoding end by a variety of sending methods by coding side, wherein different hairs Mode is sent to correspond to different identification informations, coding side can be by different sending methods by the identification information of target code mode It is sent to decoding end.Specifically, coding side may be selected to send the sending method of index information, the mode of index is can be used in coding side Target code mode is marked, the index information of target code mode is encoded to obtain the mark in a manner of target code Know information, and then the identification information of target code mode can be passed to decoding end in the form of code stream.Decoding end can pass through solution Code stream information is analysed, target code mode is determined according to the index information carried in code stream information, and then can be according to target code side The corresponding decoding process of formula is decoded processing to frame to be processed.Coding side encodes the index information of target code mode When, can in a manner of equiprobable code index information, for example, if there are two types of coding mode, the first coding mode is encoded to 0, the second coding mode is encoded to 1.Specifically, coding side can also be compiled in the way of unequal probability with actual count characteristic Code index information.For example, if the first coding mode may be encoded as 0, and the second coding mode may be encoded as there are three types of coding mode 10, third coding mode may be encoded as 110.Further, equiprobability can also be used in coding side and unequal probability two ways is mixed Compile in collaboration with the mode code index information of code.For example, if the first coding mode may be encoded as 0, the second coding there are three types of coding mode Mode may be encoded as 10, and third coding mode may be encoded as 11.In the specific implementation, coding side can according to use application demand determine The coding mode of index information, herein with no restrictions.
In addition, target code mode is marked to obtain mesh in the physical quantity that characterization time-domain information can also be used in coding side The identification information of coding mode is marked, and passes to decoding end for the identification information of target code mode as control information.Decoding End can determine target code mode according to the control information that coding side transmits, and then can be according to the corresponding decoding of target code mode Mode is decoded processing to frame to be processed.The transfer mode of above-mentioned target code mode is only citing, and non exhaustive, include but It is not limited to above-mentioned implementation, herein with no restrictions.Wherein, the physical quantity of above-mentioned characterization time-domain information can include: image is shown Information or timestamp etc., herein with no restrictions.For example, " image display information " is each frame image of characterization in the aobvious of decoding end The physical quantity for showing sequence can be passed to decoding end as a kind of control information.Coding side and decoding end could dictate that when some In the presence of Image display position corresponding " image display information ", frame to be processed is handled using the first coding mode, it is no Then, frame to be processed is handled using the second coding mode.Or " timestamp " is being passed in transmission of video information Which is the control information that defeated system layer is added be characterized in decoding end at and play corresponding video information at time point.Coding End and decoding end can specify that when the timestamp information space character regulation of some video clips, use the first coding mode It is encoded, otherwise, when the degree of rarefication at above-mentioned timestamp information interval is more than preset threshold, is carried out using the second coding mode Coding.Coding side can pass through figure using information such as image display information or timestamps as the identification information of target code mode As target code mode is passed to decoding end by the display information such as information or timestamp.
Coding side can be complicated according to the frame per second information of the frame to be processed of video data or time in embodiments of the present invention The specify information of the degree frame to be processed such as information or spatial complexity information judges whether using a variety of coding modes to be processed Frame to be processed in video is handled, or according to video encoding and decoding standard defined general coding mode to frame to be processed It is handled.If the specify informations such as the frame per second of video data to be processed or time complexity meet a variety of coding modes of starting into The preset condition of row processing, then can be used the first coding mode and the second coding mode respectively to the frame to be processed in video data Pre-processed, so can be determined according to the pre-processed results of above-mentioned first coding mode and the second coding mode selection it is specific which A kind of coding mode is that frame to be processed is encoded.After coding side selection target coding mode, it can be sent according to specify information Mode informs target code mode to decoding end, so that decoding end treats place according to the corresponding decoding process of target code mode Reason video data is decoded processing.The embodiment of the present invention can select a kind of specific coding mode to treat from a variety of coding modes It handles frame and carries out coded treatment, improve the video processing quality of the frame to be processed of video data, enhance video data processing User experience.
It is the second embodiment flow diagram of the coding method of video data provided in an embodiment of the present invention referring to Fig. 3. The coding method of video data described in the embodiment of the present invention, comprising steps of
S201, obtain video data in frame to be processed specify information, the specify information include: frame per second information, when Between at least one of complexity information and spatial complexity information.
In some possible embodiments, coding side described in the embodiment of the present invention obtains the specified of frame to be processed The specific implementation process of information can be found in the step S101 in above-mentioned first embodiment, and details are not described herein.
S202, if the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, The first video quality processing is carried out to the frame to be processed according to first coding mode and meets designated quality to obtain The video data of index, wherein the pre-processed results of first coding mode include that the first video quality processing is consumed The first coding data amount size taken.
S203 carries out the second video quality processing to the frame to be processed according to second coding mode to be met The video data of the designated quality index, wherein the pre-processed results of second coding mode include described second The second spent amount of coded data size of video quality processing.
In some possible embodiments, coding side described in the embodiment of the present invention according to the first coding mode and Second coding mode, which carries out pretreated specific implementation process to frame to be processed, can be found in step in above-mentioned first embodiment S102, details are not described herein.
Further, in some possible embodiments, algorithm Starting mode described in the embodiment of the present invention, i.e., Determine whether the Starting mode handled using a variety of coding modes frame to be processed, can be each frame and all individually sentenced Break, determine whether to start, it can is the judgment mode of frame level.Further, coding side determines whether using a variety of coding staffs The Starting mode that formula handles video data to be processed can also be the judgment mode of image group grade.For example, in coding one When a image group, will can once it be judged in the multiframe data set for including in an image group, so that more in the image group Frame data are all handled using two kinds of coding modes, then therefrom selection target coding mode, or all use three kinds of coding staffs Formula is handled, reselection target code mode.Further, it can also be the judgment mode of sequence-level, i.e. sequence Frame data (being made of multiple images group) once judge, it is determined whether start a variety of coding modes and handled.Either band The judgment mode of grade or the judgment mode of chip level, wherein above-mentioned band and piece are the two ways of configuration frame.That is, coding side Can in the way of band perhaps in the way of piece to frame data split by a frame data be split as multiple strip datas or Multislice data, and then can be judged in the way of slice level, determine whether strip data starts a variety of coding modes and carry out Whether processing or chip level data start a variety of coding modes and handle etc., herein with no restrictions.
Further, in some possible embodiments, the first coding mode described in the embodiment of the present invention and Second coding mode can be two kinds of different coding modes, can also be a variety of coding modes such as different three kinds or four kinds, This is with no restrictions.The present invention will be that two different coding mode such cases are with the first coding mode and the second coding mode Example, is specifically described.
In some possible embodiments, coding side can carry out the first video to frame to be processed according to the first coding mode Quality treatment, to obtain the video data for meeting designated quality index.Further, coding side may further determine that according to first Coding mode carries out spent amount of coded data size (the i.e. first coding data amount of the first video quality processing to frame to be processed Size), wherein the spent first coding data amount size of above-mentioned first video quality processing is that the first coding mode is treated It handles frame and carries out pretreated pre-processed results.In addition, coding side can also carry out the to frame to be processed according to the second coding mode Two video qualities processing, to obtain the video data for meeting above-mentioned designated quality index.Wherein, above-mentioned second coding mode Carrying out pretreated pre-processed results to frame to be processed includes that the second spent amount of coded data of the second video quality processing is big It is small.In the specific implementation, above-mentioned designated quality index may include subjective quality index or objective quality index.Wherein, on State objective quality index can include: Y-PSNR or structural similarity etc..The quality index of above-mentioned video objective quality is only It is citing, and it is non exhaustive, including but not limited to above-mentioned quality index.Above-mentioned subjective quality index can include: subjective evaluation scoring, Psychological model scoring, minimum visible error scoring, time-space domain human eye cover up scoring of effect or physilogical characteristics etc..Above-mentioned view The quality index of frequency subjective quality is only citing, and non exhaustive, including but not limited to above-mentioned quality index.
S204, judges whether the first coding data amount size is less than the second amount of coded data size, if judgement As a result be it is yes, then follow the steps S205, it is no to then follow the steps S206.
First coding mode selection is target code mode by S205.
Second coding mode selection is target code mode by S206.
In some possible embodiments, coding side is according to the first coding mode and the second coding mode to frame to be processed It then can will include first coding data amount size in the pre-processed results of the first coding mode, with second after being pre-processed The the second amount of coded data size for including in the pre-processed results of coding mode is compared, and is encoded according to comparison result from first Selection target coding mode in mode and the second coding mode.Specifically, coding side can determine whether that first coding data amount size is It is no less than the second amount of coded data size, if the determination result is YES, then in the case where can determine same treatment target, first coding Mode handle to frame to be processed that spent amount of coded data is smaller, i.e., the process performance of the first coding mode is better than second Coding mode, and then can be target code mode by the first coding mode selection.If first coding data amount size is greater than second The process performance of second coding mode can be then better than the first coding mode by amount of coded data size, and then can be by the second coding Mode is determined as target code mode.
S207 encodes the frame to be processed according to the target code mode.
In the specific implementation, coding side can be joined using the specific implementation process that target code mode encodes frame to be processed See implementation described in the step S103 in first embodiment, details are not described herein.
S208 judges whether the target code mode is that video encoding standard technology if the determination result is YES then executes Step S209.
S209, the reference frame lists are added in the frame to be processed that processing is obtained, to generate next frame to be processed Corresponding reference frame lists.
In some possible embodiments, coding side has determined target code mode, is treated using target code mode After processing frame is encoded, it can also be determined whether frame to be processed accessing next frame pair to be processed according to target code mode In the reference frame lists answered.That is, can treated when coding side encodes frame to be processed (such as nth frame, N are natural number) Processed frame (such as N-1 frame) before processing frame carries out the reference frame that frame to be processed (nth frame) is generated after coding is completed List.After coding side handles the frame data of N-1 frame, it can determine that N-1 frame is according to the coding mode of N-1 frame The reference frame of the no frame data coding that can be used as its next frame (i.e. nth frame).If so, above-mentioned N-1 frame can be added to In reference frame lists, that is, N-1 frame is added in the reference frame lists of its coding, generates a new reference frame lists, it will The reference frame lists that the reference frame lists are encoded as first frame (nth frame).Coding side carries out handling it to the frame data of nth frame Afterwards, it can determine whether nth frame can be used as the frame data coding of its next frame (i.e. N+1 frame) according to the coding mode of nth frame Reference frame.If so, above-mentioned nth frame can be added in reference frame lists, that is, nth frame is added to the reference of its coding In frame list, a new reference frame lists, the reference frame which is encoded as first frame (N+1 frame) are generated List.
In some possible embodiments, if above-mentioned target code mode is the video of video encoding and decoding standard defined Above-mentioned frame to be processed can be then added in reference frame lists by coding standard technology, generate the corresponding ginseng of next frame to be processed Examine frame list.If above-mentioned target code mode is not the video encoding standard technology of video encoding and decoding standard defined, will not Frame to be processed is linked into reference frame lists.In the specific implementation, if above-mentioned target code mode is advised by video encoding and decoding standard Fixed video encoding standard technology, then coding side can be by way of sending index information by the selection result of target code mode Decoding end is passed to, or by the selection result of target code mode by way of sending the control information such as image display information Pass to decoding end.Decoding end receive above-mentioned index information perhaps control after information then can to above-mentioned index information or Control information is parsed, and determines target code mode.If above-mentioned target code mode is the coding mode converted in frame rate, After coding side has determined the coding mode encoded to frame to be processed, without sending the video data information of any present frame To decoding end.If decoding end does not receive any information from coding side, can default objects coding mode be frame rate on convert Coding mode.
In the specific implementation, coding side to the video data of each frame be all according to above-mentioned implementation, so, coding side pair When frame to be processed is handled, the target code mode of the processed frame before can receive frame to be processed, and then can according to The target code mode of processing frame determines whether processed frame can be the reference frame of currently pending frame.Specifically, working as coding side Using the method for sending index information when sending target code mode to decoding end, then coding side can compile solution to using video The processing frame of the coding mode of code prescribed by standard assigns " image display information ", and will have the place of " image display information " Frame is managed according to the far and near position principle close to frame to be processed, generates the reference frame lists of frame to be processed.For example, coding side can obtain 5 video frames before frame to be processed, it is assumed that be F1 to F5, wherein the coding mode that F1, F3 and F4 are used is coding and decoding video The coding mode of prescribed by standard, F2 and F5 use the coding mode converted in frame rate, then coding side can get F1 The case where " image display information " that is carried into each video frame of F5.Specifically, coding side can be according to above-mentioned each video frame The coding mode used determine " image display information " of each video frame for 0, nothing, 1,2, nothing, at this point, coding side can determine to The reference frame for handling frame is F1, F3 and F4, and then can be determined according to the positional relationship of each reference frame and frame to be processed to be processed The reference frame lists of frame are F4, F3, F1.
In some possible embodiments, it is controlled to decoding end using transmission if coding side transmits target code mode The method of information processed, then coding side can will have the processed frame of " image display information " according to close to the far and near position of frame to be processed Principle is set, generates reference frame lists, and then also discontinuous " image display information " can be mapped as continuous reference key.Example Such as, coding side can obtain 5 video frames before frame to be processed, it is assumed that be F1 to F5, wherein the coding that F1, F3 and F4 are used Mode is the coding mode of video encoding and decoding standard defined, and F2 and F5 use the coding mode converted in frame rate, then Coding side can get the case where " image display information " carried in each video frame of F1 to F5 be 0, nothing, 2,3, without into one Step, coding side can determine that the reference frame of frame to be processed is F1, F3 and F4, and then can be according to each reference frame and frame to be processed Positional relationship determines that the reference frame lists of frame to be processed are F4, F3, F1.Wherein, the reference key of above-mentioned reference frame F4, F3, F1 There can be (3,2,0) to be mapped as (3,2,1).
Coding side can be according to information such as the frame per second of video data to be processed or time complexities in embodiments of the present invention Judge whether to handle the frame to be processed in video to be processed using a variety of coding modes, or according to coding and decoding video mark The general coding mode of quasi- defined handles frame to be processed.If the frame per second or time complexity of video data to be processed Etc. specify informations meet preset condition, then the first coding mode and the second coding mode can be used respectively to video data to be processed In frame to be processed pre-processed, and then can be true according to the pre-processed results of above-mentioned first coding mode and the second coding mode Specific any coding mode is selected to be encoded for frame to be processed calmly.It, can be according to after coding side selection target coding mode Specify information sending method informs target code mode to decoding end, so that decoding end is according to the corresponding solution of target code mode Code mode is decoded processing to video data to be processed, and decoding end can be improved to coding mode without voluntarily judging in decoding end Judgement accuracy.Coding side can also be according to the pre-generated corresponding reference frame lists of frame to be processed, according to target code Mode encodes frame to be processed, improves the coded video quality of video to be processed, enhances the use of video data processing Family experience.
Referring to fig. 4, be video data provided in an embodiment of the present invention coding method 3rd embodiment flow diagram. The coding method of video data described in the embodiment of the present invention, comprising steps of
S301, obtain video data in frame to be processed specify information, the specify information include: frame per second information, when Between at least one of complexity information and spatial complexity information.
In some possible embodiments, coding side described in the embodiment of the present invention obtains the specified of frame to be processed The specific implementation process of information can be found in the step S101 in above-mentioned first embodiment, and details are not described herein.
S302 is pressed if the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed The first video quality processing is carried out to the frame to be processed according to first coding mode and expends prescribed coding data volume to obtain Video data in the case where size, wherein the pre-processed results of first coding mode include first video quality Handle the first quality index of the obtained video data.
S303 carries out the second video quality processing to the frame to be processed according to second coding mode to be expended Video data in the case where the prescribed coding data volume size, wherein the pre-processed results packet of second coding mode Include the second quality index of the video data that second video quality is handled.
In some possible embodiments, coding side described in the embodiment of the present invention according to the first coding mode and Second coding mode, which carries out pretreated specific implementation process to frame to be processed, can be found in step in above-mentioned first embodiment Step S202 and S203 in S102 and second embodiment, details are not described herein.
Further, coding side can carry out the first video quality processing to the frame to be processed according to the first coding mode, To obtain the video data in the case where expending prescribed coding data volume size.Further, coding side may further determine that according to One coding mode carries out the quality index (i.e. first of obtained video data after the first video quality processing to frame to be processed Quality index).Wherein, the first quality index of the video data that above-mentioned first video quality is handled is the first coding staff Formula carries out pretreated pre-processed results to frame to be processed.In addition, coding side can also be according to the second coding mode to frame to be processed The second video quality processing is carried out, to obtain the video data in the case where expending above-mentioned prescribed coding data volume size, wherein The pre-processed results of second coding mode include the second quality index of the video data that the second video quality is handled. Wherein, above-mentioned quality index may include objective quality index and subjective quality index, for details, reference can be made to above-mentioned second embodiment, This is repeated no more.
S304, judges whether first quality index is higher than second quality index and if the determination result is YES then holds Otherwise row step S305 executes step S306.
First coding mode selection is target code mode by S305.
Second coding mode selection is target code mode by S306.
In some possible embodiments, coding side is according to the first coding mode and the second coding mode to frame to be processed It then can will include the first quality index in the pre-processed results of the first coding mode, with the second coding staff after being pre-processed The second quality index for including in the pre-processed results of formula is compared, and is compiled according to comparison result from the first coding mode and second Selection target coding mode in code mode.Specifically, coding side can determine whether the first quality index is higher than the second quality index, If the determination result is YES, then it can determine that the first coding mode is to frame to be processed in the case where expending identical amount of coded data size The video quality handled is higher, i.e., the process performance of the first coding mode is better than the second coding mode, and then can incite somebody to action First coding mode selection is target code mode.It, can be by the second coding if the first quality index is lower than the second quality index The process performance of mode is better than the first coding mode, and then the second coding mode can be determined as target code mode.
In some possible embodiments, coding side selection target from the first coding mode and the second coding mode is compiled It, can also be according to first coding data amount size and the second amount of coded data size and the first quality index and the when code mode Two quality index carry out comprehensive descision.Coding side can be obtained from the pre-processed results of the first coding mode and the second coding mode Above-mentioned first quality index and the second quality index and first coding data amount size and the second amount of coded data size are taken, And then can determine the ratio of the first quality index and first coding data amount size, and, the second quality index and the second coding The ratio of data volume size.When coding side judgement learns that the ratio of the first quality index and first coding data amount size is higher than the When the ratio of two quality index and the second amount of coded data size, then the first coding mode can be determined as to the target of frame to be processed Otherwise second coding mode is then determined as the target code mode of frame to be processed by coding mode.
Further, coding side may be used also from the first coding mode and the second coding mode when selection target coding mode It is selected according to the concrete application scene of frame to be processed.If frame to be processed is the video data of specified application scenarios, can be strong System is encoded using the coding mode for guaranteeing video quality, that is, may specify that some specific coding mode is specified application scenarios Video data coding mode.Specifically, coding side can determine whether above-mentioned frame to be processed is monitor video data or spy Write the video frame in video data.If above-mentioned frame to be processed is the video frame in monitor video data or feature video data, Higher-quality coding mode may be used to handle above-mentioned frame to be processed.For example, for monitor video, it can be according to first Include in the pre-processed results of the first quality index and the second coding mode that include in the pre-processed results of coding mode Two selection of quality index target code modes.If the first quality index is higher than the second quality index, the first coding mode is selected As the target code mode of monitor video, otherwise, target code mode of second coding mode as monitor video is selected.
S307 encodes the frame to be processed according to the target code mode.
In the specific implementation, coding side can be joined using the specific implementation process that target code mode encodes frame to be processed See implementation described in the step S103 in first embodiment, details are not described herein.
S308 judges whether the target code mode is that video encoding standard technology if the determination result is YES then executes Step S309.
S309, the reference frame lists are added in the frame to be processed that processing is obtained, to generate next frame to be processed Corresponding reference frame lists.
In the specific implementation, the specific implementation process of above-mentioned steps S308 and S309 can be found in the step in second embodiment Implementation described in S208 and S209, details are not described herein.
Coding side can be multiple according to the frame per second or time complexity of video data to be processed, space in embodiments of the present invention The information such as miscellaneous degree judge whether to handle the frame to be processed in video to be processed using a variety of coding modes, or according to view The general coding mode of frequency encoding and decoding standard defined handles frame to be processed.If the frame per second of video data to be processed or The specify informations such as time complexity meet preset condition, then the first coding mode can be used and the second coding mode treats place respectively Frame to be processed in reason video data is pre-processed, and then can be according to the pre- of above-mentioned first coding mode and the second coding mode Processing result, which determines, selects specific any coding mode to be encoded for frame to be processed.Coding side selection target coding mode it Afterwards, target code mode can be informed to decoding end according to specify information sending method, so that decoding end is according to target code side The corresponding decoding process of formula is decoded processing to video data to be processed, and decoding end can be improved without voluntarily judging in decoding end To the accuracy of the judgement of coding mode.Coding side can also be according to the pre-generated corresponding reference frame lists of frame to be processed, root Frame to be processed is encoded according to target code mode, the coded video quality of video to be processed is improved, enhances video counts According to the user experience of processing.
It is the first embodiment flow diagram of the coding/decoding method of video data provided in an embodiment of the present invention referring to Fig. 5. The coding/decoding method of video data described in the embodiment of the present invention comprising steps of
S401, obtain video data in frame to be processed specify information, the specify information include: frame per second information, when Between at least one of complexity information and spatial complexity information.
In some possible embodiments, the realization process of decoding end described in the embodiment of the present invention and above-mentioned coding The inverse mistake that the realization process at end inverse process each other, i.e. realization process described in decoding end are the realization process of above-mentioned coding side Journey.The specify information of the frame to be processed of video data described in the embodiment of the present invention includes: frame per second information, time complexity Information and spatial complexity information etc..Wherein, the time complexity information of above-mentioned video data to be processed includes: motion vector Length information or reference frame index information;Above-mentioned spatial complexity information includes: the variation range letter of image chroma Breath, image texture information and picture coding patterns information etc..In the specific implementation, decoding end described in the embodiment of the present invention The specify information for obtaining frame to be processed, judges whether above-mentioned specify information meets starting target decoder mode and handle frame to be processed The specific implementation process of preset condition can be found in implementation described in above-mentioned coding side embodiment, and details are not described herein.
S402, if the specify information meets the preset condition that starting target decoder mode handles the frame to be processed, According to the identification information carried in the code stream information of the frame to be processed, the target decoder mode of the frame to be processed is determined.
In some possible embodiments, when decoding end judgement learns that above-mentioned specify information meets preset condition, then Determine that coding side carries out frame to be processed to encode used mesh according to the identification information carried in the code stream information of frame to be processed Coding mode is marked, and then can determine the target decoder mode of frame to be processed according to the mode of decoding end and the preparatory agreement of coding side. Specifically, decoding end can receive the code stream information of coding side transmission, code stream information is decoded, is carried according in code stream information Information determine target code mode used by coding side.When the index that the identification information that decoding end receives is coding mode When information, decoding end can determine target code according to the encoded information carried in the coding mode combination code stream information of index information Mode, and then the corresponding relationship for the coding mode and decoding process that can be determined according to coding side and the preparatory agreement of decoding end, determine The corresponding target decoder mode of target code mode.When the identification information that decoding end receives is aobvious for the corresponding image of coding mode When showing that information etc. controls information, decoding end can be assigned in mode combination code stream information according to the image display information of regulation and be carried Image display information determines target code mode.When code stream information of the decoding end to frame to be processed is decoded, code is acquired Include image display information or timestamp in the identification information carried in stream information, then can determine that target code mode is video Coding standard technology, and then can determine that target code mode is corresponding according to the corresponding relationship of preset coding mode and decoding process Target decoder mode.If decoding end is decoded the code stream information of frame to be processed, acquires and carried in code stream information There is no image display information or timestamp in identification information, it is determined that target code mode is to convert skill in the frame rate Art or the resolution ratio zoom technology, and then institute can be determined according to the corresponding relationship of preset coding mode and decoding process State the corresponding target decoder mode of switch technology in frame rate or the corresponding target decoder side of the resolution ratio zoom technology Formula.
For example, decoding end can arrange the coding mode of the index information of target code mode with coding side, for example, if having two Kind coding mode, could dictate that the first coding mode is encoded to 0, and the second coding mode is encoded to 1, then decoding end receives code stream letter After breath, then code stream information can be parsed, the target code information carried in code stream is determined according to stipulated form.Alternatively, Decoding end can arrange the control information of target code mode and the corresponding relationship of image display information with coding side, for example, if having Two kinds of coding modes then could dictate that the first coding mode can assign " image display information ", and the second coding mode does not assign " image Show information ".After decoding end receives control information, it can determine if including " image display information " in above-mentioned control information Target code mode is the first coding mode.If not including " image display information " in above-mentioned control information, target can determine Coding mode is the second coding mode.The mode of above-mentioned determining target code mode is only citing in specific implementation, can specifically be joined See implementation described in above-mentioned coding side, herein with no restrictions.
In some possible embodiments, if above-mentioned specify information is unsatisfactory for preset condition, decoding end can be according to one As decoding process processing is decoded to video data to be processed.Wherein, above-mentioned general decoding process is video encoding and decoding standard The mixed structure decoding process of defined, the specific implementation of above-mentioned mixed architecture decoding process can be found in video data and compile solution Executive mode specified in code standard, details are not described herein.
S403 determines the auxiliary information being decoded to the frame to be processed according to the target decoder mode, according to institute It states target decoder mode and the auxiliary information decodes the frame to be processed.
In some possible embodiments, after decoding end has determined target code mode used by coding side, then The frame to be processed in video data to be processed can be decoded according to the corresponding target decoder mode of target code mode.Specifically , it, can be corresponding with each coding mode of the preparatory agreement of coding side according to it when decoding end is decoded frame to be processed Information method of determination determines the corresponding specify information of target code mode that above-mentioned coding side uses, and then can be from view to be processed Frequency obtains above-mentioned specify information in, is decoded according to above-mentioned specify information to video data to be processed, obtains video counts According to.In the specific implementation, if decoding end can when above-mentioned target decoder mode is video encoding standard technology corresponding decoding process Obtain the auxiliary information being decoded to frame to be processed, wherein above-mentioned auxiliary information is the corresponding residual error data letter of frame to be processed Breath and control head information.When target decoder mode decoding process corresponding for switch technology in frame rate, decoding end is then not necessarily to Obtain the auxiliary information handled frame to be processed, that is, be sky to the auxiliary information that the frame to be processed is decoded.Work as mesh When to mark decoding process be resolution ratio zoom technology corresponding decoding process, the auxiliary information being decoded to frame to be processed is obtained, Wherein, above-mentioned auxiliary information is that frame to be processed zooms in and out the corresponding residual error data information of processing and control head information.For example, if Coding side is encoded the video encoding standard technology using video encoding and decoding standard defined to frame to be processed, then is decoded End needs to obtain complete residual error data information and control head letter after determining the corresponding decoding process of above-mentioned target code mode Breath is decoded video to be processed.If coding side encodes using frame rate conversion techniques frame to be processed, solve Code end does not need to obtain any data information, it is only necessary to be decoded i.e. according to the corresponding decoding process of frame rate conversion techniques It can.If coding side encodes using resolution ratio zoom technology frame to be processed, after decoding end needs to obtain diminution Residual error data information and control head information are decoded video to be processed.Further, other volumes can also be used in coding side Code mode, the information for needing to transmit may also include that filter factor information or update information etc., herein with no restrictions.
In some possible embodiments, decoding end according to target decoder mode and get auxiliary information decoding to Frame is handled, decoding end can also determine whether frame to be processed can be used as next frame to be processed according to the decoding process of frame to be processed Reference frame.Specifically, decoding end can determine whether target decoder mode is the corresponding decoding process of video encoding standard technology, if Target decoder mode is the corresponding decoding process of video encoding standard technology, then the ginseng is added in the frame to be processed obtained processing Frame list is examined, to generate the corresponding reference frame lists of next frame to be processed.In the specific implementation, decoding end generates frame to be processed The specific implementation of reference frame lists can be found in implementation described in coding side embodiment, and details are not described herein.
In embodiments of the present invention, decoding end can be determined according to video data to be processed and be advised in starting video encoding and decoding standard Fixed decoding process is decoded video data to be processed, can also determine coding side pair according to the identification information that coding side is sent The target decoder mode that video data to be processed is encoded, and then frame to be processed is decoded according to target decoder mode. Decoding end is not necessarily to voluntarily determine decoding process according to Adaptive Criterion, reduces the coding mode of video data encoding to be processed Erroneous judgement property, enhances the accuracy of frame video data decoding to be processed, improves the quality of video data decoding.
It is the schematic structural diagram of the first embodiment of the code device of video data provided in an embodiment of the present invention referring to Fig. 6. Code device described in the embodiment of the present invention, comprising:
Module 10 is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame At least one of rate information, time complexity information and spatial complexity information.
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath.
Preprocessing module 20, the specify information for obtaining in the acquisition module meet a variety of coding modes of starting When handling the preset condition of the frame to be processed, according to the first coding mode and the second coding mode respectively to the frame to be processed It is pre-processed, wherein first coding mode and second coding mode are different coding mode.
Processing module 30, the pre-processed results of first coding mode for being handled according to the preprocessing module and The pre-processed results of second coding mode, selection target is compiled from first coding mode and second coding mode Code mode, and the frame to be processed is encoded according to the target code mode.
In some possible embodiments, above-mentioned to designate the information as frame per second information;
Above-mentioned preprocessing module 20, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, then It determines that the specify information meets and starts the preset condition that a variety of coding modes handle the frame to be processed.
In some possible embodiments, above-mentioned to designate the information as time complexity information;
Above-mentioned preprocessing module 20, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, If the determination result is YES, it is determined that the specify information, which meets, starts the default item that a variety of coding modes handle the frame to be processed Part.
In some possible embodiments, above-mentioned to determine information for spatial complexity information;
Above-mentioned preprocessing module 20, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if Judging result is yes, it is determined that the specify information, which meets, starts the default item that a variety of coding modes handle the frame to be processed Part.
In the specific implementation, code device described in the embodiment of the present invention is video counts provided in an embodiment of the present invention According to coding method executing subject, coding side described in the embodiment of as above-mentioned coding method.The embodiment of the present invention Described in code device can pass through that it obtains module 10, preprocessing module 20 and processing module 30 execute the embodiment of the present invention Implementation described in the first embodiment of the coding method of the video data of offer, specific implementation process can be found in above-mentioned Each step in the first embodiment of the coding method of video data (implementation described in step S101 to S103), herein It repeats no more.
In embodiments of the present invention, code device can be multiple according to the frame per second information of the frame to be processed of video data or time The specify information of the miscellaneous degree frame to be processed such as information or spatial complexity information judges whether to treat place using a variety of coding modes Reason video in frame to be processed handled, or according to video encoding and decoding standard defined general coding mode to be processed Frame is handled.If the specify informations such as the frame per second of video data to be processed or time complexity meet a variety of coding modes of starting The first coding mode and the second coding mode then can be used respectively to be processed in video data in the preset condition handled Frame is pre-processed, and then can determine that selection is specific according to the pre-processed results of above-mentioned first coding mode and the second coding mode Any coding mode is that frame to be processed is encoded.It, can be according to specify information after code device selection target coding mode Sending method informs target code mode to decoding apparatus, so that decoding apparatus is according to the corresponding decoding side of target code mode Formula is decoded processing to video data to be processed.Code device described in the embodiment of the present invention can be from a variety of coding modes Specifically a kind of coding mode carries out coded treatment to frame to be processed for selection, improves the video processing of the frame to be processed of video data Quality enhances the user experience of video data processing.
It is the schematic structural diagram of the second embodiment of the code device of video data provided in an embodiment of the present invention referring to Fig. 7. Code device described in the embodiment of the present invention, comprising:
Module 10 is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame At least one of rate information, time complexity information and spatial complexity information.
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath.
In the specific implementation, the acquisition module 10 in the first embodiment of above-mentioned code device can be performed in above-mentioned acquisition module 10 The step of performed implementation, specific implementation process can be found in the first embodiment of the coding method of above-mentioned video data Implementation described in S101, details are not described herein.
Preprocessing module 21, the specify information for obtaining in the acquisition module meet a variety of coding modes of starting When handling the preset condition of the frame to be processed, according to the first coding mode and the second coding mode respectively to the frame to be processed It is pre-processed, wherein first coding mode and second coding mode are different coding mode.
In the specific implementation, the pretreatment mould in the first embodiment of above-mentioned code device can be performed in above-mentioned preprocessing module 21 Implementation performed by block 20 further can also carry out following operation:
In some possible embodiments, above-mentioned preprocessing module 21, is specifically used for:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to meet and specify The video data of video quality index, wherein the pre-processed results of first coding mode include first video quality The spent first coding data amount size of processing;
The second video quality processing is carried out to obtain described in satisfaction to the frame to be processed according to second coding mode The video data of designated quality index, wherein the pre-processed results of second coding mode include second video Second amount of coded data size spent by quality treatment.
Processing module 31, the pre-processed results of first coding mode for being handled according to the preprocessing module and The pre-processed results of second coding mode, selection target is compiled from first coding mode and second coding mode Code mode, and the frame to be processed is encoded according to the target code mode.
In the specific implementation, the first reality of above-mentioned code device can be performed in processing module 31 described in the embodiment of the present invention Implementation performed by the processing module 30 in example is applied, further, can also carry out following operation:
In some possible embodiments, above-mentioned processing module 31, is specifically used for:
Judge whether the first coding data amount size that the preprocessing module is handled is less than second volume First coding mode selection is then if the determination result is YES target code mode, if judging result by code data volume size It is no, then is target code mode by second coding mode selection.
In some possible embodiments, code device described in the embodiment of the present invention further includes mark module 40, above-mentioned mark module 40 is used for:
The target code mode is marked by the way of index, to the index information of the target code mode It is encoded to obtain the identification information in a manner of the target code, and the identification information is passed into solution in the form of code stream Code end;Or
Use the physical quantity of characterization time-domain information that the target code mode is marked to obtain the target code The identification information of mode, and decoding end is passed to using the identification information of the target code mode as control information;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
In some possible embodiments, code device described in the embodiment of the present invention further include:
Reference frame judgment module 50, for judging the target code mode that the processing module selects whether for video Coding standard technology will be handled described in obtaining if the target code mode is the video encoding standard technology wait locate It manages frame and the reference frame lists is added, to generate the corresponding reference frame lists of next frame to be processed.
In the specific implementation, code device provided in an embodiment of the present invention concretely video counts provided in an embodiment of the present invention According to coding method executing subject, i.e., described in the embodiment of the coding method of video data provided in an embodiment of the present invention Coding side, in the specific implementation, code device (including can obtain module 10, preprocessing module by modules built in it 21, processing module 31, mark module 40 and reference frame judgment module 50) execute above-mentioned video data coding method it is first real Implementation described in example and second embodiment is applied, specific implementation process can be found in the coding method of above-mentioned video data Implementation described in each step in first embodiment and second embodiment, details are not described herein.
Code device can be believed according to the frame per second of video data to be processed or time complexity etc. in embodiments of the present invention Breath judges whether to handle the frame to be processed in video to be processed using a variety of coding modes, or according to coding and decoding video The general coding mode of prescribed by standard handles frame to be processed.If the frame per second of video data to be processed or time are complicated The specify informations such as degree meet preset condition, then the first coding mode and the second coding mode can be used respectively to video counts to be processed Frame to be processed in is pre-processed, and then can be according to the pre-processed results of above-mentioned first coding mode and the second coding mode It determines and specific any coding mode is selected to be encoded for frame to be processed.It, can after code device selection target coding mode Target code mode is informed to decoding apparatus according to specify information sending method, so that decoding apparatus is according to target code mode Corresponding decoding process is decoded processing to video data to be processed, and decoding dress can be improved without voluntarily judging in decoding apparatus Set the accuracy of the judgement to coding mode.Code device can also be according to the pre-generated corresponding reference frame list of frame to be processed Table encodes frame to be processed according to target code mode, improves the coded video quality of video to be processed, enhance view The user experience of frequency data processing.
It is the 3rd embodiment structural schematic diagram of the code device of video data provided in an embodiment of the present invention referring to Fig. 8. Code device described in the embodiment of the present invention, comprising:
Module 10 is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame At least one of rate information, time complexity information and spatial complexity information.
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath.
In the specific implementation, the specific implementation process of above-mentioned acquisition module 10 can be found in the coding method of above-mentioned video data Implementation described in the step S101 of first embodiment, details are not described herein.
Preprocessing module 22, the specify information for obtaining in the acquisition module meet a variety of coding modes of starting When handling the preset condition of the frame to be processed, according to the first coding mode and the second coding mode respectively to the frame to be processed It is pre-processed, wherein first coding mode and second coding mode are different coding mode.
In the specific implementation, the pretreatment mould in the first embodiment of above-mentioned code device can be performed in above-mentioned preprocessing module 22 Implementation performed by preprocessing module 21 in the second embodiment of block 20 and above-mentioned code device, further, also Executable following operation:
In some possible embodiments, above-mentioned preprocessing module 22, is specifically used for:
The first video quality processing is carried out to the frame to be processed according to first coding mode to obtain to expend and specify Video data in the case where amount of coded data size, wherein the pre-processed results of first coding mode include described First quality index of the video data that one video quality is handled;
The second video quality processing is carried out to obtain described in consuming to the frame to be processed according to second coding mode Video data in the case where prescribed coding data volume size, wherein the pre-processed results of second coding mode include institute State the second quality index of the video data that the second video quality is handled.
Processing module 32, the pre-processed results of first coding mode for being handled according to the preprocessing module and The pre-processed results of second coding mode, selection target is compiled from first coding mode and second coding mode Code mode, and the frame to be processed is encoded according to the target code mode.
In the specific implementation, the first reality of above-mentioned code device can be performed in processing module 32 described in the embodiment of the present invention Implementation performed by the processing module 30 in example and the processing module in second embodiment 31 is applied further may be used also It performs the following operations:
In some possible embodiments, above-mentioned processing module 32, is specifically used for:
Whether first quality index for judging that the preprocessing module is handled is higher than second quality index, If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, will described in Second coding mode selection is target code mode.
In some possible embodiments, above-mentioned processing module 32 can also be specifically used for:
The first coding data amount size and the second amount of coded data size of the preprocessing module processing are obtained, with And first quality index and second quality index;
Judge the ratio of first quality index Yu the first coding data amount size, if be higher than second matter The ratio of figureofmerit and the second amount of coded data size;
If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, It is then target code mode by second coding mode selection.
In some possible embodiments, above-mentioned processing module 32 can also be specifically used for:
Judge whether the frame to be processed is video frame in monitor video data or feature video data;
If the determination result is YES, then first quality index and second matter of the preprocessing module processing are obtained Figureofmerit;
Judge whether first quality index is higher than second quality index, it if the determination result is YES, then will be described First coding mode selection is target code mode.
In some possible embodiments, code device described in the embodiment of the present invention further includes mark module 40, above-mentioned mark module 40 is used for:
The target code mode is marked by the way of index, to the index information of the target code mode It is encoded to obtain the identification information in a manner of the target code, and the identification information is passed into solution in the form of code stream Code end;Or
Use the physical quantity of characterization time-domain information that the target code mode is marked to obtain the target code The identification information of mode, and decoding end is passed to using the identification information of the target code mode as control information;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
In some possible embodiments, code device described in the embodiment of the present invention further include:
Reference frame judgment module 50, for judging the target code mode that the processing module selects whether for video Coding standard technology will be handled described in obtaining if the target code mode is the video encoding standard technology wait locate It manages frame and the reference frame lists is added, to generate the corresponding reference frame lists of next frame to be processed.
In the specific implementation, code device provided in an embodiment of the present invention concretely video counts provided in an embodiment of the present invention According to coding method executing subject, i.e., described in the embodiment of the coding method of video data provided in an embodiment of the present invention Coding side, in the specific implementation, code device (including can obtain module 10, preprocessing module by modules built in it 22, processing module 32, mark module 40 and reference frame judgment module 50) execute above-mentioned video data coding method it is first real Implementation described in example and second embodiment is applied, specific implementation process can be found in the coding method of above-mentioned video data Implementation described in each step in first embodiment and second embodiment, details are not described herein.
Code device can be according to the frame per second or time complexity of video data to be processed, space in embodiments of the present invention The information such as complexity judge whether to handle the frame to be processed in video to be processed using a variety of coding modes, or according to The general coding mode of video encoding and decoding standard defined handles frame to be processed.If the frame per second of video data to be processed or The specify informations such as person's time complexity meet preset condition, then the first coding mode can be used and the second coding mode is treated respectively Frame to be processed in processing video data is pre-processed, and then can be according to above-mentioned first coding mode and the second coding mode Pre-processed results, which determine, selects specific any coding mode to be encoded for frame to be processed.Code device selection target coding staff After formula, target code mode can be informed to decoding apparatus according to specify information sending method, so that decoding apparatus is according to mesh The corresponding decoding process of mark coding mode is decoded processing to video data to be processed, and decoding apparatus, can without voluntarily judging Decoding apparatus is improved to the accuracy of the judgement of coding mode.Code device can also be corresponding according to pre-generated frame to be processed Reference frame lists encode frame to be processed according to target code mode, improve the coded video quality of video to be processed, Enhance the user experience of video data processing.
It is the example structure schematic diagram of the decoding apparatus of video data provided in an embodiment of the present invention referring to Fig. 9.This hair Decoding apparatus described in bright embodiment, comprising:
Module 60 is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame At least one of rate information, time complexity information and spatial complexity information.
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame; The spatial complexity information includes: variation range information, image texture information and the picture coding patterns letter of image chroma At least one of breath.
Determining module 70, the specify information for obtaining in the acquisition module meet at starting target decoder mode When managing the preset condition of the frame to be processed, according to the identification information carried in the code stream information of the frame to be processed, institute is determined State the target decoder mode of frame to be processed.
Processing module 80, the target decoder mode for being determined according to the determining module are determined to described to be processed The auxiliary information that frame is decoded decodes the frame to be processed according to the target decoder mode and the auxiliary information.
In some possible embodiments, above-mentioned to designate the information as frame per second information;
Above-mentioned determining module 70, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, then Determine that the specify information meets the preset condition that starting target decoder mode handles the frame to be processed.
In some possible embodiments, above-mentioned to designate the information as time complexity information;
Above-mentioned determining module 70, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, If the determination result is YES, it is determined that the specify information meets the default item that starting target decoder mode handles the frame to be processed Part.
In some possible embodiments, above-mentioned to designate the information as spatial complexity information;
Above-mentioned determining module 70, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if Judging result is yes, it is determined that the specify information meets the default item that starting target decoder mode handles the frame to be processed Part.
In some possible embodiments, above-mentioned determining module 70 is specifically used for:
The code stream information of the frame to be processed is decoded, obtain carried in the code stream information for marking target The index information of coding mode determines the target code mode according to the index information;
According to the corresponding relationship of preset coding mode and decoding process, in conjunction with the target code mode determine described in Handle the target decoder mode of frame.
In some possible embodiments, above-mentioned determining module 70 is specifically used for:
The code stream information of the frame to be processed is decoded, whether obtains the identification information that carries in the code stream information Include image display information or timestamp;
If in the identification information including image display information or timestamp, it is determined that the target code mode is institute Video encoding standard technology is stated, and according to the corresponding relationship of preset coding mode and decoding process, determines the Video coding The corresponding target decoder mode of standard technique;
If there is no image display information or timestamp in the identification information, it is determined that the target code mode is institute Switch technology or the resolution ratio zoom technology in frame rate are stated, and according to the correspondence of preset coding mode and decoding process Relationship determines that the corresponding target decoder mode of switch technology or the resolution ratio zoom technology are corresponding in the frame rate Target decoder mode.
In some possible embodiments, above-mentioned processing module 80, is specifically used for:
When the target decoder mode that the determining module determines is the corresponding decoding of the video encoding standard technology When mode, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is that the frame to be processed is corresponding Residual error data information and control head information;Or
When the target decoder mode that the determining module determines is the corresponding decoding process of switch technology in frame rate When, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is sky;
When the target decoder mode that the determining module determines decoding process corresponding for resolution ratio zoom technology, The auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is that the frame to be processed zooms in and out processing pair The residual error data information and control head information answered.
In some possible embodiments, decoding apparatus (such as Figure 10) described in the embodiment of the present invention further include:
Reference frame judgment module 90, for judging whether the target decoder mode that the determining module determines is described The corresponding decoding process of video encoding standard technology;
If the target decoder mode is the corresponding decoding process of the video encoding standard technology, processing is obtained The reference frame lists are added in the frame to be processed, to generate the corresponding reference frame lists of next frame to be processed.
In the specific implementation, decoding apparatus provided in an embodiment of the present invention is the solution of video data provided in an embodiment of the present invention The executing subject of code method, i.e., decoding described in the embodiment of the coding/decoding method of video data provided in an embodiment of the present invention End.Decoding apparatus can execute present invention implementation by that can obtain module, determining module, processing module and reference frame judgment module etc. Implementation described in the embodiment of the coding/decoding method for the video data that example provides, specific implementation process can be found in above-mentioned view The embodiment of the coding/decoding method of frequency evidence, details are not described herein.
In embodiments of the present invention, decoding apparatus can determine in starting video encoding and decoding standard according to video data to be processed Defined decoding process is decoded video data to be processed, and the identification information that can be also sent according to code device determines coding The target decoder mode that device encodes video data to be processed, and then frame to be processed is carried out according to target decoder mode Decoding.Decoding apparatus is not necessarily to voluntarily determine decoding process according to Adaptive Criterion, reduces the volume of video data encoding to be processed The erroneous judgement of code mode, enhances the accuracy of frame video data decoding to be processed, improves the quality of video data decoding.
It is the example structure schematic diagram of the coding/decoding system of video data provided in an embodiment of the present invention referring to Figure 11, It include: code device 1000 provided in an embodiment of the present invention and decoding apparatus 2000.
In the specific implementation, the specific implementation process of above-mentioned encoding apparatus and decoding apparatus can be found in above-mentioned each embodiment Implementation described in each step, details are not described herein.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the program can be stored in a computer-readable storage medium In, the program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, the storage medium can be magnetic Dish, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
The above disclosure is only the preferred embodiments of the present invention, cannot limit the right model of the present invention with this certainly It encloses, therefore equivalent changes made in accordance with the claims of the present invention, is still within the scope of the present invention.

Claims (38)

1. a kind of coding method of video data characterized by comprising
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity letter At least one of breath and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame;It is described Spatial complexity information includes: at least one of variation range information and picture coding patterns information of image chroma;
If the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, compiled according to first Code mode and the second coding mode respectively pre-process the frame to be processed, wherein the preset condition is described specified Information is greater than the threshold value of the preset correspondence specify information, and first coding mode and second coding mode are different Coding mode;
According to the pre-processed results of the pre-processed results of first coding mode and second coding mode, from described first Selection target coding mode in coding mode and second coding mode, and according to the target code mode to described wait locate Reason frame is encoded;
It is wherein, described that the frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode, comprising:
The first video quality processing is carried out to the frame to be processed according to first coding mode and meets designated to obtain The video data of quality index, wherein the pre-processed results of first coding mode include the first video quality processing Spent first coding data amount size;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and meet described specify The video data of video quality index, wherein the pre-processed results of second coding mode include second video quality The second spent amount of coded data size of processing;
It is described according to the pre-processed results of first coding mode and the pre-processed results of second coding mode, from described Selection target coding mode includes: in first coding mode and second coding mode
Judge whether the first coding data amount size is less than the second amount of coded data size, if the determination result is YES, Then by first coding mode selection be target code mode, if judging result be it is no, by second coding mode choosing It is selected as target code mode.
2. the method as described in claim 1, which is characterized in that described to designate the information as frame per second information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Frame per second indicated by the frame per second information is greater than preset frame per second threshold value.
3. the method as described in claim 1, which is characterized in that described to designate the information as time complexity information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Time complexity indicated by the time complexity information is greater than preset time complexity threshold value.
4. the method as described in claim 1, which is characterized in that described to designate the information as spatial complexity information;
The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, comprising:
Space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold.
5. the method as described in claim 1, which is characterized in that described from first coding mode and second coding staff In formula after selection target coding mode, the method also includes:
The target code mode is marked by the way of index, the index information of the target code mode is carried out It encodes to obtain the identification information in a manner of the target code, and the identification information is passed into decoding in the form of code stream End;Or
The physical quantity of characterization time-domain information is used the target code mode to be marked to obtain in a manner of the target code Identification information, and using the identification information of the target code mode as control information pass to decoding end;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
6. method as claimed in claim 5, which is characterized in that it is described according to the target code mode to the frame to be processed After being encoded, the method also includes:
Judge whether the target code mode is video encoding standard technology;
If the target code mode is the video encoding standard technology, institute is added in the frame to be processed that processing is obtained Reference frame lists are stated, to generate the corresponding reference frame lists of next frame to be processed.
7. a kind of coding method of video data characterized by comprising
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity letter At least one of breath and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame;It is described Spatial complexity information includes: at least one of variation range information and picture coding patterns information of image chroma;
If the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, compiled according to first Code mode and the second coding mode respectively pre-process the frame to be processed, wherein the preset condition is described specified Information is greater than the threshold value of the preset correspondence specify information, and first coding mode and second coding mode are different Coding mode;
According to the pre-processed results of the pre-processed results of first coding mode and second coding mode, from described first Selection target coding mode in coding mode and second coding mode, and according to the target code mode to described wait locate Reason frame is encoded;
It is wherein, described that the frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode, comprising:
The first video quality processing is carried out to obtain consuming prescribed coding to the frame to be processed according to first coding mode Video data in the case where data volume size, wherein the pre-processed results of first coding mode include first view First quality index of the video data that frequency quality treatment obtains;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and expend described specify Video data in the case where amount of coded data size, wherein the pre-processed results of second coding mode include described Second quality index of the video data that two video qualities are handled;
Obtain the first coding data amount size and the second amount of coded data size and first quality index and described Second quality index;
Judge the ratio of first quality index Yu the first coding data amount size, if refer to higher than second mass The ratio of mark and the second amount of coded data size;
If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, general Second coding mode selection is target code mode.
8. a kind of coding method of video data characterized by comprising
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity letter At least one of breath and spatial complexity information;
If the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed, the preset condition It is greater than the threshold value of the preset correspondence specify information for the specify information, then according to the first coding mode and the second coding staff Formula respectively pre-processes the frame to be processed, wherein first coding mode and second coding mode are different Coding mode;
According to the pre-processed results of the pre-processed results of first coding mode and second coding mode, from described first Selection target coding mode in coding mode and second coding mode, and according to the target code mode to described wait locate Reason frame is encoded;Wherein, described that the frame to be processed is carried out in advance respectively according to the first coding mode and the second coding mode Processing, comprising:
The first video quality processing is carried out to obtain consuming prescribed coding to the frame to be processed according to first coding mode Video data in the case where data volume size, wherein the pre-processed results of first coding mode include first view First quality index of the video data that frequency quality treatment obtains;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and expend described specify Video data in the case where amount of coded data size, wherein the pre-processed results of second coding mode include described Second quality index of the video data that two video qualities are handled;
It is described according to the pre-processed results of first coding mode and the pre-processed results of second coding mode, from described Selection target coding mode includes: in first coding mode and second coding mode
Judge whether first quality index is higher than second quality index, if the determination result is YES, then by described first Coding mode selection is target code mode, is target code by second coding mode selection if judging result is no Mode.
9. method according to claim 8, which is characterized in that the frame to be processed includes monitor video data or feature view Video frame of the frequency in.
10. method according to claim 8, which is characterized in that the method also includes:
The target code mode is marked by the way of index, the index information of the target code mode is carried out It encodes to obtain the identification information in a manner of the target code, and the identification information is passed into decoding in the form of code stream End;Or
The physical quantity of characterization time-domain information is used the target code mode to be marked to obtain in a manner of the target code Identification information, and using the identification information of the target code mode as control information pass to decoding end;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
11. a kind of coding/decoding method of video data characterized by comprising
The specify information of the frame to be processed in video data is obtained, the specify information includes: frame per second information, time complexity letter At least one of breath and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame;It is described Spatial complexity information includes: at least one of variation range information and picture coding patterns information of image chroma;
If the specify information meets the preset condition that starting target decoder mode handles the frame to be processed, the preset condition It is greater than the threshold value of the preset correspondence specify information for the specify information, then is received in the form of code stream information from coding side The identification information of transmitting, the identification information are carried out encoding used target code by the coding side to the frame to be processed The index information of mode encodes to obtain, or receives control information from the coding side, carries in the control information and uses table The identification information that the target code mode is marked in the physical quantity of sign time-domain information;
The target code mode is determined according to the identification information or the control information, and according to preset coding mode With the corresponding relationship of decoding process, the target decoder mode of the frame to be processed is determined in conjunction with the target code mode;
The auxiliary information being decoded to the frame to be processed is determined according to the target decoder mode, according to the target decoder Mode and the auxiliary information decode the frame to be processed.
12. method as claimed in claim 11, which is characterized in that described to designate the information as frame per second information;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Frame per second indicated by the frame per second information is greater than preset frame per second threshold value.
13. method as claimed in claim 11, which is characterized in that described to designate the information as time complexity information;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Time complexity indicated by the time complexity information is greater than preset time complexity threshold value.
14. method as claimed in claim 11, which is characterized in that described to designate the information as spatial complexity information;
The specify information meets the preset condition that starting target decoder mode handles the frame to be processed, comprising:
Space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold.
15. such as the described in any item methods of claim 11-14, which is characterized in that described to determine institute according to the identification information State target code mode, comprising:
The code stream information is decoded, obtain carried in the code stream information for marking the target code mode Index information determines the target code mode according to the index information.
16. such as the described in any item methods of claim 11-14, which is characterized in that described to determine institute according to the control information Target code mode is stated, and according to the corresponding relationship of preset coding mode and decoding process, in conjunction with the target code mode The target decoder mode for determining the frame to be processed includes:
The control information is decoded, obtains whether the identification information carried in the control information includes image display letter Breath or timestamp;
If in the identification information including image display information or timestamp, it is determined that the target code mode is video volume Code standard technique, and according to the corresponding relationship of preset coding mode and decoding process, determine the video encoding standard technology Corresponding target decoder mode;
If there is no image display information or timestamp in the identification information, it is determined that the target code mode is frame rate Upper switch technology or resolution ratio zoom technology, and according to the corresponding relationship of preset coding mode and decoding process, determine institute State the corresponding target decoder mode of switch technology in frame rate or the corresponding target decoder side of the resolution ratio zoom technology Formula;
Wherein, the physical quantity of the characterization time-domain information includes: that described image shows information or the timestamp.
17. the method described in claim 16, which is characterized in that it is described according to the target decoder mode determine to it is described to The auxiliary information that processing frame is decoded, comprising:
When target decoder mode decoding process corresponding for the video encoding standard technology, obtain to described to be processed The auxiliary information that frame is decoded, the auxiliary information are the corresponding residual error data information of the frame to be processed and control head letter Breath;
When the target decoder mode is the corresponding decoding process of switch technology in frame rate, obtain to the frame to be processed into The decoded auxiliary information of row, the auxiliary information are sky;
When target decoder mode decoding process corresponding for resolution ratio zoom technology, obtains and the frame to be processed is carried out Decoded auxiliary information, the auxiliary information are that the frame to be processed zooms in and out the corresponding residual error data information of processing and control Head information.
18. method as claimed in claim 17, which is characterized in that it is described according to the target decoder mode determine to it is described to The auxiliary information that is decoded of processing frame, according to the target decoder mode and the auxiliary information decode the frame to be processed it Afterwards, the method also includes:
Judge whether the target decoder mode is the corresponding decoding process of the video encoding standard technology;
If the target decoder mode is the corresponding decoding process of the video encoding standard technology, will handle described in obtaining The reference frame lists are added in frame to be processed, to generate the corresponding reference frame lists of next frame to be processed.
19. a kind of code device of video data characterized by comprising
Module is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame per second letter At least one of breath, time complexity information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame;It is described Spatial complexity information includes: in the variation range information, image texture information and picture coding patterns information of image chroma At least one;
Preprocessing module, the specify information for obtaining in the acquisition module meet starting a variety of coding mode processing institute When stating the preset condition of frame to be processed, the frame to be processed is carried out respectively according to the first coding mode and the second coding mode pre- Processing, wherein threshold value of the preset condition for the specify information greater than the preset correspondence specify information, described first Coding mode and second coding mode are different coding mode;
Processing module, the pre-processed results of first coding mode for being handled according to the preprocessing module and described The pre-processed results of two coding modes, the selection target coding staff from first coding mode and second coding mode Formula, and the frame to be processed is encoded according to the target code mode
Wherein, the preprocessing module, is specifically used for:
The first video quality processing is carried out to the frame to be processed according to first coding mode and meets designated to obtain The video data of quality index, wherein the pre-processed results of first coding mode include the first video quality processing Spent first coding data amount size;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and meet described specify The video data of video quality index, wherein the pre-processed results of second coding mode include second video quality The second spent amount of coded data size of processing;
The processing module is specifically used for:
Whether the first coding data amount size for judging that the preprocessing module is handled is less than second coded number It is then target code mode by first coding mode selection, if judging result is if the determination result is YES according to amount size It is no, then it is target code mode by second coding mode selection.
20. code device as claimed in claim 19, which is characterized in that described to designate the information as frame per second information;
The preprocessing module, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, it is determined that The specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed.
21. code device as claimed in claim 19, which is characterized in that described to designate the information as time complexity information;
The preprocessing module, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, if sentencing Disconnected result is yes, it is determined that the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed.
22. code device as claimed in claim 19, which is characterized in that described to designate the information as spatial complexity information;
The preprocessing module, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if judgement It as a result is yes, it is determined that the specify information, which meets, starts the preset condition that a variety of coding modes handle the frame to be processed.
23. such as the described in any item code devices of claim 21-22, which is characterized in that the preprocessing module is also used to:
The first video quality processing is carried out to obtain consuming prescribed coding to the frame to be processed according to first coding mode Video data in the case where data volume size, wherein the pre-processed results of first coding mode include first view First quality index of the video data that frequency quality treatment obtains;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and expend described specify Video data in the case where amount of coded data size, wherein the pre-processed results of second coding mode include described Second quality index of the video data that two video qualities are handled;
The processing module is also used to:
Obtain the first coding data amount size and the second amount of coded data size of the preprocessing module processing, Yi Jisuo State the first quality index and second quality index;
Judge the ratio of first quality index Yu the first coding data amount size, if refer to higher than second mass The ratio of mark and the second amount of coded data size;
If the determination result is YES, then by first coding mode selection be target code mode, if judging result be it is no, general Second coding mode selection is target code mode.
24. code device as claimed in claim 19, which is characterized in that the code device further includes mark module, described Mark module is used for:
The target code mode is marked by the way of index, the index information of the target code mode is carried out It encodes to obtain the identification information in a manner of the target code, and the identification information is passed into decoding in the form of code stream End;Or
The physical quantity of characterization time-domain information is used the target code mode to be marked to obtain in a manner of the target code Identification information, and using the identification information of the target code mode as control information pass to decoding end;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
25. code device as claimed in claim 24, which is characterized in that the code device, further includes:
Reference frame judgment module, for judging the target code mode that the processing module selects whether for Video coding mark Quasi- technology adds the frame to be processed that processing obtains if the target code mode is the video encoding standard technology Enter the reference frame lists, to generate the corresponding reference frame lists of next frame to be processed.
26. a kind of code device of video data characterized by comprising
Module is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame per second letter At least one of breath, time complexity information and spatial complexity information;
Preprocessing module, the specify information for obtaining in the acquisition module meet starting a variety of coding mode processing institute The preset condition of frame to be processed is stated, the preset condition is the threshold that the specify information is greater than the preset correspondence specify information When value, the frame to be processed is pre-processed respectively according to the first coding mode and the second coding mode, wherein described first Coding mode and second coding mode are different coding mode;
Processing module, the pre-processed results of first coding mode for being handled according to the preprocessing module and described The pre-processed results of two coding modes, the selection target coding staff from first coding mode and second coding mode Formula, and the frame to be processed is encoded according to the target code mode;
Wherein, the preprocessing module, is specifically used for:
The first video quality processing is carried out to obtain consuming prescribed coding to the frame to be processed according to first coding mode Video data in the case where data volume size, wherein the pre-processed results of first coding mode include first view First quality index of the video data that frequency quality treatment obtains;
The second video quality processing is carried out to the frame to be processed according to second coding mode to obtain and expend described specify Video data in the case where amount of coded data size, wherein the pre-processed results of second coding mode include described Second quality index of the video data that two video qualities are handled;
The processing module, is specifically used for:
Judge whether first quality index is higher than second quality index, if the determination result is YES, then by described first Coding mode selection is target code mode, is target code by second coding mode selection if judging result is no Mode.
27. code device as claimed in claim 26, which is characterized in that the frame to be processed include monitor video data or Video frame in feature video data.
28. code device as claimed in claim 26, which is characterized in that the code device further includes mark module, described Mark module is used for:
The target code mode is marked by the way of index, the index information of the target code mode is carried out It encodes to obtain the identification information in a manner of the target code, and the identification information is passed into decoding in the form of code stream End;Or
The physical quantity of characterization time-domain information is used the target code mode to be marked to obtain in a manner of the target code Identification information, and using the identification information of the target code mode as control information pass to decoding end;
Wherein, the physical quantity of the characterization time-domain information includes: image display information or timestamp.
29. a kind of decoding apparatus of video data characterized by comprising
Module is obtained, for obtaining the specify information of the frame to be processed in video data, the specify information includes: frame per second letter At least one of breath, time complexity information and spatial complexity information;
Wherein, the time complexity information includes: the length information of motion vector or the index information of reference frame;It is described Spatial complexity information includes: in the variation range information, image texture information and picture coding patterns information of image chroma At least one;
Determining module, the specify information for being obtained in the acquisition module meet starting target decoder mode handle it is described The preset condition of frame to be processed, the preset condition are the threshold value that the specify information is greater than the preset correspondence specify information When, the identification information transmitted in the form of code stream information is received from coding side, the identification information is by the coding side to described The index information of target code mode used by frame to be processed is encoded encodes to obtain, or receives and control from the coding side Information processed, it is described to control what carrying in information was marked the target code mode using the physical quantity of characterization time-domain information Identification information;The target code mode is determined according to the identification information or the control information, and according to preset volume The corresponding relationship of code mode and decoding process determines the target decoder side of the frame to be processed in conjunction with the target code mode Formula;
Processing module, the target decoder mode for being determined according to the determining module, which determines, carries out the frame to be processed Decoded auxiliary information decodes the frame to be processed according to the target decoder mode and the auxiliary information.
30. decoding apparatus as claimed in claim 29, which is characterized in that described to designate the information as frame per second information;
The determining module, is specifically used for:
Judge whether frame per second indicated by the frame per second information is greater than preset frame per second threshold value, if the determination result is YES, it is determined that The specify information meets the preset condition that starting target decoder mode handles the frame to be processed.
31. decoding apparatus as claimed in claim 29, which is characterized in that described to designate the information as time complexity information;
The determining module, is specifically used for:
Judge whether time complexity indicated by the time complexity information is greater than preset time complexity threshold value, if sentencing Disconnected result is yes, it is determined that the specify information meets the preset condition that starting target decoder mode handles the frame to be processed.
32. decoding apparatus as claimed in claim 29, which is characterized in that described to designate the information as spatial complexity information;
The determining module, is specifically used for:
Judge whether space complexity indicated by the spatial complexity information is greater than pre-set space complexity threshold, if judgement It as a result is yes, it is determined that the specify information meets the preset condition that starting target decoder mode handles the frame to be processed.
33. such as the described in any item decoding apparatus of claim 29-32, which is characterized in that the determining module is specifically used for:
The code stream information is decoded, obtain carried in the code stream information for marking the target code mode Index information determines the target code mode according to the index information.
34. such as the described in any item decoding apparatus of claim 29-32, which is characterized in that the determining module is specifically used for:
The control information is decoded, obtains whether the identification information carried in the control information includes image display letter Breath or timestamp;
If in the identification information including image display information or timestamp, it is determined that the target code mode is video volume Code standard technique, and according to the corresponding relationship of preset coding mode and decoding process, determine the video encoding standard technology Corresponding target decoder mode;
If there is no image display information or timestamp in the identification information, it is determined that the target code mode is frame rate Upper switch technology or resolution ratio zoom technology, and according to the corresponding relationship of preset coding mode and decoding process, determine institute State the corresponding target decoder mode of switch technology in frame rate or the corresponding target decoder side of the resolution ratio zoom technology Formula.
35. decoding apparatus as claimed in claim 34, which is characterized in that the processing module is specifically used for:
When the target decoder mode that the determining module determines is the corresponding decoding process of the video encoding standard technology When, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is the corresponding residual error of the frame to be processed Data information and control head information;
When the target decoder mode that the determining module determines is the corresponding decoding process of switch technology in the frame rate When, the auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is sky;
When the target decoder mode that the determining module determines decoding process corresponding for the resolution ratio zoom technology, The auxiliary information being decoded to the frame to be processed is obtained, the auxiliary information is that the frame to be processed zooms in and out processing pair The residual error data information and control head information answered.
36. decoding apparatus as claimed in claim 35, which is characterized in that the decoding apparatus further include:
Reference frame judgment module, for judging whether the target decoder mode that the determining module determines is that the video is compiled The corresponding decoding process of code standard technique;
If the target decoder mode is the corresponding decoding process of the video encoding standard technology, will handle described in obtaining The reference frame lists are added in frame to be processed, to generate the corresponding reference frame lists of next frame to be processed.
37. a kind of coding/decoding system of video data characterized by comprising such as the described in any item volumes of claim 19-25 Code device, and such as the described in any item decoding apparatus of claim 29-36.
38. a kind of coding/decoding system of video data characterized by comprising such as the described in any item volumes of claim 26-28 Code device, and such as the described in any item decoding apparatus of claim 29-36.
CN201510180497.4A 2015-04-16 2015-04-16 A kind of decoding method and device of video data Active CN104811722B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510180497.4A CN104811722B (en) 2015-04-16 2015-04-16 A kind of decoding method and device of video data
PCT/CN2016/079034 WO2016165603A1 (en) 2015-04-16 2016-04-12 Encoding and decoding method and device for video data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510180497.4A CN104811722B (en) 2015-04-16 2015-04-16 A kind of decoding method and device of video data

Publications (2)

Publication Number Publication Date
CN104811722A CN104811722A (en) 2015-07-29
CN104811722B true CN104811722B (en) 2019-05-07

Family

ID=53696151

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510180497.4A Active CN104811722B (en) 2015-04-16 2015-04-16 A kind of decoding method and device of video data

Country Status (2)

Country Link
CN (1) CN104811722B (en)
WO (1) WO2016165603A1 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104811722B (en) * 2015-04-16 2019-05-07 华为技术有限公司 A kind of decoding method and device of video data
CN107343208B (en) * 2016-04-29 2019-10-11 掌赢信息科技(上海)有限公司 A kind of control video code rate method and electronic equipment
CN105959700B (en) * 2016-05-31 2020-04-14 腾讯科技(深圳)有限公司 Video image coding method, device, storage medium and terminal equipment
CN107635142B (en) * 2016-07-18 2020-06-26 浙江大学 Video data processing method and device
EP3474225B1 (en) * 2017-10-18 2019-09-25 Axis AB Method and encoder for encoding a video stream in a video coding format supporting auxiliary frames
CN107682675A (en) * 2017-10-19 2018-02-09 佛山市章扬科技有限公司 A kind of method using a variety of compress mode recorded videos
CN110139104B (en) * 2018-02-09 2023-02-28 腾讯科技(深圳)有限公司 Video decoding method, video decoding device, computer equipment and storage medium
CN108960384B (en) * 2018-06-07 2020-04-28 阿里巴巴集团控股有限公司 Decoding method of graphic code and client
CN118042135A (en) * 2019-03-19 2024-05-14 华为技术有限公司 Point cloud encoding method, point cloud decoding method, device and storage medium
CN110740317B (en) * 2019-09-18 2021-10-15 浙江大华技术股份有限公司 Subblock motion prediction method, subblock motion encoding method, subblock motion encoder, and storage device
CN114827723B (en) * 2022-04-25 2024-04-09 阿里巴巴(中国)有限公司 Video processing method, device, electronic equipment and storage medium
CN117294683A (en) * 2022-06-16 2023-12-26 中兴通讯股份有限公司 Video processing method, transmitting end, receiving end, storage medium and program product
CN116233438B (en) * 2023-03-09 2023-08-29 上海华期信息技术有限责任公司 Data prediction acquisition system using weighting algorithm

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6845214B1 (en) * 1999-07-13 2005-01-18 Nec Corporation Video apparatus and re-encoder therefor
CN101018337A (en) * 2006-02-10 2007-08-15 富士施乐株式会社 Coding apparatus, decoding apparatus, coding method, decoding method and computer readable medium
CN101615910A (en) * 2009-05-31 2009-12-30 华为技术有限公司 The method of compressed encoding, device and equipment and comprssing coding/decoding method
CN104052992A (en) * 2014-06-09 2014-09-17 联想(北京)有限公司 Image processing method and electronic equipment
CN104410861A (en) * 2014-11-24 2015-03-11 华为技术有限公司 Video encoding method and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100952340B1 (en) * 2008-01-24 2010-04-09 에스케이 텔레콤주식회사 Method and Apparatus for Determing Encoding Mode by Using Temporal and Spartial Complexity
CN103458237B (en) * 2012-05-29 2016-09-21 北京数码视讯科技股份有限公司 The determination method and apparatus of Video coding mode and method for video coding and device
CN104519368B (en) * 2013-09-30 2017-12-01 华为技术有限公司 Image Coding, decoding and reconstituting processing method and processing device
CN104811722B (en) * 2015-04-16 2019-05-07 华为技术有限公司 A kind of decoding method and device of video data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6845214B1 (en) * 1999-07-13 2005-01-18 Nec Corporation Video apparatus and re-encoder therefor
CN101018337A (en) * 2006-02-10 2007-08-15 富士施乐株式会社 Coding apparatus, decoding apparatus, coding method, decoding method and computer readable medium
CN101615910A (en) * 2009-05-31 2009-12-30 华为技术有限公司 The method of compressed encoding, device and equipment and comprssing coding/decoding method
CN104052992A (en) * 2014-06-09 2014-09-17 联想(北京)有限公司 Image processing method and electronic equipment
CN104410861A (en) * 2014-11-24 2015-03-11 华为技术有限公司 Video encoding method and device

Also Published As

Publication number Publication date
WO2016165603A1 (en) 2016-10-20
CN104811722A (en) 2015-07-29

Similar Documents

Publication Publication Date Title
CN104811722B (en) A kind of decoding method and device of video data
CN105981380B (en) Utilize the method and apparatus of the encoded video data block of palette coding
CN108322760A (en) A kind of method and device of coding and decoding video
US9462279B2 (en) Image encoding/decoding method and device
CN105141955B (en) Picture decoding method and image decoding apparatus
CN106105228B (en) A kind of method, apparatus and computer-readable medium handling video data
KR101782280B1 (en) Method and apparatus for palette table prediction
KR101500781B1 (en) Method for processing images and the corresponding electronic device
CN108702501A (en) The method and device that the compartmentalization luma prediction modes of colorimetric prediction for Video coding are inherited
EP3007442A1 (en) Method of pulse-code modulation and palette coding for video coding
KR102114641B1 (en) Method of video coding by prediction of the partitioning of a current block, method of decoding, coding and decoding devices and computer programs corresponding thereto
CN110233949A (en) The method of palette table initialization and management
US20180249162A1 (en) Method and apparatus for decoding a video using an intra prediction
CN110191338A (en) It will jump out method of the pixel as fallout predictor in index graph code
CN107465924A (en) The method and apparatus that quantization parameter predicted value is determined from multiple adjacent quantization parameters
RU2008106777A (en) IMAGE CODER AND DECODER IMAGES picture encoding method and method for decoding an image, the encoding image management software and image decoding, and computer-readable recording medium, in which the Program image coding, and computer-readable recording medium, in which the Program image decoding
CN108353180A (en) Video coding with delay reconstruction
CN110476422A (en) Picture coding device, image encoding method and image encoding program, picture decoding apparatus, picture decoding method and image decoding program
Erfurt et al. Multiple feature-based classifications adaptive loop filter
CN114868390A (en) Video encoding method, decoding method, encoder, decoder, and AI accelerator
JPH0217777A (en) Image transmission system
CN104581184B (en) Coding, decoding processing method and the device of depth image
CN109862358A (en) A kind of picture frame coding/decoding method based on environmental information
CN115190309A (en) Video frame processing method, training method, device, equipment and storage medium
CN115474052A (en) Point cloud encoding processing method, point cloud decoding processing method and related equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant