CN113873248A - Digital video data encoding and decoding method and equipment - Google Patents

Digital video data encoding and decoding method and equipment Download PDF

Info

Publication number
CN113873248A
CN113873248A CN202111130687.7A CN202111130687A CN113873248A CN 113873248 A CN113873248 A CN 113873248A CN 202111130687 A CN202111130687 A CN 202111130687A CN 113873248 A CN113873248 A CN 113873248A
Authority
CN
China
Prior art keywords
frame
sub
processing
data
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111130687.7A
Other languages
Chinese (zh)
Inventor
陈红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Wanlixiang Industrial Co ltd
Original Assignee
Shenzhen Wanlixiang Industrial Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Wanlixiang Industrial Co ltd filed Critical Shenzhen Wanlixiang Industrial Co ltd
Priority to CN202111130687.7A priority Critical patent/CN113873248A/en
Publication of CN113873248A publication Critical patent/CN113873248A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/182Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/567Motion estimation based on rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention provides a method and a device for encoding and decoding a mobile digital video, wherein the method comprises the steps of collecting video data and processing the video data to obtain the mobile digital video; separating the mobile digital video to obtain a plurality of sub-frame digital images, and optimizing a processing frame; and coding and decoding the sub-frame digital images based on the optimized processing frame, so as to realize the compression of the digital video data in the coding and decoding processes under the condition of ensuring that high-quality video can be shown.

Description

Digital video data encoding and decoding method and equipment
Technical Field
The present invention relates to the field of digital video encoding and decoding technologies, and in particular, to a method and an apparatus for encoding and decoding digital video data.
Background
At present, in daily life, video codecs are widely applied. Such as on the internet in various satellite and terrestrial television broadcast systems. On-line video material is typically compressed using a wide variety of different codecs, and in order to be able to view the material correctly, the user needs to download and install a codec package-a compiled codec component ready for the PC.
The video technology has a wide application range, such as services of online video conferences, online video e-commerce, online government affairs, online shopping, online schools, telemedicine, online seminars, online exhibition halls, personal online chatting, visual consultation and the like. However, all of the above applications must be compressed and are very labor intensive.
Accordingly, the present invention is directed to a digital video data encoding and decoding method and apparatus.
Disclosure of Invention
The invention provides a digital video data coding and decoding method, which comprises the following steps:
step 1, collecting video data, and processing the video data to obtain a mobile digital video;
step 2, separating the mobile digital video to obtain a plurality of sub-frame digital images, and optimizing a processing frame;
and 3, coding and decoding the sub-frame digital image based on the optimized processing frame.
Preferably, the processing framework comprises an encoder, a decoder and an entropy parameter model module, wherein the encoder comprises a main encoder and a super-a-encoder, and the decoder comprises a main decoder and a super-a-decoder.
Preferably, step 2, the process of separating the mobile digital video to obtain a plurality of sub-frame digital images and optimizing the processing frame further comprises:
the method comprises the steps of separating the mobile digital video to obtain a plurality of subframes, wherein subframe intervals exist between adjacent frames, and further a frame interval set of the mobile digital video is obtained;
acquiring a pixel column corresponding to each interval time point in the mobile digital video based on the frame interval set, obtaining the relative position of the pixel column corresponding to each subframe interval, binding each subframe interval and the relative position of the corresponding pixel column to obtain binding information, inserting the binding information into the corresponding subframe interval, and obtaining an interval calibration block.
Preferably, step 3, in the process of encoding and decoding the sub-frame digital image based on the optimized processing frame, further includes:
step 3.1, the super-prior encoder and the main encoder respectively encode the images corresponding to the interval calibration blocks and the subframes to generate encoded data with a first preset length and encoded data with a second preset length;
step 3.2, the entropy parameter model module carries out data compression on the coded data with the first preset length and the coded data with the second preset length;
and 3.3, the super-prior decoder and the main decoder respectively decode the compressed coded data with the first preset length and the compressed coded data with the second preset length.
Preferably, step 3, the process of encoding and decoding the sub-frame digital image based on the optimized processing frame further includes:
carrying out continuity detection on the interval calibration blocks;
judging whether the adjacent frames based on the current calibration block are continuous or not according to the continuity detection result;
if the frame is continuous, continuously decoding the adjacent frame corresponding to the next calibration block;
otherwise, resending the coded data of the adjacent frame of the current calibration block and alarming.
Preferably, step 1, acquiring video data, and processing the video data to obtain a mobile digital video, includes:
cutting a sub-video frame image in video data into n image blocks with the same size according to a preset cutting mode to obtain a plurality of image block data, and analyzing each image block data based on a preset algorithm to obtain corresponding sub-fingerprint data;
performing parallel processing on the n image block data after the sub-video frame image is cut, and performing fingerprint comparison on the image block data at the same position as the previous sub-video frame image according to the unique fingerprint data;
screening out small image block data with inconsistent fingerprint comparison from the current sub video frame image, carrying out data coding processing to obtain a plurality of coded data, collecting the plurality of coded data to form a data queue, and sending the data queue to a receiving party through a network;
after the receiving party receives the data queue, carrying out parallel decoding processing on a plurality of coded data in the data queue to restore and obtain corresponding small image block data; judging whether the small image block data obtained by restoration are complete, if not, determining a vacant position, and finding out the small image block data corresponding to the vacant position from the previous sub-video frame image for completion;
and splicing the supplemented small image block data to generate a complete mobile digital video and displaying the complete mobile digital video.
Preferably, step 2, the mobile digital video is partitioned to obtain a plurality of sub-frame digital images, and the optimization processing framework includes:
performing a first optimization on the processing framework: training the processing frame based on a preset first quantizer until the processing frame meets a first preset condition for stopping training, obtaining first sub-output values of all modules in the processing frame, and taking the first sub-output values as sub-initial solutions of corresponding modules;
processing the main encoder and the pre-check encoder, adjusting the first quantizer based on a preset adjusting method to obtain a second quantizer, training a processing frame based on the second quantizer to obtain second sub-weight values respectively corresponding to a decoder, an entropy parameter model module and a factorization entropy model module in the processing frame, and processing sub-initial solutions of the corresponding modules based on the second sub-weight values to obtain first sub-optimal solutions corresponding to the decoder, the entropy parameter model module and the factorization entropy model module;
and performing second optimization on the processing frame: processing the main decoder, the super-prior decoder and the entropy parameter model module, training a processing frame based on a preset third quantizer to obtain third sub-weight values respectively corresponding to the main encoder and the pre-prior encoder in the processing frame, and processing corresponding sub-initial solutions of the main encoder and the pre-prior encoder based on the second sub-weight values to obtain second sub-optimal solutions corresponding to the main encoder and the pre-prior encoder;
and optimizing the processing frame based on the first sub-optimization solution and the second sub-initial solution to obtain the optimized processing frame.
Preferably, step 3, encoding and decoding the sub-frame digital image based on the optimized processing frame, includes:
determining first parameter information of an entropy parameter model based on the optimized processing frame, coding the first parameter information based on a super-prior coder to obtain coded parameter information, and decoding the coded parameter information by the super-prior decoder to obtain second parameter information;
substituting the second parameter information into the optimized processing frame to obtain a first output value and a second output value output by the entropy parameter model, and predicting the code rate of the optimized processing frame based on the first output value and the second output value and a preset algorithm to obtain the code rate of the optimized processing frame;
enhancing the subframe digital images based on a preset algorithm to obtain first subframe digital images, performing quantization processing on the first subframe digital images to obtain second subframe digital images, performing image reconstruction processing on the second subframe digital images based on the optimized code rate of the processing frame, outputting reconstructed images, and performing image optimization performance evaluation calculation on encoding results based on the subframe digital images and the reconstructed images to obtain a distortion function;
estimating an encoding result generated by the optimized processing frame based on the code rate, wherein the encoding result comprises the code rate consumed by transmitting the encoded data in the optimized processing frame and the code rate of the second parameter information transmitted by the prior decoder; constructing a distortion loss function for performing end-to-end optimization on the image according to the distortion function and the code rate result; and performing end-to-end compression and optimization processing on the target image based on the distortion loss function.
Preferably, the super-first encoder is configured to encode the interval scaling block to generate encoded data of a first preset length, and includes:
acquiring a calibration attribute and a corresponding calibration identifier of each calibration point in the interval calibration block to obtain a first symbol;
predicting and acquiring a corresponding second symbol from a preset sequence according to the property of each calibration point;
performing fusion processing on the first symbol and the second symbol to obtain a fusion symbol corresponding to the calibration point;
and acquiring coded data with a first preset length according to all the fusion symbols.
An apparatus for mobile digital video encoding and decoding, comprising:
the acquisition unit is used for acquiring video data and processing the video data to obtain a mobile digital video;
the optimization unit is used for separating the mobile digital video to obtain a plurality of sub-frame digital images and optimizing a processing frame;
and the processing unit is used for encoding and decoding the sub-frame digital image based on the optimized processing frame.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
FIG. 1 is a flow chart of a method for encoding and decoding a mobile digital video according to an embodiment of the present invention;
FIG. 2 is a flow chart of another method for encoding and decoding a mobile digital video according to an embodiment of the present invention;
fig. 3 is a block diagram of an apparatus for encoding and decoding a mobile digital video according to an embodiment of the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.
Example 1
An embodiment of the present invention provides a method for encoding and decoding a mobile digital video, as shown in fig. 1, including:
step 1, collecting video data, and processing the video data to obtain a mobile digital video;
step 2, separating the mobile digital video to obtain a plurality of sub-frame digital images, and optimizing a processing frame;
and 3, coding and decoding the sub-frame digital image based on the optimized processing frame.
In this embodiment, the separation is performed by dividing sub-frame digital images in the mobile digital video to obtain a plurality of sub-frame digital images, and inserting binding information into two adjacent sub-frame digital images to obtain an interval scaling block.
In this embodiment, the optimization processing frame is configured to process the modules in the processing frame respectively to obtain a first sub-optimization solution and a second sub-initial solution, and optimize the processing frame based on the first sub-optimization solution and the second sub-initial solution, where the first sub-optimization solution and the second sub-optimization solution are sub-initial solutions obtained by training the processing frame with the first quantizer, and the optimization solutions corresponding to the modules are obtained by processing the sub-initial solutions based on the second sub-weight value and the third sub-weight value.
The beneficial effect of above-mentioned scheme does: by separating the mobile digital video and optimizing the processing frame, the redundancy removal in huge data is realized, the storage space is saved, and the coding and decoding efficiency is improved.
Example 2
Based on embodiment 1, the processing framework comprises an encoder, a decoder and an entropy parameter model module, wherein the encoder comprises a main encoder and a super-a-encoder, and the decoder comprises a main decoder and a super-a-decoder.
Example 3
Based on embodiment 1, the process of the encoding and decoding method further includes:
the method comprises the steps of separating the mobile digital video to obtain a plurality of subframes, wherein subframe intervals exist between adjacent frames, and further a frame interval set of the mobile digital video is obtained;
acquiring a pixel column corresponding to each interval time point in the mobile digital video based on a frame interval set, acquiring the relative position of the pixel column corresponding to each subframe interval, binding each subframe interval and the relative position of the corresponding pixel column to acquire binding information, inserting the binding information into the corresponding subframe interval, and acquiring an interval calibration block;
in this embodiment, the interval set is a set formed by intervals between every two adjacent sub-video frame images;
in this embodiment, the binding information is obtained by binding the relative position of the pixel corresponding to each sub-video frame image in the mobile digital video and the sub-frame interval corresponding to the sub-video frame image, and inserting the binding information into the corresponding sub-frame interval to obtain an interval calibration block, so as to encode and decode the mobile digital video;
the beneficial effect of above-mentioned scheme does: by segmenting the mobile digital video and inserting the interval calibration block, the sequence of the mobile digital video is ensured, and the mobile digital video is prevented from being out of order in the encoding and decoding processes, so that the encoding and decoding efficiency is influenced, the encoded data to be decoded is ensured to be subjected to lossless reconstruction, and the encoding and decoding efficiency is improved.
Example 4
Based on embodiment 1, as shown in fig. 2, step 3, in the process of encoding and decoding the sub-frame digital image based on the optimized processing frame, further includes:
step 3.1, the super-prior encoder and the main encoder respectively encode the images corresponding to the interval calibration blocks and the subframes to generate encoded data with a first preset length and encoded data with a second preset length;
step 3.2, the entropy parameter model module carries out data compression on the coded data with the first preset length and the coded data with the second preset length;
and 3.3, the super-prior decoder and the main decoder respectively decode the compressed coded data with the first preset length and the compressed coded data with the second preset length.
In this embodiment, the first preset length and the second preset length are respectively 4 bytes and 16 bytes, and the interval calibration block and the sub-video frame image are coded to different lengths, so that the interval calibration block and the sub-video frame image are distinguished conveniently, error coding or error decoding is prevented from occurring in the coding and decoding process, and the coding and decoding accuracy is improved.
The beneficial effect of above-mentioned scheme does: the super-first encoder and the main encoder respectively encode images corresponding to the interval calibration blocks and the sub-frames, and the entropy parameter model module performs data compression on the encoded data with the first preset length and the encoded data with the second preset length; the super-first-check decoder and the main decoder respectively decode the compressed coded data with the first preset length and the compressed coded data with the second preset length to prevent the mobile digital video from being out of order in the coding and decoding processes, so that the coding and decoding efficiency is influenced, the lossless reconstruction of the coded data to be decoded is ensured, and the coding and decoding efficiency is improved.
Example 5
Based on embodiment 1, step 3, the process of encoding and decoding the sub-frame digital image based on the optimized processing framework, further includes:
carrying out continuity detection on the interval calibration blocks;
judging whether the adjacent frames based on the current calibration block are continuous or not according to the continuity detection result;
if the frame is continuous, continuously decoding the adjacent frame corresponding to the next calibration block;
otherwise, resending the coded data of the adjacent frame of the current calibration block and alarming.
The beneficial effect of above-mentioned scheme does: whether the adjacent frames based on the current calibration block are continuous or not is judged according to the continuity detection result, and an alarm is given when the adjacent frames are discontinuous, so that error coding or error decoding is prevented from occurring in the coding and decoding process, and the coding and decoding accuracy is improved.
Example 6
Based on embodiment 1, step 1 is to collect video data and process the video data to obtain a mobile digital video, and includes:
cutting a sub-video frame image in video data into n image blocks with the same size according to a preset cutting mode to obtain a plurality of image block data, and analyzing each image block data based on a preset algorithm to obtain corresponding sub-fingerprint data;
performing parallel processing on the n image block data after the sub-video frame image is cut, and performing fingerprint comparison on the image block data at the same position as the previous sub-video frame image according to the unique fingerprint data;
screening out small image block data with inconsistent fingerprint comparison from the current sub video frame image, carrying out data coding processing to obtain a plurality of coded data, collecting the plurality of coded data to form a data queue, and sending the data queue to a receiving party through a network;
after the receiving party receives the data queue, carrying out parallel decoding processing on a plurality of coded data in the data queue to restore and obtain corresponding small image block data; judging whether the small image block data obtained by restoration are complete, if not, determining a vacant position, and finding out the small image block data corresponding to the vacant position from the previous sub-video frame image for completion;
and splicing the supplemented small image block data to generate a complete mobile digital video and displaying the complete mobile digital video.
In this embodiment, the predetermined algorithm is a frequency complex analysis algorithm. And analyzing each image block data to obtain corresponding sub-fingerprint data, performing fingerprint comparison on the sub-fingerprint data of each image block and the image block data at the same position of the previous sub-video frame image according to the unique fingerprint data, and performing data coding processing on the small image block data with inconsistent fingerprint comparison, so that the workload is reduced, and the coding and decoding efficiency is improved.
In this embodiment, the splicing is to splice the complemented small image blocks into a complete mobile digital video according to the corresponding sequence;
the beneficial effect of above-mentioned scheme does: by analyzing the image block data, the sub-fingerprint data of each image block is obtained and the image blocks with different sub-fingerprint data are encoded, so that the encoded data is simplified, the storage space is reduced, the transmission efficiency is improved, the corresponding relation between the encoded data and the image is established, a path and a calling relation are provided for the subsequent decoding of the encoded data, and the subsequent decoding of the currently obtained encoded data is facilitated.
Example 7
Based on the embodiment 1, the step 2 of partitioning the mobile digital video to obtain a plurality of sub-frame digital images and optimizing a processing frame includes:
performing a first optimization on the processing framework: training the processing frame based on a preset first quantizer until the processing frame meets a first preset condition for stopping training, obtaining first sub-output values of all modules in the processing frame, and taking the first sub-output values as sub-initial solutions of corresponding modules;
processing the main encoder and the pre-check encoder, adjusting the first quantizer based on a preset adjusting method to obtain a second quantizer, training a processing frame based on the second quantizer to obtain second sub-weight values respectively corresponding to a decoder, an entropy parameter model module and a factorization entropy model module in the processing frame, and processing sub-initial solutions of the corresponding modules based on the second sub-weight values to obtain first sub-optimal solutions corresponding to the decoder, the entropy parameter model module and the factorization entropy model module;
and performing second optimization on the processing frame: processing the main decoder, the super-prior decoder and the entropy parameter model module, training a processing frame based on a preset third quantizer to obtain third sub-weight values respectively corresponding to the main encoder and the pre-prior encoder in the processing frame, and processing corresponding sub-initial solutions of the main encoder and the pre-prior encoder based on the second sub-weight values to obtain second sub-optimal solutions corresponding to the main encoder and the pre-prior encoder;
and optimizing the processing frame based on the first sub-optimization solution and the second sub-initial solution to obtain the optimized processing frame.
In this embodiment, the processing framework is an integrated encoding and decoding module composed of an encoder, a decoder, and an entropy parameter model module, and is used for compressing, encoding, and decoding digital video data;
in this embodiment, the first quantizer performs 4 × 4 integer discrete cosine transform on the digital video data, and inputs the transformed digital video data into the processing frame until the processing frame meets a first preset condition for stopping training;
in this embodiment, the first preset condition is that the definition of the digital video data subjected to the frame encoding and decoding reaches a preset range;
in the embodiment, the sub-initial solution is the first sub-output value of each module;
in this embodiment, the processing of the main encoder and the super-first-pass encoder is to freeze the main encoder and the super-first-pass encoder, operate only the other modules, obtain second sub-weight values corresponding to the other modules, and process the sub-initial solutions of the corresponding modules based on the second sub-weight values, so as to obtain first sub-optimal solutions of the corresponding modules;
in this embodiment, the second quantizer is to perform 2 × 2 integer discrete cosine transform on the digital video data and input the transformed digital video data into the processing frame;
in this embodiment, the third quantizer is to perform 8 × 8 integer discrete cosine transform on the digital video data and input the transformed digital video data into the processing frame;
the beneficial effect of above-mentioned scheme does: the optimized processing frame is obtained by performing the first optimization and the second optimization on the processing frame, so that the encoded data is simplified, the storage space is reduced, the transmission efficiency is improved, and the encoded data obtained at present can be conveniently decoded subsequently.
Example 8
Based on embodiment 1, step 3, based on the optimized processing framework, encodes and decodes the sub-frame digital image, including:
determining first parameter information of an entropy parameter model based on the optimized processing frame, coding the first parameter information based on a super-prior coder to obtain coded parameter information, and decoding the coded parameter information by the super-prior decoder to obtain second parameter information;
substituting the second parameter information into the optimized processing frame to obtain a first output value and a second output value output by the entropy parameter model, and predicting the code rate of the optimized processing frame based on the first output value and the second output value and a preset algorithm to obtain the code rate of the optimized processing frame;
enhancing the subframe digital images based on a preset algorithm to obtain first subframe digital images, performing quantization processing on the first subframe digital images to obtain second subframe digital images, performing image reconstruction processing on the second subframe digital images based on the optimized code rate of the processing frame, outputting reconstructed images, and performing image optimization performance evaluation calculation on encoding results based on the subframe digital images and the reconstructed images to obtain a distortion function;
estimating an encoding result generated by the optimized processing frame based on the code rate, wherein the encoding result comprises the code rate consumed by transmitting the encoded data in the optimized processing frame and the code rate of the second parameter information transmitted by the prior decoder; constructing a distortion loss function for performing end-to-end optimization on the image according to the distortion function and the code rate result; and performing end-to-end compression and optimization processing on the target image based on the distortion loss function.
In this embodiment, the first parameter information is a compression parameter of the entropy parameter model, and is used for compressing the sub-frame digital image, where the compression parameter represents a compression mode of data;
in this embodiment, the second parameter information is obtained by encoding and decoding the first parameter information;
in this embodiment, the first output value and the second output value are two values output by the entropy parameter model after the second parameter information is substituted into the optimized processing frame;
in this embodiment, the code rate prediction is to predict the transmission speed of the optimized processing frame based on a preset code rate prediction algorithm and the first output value and the second output value, so as to obtain the data transmission speed, i.e. the code rate, of the optimized processing frame;
in this embodiment, the quantization process is to divide the first sub-frame digital image into a plurality of intervals, and replace the intervals with the average pixel value of each interval, so as to simplify the first sub-frame digital image and obtain the second sub-frame digital image;
in the embodiment, the image reconstruction processing is to convert the digital image of the second subframe from digital information into image information, and evaluate and calculate the image optimization performance of the encoding result to obtain a distortion function;
in this embodiment, the distortion function is a distortion value between an image output by the optimized processing framework and an input image;
the beneficial effect of above-mentioned scheme does: and determining first parameter information of the entropy parameter model based on the optimized processing frame, decoding the first parameter information to obtain second parameter information, performing code rate prediction on the optimized processing frame based on the second parameter information to obtain a distortion function, and performing end-to-end compression and optimization processing on the target image based on the distortion function, so that the lossless restoration degree of the image in the encoding and decoding processes is improved.
Example 9
Based on embodiment 7, the process of performing image reconstruction processing on the digital image of the second subframe based on the optimized code rate of the processing frame includes:
calculating a first loss value for image reconstruction according to the following formula:
Figure BDA0003280417020000131
wherein D1I is the number of pixels of the second sub-frame digital image, j is a preset weight value,
Figure BDA0003280417020000132
for each pixel corresponding to a predicted two-dimensional coordinate,
Figure BDA0003280417020000133
corresponding to each pixel to mark two-dimensional coordinates;
by the formula
Figure BDA0003280417020000134
A first loss function for the image reconstruction can be obtained, for example, when i 1000, j 1,
Figure BDA0003280417020000135
then D is11.998, and calculating a second loss value of the image reconstruction according to the following formula:
Figure BDA0003280417020000136
wherein D is2For the second loss value of the image reconstruction,
Figure BDA0003280417020000137
a predicted reconstruction error corresponding to each pixel;
by the formula
Figure BDA0003280417020000138
Second loss values for image reconstruction can be obtained, e.g.
Figure BDA0003280417020000139
Then D is2Judging whether the image reconstruction reaches the standard or not based on a second loss value of the image reconstruction;
when D is present2When the second subframe digital image is smaller than the preset threshold value, the second subframe digital image is qualified for reconstruction and is not processed;
when D is present2And when the number of the second subframe digital images is not less than the preset threshold value, the reconstruction of the second subframe digital images does not reach the standard, and the secondary reconstruction is carried out on the images until the reconstruction of the images reaches the standard.
The beneficial effect of above-mentioned scheme does: and whether the image reconstruction reaches the standard is judged based on the distortion function, and secondary reconstruction is carried out when the image reconstruction does not reach the standard until the image reconstruction reaches the standard, so that the lossless restoration degree of the image in the encoding and decoding processes is improved.
Example 10
Based on embodiment 4, the super-first encoder is configured to encode the interval calibration block to generate encoded data of a first preset length, and includes:
acquiring a calibration attribute and a corresponding calibration identifier of each calibration point in the interval calibration block to obtain a first symbol;
predicting and acquiring a corresponding second symbol from a preset sequence according to the property of each calibration point;
performing fusion processing on the first symbol and the second symbol to obtain a fusion symbol corresponding to the calibration point;
and acquiring coded data with a first preset length according to all the fusion symbols.
The beneficial effect of above-mentioned scheme does: the work load of coding and decoding is reduced, the coding and decoding efficiency is improved, and the lossless restoration of the coded data to be decoded is ensured.
Example 11
An embodiment of the present invention provides a device for encoding and decoding a mobile digital video, as shown in fig. 3, including:
the acquisition unit is used for acquiring video data and processing the video data to obtain a mobile digital video;
the optimization unit is used for separating the mobile digital video to obtain a plurality of sub-frame digital images and optimizing a processing frame;
and the processing unit is used for encoding and decoding the sub-frame digital image based on the optimized processing frame.
The beneficial effects of the above scheme are already explained in example 1.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (10)

1. A method of mobile digital video encoding and decoding, comprising:
step 1, collecting video data, and processing the video data to obtain a mobile digital video;
step 2, separating the mobile digital video to obtain a plurality of sub-frame digital images, and optimizing a processing frame;
and 3, coding and decoding the sub-frame digital image based on the optimized processing frame.
2. The method of claim 1, wherein the processing framework comprises an encoder, a decoder, and an entropy parameter model module, wherein the encoder comprises a main encoder and a super-a-priori encoder, and the decoder comprises a main decoder and a super-a-priori decoder.
3. The method as claimed in claim 1, wherein the step 2 of partitioning the mobile digital video to obtain a plurality of sub-frame digital images and optimizing the processing frame further comprises:
the method comprises the steps of separating the mobile digital video to obtain a plurality of subframes, wherein subframe intervals exist between adjacent frames, and further a frame interval set of the mobile digital video is obtained;
acquiring a pixel column corresponding to each interval time point in the mobile digital video based on the frame interval set, obtaining the relative position of the pixel column corresponding to each subframe interval, binding each subframe interval and the relative position of the corresponding pixel column to obtain binding information, inserting the binding information into the corresponding subframe interval, and obtaining an interval calibration block.
4. A method of mobile digital video encoding and decoding as defined in claim 2, wherein: step 3, in the process of encoding and decoding the sub-frame digital image based on the optimized processing frame, the method further comprises the following steps:
step 3.1, the super-prior encoder and the main encoder respectively encode the images corresponding to the interval calibration blocks and the subframes to generate encoded data with a first preset length and encoded data with a second preset length;
step 3.2, the entropy parameter model module carries out data compression on the coded data with the first preset length and the coded data with the second preset length;
and 3.3, the super-prior decoder and the main decoder respectively decode the compressed coded data with the first preset length and the compressed coded data with the second preset length.
5. The method of claim 3, wherein in the step 3, the encoding and decoding of the sub-frame digital image based on the optimized processing frame further comprises:
carrying out continuity detection on the interval calibration blocks;
judging whether the adjacent frames based on the current calibration block are continuous or not according to the continuity detection result;
if the frame is continuous, continuously decoding the adjacent frame corresponding to the next calibration block;
otherwise, resending the coded data of the adjacent frame of the current calibration block and alarming.
6. A method for encoding and decoding a mobile digital video according to claim 1, wherein step 1, acquiring video data and processing the video data to obtain the mobile digital video comprises:
cutting a sub-video frame image in video data into n image blocks with the same size according to a preset cutting mode to obtain a plurality of image block data, and analyzing each image block data based on a preset algorithm to obtain corresponding sub-fingerprint data;
performing parallel processing on the n image block data after the sub-video frame image is cut, and performing fingerprint comparison on the image block data at the same position as the previous sub-video frame image according to the unique fingerprint data;
screening out small image block data with inconsistent fingerprint comparison from the current sub video frame image, carrying out data coding processing to obtain a plurality of coded data, collecting the plurality of coded data to form a data queue, and sending the data queue to a receiving party through a network;
after the receiving party receives the data queue, carrying out parallel decoding processing on a plurality of coded data in the data queue to restore and obtain corresponding small image block data; judging whether the small image block data obtained by restoration are complete, if not, determining a vacant position, and finding out the small image block data corresponding to the vacant position from the previous sub-video frame image for completion;
and splicing the supplemented small image block data to generate a complete mobile digital video and displaying the complete mobile digital video.
7. The method for encoding and decoding a mobile digital video according to claim 1, wherein the step 2 of partitioning the mobile digital video to obtain a plurality of sub-frame digital images and optimizing the processing framework comprises:
performing a first optimization on the processing framework: training the processing frame based on a preset first quantizer until the processing frame meets a first preset condition for stopping training, obtaining first sub-output values of all modules in the processing frame, and taking the first sub-output values as sub-initial solutions of corresponding modules;
processing the main encoder and the pre-check encoder, adjusting the first quantizer based on a preset adjusting method to obtain a second quantizer, training a processing frame based on the second quantizer to obtain second sub-weight values respectively corresponding to a decoder, an entropy parameter model module and a factorization entropy model module in the processing frame, and processing sub-initial solutions of the corresponding modules based on the second sub-weight values to obtain first sub-optimal solutions corresponding to the decoder, the entropy parameter model module and the factorization entropy model module;
and performing second optimization on the processing frame: processing the main decoder, the super-prior decoder and the entropy parameter model module, training a processing frame based on a preset third quantizer to obtain third sub-weight values respectively corresponding to the main encoder and the pre-prior encoder in the processing frame, and processing corresponding sub-initial solutions of the main encoder and the pre-prior encoder based on the second sub-weight values to obtain second sub-optimal solutions corresponding to the main encoder and the pre-prior encoder;
and optimizing the processing frame based on the first sub-optimization solution and the second sub-initial solution to obtain the optimized processing frame.
8. The method of claim 1, wherein the step 3 of encoding and decoding the sub-frame digital image based on the optimized processing frame comprises:
determining first parameter information of an entropy parameter model based on the optimized processing frame, coding the first parameter information based on a super-prior coder to obtain coded parameter information, and decoding the coded parameter information by the super-prior decoder to obtain second parameter information;
substituting the second parameter information into the optimized processing frame to obtain a first output value and a second output value output by the entropy parameter model, and predicting the code rate of the optimized processing frame based on the first output value and the second output value and a preset algorithm to obtain the code rate of the optimized processing frame;
enhancing the subframe digital images based on a preset algorithm to obtain first subframe digital images, performing quantization processing on the first subframe digital images to obtain second subframe digital images, performing image reconstruction processing on the second subframe digital images based on the optimized code rate of the processing frame, outputting reconstructed images, and performing image optimization performance evaluation calculation on encoding results based on the subframe digital images and the reconstructed images to obtain a distortion function;
estimating an encoding result generated by the optimized processing frame based on the code rate, wherein the encoding result comprises the code rate consumed by transmitting the encoded data in the optimized processing frame and the code rate of the second parameter information transmitted by the prior decoder; constructing a distortion loss function for performing end-to-end optimization on the image according to the distortion function and the code rate result; and performing end-to-end compression and optimization processing on the target image based on the distortion loss function.
9. The method of claim 4, wherein the super-a-priori encoder is configured to encode the inter-block to generate the encoded data of the first predetermined length, and comprises:
acquiring a calibration attribute and a corresponding calibration identifier of each calibration point in the interval calibration block to obtain a first symbol;
predicting and acquiring a corresponding second symbol from a preset sequence according to the property of each calibration point;
performing fusion processing on the first symbol and the second symbol to obtain a fusion symbol corresponding to the calibration point;
and acquiring coded data with a first preset length according to all the fusion symbols.
10. A device for mobile digital video encoding and decoding according to claim 1, comprising:
the acquisition unit is used for acquiring video data and processing the video data to obtain a mobile digital video;
the optimization unit is used for separating the mobile digital video to obtain a plurality of sub-frame digital images and optimizing a processing frame;
and the processing unit is used for encoding and decoding the sub-frame digital image based on the optimized processing frame.
CN202111130687.7A 2021-09-26 2021-09-26 Digital video data encoding and decoding method and equipment Pending CN113873248A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111130687.7A CN113873248A (en) 2021-09-26 2021-09-26 Digital video data encoding and decoding method and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111130687.7A CN113873248A (en) 2021-09-26 2021-09-26 Digital video data encoding and decoding method and equipment

Publications (1)

Publication Number Publication Date
CN113873248A true CN113873248A (en) 2021-12-31

Family

ID=78990749

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111130687.7A Pending CN113873248A (en) 2021-09-26 2021-09-26 Digital video data encoding and decoding method and equipment

Country Status (1)

Country Link
CN (1) CN113873248A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116600119A (en) * 2023-07-18 2023-08-15 腾讯科技(深圳)有限公司 Video encoding method, video decoding method, video encoding device, video decoding device, computer equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116600119A (en) * 2023-07-18 2023-08-15 腾讯科技(深圳)有限公司 Video encoding method, video decoding method, video encoding device, video decoding device, computer equipment and storage medium
CN116600119B (en) * 2023-07-18 2023-11-03 腾讯科技(深圳)有限公司 Video encoding method, video decoding method, video encoding device, video decoding device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
RU2737315C1 (en) Image encoding method and device and image decoding method and device
US10594338B1 (en) Adaptive quantization
US20020009143A1 (en) Bandwidth scaling of a compressed video stream
EP3677027B1 (en) Mixing of probabilities for entropy coding in video compression
US20180115787A1 (en) Method for encoding and decoding video signal, and apparatus therefor
US10965948B1 (en) Hierarchical auto-regressive image compression system
EP2262270A1 (en) Method, device and system for interframe prediction encoding and decoding
RU2007106081A (en) METHOD AND DEVICE FOR TRANSFORMING WITH INCREASING FRAME RATE BY USING A CODER (EA-FRUC) FOR COMPRESSING A VIDEO IMAGE
CN110650349B (en) Image encoding method, decoding method, encoder, decoder and storage medium
CN102792695A (en) Method and apparatus for encoding and decoding image by using large transform unit
CN101389029B (en) Method and apparatus for video image encoding and retrieval
EP3355580A1 (en) Method and apparatus for processing video signals using coefficient derivation prediction
CN114513655A (en) Live video quality evaluation method, video quality adjustment method and related device
CN107005702B (en) System and method for processing blocks of a digital image
CN101072344A (en) Video coding method, decoding method and device
CN111263243A (en) Video coding method and device, computer readable medium and electronic equipment
CN113873248A (en) Digital video data encoding and decoding method and equipment
CN109274682B (en) Data processing method and apparatus thereof
US9667958B2 (en) Image coding and decoding methods and apparatuses
CN111385576B (en) Video coding method and device, mobile terminal and storage medium
EP2888877B1 (en) Method and apparatus for estimating content complexity for video quality assessment
US10469874B2 (en) Method for encoding and decoding a media signal and apparatus using the same
US20040057518A1 (en) Compression of motion vectors
CN116982313A (en) Method and apparatus for encoding video for machine vision
US20220377342A1 (en) Video encoding and video decoding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination