WO2019210822A1 - 视频编解码方法、装置、系统及存储介质 - Google Patents

视频编解码方法、装置、系统及存储介质 Download PDF

Info

Publication number
WO2019210822A1
WO2019210822A1 PCT/CN2019/084836 CN2019084836W WO2019210822A1 WO 2019210822 A1 WO2019210822 A1 WO 2019210822A1 CN 2019084836 W CN2019084836 W CN 2019084836W WO 2019210822 A1 WO2019210822 A1 WO 2019210822A1
Authority
WO
WIPO (PCT)
Prior art keywords
compression
macroblocks
macroblock
text
character
Prior art date
Application number
PCT/CN2019/084836
Other languages
English (en)
French (fr)
Inventor
高俊平
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Priority to US17/052,256 priority Critical patent/US11445201B2/en
Publication of WO2019210822A1 publication Critical patent/WO2019210822A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • H04N19/122Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/182Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/188Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a video data packet, e.g. a network abstraction layer [NAL] unit
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression

Definitions

  • Embodiments of the present disclosure relate to, but are not limited to, the field of communication technology.
  • a video encoding method includes: performing macroblock partitioning on pixel data of a screen change region; and classifying the divided macroblock into a text macroblock or image according to pixel data in the macroblock. a macroblock; determining a compression type of each of the character macroblocks according to color data in each of the character macroblocks; and encoding and compressing each of the character macroblocks according to a compression type of each of the character macroblocks by using a corresponding compression quality.
  • a video decoding method including: receiving and parsing a packet sent by a video encoding apparatus, where the packet carries compressed data and a compression type of the compressed data; The compression type decodes the compressed data in the packet.
  • a video encoding apparatus includes: a macro dividing module configured to perform macroblock partitioning on pixel data of a screen change area; and a macro classification module configured to perform pixel data according to the macroblock Classifying the divided macroblocks into a text macroblock or an image macroblock; a type determining module, configured to determine a compression type of each of the text macroblocks according to color data in each of the text macroblocks; and an encoding module, configured to The compression type of each character macroblock encodes and compresses each of the character macroblocks with a corresponding compression quality.
  • a video decoding apparatus including: a receiving module, configured to receive a packet sent by a video encoding apparatus, where the packet carries compressed data and a compression type of the compressed data; a module, configured to parse the packet, and obtain the compression type; and a decoding module, configured to perform decoding processing on the compressed data according to the compression type.
  • a video codec system including a video encoding device according to the present disclosure and a video decoding device according to the present disclosure.
  • a storage medium having stored thereon one or more programs, the one or more programs being executed by one or more processors, the one or more processes The video encoding method according to the present disclosure is performed.
  • a storage medium having stored thereon one or more programs, the one or more programs being executed by one or more processors, the one or more processes The video decoding method according to the present disclosure is performed.
  • FIG. 1 is a flowchart of a video encoding method according to an embodiment of the present disclosure
  • FIG. 2 is another flow chart of a video encoding method in accordance with an embodiment of the present disclosure
  • FIG. 3 is a block diagram showing a structure of a video encoding apparatus according to an embodiment of the present disclosure
  • FIG. 4 is a flowchart of a video decoding method in accordance with an embodiment of the present disclosure
  • FIG. 5 is another flowchart of a video decoding method according to an embodiment of the present disclosure.
  • FIG. 6 is a block diagram showing a structure of a video decoding apparatus according to an embodiment of the present disclosure
  • FIG. 7 is a schematic diagram of a video codec system in accordance with an embodiment of the present disclosure.
  • the remote transmission classification coding technology is roughly divided into two categories: the first category, which roughly divides the entire screen into equally divided regions, roughly divided into texts and pictures according to the colors of the regions, and then uses lossless compression on the text to damage the image. Compression, but the use of lossless compression for text can not meet the transmission requirements under low bandwidth; the second category, in the video compression technology, the intra prediction macroblocks are classified, the text and image are distinguished, and then the text and image are transformed.
  • the high-frequency coefficient and the low-frequency coefficient are divided into multiple levels of compression transmission. This classification method requires multiple compressions, and the compression efficiency of the text is not high, especially for a desktop office scene containing a large amount of text, and the applicability is not strong.
  • the interactive features of screen sharing put forward high requirements on the definition of video encoding and the real-time transmission.
  • the screen content contains a lot of thin lines such as fonts, borders, etc., some even one pixel. Monochrome lines.
  • the video coding technology is not efficient for compressing non-continuous tone content such as text, especially for desktop office scenes with a large amount of text, and the applicability is not strong.
  • Embodiments of the present disclosure provide a video encoding and decoding method, apparatus, system, and storage medium to improve compression efficiency of non-contiguous tone content such as text, especially for desktop office scenes containing a large amount of text, and to improve video coding and decoding. Sex.
  • FIG. 1 is a flowchart of a video encoding method in accordance with an embodiment of the present disclosure.
  • a video encoding method includes a device S104 of step S101.
  • step S101 macroblock division is performed on the pixel data of the screen change area.
  • step S102 the divided macroblocks are classified into character macroblocks or image macroblocks based on the pixel data in the macroblock.
  • step S103 the compression type of each character macroblock is determined based on the color data in each character macroblock.
  • each character macroblock is encoded and compressed according to the compression type of each character macroblock by using the corresponding compression quality.
  • the division of macroblocks may be performed according to the coding unit size of 16x16, 32x32, 64x64, etc. according to the needs of the encoder.
  • step S102 can include analyzing gradient features of the internal pixel data distribution of each macroblock and classifying the macroblocks into image macroblocks or text macroblocks according to the gradient features.
  • the distribution of the pixel data of the image has graduality, the gradient of the pixel data is small, the boundary of the text is more, and the gradient of the pixel is larger. Therefore, the threshold can be set, and the macroblock whose gradient change is larger than the threshold is classified into a character macroblock, and The remaining macroblocks are classified as image macroblocks.
  • step S103 may include: establishing statistical information according to pixel data of each character macroblock; classifying each character macroblock according to a color feature in the statistical information; and color features according to different categories of text macroblocks The difference between the foreground color and the background color determines the compression type of the text macroblock of each category, wherein the compression type includes high lossy compression, moderate lossy compression, high definition lossy compression, lossy compression requiring compensation, and losslessness. At least one of compression.
  • a plurality of sets of data for example, the combination of the luminance and chrominance signals (YUV) can be selected.
  • a non-color text macroblock a red text macroblock, a blue text macroblock, and a mixed color text macroblock (that is, a red macro and a blue text macroblock). class.
  • the difference between the foreground color and the background color of each character macroblock may be further analyzed, and the difference between the foreground color and the background color is greater than the first threshold.
  • the compression type is determined to be highly lossy compression; the compression type of the achromatic text macroblock whose difference between the foreground color and the background color is less than or equal to the first threshold and greater than the second threshold is determined as moderate lossy compression;
  • the compression type of the macroblock or the blue-text macroblock of the human eye is determined to be lossy compression requiring compensation;
  • the compression type of the mixed color macroblock is determined to be lossless compression; and the compression type of the remaining character macroblock is Determined for high definition lossy compression.
  • an appropriate encoding manner may be selected according to the compression type of the text macroblock. Coding quality to ensure the clarity of the picture.
  • the coding method and the quality of the coding can be selected according to the following principle: for a macroblock of text whose compression type is highly lossy compression, the text is achromatic text, and the brightness of the foreground and the background are very different, and can be compressed with a larger compression quality.
  • step S104 may include: encoding and compressing each of the character macroblocks according to a compression type of each of the character macroblocks, wherein the installation is highly lossy compression, moderately lossy compression The sequential compression ratios of high-definition lossy compression, lossy compression and lossless compression requiring compensation are sequentially reduced.
  • the compression quality can be measured by the compression ratio of the code.
  • the compression ratio corresponding to the larger compression quality, the medium compression quality, and the higher resolution compression quality is sequentially reduced, that is, the compression ratio of the larger compression quality is > medium.
  • Compression ratio of compression quality > compression ratio of compression quality of higher definition compression ratios corresponding to different compression qualities can be selected according to actual needs.
  • the video encoding method according to the present disclosure may further include: encoding the compressed text macroblock and carrying the compression type of the macroblock in the packet data; and transmitting the packet to the video decoding device. decoding.
  • Text macroblocks with lossy compression and lossless compression can be distinguished, for example, by using the "NAL type" field of the packet data.
  • the pixel data of the screen change area is subjected to macroblock division, and the compression type of the character macroblock is determined according to the color data of the divided character macroblock, and the character macroblock is determined according to different compression types.
  • the coding compression is performed with different compression qualities, so that the corresponding compression quality is selected while the text clarity is satisfied, the compression efficiency is improved as much as possible, and the bandwidth is reduced to adapt to a desktop office scene containing a large amount of text.
  • FIG. 2 is another flow diagram of a video encoding method in accordance with an embodiment of the present disclosure.
  • another flow of the video encoding method according to an embodiment of the present disclosure includes steps S201 to S207.
  • step S201 for the changed area of the cloud desktop, the pixels of the original screen data are color-space converted, and the converted data is sub-sampled as needed.
  • the original pixels captured in the cloud desktop are based on the RGB color space data, which can be converted to the YUV color space required for video encoding, that is, the data of the RGB color space is converted into the YUV color space, and converted as needed.
  • YUV data is sub-sampled.
  • the YUV color space contains a Y luminance component and two chrominance components U and V.
  • step S202 the pixel data of the converted screen change area is subjected to macroblock division according to the size of the coding unit according to the needs of the encoder.
  • step S203 the macroblock is classified into a character macroblock or an image macroblock according to the pixel gradient of the macroblock.
  • step S204 the image macroblock is encoded and compressed.
  • step S205 the pixel color information in each character macroblock is analyzed, and each character macroblock is classified according to high-loss lossy compression, moderate lossy compression, high-definition lossy compression, lossy compression and lossless compression requiring compensation. .
  • step S206 according to the compression type of each character macroblock, the corresponding coding mode and compression quality are selected to encode and compress each character macroblock.
  • the compression type of the text macroblock is lossy compression (for example, highly lossy compression, moderate lossy compression, high definition lossy compression, lossy compression requiring compensation)
  • the corresponding coding is selected according to the lossy compression coding method.
  • the compression quality (such as compression ratio) is encoded and compressed; if the compression type is lossless compression, encoding compression is performed according to the lossless compression encoding method.
  • step S207 the encoded image macroblock and the character macroblock are encapsulated one by one, and the encapsulated packet data is sent to the decoding end.
  • step 207 packet data for lossy compression and lossless compression needs to be distinguished, for example, using a different "NAL type" for differentiation.
  • Loss compression for example, a unique qp value can be used to identify the compression quality, and a qp value can also be used to distinguish between a text macroblock and an image macroblock.
  • the pixel data of the screen change area is subjected to macroblock division, and the compression type of the character macroblock is determined according to the color data of the divided character macroblock, and the character macroblock is determined according to different compression types.
  • the coding compression is performed with different compression qualities, so that the corresponding compression quality is selected while the text clarity is satisfied, the compression efficiency is improved as much as possible, and the bandwidth is reduced to adapt to a desktop office scene containing a large amount of text.
  • FIG. 3 is a block diagram showing a structure of a video encoding apparatus according to an embodiment of the present disclosure.
  • a video encoding apparatus may include a macro partitioning module 301, a macro classification module 302, a type determining module 303, and an encoding module 304.
  • the macro division module 301 is configured to perform macroblock division on pixel data of the screen change area.
  • the macro classification module 302 is configured to classify the divided macroblocks into text macroblocks or image macroblocks according to pixel data in the macroblock.
  • the type determining module 303 is configured to determine, according to the color data in each character macroblock, a compression type of each character macroblock;
  • the encoding module 304 is configured to encode and compress each character macroblock according to the compression type of each character macroblock by using a corresponding compression quality.
  • the macro partitioning module 301 can perform macroblock partitioning according to the coding unit size of 16x16, 32x32, 64x64, etc. according to the needs of the encoder.
  • the macro classification module 302 is further configured to analyze the gradient features of the internal pixel data distribution of each macroblock and classify the macroblocks into image macroblocks or text macroblocks according to the gradient features.
  • the type determining module 303 may include: a statistical submodule configured to establish statistical information according to pixel data of each character macroblock; and a classification submodule configured to perform each character macroblock according to a color feature in the statistical information. And a compression type determining sub-module, configured to determine a compression type of each type of text macroblock according to a difference between a foreground color and a background color of a color feature of a different type of text macroblock, wherein the compression type includes a highly lossy compression At least one of moderate lossy compression, high definition lossy compression, lossy compression and lossless compression requiring compensation.
  • the encoding module 304 is further configured to perform encoding compression on each character macroblock according to a compression type of each character macroblock, wherein the height is lossy compression, medium lossy compression, and high definition.
  • the sequential compression ratios of lossy compression, lossy compression, and lossless compression that need to be compensated are sequentially reduced.
  • the video encoding apparatus further includes: a packet module, configured to encapsulate the encoded compressed macroblock and carry a compression type of the macroblock in the packet data; and a sending module, configured to: The packet is sent to a video decoding device for decoding.
  • Text macroblocks with lossy compression and lossless compression can be distinguished, for example, by using the "NAL type" field of the packet data.
  • Loss compression for example, a unique qp value can be used to identify the compression quality, and a qp value can also be used to distinguish between a text macroblock and an image macroblock.
  • the video encoding apparatus may further include a color conversion and sampling module for performing color conversion on the original pixels of the changed area of the desktop to be converted into video encoding before performing macroblock division. Color space and sample color as necessary.
  • the pixel data of the screen change area is subjected to macroblock division, the compression type of the character macroblock is determined according to the color data of the divided character macroblock, and the character macroblock is determined according to different compression types.
  • the coding compression is performed with different compression qualities, so that the corresponding compression quality is selected while the text clarity is satisfied, the compression efficiency is improved as much as possible, and the bandwidth is reduced to adapt to a desktop office scene containing a large amount of text.
  • FIG. 4 is a flow chart of a video decoding method in accordance with an embodiment of the present disclosure.
  • a video decoding method includes steps S401 and S402.
  • step S401 the packet sent by the video encoding apparatus is received and parsed, and the packet carries compressed data and a compression type of the compressed data.
  • step S402 the compressed data is subjected to decoding processing according to the type of compression.
  • the compression type of the compressed data is lossy compression or lossless compression.
  • step S402 may include: if the compression type of the compressed data is lossless compression, performing decoding of the compressed data without loss compression; if the compression type of the compressed data is lossy compression, performing the compressed data without Destruction of lossy compression.
  • the text macroblock can be decoded by using different compression qualities according to the compression type of the character macroblock, and the desktop office scene containing a large amount of text can be adapted while satisfying the text definition.
  • FIG. 5 is another flow chart of a video decoding method in accordance with an embodiment of the present disclosure.
  • FIG. 5 another flow of the video decoding method according to an embodiment of the present disclosure includes steps S501 to S510.
  • step S501 the packet data sent by the encoding end is received.
  • step S502 the received packet data is parsed, and it is judged whether it is lossy compression or lossless compression according to the "NAL type" field of the packet data. If it is lossless compression, step S503 is performed, and if lossy compression is performed, step S504 is performed.
  • step S503 the packet data is subjected to lossless compression decoding.
  • step S504 the compressed macroblock in each packet data is subjected to lossy compression decoding.
  • the packet data can carry the compression quality of the text macroblock to identify the compression type of each text macroblock (including highly lossy compression, moderate lossy compression, high definition lossy compression, lossy compression requiring compensation) And the unique qp value can be used to identify the compression quality, and the qp value can also be used to distinguish between the text macroblock and the image macroblock.
  • the decoding end may process the macroblock by a preset algorithm to increase the clarity of the decoded text.
  • step S505 according to the color sampling format of the current encoding end, it is determined whether it is necessary to construct a data frame based on 4:4:4 full sampling. If necessary, step S506 is performed; otherwise, step S509 is performed.
  • step S505 is performed after decoding of all macroblocks is completed.
  • the decoded sub-sampling format of the encoded end is required to be performed, and the sub-sampled data is filled into the decoded frame.
  • the basis for judging whether it is necessary to construct a 4:4:4 fully sampled data frame is: if the encoding end uses a 4:4:4 fully sampled data frame, no construction is required; if the encoding end has a sampling format of 4 : 2:2 or 4:2:0 sub-sampling, the data decoded by the decoding end is 4:2:2 or 4:2:0 YUV data, in which case a 4:4:4 sampled data frame needs to be constructed. For storing complete YUV data.
  • the 4:4:4 fully sampled data frame is only used for the client's display, and is not used as a reference for the decoder.
  • a data frame based on 4:4:4 full sampling is constructed.
  • the pixel data In fully sampled data frames, the pixel data needs to be expanded, corrected, and backfilled.
  • step S507 it is detected whether the lossy decoded macroblock has a macroblock identified as requiring color correction, and if so, step S508 is performed, otherwise step S509 is performed.
  • the macroblocks that need to be color corrected are color corrected based on the 4:4:4 fully sampled data frame.
  • the decoded data is padded into a 4:4:4 fully sampled data frame.
  • the decoded 4:4:4 fully sampled data frame is converted from the YUV color space to the desired RGB color space.
  • the text macroblock can be decoded by using different compression qualities according to the compression type of the character macroblock, and the desktop office scene containing a large amount of text can be adapted while satisfying the text definition.
  • FIG. 6 is a block diagram showing a structure of a video decoding apparatus according to an embodiment of the present disclosure.
  • a video decoding apparatus includes a receiving module 601, a parsing module 602, and a decoding module 603.
  • the receiving module 601 is configured to receive a packet sent by the video encoding device, where the packet carries compressed data and a compression type of the compressed data.
  • the parsing module 602 is configured to parse the packet and obtain a compression type.
  • the decoding module 603 is configured to perform decoding processing on the compressed data according to the compression type.
  • a video decoding apparatus may perform a video decoding method according to the present disclosure, and a specific decoding process is not described herein.
  • a video decoding apparatus further includes a data restoration module for creating a fully sampled data frame as needed, and expanding the decoded pixel data in the fully sampled data frame. , correction and backfilling.
  • a video decoding apparatus further includes a color conversion module for performing color conversion on pixel data in a 4:4:4 fully sampled data frame according to a color space requirement required for final display. .
  • the character macroblock can be decoded by using different compression qualities according to the compression type of the character macroblock, and the desktop office scene containing a large amount of text can be accommodated while satisfying the text definition.
  • FIG. 7 is a schematic diagram of a video codec system in accordance with an embodiment of the present disclosure.
  • a video codec system includes a video encoding device 701 and a video decoding device 702.
  • Video encoding device 701 may employ video encoding devices in accordance with various embodiments of the present disclosure
  • video decoding device 702 may employ video decoding devices in accordance with various embodiments of the present disclosure.
  • the pixel data of the screen change area is subjected to macroblock division at the time of encoding, and the compression type of the character macroblock is determined according to the color data of the divided character macroblock, and according to different compression types.
  • the coding macro compression is performed on the macroblocks with different compression qualities, so that the corresponding compression quality is selected while the text clarity is satisfied, the compression efficiency is improved as much as possible, and the bandwidth is reduced to adapt to the desktop office scene containing a large amount of text.
  • Embodiments of the present disclosure also provide a storage medium having stored thereon one or more programs, the one or more processors being executed by one or more processors, the one or more processors executing according to the present disclosure Video coding method of each embodiment.
  • Embodiments of the present disclosure also provide a storage medium having stored thereon one or more programs, the one or more processors being executed by one or more processors, the one or more processors executing according to the present disclosure Video decoding method of each embodiment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Discrete Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)

Abstract

本公开提供了一种视频编解码方法、装置、系统及存储介质。所述视频编码方法包括:对屏幕变化区域的像素数据进行宏块划分;根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块;根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型;以及根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩。

Description

视频编解码方法、装置、系统及存储介质 技术领域
本公开实施例涉及(但不限于)通讯技术领域。
背景技术
随着云计算、虚拟桌面等技术的普及,大量的解码端数据需要通过网络传输到用户端进行处理,为了满足窄带宽下网络状况下桌面内容的传输,基于屏幕内容的视频编码技术被广泛应用在虚拟桌面领域。
发明内容
根据本公开实施例的一个方面,提供一种视频编码方法,包括:对屏幕变化区域的像素数据进行宏块划分;根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块;根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型;以及根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩。
根据本公开实施例的另一个方面,提供一种视频解码方法,包括:接收并解析视频编码装置发送的封包,所述封包中携带有压缩数据以及所述压缩数据的压缩类型;以及根据所述压缩类型对所述封包中的压缩数据进行解码处理。
根据本公开实施例的另一个方面,提供一种视频编码装置,包括:宏划分模块,用于对屏幕变化区域的像素数据进行宏块划分;宏分类模块,用于根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块;类型确定模块,用于根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型;以及编码模块,用于根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩。
根据本公开实施例的另一个方面,提供一种视频解码装置,包括:接收模块,用于接收视频编码装置发送的封包,所述封包中携带 有压缩数据以及所述压缩数据的压缩类型;解析模块,用于解析所述封包,并获取所述压缩类型;以及解码模块,用于根据所述压缩类型对所述压缩数据进行解码处理。
根据本公开实施例的另一个方面,提供一种视频编解码系统,包括根据本公开的视频编码装置以及根据本公开的视频解码装置。
根据本公开实施例的另一个方面,提供一种存储介质,其上存储有一个或者多个程序,所述一个或者多个程序被一个或者多个处理器执行时,所述一个或多个处理器执行根据本公开的视频编码方法。
根据本公开实施例的另一个方面,提供一种存储介质,其上存储有一个或者多个程序,所述一个或者多个程序被一个或者多个处理器执行时,所述一个或多个处理器执行根据本公开的视频解码方法。
附图说明
图1为根据本公开实施例的视频编码方法的流程图;
图2为根据本公开实施例的视频编码方法的另一流程图;
图3为根据本公开实施例的视频编码装置的模块结构示意图;
图4为根据本公开实施例的视频解码方法的流程图;
图5为根据本公开实施例的视频解码方法的另一流程图;
图6为根据本公开实施例的视频解码装置的模块结构示意图;以及
图7为根据本公开实施例的视频编解码系统的示意图。
本公开实施例目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。
具体实施方式
为了使本公开实施例所要解决的技术问题、技术方案及有益效果更加清楚、明白,以下结合附图和实施例,对本公开实施例进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本公开实施例,并不用于限定本公开实施例。
远程传输分类编码技术大致分为两类:第一类,把整个屏幕粗 略的划分成等分区域,根据区域的色彩粗略的分为文字和图片,然后对文字使用无损压缩,对图片进行有损压缩,但是对文字使用无损压缩不能满足低带宽下的传输要求;第二类,在视频压缩技术中,对帧内预测的宏块进行分类,区分出文字和图像,然后对文字和图像变换后的高频系数和低频系数分多层次的压缩传输,这种分类方式需要多次压缩,并且对文字压缩效率不高,特别是对含有大量文字的桌面办公场景,适用性不强。
屏幕共享的交互特征对视频编码的清晰度和传输的实时性提出较高要求,例如,在云桌面办公的场景中,屏幕内容含有大量细线条如字体、边框等,有些甚至是仅为一个像素的单色线条。目前,视频编码技术对于文字等非连续色调内容的压缩效率不高,特别是对含有大量文字的桌面办公场景,适用性不强。
本公开各实施例提出一种视频编解码方法、装置、系统及存储介质,以提高对文字等非连续色调内容的压缩效率,特别是对含有大量文字的桌面办公场景,提高视频编解码的适用性。
图1为根据本公开实施例的视频编码方法的流程图。
参见图1,根据本公开实施例的视频编码方法包括步骤S101装置S104。
在步骤S101,对屏幕变化区域的像素数据进行宏块划分。
在步骤S102,根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块。
在步骤S103,根据各文字宏块内的色彩数据确定各文字宏块的压缩类型。
在步骤S104,根据各文字宏块的压缩类型采用对应的压缩质量对各文字宏块进行编码压缩。
在一个实施例中,在执行步骤S101时,可以根据编码器的需要,按照16x16、32x32、64x64等编码单元大小进行宏块的划分。
在一个实施例中,步骤S102可以包括:分析每一个宏块的内部像素数据分布的梯度特征,并根据所述梯度特征把宏块分类为图像宏块或文字宏块。
图像的像素数据的分布具有渐进性,像素数据的变化梯度小,文字的边界较多,像素的变化梯度较大,因此可以设置阈值,把梯度变化大于阈值宏块分类为文字宏块,并把其余的宏块分类为图像宏块。
在一个实施例中,步骤S103可以包括:根据各文字宏块的像素数据建立统计信息;根据统计信息中的颜色特征将各文字宏块进行分类;以及根据不同类别的文字宏块的颜色特征的前景色和背景色的差异,确定各类别的文字宏块的压缩类型,其中,压缩类型包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩中的至少一种。
具体的,对文字宏块内的像素建立统计信息之后,可以选取例如亮度色度信号(YUV)组合最多的若干组数据。
可以根据颜色特征分类为非彩色系文字宏块、红色系文字宏块、蓝色系文字宏块、混合色系文字宏块(即同时包含了红色系和蓝色系的文字宏块)四大类。
在确定各类别的文字宏块的压缩类型时,可以进一步分析各文字宏块的前景色和背景色的差异,将前景色和背景色的差值大于第一阈值的非彩色系文字宏块的压缩类型确定为高度有损压缩;将前景色和背景色的差值小于或等于第一阈值并大于第二阈值的非彩色系文字宏块的压缩类型确定为中度有损压缩;将红色系文字宏块或者人眼比较敏感的蓝色系文字宏块的压缩类型确定为需要补偿的有损压缩;将混合色系宏块的压缩类型确定为无损压缩;并且将其余文字宏块的压缩类型确定为高清晰度有损压缩。
在一个实施例中,在根据各文字宏块的压缩类型采用对应的压缩质量对各文字宏块进行编码压缩(即,步骤S104)时,可以根据文字宏块的压缩类型选取适当的编码方式和编码质量,以保证画面的清晰度。
编码方式和编码质量的选择可以遵循以下原则:对压缩类型为高度有损压缩的文字宏块,其文字是非彩色文字,且前景和背景的亮度相差很大,可以使用较大的压缩质量来压缩,不会造成文字的模糊或者产生色差;对压缩类型为中度有损压缩的文字宏块,其文字是非 彩色文字,且前景和背景的亮度差别不太小,可以使用中等的压缩质量来压缩,不会造成文字模糊或者产生色差;对压缩类型为高清晰度有损压缩的文字宏块,其文字是非彩色或者同色系的文字,且前景和背景的亮度是差别很小,因此需要使用较高清晰度的压缩质量来压缩才不会造成模糊;对压缩类型为需要补偿的有损压缩的文字宏块,其文字的前景和背景亮度差别较大,在使用非4:4:4完全采样时,容易造成色差的,需要在解码端对非完全采样复原时对色度进行补偿校正;并且对压缩类型为无损压缩的文字宏块,其文字的前景和背景都属于包含色彩且色度相差较大,在使用非4:4:4完全采样时,容易造成模糊的文字,需要使用完全无损的压缩方式进行压缩。
在一个实施例中,步骤S104可以包括:根据所述各文字宏块的压缩类型采用对应的压缩比对所述各文字宏块进行编码压缩,其中,安装高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩的顺序压缩比依次减小。
压缩质量可以用编码的压缩比衡量,较大的压缩质量、中等的压缩质量、较高清晰度的压缩质量对应的压缩比是依次降低的,即,较大的压缩质量的压缩比>中等的压缩质量的压缩比>较高清晰度的压缩质量的压缩比,不同的压缩质量对应的压缩比可以根据实际需要进行选择。
在一个实施例中,根据本公开的视频编码方法还可以包括:对编码压缩后的文字宏块进行封包并在封包数据中携带文字宏块的压缩类型;以及将该封包发送给视频解码装置进行解码。
可以对有损压缩和无损压缩的文字宏块进行区分,例如,利用封包数据的“NAL type”字段进行区分。
根据本实施例的视频编码方法,将屏幕变化区域的像素数据进行宏块划分,根据划分出的文字宏块的色彩数据确定个文字宏块的压缩类型,并根据不同的压缩类型对文字宏块采用不同的压缩质量执行编码压缩,从而,在满足文字清晰度的同时,选择相应的压缩质量,尽可能的提高压缩效率,降低带宽,以适应含有大量文字的桌面办公场景。
图2为根据本公开实施例的视频编码方法的另一流程图。
参见图2,根据本公开实施例的视频编码方法的另一流程包括步骤S201至S207。
在步骤S201,针对云桌面的变化区域,对原始屏幕数据的像素进行色彩空间转换,并根据需要对转换后的数据进行亚采样。
在云桌面中捕获到的原始像素是基于RGB色彩空间的数据,可以转换到视频编码所需要的YUV色彩空间,即,将RGB色彩空间的数据转换到YUV色彩空间,并根据需要对转换后的YUV数据进行亚采样。YUV色彩空间包含了一个Y亮度分量和U、V两个色度分量。
在步骤S202,根据编码器的需要,对转换后的屏幕变化区域的像素数据按照编码单元的大小进行宏块划分。
在步骤S203,根据宏块的像素梯度将宏块分类为文字宏块或图像宏块。
在步骤S204,对图像宏块进行编码压缩。
在步骤S205,分析各文字宏块内的像素色彩信息,把各文字宏块按照高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩和无损压缩进行分类。
具体的分类方法可参考前述实施例,此处不在赘述。
在步骤S206,根据各文字宏块的压缩类型,选取对应的编码方式和压缩质量对各文字宏块进行编码压缩。
如果文字宏块的压缩类型是有损压缩(例如,高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩),则按照有损压缩的编码方式选择对应的压缩质量(如压缩比)进行编码压缩;如果压缩类型是无损压缩,则按照无损压缩的编码方式进行编码压缩。
在步骤S207,将编码压缩后的图像宏块及文字宏块逐一进行封包,并将封包后的封包数据发送给解码端。
在步骤207中,需要对有损压缩和无损压缩的封包数据进行区分,例如,使用不同的“NAL type”进行区分。
还可以在封包数据中携带文字宏块的压缩质量,以用于标识每个文字宏块的压缩类型(包括高度有损压缩、中度有损压缩、高清晰 度有损压缩、需要补偿的有损压缩),例如,可以采用专属的qp值来标识压缩质量,也可以通过qp值来区分文字宏块和图像宏块。
根据本实施例的视频编码方法,将屏幕变化区域的像素数据进行宏块划分,根据划分出的文字宏块的色彩数据确定个文字宏块的压缩类型,并根据不同的压缩类型对文字宏块采用不同的压缩质量执行编码压缩,从而,在满足文字清晰度的同时,选择相应的压缩质量,尽可能的提高压缩效率,降低带宽,以适应含有大量文字的桌面办公场景。
图3为根据本公开实施例的视频编码装置的模块结构示意图。
参见图3,根据本公开实施例的视频编码装置可以包括宏划分模块301、宏分类模块302、类型确定模块303和编码模块304。
宏划分模块301用于对屏幕变化区域的像素数据进行宏块划分。
宏分类模块302用于根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块。
类型确定模块303用于根据各文字宏块内的色彩数据确定各文字宏块的压缩类型;
编码模块304用于根据各文字宏块的压缩类型采用对应的压缩质量对各文字宏块进行编码压缩。
在一个实施例中,宏划分模块301可以根据编码器的需要,按照16x16、32x32、64x64等编码单元大小进行宏块的划分。
在一个实施例中,宏分类模块302还用于分析每一个宏块的内部像素数据分布的梯度特征,并根据梯度特征把宏块分类为图像宏块或文字宏块。
在一个实施例中,类型确定模块303可以包括:统计子模块,用于根据各文字宏块的像素数据建立统计信息;分类子模块,用于根据统计信息中的颜色特征将各文字宏块进行分类;以及压缩类型确定子模块,用于根据不同类别的文字宏块的颜色特征的前景色和背景色的差异,确定各类别的文字宏块的压缩类型,其中,压缩类型包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩中的至少一种。
具体的分类方法可参考前述实施例,此处不在赘述。
在一个实施例中,编码模块304还用于根据各文字宏块的压缩类型采用对应的压缩比对各文字宏块进行编码压缩,其中,按照高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩的顺序压缩比依次减小。
在一个实施例中,根据本公开的视频编码装置还包括:封包模块,用于对编码压缩后的文字宏块进行封包并在封包数据中携带文字宏块的压缩类型;以及发送模块,用于将封包发送给视频解码装置进行解码。
可以对有损压缩和无损压缩的文字宏块进行区分,例如,利用封包数据的“NAL type”字段进行区分。
还可以在封包数据中携带文字宏块的压缩质量,以用于标识每个文字宏块的压缩类型(包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩),例如,可以采用专属的qp值来标识压缩质量,也可以通过qp值来区分文字宏块和图像宏块。
在一个实施例中,根据本公开的视频编码装置还可以包括色彩转换和采样模块,用于在进行宏块划分前,对桌面的变化区域的原始像素进行色彩转换,以转换成视频编码使用的色彩空间,并在必要时进行色彩的抽样。
根据本实施例的视频编码装置,将屏幕变化区域的像素数据进行宏块划分,根据划分出的文字宏块的色彩数据确定个文字宏块的压缩类型,并根据不同的压缩类型对文字宏块采用不同的压缩质量执行编码压缩,从而,在满足文字清晰度的同时,选择相应的压缩质量,尽可能的提高压缩效率,降低带宽,以适应含有大量文字的桌面办公场景。
图4为根据本公开实施例的视频解码方法的流程图。
参见图4,根据本公开实施例的视频解码方法包括步骤S401和S402。
在步骤S401,接收并解析视频编码装置发送的封包,该封包中携带有压缩数据以及压缩数据的压缩类型。
在步骤S402,根据压缩类型对压缩数据进行解码处理。
可以根据,例如,封包数据的“NAL type”字段判断压缩数据的压缩类型是有损压缩还是无损压缩。
在一个实施例中,步骤S402可以包括:若压缩数据的压缩类型为无损压缩,则对该压缩数据进行无损压缩的解码;若该压缩数据的压缩类型为有损压缩,则对压缩数据进行无有损压缩的解码。
根据本实施例的视频解码方法,可以根据文字宏块的压缩类型对文字宏块采用不同的压缩质量执行解码,在满足文字清晰度的同时,能够适应含有大量文字的桌面办公场景。
图5为根据本公开实施例的视频解码方法的另一流程图。
参见图5,根据本公开实施例的视频解码方法的另一流程包括步骤S501至S510。
在步骤S501,接收编码端发送的封包数据。
在步骤S502,解析接收的封包数据,并根据封包数据的“NAL type”字段判断是有损压缩还是无损压缩,若是无损压缩,则执行步骤S503,若是有损压缩,则执行步骤S504。
在步骤S503,对封包数据进行无损压缩的解码。
在步骤S504,对每一个封包数据中的压缩后的宏块进行有损压缩的解码。
封包数据可以携带文字宏块的压缩质量,以用于标识每个文字宏块的压缩类型(包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩),并且可以采用专属的qp值来标识压缩质量,也可以通过qp值来区分文字宏块和图像宏块。
在解码时,如果qp值标识出某个宏块为文字宏块,则解码端可以采用预设的算法对该文字宏块进行处理,以增加解码后文字显示的清晰程度。
在步骤S505,根据当前编码端的色彩采样格式,判断是否需要构造基于4:4:4完全采样的数据帧,若需要,则执行步骤S506,否则,执行步骤S509。
实际应用中,步骤S505在完成所有宏块的解码后执行。
如果编码端采用的是非4:4:4完全采样,则需要对解码后的无损的数据进行编码端相同的亚采样格式,并把亚采样后的数据填写到解码帧中。
判断是否需要构造基于4:4:4完全采样的数据帧的依据是:如果编码端采用的是基于4:4:4完全采样的数据帧,则不需要进行构造;如果编码端的采样格式为4:2:2或者4:2:0亚采样,则解码端解码出的数据是4:2:2或者4:2:0的YUV数据,此时需要构建一个4:4:4采样的数据帧,用于存放完整的YUV数据。
需要说明的是,4:4:4完全采样的数据帧只用于客户端的显示,而不做为解码端的参考使用。
在步骤S506,构造基于4:4:4完全采样的数据帧。
在完全采样的数据帧中,需要对像素数据进行扩充、校正和回填。
在步骤S507,检测有损解码的宏块是否有标识为需要进行颜色校正的宏块,如果有,则执行步骤S508,否则执行步骤S509。
在步骤S508,基于4:4:4完全采样的数据帧对需要进行颜色校正的宏块进行颜色校正。
在步骤S509,将解码的数据填充到基于4:4:4完全采样的数据帧中。
在步骤S510,将完成解码的4:4:4完全采样的数据帧,从YUV色彩空间转换到显示需要的RGB色彩空间。
根据本实施例的视频解码方法,可以根据文字宏块的压缩类型对文字宏块采用不同的压缩质量执行解码,在满足文字清晰度的同时,能够适应含有大量文字的桌面办公场景。
图6为根据本公开实施例的视频解码装置的模块结构示意图。
参见图6,根据本公开实施例的视频解码装置包括接收模块601、解析模块602和解码模块603。
接收模块601用于接收视频编码装置发送的封包,该封包中携带有压缩数据以及压缩数据的压缩类型。
解析模块602用于解析该封包,并获取压缩类型。
解码模块603用于根据压缩类型对压缩数据进行解码处理。
据本公开实施例的视频解码装置可以执行根据本公开的视频解码方法,具体解码过程此处不在赘述。
在一个实施例中,据本公开实施例的视频解码装置还包括数据复原模块,用于根据需要创建完全抽样的数据帧,并在该完全抽样的数据帧中,对解码后的像素数据进行扩充、校正和回填。
具体创建过程请参见前述实施例,此处不在赘述。
在一个实施例中,据本公开实施例的视频解码装置还包括色彩转换模块,用于根据最终显示需要的色彩空间要求,对4:4:4完全采样的数据帧中的像素数据进行色彩转换。
根据本实施例的视频解码装置,可以根据文字宏块的压缩类型对文字宏块采用不同的压缩质量执行解码,在满足文字清晰度的同时,能够适应含有大量文字的桌面办公场景。
图7为根据本公开实施例的视频编解码系统的示意图。
参见图7,根据本公开实施例的视频编解码系统包括视频编码装置701和视频解码装置702。
视频编码装置701可以采用根据本公开各实施例的视频编码装置,视频解码装置702可以采用根据本公开各实施例的视频解码装置。
根据本实施例的视频编解码系统,在编码时将屏幕变化区域的像素数据进行宏块划分,根据划分出的文字宏块的色彩数据确定个文字宏块的压缩类型,并根据不同的压缩类型对文字宏块采用不同的压缩质量执行编码压缩,从而,在满足文字清晰度的同时,选择相应的压缩质量,尽可能的提高压缩效率,降低带宽,以适应含有大量文字的桌面办公场景。
本公开实施例还提供一种存储介质,其上存储有一个或者多个程序,所述一个或者多个程序被一个或者多个处理器执行时,所述一个或多个处理器执行根据本公开各实施例的视频编码方法。
本公开实施例还提供一种存储介质,其上存储有一个或者多个程序,所述一个或者多个程序被一个或者多个处理器执行时,所述一个或多个处理器执行根据本公开各实施例的视频解码方法。
以上参照附图对本公开各实施例进行了说明,而非旨在限制本公开的范围。本领域技术人员在不脱离本公开的范围和实质内容的情况下所作的任何修改、等同替换和改进,均应在本公开的范围之内。

Claims (15)

  1. 一种视频编码方法,包括:
    对屏幕变化区域的像素数据进行宏块划分;
    根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块;
    根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型;以及
    根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩。
  2. 如权利要求1所述的视频编码方法,其中,根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块的步骤包括:
    分析每一个宏块的内部像素数据分布的梯度特征,并根据所述梯度特征把宏块分类为图像宏块或文字宏块。
  3. 如权利要求1所述的视频编码方法,其中,根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型的步骤包括:
    根据所述各文字宏块的像素数据建立统计信息;
    根据所述统计信息中的颜色特征将所述各文字宏块进行分类;以及
    根据不同类别的文字宏块的颜色特征的前景色和背景色的差异,确定各类别的文字宏块的压缩类型,
    其中,所述压缩类型包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩中的至少一种。
  4. 如权利要求3所述的视频编码方法,其中,根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩的步骤包括:
    根据所述各文字宏块的压缩类型采用对应的压缩比对所述各文 字宏块进行编码压缩,
    其中,按照高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩的顺序压缩比依次减小。
  5. 如权利要求1所述的视频编码方法,还包括:
    对编码压缩后的文字宏块进行封包并在封包数据中携带文字宏块的压缩类型;以及
    将所述封包发送给视频解码装置进行解码。
  6. 一种视频解码方法,包括:
    接收并解析视频编码装置发送的封包,所述封包中携带有压缩数据以及所述压缩数据的压缩类型;以及
    根据所述压缩类型对所述压缩数据进行解码处理。
  7. 一种视频编码装置,包括:
    宏划分模块,用于对屏幕变化区域的像素数据进行宏块划分;
    宏分类模块,用于根据宏块内的像素数据将划分出的宏块分类为文字宏块或图像宏块;
    类型确定模块,用于根据各文字宏块内的色彩数据确定所述各文字宏块的压缩类型;以及
    编码模块,用于根据所述各文字宏块的压缩类型采用对应的压缩质量对所述各文字宏块进行编码压缩。
  8. 如权利要求7所述的视频编码装置,其中,所述分类子模块还用于分析每一个宏块的内部像素数据分布的梯度特征,并根据所述梯度特征把宏块分类为图像宏块或文字宏块。
  9. 如权利要求7所述的视频编码装置,其中,所述类型确定模块包括:
    统计子模块,用于根据所述各文字宏块的像素数据建立统计信 息;
    分类子模块,用于根据所述统计信息中的颜色特征将所述各文字宏块进行分类;以及
    压缩类型确定子模块,用于根据不同类别的文字宏块的颜色特征的前景色和背景色的差异,确定各类别的文字宏块的压缩类型,
    其中,所述压缩类型包括高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩中的至少一种。
  10. 如权利要求9所述的视频编码装置,其中,所述编码模块还用于根据所述各文字宏块的压缩类型采用对应的压缩比对所述各文字宏块进行编码压缩,
    其中,按照高度有损压缩、中度有损压缩、高清晰度有损压缩、需要补偿的有损压缩及无损压缩的顺序压缩比依次减小。
  11. 如权利要求7所述的视频编码装置,还包括:
    封包模块,用于对编码压缩后的文字宏块进行封包并在封包数据中携带文字宏块的压缩类型;以及
    发送模块,用于将所述封包发送给视频解码装置进行解码。
  12. 一种视频解码装置,包括:
    接收模块,用于接收视频编码装置发送的封包,所述封包中携带有压缩数据以及所述压缩数据的压缩类型;
    解析模块,用于解析所述封包,并获取所述压缩类型;以及
    解码模块,用于根据所述压缩类型对所述压缩数据进行解码处理。
  13. 一种视频编解码系统,包括如权利要求7至11中任一项所述的视频编码装置以及如权利要求12所述的视频解码装置。
  14. 一种存储介质,其上存储有一个或者多个程序,所述一个 或者多个程序被一个或者多个处理器执行时,所述一个或者多个处理器执行如权利要求1-5任一项所述的视频编码方法。
  15. 一种存储介质,其上存储有一个或者多个程序,所述一个或者多个程序被一个或者多个处理器执行时,所述一个或者多个处理器执行如权利要求6所述的视频解码方法。
PCT/CN2019/084836 2018-05-02 2019-04-28 视频编解码方法、装置、系统及存储介质 WO2019210822A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/052,256 US11445201B2 (en) 2018-05-02 2019-04-28 Video encoding and decoding method, device, and system, and storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810410250.0 2018-05-02
CN201810410250.0A CN110446041B (zh) 2018-05-02 2018-05-02 一种视频编解码方法、装置、系统及存储介质

Publications (1)

Publication Number Publication Date
WO2019210822A1 true WO2019210822A1 (zh) 2019-11-07

Family

ID=68387058

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/084836 WO2019210822A1 (zh) 2018-05-02 2019-04-28 视频编解码方法、装置、系统及存储介质

Country Status (3)

Country Link
US (1) US11445201B2 (zh)
CN (1) CN110446041B (zh)
WO (1) WO2019210822A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112839226A (zh) * 2019-11-25 2021-05-25 深信服科技股份有限公司 一种图像编码、解码方法、相关装置及存储介质

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020177072A1 (en) * 2019-03-05 2020-09-10 Zte Corporation Cross-component quantization in video coding
CN111010582B (zh) * 2019-12-18 2022-01-18 深信服科技股份有限公司 一种云桌面图像处理方法、装置、设备及可读存储介质
CN111277829B (zh) * 2020-02-25 2023-03-24 西安万像电子科技有限公司 编解码方法及装置
CN111476853B (zh) * 2020-03-17 2024-05-24 西安万像电子科技有限公司 文字图像的编解码方法、设备及系统
CN111556317A (zh) * 2020-04-23 2020-08-18 西安万像电子科技有限公司 编码方法、装置及编解码系统
CN111953983A (zh) * 2020-07-17 2020-11-17 西安万像电子科技有限公司 视频编码方法及装置
CN111950042B (zh) * 2020-08-25 2023-11-24 西安万像电子科技有限公司 图像数据的处理方法、系统以及装置
CN112422973A (zh) * 2020-10-28 2021-02-26 西安万像电子科技有限公司 编码方法及装置
CN113365071B (zh) * 2021-06-08 2023-01-17 北京数科网维技术有限责任公司 一种图像分层压缩方法及图像分层压缩装置
CN115908165A (zh) * 2022-11-02 2023-04-04 阿里巴巴(中国)有限公司 云桌面图像的处理方法和图像处理方法、设备及存储介质
CN116781906A (zh) * 2023-07-03 2023-09-19 深圳市青葡萄科技有限公司 一种图文清晰度优化方法、设备及存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005184168A (ja) * 2003-12-17 2005-07-07 Mitsubishi Electric Corp 映像符号化装置および映像符号化方法
CN1687969A (zh) * 2005-05-12 2005-10-26 北京航空航天大学 基于文档图像内容分析与特征提取的文档图像压缩方法
CN107509079A (zh) * 2017-08-16 2017-12-22 西安万像电子科技有限公司 文字块编码方法及装置

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6314208B1 (en) * 1998-07-21 2001-11-06 Hewlett-Packard Company System for variable quantization in JPEG for compound documents
US6567559B1 (en) * 1998-09-16 2003-05-20 Texas Instruments Incorporated Hybrid image compression with compression ratio control
US6853755B2 (en) * 2001-03-28 2005-02-08 Sharp Laboratories Of America, Inc. Method and apparatus for adaptive compression of scanned documents
US7085420B2 (en) * 2002-06-28 2006-08-01 Microsoft Corporation Text detection in continuous tone image segments
KR100643759B1 (ko) * 2004-12-01 2006-11-10 삼성전자주식회사 문서 압축 장치 및 그 방법
US8064516B2 (en) * 2005-06-02 2011-11-22 Broadcom Corporation Text recognition during video compression
JP4717562B2 (ja) * 2005-09-02 2011-07-06 キヤノン株式会社 画像処理装置及び方法
JP4871015B2 (ja) * 2006-04-27 2012-02-08 株式会社リコー 画像処理装置、および画像処理方法
JP5003443B2 (ja) * 2007-12-04 2012-08-15 セイコーエプソン株式会社 画像処理装置および画像形成装置並びに画像処理方法,プログラム
JP4538514B2 (ja) * 2008-07-30 2010-09-08 シャープ株式会社 画像圧縮方法、画像圧縮装置、画像形成装置、コンピュータプログラム及び記録媒体
US8648858B1 (en) * 2009-03-25 2014-02-11 Skyfire Labs, Inc. Hybrid text and image based encoding
US11074495B2 (en) * 2013-02-28 2021-07-27 Z Advanced Computing, Inc. (Zac) System and method for extremely efficient image and pattern recognition and artificial intelligence platform
US10412414B2 (en) * 2012-08-13 2019-09-10 Gurulogic Microsystems Oy Decoder and method for decoding encoded input data containing a plurality of blocks or packets
JP6184196B2 (ja) * 2013-06-26 2017-08-23 キヤノン株式会社 画像処理方法及び画像処理装置
KR101599888B1 (ko) * 2014-05-02 2016-03-04 삼성전자주식회사 적응적 영상 데이터 압축 방법 및 장치
EP3304912A4 (en) * 2015-06-07 2018-06-06 Sharp Kabushiki Kaisha Systems and methods for optimizing video coding based on a luminance transfer function or video color component values
US10652583B2 (en) * 2016-08-19 2020-05-12 Apple Inc. Compression of image assets
CN106686384B (zh) * 2016-11-29 2019-05-24 西安万像电子科技有限公司 文字块的压缩方法和装置以及图像压缩方法和装置
CN106888380B (zh) * 2017-01-04 2019-05-03 西安万像电子科技有限公司 图像中文字块的编码方法及装置
US10455259B2 (en) * 2017-08-23 2019-10-22 Intel Corporation Enhanced text rendering and readability in images
CN107770539A (zh) * 2017-10-13 2018-03-06 西安万像电子科技有限公司 文字块的编码方法和装置及其解码方法和装置
CN107749989B (zh) * 2017-10-17 2019-11-05 西安万像电子科技有限公司 图像的编码方法和装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005184168A (ja) * 2003-12-17 2005-07-07 Mitsubishi Electric Corp 映像符号化装置および映像符号化方法
CN1687969A (zh) * 2005-05-12 2005-10-26 北京航空航天大学 基于文档图像内容分析与特征提取的文档图像压缩方法
CN107509079A (zh) * 2017-08-16 2017-12-22 西安万像电子科技有限公司 文字块编码方法及装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TONG YUBING ET AL: "Effectively Compress Document-snapping Pictures Based on Content Analysis and Characteristic Extract", RADIO ENGINEERING OF CHINA, vol. 34, no. 11, 20041130, pages 8 - 10, ISSN: 1003-3106 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112839226A (zh) * 2019-11-25 2021-05-25 深信服科技股份有限公司 一种图像编码、解码方法、相关装置及存储介质
CN112839226B (zh) * 2019-11-25 2024-02-23 深信服科技股份有限公司 一种图像编码、解码方法、相关装置及存储介质

Also Published As

Publication number Publication date
US11445201B2 (en) 2022-09-13
CN110446041B (zh) 2021-11-19
CN110446041A (zh) 2019-11-12
US20210377542A1 (en) 2021-12-02

Similar Documents

Publication Publication Date Title
WO2019210822A1 (zh) 视频编解码方法、装置、系统及存储介质
KR102562874B1 (ko) 피드백 채널에 의한 컬러 영역 적응
CN107950028B (zh) 用于处理视频数据的方法和设备
TWI670969B (zh) 至高動態範圍及/或廣色域視訊資料之固定轉移功能之內容自適應應用
US10200701B2 (en) HDR and WCG coding architecture with SDR backwards compatibility in a single bitstream for video coding
US9706212B2 (en) Palette mode in high efficiency video coding (HEVC) screen content coding (SCC)
RU2710291C2 (ru) Способы и устройства для кодирования и декодирования цветного изображения hdr
CN113115047B (zh) 视频编解码方法和设备
KR102523233B1 (ko) 컬러 픽처를 디코딩하는 방법 및 디바이스
CN109831668B (zh) 数据压缩方法及装置、数据编码/解码方法及装置
KR102509504B1 (ko) 코딩 및 디코딩 방법과 대응하는 디바이스
CN112954367B (zh) 使用调色板译码的编码器、解码器和相应方法
TW201931853A (zh) 具有聯合像素/變換為基礎之量化之視頻寫碼之量化參數控制
TWI713354B (zh) 用於顯示器調適之色彩重映射資訊sei信息發信號
WO2019203973A1 (en) Method and device for encoding an image or video with optimized compression efficiency preserving image or video fidelity
EP3026908A1 (en) Method and device for quantizing and de-quantizing a picture using scaling factors for chrominance based on luminance
TW202141979A (zh) 用於針對具有基於聯合圖元/變換的量化的視訊解碼的量化參數控制的方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19796804

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11.03.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19796804

Country of ref document: EP

Kind code of ref document: A1