CN112218087A - Image encoding and decoding method, encoding and decoding device, encoder and decoder - Google Patents

Image encoding and decoding method, encoding and decoding device, encoder and decoder Download PDF

Info

Publication number
CN112218087A
CN112218087A CN202011359566.5A CN202011359566A CN112218087A CN 112218087 A CN112218087 A CN 112218087A CN 202011359566 A CN202011359566 A CN 202011359566A CN 112218087 A CN112218087 A CN 112218087A
Authority
CN
China
Prior art keywords
image
standard
standard image
image data
prediction residual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011359566.5A
Other languages
Chinese (zh)
Other versions
CN112218087B (en
Inventor
罗伟节
刘星宇
滕波
任鹏
张峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Visual Intelligence Innovation Center Co ltd
Original Assignee
Zhejiang Smart Video Security Innovation Center Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Smart Video Security Innovation Center Co Ltd filed Critical Zhejiang Smart Video Security Innovation Center Co Ltd
Priority to CN202011359566.5A priority Critical patent/CN112218087B/en
Publication of CN112218087A publication Critical patent/CN112218087A/en
Application granted granted Critical
Publication of CN112218087B publication Critical patent/CN112218087B/en
Priority to PCT/CN2021/109724 priority patent/WO2022110870A1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses an image coding and decoding method, an image coding and decoding device, an image coder and an image decoder, wherein the image coding processing method comprises the following steps: extracting a current image to be encoded from image data; identifying a standard image with a preset compression mode from a current image; calculating a motion vector and a prediction residual by taking a standard image as a reference frame; and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding. The image coding processing method provided by the embodiment of the disclosure can calculate the motion vector and the prediction residual by using the standard image as the reference frame, and code the current image according to the motion vector and the prediction residual, so that the obtained coded image data is richer.

Description

Image encoding and decoding method, encoding and decoding device, encoder and decoder
Technical Field
The present invention relates to the field of computer technologies, and in particular, to an image encoding and decoding method, an image encoding and decoding apparatus, an image encoder, and an image decoder.
Background
Since a large amount of redundant information exists in both temporal and spatial directions in consecutive video images, storing and transmitting video image information can be performed efficiently by reducing the redundant information in the video. In the prior art, video information redundancy is usually eliminated through predictive coding. Predictive coding is widely used in various video codec standards including MPEG2, MPEG4, h.264, h.265/HEVC, h.266/VCC, etc.
Video, or the images that make up video, often contain large amounts of data that is relevant to a priori knowledge of a human. For example, the shape of characters generated in accordance with a standard font is set in advance. The efficiency of image data compression can be further improved by using a priori knowledge, but the research is not much at present.
In the video or image data transmitted by the network, a large number of standard images related to the prior knowledge and other image data coexist in one image, wherein the standard images comprise characters with standard fonts, popular expressions and even complete popular video segments. However, these standard image/video data are not processed by a special data compression process, and therefore, the compression efficiency is low, and the image/video effect after decompression is not good.
Disclosure of Invention
In view of this, it is necessary to provide an image encoding and decoding method, an encoding and decoding apparatus, an encoder, and a decoder, which are directed to the problems of low compression efficiency and poor image effect after decompression due to the absence of a special data compression processing method based on the existing standard image.
In a first aspect, an embodiment of the present application provides an image encoding processing method, where the method includes:
extracting a current image to be encoded from image data;
identifying a standard image with a preset compression mode from a current image;
calculating a motion vector and a prediction residual by taking the standard image as a reference frame;
and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding, wherein the coded image data carries the number information of the standard image.
In one embodiment, before the identifying the standard image with the preset compression mode from the current image, the method further includes:
and establishing a standard image library comprising a plurality of standard images, wherein each standard image has a corresponding preset compression mode.
In one embodiment, the standard image includes at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
In one embodiment, the number of the standard image is generated by using a message digest algorithm, or the number of the standard image is generated by using a cyclic redundancy check algorithm.
In one embodiment, in the same image frame, there is at least one first coding block capable of supporting a preset coding mode of the standard image, and at least one second coding block capable of supporting any one of an inter-coding mode, an intra-coding mode and an inter-intra-frame joint coding mode.
In a second aspect, an embodiment of the present application provides an image decoding processing method, where the method includes:
acquiring a video code stream comprising encoded image data;
extracting the serial number of a standard image, the prediction residual between a current image and the standard image and first image data corresponding to a non-standard image area in the current image from the video code stream;
restoring second image data corresponding to the standard image according to the serial number of the standard image and the prediction residual error;
and restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data.
In a third aspect, an embodiment of the present application provides an image encoding processing apparatus, including:
an acquisition unit configured to extract a current image to be encoded from image data;
the identification unit is used for identifying a standard image with a preset compression mode from the current image acquired by the acquisition unit;
a calculation unit configured to calculate a motion vector and a prediction residual using the standard image identified by the identification unit as a reference frame;
and the encoding unit is used for encoding the current image according to the motion vector and the prediction residual calculated by the calculation unit to obtain encoded image data so as to send the encoded image data to a decoder for image decoding, wherein the encoded image data carries the number information of the standard image.
In a fourth aspect, an embodiment of the present application provides an image decoding processing apparatus, including:
an acquisition unit configured to acquire a video stream including encoded image data;
the extraction unit is used for extracting the serial number of the standard image, the prediction residual between the current image and the standard image and first image data corresponding to a non-standard image area in the current image from the video code stream acquired by the acquisition unit;
a decoding unit configured to restore second image data corresponding to the standard image, based on the number of the standard image extracted by the extracting unit and the prediction residual; and
and restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data extracted by the extraction unit.
In a fifth aspect, an embodiment of the present application provides an encoder including the image encoding processing apparatus described above.
In a sixth aspect, an embodiment of the present application provides a decoder, which includes the image decoding processing apparatus described above.
The technical scheme provided by the embodiment of the application can have the following beneficial effects:
in the embodiment of the application, a current image to be coded is extracted from image data; identifying a standard image with a preset compression mode from a current image; calculating a motion vector and a prediction residual by taking a standard image as a reference frame; and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding. The image coding processing method provided by the embodiment of the present disclosure can use a standard image as a reference frame, calculate a motion vector and a prediction residual, and code a current image according to the motion vector and the prediction residual, so that the obtained coded image data is richer, and in addition, the coded image data carries the number information of the standard image, so that the standard image can be accurately positioned according to the number information.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
Fig. 1 is a schematic flowchart of an image encoding processing method provided in an embodiment of the present disclosure;
fig. 2 is a schematic flowchart of an image decoding processing method provided in an embodiment of the present disclosure;
fig. 3 is a schematic structural diagram of an image encoding processing apparatus according to an embodiment of the present disclosure;
fig. 4 is a schematic structural diagram of an image decoding processing apparatus according to an embodiment of the present disclosure;
fig. 5 is a block diagram of an encoder according to an embodiment of the present disclosure;
fig. 6 is a block diagram of a decoder according to an embodiment of the present disclosure.
Detailed Description
The following description and the drawings sufficiently illustrate specific embodiments of the invention to enable those skilled in the art to practice them.
It should be understood that the described embodiments are only some embodiments of the invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Alternative embodiments of the present disclosure are described in detail below with reference to the accompanying drawings.
As shown in fig. 1, an embodiment of the present disclosure provides a flowchart of an image encoding processing method, which is applied to a server side, and specifically includes the following method steps:
s102: extracting a current image to be encoded from image data;
in this embodiment, the image data may be obtained from an image library, or the image data may also be extracted from a video code stream, where the manner of obtaining the image data is not particularly limited.
S104: identifying a standard image with a preset compression mode from a current image;
in an embodiment of the application itself, the standard image comprises at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
The above lists only common standard images, and other forms of standard images may also be introduced according to different application scenarios, and the standard images are not particularly limited herein.
S106: calculating a motion vector and a prediction residual by taking a standard image as a reference frame;
in the embodiment of the present application, a standard image closest to the standard image in the standard image database is used as a reference image, the identified target image is compared with the standard image, and a motion vector and a residual value are calculated. The above process can still adopt a block prediction-based mode, and measure the difference degree between the predicted value and the actual value by using the absolute value difference or the average absolute value difference. For example, assuming that the pixel value of the target image at the coordinates (x 1, y 1) is a (x 1, y 1), and the pixel value of the standard image at the corresponding position is b (x 2, y 2), the prediction residual value is c (x 1, y 1) = a (x 1, y 1) -b (x 2, y 2), and the motion vector is (x 1-x2, y1-y 2). In contrast to H.264, H.265/HEVC, H.266/VVC, the reference pictures are no longer I-frames, but are pictures in a standard database. In addition, the subsequent transformation, quantization and entropy coding can be processed by following the prior art, and are not described herein again.
S108: coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding, wherein the coded image data carries the number information of the standard image; because the coded image data carries the number information of the standard image, the standard image can be accurately positioned according to the number information.
In the image coding processing method provided by the embodiment of the application, the reference image is not only stored in the encoder, but also stored in the decoder, and is not required to be transmitted through a part of a video coding code stream. Therefore, it is necessary that the encoder somehow indicates the number of the adopted standard image to the decoder.
In the embodiment of the application, the number of the standard image is generated by using a message digest algorithm, or the number of the standard image is generated by using a cyclic redundancy check algorithm, and the number of the standard image is used as a method for uniquely identifying the standard image. The algorithm based on the message digest and the cyclic redundancy check algorithm are both conventional algorithms, and are not described herein again.
The above lists two common generation algorithms used for numbering the standard images, and other algorithms for generating numbering the standard images can also be introduced according to the requirements of different application scenes, and are not described herein again.
All or part of the number of the standard image is coded into the video code stream, received by a decoding end and uniquely determined to be one in the standard image database.
In the embodiment of the present application, in order to improve compression efficiency, if a part of the number of the standard image can also uniquely identify a certain standard image in the standard image database, only a part of the number of the standard image is transmitted.
In the embodiment of the application, in the same image frame, at least one first coding block can support a preset coding mode of a standard image, and at least one second coding block can support any one coding mode of an interframe coding mode, an intraframe coding mode and an interframe and intraframe joint coding mode; therefore, the finally obtained encoded image data can be effectively ensured to be richer, and the method is favorable for effectively decoding the standard image in the encoded image data.
In the embodiment of the present application, the image coding method of the area unrelated to the standard image may be implemented by inter-frame, intra-frame, and inter-frame joint coding according to the methods specified in the existing standards such as h.264, h.265/HEVC, h.266/VVC, etc., to generate prediction residual data, and the prediction residual data is converted into a video code stream after the processing flows such as transformation, quantization, entropy coding, etc.
In the embodiment of the present application, the standard image constitutes one image frame together with other image information. When image data is transmitted, image frames are generally rectangular, and in order to save transmission bandwidth, image data is often compressed in advance. In consideration of the fact that the standard image may exist at both the transmitting end and the receiving end, in order to further save transmission bandwidth, when transmitting image data, it is not necessary to transmit the complete standard image data, but only image data not available at the receiving end is required to be transmitted.
In a possible implementation manner, before the encoding process is performed on the image, the decoding-end device may further send, to the encoding end, a part of supported formats or all supported formats of a standard image database that can be supported by the encoding end in a signaling manner; thus, the efficiency of the image encoding process can be effectively improved.
In one possible implementation, before encoding an image frame, an object detection function of an object detection unit is activated, which determines whether the image frame contains an image of interest using an object detection algorithm.
In the image coding processing method provided in the embodiment of the present application, the image of interest is an image that is highly matched with at least one standard image in a standard image database. The standard images in the standard image database may both exist in the encoder and the decoder.
In the embodiment of the present application, the object detection unit may employ various target detection algorithms, for example, Fast RCNN, Fast-RCNN, Mask RCNN, R-FCN, FPN, and the like, and all target detection algorithms are conventional target detection algorithms based on the above target detection algorithms, and are not described herein again.
In a block-based video coding process, it is necessary to divide an image frame into several basic coding units, for example, to divide an image frame to be processed into several coding units. In this case, the target detection process may be performed within the encoded block. If the standard image of interest is not detected in the coded block, the processing is performed in a conventional video coding manner, for example, in a manner specified by standards such as h.264, h.265/HEVC, and h.266/VVC. If at least one object image data which is sufficiently similar to at least one standard image in the standard image database exists, the image coding method provided by the embodiment of the application is continuously adopted for coding.
The method of whether the object image data is sufficiently similar to at least one standard image in the standard image database may be measured by ssd, sad, mse, psnr, and both are considered sufficiently similar as long as a certain threshold is exceeded. The ssd, sad, mse and psnr measurement methods are conventional methods, and are not described herein again.
When the size of the object identified by the target detection algorithm is inconsistent with the size of at least one standard image in the standard image database, the standard image may also need to be scaled until the sizes of the two are consistent, and then the two are compared.
The image encoding processing method provided in the embodiment of the present application may perform encoding processing not only on images in an image library, but also on images extracted from a video to perform encoding processing on image data.
In the embodiment of the present disclosure, a current image to be encoded is extracted from image data; identifying a standard image with a preset compression mode from a current image; calculating a motion vector and a prediction residual by taking a standard image as a reference frame; and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding. The image coding processing method provided by the embodiment of the disclosure can calculate the motion vector and the prediction residual by using the standard image as the reference frame, and code the current image according to the motion vector and the prediction residual, so that the obtained coded image data is richer.
As shown in fig. 2, an embodiment of the present disclosure provides a flowchart of an image decoding processing method, which is applied to a server side, and specifically includes the following method steps:
s202: acquiring a video code stream comprising encoded image data;
in the embodiment of the application, the video code stream at least comprises the number of the standard image, the prediction residual between the current image and the standard image, and the first image data corresponding to the non-standard image area in the current image.
S204: and extracting the number of the standard image, the prediction residual between the current image and the standard image and first image data corresponding to a non-standard image area in the current image from the video code stream.
In an embodiment of the application, the standard image comprises at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
The above lists only common standard images, and other forms of standard images may also be introduced according to different application scenarios, and the standard images are not particularly limited herein.
In the embodiment of the present application, the number of the standard image is generated by using a message digest algorithm, or,
the number of the standard image is generated by adopting a cyclic redundancy check algorithm.
The above lists two common generation algorithms used for numbering the standard images, and other algorithms for generating numbering the standard images can also be introduced according to the requirements of different application scenes, and are not described herein again.
S206: restoring second image data corresponding to the standard image according to the serial number and the prediction residual error of the standard image; thus, the image decoding processing method provided by the embodiment of the application can recover the standard image with high resolution.
In the embodiment of the application, at least one first decoding block is arranged in the same image frame, and the first decoding block can support a preset decoding mode of a standard image.
S208: restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data; in this way, the finally restored processed image includes not only the embedded high-resolution standard image but also a high-resolution image restored from the first image data of the non-standard image region.
In the embodiment of the application, after the decoding end receives all or part of the standard image number from the video code stream, one standard image in the standard image database is uniquely determined. And obtaining a standard image residual error after entropy decoding, inverse transformation, inverse quantization and other processing. The processing methods such as entropy decoding, inverse transformation, inverse quantization and the like are all conventional processing methods, and are not described herein again.
After acquiring the standard image residual error, taking a uniquely determined standard image in a standard image database as a reference image, and restoring the image of the part of the area related to the standard image. Similarly, the data of other regions obtained by entropy decoding, inverse transformation, inverse quantization and other processing can be reconstructed according to the modes specified by standards such as h.264, h.265/HEVC, h.266/VVC and the like; in this way, the finally restored processed image includes not only the embedded high-resolution standard image but also a high-resolution image restored from the first image data of the non-standard image region.
In the embodiment of the present application, in the same image frame, there is at least one second decoding block, and the second decoding block can support any one of an inter-frame decoding method, an intra-frame decoding method, and an inter-frame and intra-frame joint decoding method.
In one possible implementation, before extracting, from the video code stream, the number of the standard image, the prediction residual between the current image and the standard image, and the first image data corresponding to the non-standard image region in the current image, the method further includes:
and establishing a standard image library comprising a plurality of standard images, wherein each standard image has a corresponding preset compression mode.
In a possible implementation manner, in order to effectively improve the image decoding processing efficiency, in a real-time video communication environment, the decoding-end device may further send, by signaling, all or a part of the standard images in the standard image database that the decoding-end device can support, to the encoding end.
The following is an embodiment of an image encoding processing apparatus, which can be used to execute embodiments of an image encoding processing method according to the present disclosure. For details not disclosed in the embodiments of the image coding processing apparatus in the embodiments of the present disclosure, please refer to the embodiments of the image coding processing method in the embodiments of the present disclosure.
Referring to fig. 3, a schematic structural diagram of an image encoding processing apparatus according to an exemplary embodiment of the present invention is shown. The image encoding processing device 30 may be implemented as all or a part of a terminal by software, hardware, or a combination of both. The image encoding processing apparatus 30 includes an acquisition unit 302, a recognition unit 304, a calculation unit 306, and an encoding unit 308;
specifically, the acquiring unit 302 is configured to extract a current image to be encoded from image data;
an identifying unit 304, configured to identify a standard image having a preset compression mode from the current image acquired by the acquiring unit 302;
a calculation unit 306 for calculating a motion vector and a prediction residual with the standard image identified by the identification unit 304 as a reference frame;
and an encoding unit 308, configured to encode the current image according to the motion vector and the prediction residual calculated by the calculating unit 306, to obtain encoded image data, so as to send the encoded image data to a decoder for image decoding, where the encoded image data carries number information of a standard image.
Optionally, the apparatus further comprises:
a standard image library establishing unit (not shown in fig. 3) configured to establish a standard image library including a plurality of standard images, each standard image having a corresponding preset compression manner, before the identifying unit 304 identifies the standard image having the preset compression manner from the current image.
Optionally, the standard image comprises at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
Optionally, the number of the standard image is generated by using a message digest algorithm, or the number of the standard image is generated by using a cyclic redundancy check algorithm.
Optionally, in the same image frame, there is at least one first coding block, where the first coding block can support a preset coding mode of a standard image, and there is at least one second coding block, where the second coding block can support any one of an inter-frame coding mode, an intra-frame coding mode, and an inter-frame intra-frame joint coding mode.
It should be noted that, when the image encoding processing apparatus provided in the foregoing embodiment executes the image encoding processing method, only the division of the above functional units is taken as an example, and in practical applications, the above functions may be distributed to different functional units according to needs, that is, the internal structure of the device may be divided into different functional units to complete all or part of the above described functions. In addition, the image coding processing apparatus and the image coding processing method provided by the above embodiments belong to the same concept, and details of implementation processes thereof are referred to in the embodiment of the image coding processing method, and are not described herein again.
In the embodiment of the disclosure, the obtaining unit is used for extracting a current image to be coded from image data; the identification unit is used for identifying a standard image with a preset compression mode from the current image acquired by the acquisition unit; the calculating unit calculates a motion vector and a prediction residual by taking the standard image identified by the identifying unit as a reference frame; and the coding unit codes the current image according to the motion vector and the prediction residual calculated by the calculation unit to obtain coded image data, and sends the coded image data to a decoder for image decoding. The image coding processing device provided by the embodiment of the disclosure can calculate a motion vector and a prediction residual by using a standard image as a reference frame, and code a current image according to the motion vector and the prediction residual, so that the obtained coded image data is richer.
Fig. 4 is a schematic structural diagram of an image decoding processing apparatus according to an embodiment of the present disclosure. The image decoding processing device 40 may be implemented as all or a part of a terminal by software, hardware, or a combination of both. The image decoding processing apparatus 40 includes an acquisition unit 402, an extraction unit 404, and a decoding unit 406;
specifically, the acquiring unit 402 is configured to acquire a video code stream including encoded image data;
an extracting unit 404, configured to extract, from the video code stream acquired by the acquiring unit 402, a number of a standard image, a prediction residual between a current image and the standard image, and first image data corresponding to a non-standard image region in the current image;
a decoding unit 406, configured to restore second image data corresponding to the standard image according to the number and the prediction residual of the standard image extracted by the extraction unit 404; and
the processed image embedded with the high-resolution standard image is restored from the first image data and the second image data extracted by the extraction unit 404.
Optionally, the apparatus further comprises:
a standard image library establishing unit (not shown in fig. 4), configured to establish a standard image library including multiple standard images before the extracting unit 404 extracts, from the video code stream, the serial number of the standard image, the prediction residual between the current image and the standard image, and the first image data corresponding to the non-standard image region in the current image, where each standard image has a corresponding preset compression mode.
Optionally, the standard image comprises at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
Optionally, the number of the standard image is generated by using a message digest algorithm, or the number of the standard image is generated by using a cyclic redundancy check algorithm.
Optionally, in the same image frame, there is at least one first decoding block capable of supporting a preset decoding mode of a standard image, and there is at least one second decoding block capable of supporting any one of an inter-frame decoding mode, an intra-frame decoding mode, and an inter-frame and intra-frame joint decoding mode.
It should be noted that, when the image decoding processing apparatus provided in the foregoing embodiment executes the image decoding processing method, only the division of the above functional units is taken as an example, and in practical applications, the above functions may be distributed to different functional units according to needs, that is, the internal structure of the device may be divided into different functional units to complete all or part of the above described functions. In addition, the image decoding processing apparatus and the image decoding processing method provided by the above embodiments belong to the same concept, and details of the implementation process are shown in the embodiment of the image decoding processing method, which is not described herein again.
In the embodiment of the present disclosure, the decoding unit is configured to restore the second image data corresponding to the standard image according to the number and the prediction residual of the standard image extracted by the extracting unit; and restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data extracted by the extraction unit. With the image decoding processing apparatus according to the embodiment of the present disclosure, the finally restored processed image includes not only the embedded high-resolution standard image but also a high-resolution image restored from the first image data of the non-standard image region.
Fig. 5 is a block diagram showing the structure of an encoder according to an embodiment of the present invention, and as shown in fig. 5, the encoder 50 includes the image encoding processing device 30 of any one of the above.
Fig. 6 is a block diagram showing a configuration of a decoder according to an embodiment of the present invention, and as shown in fig. 6, the decoder 60 includes the image decoding processing apparatus 40 according to any one of the above.
In an embodiment of the present invention, an electronic device is further provided, and the electronic device includes an encoding processing device (such as the encoder 50 described above) and/or a decoding processing device (such as the decoder 60 described above).
The encoding processing device may process an HDR video (i.e., an input original HDR video) using, for example, the foregoing implementation method of an encoding processing device, and generate an HDR video stream, or a transport stream or a media file containing the HDR video stream.
The decoding processing device can process the HDR video stream, or a transport stream or a media file containing the HDR video stream, to generate an HDR restored video suitable for playing, using an implementation method of the decoding processing device, for example, as described above.
The electronic device in this embodiment may be a related code stream generating device and a receiving and playing device in video communication application, for example, a mobile phone, a computer, a server, a set top box, a portable mobile terminal, a digital camera, a television broadcasting system device, and the like.
It should be noted that, the above units may be implemented by software or hardware, and for the latter, the following may be implemented, but not limited to: the units are all positioned in the same processor; alternatively, the units may be located in a plurality of processors, respectively.
The embodiment of the invention also provides a storage medium. Alternatively, in the present embodiment, the storage medium may be configured to store program codes for performing the following steps:
extracting a current image to be encoded from image data;
identifying a standard image with a preset compression mode from a current image;
calculating a motion vector and a prediction residual by taking a standard image as a reference frame;
and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding, wherein the coded image data carries the number information of the standard image.
Optionally, in this embodiment, the storage medium may include, but is not limited to: various media capable of storing program codes, such as a usb disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk.
Optionally, in this embodiment, the processor executes the steps in the above method embodiments according to the program code stored in the storage medium.
Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments and optional implementation manners, and this embodiment is not described herein again.
It will be apparent to those skilled in the art that the elements or steps of the invention described above may be implemented in a general purpose computing device, centralized on a single computing device or distributed across a network of computing devices, or alternatively, in program code executable by a computing device, such that the steps shown and described may be performed by a computing device stored in a memory device and, in some cases, performed in an order different than that described herein, or fabricated separately as individual integrated circuit units or fabricated as a single integrated circuit unit. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. An image encoding processing method, characterized in that the method comprises:
extracting a current image to be encoded from image data;
identifying a standard image with a preset compression mode from a current image;
calculating a motion vector and a prediction residual by taking the standard image as a reference frame;
and coding the current image according to the motion vector and the prediction residual error to obtain coded image data, and sending the coded image data to a decoder for image decoding, wherein the coded image data carries the number information of the standard image.
2. The method according to claim 1, wherein before said identifying the standard image having the preset compression mode from the current image, the method further comprises:
and establishing a standard image library comprising a plurality of standard images, wherein each standard image has a corresponding preset compression mode.
3. The method of claim 1, wherein the standard image comprises at least one of:
a standard image composed of characters of any standard font, a standard image including expressions, a dynamic standard image composed of animation, and any standard image with independent image elements.
4. The method of claim 1,
the number of the standard image is generated by adopting a message digest algorithm, or,
and the serial number of the standard image is generated by adopting a cyclic redundancy check algorithm.
5. The method of claim 1,
in the same image frame, at least one first coding block can support a preset coding mode of the standard image, and at least one second coding block can support any one coding mode of an interframe coding mode, an intraframe coding mode and an interframe and intraframe joint coding mode.
6. An image decoding processing method, characterized by comprising:
acquiring a video code stream comprising encoded image data;
extracting the serial number of a standard image, the prediction residual between a current image and the standard image and first image data corresponding to a non-standard image area in the current image from the video code stream;
restoring second image data corresponding to the standard image according to the serial number of the standard image and the prediction residual error;
and restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data.
7. An image encoding processing apparatus, characterized in that the apparatus comprises:
an acquisition unit configured to extract a current image to be encoded from image data;
the identification unit is used for identifying a standard image with a preset compression mode from the current image acquired by the acquisition unit;
a calculation unit configured to calculate a motion vector and a prediction residual using the standard image identified by the identification unit as a reference frame;
and the encoding unit is used for encoding the current image according to the motion vector and the prediction residual calculated by the calculation unit to obtain encoded image data so as to send the encoded image data to a decoder for image decoding, wherein the encoded image data carries the number information of the standard image.
8. An image decoding processing apparatus, characterized in that the apparatus comprises:
an acquisition unit configured to acquire a video stream including encoded image data;
the extraction unit is used for extracting the serial number of the standard image, the prediction residual between the current image and the standard image and first image data corresponding to a non-standard image area in the current image from the video code stream acquired by the acquisition unit;
a decoding unit configured to restore second image data corresponding to the standard image, based on the number of the standard image extracted by the extracting unit and the prediction residual; and
and restoring the processed image embedded with the high-resolution standard image according to the first image data and the second image data extracted by the extraction unit.
9. An encoder, characterized in that it comprises the apparatus of claim 7.
10. A decoder, characterized in that it comprises the apparatus of claim 8.
CN202011359566.5A 2020-11-27 2020-11-27 Image encoding and decoding method, encoding and decoding device, encoder and decoder Active CN112218087B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202011359566.5A CN112218087B (en) 2020-11-27 2020-11-27 Image encoding and decoding method, encoding and decoding device, encoder and decoder
PCT/CN2021/109724 WO2022110870A1 (en) 2020-11-27 2021-07-30 Image encoding and decoding method, encoding and decoding apparatus, encoder, and decoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011359566.5A CN112218087B (en) 2020-11-27 2020-11-27 Image encoding and decoding method, encoding and decoding device, encoder and decoder

Publications (2)

Publication Number Publication Date
CN112218087A true CN112218087A (en) 2021-01-12
CN112218087B CN112218087B (en) 2021-05-04

Family

ID=74067955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011359566.5A Active CN112218087B (en) 2020-11-27 2020-11-27 Image encoding and decoding method, encoding and decoding device, encoder and decoder

Country Status (2)

Country Link
CN (1) CN112218087B (en)
WO (1) WO2022110870A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022110870A1 (en) * 2020-11-27 2022-06-02 浙江智慧视频安防创新中心有限公司 Image encoding and decoding method, encoding and decoding apparatus, encoder, and decoder

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070118365A1 (en) * 2003-03-04 2007-05-24 Chu Wai C Methods and apparatuses for variable dimension vector quantization
CN102831599A (en) * 2012-07-17 2012-12-19 南方医科大学 Registering method of medical image with uneven light and shade
CN105163122A (en) * 2015-08-27 2015-12-16 西安空间无线电技术研究所 Image compression and decompression method based on similarity of image blocks
CN105474644A (en) * 2013-06-17 2016-04-06 古如罗技微系统公司 Processing and reproduction of frames

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102238390B (en) * 2011-08-05 2013-03-20 中国科学院深圳先进技术研究院 Image-library-based video and image coding and decoding method and system
CN103475882B (en) * 2013-09-13 2017-02-15 北京大学 Surveillance video encoding and recognizing method and surveillance video encoding and recognizing system
CN103957341B (en) * 2014-04-30 2016-09-28 华为软件技术有限公司 The method of picture transfer and relevant device thereof
US11134253B2 (en) * 2017-02-02 2021-09-28 Hewlett-Packard Development Company, L.P. Video compression
CN109547786B (en) * 2017-09-22 2023-05-09 阿里巴巴集团控股有限公司 Video encoding and video decoding methods and devices
CN111612796A (en) * 2020-04-27 2020-09-01 西安万像电子科技有限公司 Image processing method and device
CN112218087B (en) * 2020-11-27 2021-05-04 浙江智慧视频安防创新中心有限公司 Image encoding and decoding method, encoding and decoding device, encoder and decoder

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070118365A1 (en) * 2003-03-04 2007-05-24 Chu Wai C Methods and apparatuses for variable dimension vector quantization
CN102831599A (en) * 2012-07-17 2012-12-19 南方医科大学 Registering method of medical image with uneven light and shade
CN105474644A (en) * 2013-06-17 2016-04-06 古如罗技微系统公司 Processing and reproduction of frames
CN105163122A (en) * 2015-08-27 2015-12-16 西安空间无线电技术研究所 Image compression and decompression method based on similarity of image blocks

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022110870A1 (en) * 2020-11-27 2022-06-02 浙江智慧视频安防创新中心有限公司 Image encoding and decoding method, encoding and decoding apparatus, encoder, and decoder

Also Published As

Publication number Publication date
CN112218087B (en) 2021-05-04
WO2022110870A1 (en) 2022-06-02

Similar Documents

Publication Publication Date Title
CN111010495B (en) Video denoising processing method and device
US9414086B2 (en) Partial frame utilization in video codecs
RU2694442C1 (en) Image decoding device and image decoding method
WO2006073116A1 (en) Video encoding method and device, video decoding method and device, program thereof, and recording medium containing the program
CN112533059B (en) Image rendering method and device, electronic equipment and storage medium
JP2007180808A (en) Video image encoding device, video image decoding device, and video image encoding method
CN109640089B (en) Image coding and decoding method and device
US8594189B1 (en) Apparatus and method for coding video using consistent regions and resolution scaling
JP2009507412A (en) Method and apparatus for video error resilience encoding / decoding
CN109862365B (en) Image data processing method and device
JP2013229674A (en) Image coding device, image decoding device, image coding method, image decoding method, image coding program, and image decoding program
US8594197B2 (en) System and method for relative storage of video data
CN112218087B (en) Image encoding and decoding method, encoding and decoding device, encoder and decoder
CN108401185B (en) Reference frame selection method, video transcoding method, electronic device and storage medium
CN111669600B (en) Video coding method, device, coder and storage device
WO2024078066A1 (en) Video decoding method and apparatus, video encoding method and apparatus, storage medium, and device
CN103581688A (en) Video image coding and decoding method and device
JP5938424B2 (en) Method for reconstructing and encoding image blocks
WO2016179261A1 (en) Methods and apparatus for optical blur modeling for improved video encoding
CN111212288B (en) Video data encoding and decoding method and device, computer equipment and storage medium
CN112672164B (en) Video compression system and method, and video decompression system and method
KR20230062862A (en) Image processing method and apparatus, device, and computer readable storage medium
CN113852816A (en) Video frame processing method and device, computer equipment and readable storage medium
CN111970508A (en) Video encoding method, video encoding device, computer equipment and storage medium
CN111491166A (en) Dynamic compression system and method based on content analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20210112

Assignee: Institute of Information Technology, Zhejiang Peking University

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2021330000326

Denomination of invention: Image encoding and decoding method, encoding and decoding device, encoder and decoder

Granted publication date: 20210504

License type: Common License

Record date: 20210927

EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20210112

Assignee: Hangzhou Weiming Information Technology Co.,Ltd.

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2021330000536

Denomination of invention: Image encoding and decoding method, encoding and decoding device, encoder and decoder

Granted publication date: 20210504

License type: Common License

Record date: 20211021

EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20210112

Assignee: HANGZHOU SHIHUI TECHNOLOGY Co.,Ltd.

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2022330000061

Denomination of invention: Image encoding and decoding method, encoding and decoding device, encoder and decoder

Granted publication date: 20210504

License type: Common License

Record date: 20220325

EE01 Entry into force of recordation of patent licensing contract
EC01 Cancellation of recordation of patent licensing contract

Assignee: HANGZHOU SHIHUI TECHNOLOGY Co.,Ltd.

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2022330000061

Date of cancellation: 20220707

EC01 Cancellation of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20210112

Assignee: HANGZHOU SHIHUI TECHNOLOGY Co.,Ltd.

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2022330000278

Denomination of invention: Image encoding and decoding method, encoding and decoding device, encoder and decoder

Granted publication date: 20210504

License type: Common License

Record date: 20220707

EE01 Entry into force of recordation of patent licensing contract
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20210112

Assignee: Hangzhou Lezhi Weiming Technology Co.,Ltd.

Assignor: Zhejiang smart video security Innovation Center Co.,Ltd.

Contract record no.: X2022330000334

Denomination of invention: Image coding and decoding method, coding and decoding device, encoder and decoder

Granted publication date: 20210504

License type: Common License

Record date: 20220726

EE01 Entry into force of recordation of patent licensing contract
CP01 Change in the name or title of a patent holder

Address after: 311215 unit 1, building 1, area C, Qianjiang Century Park, ningwei street, Xiaoshan District, Hangzhou City, Zhejiang Province

Patentee after: Zhejiang Visual Intelligence Innovation Center Co.,Ltd.

Address before: 311215 unit 1, building 1, area C, Qianjiang Century Park, ningwei street, Xiaoshan District, Hangzhou City, Zhejiang Province

Patentee before: Zhejiang smart video security Innovation Center Co.,Ltd.

CP01 Change in the name or title of a patent holder