CN116489371A - Image decoding method and device based on frequency domain bit width enhancement and electronic equipment - Google Patents

Image decoding method and device based on frequency domain bit width enhancement and electronic equipment Download PDF

Info

Publication number
CN116489371A
CN116489371A CN202310231091.9A CN202310231091A CN116489371A CN 116489371 A CN116489371 A CN 116489371A CN 202310231091 A CN202310231091 A CN 202310231091A CN 116489371 A CN116489371 A CN 116489371A
Authority
CN
China
Prior art keywords
frequency response
image
response sequence
value
bit width
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310231091.9A
Other languages
Chinese (zh)
Inventor
刘建伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Aixin Yuanzhi Technology Co ltd
Original Assignee
Shenzhen Aixin Yuanzhi Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Aixin Yuanzhi Technology Co ltd filed Critical Shenzhen Aixin Yuanzhi Technology Co ltd
Priority to CN202310231091.9A priority Critical patent/CN116489371A/en
Publication of CN116489371A publication Critical patent/CN116489371A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/182Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The application provides an image decoding method and device based on frequency domain bit width enhancement and electronic equipment, and relates to the technical field of image processing. The method comprises the following steps: decoding the received code stream to obtain a first frequency response sequence corresponding to each pixel point in the first image; in response to the absence of the value in the first frequency response sequence, supplementing the absence of the value in the first frequency response sequence to obtain a second frequency response sequence; determining the first frequency response sequence as a second frequency response sequence in response to the absence of a missing value in the first frequency response sequence; and performing spatial domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width to obtain a second image under the converted second bit width. The method and the device can recover high-quality image data from the compressed code stream, relieve bandwidth limitation, enable the overall image effect to be as close as possible to that of the original image, and better keep information in the original image.

Description

Image decoding method and device based on frequency domain bit width enhancement and electronic equipment
Technical Field
The present disclosure relates to the field of image processing technologies, and in particular, to an image decoding method and apparatus based on frequency domain bit width enhancement, and an electronic device.
Background
The data transfer capability provided by the memory bus is limited, and memory bandwidth is an important resource in a system-on-chip, particularly in a high resolution image signal processor (Image Signal Processor, ISP) system. In the related art, data can be transmitted as much as possible under the condition of limited bandwidth by using a compression and decompression algorithm, but stepped image defects (artifacts) and noise may occur in the compression and decompression process, so how to recover high-quality image data from a compressed code stream, alleviate bandwidth limitation in a chip system, better retain information in an original image has become one of important research directions.
Disclosure of Invention
The object of the present application is to solve, at least to some extent, one of the technical problems in the art described above.
The first aspect of the present application provides an image decoding method based on frequency domain bit width enhancement, including:
receiving a code stream of a first image sent by an encoder, and decoding the code stream to obtain a first frequency response sequence corresponding to each pixel point in the first image;
identifying a first frequency response sequence corresponding to any pixel point, and determining whether a missing value exists in the first frequency response sequence;
in response to the absence of the value in the first frequency response sequence, supplementing the absence of the value in the first frequency response sequence to obtain a second frequency response sequence;
determining the first frequency response sequence as a second frequency response sequence in response to the absence of a missing value in the first frequency response sequence;
and aiming at a second frequency response sequence corresponding to any pixel point, performing spatial domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width so as to acquire a second image under the second bit width after the conversion of the first image, wherein the second bit width is larger than or equal to the first bit width.
The image decoding method based on frequency domain bit width enhancement provided in the first aspect of the present application further has the following technical characteristics, including:
according to an embodiment of the present application, supplementing the missing value in the first frequency response sequence to obtain a second frequency response sequence includes:
for any missing value in the first frequency response sequence, responding to the fact that the bit of the missing value is smaller than or equal to a preset bit threshold value, and carrying out random processing on the missing value; or (b)
And determining the missing value as a preset value in response to the bit in which the missing value is located being greater than the bit threshold.
According to an embodiment of the present application, performing spatial transform on the second frequency response sequence according to the first bit width of the first image and the preset second bit width to obtain a second image with the second bit width after the conversion of the first image, including:
obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value;
aiming at the second frequency response sequence corresponding to any pixel point, performing space domain transformation on the second frequency response sequence according to a target transformation factor to obtain a target pixel value corresponding to any pixel point;
the target pixel value of each pixel is quantized to obtain a second image.
According to an embodiment of the present application, obtaining a target transformation factor according to a difference value includes:
obtaining a target base number according to a preset carry number system;
and carrying out exponential operation according to the target base number and the difference value to obtain a target transformation factor.
According to an embodiment of the present application, performing spatial transform on the second frequency response sequence according to a target transform factor to obtain a target pixel value corresponding to any pixel point, including:
performing spatial domain transformation on any second frequency response sequence to obtain candidate pixel values retaining preset decimal places;
and obtaining a target pixel value according to the product of the candidate pixel value and the target transformation factor.
According to an embodiment of the present application, quantizing each target pixel value to obtain a second image includes:
each target pixel value is rounded down to obtain a second image.
According to an embodiment of the present application, decoding a code stream to obtain a first frequency response sequence corresponding to each pixel point in a first image includes:
acquiring the coding sequence of each bit in the code stream;
and decoding the code stream according to the coding sequence to obtain a first frequency response sequence corresponding to each pixel point in the first image.
A second aspect of the present application provides an image decoding apparatus based on frequency domain bit width enhancement, comprising:
the decoding module is used for receiving the code stream of the first image sent by the encoder, decoding the code stream and obtaining a first frequency response sequence corresponding to each pixel point in the first image;
the identification module is used for identifying the first frequency response sequence corresponding to any pixel point and determining whether a missing value exists in the first frequency response sequence;
the processing module is used for responding to the missing value in the first frequency response sequence, supplementing the missing value in the first frequency response sequence and obtaining a second frequency response sequence; determining the first frequency response sequence as a second frequency response sequence in response to the absence of a missing value in the first frequency response sequence;
the acquisition module is used for carrying out space domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width aiming at the second frequency response sequence corresponding to any pixel point so as to acquire a second image under the second bit width after the conversion of the first image, wherein the second bit width is larger than or equal to the first bit width.
An image decoding device based on frequency domain bit width enhancement provided in a second aspect of the present application further has the following technical features, including:
according to an embodiment of the present application, the processing module is further configured to:
for any missing value in the first frequency response sequence, responding to the fact that the bit of the missing value is smaller than or equal to a preset bit threshold value, and carrying out random processing on the missing value; or (b)
And determining the missing value as a preset value in response to the bit in which the missing value is located being greater than the bit threshold.
According to an embodiment of the present application, the obtaining module is further configured to:
obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value;
aiming at the second frequency response sequence corresponding to any pixel point, performing space domain transformation on the second frequency response sequence according to a target transformation factor to obtain a target pixel value corresponding to any pixel point;
the target pixel value of each pixel is quantized to obtain a second image.
According to an embodiment of the present application, the obtaining module is further configured to:
obtaining a target base number according to a preset carry number system;
and carrying out exponential operation according to the target base number and the difference value to obtain a target transformation factor.
According to an embodiment of the present application, the obtaining module is further configured to:
performing spatial domain transformation on any second frequency response sequence to obtain candidate pixel values retaining preset decimal places;
and obtaining a target pixel value according to the product of the candidate pixel value and the target transformation factor.
According to an embodiment of the present application, the obtaining module is further configured to:
each target pixel value is rounded down to obtain a second image.
According to an embodiment of the present application, the decoding module is further configured to:
acquiring the coding sequence of each bit in the code stream;
and decoding the code stream according to the coding sequence to obtain a first frequency response sequence corresponding to each pixel point in the first image.
An embodiment of a third aspect of the present application provides an electronic device, including: at least one processor; and a memory communicatively coupled to the at least one processor; the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the frequency domain bit width enhancement based image decoding method provided in the first aspect of the present application.
An embodiment of a fourth aspect of the present application provides a computer-readable storage medium storing computer instructions for causing a computer to perform the frequency domain bit width enhancement-based image decoding method provided in the first aspect of the present application.
An embodiment of a fifth aspect of the present application provides a computer program product which, when executed by an instruction processor in the computer program product, performs the frequency domain bit width enhancement based image decoding method provided in the first aspect of the present application.
Additional aspects and advantages of the application will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the application.
According to the method and the device for decoding the image, the high-quality image data can be recovered from the compressed code stream, bandwidth limitation in a chip system is relieved, and non-important lossy signal parts are adaptively generated in a frequency domain, so that the overall image effect is as close as possible to that of an original image, information in the original image is better reserved, and the accuracy of image decoding is improved.
Drawings
The foregoing and/or additional aspects and advantages of the present application will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings, in which:
FIG. 1 is a flow chart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 2 is a schematic diagram of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 3 is a schematic diagram of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 4 is a schematic diagram of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 5 is a schematic diagram of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 6 is a flow chart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 7 is a flow chart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application;
FIG. 8 is a schematic diagram of an image decoding apparatus based on frequency domain bit width enhancement according to an embodiment of the present application;
fig. 9 is a block diagram of an electronic device of an embodiment of the present application.
Detailed Description
Embodiments of the present application are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals refer to the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the drawings are exemplary and intended for the purpose of explaining the present application and are not to be construed as limiting the present application.
The following describes an image decoding method, an image decoding device and an electronic device based on frequency domain bit width enhancement according to the embodiments of the present application with reference to the accompanying drawings.
Fig. 1 is a flowchart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application, and as shown in fig. 1, the method is performed by a decoder, and includes:
s101, receiving a code stream of a first image sent by an encoder, and decoding the code stream to obtain a first frequency response sequence corresponding to each pixel point in the first image.
In this embodiment of the present application, after the encoder acquires the first image, the encoder compresses the first image according to the frequency domain conversion algorithm to acquire a code stream of the first image, and then sends the code stream of the first image to the decoder.
In order to improve coding efficiency and accuracy, the encoder may send the coding sequence of each bit in the code stream to the decoder, and the decoder obtains the coding sequence of each bit in the code stream, and then decodes the code stream according to the coding sequence, so as to obtain a first frequency response sequence corresponding to each pixel in the first image.
In some implementations, the process of the encoder compressing the first image to obtain the code stream of the first image may include: firstly, converting a first image from a space domain to a frequency domain, and converting the first image from the space domain to the frequency domain to obtain a frequency domain image, wherein optionally, an image substrate corresponding to a pixel point corresponding to the frequency domain image is shown in fig. 2. Because the low frequency data has higher importance, the coding sequence of each bit data can be obtained according to the frequency data corresponding to any pixel point, namely, the coding sequence of the low frequency data is higher than the coding sequence of the high frequency data, further, the first image can be coded according to the coding sequence, all pixels in the image are changed into a bit code stream with sequence, the required bit number is determined according to the set code stream requirement, and the generated code stream is cut off. Alternatively, in the embodiment of the present application, the more important the earlier the code stream is with respect to the image quality, the encoder may intercept the first n bit code streams.
Alternatively, the coding sequence may also be a preset coding sequence.
Alternatively, the encoder may transform the first image spatially to frequency by a frequency domain transform algorithm, such as a LeGall 5/3 frequency domain transform algorithm, a Daubechies 9/7 frequency domain transform algorithm, a discrete cosine transform (Discrete Cosine Transform, DCT), or the like.
Taking the size of the first image as h×w pixels as an example for illustration, the coding sequence in the embodiment of the present application may be as shown in fig. 3, where the highest bit of the (0, 0) pixel in the image coordinate system is coded, then the highest bit of the (0, 1) pixel is coded, then the next highest bit of the (0, 0) pixel is coded, and finally the lowest bit of the (h-1, w-1) pixel is coded. Wherein the number of each bit indicates the coding order, the smaller the number the earlier the coding order. Thus, the first image is encoded in the encoding order, and all pixels in the image can be changed into an ordered bit stream. Wherein h, w and n are positive integers.
In this embodiment of the present application, the decoder may decode the received code stream to obtain the data after the frequency domain recovery.
S102, identifying a first frequency response sequence corresponding to any pixel point, and determining whether a missing value exists in the first frequency response sequence.
As shown in fig. 4, since the received code stream is a truncated code stream, the recovered data may have different importance, and the number of recovered bit widths is not uniform. In this embodiment, the first frequency response sequence is identified, and whether a missing value (i.e., a bit of the data is not decoded) exists in the first frequency response sequence is determined, so that the first frequency response sequence with the missing value is supplemented later, and a second frequency response sequence with consistent bit width is obtained.
In this embodiment, the bit width of the second frequency response sequence is identical to the bit width of the first image.
As shown in fig. 4, a column of data in the coordinates of each pixel point is the first frequency response sequence corresponding to the pixel point.
S103, in response to the missing value in the first frequency response sequence, supplementing the missing value in the first frequency response sequence to obtain a second frequency response sequence.
In some implementations, missing values in the first frequency response sequence are then processed to obtain a second frequency response sequence.
In some implementations, in order to improve the sharpness of the image, to avoid the occurrence of a stepped image defect, for any missing value in the first frequency response sequence, the missing value is randomly processed in response to the bit at which the missing value is located being less than or equal to a preset bit threshold, and the missing value is determined to be the preset value in response to the bit at which the missing value is located being greater than the bit threshold. The bit threshold is 2, and the preset value is 0, that is, the missing values of the last 2 bits in the first frequency response sequence of each pixel point are randomly processed, as shown in fig. 5, the missing value exists in the last bit of the (0, 1) pixel point, the missing value exists in the last 3 bits of the (h-1, w-1) pixel point, the missing value of the last bit of the (0, 1) pixel point and the missing value of the last 2 bits of the (h-1, w-1) pixel point can be randomly processed, and the missing value of the 2 nd bit of the (h-1, w-1) pixel point is set to 0.
S104, determining the first frequency response sequence as a second frequency response sequence in response to the absence of the missing value in the first frequency response sequence.
As shown in fig. 5, the first frequency response sequence of the (0, 0) pixel point has no missing value, and the first frequency response sequence of the pixel point is determined to be the second frequency response sequence without supplementing the first frequency response sequence.
In the embodiment of the application, according to the set random strength, 0/1 random is carried out on unrecovered bits, and for important pixels with high recovery degree, the random is less, and the whole signal is better reserved; for unimportant pixels with low restoration degree, more randomization is performed, thereby avoiding occurrence of image defects.
S105, aiming at a second frequency response sequence corresponding to any pixel point, performing spatial domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width so as to obtain a second image with the second bit width converted by the first image, wherein the second bit width is larger than or equal to the first bit width.
In this embodiment of the present application, the high-precision spatial transform may be performed on the second frequency response sequence according to the inverse transform of the frequency domain transform algorithm, so as to obtain the second image under the second bit width after the conversion of the first image. For example, the bit width difference value can be obtained according to the first bit width and the second bit width, and then the bit width difference value is acted on the airspace transformation process of the second frequency response sequence, so that the airspace image with higher precision is obtained, at the moment, the information quantity carried by the airspace image is larger than or equal to the information quantity of the first bit width, and further the airspace image is quantized to the second bit width, so that the second image under the second bit width after the conversion of the first image can be obtained.
In the embodiment of the application, decoding is performed on a received code stream to obtain a first frequency response sequence corresponding to each pixel point in a first image; in response to the absence of the value in the first frequency response sequence, supplementing the absence of the value in the first frequency response sequence to obtain a second frequency response sequence; determining the first frequency response sequence as a second frequency response sequence in response to the absence of a missing value in the first frequency response sequence; and performing spatial domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width so as to obtain a second image under the second bit width after the conversion of the first image. The method and the device can recover high-quality image data from the compressed code stream, relieve bandwidth limitation in a chip system, and adaptively generate non-important lossy signal parts in a frequency domain, so that the overall image effect is as close as possible to an original image, information in the original image is better reserved, and the accuracy of image decoding is improved.
Fig. 6 is a flowchart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application, as shown in fig. 6, performing spatial transform on a second frequency response sequence according to a first bit width of a first image and a preset second bit width, so as to obtain a second image under the second bit width after the conversion of the first image, where the method includes:
s601, obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value.
In some implementations, the target base is obtained according to a preset carry number system, and the target transformation factor is obtained by performing an exponential operation according to the target base and the difference value.
Alternatively, in an implementation where the carry count is made binary, the target transform factor may be obtained using the following formula:
α=2 β-γ
where α is the target transform factor, β is the second bit width, and γ is the first bit width.
For example, if the first bit width is 8 bits, the pixel value is (0-255), the second bit width is 10 bits, (pixel value is 0-1023), the target transform factor is 2 10-8 =4。
S602, for a second frequency response sequence corresponding to any pixel point, performing spatial domain transformation on the second frequency response sequence according to a target transformation factor to obtain a target pixel value corresponding to any pixel point.
In some implementations, for any second frequency response sequence, performing spatial transform on the second frequency response sequence to obtain a candidate pixel value retaining a preset decimal place, and obtaining a target pixel value according to the product of the candidate pixel value and a target transform factor.
For example, if the first bit width is 8 bits, the pixel value is (0-255), the second bit width is 10 bits, (the pixel value is 0-1023), the second frequency response sequence is subjected to high-precision spatial transform, the candidate pixel value with reserved 2 decimal places is obtained to be 10.25, and since more reserved 2 bits of data are reserved, the target pixel value is 10.25×4=41, which is equivalent to changing the pixel value from 0-255 to 0-1023, and thus the data with low bit width is also brought into the target pixel value.
S603, quantizing the target pixel value of each pixel point to obtain a second image.
And (3) rounding down each target pixel value obtained in the previous step to obtain a second image.
In some implementations, the target pixel value for each pixel point may be quantized using the following formula:
F=floor(x)
where F represents the second image, floor (..) represents a function of rounding down the element, and x represents the target pixel value of each pixel point.
In some implementations, the target pixel value for each pixel point may be quantized using the following formula:
in some implementations, the target pixel value for each pixel point may be quantized using the following formula:
in the embodiment of the application, a difference value between a second bit width and a first bit width is obtained, a target transformation factor is obtained according to the difference value, a second frequency response sequence corresponding to any pixel point is subjected to spatial domain transformation according to the target transformation factor, so as to obtain a target pixel value corresponding to any pixel point, and the target pixel value of each pixel point is quantized so as to obtain a second image. The method and the device can recover high-quality image data from the compressed code stream, relieve bandwidth limitation in a chip system, and adaptively generate non-important lossy signal parts in a frequency domain, so that the overall image effect is as close as possible to that of an original image, and information in the original image is better reserved.
Fig. 7 is a flowchart of an image decoding method based on frequency domain bit width enhancement according to an embodiment of the present application, as shown in fig. 7, in the embodiment of the present application, an encoder performs spatial domain to frequency domain conversion on a first image, further encodes the first image according to an encoding sequence, changes all pixels in the image into a sequential bit stream, determines a required bit number according to a set bit stream requirement, intercepts the generated bit stream, and intercepts the first n bit streams to be sent to a decoder. The decoder receives the code stream of the first image sent by the encoder, recovers the frequency domain signal of the code stream according to the encoding sequence, supplements the missing value of the frequency domain, and performs high-precision space domain conversion according to the first bit width of the first image and the preset second bit width so as to obtain a second image under the second bit width after the conversion of the first image.
In some implementations, when the second bit width of the second image is greater than the first bit width of the first image, a higher compression rate may be obtained according to the image decoding method of the present application.
In the embodiment of the application, the high-quality image data can be recovered from the compressed code stream, the bandwidth limitation in a chip system is relieved, and the non-important lossy signal part is adaptively generated in the frequency domain, so that the overall image effect is as close as possible to the original image, the information in the original image is better reserved, and the accuracy of image decoding is improved.
Fig. 8 is a schematic structural diagram of an image decoding device based on frequency domain bit width enhancement according to an embodiment of the present application, as shown in fig. 8, an image decoding device 800 based on frequency domain bit width enhancement includes:
the decoding module 810 is configured to receive a code stream of the first image sent by the encoder, and decode the code stream to obtain a first frequency response sequence corresponding to each pixel point in the first image;
the identifying module 820 is configured to identify, for a first frequency response sequence corresponding to any pixel point, the first frequency response sequence, and determine whether a missing value exists in the first frequency response sequence;
the processing module 830 is configured to supplement the missing value in the first frequency response sequence to obtain a second frequency response sequence in response to the missing value in the first frequency response sequence; determining the first frequency response sequence as a second frequency response sequence in response to the absence of a missing value in the first frequency response sequence;
the obtaining module 840 is configured to perform spatial transform on the second frequency response sequence according to the first bit width of the first image and the preset second bit width for the second frequency response sequence corresponding to any pixel point, so as to obtain a second image under the second bit width after the conversion of the first image, where the second bit width is greater than or equal to the first bit width.
The image decoding device 800 based on frequency domain bit width enhancement provided in the present application further has the following technical features, including:
according to an embodiment of the present application, the processing module 830 is further configured to:
for any missing value in the first frequency response sequence, responding to the fact that the bit of the missing value is smaller than or equal to a preset bit threshold value, and carrying out random processing on the missing value; or (b)
And determining the missing value as a preset value in response to the bit in which the missing value is located being greater than the bit threshold.
According to an embodiment of the present application, the obtaining module 840 is further configured to:
obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value;
aiming at the second frequency response sequence corresponding to any pixel point, performing space domain transformation on the second frequency response sequence according to a target transformation factor to obtain a target pixel value corresponding to any pixel point;
the target pixel value of each pixel is quantized to obtain a second image.
According to an embodiment of the present application, the obtaining module 840 is further configured to:
obtaining a target base number according to a preset carry number system;
and carrying out exponential operation according to the target base number and the difference value to obtain a target transformation factor.
According to an embodiment of the present application, the obtaining module 840 is further configured to:
performing spatial domain transformation on any second frequency response sequence to obtain candidate pixel values retaining preset decimal places;
and obtaining a target pixel value according to the product of the candidate pixel value and the target transformation factor.
According to an embodiment of the present application, the obtaining module 840 is further configured to:
each target pixel value is rounded down to obtain a second image.
According to an embodiment of the present application, the decoding module 810 is further configured to:
acquiring the coding sequence of each bit in the code stream;
and decoding the code stream according to the coding sequence to obtain a first frequency response sequence corresponding to each pixel point in the first image.
The method and the device can recover high-quality image data from the compressed code stream, relieve bandwidth limitation in a chip system, and adaptively generate non-important lossy signal parts in a frequency domain, so that the overall image effect is as close as possible to an original image, information in the original image is better reserved, and the accuracy of image decoding is improved.
To achieve the above embodiments, the present application also provides an electronic device, a computer-readable storage medium, and a computer program product.
Fig. 9 is a block diagram of an electronic device according to an embodiment of the present application, and an image decoding method based on frequency domain bit width enhancement may be implemented according to the electronic device shown in fig. 9.
As shown in fig. 9, the electronic device 900 includes: memory 910 and processor 920, bus 930 connecting different components (including memory 910 and processor 920), memory 910 stores a computer program, and when processor 920 executes the program, implements the method for identifying device identification hops according to the embodiments of the present disclosure.
Bus 930 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, and a local bus using any of a variety of bus architectures. By way of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus, micro channel architecture (MAC) bus, enhanced ISA bus, video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.
Electronic device 900 typically includes a variety of electronic device readable media. Such media can be any available media that is accessible by electronic device 900 and includes both volatile and nonvolatile media, removable and non-removable media.
Memory 910 may also include computer-system readable media in the form of volatile memory such as Random Access Memory (RAM) 940 and/or cache memory 950. The electronic device 900 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 960 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 9, commonly referred to as a "hard disk drive"). Although not shown in fig. 9, a magnetic disk drive for reading from and writing to a removable non-volatile magnetic disk (e.g., a "floppy disk"), and an optical disk drive for reading from or writing to a removable non-volatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In such cases, each drive may be coupled to bus 930 via one or more data medium interfaces. Memory 910 may include at least one program product having a set (e.g., at least one) of program modules configured to carry out the functions of the various embodiments of the disclosure.
A program/utility 980 having a set (at least one) of program modules 970 may be stored, for example, in memory 910, such program modules 970 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each or some combination of which may include an implementation of a network environment. Program modules 970 generally perform the functions and/or methods in the embodiments described in this disclosure.
The electronic device 900 may also communicate with one or more external devices 990 (e.g., keyboard, pointing device, display 991, etc.), one or more devices that enable a user to interact with the electronic device 900, and/or any devices (e.g., network card, modem, etc.) that enable the electronic device 900 to communicate with one or more other computing devices. Such communication may occur through an input/output (I/O) interface 992. Also, the electronic device 900 may communicate with one or more networks such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet, through a network adapter 993. As shown in fig. 9, the network adapter 993 communicates with other modules of the electronic device 900 over the bus 930. It should be appreciated that although not shown, other hardware and/or software modules may be used in connection with electronic device 900, including, but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, data backup storage systems, and the like.
The processor 920 performs various functional applications and data processing by running programs stored in the memory 910.
To achieve the above embodiments, the present application also provides a computer-readable storage medium storing computer instructions for causing a computer to execute an image decoding method based on frequency domain bit width enhancement.
To achieve the above embodiments, the present application also provides a computer program product which, when executed by an instruction processor in the computer program product, performs an image decoding method based on frequency domain bit width enhancement.
In the description of the present specification, a description referring to terms "one embodiment," "some embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present application. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.
Furthermore, the terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include at least one such feature. In the description of the present application, the meaning of "plurality" is at least two, such as two, three, etc., unless explicitly defined otherwise.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and additional implementations are included within the scope of the preferred embodiment of the present application in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the embodiments of the present application.
Logic and/or steps represented in the flowcharts or otherwise described herein, e.g., a ordered listing of executable instructions for implementing logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). In addition, the computer readable medium may even be paper or other suitable medium on which the program is printed, as the program may be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory.
It is to be understood that portions of the present application may be implemented in hardware, software, firmware, or a combination thereof. In the above-described embodiments, the various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. As with the other embodiments, if implemented in hardware, may be implemented using any one or combination of the following techniques, as is well known in the art: discrete logic circuits having logic gates for implementing logic functions on data signals, application specific integrated circuits having suitable combinational logic gates, programmable Gate Arrays (PGAs), field Programmable Gate Arrays (FPGAs), and the like.
Those of ordinary skill in the art will appreciate that all or a portion of the steps carried out in the method of the above-described embodiments may be implemented by a program to instruct related hardware, where the program may be stored in a computer readable storage medium, and where the program, when executed, includes one or a combination of the steps of the method embodiments.
In addition, each functional unit in each embodiment of the present application may be integrated in one processing module, or each unit may exist alone physically, or two or more units may be integrated in one module. The integrated modules may be implemented in hardware or in software functional modules. The integrated modules may also be stored in a computer readable storage medium if implemented in the form of software functional modules and sold or used as a stand-alone product.
The above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, or the like. Although embodiments of the present application have been shown and described above, it will be understood that the above embodiments are illustrative and not to be construed as limiting the application, and that variations, modifications, alternatives, and variations may be made to the above embodiments by one of ordinary skill in the art within the scope of the application.

Claims (16)

1. An image decoding method based on frequency domain bit width enhancement, performed by a decoder, comprising:
receiving a code stream of a first image sent by an encoder, and decoding the code stream to obtain a first frequency response sequence corresponding to each pixel point in the first image;
identifying a first frequency response sequence corresponding to any pixel point, and determining whether a missing value exists in the first frequency response sequence;
supplementing the missing value in the first frequency response sequence to obtain a second frequency response sequence in response to the missing value in the first frequency response sequence;
determining the first frequency response sequence as the second frequency response sequence in response to the absence of a missing value in the first frequency response sequence;
and aiming at a second frequency response sequence corresponding to any pixel point, performing spatial domain transformation on the second frequency response sequence according to the first bit width of the first image and a preset second bit width so as to acquire a second image under the second bit width after the conversion of the first image, wherein the second bit width is larger than or equal to the first bit width.
2. The method of claim 1, wherein the supplementing missing values in the first frequency response sequence to obtain a second frequency response sequence comprises:
for any missing value in the first frequency response sequence, responding to the fact that the bit of the missing value is smaller than or equal to a preset bit threshold value, and carrying out random processing on the missing value; or (b)
And determining the missing value as a preset value in response to the bit of the missing value being greater than the bit threshold.
3. The method of claim 1, wherein spatially transforming the second frequency response sequence according to the first bit width of the first image and the preset second bit width to obtain a second image at the second bit width after the first image conversion, comprises:
obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value;
performing spatial domain transformation on a second frequency response sequence corresponding to any pixel point according to the target transformation factor to obtain a target pixel value corresponding to any pixel point;
the target pixel value for each pixel is quantized to obtain the second image.
4. A method according to claim 3, wherein said obtaining a target transformation factor from said difference value comprises:
obtaining a target base number according to a preset carry number system;
and carrying out exponential operation according to the target base number and the difference value to obtain the target transformation factor.
5. A method according to claim 3, wherein spatially transforming the second frequency response sequence according to the target transformation factor to obtain a target pixel value corresponding to the arbitrary pixel point comprises:
performing spatial domain transformation on any second frequency response sequence to obtain a candidate pixel value retaining a preset decimal place;
and obtaining the target pixel value according to the product of the candidate pixel value and the target transformation factor.
6. A method according to claim 3, wherein said quantizing each of said target pixel values to obtain said second image comprises:
and rounding down each target pixel value to acquire the second image.
7. The method according to any one of claims 1 to 6, wherein decoding the code stream to obtain a first frequency response sequence corresponding to each pixel in the first image includes:
acquiring the coding sequence of each bit in the code stream;
and decoding the code stream according to the coding sequence to obtain the first frequency response sequence corresponding to each pixel point in the first image.
8. An image decoding apparatus based on frequency domain bit width enhancement, comprising:
the decoding module is used for receiving the code stream of the first image sent by the encoder, decoding the code stream and obtaining a first frequency response sequence corresponding to each pixel point in the first image;
the identification module is used for identifying a first frequency response sequence corresponding to any pixel point, and determining whether a missing value exists in the first frequency response sequence;
the processing module is used for responding to the existence of the missing value in the first frequency response sequence, supplementing the missing value in the first frequency response sequence and obtaining a second frequency response sequence; determining the first frequency response sequence as the second frequency response sequence in response to the absence of a missing value in the first frequency response sequence;
the acquisition module is used for carrying out space domain transformation on the second frequency response sequence according to the first bit width of the first image and the preset second bit width aiming at the second frequency response sequence corresponding to any pixel point so as to acquire a second image under the second bit width after the conversion of the first image, wherein the second bit width is larger than or equal to the first bit width.
9. The apparatus of claim 8, wherein the processing module is further configured to:
for any missing value in the first frequency response sequence, responding to the fact that the bit of the missing value is smaller than or equal to a preset bit threshold value, and carrying out random processing on the missing value; or (b)
And determining the missing value as a preset value in response to the bit of the missing value being greater than the bit threshold.
10. The apparatus of claim 8, wherein the acquisition module is further configured to:
obtaining a difference value between the second bit width and the first bit width, and obtaining a target transformation factor according to the difference value;
performing spatial domain transformation on a second frequency response sequence corresponding to any pixel point according to the target transformation factor to obtain a target pixel value corresponding to any pixel point;
the target pixel value for each pixel is quantized to obtain the second image.
11. The apparatus of claim 10, wherein the acquisition module is further configured to:
obtaining a target base number according to a preset carry number system;
and carrying out exponential operation according to the target base number and the difference value to obtain the target transformation factor.
12. The apparatus of claim 10, wherein the acquisition module is further configured to:
performing spatial domain transformation on any second frequency response sequence to obtain a candidate pixel value retaining a preset decimal place;
and obtaining the target pixel value according to the product of the candidate pixel value and the target transformation factor.
13. The apparatus of claim 10, wherein the acquisition module is further configured to:
and rounding down each target pixel value to acquire the second image.
14. The apparatus of any one of claims 8 to 13, wherein the decoding module is further configured to:
acquiring the coding sequence of each bit in the code stream;
and decoding the code stream according to the coding sequence to obtain the first frequency response sequence corresponding to each pixel point in the first image.
15. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
16. A computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1-7.
CN202310231091.9A 2023-03-01 2023-03-01 Image decoding method and device based on frequency domain bit width enhancement and electronic equipment Pending CN116489371A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310231091.9A CN116489371A (en) 2023-03-01 2023-03-01 Image decoding method and device based on frequency domain bit width enhancement and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310231091.9A CN116489371A (en) 2023-03-01 2023-03-01 Image decoding method and device based on frequency domain bit width enhancement and electronic equipment

Publications (1)

Publication Number Publication Date
CN116489371A true CN116489371A (en) 2023-07-25

Family

ID=87216780

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310231091.9A Pending CN116489371A (en) 2023-03-01 2023-03-01 Image decoding method and device based on frequency domain bit width enhancement and electronic equipment

Country Status (1)

Country Link
CN (1) CN116489371A (en)

Similar Documents

Publication Publication Date Title
CN104838653B (en) Lossless image compression using differential transmission
JP4895400B2 (en) Improved compression in the representation of non-frame edge blocks of image frames
US10785493B2 (en) Method of compressing and decompressing image data
US8254700B1 (en) Optimized method and system for entropy coding
JP5318256B2 (en) High bit depth image compression
TWI479456B (en) Method and apparatus for data compression using error plane coding
US7983494B2 (en) Method, medium, and system compressing and/or decompressing pixel data
CN113170140A (en) Bit plane encoding of data arrays
EP2787738B1 (en) Tile-based compression for graphic applications
JP2010098352A (en) Image information encoder
US20110052089A1 (en) Image processing system, image processing method, and computer-readable medium
US20060239572A1 (en) Encoding device and method, decoding device and method, and program
US20170201759A1 (en) Method and device for image encoding and image decoding
CN116489371A (en) Image decoding method and device based on frequency domain bit width enhancement and electronic equipment
US10536697B2 (en) Method for re-encoding image and apparatus therefor
US9819969B2 (en) Generalization of methods and systems for image compression while encoding at least one extra bit
US8094951B2 (en) Coding system and method for a bit-plane
CN110545437B (en) Coefficient encoding method, coefficient decoding method, electronic device, and medium
CN114051144A (en) Video compression method and device, computer equipment and storage medium
JP5593468B1 (en) Image coding apparatus and image coding method
CN114501032A (en) Video encoding device and method, video decoding device and method, and encoding/decoding system
CN115250351A (en) Compression method, decompression method and related products for image data
CN116668707A (en) Image encoding method, image encoding device, image decoding method and image decoding device
KR20160022726A (en) Apparatus and method for encoding
TWI484829B (en) System and method for processing image

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination