CN113055669A

CN113055669A - Image filtering method and device before coding

Info

Publication number: CN113055669A
Application number: CN202110057841.6A
Authority: CN
Inventors: 向国庆; 黄菊; 贾惠柱; 范晓东; 宋磊
Original assignee: Beijing Boya Huishi Intelligent Technology Research Institute Co ltd
Current assignee: Beijing Boya Huishi Intelligent Technology Research Institute Co ltd
Priority date: 2021-01-15
Filing date: 2021-01-15
Publication date: 2021-06-29
Anticipated expiration: 2041-01-15
Also published as: CN113055669B

Abstract

The invention discloses an image filtering method before coding, which comprises the following steps: aiming at each pixel in the current frame, determining the spatial domain weight between the pixel and other pixels in a spatial search window, and acquiring the time domain weight between the pixel and other pixels in a time search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, and the temporal search window is a window which takes the pixel position as the center in the previous frame; the pixel is filtered according to other pixels in the temporal search window, temporal domain weights between the pixel and other pixels in the temporal search window, other pixels in the spatial search window, and spatial domain weights between the pixel and other pixels in the spatial search window. The spatial domain similarity of the pixel to be filtered in the current frame is considered, meanwhile, the time domain similarity of the previous filtering frame is introduced through recursion, the pixel to be filtered is filtered by utilizing the two items of information, and even when the code rate is insufficient, the effect of reducing the blocking effect can be achieved.

Description

Image filtering method and device before coding

Technical Field

The invention relates to the technical field of image processing, in particular to a method and a device for filtering an image before coding.

Background

In the field of video coding, varying degrees of inter-block discontinuity distortion, i.e., blockiness, often occur in coded video frames. For this reason, blocking artifacts are currently reduced by setting a loop filter algorithm in the coding tool.

However, when the bandwidth is insufficient or the video is too complex, the code rate is insufficient, and in this case, the video frame may not be able to obtain a satisfactory result after passing through the coding tool. Therefore, even if a post-processing method like loop filtering is used in the coding tool, blocking artifacts still occur at a low code rate, and relying on the post-processing method inside the coding tool alone is not enough to solve the blocking artifacts problem.

Disclosure of Invention

The present invention provides a method and an apparatus for filtering an image before encoding, which are directed to the above-mentioned deficiencies of the prior art, and the object is achieved by the following technical solutions.

The first aspect of the present invention provides a method for filtering an image before encoding, where the method includes:

aiming at each pixel in the current frame, determining the spatial domain weight between the pixel and other pixels in a spatial search window, and acquiring the temporal domain weight between the pixel and other pixels except the pixel position in the temporal search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, the temporal search window is a window which takes the pixel position as the center in the previous frame, and the previous frame is a coded video frame;

and filtering the pixel according to other pixels except the pixel position in the time search window, the time domain weight between the pixel and other pixels except the pixel position in the time search window, other pixels in the space search window and the space domain weight between the pixel and other pixels in the space search window.

A second aspect of the present invention provides an image filtering apparatus before encoding, the apparatus comprising:

the weight calculation module is used for determining the spatial domain weight between the pixel and other pixels in a spatial search window aiming at each pixel in the current frame and acquiring the time domain weight between the pixel and other pixels except the pixel position in the temporal search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, the temporal search window is a window which takes the pixel position as the center in the previous frame, and the previous frame is a coded video frame;

and the filtering module is used for filtering the pixel according to other pixels except the pixel position in the time search window, the time domain weight between the pixel and other pixels except the pixel position in the time search window, other pixels in the space search window and the space domain weight between the pixel and other pixels in the space search window.

Based on the image filtering method and device before encoding in the first aspect and the second aspect, the invention has the following beneficial effects:

the method comprises the steps of introducing surrounding pixel information (namely time domain similarity) of a previous filtered frame through recursion while considering surrounding pixel information (namely space domain similarity) of a pixel to be filtered in a current frame, and filtering the pixel to be filtered in the current frame by utilizing the surrounding pixel information (namely time domain similarity) of the previous filtered frame, so that the effect of reducing blocking effect can be achieved under the condition of insufficient code rate, and the perception quality of a video can be improved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention without limiting the invention to the right. In the drawings:

FIG. 1 is a flow chart illustrating an embodiment of a method for image filtering before encoding according to an exemplary embodiment of the present invention;

FIG. 2 is a diagram illustrating filtering of a pixel i in a current frame according to an exemplary embodiment of the present invention;

FIG. 3 is a diagram illustrating an unfiltered and filtered differential encoded image contrast according to an exemplary embodiment of the present invention;

FIG. 4 is a diagram illustrating a hardware configuration of an electronic device in accordance with an exemplary embodiment of the present invention;

fig. 5 is a schematic structural diagram of an image filtering apparatus before encoding according to an exemplary embodiment of the present invention.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.

It is to be understood that although the terms first, second, third, etc. may be used herein to describe various information, these information should not be limited by these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope of the present invention. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.

Fig. 1 is a flowchart illustrating an embodiment of a method for filtering an image before encoding according to an exemplary embodiment of the present invention, where the method for filtering an image before encoding can be applied to any electronic device (e.g., a camera, a PC, a server, etc.), and in this embodiment, the method is directed to an image filtering preprocessing before encoding, and after the filtering preprocessing, the filtered image is input to an encoding tool for encoding, so as to generate a better encoding effect. As shown in fig. 1, the image filtering method before encoding includes the steps of:

step 101: and aiming at each pixel in the current frame, determining the spatial domain weight between the pixel and other pixels in the spatial search window, and acquiring the temporal domain weight between the pixel and other pixels except the pixel position in the temporal search window of the previous frame.

In this embodiment, when filtering pixels in a current frame, since the similar information in the spatial domain is not enough to describe the perceptual similarity, the pixels in the current frame are also filtered by recursively using the similar information in the previous frame (i.e., the frame encoded by the filtered wave) in the temporal domain while considering the similar information in the spatial domain, so as to obtain a better encoding effect.

The spatial search window refers to a window centered on a current pixel to be filtered in a current frame, the temporal search window refers to a window centered on a current pixel to be filtered in a previous frame, and the previous frame refers to a filtered and encoded video frame.

It should be noted that the size of the spatial search window located in the current frame is equal to the size of the temporal search window located in the previous frame.

In an embodiment, the calculation process for the spatial domain weight may include the following steps:

step 11: according to the pixel contained in the space search window, a first space distance measurement between the pixel and other pixels in the space search window is obtained.

Wherein the first spatial distance measure refers to a sum of squared accumulations of block-level pixel similarity distances between the pixel to be filtered and other pixels in the spatial search window. The smaller the first spatial distance measure, the more similar between the two blocks.

Referring to fig. 2, in the spatial search window of the current frame, the pixel i to be filtered is located in the center of the spatial search window, the sizes of other pixels j in the spatial search window, the block centered on the pixel i and the block centered on the pixel j are both d × d, that is, the size of the filtering kernel is d × d, and the first spatial distance metric | | | DV_k(i,j)||²The calculation formula is as follows:

wherein, | | I_k(i+z)-I_k(j+z)||²Representing the sum of the squared sums of the pixel differences at corresponding positions in the block centered on pixel i and the block centered on pixel j.

Step 12: and acquiring the spatial filtering strength according to the size of the spatial search window.

Wherein the spatial filtering strength is an adjustment parameter of the first spatial distance measure for controlling the attenuation of the weights relative to the first spatial distance measure.

In the present embodiment, the spatial filtering strength σ_d ²Is related to the size d of the filter kernel, assuming 2 σ_dD +1, then σ_dThe calculation formula of (a) is as follows:

step 13: a just noticeable distortion value for each pixel in the spatial search window is obtained.

The just Noticeable distortion value is a JND (just Noticeable distortion) value, which indicates that human eyes cannot perceive some distortion below the JND value.

In an example, in the process of calculating the JND value of a certain pixel, the average luminance of a preset region including the pixel may be obtained, the luminance adaptive factor of the pixel may be calculated by using the average luminance, the luminance contrast of the pixel in the preset region and the number of gradient direction types of the pixel in the current frame may be obtained, the visual occlusion factor of the pixel may be calculated by using the luminance contrast and the number, and the JND value of the pixel may be obtained according to the luminance adaptive factor and the visual occlusion factor.

Continuing with FIG. 2, taking JND value calculation of pixel i as an example, the luminance adaptive factor L of pixel i_A(i) The calculation formula of (a) is as follows:

wherein b (i) is the average brightness of the predetermined area including the pixel i.

Visual occlusion factor V for pixel i_M(i) The calculation formula of (a) is as follows:

and Lc is the brightness contrast of the pixel i in the preset area, and N is the number of gradient direction types of the pixel i in the current frame.

From this, the formula for calculating the JND value JND (i) of the pixel i is as follows:

JND(i)＝L_A(i)+V_M(i)-0.3·min(L_A(i),V_M(i) equation 5

Step 14: and acquiring a first perception distance measurement between the pixel and other pixels in the space search window according to the JND value of each pixel in the space search window.

In this embodiment, in practical application, the spatial distance metric does not completely reflect the perceptual attribute of the content, so the perceptual distance metric is considered on the basis of considering the spatial distance metric, so as to better describe the content perceptual features.

Wherein the first perceptual distance metric refers to a sum of squared accumulations of block-level JND value-like distances between the pixel to be filtered and other pixels in the spatial search window. The smaller the perceptual distance metric, the more similar between the two blocks.

With continued reference to fig. 2, the first perceptual distance metric | | DJ_k(i,j)||²The calculation formula is as follows:

wherein, | JND_k(i+z)-JND_k(j+z)||²Representing the sum of the squared accumulation of the JND value differences for the pixel at the corresponding position in the block centered on pixel i and the block centered on pixel j.

Step 15: the perceptual filter strength of the pixel is determined from the quantization parameter used to encode the previous frame.

In an example, a JND variance value of a JND value of a pixel included in a preset block centered on the pixel may be obtained, and then a perceptual filtering strength of the pixel may be determined according to the quantization parameter and the JND variance value.

Specifically, the preset block is a filtering kernel of the pixel, and the JND variance value JND of the filtering kernel_varThe calculation formula is as follows:

where JND (i + z) represents the JND value for each pixel in the filter kernel centered on pixel i,

represents the average value of the pixel JND values in the filter kernel centered on pixel i.

Perceptual filtering strength of pixel i

The calculation formula is as follows:

where m and n are empirical normalization parameters, QP is the quantization parameter used for the previous frame, QP_ThIs the threshold value of QP.

As can be seen from the above equation 8, when the code rate is insufficient, the QP used by the previous frame may exceed the QP_ThA threshold, which means that a higher level of perceptual filtering strength is required to reduce frame complexity; when the code rate is sufficient, the QP used by the previous frame may be below the threshold, at which time max (0, QP-QP)_Th) Take 0, by JND_varAnd determining the perceptual filtering strength, and removing invisible high-frequency information in the current frame.

Therefore, in the process of filtering the current frame, for each pixel, the filtering strength is adaptively updated according to the content perception characteristic and the quantization parameter used by the previous frame, and under the condition of insufficient code rate, the sequence complexity is reduced by adjusting the filtering strength, so that the blocking effect after coding is reduced, and the perception quality of the video is improved.

Step 16: and obtaining the spatial domain weight between the pixel and other pixels in the spatial search window according to the first spatial distance measurement, the first perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels in the spatial search window.

Based on the above description, the spatial domain weight ω between the pixel i and other pixels j in the spatial search window_k(i, j) the calculation formula is as follows:

wherein, | | DV_k(i,j)||²Is a first spatial distance measure, σ_d ²Is the spatial filtering strength, | DJ_k(i,j)||²In order to be the first perceptual distance measure,

to sense the filtering strength.

In one embodiment, based on the calculation principle of the time domain weights, the calculation process for the spatial domain weights includes the following steps:

step 21: and acquiring a second spatial distance measurement between the pixel and other pixels except the pixel position in the time search window according to the pixel contained in the time search window.

Wherein the second spatial distance metric refers to a sum of squared accumulations of block-level pixel similarity distances between the pixel to be filtered in the spatial search window and other pixels in the temporal search window.

Referring to fig. 2, in the spatial search window of the current frame, the pixel i to be filtered is located at the center of the spatial search window, and in the other pixels j in the temporal search window of the previous frame, the size of the block centered on the pixel i and the size of the block centered on the pixel j are both d × d, that is, the size of the filter kernel is d × d, and the second spatial distance measure | | DV_k-1(i,j)||²The calculation formula is as follows:

wherein, | | I_k(i+z)-I_k-1(j+z)||²Representing the sum of the squared differences of pixels at corresponding positions in a block centered on pixel i in the current frame and a block centered on pixel j in the previous frame.

Step 22: and acquiring the spatial filtering strength according to the size of the time search window.

Based on the above step 12, since the size of the temporal search window is consistent with the size of the spatial search window, the step 22 may also adopt the above equation 2.

Step 23: the JND value of each pixel in the temporal search window is obtained.

Based on the JND value calculation principle described in step 13, step 23 may also be calculated by using the above equations 3, 4, and 5.

Step 24: and acquiring a second perception distance measurement between the pixel and other pixels except the pixel position in the time search window according to the JND value of each pixel in the time search window.

The second perception distance measure refers to the sum of squares and accumulations of similar distances of block-level JND values between the pixel to be filtered in the spatial search window and other pixels in the temporal search window.

With continued reference to FIG. 2, the second perceptual distance metric | | DJ_k-1(i,j)||²The calculation formula is as follows:

wherein, | JND_k(i+z)-JND_k-1(j+z)||²And the sum of the square accumulation of the difference values of the JND values of the block taking the pixel i as the center in the spatial search window and the pixel at the corresponding position in the block taking the pixel j as the center in the temporal search window is represented.

Step 25: the perceptual filter strength of the pixel is determined from the quantization parameter used to encode the previous frame.

The perceptual filtering strength in step 15 is also the perceptual filtering strength in step 25.

Step 26: and obtaining the time domain weight between the pixel and other pixels except the pixel position in the time search window according to the second spatial distance measurement, the second perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels except the pixel position in the time search window.

Based on the above description, the time domain weight ω between pixel i and other pixels j in the time search window_k-1(i, j) the calculation formula is as follows:

wherein, | | DV_k-1(i,j)||²Is as followsTwo spatial distance measures, σ_d ²Is the spatial filtering strength, | DJ_k-1(i,j)||²For the purpose of the second perceptual distance measure,

to sense the filtering strength.

It should be noted that the first perceptual distance metric described above refers to a JND similarity distance metric between the pixel to be filtered and other pixels in the current frame, and the second perceptual distance metric refers to a JND similarity distance metric between the pixel to be filtered and other pixels in the previous frame.

The first spatial distance measure refers to a pixel similarity distance measure between the pixel to be filtered and other pixels in the current frame, and the second spatial distance measure refers to a pixel similarity distance measure between the pixel to be filtered and other pixels in the previous frame.

Step 102: and filtering the pixel according to other pixels except the pixel position in the time search window, the time domain weight between the pixel and other pixels except the pixel position in the time search window, other pixels in the space search window and the space domain weight between the pixel and other pixels in the space search window.

Based on the above-mentioned description of step 101, referring to FIG. 2, the filtered value f of pixel i in the current frame_k(i) The calculation formula is as follows:

wherein, ω is_k-1(i, j) is the time domain weight between pixel i and pixel j in the previous frame, f_k-1(j) The encoded pixel value of the pixel j in the previous frame,_ε(k-1) is a time search window, ω_k(I, j) is the spatial domain weight between pixel I and pixel j in the current frame, I_k(j) Is the pixel value of pixel j in the current frame,_Ω(_k) For a spatial search window, W_kFor the sum of the time domain weight and the space domain weight, the calculation formula is as follows:

it should be noted that, when the current frame is the first frame, the filtering process of the pixels in the current frame only considers the spatial domain weight, and the calculation formula is as follows:

through experimental comparison, as shown in fig. 3, a graph (a) shows a coded image obtained without using the filtering method of the present embodiment, where the local blocking effect indicated by the arrow is relatively severe, and a graph (b) shows a coded image obtained using the filtering method of the present embodiment, where the local blocking effect indicated by the arrow is greatly reduced.

So far, the filtering process shown in fig. 1 is completed, and while taking into account the surrounding pixel information (i.e., spatial domain similarity) of the pixel to be filtered in the current frame, the surrounding pixel information (i.e., temporal domain similarity) of the previous filtered frame is introduced through recursion, and the two items of information are utilized to filter the pixel to be filtered in the current frame, so that even under the condition of insufficient code rate, the effect of reducing blocking effect can be achieved, and the perceptual quality of the video can be improved.

Fig. 4 is a hardware block diagram of an electronic device according to an exemplary embodiment of the present invention, the electronic device including: a communication interface 401, a processor 402, a machine-readable storage medium 403, and a bus 404; wherein the communication interface 401, the processor 402 and the machine-readable storage medium 403 communicate with each other via a bus 404. The processor 402 may execute the pre-encoding image filtering method described above by reading and executing machine executable instructions in the machine readable storage medium 403 corresponding to the control logic of the pre-encoding image filtering method, and the specific content of the method is described in the above embodiments, which will not be described herein again.

The machine-readable storage medium 403 referred to in this disclosure may be any electronic, magnetic, optical, or other physical storage device that can contain or store information such as executable instructions, data, and the like. For example, the machine-readable storage medium may be: volatile memory, non-volatile memory, or similar storage media. In particular, the machine-readable storage medium 403 may be a RAM (Random Access Memory), a flash Memory, a storage drive (e.g., a hard disk drive), any type of storage disk (e.g., a compact disk, a DVD, etc.), or similar storage medium, or a combination thereof.

The invention also provides an embodiment of an image filtering device before coding, corresponding to the embodiment of the image filtering method before coding.

Fig. 5 is a flowchart illustrating an embodiment of a pre-coding image filtering apparatus according to an exemplary embodiment of the present invention, where the pre-coding image filtering apparatus may be applied to any electronic device, as shown in fig. 5, and the pre-coding image filtering apparatus includes:

a weight calculating module 510, configured to determine, for each pixel in the current frame, a spatial domain weight between the pixel and another pixel in the spatial search window, and obtain a temporal domain weight between the pixel and another pixel except the pixel position in the temporal search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, the temporal search window is a window which takes the pixel position as the center in the previous frame, and the previous frame is a coded video frame;

a filtering module 520, configured to filter the pixel according to the other pixels except the pixel position in the temporal search window, the time domain weights between the pixel and the other pixels except the pixel position in the temporal search window, the other pixels in the spatial search window, and the spatial domain weights between the pixel and the other pixels in the spatial search window.

In an optional implementation manner, the weight calculating module 510 is specifically configured to, in a process of determining a spatial domain weight between the pixel and another pixel in the spatial search window, obtain a first spatial distance metric between the pixel and another pixel in the spatial search window according to the pixel included in the spatial search window; acquiring spatial filtering strength according to the size of the spatial search window; acquiring a JND value of each pixel in the space search window; acquiring a first perception distance measurement between each pixel and other pixels in the space search window according to the JND value of each pixel in the space search window; determining the perceptual filtering strength of the pixel according to the quantization parameter used for encoding the previous frame; and obtaining the spatial domain weight between the pixel and other pixels in the spatial search window according to the first spatial distance measurement, the first perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels in the spatial search window.

In an optional implementation manner, the weight calculating module 510 is specifically configured to, in a process of determining the perceptual filtering strength of the pixel according to a quantization parameter used in encoding a previous frame, obtain a JND variance value of a JND value of a pixel included in a preset block centered on the pixel; and determining the perceptual filtering strength of the pixel according to the quantization parameter and the JND variance value.

In an optional implementation manner, the weight calculating module 510 is specifically configured to, in the process of obtaining the JND value of each pixel in the spatial search window, obtain, for each pixel in the spatial search window, an average luminance of a preset region including the pixel, and calculate a luminance adaptive factor of the pixel by using the average luminance; acquiring the brightness contrast of the pixel in the preset area and the number of gradient direction types of the pixel in the current frame, and calculating the visual shielding factor of the pixel by using the brightness contrast and the number; and acquiring a JND value of the pixel according to the brightness self-adaptive factor and the visual shielding factor.

In an optional implementation manner, the weight calculating module 510 is specifically configured to, in a process of obtaining a time domain weight between the pixel and another pixel in the time search window of the previous frame except for the pixel position, obtain a second spatial distance metric between the pixel and another pixel in the time search window except for the pixel position according to the pixel included in the time search window; acquiring spatial filtering strength according to the size of the time search window; acquiring a JND value of each pixel in the time search window; acquiring a second perception distance measurement between the pixel and other pixels except the pixel position in the time search window according to the JND value of each pixel in the time search window; determining the perceptual filtering strength of the pixel according to the quantization parameter used for coding the previous frame; and obtaining the time domain weight between the pixel and other pixels except the pixel position in the time search window according to the second spatial distance measurement, the second perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels except the pixel position in the time search window.

The implementation process of the functions and actions of each unit in the device is specifically detailed in the implementation process of the corresponding step in the method, and is not described herein again.

For the device embodiments, since they substantially correspond to the method embodiments, reference may be made to the partial description of the method embodiments for relevant points. The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the scheme of the invention. One of ordinary skill in the art can understand and implement the present invention without any inventive effort.

Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This invention is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other elements in the process, method, article, or apparatus that comprise the element.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A method of filtering an image prior to encoding, the method comprising:

aiming at each pixel in the current frame, determining the spatial domain weight between the pixel and other pixels in a spatial search window, and acquiring the temporal domain weight between the pixel and other pixels except the pixel position in a temporal search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, the temporal search window is a window which takes the pixel position as the center in the previous frame, and the previous frame is a coded video frame;

2. The method of claim 1, wherein determining spatial domain weights between the pixel and other pixels in the spatial search window comprises:

according to the pixels contained in the space search window, acquiring a first space distance measurement between the pixel and other pixels in the space search window;

acquiring spatial filtering strength according to the size of the spatial search window;

acquiring Just Noticeable Distortion (JND) values of each pixel in the spatial search window;

acquiring a first perception distance measurement between each pixel and other pixels in the space search window according to the JND value of each pixel in the space search window;

determining the perceptual filtering strength of the pixel according to the quantization parameter used for encoding the previous frame;

and obtaining the spatial domain weight between the pixel and other pixels in the spatial search window according to the first spatial distance measurement, the first perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels in the spatial search window.

3. The method of claim 2, wherein determining the perceptual filtering strength of the pixel according to the quantization parameter used for encoding the previous frame comprises:

acquiring a JND variance value of a pixel JND value contained in a preset block taking the pixel as a center;

and determining the perceptual filtering strength of the pixel according to the quantization parameter and the JND variance value.

4. The method of claim 2, wherein the obtaining the Just Noticeable Distortion (JND) value for each pixel in the spatial search window comprises:

aiming at each pixel in a space search window, obtaining the average brightness of a preset area containing the pixel, and calculating the brightness self-adaptive factor of the pixel by using the average brightness;

acquiring the brightness contrast of the pixel in the preset area and the number of gradient direction types of the pixel in the current frame, and calculating the visual shielding factor of the pixel by using the brightness contrast and the number;

and acquiring a JND value of the pixel according to the brightness self-adaptive factor and the visual shielding factor.

5. The method of claim 1, wherein obtaining the time domain weight between the pixel and other pixels except the pixel position in the time search window of the previous frame comprises:

according to the pixels contained in the time search window, obtaining a second spatial distance measurement between the pixels and other pixels except the pixel position in the time search window;

acquiring spatial filtering strength according to the size of the time search window;

acquiring a JND value of each pixel in the time search window;

acquiring a second perception distance measurement between the pixel and other pixels except the pixel position in the time search window according to the JND value of each pixel in the time search window;

and obtaining the time domain weight between the pixel and other pixels except the pixel position in the time search window according to the second spatial distance measurement, the second perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels except the pixel position in the time search window.

6. An apparatus for filtering an image before encoding, the apparatus comprising:

the weight calculation module is used for determining the spatial domain weight between the pixel and other pixels in a spatial search window aiming at each pixel in the current frame and acquiring the time domain weight between the pixel and other pixels except the pixel position in the time search window of the previous frame; the spatial search window is a window which takes the pixel as the center in the current frame, the temporal search window is a window which takes the pixel position as the center in the previous frame, and the previous frame is a coded video frame;

7. The apparatus according to claim 6, wherein the weight calculation module is specifically configured to, in the process of determining the spatial domain weight between the pixel and the other pixels in the spatial search window, obtain a first spatial distance metric between the pixel and the other pixels in the spatial search window according to the pixel included in the spatial search window; acquiring spatial filtering strength according to the size of the spatial search window; acquiring Just Noticeable Distortion (JND) values of each pixel in the spatial search window; acquiring a first perception distance measurement between each pixel and other pixels in the space search window according to the JND value of each pixel in the space search window; determining the perceptual filtering strength of the pixel according to the quantization parameter used for encoding the previous frame; and obtaining the spatial domain weight between the pixel and other pixels in the spatial search window according to the first spatial distance measurement, the first perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels in the spatial search window.

8. The apparatus according to claim 7, wherein the weight calculation module is specifically configured to, in the process of determining the perceptual filtering strength of the pixel according to the quantization parameter used in encoding the previous frame, obtain a JND variance value of a JND value of a pixel included in a preset block centered on the pixel; and determining the perceptual filtering strength of the pixel according to the quantization parameter and the JND variance value.

9. The apparatus according to claim 7, wherein the weight calculation module is specifically configured to, in the process of obtaining the just noticeable distortion JND value of each pixel in the spatial search window, obtain, for each pixel in the spatial search window, an average luminance of a preset region including the pixel, and calculate a luminance adaptive factor of the pixel by using the average luminance; acquiring the brightness contrast of the pixel in the preset area and the number of gradient direction types of the pixel in the current frame, and calculating the visual shielding factor of the pixel by using the brightness contrast and the number; and acquiring a JND value of the pixel according to the brightness self-adaptive factor and the visual shielding factor.

10. The apparatus according to claim 6, wherein the weight calculation module is specifically configured to, in obtaining the time domain weight between the pixel and the other pixels except the pixel position in the time search window of the previous frame, obtain a second spatial distance metric between the pixel and the other pixels except the pixel position in the time search window according to the pixels included in the time search window; acquiring spatial filtering strength according to the size of the time search window; acquiring a JND value of each pixel in the time search window; acquiring a second perception distance measurement between the pixel and other pixels except the pixel position in the time search window according to the JND value of each pixel in the time search window; determining the perceptual filtering strength of the pixel according to the quantization parameter used for encoding the previous frame; and obtaining the time domain weight between the pixel and other pixels except the pixel position in the time search window according to the second spatial distance measurement, the second perception distance measurement, the spatial filtering strength and the perception filtering strength between the pixel and other pixels except the pixel position in the time search window.