CN113568571A

CN113568571A - Image de-duplication method based on residual error neural network

Info

Publication number: CN113568571A
Application number: CN202110722552.3A
Authority: CN
Inventors: 张跃宇; 徐跃; 苗雅文; 李雪; 李晖; 陈杰; 吕嘉宁; 马佳骥
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-06-28
Filing date: 2021-06-28
Publication date: 2021-10-29
Anticipated expiration: 2041-06-28
Also published as: CN113568571B

Abstract

The invention discloses an image de-duplication method based on a residual error neural network, which is applied to a server and comprises the following steps: receiving an uploading request; according to the first accurate matching data and the first perception hash characteristic value, similarity detection is carried out on the image to be processed in the database; when a first image matched with the image to be processed is detected in the database, comparing the image quality scores of the image to be processed and the first image; and if the image quality score of the image to be processed is higher than the image quality score of the first image, sending an uploading instruction to the client, and receiving the image to be processed uploaded by the client. According to the invention, the images with the same visual information can be judged as similar images by comparing the first accurate matching data with the first perception hash characteristic value, and the duplicate removal precision is good. In addition, by comparing the image quality of the image to be processed with that of the first image and deleting the redundant image with poor quality, a good deduplication effect can be obtained and the user experience can be improved.

Description

Image de-duplication method based on residual error neural network

Technical Field

The invention belongs to the field of image processing, and particularly relates to an image de-duplication method based on a residual error neural network.

Background

Images contain rich and intuitive information, and a large number of images are needed to deliver information to users in the fields of social interaction, shopping, tourism and the like. With the increasing number of images, the local storage overhead is increased, and therefore more and more users upload images from the local client to the cloud server for storage.

In order to avoid unnecessary increase of the storage capacity of the cloud server caused by uploading similar or identical images again, how to realize secure deduplication of images from a large amount of data becomes a research hotspot in the field. At present, in the related art, a Client-based Multimedia Data Deduplication method (CSPD) is adopted to achieve image Deduplication, and the method includes three stages of similar image detection, ownership authentication, and quality comparison. However, in the related art, similarity discrimination is performed from two angles of cryptographic hash and discrete cosine transform, so that only an original image and an image with one distortion change can be recognized, and when the original image is subjected to superposition of two or more kinds of distortions, the recognition accuracy of the similar image is reduced, and further the image deduplication precision is reduced.

Disclosure of Invention

In order to solve the above problems in the prior art, the present invention provides an image deduplication method based on a residual error neural network. The technical problem to be solved by the invention is realized by the following technical scheme:

in a first aspect, the present invention provides an image deduplication method based on a residual error neural network, applied to a server, including:

receiving an uploading request from a client; the client side and the server are in communication connection with each other in advance, and the uploading request comprises first accurate matching data and a first perceptual hash characteristic value of the image to be processed;

according to the first accurate matching data and the first perception hash characteristic value, carrying out similarity detection on the image to be processed in a database; the database comprises a plurality of groups of pre-stored images, accurate matching data corresponding to the images and a perception hash characteristic value;

when a first image matching with an image to be processed is detected in the database, comparing the image quality scores of the image to be processed and the first image; and if the image quality score of the image to be processed is higher than the image quality score of the first image, sending an uploading instruction to a client, and receiving the image to be processed uploaded by the client.

In an embodiment of the present invention, the step of performing similarity detection on the to-be-processed image in a database according to the first exact match data and the first perceptual hash feature value includes:

detecting whether second exact match data equal to the first exact match data exist in the exact match data stored in the database;

if the first image exists, determining the image corresponding to the second accurate matching data as a first image; otherwise, calculating a first average absolute error distance between the first perceptual hash characteristic value and each perceptual hash characteristic value stored in the database;

and comparing the first average absolute error distance with a first preset threshold, and determining an image corresponding to the second hash characteristic value as a first image after determining the perceptual hash characteristic value of which the first average absolute error distance is smaller than the preset threshold as the second hash characteristic value.

In an embodiment of the present invention, when a first image matching an image to be processed is detected in the database, the step of comparing the image quality of the image to be processed with that of the first image is preceded by:

and performing ownership verification on the client.

In an embodiment of the present invention, the step of performing ownership verification on the client includes:

after a plurality of image blocks are divided in the first image, selecting a first image block from the first image, and preprocessing the first image block;

the preprocessed first image block and the first image are zoomed to the same size and mixed to obtain a first target image;

inputting the first target image to a first residual error neural network to obtain a perceptual hash characteristic value of the first target image;

generating an ownership verification request, the ownership verification request including network parameters of the first residual neural network;

sending the ownership verification request to a client so that the client can select a second image block from the image to be processed after dividing the image block into a plurality of image blocks, preprocessing the second image block, scaling the preprocessed second image block and the image to be processed to the same size, mixing the preprocessed second image block and the image to be processed to obtain a second target image, setting network parameters of a second residual error neural network according to the network parameters of the first residual error neural network, inputting the second target image to the second residual error neural network, and sending the obtained perceptual hash characteristic value of the second target image to the server;

and judging whether the client passes ownership verification or not according to the perceptual hash characteristic value of the first target image and the perceptual hash characteristic value of the second target image.

In an embodiment of the present invention, the step of determining whether the client passes the ownership verification according to the perceptual hash feature value of the first target image and the perceptual hash feature value of the second target image includes:

calculating a second average absolute error distance between the perceptual hash characteristic value of the first target image and the perceptual hash characteristic value of the second target image;

comparing the second average absolute error distance with a second preset threshold; if the second mean absolute error distance is smaller than a second preset threshold, the client passes ownership verification, otherwise, the client fails ownership verification;

and sending a verification result to the client.

In one embodiment of the invention, the pre-processing comprises a rotation and/or translation operation.

In an embodiment of the present invention, when a first image matching an image to be processed is detected in the database, the step of comparing the image quality scores of the image to be processed and the first image is followed by:

and if the image quality score of the image to be processed is lower than that of the first image, sending the link of the first image to the client.

In an embodiment of the present invention, when there is no first image matching the image to be processed in the database, after the step of performing similarity detection on the image to be processed in the database according to the first exact match data and the first perceptual hash feature value, the method further includes:

and after an uploading instruction is sent to the client, receiving the image to be processed uploaded by the client.

In an embodiment of the present invention, after the step of receiving the image to be processed uploaded by the client, the method further includes:

and storing the image to be processed, the first exact matching data and the first perception hash characteristic value into a database.

In a second aspect, the present invention further provides an image deduplication method based on a residual error neural network, which is applied to a client, and includes:

acquiring an image to be processed;

determining first exact match data of the image to be processed according to a Sha256 algorithm;

determining a first perception hash characteristic value of the image to be processed by utilizing a pre-trained second residual error neural network model;

sending an uploading request of the image to be processed to a server, so that the server performs similarity detection on the image to be processed in a database according to the first accurate matching data and the first perceptual hash characteristic value, and comparing the image quality scores of the image to be processed and a first image when the first image matched with the image to be processed is detected;

if the image quality score of the image to be processed is higher than the image quality score of the first image, receiving an uploading instruction, and uploading the image to be processed to the server; otherwise, a link to the first image is received.

Compared with the prior art, the invention has the beneficial effects that:

in the image deduplication method based on the residual error neural network, the server receives an uploading request from the client, and the uploading request comprises first accurate matching data and a first perceptual hash characteristic value of an image to be processed, so that the same image can be judged by comparing the first accurate matching data with the accurate matching data of the image stored in the database in advance, and the similar image can be judged by comparing the first perceptual hash characteristic value with the perceptual hash characteristic value of the image stored in the database in advance, so that the images with the same visual information can be judged to be similar, and the deduplication precision is good. In addition, when a first image which is the same as or similar to the image to be uploaded exists in the database, the image quality of the first image and the image quality of the second image are compared, and a redundant image with poor quality is deleted, so that good de-duplication effect is obtained, and user experience is improved.

The present invention will be described in further detail with reference to the accompanying drawings and examples.

Drawings

Fig. 1 is a schematic flowchart of an image deduplication method based on a residual neural network according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of an image deduplication method based on a residual error neural network according to an embodiment of the present invention;

fig. 3 is a schematic diagram of performing ownership verification on a server according to an embodiment of the present invention;

fig. 4 is a schematic diagram of interaction between a client and a server in an ownership verification process provided by an embodiment of the present invention;

fig. 5 is a schematic flowchart of another image deduplication method based on a residual neural network according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to specific examples, but the embodiments of the present invention are not limited thereto.

As shown in fig. 1, the image deduplication method based on the residual neural network provided in the embodiment of the present invention is applied to a server, and includes:

s101, receiving an uploading request from a client; the method comprises the steps that communication connection is established between a client and a server in advance, and an uploading request comprises first accurate matching data and a first perception hash characteristic value of an image to be processed;

s102, carrying out similarity detection on the image to be processed in a database according to the first accurate matching data and the first perception hash characteristic value; the database comprises a plurality of groups of pre-stored images, accurate matching data corresponding to the images and perceptual hash characteristic values;

s103, when a first image matched with the image to be processed is detected in the database, comparing the image quality scores of the image to be processed and the first image; and if the image quality score of the image to be processed is higher than the image quality score of the first image, sending an uploading instruction to the client, and receiving the image to be processed uploaded by the client.

In this embodiment, the server and the client establish a communication connection in advance, and the database of the server includes a plurality of sets of images stored in advance, exact matching data corresponding to the images, and perceptual hash feature values, where the images stored in advance may be uploaded to the server by the client. Optionally, the upload request sent by the client to the server includes a feature sequence of the image to be processed, where the feature sequence includes first exact match data and a first perceptual hash feature value. And after receiving the uploading request, the server searches in the database according to the first accurate matching data and the first perceptual hash characteristics, and detects whether an image similar to the image to be processed exists or not. It should be noted that the image similar to the image to be processed in this embodiment includes both an image having the same encoding method as the image to be processed and a similar image having the same semantic information after the image to be processed is subjected to operations such as rotation, filtering, and compression.

Optionally, the client determines first exact match data of the image to be processed by using a Sha256 algorithm, and determines a first perceptual hash feature value of the image to be processed by using a pre-trained second residual neural network model. In the embodiment, the residual neural network and the Hash technology are introduced into the similarity discrimination of the images, and the images with the same visual information can be judged to be similar, so that the robustness and the distinguishability of the image deduplication are improved.

Of course, in some other embodiments of the present invention, other algorithms may be used to calculate the first exact match data and the first perceptual hash feature value of the image to be processed, which is not limited by the present invention.

In the above step S103, when there is a first image similar to the image to be processed in the database, the image quality scores of the two images are further compared, wherein a higher image quality score indicates better image quality. When the image quality of the image to be processed is better than that of the first image, the image to be processed which is requested to be uploaded by the client side is better in quality, therefore, the server sends an uploading instruction to the client side and receives the image to be processed which is uploaded by the client side, the design method can delete the redundant image with poor quality, and therefore good de-duplication effect is achieved, and user experience is improved.

In addition, in the above step S103, if the image quality score of the image to be processed is lower than the image quality score of the first image, a link of the first image is transmitted to the client so that the user uses the first image according to the link download.

Of course, in this embodiment, if the server does not retrieve the first image, it indicates that the image that is the same as or similar to the image to be processed is not uploaded, so the server can send an upload instruction to the client and receive the image to be processed uploaded by the client.

Optionally, in the step S102, the step of performing similarity detection on the image to be processed in the database according to the first exact match data and the first perceptual hash feature value includes:

detecting whether second accurate matching data equal to the first accurate matching data exists in the accurate matching data stored in the database;

if the first image exists, determining the image corresponding to the second accurate matching data as the first image; otherwise, calculating a first average absolute error distance between the first perceptual hash characteristic value and each perceptual hash characteristic value stored in the database;

In this embodiment, after receiving the upload request, the server first detects whether there is second exact match data equal to the first exact match data in the exact match data stored in the database; if the first image exists, determining the image corresponding to the second accurate matching data as a first image, wherein the first image and the image to be processed are the same image; on the contrary, it means that the same image as the image to be processed does not exist in the server, at this time, a first average absolute error distance between the first perceptual hash feature value and each perceptual hash feature value stored in the database is further calculated, after the perceptual hash feature value of which the first average absolute error distance is smaller than a preset threshold value is determined as a second perceptual hash feature value, the image corresponding to the second perceptual hash feature value is determined as a first image, and the first image and the image to be processed are similar images.

Optionally, when the first image matching the image to be processed is detected in the database, the step of comparing the image quality of the image to be processed with that of the first image is preceded by the following step:

and carrying out ownership verification on the client.

In this embodiment, when the first image is stored in the server, the ownership of the client is verified. It should be understood that when deduplication is performed according to the feature sequence, if an attacker guesses or illegally acquires the feature sequence of the image to be processed, the attacker may obtain the use right by using the feature sequence and further acquire the first image, which results in data leakage, and therefore it is necessary to perform security authentication on the client.

Fig. 3 is a schematic diagram of performing ownership verification at a server according to an embodiment of the present invention, and fig. 4 is a schematic diagram of interaction between a client and a server in an ownership verification process according to an embodiment of the present invention.

Referring to fig. 3-4, the step of performing ownership verification on the client includes:

after a plurality of image blocks are divided in a first image, selecting the first image block from the first image, and preprocessing the first image block;

inputting a first target image to a first residual error neural network to obtain a perceptual hash characteristic value of the first target image;

sending an ownership verification request to a client so that the client can select a second image block from the image to be processed after dividing the image block into a plurality of image blocks, preprocessing the second image block, scaling the preprocessed second image block and the image to be processed to the same size, mixing the preprocessed second image block and the image to be processed to obtain a second target image, setting network parameters of a second residual error neural network according to the network parameters of a first residual error neural network, inputting the second target image to the second residual error neural network, and sending the obtained perceptual hash characteristic value of the second target image to a server;

and judging whether the client passes ownership verification according to the perceptual hash characteristic value of the first target image and the perceptual hash characteristic value of the second target image.

Specifically, as shown in fig. 3, in the process of performing ownership verification on the client, the server first divides the first image into a plurality of image blocks, and randomly selects a first image block from the image blocks; then, preprocessing the first image block, wherein the preprocessing comprises rotation and/or translation operation, and then scaling the preprocessed first image block; and finally, after the unprocessed first image is scaled to the size same as that of the first image block, the unprocessed first image and the first image block are mixed to generate a first target image. Further, as shown in fig. 4, after the server generates the first target image, the first target image is input to the first residual neural network to obtain the perceptual hash feature value of the first target image.

It can be understood that in this embodiment, the ownership verification of the client is performed by comparing the first target image generated by the server and the second target image generated by the client, and the generation process of the second target image is the same as that of the first target image, that is, the client divides the image to be processed into a plurality of image blocks, randomly selects the second image block from the image blocks, performs the same preprocessing operation as that of the first image block on the second image block, scales and mixes the image to be processed and the processed second image block, and then inputs the mixed second target image into the second residual neural network, so as to obtain the perceptual hash feature value of the second target image.

It should be noted that, after the server side completes the calculation of the perceptual hash characteristic value of the first target image, an ownership verification request is generated and sent, the ownership verification request contains the network parameters of the first residual error neural network, and the client side sets the network parameters of the second residual error neural network according to the network parameters of the first residual error neural network.

Further, the client sends the perceptual hash feature value of the second target image to the server, and the server can judge whether the client passes ownership verification according to the perceptual hash feature value of the first target image and the perceptual hash feature value of the second target image.

Optionally, with reference to fig. 4, the step of determining whether the client passes the ownership verification according to the perceptual hash feature value of the first target image and the perceptual hash feature value of the second target image includes:

comparing the second average absolute error distance with a second preset threshold; if the second average absolute error distance is smaller than a second preset threshold, the client passes ownership verification, otherwise, the client does not pass ownership verification;

and sending the verification result to the client.

Specifically, the server calculates a second average absolute error distance between a perceptual hash characteristic value of the first target image and a perceptual hash characteristic value of the second target image, and when the second average absolute error distance is smaller than a second preset threshold, the client passes ownership verification; and when the second average absolute error distance is larger than or equal to a second preset threshold, the client side does not pass ownership verification, and the server rejects the uploading request.

Therefore, the invention does not need to introduce a server specially used for ownership verification and set an additional reference image library, and the perceptual hash characteristic value used for ownership verification is not reused, thereby ensuring that the image deduplication method has excellent safety and reliability.

It should be noted that, no matter whether the image to be processed is finally uploaded to the server, the image to be processed, the first exact matching data and the first perceptual hash feature value may be stored in the database, and the updated database is favorable for being used as a basis for similarity determination in the subsequent deduplication process of other images.

As shown in fig. 5, an embodiment of the present invention further provides an image deduplication method based on a residual neural network, which is applied to a client, and includes:

s501, acquiring an image to be processed;

s502, determining first accurate matching data of the image to be processed according to a Sha256 algorithm;

s503, determining a first perception hash characteristic value of the image to be processed by utilizing a pre-trained second residual error neural network model;

s504, sending an uploading request of the image to be processed to a server, so that the server performs similarity detection on the image to be processed in a database according to the first accurate matching data and the first perceptual hash characteristic value, and comparing the image quality scores of the image to be processed and the first image when the first image matched with the image to be processed is detected;

s505, if the image quality score of the image to be processed is higher than that of the first image, receiving an uploading instruction, and uploading the image to be processed to a server;

s506, receiving the link of the first image if the image quality score of the image to be processed is lower than the image quality score of the first image.

In the description of the present invention, the terms "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implying any number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples described in this specification can be combined and combined by those skilled in the art.

While the present application has been described in connection with various embodiments, other variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the claimed application, from a review of the drawings, the disclosure, and the appended claims. In the claims, the word "comprising" does not exclude other elements or steps, and the word "a" or "an" does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.

The foregoing is a more detailed description of the invention in connection with specific preferred embodiments and it is not intended that the invention be limited to these specific details. For those skilled in the art to which the invention pertains, several simple deductions or substitutions can be made without departing from the spirit of the invention, and all shall be considered as belonging to the protection scope of the invention.

Claims

1. An image deduplication method based on a residual error neural network is applied to a server and comprises the following steps:

2. The residual neural network-based image deduplication method of claim 1, wherein the step of performing similarity detection on the to-be-processed image in a database according to the first exact match data and the first perceptual hash feature value comprises:

3. The residual neural network-based image deduplication method of claim 2, wherein when a first image matching a to-be-processed image is detected in the database, the step of comparing the image quality of the to-be-processed image with that of the first image is preceded by the step of:

and performing ownership verification on the client.

4. The residual neural network-based image deduplication method of claim 3, wherein the step of performing ownership verification on the client comprises:

5. The residual neural network-based image deduplication method of claim 4, wherein the step of determining whether the client passes ownership verification according to the perceptual hash feature value of the first target image and the perceptual hash feature value of the second target image comprises:

and sending a verification result to the client.

6. The residual neural network-based image deduplication method of claim 4, wherein the preprocessing comprises rotation and/or translation operations.

7. The residual neural network-based image deduplication method of claim 1, wherein when a first image matching a to-be-processed image is detected in the database, the step of comparing the to-be-processed image with an image quality score of the first image is followed by further comprising:

8. The residual neural network-based image deduplication method of claim 1, wherein when there is no first image matching the image to be processed in the database, the step of performing similarity detection on the image to be processed in the database according to the first exact match data and the first perceptual hash feature value further comprises:

9. The residual neural network-based image deduplication method according to claim 1 or 8, wherein after the step of receiving the to-be-processed image uploaded by the client, the method further comprises:

10. An image deduplication method based on a residual error neural network is applied to a client side, and comprises the following steps:

acquiring an image to be processed;