CN112417381B

CN112417381B - Method and device for rapidly positioning infringement image applied to image copyright protection

Info

Publication number: CN112417381B
Application number: CN202011446862.9A
Authority: CN
Inventors: 张季玮; 王泽辉
Original assignee: Chinaso Information Technology Co ltd
Current assignee: Chinaso Information Technology Co ltd
Priority date: 2020-12-11
Filing date: 2020-12-11
Publication date: 2021-06-22
Anticipated expiration: 2040-12-11
Also published as: CN112417381A

Abstract

The invention relates to the field of image copyright protection, and discloses a method and a device for quickly positioning an infringement image for image copyright protection, wherein the method comprises the steps of respectively obtaining copyright image data and an image to be retrieved, and extracting a characteristic vector by using an image infringement detection deep learning model; calculating a perceptual hash value of each image; and establishing a large-scale data retrieval method by using the characteristic vector and the perceptual hash value, and performing positioning infringement retrieval on the image data to be retrieved by the large-scale data retrieval method to obtain an infringement retrieval result. The invention extracts the high-dimensional characteristic vector of the image by combining the CNN network and the R-MAC method, simultaneously calculates the perceptual hash value of the image, and identifies and compares the image by two layers of scales, thereby enhancing the identification of anti-attack means which possibly appears in an infringement image and enhancing the robustness of the model.

Description

Method and device for rapidly positioning infringement image applied to image copyright protection

Technical Field

The invention relates to the field of image copyright protection, in particular to a method and a device for quickly positioning an infringement image applied to image copyright protection.

Background

With the rapid development of multimedia and internet technologies, massive image data has been widely applied to various fields of society, and the problem of image copyright protection is becoming more severe. Generally, an image retrieval technology is used to search and compare an infringement image and an original image, which mainly includes two steps: how to express the characteristic information of the image and an efficient retrieval method. The difference in the way image content is described can be divided into two categories: text-based image retrieval and content-based image retrieval. The text-based image retrieval method mainly utilizes text annotation to describe the content in the image, but with the rapid increase of data magnitude, the manpower and financial resources consumed by manually annotating the text are not enough to support the huge data volume in reality. In recent decades, image retrieval technologies based on content have been rapidly developed, and in the early century, due to the excellent performance of SIFT features in the problem of image scale change, image retrieval methods based on local description operators have been widely researched, most local feature description methods encode features based on gradients to overcome semantic differences, such as a bag-of-words model, a Fisher vector or a vector local aggregation descriptor, and the like, but the encoding methods cannot capture high-level semantic information of images. In recent years, high-dimensional image features extracted by the image representation method based on the deep learning convolutional neural network show extremely high performance, and the method for extracting the features by image retrieval gradually changes from SIFT features to CNN features. With the continuous increase of data scale, the calculation cost for training the deep learning model is also continuously increased, so that the comparison and identification of images under large-scale data and the quick search and sequencing of infringement results need to be met by more reasonably utilizing high-dimensional image features and a more efficient retrieval method.

Disclosure of Invention

The invention provides a method and a device for rapidly positioning an infringement image for image copyright protection, thereby solving the problems in the prior art.

In a first aspect, the present invention provides a method for rapidly positioning an infringement image applied to image copyright protection, comprising the following steps:

s1) respectively acquiring copyright image data and an image to be retrieved, establishing a picture infringement detection deep learning model, and extracting a feature vector of each image in the copyright image data and the image to be retrieved by using the picture infringement detection deep learning model;

s2) calculating the perception hash value of each image in the copyright image data and the image to be retrieved;

s3) establishing a large-scale data retrieval method by utilizing the characteristic vector and the perception hash value, and rapidly positioning and infringing the retrieval of the image data to be retrieved by the large-scale data retrieval method to obtain an infringing retrieval result.

Further, in step S1), acquiring the copyright image data and the image to be retrieved, respectively, establishing a picture infringement detection deep learning model, and extracting a feature vector of each image in the copyright image data and the image to be retrieved by using the picture infringement detection deep learning model, including the following steps:

s11) carrying out first image preprocessing on the copyright image data and each image in the images to be retrieved, wherein the first image preprocessing comprises the step of scaling the copyright image data and each image in the images to be retrieved to a first preset size;

s12), establishing a CNN-based deep learning model, inputting each image preprocessed by a first image into the CNN-based deep learning model, and taking a feature map of the last convolutional layer of the CNN-based deep learning model as output;

s13) extracting a plurality of areas with different scales from each feature map by adopting an R-MAC method to obtain a plurality of area R-MAC features, summing and pooling the plurality of area R-MAC features, and splicing to obtain n-dimensional feature vectors corresponding to each image; the n-dimensional feature vector corresponding to each image includes the n-dimensional feature vector of each image in the copyrighted-image data and the n-dimensional feature vector of the image to be retrieved.

Further, in step S2), calculating a perceptual hash value of each image in the copyrighted-image data and the image to be retrieved, including the following steps:

s21) carrying out second image preprocessing on the copyright image data and each image in the images to be retrieved, wherein the second image preprocessing comprises the step of scaling the copyright image data and each image in the images to be retrieved to a second preset size;

s22) converting each image scaled to the second preset size into a gray image, respectively;

s23) calculating the discrete cosine transform of each image after being converted into the gray level image to obtain a discrete cosine transform coefficient matrix;

s24) extracting a low-frequency matrix with a preset size at the upper left corner of the discrete cosine transform coefficient matrix, calculating the element average value of the low-frequency matrix, setting elements which are larger than or equal to the element average value in the low-frequency matrix to be 1, setting elements which are smaller than the element average value in the low-frequency matrix to be 0, and obtaining the low-frequency matrix with the elements set to be 1 or 0;

s25) flattening the low-frequency matrix with the element set to be 1 or 0 into a one-dimensional vector to obtain copyright image data and a perception hash value of each image in the image to be retrieved.

Further, in step S3), the method for building a large-scale data retrieval by using the feature vector and the perceptual hash value includes constructing a retrieval index file and retrieving by using the retrieval index file, where constructing the retrieval index file includes the following steps:

s31) initializing a retrieval index file data structure, wherein the retrieval index file data structure comprises a reverse arrangement table, a code table, a reverse arrangement vector ID table and/or a reverse arrangement vector coding table;

s32) training data are obtained, and a clustering algorithm is trained by using the training data; the training data comprises a plurality of data points, and the data points are respectively n-dimensional characteristic vectors of each image in the copyright image data; establishing a reverse vector ID table, wherein the reverse vector ID table is used for storing a plurality of data points and IDs of the data points;

s33) determining the number of the centers of the clusters and the number range of elements in the cluster where each cluster center is located according to the data volume of the training data;

s34) randomly initializing all cluster centers and updating all cluster centers and code tables, comprising the following steps:

s3031) initializing all clustering centers;

s3032) calculating any one clustering center C_qShortest distance d (C) to other cluster centers_q，C_w)；d(C_q，C_w) Represents the clustering center C_qAnd a distance clustering center C_qNearest cluster center C_wThe distance of (d);

s3033) obtaining a clustering center C_qThe data point x in the cluster to the cluster center C_qDistance d (C)_qX), judgment 2d (C)_q，x)≤d(C_q，C_w) If yes, the classification position of the data point x is unchanged; if not, entering step S3034);

s3034) calculating the distance from the data point x to other clustering centers, and classifying the data point x into the cluster where the clustering center closest to the other clustering centers is located;

s3035) repeating the steps S3032) to S3034), and sequentially obtaining the clustering center C_qA classification location for each data point in the cluster;

s3036) repeating the steps S3032) to S3035), and sequentially obtaining the classification position of each data point in the cluster where each cluster center is located;

s3037) updating all the clustering centers, judging whether all the clustering centers change, and if so, returning to the step S3032); if not, finishing the cluster updating, obtaining all the updated cluster centers and all the data points in the cluster of each cluster center, wherein each data point corresponds to one ID, adding all the updated cluster centers into a code table, storing all the data points in the cluster of each cluster center and the IDs of the data points into corresponding inverted arrangement tables, each cluster center corresponds to one inverted arrangement table, inverted IDs and inverted code tables are stored in the inverted arrangement tables, the inverted IDs are used for storing the IDs of the data points, and the inverted code tables are used for storing all the data points in the cluster of the cluster center.

Further, the retrieval by using the retrieval index file comprises the following steps:

s35), acquiring the constructed retrieval index file, calculating the vector distance between the N-dimensional feature vector of the image to be retrieved and all the updated clustering centers, and acquiring N clustering center points closest to the N-dimensional feature vector of the image to be retrieved;

s36) obtaining a reverse arrangement table of the N clustering center points, traversing the reverse arrangement coding tables of the N clustering center points in parallel through OpenMP, and calculating the distance between the N-dimensional feature vector of the image to be retrieved and the feature vector in the reverse arrangement coding table of each clustering center point; and obtaining a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved.

Further, in step S3), performing fast positioning infringement search on the image data to be searched by a large-scale data search method to obtain an infringement search result, including the following steps:

s361) respectively calculating Euclidean distances between a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved;

s371) obtaining a plurality of perceptual hash values of the copyright images corresponding to a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved, and calculating Hamming distances between the perceptual hash values of the image to be retrieved and the perceptual hash values of the copyright images respectively;

s381) establishing a first distance scoring function f_i＝w₁d_1i+w₂d_2iI 1, 2, 1, m is the total number of the plurality of copyright images; w is a₁Is a feature vector distance weight, w₂To perceive the hash value distance weight, d_1iIs the Euclidean distance between the ith feature vector nearest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved, d_2iThe Hamming distance between the perceptual hash value of the ith copyright image and the perceptual hash value of the image to be retrieved is obtained; f. of_iThe value of credit of the ith copyright image; the ith copyright image corresponds to the ith characteristic vector closest to the n-dimensional characteristic vector of the image to be retrieved;

s391) respectively calculating the score values of a plurality of copyright images, sequencing the score values, setting a score threshold, judging whether the maximum score value is greater than the score threshold, if so, acquiring the copyright image corresponding to the maximum score value, and identifying the image to be retrieved as the copyright image infringing the right corresponding to the maximum score value; if not, the image to be retrieved is indicated to have no infringement image retrieved.

s362) respectively calculating Euclidean distances between a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved;

s363) setting a Euclidean distance threshold, and selecting k feature vectors of which Euclidean distances from the n-dimensional feature vectors of the image to be retrieved are larger than the Euclidean distance threshold from a plurality of feature vectors closest to the n-dimensional feature vectors of the image to be retrieved;

s364), acquiring the perceptual hash values of k copyright images corresponding to k eigenvectors, and respectively calculating the Hamming distance between the perceptual hash value of the image to be retrieved and the perceptual hash values of the k copyright images;

s365) setting a Hamming distance threshold, and selecting z feature vectors of which the Hamming distance from the perceptual hash value of the image to be retrieved is larger than the Hamming distance threshold from the k feature vectors; the z feature vectors correspond to the z copyright images;

s366) establishing a second distance scoring function F_j＝W₁D_1j+W₂D_2j，j＝1、2、...、z，W₁Is the second eigenvector distance weight, W₂Is a second perceptual hash value distance weight, D_1jIs Euclidean distance between the feature vector of the jth copyright image in the z copyright images and the n-dimensional feature vector of the image to be retrieved, D_2jThe Hamming distance between the perceptual hash value of the jth copyright image in the z copyright images and the perceptual hash value of the image to be retrieved is obtained; f_jThe value of credit of the j-th copyright image in the z copyright images;

s367) respectively calculating the score values of the z copyright images, sequencing the score values, setting a second score threshold value, judging whether the maximum score value in the score values of the z copyright images is larger than the second score threshold value, if so, acquiring the copyright image corresponding to the maximum score value in the score values of the z copyright images, and identifying the image to be retrieved as the copyright image corresponding to the maximum score value in the score values infringing the z copyright images; if not, the image to be retrieved is indicated to have no infringement image retrieved.

Further, the deep learning model based on CNN is to select VGG16 or AlexNet convolutional neural network pre-trained on ImageNet.

On the other hand, the invention provides a device for rapidly positioning an infringement image for image copyright protection, which comprises an image acquisition unit, a model establishing unit and an infringement image rapid positioning and retrieving unit;

the image acquisition unit is used for acquiring copyright image data and an image to be retrieved;

the model establishing unit is used for establishing a picture infringement detection deep learning model and extracting a feature vector containing copyright image data and global features and local features of each image in an image to be retrieved by using the picture infringement detection deep learning model; calculating the perceptual hash value of the copyright image data and each image in the image to be retrieved;

and the infringement image rapid positioning retrieval unit is used for establishing a large-scale data retrieval method by utilizing the characteristic vector and the perception hash value in the model establishing unit, and rapidly positioning and infringement retrieval is carried out on the image data to be retrieved acquired in the image acquisition unit by the large-scale data retrieval method to obtain an infringement retrieval result.

In a further aspect, the invention provides a computer-readable storage medium, in which a computer program is stored, which, when executed by a processor, causes the processor to carry out the steps of the above-mentioned method.

The invention has the beneficial effects that: firstly, high-dimensional feature vectors (namely global features and local features) of the image are extracted through the combination of a CNN (computer network) and an R-MAC (radio-media access control) method, the perceptual hash value of the image is calculated, the image is identified and compared in two-layer scales, especially, the identification of anti-attack means possibly appearing in an infringing image is enhanced, and the robustness of the model is greatly enhanced; in addition, the invention provides a retrieval method for quickly retrieving the source-tracing infringement images of billions of images, which fully utilizes computing resources to carry out high-efficiency clustering algorithm and inverted list calculation, is suitable for retrieval of single images and retrieval of batch data, can realize millisecond-level query of ten-million-level data under the single-machine effect (such as 48-core cpu 128g RAM), and greatly improves the retrieval efficiency under mass data; meanwhile, aiming at the characteristics of image copyright protection, the method is different from the screening of a single threshold, uses a mode of combining multiple indexes, provides a method for automatically tuning parameters aiming at the characteristics of data, and ensures the accuracy and recall rate of retrieval to the maximum extent.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the embodiments are briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.

Fig. 1 is a flowchart illustrating a method for rapidly positioning an infringement image for image copyright protection according to an embodiment of the present invention.

Fig. 2 is a schematic flow chart of extracting feature vectors by using a picture infringement detection deep learning model according to the first embodiment.

Fig. 3 is a schematic flowchart of calculating a perceptual hash value of each image according to the first embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings. It should be understood that the detailed description and specific examples, while indicating the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention. It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order, and it should be understood that the terms so used are interchangeable under appropriate circumstances and are merely used to describe the distinguishing manner in which the embodiments of the present invention distinguish between similar elements. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of elements is not necessarily limited to those elements, but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

In a first aspect, the present invention provides a method for rapidly locating an infringement image for image copyright protection, as shown in fig. 1, including the following steps:

s1) respectively obtaining copyright image data and an image to be retrieved, establishing a picture infringement detection deep learning model, and extracting feature vectors including global features and local features of each image in the copyright image data and the image to be retrieved by using the picture infringement detection deep learning model, as shown in fig. 2, including the following steps:

s12), establishing a CNN-based deep learning model, inputting each image preprocessed by the first image into the CNN-based deep learning model respectively, and taking the feature map of the last convolutional layer of the CNN-based deep learning model as output;

Step S1), extracting the regional characteristics of the characteristic graph obtained by calculating the last convolution layer of the CNN-based deep learning model through an R-MAC method, generating a series of regional vectors for the local region specified in the center of the image by using the R-MAC method, then summing and pooling for vector aggregation, outputting the vectors as 512-dimensional characteristic vectors, and storing all vector results in a database to be used in the retrieval process. The CNN-based deep learning model uses ImageNet pre-training model parameters, and can be selected to be finely adjusted on specific data, so that the requirements of data with different orders of magnitude are met. The deep learning model based on CNN is to select VGG16 or AlexNet convolutional neural network pre-trained on ImageNet.

S2) calculating the perceptual hash value of each image in the copyrighted-image data and the image to be retrieved, as shown in fig. 3, including the steps of:

In order to enhance the attack resisting capability of the model for an infringement image, on the basis of deep learning characteristics, perceptual hash is selected as a posteriori measurement index, firstly, the image is scaled to the size of 32x32 (namely, copyright image data and each image in the image to be retrieved are subjected to second image preprocessing), image redundant information is reduced, meanwhile, the calculation of discrete cosine transform is accelerated, the image is converted into a 64-degree gray image, the calculation amount is further simplified, and the discrete cosine transform of the image is calculated.

In step S3), a large-scale data retrieval method is established using the eigenvectors and the perceptual hash values, including constructing a retrieval index file and retrieving using the retrieval index file, the constructing of the retrieval index file including the steps of:

s3031) initializing all clustering centers;

The retrieval is carried out by utilizing the retrieval index file, and the method comprises the following steps:

Inputting a training data training clustering algorithm, fixing the central number of clustering according to the data volume, firstly checking the number of elements participating in training clustering and the element number range in a preset clustering central point, and judging whether random sampling is carried out or not; randomly initializing a clustering center, adding the clustering center into a code table, and performing clustering iteration for specified times; finding out the nearest clustering center and distance of each element to be clustered, updating the clustering center, replacing the clustering center point before the mean value of each central element obtained by parallel calculation is replaced, finding out the splitting center meeting the conditions aiming at the clustering center with empty elements, modifying the splitting center point value through small symmetrical disturbance, taking the splitting center point and the original center point mean value as a substitute, updating the code table, and continuing iteration until the iteration is finished.

The invention reduces unnecessary distance calculation by using the triangle inequality and accelerates the clustering algorithm. When the clustering algorithm is trained, the element number range of each clustering center is fixed, so that the clustering centers are prevented from having no representativeness due to too few elements, and meanwhile, excessive data elements are prevented from training the model, and the training time is prevented from being increased.

When the nearest cluster center and distance of each element are calculated, excessive time is consumed when the data size is large because all elements need to be traversed, and the calculation process is optimized by adopting the BLAS linear algebra library and the SIMD AVX2 instruction set, so that the calculation time is greatly saved. And when a retrieval index file is created, respectively calculating the distances from all the training data to all the clustering centers, adding the distances to the clustering centers closest to the training data to the clustering centers, and storing the characteristic vectors into an inverted coding list so as to construct the retrieval index file.

The method comprises the steps of loading a constructed retrieval index file, inputting a feature vector of an image to be retrieved (the feature vector of the image to be retrieved is the feature vector of a single image to be queried or the feature vectors of a plurality of images), firstly querying N clustering centers closest to the feature vector of the image to be retrieved, obtaining an inverted arrangement table of the current N clustering centers, traversing the inverted arrangement coding table in parallel through OpenMP, calculating the distance between the vectors and querying, putting the result into a returned maximum heap, circularly accumulating the vectors returned by querying, sequencing the maximum heap, rapidly positioning fragment data by coarsely querying the N clustering centers to reduce the data volume needing to be calculated in the process, and fully utilizing the performance of a multi-core processor through OpenMP to accelerate the retrieval speed.

Step S3), rapidly positioning and infringing the search for the image data to be searched by a large-scale data search method to obtain an infringing search result, comprising the following steps:

s381) constructionEstablishing a first distance scoring function f_i＝w₁d_1i+w₂d_2iI 1, 2, 1, m is the total number of the plurality of copyright images; w is a₁Is a feature vector distance weight, w₂To perceive the hash value distance weight, d_1iIs the Euclidean distance between the ith feature vector nearest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved, d_2iThe Hamming distance between the perceptual hash value of the ith copyright image and the perceptual hash value of the image to be retrieved is obtained; f. of_iThe value of credit of the ith copyright image; the ith copyright image corresponds to the ith characteristic vector closest to the n-dimensional characteristic vector of the image to be retrieved;

In this embodiment, a result (an euclidean distance) returned by the constructed retrieval index file is used, a plurality of perceptual hash values of the copyrighted images corresponding to a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved are found through the inverted vector ID table, hamming distances between the perceptual hash values of the image to be retrieved and the perceptual hash values of the copyrighted images are calculated and calculated, a distance scoring function is used for screening and sorting, a scoring function and two weighting threshold screening criteria are constructed based on two distance indexes according to infringement data characteristics, and the result is finally scored and sorted.

On the other hand, the embodiment of the invention provides a device for quickly positioning an infringement image for image copyright protection, which comprises an image acquisition unit, a model establishing unit and an infringement image quick positioning and retrieving unit;

In yet another aspect, the present invention provides a computer-readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the processor is caused to execute the steps of the above method.

In step S3), performing fast positioning infringement search on image data to be searched by a large-scale data search method to obtain an infringement search result, where the method is applied to fast positioning infringement images for image copyright protection, and includes the following steps:

The other contents of the second embodiment are the same as those of the first embodiment, and are not described herein again.

By adopting the technical scheme disclosed by the invention, the following beneficial effects are obtained:

firstly, high-dimensional feature vectors (global features and local features) of the image are extracted by combining a CNN network and an R-MAC method, a perceptual hash value of the image is calculated, the image is identified and compared by two layers of scales, and particularly, the identification of anti-attack means possibly appearing in an infringing image is enhanced, so that the robustness of the model is greatly enhanced; in addition, the invention provides a retrieval method for quickly retrieving the source-tracing infringement images of hundred million-level images, which fully utilizes computing resources to carry out high-efficiency clustering algorithm and inverted list calculation, is suitable for retrieval of single images and retrieval of batch data, can realize millisecond-level query of million-level data under the single-machine effect, and greatly improves the retrieval efficiency under mass data; meanwhile, aiming at the characteristics of image copyright protection, the method is different from the screening of a single threshold, uses a mode of combining multiple indexes, provides a method for automatically tuning parameters aiming at the characteristics of data, and ensures the accuracy and recall rate of retrieval to the maximum extent.

The foregoing is only a preferred embodiment of the present invention, and it should be noted that it will be apparent to those skilled in the art that various modifications and improvements can be made without departing from the principle of the present invention, and such modifications and improvements should also be considered within the scope of the present invention.

Claims

1. The method for rapidly positioning the infringement image applied to the image copyright protection is characterized by comprising the following steps of:

s2) calculating the perception hash value of the copyright image data and each image in the image to be retrieved;

s3) establishing a large-scale data retrieval method by using the feature vector and the perceptual hash value, and performing rapid positioning infringement retrieval on the image data to be retrieved by the large-scale data retrieval method to obtain an infringement retrieval result;

in step S3), a large-scale data retrieval method is established using the feature vector and the perceptual hash value, including constructing a retrieval index file and retrieving using the retrieval index file, where constructing the retrieval index file includes the following steps:

s31), initializing a retrieval index file data structure, wherein the retrieval index file data structure comprises a reverse arrangement table, a code table, a reverse arrangement vector ID table and/or a reverse arrangement vector coding table;

s32) obtaining training data, and training a clustering algorithm by using the training data; the training data comprises a plurality of data points, and the data points are respectively n-dimensional feature vectors of each image in the copyright image data; establishing a reverse vector ID table, wherein the reverse vector ID table is used for storing a plurality of data points and IDs of the data points;

s3031) initializing all clustering centers;

s3032) calculating any one clustering center C_qShortest distance d (C) to other cluster centers_q，C_w)，d(C_q，C_w) Represents the clustering center C_qAnd is distant from the cluster center C_qNearest cluster center C_wThe distance of (d);

2. The method for rapidly positioning infringement image for image copyright protection according to claim 1, wherein in step S1), separately acquiring copyright image data and image to be retrieved, establishing a deep learning model for picture infringement detection, and extracting feature vectors of each image in the copyright image data and the image to be retrieved by using the deep learning model for picture infringement detection, the method comprises the following steps:

s11) performing a first image preprocessing on the copyrighted-image data and each of the images to be retrieved, the first image preprocessing including scaling the copyrighted-image data and each of the images to be retrieved to a first preset size;

s13) extracting a plurality of areas with different scales from each feature map by adopting an R-MAC method to obtain a plurality of area R-MAC features, summing and pooling the plurality of area R-MAC features, and splicing to obtain n-dimensional feature vectors corresponding to each image; the n-dimensional feature vector corresponding to each image comprises the n-dimensional feature vector of each image in the copyright image data and the n-dimensional feature vector of the image to be retrieved.

3. The method for rapidly positioning infringed image for image copyright protection according to claim 1 or 2, wherein in step S2), the step of calculating the perceptual hash value of each image in the copyrighted-image data and the image to be retrieved comprises the following steps:

s21) performing second image preprocessing on the copyrighted-image data and each of the images to be retrieved, the second image preprocessing including scaling the copyrighted-image data and each of the images to be retrieved to a second preset size;

s25) flattening the low-frequency matrix with the element set to be 1 or 0 into a one-dimensional vector to obtain the copyright image data and the perception hash value of each image in the image to be retrieved.

4. The method for rapidly positioning infringed image for image copyright protection according to claim 1, wherein the retrieval using the retrieval index file comprises the following steps:

s35), acquiring the constructed retrieval index file, calculating the vector distance between the N-dimensional feature vector of the image to be retrieved and all updated clustering centers, and acquiring N clustering center points closest to the N-dimensional feature vector of the image to be retrieved;

s36) obtaining the inverted arrangement table of the N clustering center points, traversing the inverted arrangement code table of the N clustering center points in parallel through OpenMP, and calculating the distance between the N-dimensional feature vector of the image to be retrieved and the feature vector in the inverted arrangement code table of each clustering center point; and obtaining a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved.

5. The method for rapidly positioning infringement image for image copyright protection according to claim 4, wherein in step S3), the rapid positioning infringement retrieval is performed on the image data to be retrieved by the large-scale data retrieval method to obtain an infringement retrieval result, and the method comprises the following steps:

s371) obtaining a plurality of perceptual hash values of the copyright images corresponding to a plurality of feature vectors closest to the n-dimensional feature vector of the image to be retrieved, and calculating Hamming distances between the perceptual hash values of the image to be retrieved and the perceptual hash values of the plurality of copyright images respectively;

s381) establishing a first distance scoring function f_i＝w₁d_1i+w₂d_2iI 1, 2, and m, wherein m is the total number of the plurality of copyright images; w is a₁Is a feature vector distance weight, w₂To perceive the hash value distance weight, d_1iIs the Euclidean distance between the ith feature vector nearest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved, d_2iThe Hamming distance between the perceptual hash value of the ith copyright image and the perceptual hash value of the image to be retrieved is obtained; f. of_iThe value of credit of the ith copyright image; the ith copyright image corresponds to the ith characteristic vector closest to the n-dimensional characteristic vector of the image to be retrieved;

s391) respectively calculating the score values of the plurality of copyright images, sequencing the score values, setting a score threshold, judging whether the maximum score value is larger than the score threshold, if so, acquiring the copyright image corresponding to the maximum score value, and determining the image to be retrieved as the copyright image corresponding to the maximum score value infringed; if not, the image to be retrieved is indicated to have no infringement image retrieved.

6. The method for rapidly positioning infringement image for image copyright protection according to claim 4, wherein in step S3), the rapid positioning infringement retrieval is performed on the image data to be retrieved by the large-scale data retrieval method to obtain an infringement retrieval result, and the method comprises the following steps:

s362) respectively calculating Euclidean distances between a plurality of feature vectors nearest to the n-dimensional feature vector of the image to be retrieved and the n-dimensional feature vector of the image to be retrieved;

s363) setting a Euclidean distance threshold, and selecting k eigenvectors of which the Euclidean distance from the n-dimensional eigenvector of the image to be retrieved is larger than the Euclidean distance threshold from a plurality of eigenvectors closest to the n-dimensional eigenvector of the image to be retrieved;

s364), acquiring the perceptual hash values of k copyright images corresponding to the k characteristic vectors, and respectively calculating the Hamming distance between the perceptual hash value of the image to be retrieved and the perceptual hash values of the k copyright images;

s365) setting a Hamming distance threshold, and selecting z feature vectors of which the Hamming distance from the k feature vectors to the perceptual hash value of the image to be retrieved is larger than the Hamming distance threshold; the z feature vectors correspond to z copyrighted images;

s367) calculating the score values of the z copyrighted images, sorting the score values, setting a second score threshold, determining whether a maximum score value of the score values of the z copyrighted images is greater than the second score threshold, if so, obtaining the copyrighted image corresponding to the maximum score value of the score values of the z copyrighted images, and regarding the image to be retrieved as the copyrighted image corresponding to the maximum score value of the score values infringing the z copyrighted images; if not, the image to be retrieved is indicated to have no infringement image retrieved.

7. The method for rapidly positioning infringement images for image copyright protection according to claim 2, wherein the deep learning model based on CNN is a convolutional neural network selected from VGG16 or AlexNet pre-trained on ImageNet.

8. The device for rapidly positioning infringed image for image copyright protection is suitable for the method for rapidly positioning infringed image for image copyright protection according to any one of claims 1 to 7, and is characterized by comprising an image acquisition unit, a model building unit and an infringed image rapid positioning retrieval unit;

the model establishing unit is used for establishing a picture infringement detection deep learning model, and extracting feature vectors containing the copyright image data and global features and local features of each image in the image to be retrieved by using the picture infringement detection deep learning model; calculating the perception hash value of the copyright image data and each image in the image to be retrieved;

the infringement image rapid positioning retrieval unit is used for establishing a large-scale data retrieval method by utilizing the characteristic vector and the perception hash value in the model establishing unit, and rapidly positioning and infringement retrieval is carried out on the image data to be retrieved acquired in the image acquisition unit by the large-scale data retrieval method to obtain an infringement retrieval result.

9. A computer-readable storage medium, storing a computer program which, when executed by a processor, causes the processor to carry out the steps of the method according to any one of claims 1 to 7.