CN114722226B

CN114722226B - Self-adaptive retrieval method and device capable of matching images and storage medium

Info

Publication number: CN114722226B
Application number: CN202210645024.7A
Authority: CN
Inventors: 王伟玺; 谢林甫; 郭欢; 李晓明; 汤圣君
Original assignee: Shenzhen University
Current assignee: Shenzhen University
Priority date: 2022-06-09
Filing date: 2022-06-09
Publication date: 2022-11-15
Anticipated expiration: 2042-06-09
Also published as: CN114722226A

Abstract

The invention discloses a self-adaptive retrieval method, a device and a storage medium capable of matching images, wherein the method comprises the following steps: acquiring a disordered image set to be participated in three-dimensional reconstruction, and extracting a similarity vector of each image according to a pre-trained visual dictionary; sorting according to the similarity value of the similarity vector to obtain a sorted similarity vector; substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve; calculating an inflection point value of the high-order polynomial function, and calculating a self-adaptive threshold value of the image according to the inflection point value; and searching the images according to the self-adaptive threshold value of the calculated images, and outputting similar image results and corresponding images according to the searching results. The invention can search out the image pairs with the same name image points from the disordered images, reduces unnecessary matching between the unrelated image pairs and improves the overall efficiency of three-dimensional reconstruction.

Description

Adaptive retrieval method and device capable of matching images and storage medium

Technical Field

The invention relates to the technical field of matchable images, in particular to a matchable image self-adaptive retrieval method, a device and a storage medium.

Background

At present, three-dimensional model reconstruction based on chaotic images is a research focus and a hot spot in the fields of digital photogrammetry, computer vision and the like. The image feature matching is based on three-dimensional reconstruction of an unordered image, i.e. one of the most time-consuming computational processes in the Structure From Motion (SFM). Due to the fact that the disordered images lack prior information such as POS information, GPS positioning, air route planning, image shooting sequence and the like, an image feature matching link usually needs all images to be subjected to exhaustive matching (N x (N-1)/2 times of calculation), and unnecessary matching among a large number of irrelevant images exists, so that great computing resource consumption and time waste are caused.

At present, the mainstream technical route of a technology for matching image retrieval facing three-dimensional reconstruction is to extract image local feature point (SIFT, SURF, ORB, and the like) operators, cluster based on a bag-of-words model to generate a visual dictionary, convert all images into visual dictionary vectors with the same dimension through the visual dictionary, represent the similarity degree between the images (between 0 and 1, the higher the value is, the more similar the value is), and then obtain a matchable image pair through similarity measurement by calculating the distance between high-dimensional vectors, such as euclidean distance, cosine distance, hamming distance, and the like, as similarity numerical values.

However, in the similarity measure method of the related art, a fixed threshold method is mainly used. The fixed threshold method is based on the fact that an empirical threshold (an image with a threshold larger than a certain value n or n images with a similarity value in front) is obtained through multiple experiments and is used as a basis for judging similarity of the images, and the fixed threshold method is based on the fact that search results are redundant or too few, unnecessary matching between unrelated images or insufficient matching of image features are caused, and further the subsequent three-dimensional reconstruction effect is influenced.

In view of this, there is still a need for improvement and development in the art.

Disclosure of Invention

In view of the above-mentioned shortcomings in the prior art, an object of the present invention is to provide a method, an apparatus and a storage medium for adaptive searching of a matchable image, so as to solve the technical problem of low efficiency of the existing three-dimensional reconstruction.

The technical scheme adopted by the invention for solving the technical problem is as follows:

in a first aspect, the present invention provides a method for adaptive searching of matchable images, the method comprising:

acquiring a disordered image set to be participated in three-dimensional reconstruction, and extracting a similarity vector of each image according to a pre-trained visual dictionary;

sorting according to the similarity value of the similarity vector to obtain a sorted similarity vector;

substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve;

calculating an inflection point value of the high-order polynomial function, and calculating a self-adaptive threshold value of the image according to the inflection point value;

and searching the images according to the self-adaptive threshold value of the calculated images, and outputting similar image results and corresponding images according to the searching results.

In one implementation, the acquiring a disordered set of images to be involved in three-dimensional reconstruction and extracting a similarity vector of each image according to a pre-trained visual dictionary previously includes:

acquiring a plurality of preset images;

extracting a plurality of local feature points of the preset image, and clustering the extracted local feature points to generate a visual dictionary;

converting the preset images into a plurality of visual dictionary vectors with the same dimensionality according to the visual dictionary;

and setting the distances among the visual dictionary vectors, and obtaining similarity values of all the representation image similarity relations according to the distances to obtain a similarity matrix.

In one implementation, the acquiring a disordered set of images to be involved in three-dimensional reconstruction and extracting a similarity vector of each image according to a pre-trained visual dictionary includes:

acquiring a disordered image set to be participated in three-dimensional reconstruction and the pre-trained visual dictionary;

extracting a similarity matrix in the pre-trained visual dictionary;

and extracting the similarity vector of each image in the unordered image set according to the similarity matrix.

In one implementation, substituting the similarity value of the sorted similarity vector into a high-order polynomial function to calculate, and resolving a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve includes:

a plurality of the similarity values are arranged in order from large to small,

substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient:

wherein y is a similarity value, x is a sequenced image sequence number, and a, b, c and d are constant term coefficients;

and obtaining a high-order polynomial function of the fitted image similarity distribution curve according to the analyzed function coefficient.

In one implementation, the calculating a knee value of a higher order polynomial function and calculating an adaptive threshold of an image according to the knee value includes:

and carrying out derivation according to the high-order polynomial function, enabling the derivation function to be zero, calculating a corner value, and substituting the corner value into the high-order polynomial function to calculate the self-adaptive threshold of the image.

In one implementation, the performing image retrieval according to the calculated adaptive threshold of the image, and outputting a similar image result and a corresponding image according to the retrieval result includes:

sequentially judging whether the similarity value of each similarity vector is larger than the corresponding adaptive threshold value or not according to the sequence from large to small;

selecting all similarity vectors which are larger than the self-adaptive threshold value, and outputting matched images to the selected similarity vectors;

all similarity vectors less than or equal to the adaptive threshold are selected and the selected similarity vectors are excluded.

In one implementation, the image retrieval according to the self-adaptive threshold of the calculated image and outputting a similar image result and a corresponding image according to the retrieval result includes:

counting the number of the similarity values larger than the self-adaptive threshold value in the similarity matrix with the same sequence number to obtain the number of common partners until all the similarity vectors are traversed;

and setting a specified value according to the number of the common partners, and if the number of the common partners is larger than or equal to the specified value, considering the two images as a similar image pair with the same name image point.

In one implementation, the method for adaptively retrieving a matchable image further includes:

and if the number of the common partners is smaller than the specified value, the common partners are regarded as rough retrieval of the image and are removed.

In a second aspect, the present invention provides a self-adaptive searching device for matchable images, comprising: a memory and a processor; the memory stores a matchable image adaptive retrieval program, which when executed by the processor is configured to implement the operations of the matchable image adaptive retrieval method as described above.

In a third aspect, the present invention provides a storage medium, which is a computer-readable storage medium, and the storage medium stores a matchable image adaptive retrieval program, and the matchable image adaptive retrieval program is used for implementing the operation of the matchable image adaptive retrieval method as described above when being executed by a processor.

Compared with the prior art, the invention has the beneficial effects that:

1) The method comprises the steps of fitting a similarity distribution curve of disordered images through a high-order polynomial function, obtaining an inflection point with the most severe change of the similarity value through function derivation, obtaining a self-adaptive threshold value of each image as a judgment basis of a similarity measurement method, and further avoiding the problem of redundancy or too few retrieval results caused by inaccurate threshold value in the current similarity measurement method;

2) The method provides a gross error rejection strategy that similar image pairs should have more common similar images, and the gross error rejection strategy is used for counting the number of common partners among the images based on numerical values such as a similarity matrix and a self-adaptive threshold value and taking the number of common partners as a screening basis of the retrieval gross error, so that the retrieval gross error result caused by the similarity of local visual contents is avoided.

Drawings

In order to more clearly illustrate the embodiments or technical solutions of the present invention, the drawings used in the embodiments or technical solutions of the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the structures shown in the drawings without creative efforts.

FIG. 1 is a flow chart of a method for adaptive searching of images to be matched according to the present invention;

FIG. 2 is a flow chart for establishing a similarity matrix according to the present invention;

FIG. 3 is a flow chart of gross error rejection after similar image results are obtained;

FIG. 4 is a detailed flowchart of the adaptive searching method for matching images shown in FIG. 1;

FIG. 5 is a detailed flowchart of gross error rejection after similar image results are obtained in FIG. 3;

fig. 6 is a functional schematic diagram of the adaptive image retrieval device according to the present invention.

The implementation, functional features and advantages of the present invention will be further described with reference to the accompanying drawings.

Detailed Description

In order to make the objects, technical solutions, and effects of the present application clearer and clearer, the present application will be described in further detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of and not restrictive on the broad application.

As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. As used herein, the term "and/or" includes all or any element and all combinations of one or more of the associated listed items.

It will be understood by those within the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

Exemplary method

In the existing similarity measure methods, a fixed threshold method is mainly adopted. The fixed threshold method is based on the fact that an empirical threshold (an image with a threshold larger than a certain value n or n images with a similarity value in front) is obtained through multiple experiments and is used as a basis for judging similarity of the images, and the fixed threshold method is based on the fact that search results are redundant or too few, unnecessary matching between unrelated images or insufficient matching of image features are caused, and further the subsequent three-dimensional reconstruction effect is influenced.

Meanwhile, the existing similarity measure method also has a mean threshold-based method. The method is based on a mean threshold value method, the mean value and the standard deviation of similarity values between a retrieval image and the residual images are calculated, the mean threshold value of the retrieval image is obtained through a linear function of the mean value and the standard deviation, the mean threshold value has certain self-adaptability, compared with a fixed threshold value method, the problem of redundancy or too little of the retrieval result can be restrained to a certain extent, however, the method cannot accurately position accurate values of similarity of the images, the accuracy of the retrieval result is still not high, and the current similarity measurement method needs to be further improved and promoted.

In view of the above problems, the embodiment provides a method for adaptive retrieval of matchable images, which includes extracting a similarity vector of each image through a pre-trained visual dictionary, substituting the similarity value of the similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting an image similarity distribution curve; calculating an inflection point value of the high-order polynomial function, and calculating an adaptive threshold value of the image according to the inflection point value; and performing image retrieval according to the self-adaptive threshold of the image obtained by calculation, and outputting a similar image result and a corresponding image according to the retrieval result, thereby avoiding the problem of redundant or less retrieval results caused by inaccurate threshold in the current similarity measurement method.

As shown in fig. 1, an embodiment of the present invention provides a method for adaptive searching of a matchable image, where the method for adaptive searching of a matchable image includes the following steps:

step S100: acquiring a disordered image set to be participated in three-dimensional reconstruction, and extracting the similarity vector of each image according to a pre-trained visual dictionary.

The image similarity value change curve is fitted through a high-order polynomial function, the inflection point value of the fitting function is obtained through derivation and serves as a judgment threshold value of image similarity, and then an image similarity self-adaptive measurement method based on the fitting function is obtained, so that the precision of the image retrieval result which can be matched is improved; aiming at the problem of retrieval gross error caused by local visual content similarity, the thinking that a similar image pair participating in three-dimensional reconstruction should have more common similar images is provided, and the retrieval gross error caused by local characteristic point similarity is eliminated through the numerical relationship of a similarity matrix, so that the precision of an image retrieval result is further improved.

Before the adaptive retrieval method of the matchable images is implemented, a visual dictionary is obtained through pre-training, the visual dictionary is formed by combining a plurality of visual words, visual dictionary vectors are extracted from the visual dictionary, and similarity values corresponding to all the images are obtained and used for making a similarity matrix.

Before implementing step S100, the similarity matrix needs to be established, specifically, as shown in fig. 2, the step of establishing the similarity matrix includes:

step S001: a plurality of preset images are obtained.

The preset image is a previously shot image, and the operation of extracting the local feature points is performed in the next step by preparing the preset image.

Step S002: and extracting a plurality of local characteristic points of the preset image, and clustering the extracted local characteristic points to generate a visual dictionary.

The extraction of local feature points of an image generally includes two steps: local feature point detection and local feature description. The local feature point detection is to detect the position or the area of the gradient distribution extreme point in the image by adopting a proper mathematical operator, the obtained area corresponding to the extreme point contains abundant visual information, and the corresponding feature vector has strong distinguishing capability and description capability. Currently, the main local feature point detection operators include: after local regions corresponding to the local feature points are determined, effective local feature descriptions, generally high-dimensional vectors, need to be generated by using a SIFT operator, a SURF operator, an ORB operator, a MSER operator, a Hrris-Affinie operator, a Hessian-Affinie operator and the like.

The local feature points can represent the bottom-layer visual characteristics of the image and are largely used in the image content analysis. However, most of the local feature points of the image are located in a high-dimensional space, for example, the SIFT operator is 128-dimensional, the SURF operator is 64-dimensional, and the like, so that storage and subsequent calculation are inconvenient. In addition, high-dimensional vectors usually face the problem of "dimensionality disaster" such as sparseness and noise, which causes the performance of algorithms that perform well in low-dimensional space to be sharply deteriorated in high-dimensional space. Therefore, it is necessary to map the high-dimensional local features of the image into a low-dimensional space for storage, indexing and calculation, map a large number of local feature points into the low-dimensional space, and obtain codes corresponding to the local feature points, where these codes are referred to as visual words, and all the visual words form a visual dictionary.

Step S003: and converting the preset images into a plurality of visual dictionary vectors with the same dimensionality according to the visual dictionary.

Specifically, the preset image is converted into a visual dictionary vector with the same dimension through the opencv2 and the boost functions in the C + + algorithm.

Step S004: setting a distance between a plurality of the visual dictionary vectors, for example: as shown in the following table, distances between horizontal rows S1, S2, S3, S4, and between vertical rows S1, S2, S3, S4, and Sn vectors and distances between vertical rows S1, S2, S3, S4, and Sn-1, sn vectors are set, and similarity values representing similarity relationships between images are obtained according to the distances, so as to obtain a similarity matrix.

Step S001 to step S004, which are the training dictionary stages of the technical route.

The similarity matrix is specifically shown in the following table:

from the table, the similarity values range from 0 to 1, and it should be noted that: the higher the similarity value in the table, the more similar the similarity between the images, and when the row number and the column number are equal, the similarity value of the image is the similarity value of itself, i.e. 1, and thus, the similarity matrix of the images is a symmetric matrix with a diagonal of 1. And (4) performing similarity value arrangement at corresponding positions of the line number and the column number until the similarity values of all images are traversed.

Specifically, in an implementation manner of this embodiment, the step S100 includes the following steps:

step S101: and acquiring a disordered image set to be participated in three-dimensional reconstruction and the pre-trained visual dictionary.

The unordered image set is all the preset images in the past, and visual words are extracted from the preset images so as to be combined into a visual dictionary.

Step S102: and extracting a similarity matrix in the pre-trained visual dictionary.

The similarity matrix is specifically shown in the above table, and the similarity vector of each image in the table can be seen according to the similarity matrix.

Step S103: and extracting the similarity vector of each image in the unordered image set according to the similarity matrix.

And extracting the similarity vector to obtain a corresponding similarity value.

As shown in fig. 1, an embodiment of the present invention provides a method for adaptive searching of a matchable image, where the method further includes the following steps:

step S200: and sequencing according to the similarity value of the similarity vector to obtain the sequenced similarity vector.

In an implementation manner of the embodiment of the present invention, the similarity values of the similarity vectors are arranged in descending order according to a certain row or a certain column of the matrix. Since the higher the similarity value is, the stronger the similarity between the images is, in the process of searching the target image, the searching is started from the image with the strongest similarity until the image with the weakest similarity is searched. All images are retrieved in this manner until the end.

Specifically, for example, in the second column of the similarity matrix, the numerical values corresponding to the similarity values are: 1.00, 0.82, 0.91, 0.16,. Copy., 0.25, 0.77, ordered by similarity value from large to small: 1.00, 0.91, 0.82, 0.77, 0.25, 0.16, resulting in a descending similarity vector ranking.

The embodiment arranges the data in the order from large to small. Because the higher the similarity value is, the stronger the similarity between the images is, in the process of searching the target image, the searching is started from the image with the strongest similarity; the problem of redundant or too few retrieval results caused by inaccurate threshold in the current similarity measurement method is avoided.

step S300: and substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve.

Specifically, in an implementation manner of this embodiment, the step S300 includes the following steps:

step S301: substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and resolving a function coefficient:

wherein y is a similarity value, x is a sequenced image sequence number, and a, b, c and d are constant term coefficients; n represents a power, and the value range of n can be taken as required, which is not limited in this embodiment.

step S400: and calculating an inflection point value of the high-order polynomial function, and calculating an adaptive threshold value of the image according to the inflection point value.

Specifically, in an implementation manner of this embodiment, the step S400 includes the following steps:

step S401: the step of calculating the inflection point value of the high-order polynomial function and calculating the self-adaptive threshold value of the image according to the inflection point value comprises the following steps:

and carrying out derivation according to the high-order polynomial function, namely:

setting the derivative function to be zero, thereby calculating an x value, wherein the x value is the inflection point value of the high-order polynomial function, and thus obtaining a point of violent change in the similarity value; and substituting the inflection value into the high-order polynomial function to calculate a y value, wherein the y value is the self-adaptive threshold of the image.

step S500: and searching the images according to the self-adaptive threshold value of the calculated images, and outputting similar image results and corresponding images according to the searching results.

Specifically, in an implementation manner of this embodiment, the step S500 includes the following steps:

step S501: and sequentially judging whether the similarity value of each similarity vector is larger than the corresponding adaptive threshold value or not according to the sequence from large to small.

Because the similarity value is higher, the similarity of the two images is stronger, and the similarity values are sequenced from large to small, the most similar images can be searched by the staff in the first time, and meanwhile, the screening mechanism is optimized.

Step S502: and selecting all similarity vectors which are larger than the adaptive threshold value, and outputting matched images to the selected similarity vectors.

When the similarity vector is larger than the adaptive threshold, the image pair can be ensured to have higher similarity.

Step S503: all similarity vectors less than or equal to the adaptive threshold are selected and the selected similarity vectors are excluded.

When the similarity vector is less than the adaptive threshold, it also indicates that the image pairs are less similar.

Step S100 to step S500 are image retrieval stages of the technical route of the present invention.

In the invention, a high-order polynomial function of a fitting image similarity distribution curve is obtained through the step S300; through the step S400, function derivation is performed to obtain an inflection point with the most severe change of the similarity degree value and an adaptive threshold value of each image, and the inflection point and the adaptive threshold value are used as judgment bases of the similarity measure method, so that the problem of redundancy or too few retrieval results caused by inaccurate threshold values in the current similarity measure method is solved.

In the practical application process of the steps S100 to S500 of the present embodiment, as shown in fig. 4, the method includes the following steps:

step S011: acquiring a disordered image set to be participated in three-dimensional reconstruction, and extracting a similarity vector of each image according to a pre-trained visual dictionary;

step S012: sorting according to the similarity value of the similarity vectors to obtain sorted similarity vectors;

step S013: substituting the similarity values of the sorted similarity vectors into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve;

step S014: calculating an inflection point value of the high-order polynomial function, and calculating an adaptive threshold value of the image according to the inflection point value;

step S015: repeating the steps S011 to S014 until the self-adaptive threshold values of all the images are obtained;

step S016: judging whether the similarity value between the image pairs is larger than a self-adaptive threshold value or not;

step S017: if the similarity value between the image pairs is larger than the adaptive threshold, the similarity degree of the image pairs is higher, and the images with the similarity value larger than the adaptive threshold are screened out, so that an image retrieval result is obtained;

step S018: if the similarity value between the image pairs is less than or equal to the self-adaptive threshold, the similarity degree of the image pairs is low, and the images with the similarity value less than or equal to the self-adaptive threshold are screened out and excluded.

In the multi-view image-based three-dimensional reconstruction, a high-precision local feature point operator such as SIFT or SURF is adopted, so that the following image feature points can be guaranteed to keep better robustness in the aspects of rotation, scale, brightness, affine, noise and the like. Although the method has various advantages, describing the image from the local information easily causes the image to have similar local visual contents in the image searching process, but actually does not have the image pairs with the same name of image points, which can also be called as searching gross error. The retrieval gross error can cause some wrong image pairs to participate in feature point matching in the subsequent image feature matching process, thereby wasting computing resources and influencing the precision of subsequent point clouds and models. Therefore, if the rough search caused by the similarity of the local visual contents of the images can be eliminated, the precision of the search result of the matched images can be further improved, the wrong image feature matching can be reduced, and the overall effect of three-dimensional reconstruction can be improved.

In order to avoid the situation, when data is collected in field, the situation can be avoided by shooting other objects (such as lawns, trees, street lamps, vehicles and the like) around the building and increasing the image recognition degree. The essential idea is that the image pairs with the same name image points will increase the visual content of the part at the same time, and the retrieval of the rough image pairs will not increase the similar content of the part. Therefore, based on the idea of adding other visual contents of images in field collection to avoid matching errors caused by local visual contents, in this embodiment, it is further proposed that, if two images have more common similar images, the coarse search difference between the images due to the similarity of the local visual contents can be eliminated by the number of the visual contents of the images.

For the problem of image retrieval gross error, the embodiment also provides technical content of gross error elimination on the basis of a matchable image adaptive retrieval method so as to improve the precision of matchable image retrieval results and avoid the waste of computer resources.

As shown in fig. 3, in another implementation manner of this embodiment, the method for adaptively retrieving a matchable image further includes the following steps:

step S600: counting the number of the similarity values larger than the self-adaptive threshold value in the similarity matrix with the same sequence number to obtain the number of common partners until all the similarity vectors are traversed; and setting a specified value according to the number of the common partners, and if the number of the common partners is larger than or equal to the specified value, considering the two images as a similar image pair with the same name image point.

The greater the number of common partners, the more similar feature points of the image pair are shown, and the stronger the similarity of the image pair is.

Of course, it should be understood that the specified values are integer values at least 3 or greater.

Specifically, an example is made according to the similarity matrix:

for the similar image pairs i and j, the similarity vectors are Si and Sj respectively, and the similarity values of the statistical vectors Si and Sj under the same sequence number are larger than the number of the self-adaptive threshold values.

Correspondingly, for example, the similarity value in the similarity matrix with the same sequence number S3 × S3 is counted, that is: 1.00, 0.82, 0.91, 0.82, 1.00, 0.73, 0.91, 0.73, 1.00, if the adaptive threshold of the image is 0.77, then there are 7 similarity values greater than the adaptive threshold, then the number of common partners is 7, and the similarity vectors of all images are traversed in this way. Further, if the specified value is set to 3, the number of common partners is greater than the specified value, and the two images are regarded as a similar image pair having the same image point.

Step S700: and if the number of the common partners is smaller than the specified value, the common partners are regarded as rough retrieval of the image and are removed.

For example: taking the similarity matrix with the same serial number S4 × S4 as an example, if the adaptive threshold of the image is calculated to be 0.88, 5 similarity values of 1.00, 0.91, and 1.00 are greater than the adaptive value, i.e., 5 common partner numbers. Further, if the specified value is set to 6, the number of common partners is smaller than the specified value, and the common partners are regarded as coarse search differences of the image and are removed.

Of course, it should be understood that the greater the number of common partners between two images, the higher the similarity between the two images, and thus the two images are considered as a pair of similar images having the same name image point; otherwise, the search gross error is regarded as the search gross error and is removed.

Step S600 to step S700 are gross error elimination stages of the technical route.

According to the method, through the steps S600-S700, the number of common similar images among the images is counted based on the similarity matrix and the adaptive threshold value, and the common similar images are used as a discrimination basis of the retrieval gross error, so that the retrieval gross error result caused by the similarity of local visual contents is avoided.

In the practical application process of the steps S600 to S700 of the present embodiment, as shown in fig. 5, the method includes the following steps:

step S019: obtaining a similar image pair and an image similarity matrix of image retrieval and an adaptive threshold;

step S020: for the similar image pairs i and j, the similarity vectors are respectively Si and Sj;

step S021: counting the number of vectors Si and Sj with the similarity values of the same sequence numbers larger than the self-adaptive threshold value, namely the number of common partners;

step S022: judging whether the number of the common partners is larger than or equal to a specified value or not;

step S023: if the number of the common partners is larger than or equal to the specified value, the two images are indicated to have more common points, and the two images are regarded as similar image pairs with the same name image points, so that a final retrieval result after coarse differences are removed is obtained;

step S024: if the number of common partners is less than the specified value, it means that the two images have less common points and are considered as coarse search, and the images with the number of common partners less than the specified value will be eliminated.

In the embodiment of the invention, through three stages of training a dictionary, image retrieval and gross error elimination, the technical problems of redundant or too few retrieval results and retrieval gross error caused by similar local visual contents in the prior art are solved, the precision of the image result which can be matched is improved, and the overall efficiency of three-dimensional reconstruction is further improved.

Exemplary device

Based on the above embodiments, the present invention further provides a self-adaptive searching device for matchable images, and a schematic block diagram thereof can be shown in fig. 6.

The adaptive retrieval device for the matchable images comprises: the system comprises a processor, a memory, an interface, a display screen and a communication module which are connected through a system bus; wherein, the processor of the image self-adaptive searching device is used for providing calculation and control capability; the memory of the image self-adaptive retrieval device capable of being matched comprises a storage medium and an internal memory; the storage medium stores an operating system and a computer program; the internal memory provides an environment for the operation of an operating system and a computer program in the storage medium; the interface is used for connecting external equipment, such as mobile terminals, computers and the like; the display screen is used for displaying corresponding combined navigation information based on deep learning; the communication module is used for communicating with a cloud server or a mobile terminal.

The computer program is used for realizing the self-adaptive searching method of the matchable image when being executed by a processor.

It will be understood by those skilled in the art that the schematic block diagram shown in fig. 6 is only a block diagram of a part of the structure related to the solution of the present invention, and does not constitute a limitation of the matchable image adaptive search device to which the solution of the present invention is applied.

In one embodiment, an adaptive image retrieval device is provided, which includes: a memory and a processor; the memory stores a matchable image adaptive retrieval program, which when executed by the processor is configured to implement the operations of the matchable image adaptive retrieval method as described above.

In one embodiment, a storage medium is provided, which is a computer readable storage medium, and the storage medium stores a matchable image adaptive retrieval program, and the matchable image adaptive retrieval program is used for implementing the operation of the matchable image adaptive retrieval method as described above when being executed by a processor.

It will be understood by those skilled in the art that all or part of the processes of the methods of the above embodiments may be implemented by hardware related to instructions of a computer program, which may be stored in a non-volatile storage medium, and when executed, may include the processes of the embodiments of the methods described above. Any reference to memory, storage, databases, or other media used in embodiments provided herein may include non-volatile and/or volatile memory.

The invention discloses a self-adaptive retrieval method, a self-adaptive retrieval device and a storage medium for a matchable image, wherein the method comprises the following steps: acquiring a disordered image set to be participated in three-dimensional reconstruction, and extracting a similarity vector of each image according to a pre-trained visual dictionary; sorting according to the similarity value of the similarity vector to obtain a sorted similarity vector; substituting the similarity value of the sorted similarity vector into a high-order polynomial function for calculation, and analyzing a function coefficient to obtain the high-order polynomial function fitting the image similarity distribution curve; calculating an inflection point value of the high-order polynomial function, and calculating a self-adaptive threshold value of the image according to the inflection point value; and searching the images according to the self-adaptive threshold value of the calculated images, and outputting similar image results and corresponding images according to the searching results. The invention can search out the image pairs with the same name image points from the disordered images, reduces unnecessary matching between the unrelated image pairs and improves the overall efficiency of three-dimensional reconstruction.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. An adaptive retrieval method for a matchable image, the adaptive retrieval method for a matchable image comprising:

the acquiring a disordered image set to participate in three-dimensional reconstruction and extracting a similarity vector of each image according to a pre-trained visual dictionary comprises the following steps:

extracting a similarity matrix in the pre-trained visual dictionary;

extracting a similarity vector of each image in the unordered image set according to the similarity matrix;

converting a plurality of preset images into a plurality of visual dictionary vectors with the same dimensionality according to the visual dictionary;

setting the distance between a plurality of visual dictionary vectors, and obtaining similarity values of all representation image similarity relations according to the distance to obtain a similarity matrix;

the value range of the similarity value is between 0 and 1, when the row number and the column number of the similarity matrix are equal, the similarity value of the image is the similarity value of the image, namely 1, and the similarity matrix is a symmetric matrix with the diagonal line of 1;

calculating an inflection point value of the high-order polynomial function, and calculating an adaptive threshold value of the image according to the inflection point value;

the calculating the inflection point value of the high-order polynomial function and calculating the self-adaptive threshold of the image according to the inflection point value comprises the following steps:

performing derivation according to the high-order polynomial function, enabling the derivation function to be zero, calculating a corner value, and substituting the corner value into the high-order polynomial function to calculate an adaptive threshold value of the image;

performing image retrieval according to the self-adaptive threshold value of the image obtained by calculation, and outputting a similar image result and a corresponding image according to a retrieval result;

the image retrieval is carried out according to the self-adaptive threshold value of the image obtained by calculation, and a similar image result and a corresponding image are output according to a retrieval result, and then the method comprises the following steps:

setting a specified value according to the number of the common partners, and if the number of the common partners is larger than or equal to the specified value, regarding the two images as a similar image pair with the same name image point;

the adaptive retrieval method for the matchable image further comprises the following steps:

2. The adaptive retrieval method for matchable images according to claim 1, wherein the obtaining a disordered set of images to be involved in three-dimensional reconstruction and extracting the similarity vector of each image according to a pre-trained visual dictionary comprises:

acquiring a plurality of preset images;

and extracting a plurality of local feature points of the preset image, and clustering the extracted local feature points to generate a visual dictionary.

3. The adaptive retrieval method for matchable images according to claim 1, wherein the step of substituting the similarity values of the sorted similarity vectors into a high-order polynomial function to calculate, and analyzing the function coefficients to obtain the high-order polynomial function fitting the image similarity distribution curve comprises:

arranging a plurality of similarity values in a descending order;

and obtaining a high-order polynomial function of the image similarity distribution curve according to the analyzed function coefficient.

4. The adaptive retrieval method for matchable images according to claim 1, wherein the image retrieval according to the adaptive threshold of the calculated image and outputting the similar image result and the corresponding image according to the retrieval result comprises:

sequentially judging whether the similarity value of each similarity vector is larger than the corresponding self-adaptive threshold value or not according to the sequence from large to small;

5. An adaptive searching device for matchable images, comprising: a memory and a processor; the memory stores a matchable image adaptive retrieval program, which when executed by the processor is configured to implement the operation of the matchable image adaptive retrieval method according to any of claims 1-4.

6. A storage medium, which is a computer-readable storage medium, and which stores a matchable image adaptive search program, when being executed by a processor, for implementing the operation of the matchable image adaptive search method according to any one of claims 1-4.