CN104200240B

CN104200240B - A kind of Sketch Searching method based on content-adaptive Hash coding

Info

Publication number: CN104200240B
Application number: CN201410493545.0A
Authority: CN
Inventors: 赵龙; 梁爽; 贾金原
Original assignee: Individual
Current assignee: Individual
Priority date: 2014-09-24
Filing date: 2014-09-24
Publication date: 2017-07-07
Anticipated expiration: 2034-09-24
Also published as: CN104200240A

Abstract

The invention discloses a kind of Sketch Searching method based on content-adaptive Hash coding, it is characterised in that including step in detail below：Constrained based on appearance constraints and diversity, sketch or profile diagram to being retrieved choose candidate window according to its content-adaptive for feature extraction, realize that the information that whole image is included is evenly distributed in each window；Conspicuousness detection method based on key point detects the conspicuousness of characteristic window；Hash algorithm based on local sensitivity combines the local visual feature of sketch or profile diagram, conspicuousness and structure space feature, is compiled into feature Hash codes；The feature Hash codes of sketch or profile diagram are indexed, the similarity between sketch is measured by calculating the Hamming distance between feature Hash codes, and by similarity it is high return to user.Can have precision higher, wider adaptability and stronger matching capacity using the present invention.

Description

Sketch retrieval method based on content adaptive hash coding

Technical Field

The invention relates to the field of image processing, in particular to a sketch retrieval method based on content adaptive hash coding.

Background

In recent years, searching using sketches (e.g., visual material information such as hand-drawn figures, pictures, and three-dimensional models) as input has been a research focus in the field of computer vision. This is because, with the increasing popularity of touch screen devices, people prefer to use gestures, touch pens, and other methods to complete input and interaction of various information with a computer, and such methods can better express the intention of a user and are very simple to operate. Meanwhile, various retrieval tasks are performed in a hand-drawing sketch mode, so that strong convenience is provided for users using handheld input devices (such as Apple iPhone/iPad, Microsoft Surface and other various tablet computers).

In various sketch interaction tasks, the matching problem of different visual information carriers such as sketches, hand-drawn graphs, pictures, 3D models and the like plays an important role in the whole sketch interaction computing field, and the matching problem is also a basic research problem of a sketch-based retrieval method. In order to obtain the result of the perception and the visual perception of the same person as much as possible in the retrieval task, an efficient algorithm is needed to accurately and quickly measure the similarity between the input sketch and the retrieved information, and the research of the retrieval method based on the sketch aims to solve the problem. From the practical value, on one hand, the retrieval method based on the sketch is closely related to the application of the practical engineering technology, and not only has strong field applicability, but also is a plurality of main core technologies based on the sketch application. On the other hand, it also has a strong scientific research value, which explores the fundamental method of computer cognition to create visual patterns.

In a conventional search task, a sketch is usually regarded as a set of a series of hand-drawn strokes, so as to represent information such as a skeleton and an outline of an object. However, other appearance details possessed by the object, such as color, texture, and the like, are inevitably lost in the process of being converted into a sketch. Due to this characteristic, the sketch-based retrieval method is very different from the conventional picture-based retrieval algorithm. For some time recently, research efforts in this area have been trying to answer a question: how can feature information contained in each sketch be reasonably extracted and compared?

Recently, segmentation-based methods are widely used, and such algorithms have proven to be very effective in the field of sketch retrieval and recognition. They typically reduce computational complexity by segmenting the strokes contained in the sketch during implementation. And then, extracting the attributes of topology, geometric shape and the like of the divided strokes. However, accurate stroke segmentation for sketches is difficult to do, especially for pictures or 3D models that contain natural landscapes. This is because the contour features of such visual information usually contain much noise, and perfect stroke segmentation is an almost impossible task to accomplish. Thus, it is this shortcoming that greatly limits the applicability of one class of methods based on stroke segmentation to be used for sketch-based retrieval of pictures or 3D models.

Other research works are based on the traditional picture search algorithm, and search tasks are completed by comparing the similarity of sketches through regional features (patch). First, a sketch is divided equally into small regions, from which different visual feature descriptors are extracted. In general, such methods cover a whole sketch with a grid with overlapping regions or dense windows of equal size, so as to describe the feature distribution of the sketch. This is very suitable for common pictures, because a picture often contains abundant detailed features such as color or texture, but is not suitable for sparse outline images such as sketches containing only limited strokes. In this case, most of the information contained in a sketch will be contained in a few very salient image areas, leaving the remaining areas almost empty. Due to the extreme imbalance, many invalid feature descriptors are generated, which greatly reduce the effectiveness of the calculation after participating in the feature similarity calculation, and moreover, because the descriptors need to be binarized before indexing them by using the hash algorithm, the situation is further worsened because only visual information is further lost.

Another problem of the search method based on the regional characteristics is that the calculation method of the similarity between the sketches is weak and not strictly accurate. A hand-drawn sketch usually contains various line strokes instead of colors or textures to represent different objects, so that the sketch is greatly different from a traditional picture in two aspects: large intra-class differences (because each sketch renderer has more or less his own subjective understanding of the same east and west) and small inter-class differences (due to the lack of important visual information such as color or texture). This results in even a sketch of the same subject containing a large number of dissimilar regional features. Therefore, "similar pictures often contain a large amount of similar regional features", this concept widely used in picture retrieval algorithms is not accurate for sketch-based retrieval methods, and its definition constraint on similarity is too strict and thus becomes inefficient.

Therefore, in view of the above analysis, it can be seen that the existing sketch-based retrieval technology is not perfect, and needs to be improved and developed. The invention researches a retrieval method based on regional characteristics, because the method has higher applicability compared with a method based on stroke segmentation, and intensively solves two problems in the retrieval method based on regional characteristics.

Disclosure of Invention

The invention aims to provide a sketch retrieval method based on content self-adaptive Hash coding, and aims to solve the problems that the existing sketch retrieval method based on Hash coding does not take account of the content distribution characteristics of a sketch when extracting visual features, the adopted method can greatly reduce the effectiveness of calculation, and the calculation method for the similarity between sketches is weak and not strict and accurate.

The technical scheme of the invention is as follows: a sketch retrieval method based on content adaptive hash coding comprises the following specific steps:

step A: based on appearance constraint and diversity constraint, selecting candidate windows for feature extraction in a self-adaptive manner according to the content of the retrieved sketch or outline map, and realizing that the information contained in the whole image is uniformly distributed in each window;

and B: detecting the significance of the characteristic window based on a significance detection method of the key points;

and C: based on a local sensitive hash algorithm, combining local visual features, saliency and structural space features of a sketch or a contour map to compile a feature hash code;

step D: indexing the characteristic hash codes of the sketch or the outline graph, measuring the similarity between the sketch by calculating the Hamming distance between the characteristic hash codes, and returning the result with high similarity to the user.

The sketch retrieval method can also be used for retrieving the picture and the 3D model, and the picture and the 3D model are preprocessed and converted into the contour line images before being retrieved.

The sketch retrieval method comprises the following steps of converting the picture and the 3D model into the contour line image: calculating a salient outline image of the picture by combining an edge extraction algorithm and a saliency detection algorithm of the picture; for the 3D model, a contour projection graph corresponding to the model is calculated according to a matching algorithm based on a visual angle.

The sketch retrieval method comprises a candidate window selection algorithm for feature extraction: firstly, setting a grid of initialized n x n for an input sketch; uniformly sampling m × m initial seeds from the grid; HOG features h are then computed for the windows in each grid_i(ii) a Finally, global HOG characteristics are calculated

The sketch retrieval method is characterized in that the appearance constraint is marked as C_appSpecifically, it is represented as: c_app(h):＝F_app(h)≥k_app×F_app(H)

Wherein, thereinF_appIs an objective equation of appearance constraint, when F_appWhen the calculated value is higher, it indicates that the feature window contains more visual feature information.

The sketch retrieval method, wherein the diversity constraint is marked as C_varSpecifically, it is represented as: c_var(h):＝F_var(h)≤k_var×F_var(H)

Wherein, thereinF_varIs a diversity-constrained objective equation, if F_varHas a low value of and F_appHigher indicates that the value of each dimension in the window feature vector h is higher.

The sketch retrieval method comprises the following specific steps of detecting the significance of a feature window by a key point-based significance detection method: first, using a Harris-Laplace detector as a sketch saliency extraction tool, for each feature window w_iDefinition of its significance k_iComprises the following steps:

wherein, Number (S)_i) Is shown in the characteristic window w_iThe number of the significant points extracted by using a Harris-Laplace detector is used; area (w)_i) Is w_iIncluding the number of pixels.

The sketch retrieval method comprises the following steps of: let f_iFor the purpose of drawing from the feature window w_iThe extracted feature vector is first binarized into vectorThen, following the calculation process of similar hash algorithm, corresponding to each windowAnd k_iCalculating the value to obtain a characteristic hash code of the window; then, dividing the sketch into two parts in the horizontal direction and the vertical direction respectively to obtain four separated space positions, and performing hash coding on candidate windows positioned on each space position respectively; and finally, sequentially splicing the hash codes on the four spatial positions end to obtain the characteristic hash code representing the whole sketch.

In the sketch retrieval method, the feature descriptor can also be a scale-invariant feature descriptor or a local linear Gabor feature descriptor.

According to the sketch retrieval method, HOG characteristics without normalization processing are selected to describe visual information contained in each sub-window.

The invention has the beneficial effects that: according to the method, the content distribution condition of the characteristics of the sketch or the contour map is taken into consideration in the process of extracting the characteristics of the sketch or the contour map, so that all the characteristics are uniformly distributed in the candidate window as much as possible, and the extracted characteristic vector can better represent the characteristics of a single sketch. Compared with a retrieval algorithm based on stroke segmentation, the characteristics are less influenced by factors such as noise, texture and the like contained in pictures and 3D model contour diagrams; compared with other retrieval methods based on Hash, the algorithm provided by the invention has higher precision, wider adaptability and stronger matching capability. The application of the method can be popularized to other research fields which take Hash coding on the characteristic information as a key technology, and the method has general significance.

Drawings

FIG. 1 is a schematic representation of the results of a search over a variety of data sets using the present invention.

Fig. 2a, 2b, 2c are schematic diagrams of sketch feature window selection strategies.

Fig. 3a, 3b, and 3c are schematic diagrams of calculation results of sketch saliency windows.

FIG. 4 is a flow chart of a sketch region feature hashing algorithm.

Fig. 5 is a schematic diagram of the intermediate result of calculating the picture contour image.

Fig. 6a, 6b are graphs comparing the performance of different components of the method.

Fig. 7a, 7b are graphs comparing performance for searches using different algorithms.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples.

The sketch-based retrieval algorithm provided by the invention mainly comprises the following three components: combining the self characteristics of the input sketch, and selecting a feature extraction window in a self-adaptive manner according to two types of constraint conditions; adaptively detecting the significance of each characteristic window by using the key point information contained in the window; the two types of feature information are combined with the structural information of the sketch, and the two types of feature information are coded into a hash code through an LSH algorithm to create a feature index. The present invention will be described hereinafter in the following sections of this specification as well.

As shown in fig. 2a, given a sketch, it is first split into a uniform grid of n x n. Then, as shown in fig. 2b, m × m points are uniformly selected at the intersection points of these grids as initial seeds for generating all feature windows, and then, a set Δ w (x, y, i) of sub-windows is defined for the seed points at the position of the sketch (x, y), where Δ w (x, y, i) represents all the ith circles of sub-windows around the seeds. As shown in fig. 2c, where the black dots represent the selected seed point, the 4 nearest sub-windows surrounding the seed point are the 1 st circle of sub-windows x surrounding the seed point, which is denoted as Δ w (x, y, 1); all the sub-windows surrounding and closest to the 1 st circle of sub-windows are the 2 nd circle of sub-windows surrounding the seed pointDenoted Δ w (x, y, 2).

To finally generate a window w (x, y) of the appropriate size for each initial seed, Δ w (x, y, i) needs to be added to w (x, y) iteratively in turn until w (x, y) satisfies a certain conditional constraint indicating that it contains sufficient characteristic information, or when w (x, y) becomes an illegal window, e.g., the window has overflowed the edge of the entire sketch or it becomes too large (greater than one-fourth of the entire sketch).

Therefore, the selection and design of conditional constraints become especially important, which is directly related to the quality of the finally selected feature window, and if the constraints are too strict, most of the windows contain insufficient feature information; on the contrary, the feature window becomes too large, so that a large number of regions are repeatedly contained in the windows, so that the feature information becomes too redundant, and the calculation amount of feature similarity comparison in the next step is increased. The present invention proposes the following two effective constraint strategies.

Because the HOG (Histogram of Oriented Gradients) features of an image have been widely used in the field of computer vision research such as object detection and image retrieval, and have achieved a good effect, the method chooses the HOG features without normalization processing to describe the visual information contained in each sub-window. In the standard HOG feature calculation process, normalization processing is often adopted to remove the influence of light and shadow on the image, and sketches are both black-white two-color images formed by strokes and backgrounds and have no illumination transformation, so that the use of unnormalized HOG features does not influence the accuracy of feature extraction, and the calculation speed is accelerated to a certain extent due to the reduction of calculated amount. Notation vector h ═ b₁,b₂,…,b_nThe HOG features extracted from the window w (x, y) are used as the HOG features, and the delta h (x, y, i) is the HOG feature vector of the window set corresponding to the delta w (x, y, i); note H as the HOG characteristic H vector of the whole sketch, because the histogram is additive, when calculating the HOG characteristic vector H of all sub-windows in the sketch_iThe HOG histogram feature H of the whole graph can then be obtained directly by adding up these values, which is then defined asThe appearance constraint of the sketch feature window is defined as follows, denoted C_appIt is defined as follows:

C_app(h):＝F_app(h)≥k_app×F_app(H) the system of equation 1

Wherein, thereinF_appIs an objective equation of appearance constraint, which essentially calculates the mean of the HOG features h. When F is present_appA higher calculated value indicates that the feature window contains more visual feature information, whereas a lower F_appThe value indicates that the entire window is almost empty. Obviously, the constraint C_appEnsures that each feature window needs to be congestedAnd the visual characteristic information is abundant enough.

The second constraint proposed by the method is called diversity constraint and is denoted as C_varIt is defined as follows:

C_var(h):＝F_var(h)≤k_var×F_var(H) the system of equation 2

Wherein, thereinF_varIs a diversity-constrained objective equation, which is actually the variance of the HOG features h. If F_varHas a low value of and F_appA higher value indicates a higher value for each dimension in the window feature vector h. Thus, satisfy C_varThe constrained window will contain more diverse features than a single direction line or segment, etc., which has proven to be very useful in sketch retrieval problems.

Notably, the parameter k_appAnd k_varThe influence of the sketch global HOG characteristic H on two constraints is controlled. Through specific experiments, k is found to be_appAnd k_varSetting to 0.8 and 1, respectively, will enable best search performance. After the above two types of conditional constraints are defined, taking the position of each initial seed as a starting point, iteratively adding surrounding sub-windows into the initial seed until each window meets the two types of conditional constraints, thereby completing the selection of the feature candidate window. The algorithm following the window selection strategy is recorded as a window conditional constraint selection algorithm, and the work flow of the whole algorithm is summarized as follows:

step a: initializing a grid of n x n for the input sketch;

step b: uniformly sampling m × m initial seeds from the grid;

step c: computing HOG features h for windows in each grid_i；

Step d: the global HOG feature H is calculated.

The specific algorithm is as follows:

since it was mentioned above that "two conditional constraints" are to be used "iteratively" to generate candidate windows, a simple and intuitive implementation example is provided here in pseudo code to help the user understand how to use two conditional constraints.

Notably, constraint C_appTarget equation F_appIs incrementally computable, that is to say it satisfies the equation:

F_app(h+Δh)＝F_app(h)+F_app(Δ h) · (3) · equation 3

As the window grows continuously, only F of the feature vector corresponding to the growing part of the sub-windows is needed_appAdding values to the results of the previous iteration without having to recalculate F over a new window resulting from each iteration_appThe value of (c). Thus, it is verified during each iteration whether the feature window satisfies the constraint C_appIs very rapid. Although F_varCannot be incrementally calculated, but until each feature candidate window is satisfying constraint C_appIt does not have to be calculated before, so in summary, the entire characteristic window of the method is calculated relatively quickly.

In experiments, it can be seen that, no matter a scale-invariant Feature descriptor, a Local linear-based Feature descriptor (GALIF) or the aforementioned HOG Feature descriptor is selected, the Feature window selection strategy proposed above in the present invention can significantly improve the retrieval accuracy.

After the selection of the sketch feature windows is completed, the method also provides a method for detecting the significance of each window in the input sketch. For sketch, to characterize its local features, it is more effective to use feature point-based saliency extraction algorithms, such as multi-scale gaussian model, Hessian algorithm, and Harris-Laplace detector, than to use region-based saliency extraction algorithms. This is due to the fact that a sketch typically contains many separate lines and dots, unlike a picture that contains continuous regions. The feature point-based saliency extraction algorithm is designed to find salient points in an image, and is therefore very suitable for detecting nodes or inflection points in a sketch, which are also the key information contained in the sketch. In the invention, the Harris-Laplace detector is found to have better performance in detecting key points of the sketch through experiments, and therefore, the Harris-Laplace detector is used as the sketch significance extraction method.

For each candidate window W belonging to W_i(where W is the result returned by Algorithm 1, described above, and is the set of all selected feature windows), its significance is defined as k_iThe formula is as follows:

wherein, Number (S)_i) Is shown in the characteristic window w_iThe number of the significant points extracted by using a Harris-Laplace detector is used; area (w)_i) Is w_iIncluding the number of pixels. Intuitively, it can be seen that: a window is more prominent when it is smaller and contains more salient points.

It should be noted that, in order to prevent the saliency of the feature window from being too sensitive to its size and resolution, the above equation uses the quadratic root term of the area at the pixel level of the window to characterize its size. FIG. 3 illustrates intermediate results of detecting feature window saliency for a sketch, wherein FIG. 3a is an example of an input sketch; the open circles in FIG. 3b represent all the keypoints found in this sketch using the Harris-Laplace detector; fig. 3c illustrates the most significant candidate feature window with the top 3 mutually overlapping areas less than 20% as calculated by equation 4 in a block.

After computing the visual descriptors of the candidate feature windows and their saliency, these contents need to be organically integrated in order to enable these information to be effectively used in similarity comparison and indexing. The similarity measurement algorithm based on the Hash can effectively retrieve a large-scale data set, and simultaneously ensures extremely high calculation speed, so that the method uses the locally sensitive Hash algorithm to encode the extracted features, and the specific process is as follows. In the implementation process of the method, the HOG is used as a window characteristic description algorithm.

Let f_iFor the purpose of drawing from the feature window w_iThe feature vector extracted in (f) is obtained by extracting f_iThe first 40% of the bits with the highest value are set as 1, and the rest are set as-1, i.e. f can be set_iBinarized into vectorsIt should be noted that, thanks to the two feature window selection constraints proposed by the present algorithm, it is further effectively ensured that each window can contain enough feature information, so that the loss of information is greatly reduced in the binarization process. Then, the invention refers to the calculation process of similar hash algorithm (Sim-hash), and passes through the corresponding window of each windowAnd k_iAnd calculating the value to obtain the characteristic hash code of the window.

In actual calculation, the window significance k is calculated_iAsThe weight of (c). A sketch contains spatial information that has proven to be a very useful class of features in the field of sketch retrieval. Therefore, in order to include the spatial relationship of the local features in the sketch into the feature hash code, first, the sketch is divided into two parts in the horizontal and vertical directions, respectively, to obtain four separated spatial positions, as shown in the flow (a) in fig. 4. Then, the candidate windows located at each spatial position are respectively hash-coded according to the above-mentioned method, and the whole process is shown in flows (b) to (e) in fig. 4. And finally, sequentially splicing the hash codes on the four spatial positions end to end, and obtaining the characteristic hash code representing the whole sketch as shown in the flow (f). Given any two sketches, pictures or 3D model projected outline images, the hamming distance (HammingDistance) between their feature hash codes is their similarity.

The following section introduces specific implementation details of the present invention, including pre-processing of pictures and 3D models prior to retrieval, specific setting of parameters, feature data indexing scheme, and the entire query process.

The sketch is usually a black and white image only containing contour lines, the picture often contains rich colors and various textures, and the 3D model is a patch set in a three-dimensional space, and obviously, the three data features are quite different, and they cannot be directly compared and retrieved. Therefore, before the search process starts, in order to enable the user to search the picture and the 3D model by using the sketch as an input, the two types of information need to be preprocessed and converted into the sketch-like outline image.

For a given picture, first, a contour map E of the entire picture is calculated using Canny or other picture edge extraction algorithms_cHowever, it is clear that the contour image inevitably contains many false edge lines due to background texture. In order to find the main content (i.e. the portion of the picture desired to be retrieved) expressed by the picture, the saliency detection algorithm of the picture is used to mark the portion of the contentAnd the corresponding saliency map is denoted as S. Then, the method also uses a Maximum Filter (MF) to perform a filtering process on S, so that the significant region of the picture can be slightly enlarged, and the loss of the outer contour of the picture main content due to the segmentation error of the algorithm is avoided. Finally, the salient outline E of the picture can be calculated according to the following formula, and the calculation process is shown in fig. 5.

For a given 3D model, the invention computes the profile projection map corresponding to the model according to the View-based matching (View-based matching) algorithm proposed in the paper "Sketch-based shape Retrieval" by Mathias Eitz et al. In the calculation process, in order to ensure the stability of the projection extraction result, the method adopts the Lloyd relaxation algorithm to uniformly sample about 14 projection visual angles from the corresponding surrounding spherical surface of each 3D model through loop iteration. And, heuristic Contours (Suggestive Contours) are used to extract appropriate contour lines from the model projection graph.

Then, whether the image salient profile graph calculated by the method or the projection profile graph of the 3D model is obtained, the image salient profile graph or the projection profile graph of the 3D model is cut out from the original result by using a minimum square bounding box and is scaled to the resolution of 160 × 160, so that the influence of the image size and deformation on the retrieval result is reduced. Next, according to the calculation process of algorithm 1, the input sketch, picture or model profile is divided into 80 × 80 grids, from which 15 × 15 seeds are uniformly sampled. In each feature sub-window, an unnormalized HOG feature histogram containing 8 directions is computed as h in the constraint, so that each feature vector contains 8 dimensions. All feature windows are then scaled to 16 x 16 blocks of pixels for subsequent visual feature extraction. Finally, since the histogram of the HOG features that have not been normalized before is calculated, after normalizing them, the HOG features can be used to describe the feature vector f for each feature window.

Given any sketch as input, measuring the Hamming distance of the characteristic hash codes between the outline diagrams of all sketches, pictures or models in the database and the outline diagrams, and sequencing from small to large, the sketch, the picture or the 3D model which is most similar to the sketch in the database can be found. Since all hash codes are binarized, the hamming distance between them can be calculated very quickly by simple shift and or operation, and the retrieval speed can be very high even if the feature data is not indexed. In order to further accelerate retrieval performance, the characteristic data is indexed according to the invention combined with a Hamming spatial Hash complex indexing algorithm (Fastsearch in Hamming Space with Multi-Index Hash) proposed by Norouzi et al, so that the method can complete each query request in sub-linear time. The flow of the whole search method of the invention is summarized as follows:

firstly, generating a proper contour map E for all pictures or 3D models in a database; secondly, generating candidate characteristic windows w for all the contour maps E according to a conditional constraint selection algorithm; calculating corresponding HOG characteristics f and significance k for each characteristic window w; then generating a characteristic hash code h for each contour map E according to all HOG characteristics f and significance k in the contour map E; and finally, creating an index I for all the characteristic hash codes h.

Calculating the characteristic hash code h of the input sketch S according to the second step to the fourth step of the retrieval process_s(ii) a Calculating the query result R from the index I and returning the result R to the user

In order to support and verify the research method and key technology provided by the invention, the method is compared with other latest and most advanced sketch-based retrieval algorithms respectively on three widely used standard data sets. The Magic Sketch dataset was created by Liang et al, which contained a total of 1100 sketches, each divided into 55 categories according to what was drawn. These sketches are all based on the MPEG-CE1 trademark image database, UKPTO trademark database of the british trademark patent office, refrigerator appliance diagrams and engineering drawings, representative shapes of which are selected and plotted by 10 persons, the data set being used to verify the validity of the various components of the algorithm proposed by the method. The TU Berlin dataset was constructed by Eitz et al, which contained 31 different topics, each with an example sketch and its corresponding 40 test pictures. The data set is used to verify the validity of the method when used in a sketch-based picture retrieval task. The PSB data set is also created by Eitz et al, and contains a sketch that is easily identifiable for each and corresponds to a class of 3D models in a Princeton Shape Benchmark (PSB). Thus, the data set may be used to verify the validity of the method when used in a sketch-based 3D model retrieval task.

In order to evaluate the performance of the sketch-based retrieval technology provided by the invention, the method adopts Recall (Recall), Precision (Precision) and dataset Benchmark Score (Benchmark Score) as performance evaluation indexes. In order to evaluate the existing method more intuitively, the relation between the Recall or Precision and the size of the candidate graph result set is given in the form of a curve (PR curve). As the number of returned results increases, the precision ratio will gradually increase and the precision ratio will gradually decrease, which is determined by the precision ratio and the calculation method of the precision ratio. When the window of the number of the returning structures is increased, the higher the denominator of the precision ratio is, the lower the precision ratio is; for the recall ratio, the larger the window is, the more the number of returned related results is, the larger the numerator of the recall ratio is, and the larger the value of the recall ratio is. Obviously, in the recall ratio graph and the precision ratio graph, the higher the curve is, the better the retrieval effect is, because the higher the curve is, the higher the corresponding recall ratio or precision ratio is in the case of returning the same number of candidate vector graphs; and vice versa. The data set Benchmark score is proposed by Mathias Eitz et al in the paper "Sketch-Based Image Retrieval: Benchmark and Bag-of-FeaturesDescriptors" as another index for evaluating the performance of data Retrieval algorithms. The index judges the quality of the retrieval algorithm by comparing Kendall grade correlation coefficients between ranking results obtained by the retrieval algorithm and artificially labeled real results after given query input under the same retrieval data set. The higher the benchmark score obtained by the search algorithm is, the closer the search result of the algorithm is to the cognitive result of human beings. In addition, the invention tests the time taken by the retrieval algorithm to complete a retrieval task to evaluate the runtime performance of the proposed solution.

Two constraints C for selecting a feature window proposed for verifying the present algorithm_app(formula 1) and C_var(equation 2) effectiveness, first, an algorithm baseline for feature extraction Using overlapping grids (Grid) is implemented according to the algorithm proposed by Konstantinos Bozas in the paper "Large Scale Sketch Based image retrieval Using Patch Hashing". Then, use it with constraint C only_appAnd simultaneous use of constraint C_appAnd C_varThe present algorithm of (a) was compared. Feature window significance k proposed for validating the present algorithm_iValidity of, in the above algorithm implementation, k_iAre all set to 1 and all C are used together_app、C_varAnd k_iThe implementation of (c) was compared. FIG. 6a shows the search performance achieved by the various algorithms on the Magic Sketch dataset as described above, from which it can be seen that the two feature window selection constraints proposed by the present algorithm are complementary, and that C is removed_app、C_varAnd k_iAny one of the components will cause the retrieval performance of the algorithm to be reduced. Therefore, C proposed by the algorithm_app、C_varAnd k_iIs one-out-of-the-box, each component has its effectiveness.

In addition, in order to further show the performance characteristics of the method, the retrieval algorithm implementation after feature extraction is carried out on a uniform grid by using SIFT, GALIF and HOG feature descriptors is respectively compared with the algorithm implementation of feature extraction on the feature extraction area selected by the algorithm by using the SIFT, GALIF and HOG feature descriptors. The PR curve of fig. 6b shows the comparison result of the above algorithm implemented on the Magic Sketch dataset. From the figure, it can be found that the result of feature extraction on the feature extraction area selected by the algorithm is better than that on the uniform grid no matter what kind of feature descriptors are used, thereby proving the universality of the method for different feature descriptors. It is noted that, thanks to the GALIF feature descriptor, a multi-directional and multi-scale feature sampling strategy is used to replace the traditional histogram strategy for characterizing the visual features, so that it can be found from the figure that the best retrieval performance can be achieved by combining the method and the GALIF feature descriptor. However, to ensure the availability of the algorithm, the HOG feature descriptors are still used to compute the visual features of the sketch in a standard implementation of the method, since computing the GALIF feature descriptors incurs a significant time overhead.

At the end of the experiment, the algorithm was also compared to other leading edge sketch-based retrieval algorithms. On the TUBerlin dataset, the algorithm was compared with a search algorithm using Bag-of-Words (BW), Key Shapes (KW), and minimum hash (Min-hash, MH), and the dataset benchmark scores were used to judge the performance of the search algorithm, with the results shown in table 1. In addition, on the Magic Sketch data set, the algorithm is compared with an algorithm based on biased svm (bsvm), geometric relationships (SR), and geometric Proximity (SP); on the PSB dataset, the present algorithm was compared with a search algorithm based on Diffusion Tensor (DT), Grid SIFT (SIFT-Grid), and GALIF (GALIF-Grid). On both data sets, a standard PR curve was used to evaluate the retrieval performance, with the results shown in fig. 7a and 7b, respectively. The above experimental results show that compared with other retrieval algorithms, the method has higher retrieval precision and better performance. Referring to fig. 1, examples of the retrieval results of the method on different data sets are respectively shown.

TABLE 1 reference scores for different algorithms

The Sketch-based retrieval algorithm provided by the invention takes about 1.87 seconds to complete a Sketch retrieval task on a Magic Sketch standard data set. This result was actually measured on a desktop computer equipped with an Intel 3.39GHz quad CPU, 16GB memory, with the implementation code programmed using MATLAB and without parallel optimization. It is worth noting that the method has higher retrieval precision compared with other algorithms on the premise that the basic real-time requirement can still be met.

It is to be understood that the invention is not limited to the examples described above, but that modifications and variations may be effected thereto by those of ordinary skill in the art in light of the foregoing description, and that all such modifications and variations are intended to be within the scope of the invention as defined by the appended claims.

Claims

1. A sketch retrieval method based on content adaptive hash coding is characterized by comprising the following specific steps:

step D: indexing the characteristic hash codes of the sketch or the outline graph, measuring the similarity between the sketch by calculating the Hamming distance between the characteristic hash codes, returning the result with high similarity to the user,

in step B, the method for detecting the saliency of the feature window based on the saliency detection method of the key point includes: first, using a Harris-Laplace detector as a sketch saliency extraction tool, for each feature window w_iDefinition of its significance k_iComprises the following steps:

2. The sketch retrieval method of claim 1, wherein the pictures and the 3D models are retrieved, and wherein the pictures and the 3D models are preprocessed to convert them into contour images before being retrieved.

3. The sketch retrieval method of claim 2, wherein the method of converting the picture and the 3D model into the outline image is as follows: calculating a salient outline image of the picture by combining an edge extraction algorithm and a saliency detection algorithm of the picture; for the 3D model, a contour projection graph corresponding to the model is calculated according to a matching algorithm based on a visual angle.

4. The sketch retrieval method of claim 1, wherein a candidate window selection algorithm for feature extraction: firstly, the methodSetting a grid of initialization n x n for the input sketch; uniformly sampling m × m initial seeds from the grid; HOG features h are then computed for the windows in each grid_i(ii) a Finally, global HOG characteristics are calculated

5. A sketch retrieval method as claimed in claim 1 or 3 wherein said appearance constraint is denoted C_appSpecifically, it is represented as: c_app(h):＝F_app(h)≥k_app×F_app(H)

Wherein,h represents the HOG direction gradient histogram feature vector corresponding to the window, H represents the HOG feature vector corresponding to the whole input sketch, n represents the dimension of the HOG feature vector H, b represents the dimension of the HOG feature vector_iRepresenting the value, k, in the ith dimension of the feature vector h_appIs a predefined coefficient for controlling the global parameter F_app(H) Influence of (A) F_appIs an objective equation of appearance constraint, when F_appWhen the calculated value is higher, it indicates that the feature window contains more visual feature information.

6. The sketch retrieval method as claimed in claim 5, wherein said diversity constraint is denoted as C_varSpecifically, it is represented as: c_var(h):＝F_var(h)≤k_var×F_var(H)

Wherein,h represents the HOG direction gradient histogram feature vector corresponding to the window, H represents the HOG feature vector corresponding to the whole input sketch, n represents the dimension of the HOG feature vector H, b represents the dimension of the HOG feature vector_iRepresenting the value, k, in the ith dimension of the feature vector h_varTo be predeterminedCoefficient of sense for controlling the global parameter F_var(H) Influence of (A) F_varIs a diversity-constrained objective equation, if F_varHas a low value of and F_appHigher indicates that the value of each dimension in the window feature vector h is higher.

7. The sketch retrieval method as claimed in claim 1, wherein the method of compiling the characteristic hash code comprises: let f_iFor the purpose of drawing from the feature window w_iThe extracted feature vector is first binarized into vectorThen, following the calculation process of similar hash algorithm, corresponding to each windowAnd k_iCalculating the value to obtain a characteristic hash code of the window; then, dividing the sketch into two parts in the horizontal direction and the vertical direction respectively to obtain four separated space positions, and performing hash coding on candidate windows positioned on each space position respectively; and finally, sequentially splicing the hash codes on the four spatial positions end to obtain the characteristic hash code representing the whole sketch.

8. The sketch retrieval method of claim 4, wherein the feature descriptor is further selected from a scale-invariant feature descriptor or a local linear Gabor feature descriptor.

9. The sketch retrieval method of claim 4, wherein the HOG features without normalization processing are selected to describe the visual information contained in each sub-window.