CN106649624B

CN106649624B - Local feature point verification method based on global relationship consistency constraint

Info

Publication number: CN106649624B
Application number: CN201611109737.2A
Authority: CN
Inventors: 姚金良; 杨醒龙
Original assignee: Hangzhou Electronic Science and Technology University
Current assignee: Hangzhou Electronic Science and Technology University
Priority date: 2016-12-06
Filing date: 2016-12-06
Publication date: 2020-03-03
Anticipated expiration: 2036-12-06
Also published as: CN106649624A

Abstract

The invention discloses a local feature point verification method based on global relationship consistency constraint. The invention comprises three parts: offline learning, local feature point quantization and feature point voting verification. Offline learning is used for the construction of visual vocabulary dictionaries. The local characteristic point quantization comprises three steps: 1. and extracting local characteristic points. 2. And (5) quantization of the feature descriptors. 3. Quantizing the main direction, the scale and the azimuth; the method used by the visual vocabulary verification part comprises two methods, namely weak relation consistency verification and strong geometry verification. The two methods adopt a voting mechanism to verify candidate feature points, and the steps are similar: 1. and acquiring candidate images and candidate feature points. 2. And verifying the candidate feature points through voting. The invention can adapt to the influence caused by image cutting, rotation, scale scaling and other transformations, and can be used in image retrieval and classification based on visual vocabularies.

Description

Local feature point verification method based on global relationship consistency constraint

Technical Field

The invention belongs to the field of computer image processing and machine vision, and relates to two kinds of local feature point verification based on visual vocabularies.

Background

Image retrieval based on local features is a mainstream copy image retrieval mode at present, local feature point descriptors are quantized into visual words, and a bag-of-words model is adopted to represent images, so that the method is an important method for current image retrieval. However, the visual vocabulary obtained by the descriptor quantization of the local features has no definite meaning with respect to the vocabulary in the natural language, and is easily affected by noise. In order to ensure the distinguishing capability of the visual words, the number of the visual words in the dictionary is more and better. But more visual vocabularies result in a weaker noise immunity and require more computational effort when the local features are quantized into visual vocabularies. Some researchers pay attention to the problem of ambiguity caused by quantizing local features into visual words, and partial solutions are proposed.

In order to increase the accuracy of local feature matching and reduce the ambiguity phenomenon of visual vocabularies, researchers are focusing on increasing the information content of the visual vocabularies to improve the recognition degree of the visual vocabularies. The method that multiple dictionaries are used for quantizing local features can improve the recall rate of image retrieval, but quantization of multiple dictionaries has intersection, and the intersection belongs to redundant data. The large amount of redundant data not only does not contribute to the improvement of the retrieval effect, but also affects the retrieval efficiency. In order to solve the problem, Zheng provides a bayesian combination method based on a plurality of dictionaries, which is used for reducing the association degree of the dictionaries and reducing redundant data. Mortensen adds a global texture vector to each local feature point to make the local feature have global properties. The method can be used for improving the distinguishing capability of visual vocabularies, but the method is not robust enough on the scale transformation of the image.

Researchers wish to improve the descriptive power of visual words by modeling their spatial dependencies (local features). Yao provides a method for generating a context descriptor of visual vocabularies, which considers the relationship between feature points and adjacent feature points and calculates the context similarity relationship between the visual vocabularies in the process of image detection to judge whether the visual vocabularies are correctly matched. Liu proposes two methods based on local features, and codes by taking the one-to-one and one-to-many relations between local feature points as feature points. The two methods belong to a method of prior verification, a good relation is established for the characteristic points before image detection, and the method has obvious advantages in retrieval speed. By adopting a similar method based on the neighbor local features, the global relationship between the feature points is often ignored.

In order to achieve global performance, researchers have considered global constraints on local features in recent years. Zheng finds that, in the image retrieval process, the query image and the correctly matched visual phrases in the candidate images have a relatively stable position relationship, for example, coordinates of the matched visual phrases are subtracted, and correctly matched points fall into a relatively concentrated position. Zhou uses a compact spatial coding method to describe the mutual position relationship of visual vocabularies. However, this method is not ideal for supporting the rotation transformation of the image, and it is necessary to construct positional relationships in multiple directions to improve the robustness of the rotation transformation.

Aiming at the problem of low matching accuracy caused by the phenomenon of ambiguity caused after the local features are quantized into the visual vocabularies, the two methods of the invention provide that the global relationship based on the local features is utilized to enhance the visual vocabulary distinguishing capability. The method meets the requirements of compactness and robustness, can be used for various edits and transformations of the image, and has a good effect;

disclosure of Invention

The invention aims to provide two local feature point verification methods based on global relationship consistency constraint aiming at the defects of the prior art.

The technical scheme adopted by the invention for solving the technical problems is as follows:

1. the local feature point verification method based on the global relationship consistency constraint is characterized by comprising the following three parts: (1) an off-line learning part, (2) a characteristic point quantization part, and (3) a characteristic point voting verification part; the off-line learning part is used for constructing a visual vocabulary dictionary; the characteristic point quantization part quantizes local characteristics according to a visual vocabulary dictionary obtained by off-line learning; the feature point voting verification part is used for verifying the feature points in the candidate images, and is specifically realized as follows:

and (1) performing off-line learning, and grouping and clustering a large number of samples to obtain a visual vocabulary dictionary.

And (2) quantizing the feature points of the query image through a visual vocabulary dictionary to obtain visual vocabularies.

And (3) matching the visual words of the query image in an index database to obtain candidate feature points, and establishing a relationship by using the unique image identification of the candidate feature points to obtain a plurality of candidate images.

And (4) verifying the feature points through the constraint of the global consistency weak relation or the strong geometric relation, and finally achieving the aim of verifying the candidate images.

(1) The off-line learning part is characterized by constructing a visual vocabulary dictionary of local features, and comprises the following specific steps:

1-1, selecting a large number of images to construct an image library, extracting local feature points and feature descriptors of the images in the image library, and constructing the extracted feature descriptors into a sample library;

1-2, obtaining a visual vocabulary dictionary through a sample library; specifically, feature vectors of feature descriptors in a sample library are grouped, K class centers are obtained on each feature group through K mean value clustering, each class center is a feature vector and represents a root word in a visual vocabulary, and the K class centers are root word sets of the feature groups; combining the root sets constructed on each feature group to obtain a visual vocabulary dictionary;

(2) the characteristic point quantization part comprises two parts: quantization of local feature descriptors, principal direction, scale and coordinate quantization.

2-1. local feature descriptor quantization: extracting local feature point set S ═ { P ═ P for input image_i,i∈[0,Q]Q is the number of local feature points in the input image, P_iRefers to the ith local feature point; and the local characteristic point P is quantized by grouping according to the visual vocabulary dictionary_iIs quantized into the visual vocabulary VW_i(ii) a The method comprises the following specific steps:

2-1-1, extracting local characteristic point P from input image_iCharacteristic descriptor F of_iPosition (Px)_i,Py_i) Dimension σ_iAnd a main direction theta_iInformation, i.e. local characteristic points P_iIs represented by [ F_i,θ_i,σ_i,Px_i,Py_i]；

2-1-2. for each local feature point P_iCharacteristic descriptor F of_iObtaining visual vocabularies by adopting a grouping quantization method according to the visual vocabulary dictionary; the grouping quantization based on visual vocabulary dictionary is to divide the feature descriptor F_iDividing into M groups, each group is D/M characteristics, wherein D is a characteristic descriptor F_iThe dimension of the feature vector; then, the feature vectors of each group are independently quantized into V according to the visual vocabulary dictionary trained in the step 1-2_jThen use grouping quantization to obtain feature descriptor F_iVisual vocabulary VW_iComprises the following steps:

wherein, L is the number of the corresponding group of the word roots in the visual vocabulary dictionary; thereby a local feature point P_iIs denoted as [ VW_i,θ_i,σ_i,Px_i,Py_i](ii) a The quantization of each group of feature vectors is realized by searching the nearest class center in the root set of the group based on Euclidean distance and taking the subscript of the class center as the quantization result;

and 2-2. quantizing the main direction, the scale and the coordinates: the principal direction θ mentioned above is a floating point type of camber value, here quantized to an integer type angle value θ: theta_i＝θ_i*180/π。

Same location information (Px)_i,Py_i) And the scale σ_iBut also quantized into a reshape. In quantizing the scale, a_iMultiplying by 100 and then rounding preserves some precision.

(3) The feature point voting verification part is realized in two ways: the first is a local feature point verification method based on global weak relation consistency constraint, and the second is a local feature point verification method based on global strong geometric relation consistency constraint. The voting mechanism is adopted in both modes: in a candidate image, if a certain candidate feature point and a certain proportion of context feature points simultaneously satisfy the constraint relation, the point is considered as a correctly matched feature point.

In the process of feature point verification, whether feature points are matched or not is verified mainly according to the azimuth feature, the principal direction feature and the scale feature of the local feature points; two ways are presented below:

3-1, verifying consistency of weak relationship: the method is based on the principle that the main direction and the scale size between the correct matching feature points have consistency. The method comprises the following specific steps:

and 3-1-1, obtaining a large number of candidate feature points by matching visual words obtained by quantizing the feature points of the query image with the feature points in the index database. And (5) constructing a Hash table by using the image IDs (imgIds) of the feature points as key words, and finding a plurality of candidate images.

3-1-2. in the query image, if the scale of a feature point is smaller than a certain context feature point, in the candidate image, the corresponding candidate feature point should also satisfy the constraint. At the same time, the principal direction should also satisfy this condition, and the formula is as follows:

wherein Scl represents scale, Ori represents principal direction, and subscripts ai, aj represents ith, j characteristic point of a image. M_i,jA value of 1 indicates that the two feature points i, j satisfy the consistency relationship constraint, and the number of votes of the feature point j to the feature point i is increased by 1. If a certain feature point and a certain proportion of context feature points satisfy the constraint, the candidate feature point is considered as a correct matching point. In the process of image retrieval, the voting sum S obtained by correctly matching the feature points is calculated.

Wherein the content of the first and second substances,

the number of votes obtained at the feature point i, and Th is a voting threshold. The vote sum S of the candidate images is used for measuring the similarity degree of one candidate image.

3-2, strong geometric verification: the strong geometric verification is to calculate two angular relationships of main direction difference and angular position difference between characteristic points. In the two angles used in the method, the principal direction difference is the difference in principal direction between the two feature points. The principal direction difference can be calculated by the following formula:

β＝|Ori_i-Ori_j|

Ori_iis the i principal direction, Ori, of the point to be verified_jIs the main direction of the context feature point j of the point i to be verified.

The angular difference can be calculated by the following formula:

wherein, arctan2 (P)_i,P_j) Is two characteristic points P_i,P_jThe angle between the connecting line and the horizontal direction. Strong tableThe relationship verification method comprises the following steps:

and 3-2-1, obtaining a large number of matched feature points by matching the visual vocabulary obtained by quantizing the feature points of the query image with the feature points in the index database. And (5) constructing a Hash table by using the image IDs (imgIds) of the feature points as key words, and finding a plurality of candidate images.

3-2-2, between the feature points which are matched correctly, the main direction difference between two feature points of the query image and the main direction difference between two feature points corresponding to the candidate image are approximately equal. The same should also be the case for angular differences that are approximately equal, corresponding to the difference in principal direction differences between feature points:

wherein the content of the first and second substances,

representing the principal direction difference of the feature points i and j in image a,

indicating the principal direction difference of the corresponding feature point in image b.

And difference in angular head:

wherein the content of the first and second substances,

representing the angular difference of the feature points i and j in image a,

indicating the angular position difference of the corresponding feature point in image b.

If the two points i and j satisfy the constraint relation, M_i,jEqual to 1. Here, whether the candidate feature point is a correctly matched feature point is determined by voting. If a certain feature point and a certain proportion of context feature points in the image satisfy the relationship, the feature point is considered as a correctly matched feature point. And counting correct matching points in each candidate image, and calculating a voting sum S obtained by correctly matching the feature points.

Wherein the content of the first and second substances,

is the number of votes obtained for feature point i. The vote sum S of the candidate images is used for measuring the similarity degree of one candidate image.

Compared with the prior art, the invention has the following beneficial effects:

the method can be used for large-scale image retrieval, and the retrieval efficiency and accuracy are improved; in addition, the method adopts the global relation of the local feature points to verify the candidate image, so when the candidate feature points of the candidate image are too many, the feature points with larger scale in the candidate image can be adopted, and the experimental result proves that the effect is better and the speed is higher than that of using all the candidate feature points. Meanwhile, the method has good robustness on image transformation such as scaling, rotation, cutting and the like of the image.

Drawings

FIG. 1 shows a flow chart of the present invention;

FIG. 2 is a diagram of weak relationship consistency verification;

FIG. 3 is a schematic illustration of strong geometry verification;

Detailed Description

In the implementation, the algorithm uses a SIFT operator, and in the following description, the descriptor of the local feature point refers to SIFT and is not specifically indicated. The first method is a local feature point verification method based on global weak relation consistency constraint, and the second method is a local feature point verification method based on global strong geometric relation consistency constraint. The method can be used in image retrieval and image identification and detection methods based on local feature points.

Embodiments of the present invention will be further described with reference to the accompanying drawings.

Fig. 1 is a flow chart of the present invention, showing the relationship of each part and the flow thereof. The two methods specifically comprise the following steps: an off-line learning section, a feature point quantizing section, and a feature point vote verifying section. The off-line learning part is used for constructing a visual vocabulary dictionary and comprises the steps of obtaining sample points, clustering the sample points and generating the dictionary. The characteristic point quantization comprises three parts, namely local characteristic point extraction, main direction, direction and scale quantization and descriptor quantization. The third part of feature point voting verification comprises two parts, wherein candidate images and candidate feature points are found in an index library, and the candidate feature points are verified in a voting mode.

(1) The offline learning part in fig. 1 mainly includes: and (5) constructing a visual vocabulary dictionary.

And selecting a large number of images, and extracting characteristic points of the images to be used as a sample learning library. Then, grouping the feature vectors of the feature descriptors in the feature descriptor sample library; obtaining K class centers on each feature group through K mean clustering, wherein each class center is a feature vector which represents a root in a visual vocabulary, and the K class centers are root sets of the feature group; selecting a root word from the root set for each feature set generates a visual vocabulary. And combining the root sets constructed on each feature group to obtain a visual vocabulary dictionary. In this embodiment, the feature descriptors of the local feature points are divided into 4 groups, each group has 8 feature values, 64 class centers are constructed by K-means clustering, and each class center is the root of the group; 4 class centers are the visual dictionary of the method; the class centers of each group are stored in an array and stored in a file; the method requires loading the array into memory when performing the quantized local feature descriptors.

And finally, obtaining the visual vocabulary by the local feature descriptors in the sample library according to the visual vocabulary dictionary by adopting a grouping quantization method. In the present embodiment, the extracted feature descriptors of SIFT are 32-dimensional feature vectors.

(2) The specific steps of the characteristic point quantization part in fig. 1 are as follows:

firstly, extracting a local feature point set S ═ P from an image_i,i∈[0,Q]Q is the number of local characteristic points in the image; and the local characteristic point P is quantized by grouping according to a visual vocabulary dictionary_iIs quantized into the visual vocabulary VW_i. The method comprises the following specific steps:

in the local feature point extraction in fig. 1, in this embodiment, the detected local feature point is described by using a SIFT descriptor; a local feature point (P)_i) By SIFT descriptor can be expressed as: [ F ]_i,θ_i,σ_i,Px_i,Py_i](ii) a Wherein F_iRepresenting the feature descriptor vector by using a gradient histogram; theta_iIs a main direction; sigma_iIs the scale of the local feature point, (Px)_i,Py_i) Is a local feature point (P)_i) Spatial position in the image. In this embodiment, F_iIs set as a 32-dimensional feature vector. Through local feature point extraction and description, an image is represented as a set of SIFT descriptors.

The feature descriptor in fig. 1 quantifies the feature descriptor for each local feature point (F)_i) Obtaining visual vocabularies by adopting a grouping quantization method according to the visual vocabularies; grouping quantization is to sub-divide the feature descriptor F_i(D-dimensional feature vectors) are divided into M groups of D/M features, and then the feature vectors of each group are quantized into V individually according to a trained dictionary_jThen, the visual Vocabulary (VW) for obtaining the feature descriptor by using the grouping quantization is:

wherein L is the number of the corresponding group of the word roots in the visual vocabulary dictionary. A local feature point P is quantized by the local feature descriptor_iIs denoted as [ VW_i,θ_i,σ_i,Px_i,Py_i]。

FIG. 1 shows 22 for quantizing other SIFT feature attributes, where the attribute to be quantized includes the principal direction θ_iDimension σ_iAnd orientation (Px)_i,Py_i). Principal direction theta_iIs a floating point type of camber value, quantized here to an integer angular value θ_i＝θ_i*180/π；

Same location information (Px)_i,Py_i) And scale is also quantized to shape. In quantizing the scale, 100 is multiplied and then rounded, and certain precision is reserved.

(3) The feature point voting verification section in fig. 1 mainly includes 31 acquiring candidate images and candidate feature points, and 32 verifying the candidate feature points by voting. In the image retrieval application, a large number of candidate feature points are obtained in an index database according to visual words of feature points of an inquired image, and a plurality of candidate images are obtained by the candidate feature points according to an image ID (imgId). The two methods verify candidate characteristic points in the candidate image through global consistency constraint to achieve the aim of image copy detection. The verification of the feature points in the invention comprises two methods, wherein one method is to verify candidate feature points by correctly matching the relationship consistency between the main direction and the scale relative size between the feature points, so as to become weak relationship consistency verification. The other is verification of feature points through two strong geometric relations of principal direction difference and angular position difference between the feature points, which is called strong geometric verification.

3-1, verifying weak consistency relationship: and verifying whether the feature points are correctly matched feature points or not by adopting a voting mode according to the principle that the main direction and the scale size between the correctly matched feature points are consistent. The method comprises the following specific steps:

and 3-1-1, obtaining a large number of matched feature points by matching the visual vocabulary obtained by quantizing the feature points of the query image with the feature points in the index database. And (5) constructing a Hash table by using the image IDs (imgIds) of the feature points as key words, and finding a plurality of candidate images.

3-1-2. As shown in FIGS. 2(a) and 2(b), in the query image, if the scale of one feature point is smaller than that of another, then the same visual vocabulary should satisfy the constraint in the candidate image. While the main direction should also fulfil this condition. If a feature point and a certain proportion of feature points in the candidate image satisfy the relationship, the feature point is considered as a correctly matched feature point. The following two equations are shown:

wherein Scl represents scale, Ori represents principal direction, and subscripts ai, aj represents ith, j characteristic point of a image. M_i,jA value of 1 indicates that the two feature points i, j satisfy the consistency relationship constraint, and the number of votes of the feature point j to the feature point i is increased by 1. If a certain feature point and a certain proportion of contextual feature points satisfy the constraint, the point is considered to be the correct point. In the process of image retrieval, the voting sum S obtained by correctly matching the feature points is calculated.

Wherein the content of the first and second substances,

the number of votes obtained at the feature point i, and Th is a voting threshold. The sum of votes for each candidate image is used to measure the similarity of a candidate image.

The pseudo code of the specific execution process is as follows:

3-2, strong geometric verification: the strong geometric verification is to calculate two angular relationships of main direction difference and angular position difference between characteristic points. The attributes of the feature points that need to be used include principal direction and orientation features. In the two angles used by the method, the main direction difference is the difference of the main directions between the two characteristic points, and the angle difference is the included angle between the main direction and the connecting line between the two characteristic points. The principal direction difference can be calculated by the following formula:

β＝|Ori_i-Ori_j|

the angular difference can be calculated by the following formula:

and 3-2-1. similar to the step 3-1-1, a visual vocabulary is obtained by quantizing the characteristic points of the query image, and the visual vocabulary is matched with the characteristic points in the index database to obtain a large number of matched characteristic points. And (5) constructing a Hash table by using the image IDs (imgIds) of the feature points as key words, and finding a plurality of candidate images.

3-2-2. between the correctly matched feature points, the main direction difference of the two feature points of the inquired image and the main direction difference of the two feature points corresponding to the candidate image should be approximately equal. The same should also be the case for angular differences that are approximately equal, corresponding to the difference in principal direction differences between feature points:

and difference in angular head

Should approach 0.

If the two points i and j satisfy the constraint relation, M_i,jEqual to 1. Here, whether the candidate feature point is a correctly matched feature point is determined by voting. If a certain feature point and other candidate feature points in the image in a certain proportion satisfy the relationship, the feature point is considered as a correctly matched feature point. And counting correct matching points in each candidate image, and calculating the voting sum S obtained by the correct matching feature points.

Wherein the content of the first and second substances,

is the number of votes obtained for feature point i. The sum of votes for each candidate image is used to measure the similarity of a candidate image.

The pseudo code of the specific execution process is as follows:

having described embodiments of the invention in detail, it will be appreciated that variations and modifications can be effected within the scope of the invention as described above and as particularly set forth in the claims by a person of ordinary skill in the art without departing from the scope of the invention.

Claims

step (1), off-line learning is carried out, and a visual vocabulary dictionary is obtained by grouping and clustering a large number of samples;

quantizing the feature points of the query image through a visual vocabulary dictionary to obtain visual vocabularies;

matching visual words of the query image in an index library to obtain candidate feature points, and establishing a relationship by using unique image identifiers of the candidate feature points to obtain a plurality of candidate images;

verifying the characteristic points through strong geometric relationship constraint to achieve the aim of verifying the candidate images;

step (4) verifying the feature points by adopting strong geometric relationship constraint; the method verifies the feature points according to the principle that the main direction difference and the angular position difference between the correctly matched feature points meet the consistency constraint principle; principal directionβ ═ Ori_i-Ori_j|；Ori_iIs the i principal direction, Ori, of the point to be verified_jIs the principal direction of the contextual feature point j of the verification point i; the angular position difference is an included angle between a connecting line of the feature point to be verified and the upper and lower feature points thereof and the main direction of the feature point to be verified, and can be calculated by the following formula:

arctan2(P_i，P_j) For calculating characteristic points (P)_i，P_j) The connecting line of the two points forms an included angle with the horizontal direction; the method comprises the following specific steps:

4-1, matching visual words obtained by quantizing the characteristic points of the query image with the characteristic points of the built index in the index database to obtain a large number of matched characteristic points; establishing a Hash table by taking the image IDs of the characteristic points as key words to find a plurality of candidate images;

4-2, because SIFT feature points have good rotation robustness, the main direction difference between two feature points of the query image and the main direction difference between two feature points corresponding to the candidate image are approximately equal; similarly, the angular differences should also be approximately equal; difference of principal direction difference between corresponding feature points: dif _ Ori_i，j＝|β_aij-β_bijAnd the difference in angular difference Dif _ Dir ═ α_aij-α_bijI should approach 0;

if the characteristic points i and j satisfy the constraint relationship, M_i，jEqual to 1; wherein the content of the first and second substances,

representing the angular difference of the feature points i and j in image a,

representing the angular position difference of the corresponding feature point in image b,

representing the principal direction difference of the corresponding feature point in the image b;

judging whether the candidate feature points are correctly matched feature points or not by adopting a voting mode, and calculating the voting sum S obtained by correctly matching the feature points;

wherein the content of the first and second substances,

the number of tickets obtained by the characteristic point i; and when selecting the result, selecting a plurality of images from large to small as copy images according to the voting sum of the candidate images.

2. The local feature point verification method based on global relationship consistency constraint according to claim 1, characterized in that the off-line learning part comprises the following specific implementation steps:

1-2, obtaining a visual vocabulary dictionary through a sample library; specifically, feature vectors of feature descriptors in a sample library are grouped, K class centers are obtained on each feature group through K mean value clustering, each class center is a feature vector and represents a root word in a visual vocabulary, and the K class centers are root word sets of the feature groups; and combining the root sets constructed on each feature group to obtain a visual vocabulary dictionary.

3. The local feature point verification method based on global relationship consistency constraint according to claim 2, wherein the quantization of the feature points in the feature point quantization part comprises two parts: quantizing local feature descriptors, and quantizing main directions, scales and coordinates;

2-1. local feature descriptor quantization: extracting local feature point set S ═ { P ═ P for input image_i，i∈[0，Q]Q is the number of local feature points in the input image, P_iRefers to the ith local feature point; and the local characteristic point P is quantized by grouping according to the visual vocabulary dictionary_iIs quantized into the visual vocabulary VW_i(ii) a The method comprises the following specific steps:

2-1-1, extracting local characteristic point P from input image_iCharacteristic descriptor F of_iPosition (Px)_i，Py_i) Dimension σ_iAnd a main direction theta_iInformation, i.e. local characteristic points P_iIs represented by [ F_i，θ_i，σ_i，Px_i，Py_i]；

wherein, L is the number of the corresponding group of the word roots in the visual vocabulary dictionary; thereby a local feature point P_iIs represented as [ V ]W_i，θ_i，σ_i，Px_i，Py_i](ii) a The quantization of each group of feature vectors is realized by searching the nearest class center in the root set of the group based on Euclidean distance and taking the subscript of the class center as the quantization result;

and 2-2. quantizing the main direction, the scale and the coordinates: the principal direction θ mentioned above is a floating point type of camber value, here quantized to an integer type angle value θ: theta_i＝θ_i*180/π；

Same location information (Px)_i，Py_i) And the scale σ_iAlso quantized into a reshape; in quantizing the scale, a_iMultiplying by 100 and then rounding preserves some precision.