CN112541092A

CN112541092A - Three-dimensional image contour retrieval method and system based on tangential domain and storage medium

Info

Publication number: CN112541092A
Application number: CN202011563813.3A
Authority: CN
Inventors: 杜娟; 闫甜甜; 胡晓明
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2020-12-25
Filing date: 2020-12-25
Publication date: 2021-03-23
Anticipated expiration: 2040-12-25
Also published as: CN112541092B

Abstract

The invention discloses a tangential domain-based three-dimensional image contour retrieval method, a system and a storage medium, which comprises the steps of clustering K-means to generate a feature view of a projection view of a three-dimensional image and extracting the contour of the feature view. Then, the Gaussian kernel function is used for convolution and is converted into a tangential domain for retrieval. The method can eliminate the influence of the starting point on the contour retrieval, has the invariance of rotation, translation, scale and mirror image, has low feature dimension and high extraction speed, and can obtain higher retrieval rate and better universality compared with other contour retrieval algorithms.

Description

Three-dimensional image contour retrieval method and system based on tangential domain and storage medium

Technical Field

The invention relates to the field of image processing, in particular to a three-dimensional image contour retrieval method and system based on a tangential domain and a storage medium.

Background

The contour of an object is an inherent attribute and does not change along with illumination, color and texture, so a contour-based target identification method is researched in a large quantity, but the contour of the object can be presented differently on an image along with the change of a shooting angle, the extracted contour is deformed due to the movement and the shielding of a non-rigid object, the same type of object also has different properties, the problems bring huge challenges to related researches, and the research scheme needs to make good balance among speed, precision and robustness.

The conventional three-dimensional image-oriented research scheme can directly establish a descriptor from a three-dimensional image and can also indirectly establish the descriptor from the projection of the three-dimensional image, so that the rapidity and the effectiveness of three-dimensional image retrieval are met, the feature dimension can be reduced from the aspect of feature extraction, the feature extraction time is further reduced, the efficiency is improved, a better matching algorithm can be designed from the aspect of feature matching, and the effectiveness of a retrieval result is enhanced.

Disclosure of Invention

In order to overcome the defects and shortcomings of the prior art, the invention provides a three-dimensional image contour retrieval method, a three-dimensional image contour retrieval system and a storage medium based on a tangential domain.

The invention adopts the following technical scheme;

a three-dimensional image contour retrieval method based on a tangential domain comprises the following steps:

generation of S1 projection diagram: inputting a three-dimensional image, zooming and translating the three-dimensional image to the center of a unit sphere, generating viewpoints on the unit sphere, and generating a projection drawing at the position of each viewpoint;

s2 feature view extraction: extracting the sequence contour of each projection drawing, and recording the sequence contour as C_iWherein i is the number of pictures and uses a one-dimensional equation C_iExpressed as x (t) + jy (t), for C_iPerforming Fourier series expansion, selecting the first n Fourier coefficients for description, randomly selecting four clustering centers, clustering by using a K-means algorithm, and taking an image closest to the clustering centers as a characteristic view of the image;

s3 arc length equal resampling: counting the number of pixels of the characteristic view as the total arc length according to

Carrying out arc length parameterization, wherein m represents the arc length of the position of the sequence contour point, carrying out cubic B-spline interpolation and carrying out arc length equal interval resampling to obtain a sequenceA column profile;

s4, convolving the sequence contour obtained in the last step with a Gaussian kernel function;

s5 tangential domain feature extraction: t ═ x (s, σ)_max)′，y(s，σ_max) '), mapping T to a unit circle, dividing the unit circle according to a certain angle interval margin, counting the number n (k) of points in each bin, wherein k is 1,2_maxAnd greater than 0.8n_maxThe angles are respectively used as a main direction and an auxiliary direction, and the characteristic vector is converted to the main direction f^main(i) And an auxiliary direction f^au(i) Then mirror the features f^mir(i) To obtain C_iIs noted as f^j(j＝1，2，3，4；)，f^j＝{f^main(i)，f^au(i)，f^mir(i) J represents the number of feature contours;

s6 defines Dpq:

[·]representing and merging into a vector;

s7 defines the threshold and retrieves: will D_pqAnd according to the sequence from small to large, taking the maximum value in the first four-dimensional minimum values of a certain class as a threshold value for searching the image.

Further, in S1, a viewpoint is generated on the unit sphere by using the gaussian seidel iteration.

Further, the number of viewpoints is 200.

Further, in S2, four feature views are generated.

Further, in S3, the number of resample points is 400.

Further, due to the translational invariance of the tangential domain and the one-to-one mapping between the closed sequence contour and the tangential domain, C will be_iConversion to the tangential domain for expression and feature extraction.

Further, the sequence contour of each projection image is extracted by adopting a morphological watershed algorithm.

A system for realizing the three-dimensional image contour retrieval method comprises the following steps:

the projection graph generation module: inputting a three-dimensional image, zooming and translating the three-dimensional image to the center of a unit sphere, generating viewpoints on the unit sphere, and generating a projection drawing at the position of each viewpoint;

the characteristic view extraction module: extracting the sequence contour of each projection drawing, and obtaining a characteristic view of the projection drawing after processing;

arc length equal interval sampling module: counting the number of pixels of the characteristic view as the total arc length, and performing resampling on the arc length at equal intervals to obtain a sequence profile;

a convolution module: convolving the obtained characteristic view sequence outline with a Gaussian kernel function;

a tangential domain feature extraction module: obtaining a feature vector of the projection drawing;

the retrieval module: definition D_pqAnd according to the sequence from small to large, taking the maximum value in the first four-dimensional minimum values of a certain class as a threshold value for searching the image.

A storage medium having stored thereon a computer program that executes the three-dimensional image contour detection method.

The invention has the beneficial effects that:

(1) the invention analyzes the projection of the three-dimensional image under different visual angles by the three-dimensional image contour retrieval method based on the tangential domain, improves the retrieval efficiency of the three-dimensional image, and has simple process and low calculation complexity.

(2) The method has stronger robustness, and can achieve good retrieval results for the property change of the objects in the class.

(3) The method has strong applicability, not only can be used for searching three-dimensional images, but also can be popularized to searching two-dimensional images, and the classifier can be designed according to the extracted characteristics for identifying objects.

Drawings

FIG. 1 is a flow chart of the operation of the present invention.

Detailed Description

The present invention will be described in further detail with reference to examples and drawings, but the present invention is not limited to these examples.

Example 1

As shown in fig. 1, a tangential domain-based three-dimensional image contour retrieval method of the present invention includes the following steps:

(1) projection view generation: inputting a three-dimensional image I, zooming and translating the three-dimensional image I to the center of a unit sphere, generating 200 viewpoints on the unit sphere by Gauss Seidel iteration, and generating a projection diagram at the position of each viewpoint.

The viewpoints generated by the method are more uniform than longitude and latitude coordinates, and 200 viewpoints not only can contain rich detail information, but also do not occupy more calculation time.

(2) Extracting a characteristic view: the sequence contour of each projection graph is extracted by the morphology watershed algorithm and is marked as C_iI 1,2 …,200, using one-dimensional equation C_iExpressed as x (t) + jy (t), for C_iAnd performing Fourier series expansion, selecting the first 20 Fourier coefficients for description, randomly selecting 4 clustering centers, clustering by using a K-means algorithm, and taking the image closest to the clustering centers as a characteristic diagram of the image.

The four characteristic views generated in the step have no difference between the optimal view and the worst view, the subsequent matching process is to synthesize the matching results of the four views, and the characteristic views are generated according to K-means clustering, so that the redundancy of projection views is reduced, the most representative projection view is extracted, and the retrieval rate and the retrieval speed are improved.

In this embodiment, four selected empirical values are selected for The clustering center, and other numerical values may be selected according to specific situations, and The Princeton Shape Benchmark dataset is generally used.

(3) Arc length equal resampling: counting the number of pixels as the total arc length according to

Carrying out arc length parameterization, wherein m represents the arc length of the position of the sequence contour point, carrying out cubic B-spline interpolation, carrying out arc length and the likeAnd re-sampling at intervals, and selecting 400 re-sampling points to obtain the sequence profile.

The step can ensure the second-order conductibility of the contour and the scale invariance of subsequent feature extraction, the number of the resampling points is 400, on one hand, the number can ensure that the error of the contour after sampling is small, and on the other hand, the algorithm has higher retrieval speed.

(4) Gaussian kernel function convolution: defining a Gaussian kernel function of

The window width can be changed by adjusting the standard deviation σ, and the sequence contour obtained in the previous step is convolved with the kernel function, x (s, σ) ═ x(s) · Gauss (σ), y (s, σ) ═ y(s) · Gauss (σ).

(5) Extracting features of a tangential domain: t ═ x (s, σ)_max)′，y(s，σ_max) '), mapping T onto a unit circle, dividing the unit circle by a certain angular interval margin (e.g., margin pi/18), counting the number of points n (k) in each bin, k being 1,2_maxAnd greater than 0.8n_maxThe angles are respectively used as a main direction and an auxiliary direction, and the characteristic vector is converted to the main direction f^main(i) And an auxiliary direction f^au(i) Note that the secondary direction is not unique, and then the features are mirrored f^mir(i) Then C is obtained_iIs noted as f^j(j＝1，2，3，4；)，f^j＝{f^main(i)，f^au(i)，f^mir(i) J denotes the number of feature contours.

Tangential domain transformation, C, due to translational invariance of the tangential domain and one-to-one mapping between the closed sequence contours and the tangential domain_iConversion to the tangential domain for expression and feature extraction.

Because it is greater than 0.8n_maxMay be more than one, and thus the secondary directions are not unique, it is necessary to calculate the feature vector in each secondary direction, with f^auTo indicate.

(6) Definition D_pq：

[·]Are represented and are combined into a vector.

D_pqA distance matrix representing the three-dimensional images p and q.

(7) Defining a threshold and retrieving: will D_pqAnd according to the sequence from small to large, taking the maximum value in the first four-dimensional minimum values of a certain class as a threshold value for searching the image.

Taking The Princeton Shape Benchmark dataset as an example, The data set has 90 classes in total, and The number of images in each class is not The same.

Because each three-dimensional image has four characteristic views, the existing matching algorithm can not be directly matched, and D is defined_pqTo perform a search for a three-dimensional image.

Search is conducted if D_pqIf the current graph is smaller than the threshold value, the current graph is output, otherwise, the next graph is input for retrieval again.

One application example of this embodiment is as follows:

a three-dimensional image is input, the data format of which is a point and a surface, the surface is composed of three points, and therefore the three-dimensional image is a connection relationship between a three-dimensional coordinate point and a point.

200 projection views are obtained, and 4 characteristic view outlines after clustering are obtained, wherein the number of the outline points is 2689 pixels in the example.

After the arc length equal resampling, the number of each feature profile point is 400, the sequence profile is convolved with the gaussian kernel function, and the standard deviation σ is set to 15.

Extracting tangential domain features, taking an input picture as an example, dividing a unit circle into 36 parts to obtain normalized feature vectors, visualizing the normalized feature vectors by using bar graphs, and respectively representing a main direction, an auxiliary direction and a mirror image feature vector.

Five images are obtained from the retrieval result, the retrieval image is removed, at most four correct images are obtained, the algorithm retrieves two correct images, and the retrieval result is displayed in a three-dimensional image mode.

The method can eliminate the influence of the starting point on the contour retrieval, has the invariance of rotation, translation, scale and mirror image, has low feature dimension and high extraction speed, and can obtain higher retrieval rate and better universality compared with other contour retrieval algorithms.

Example 2

A computer-readable storage medium having stored thereon a computer program that executes a method of implementing the tangential domain-based three-dimensional image contour retrieval method described above.

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. A three-dimensional image contour retrieval method based on a tangential domain is characterized by comprising the following steps:

Carrying out arc length parameterization, wherein m represents the arc length of the position of the sequence contour point, carrying out cubic B-spline interpolation and carrying out arc length equal interval resampling to obtain the sequence contour；

s6 definition D_pq：dist(f_q ¹，f_p ¹)＝min[sum(f_p ¹-f_q ¹)²](p≠q)，

[·]Representing and merging into a vector;

2. The method for retrieving a contour of a three-dimensional image according to claim 1, wherein in S1, a viewpoint is generated on a unit sphere by using a gaussian seidel iteration.

3. The method for retrieving a contour of a three-dimensional image according to claim 1 or 2, wherein the number of the viewpoints is 200.

4. The method for retrieving a contour of a three-dimensional image according to claim 1, wherein in said S2, four feature views are generated.

5. The method for retrieving a three-dimensional image contour according to claim 1, wherein in said S3, the number of resample points is 400.

6. The method of claim 1, wherein C is assigned due to translational invariance in the tangential domain and a one-to-one mapping between the closed sequence contours and the tangential domain_iConversion to the tangential domain for expression and feature extraction.

7. The method of claim 1, wherein the extracting the sequence contour of each projection image is performed by using a morphological watershed algorithm.

8. A system for implementing the three-dimensional image contour retrieval method according to any one of claims 1 to 7, comprising:

9. A storage medium having stored thereon a computer program for executing the three-dimensional image contour detection method according to any one of claims 1 to 7.