CN106709997B - Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder - Google Patents

Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder Download PDF

Info

Publication number
CN106709997B
CN106709997B CN201610279232.4A CN201610279232A CN106709997B CN 106709997 B CN106709997 B CN 106709997B CN 201610279232 A CN201610279232 A CN 201610279232A CN 106709997 B CN106709997 B CN 106709997B
Authority
CN
China
Prior art keywords
point
dimensional
grid model
dimensional grid
encoding encoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201610279232.4A
Other languages
Chinese (zh)
Other versions
CN106709997A (en
Inventor
朱策
林薪雨
张倩
王征韬
刘翼鹏
夏志强
虢齐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN201610279232.4A priority Critical patent/CN106709997B/en
Publication of CN106709997A publication Critical patent/CN106709997A/en
Application granted granted Critical
Publication of CN106709997B publication Critical patent/CN106709997B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • G06T17/30Polynomial surface description
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Physics (AREA)
  • Mathematical Analysis (AREA)
  • Algebra (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The invention belongs to 3D computer vision technical fields, and in particular to a kind of three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder.This method includes the sparse self-encoding encoder of training and deep neural network stage and detects the three-dimensional key point stage using trained deep neural network as regression model.Part and global information of the three-dimensional grid model in multiscale space are fully utilized to detect whether tested point is key point.The correlation between these parts and global information can effectively be found and form the advanced features representation of these information by introducing the sparse self-encoding encoder of multilayer, to return to it.It finally can effectively, robustly and steadily detect the key point in three-dimensional grid model.

Description

Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder
Technical field
The invention belongs to 3D computer vision technical fields, and in particular to one kind based on deep neural network and it is sparse from The three-dimensional critical point detection method of encoder.
Background technique
Three-dimensional critical point detection is the important content in 3D computer vision, be widely used in as target registered with Match, 3D shape retrieval, among the various applications such as mesh segmentation and simplification.Researchers propose more in the past few decades The method that kind detects three-dimensional key point, wherein being mostly the method based on geometry.Godila and Wagan are to two-dimensional Scale Invariant Feature Transform (SIFT) algorithm is extended, and proposes that three-dimensional S IFT critical point detection is calculated Method.Holte carries out three-dimensional critical point detection using Difference-of-Normals (DoN) operator.Castellani is according to three The vision significance principle for tieing up grid model, proposes a kind of three-dimensional critical point detection algorithm that can detect robust.In addition, also There is some algorithm to carry out three-dimensional critical point detection in Laplce's spectral domain rather than real domain using the method for Laplce's spectrum.
Three-dimensional critical point detection method based on geometry lacks enough flexibilities, and therefore, it is difficult to meet the need of vast application It asks.It is three-dimensional key point that such method, which usually defines on three-dimensional grid model surface the point of acute variation in all directions, still In some scenes, these points may be some unessential little details in noise either three-dimensional grid model.In addition, working as When needs in view of three-dimensional grid model semantic information, such issues that method based on geometry can hardly be handled.It is based on The above reason, more and more researchers start to be dedicated to finding a kind of new three-dimensional critical point detection of frame progress.
In recent years, some researchers proposed to carry out three-dimensional critical point detection using the method for machine learning, such method It can solve the deficiency of the three-dimensional critical point detection method based on geometry to a certain extent.Teran and Mordohai is using at random Forest carries out three-dimensional critical point detection (Teran, L., Mordohai, P.:3d interest point as classifier detection via discriminative learning.In:Proceedings of the 13th European Conference on Computer Vision Conference on Computer Vision,Zurich, Switzerland(Sept 2014)).In the method, several three-dimensional critical point detection methods based on geometry are used to generate The characteristic attribute of training sample and test sample.Creusot using linear discriminant analysis (LDA) and AdaBoost two ways from Three-dimensional key point is detected in three-dimensional face model.Three-dimensional critical point detection problem is attributed to a two dimension by Salti and Tombari Classification problem, classification standard be a point whether can correctly with a predefined sub- successful match of three-dimensional description (Salti,S.,Tombari,F.,Spezialetti,R.,Stefano,L.D.:Learning a descriptor- specific 3d keypoint detector.In:Computer Vision(ICCV),2015IEEE International Conference on,Dec 2015,2318-2326)。
However, these above-mentioned algorithms mostly only generate characteristic attribute using only local message, lacks and be similar to La Pula Global information as this spectrum.
Summary of the invention
For above-mentioned there are problem or deficiency, can more effectively, robustly and steadily to detect three-dimensional grid mould Key point in type, the present invention provides a kind of three-dimensional critical point detection side based on deep neural network and sparse self-encoding encoder Method.
Specific technical solution is as follows:
Step 1 chooses training set and test set from three-dimensional grid model database, and chooses positive and negative sample from training set This point:
The training set and test set of selection do not overlap, and three-dimensional grid model has the three-dimensional pass generated by handmarking Key point.For each of training set three-dimensional grid model, chooses and be all positive by the three-dimensional key point that handmarking generates Sample point, remaining sample point that is negative.
Step 2, the negative sample point that is positive form characteristic attribute, construction feature property set:
The feature of sample point is formed using local message of the three-dimensional grid model in multiscale space and global information Attribute.
The local message includes three parts: the 1) Euclidean distance of the surrounding neighbors point of measured point to measured point tangent plane fd, 2) and measured point and the angle f around it between normal vector of field pointθ, 3) and four kinds of curvature of curved surface fc: maximum principal curvatures, minimum Principal curvatures, Gaussian curvature and mean value curvature.The global information is Laplce's spectral information fls
For any point v in three-dimensional grid model M (x, y, z), enabling f is its characteristic attribute, then:
F=[f0,f1,f2,...,fΩ]T (1)
fi=[fd,fθ,fc,fls], i=0,1,2 ..., Ω (2)
Wherein fi, i=0,1,2 ..., Ω indicates three-dimensional grid model M (x, y, z) characteristic attribute corresponding to scale i Information, fiInclude three classes local message fd、fθ、fcWith global information fls.Three-dimensional grid model M (x, y, z) is in dimensional space Developing indicates are as follows:
Mδ(x, y, z)=M (x, y, z) * G (x, y, z, δ) (3)
Wherein δ ∈ { 0, ε, 2 ε ..., Ω ε } is the standard deviation of three-dimensional Gaussian filter, and ε is to surround three-dimensional grid mould completely 0.3%, the δ=0 of the leading diagonal length of the minimum cube of type indicate the evolutionary model be initial three-dimensional grid model M (x, Y, z), * is convolution operator.
Three-dimensional grid model is made of series of points and its connection relationship.As shown in figure 3, in three-dimensional grid model Any point v, enables Vk(v), k=1,2,3,4,5 be surrounding k- ring neighborhood point, and n is its normal vector.Enable vkjFor Vk(v) in J-th point, nkjFor its normal vector.Then point vkjTo the Euclidean distance d of tangent plane corresponding to point vkjAre as follows:
Point vkjWith the angle between point v normal vector are as follows:
Wherein, (xv,yv,zv) be point v coordinate, (xkj,ykj,zkj) it is point vkjCoordinate.It enables Wherein NkFor Vk(v) number at midpoint.Then fdAnd fθAre as follows:
fd=[max (dk),min(dk),max(dk)-min(dk),mean(dk),var(dk),harmmean(dk)] (7)
fθ=[max (θk),min(θk),max(θk)-min(θk),mean(θk),var(θk),harmmean(θk)] (8)
Wherein, mean (), var () and harmmean () respectively indicate arithmetic average, variance and harmonic average.fc It is made of four kinds of curvature:
Wherein c1For minimum principal curvatures, c2For maximum principal curvatures, (c1+c2)/2 are mean value curvature, c1c2For Gaussian curvature.
The Laplacian Matrix of three-dimensional grid model is a symmetrical matrix and can decompose are as follows:
L=B Λ BT (10)
Wherein Λ=Diag { λf, 1≤f≤Ψ } and it be the element of a diagonal matrix and the inside is arranged according to ascending order, λfIt is The characteristic value of the Laplacian Matrix of three-dimensional grid model.The column vector of orthogonal matrix B is corresponding characteristic vector, and Ψ is three-dimensional The sum at grid model midpoint.Laplce's spectrum is defined as:
H (f)={ λf,1≤f≤Ψ} (11)
Global information is obtained using logarithm-Laplce's spectrum.Logarithm-Laplce spectrum L (f) is defined as:
L (f)=log (H (f)) (12)
The scrambling R (f) of spectrum is used to obtain grid conspicuousness:
R (f)=| L (f)-JΓ(f)*L(f)| (13)
Wherein,It is the vector of a 1 × Γ.Pass through following formula:
The scrambling of spectrum is transformed into real domain from spectral domain.Wherein R1=Diag { exp (R (f)): 1≤f≤Ψ } is Diagonal matrix,For Hadamard product, W is weight matrix, wherein
It enablesS is the element of S, then:
fls=[max (sk),min(sk),max(sk)-min(sk),mean(sk),var(sk),harmmean(sk)] (17)
Step 3, the characteristic attribute collection built using step 2 and corresponding tally set train sparse self-encoding encoder and depth Neural network:
Sparse self-encoding encoder is a variant of self-encoding encoder, by adding sparsity limit in the hidden layer part of self-encoding encoder It makes and obtains.Fig. 2 (a) illustrates the basic structure of a self-encoding encoder.
Three sparse self-encoding encoders are trained first, and the coded portion of these three sparse self-encoding encoders is then extracted grade It is linked togather, forms the sparse self-encoding encoder of depth, then train first level logical to return layer sparse from coding to handle depth The feature exported after device coding.
Deep neural network regression model is made of the sparse self-encoding encoder of depth and above-mentioned logistic regression level connection, Fig. 2 (b) Illustrate the basic structure of the deep neural network regression model in the method for the present invention.
Finally the effect that deep neural network regression model realizes accurate adjustment is acted on using back-propagation algorithm.
Step 4, the deep neural network regression model obtained using step 3 are predicted and are obtained to three-dimensional grid model Corresponding conspicuousness response diagram:
Characteristic attribute is formed to each point of three-dimensional grid model in test set using method same in step 2, and The deep neural network regression model obtained with step 3 predicts the point, obtains a regressand value.Three-dimensional grid is obtained again The regressand value of all the points in model is constituted the conspicuousness response diagram of the three-dimensional grid model with it.
Step 5, the conspicuousness response diagram obtained according to step 4 obtain three-dimensional key point:
The point in conspicuousness response diagram with local maximum is chosen as three-dimensional key point.For in three-dimensional grid model Each point, if the conspicuousness response of the point is all bigger than the conspicuousness response put in 5- ring neighborhood around it, should Point is three-dimensional key point.Otherwise, which is not just three-dimensional key point.
The present invention passes through: 1, deep neural network being used to carry out three-dimensional pass as regression model in conjunction with sparse self-encoding encoder The detection of key point;2, the part using three-dimensional grid model in multiscale space and global information form characteristic attribute, use up Information more than possible is utilized to detect three-dimensional key point;3, these offices can effectively be found by introducing the sparse self-encoding encoder of multilayer Correlation between portion and global information and the advanced features representation for forming these information, to be returned to it.Energy Enough key points effectively, robustly and steadily detected in three-dimensional grid model.
In conclusion relatively existing three-dimensional critical point detection method, method of the invention can effectively, robustly and surely Surely the key point in three-dimensional grid model is detected.
Detailed description of the invention
Fig. 1 is the flow chart of three-dimensional critical point detection method in the present invention;
Fig. 2 (a) is the structure chart of self-encoding encoder, and Fig. 2 (b) is deep neural network regression model of the invention;
Fig. 3 is aircraft three-dimensional grid model and its partial enlargement diagram;
Fig. 4 is the three-dimensional key point that chair three-dimensional grid model is detected using the method for the present invention;
Fig. 5 is using present invention detection three-dimensional video sequence key point obtained in different frame;
Fig. 6 is the performance comparison figure of the present invention with remaining 5 kinds three-dimensional critical point detection methods;(a) is schemed for database A test Concentrate the performance chart about IOU evaluation index;It is bent about the performance of IOU evaluation index in database B test set for scheming (b) Line chart.
Appended drawing reference: tested point v;1- ring neighborhood point V1;2- ring neighborhood point V2;3- ring neighborhood point V3;4- ring neighborhood point V4; 5- ring neighborhood point V5
Specific embodiment
The method of the present invention is described in further detail with specific example with reference to the accompanying drawing, the target of example is to pass through The validity of three-dimensional grid model critical point detection result verification the method for the invention.
In implementation process, we are with document (Dutagaci, H., Cheung, C.P., Godil, A.:Evaluation of 3d interest point detection techniques via human-generated ground Truth.The Visual Computer 28 (9) (2012) 901-917) in three-dimensional grid model database as training and Test data set.
The specific embodiment in training deep neural network stage:
Step 1 chooses training set and test set from three-dimensional grid model database, and chooses positive and negative sample from training set This point:
The three-dimensional grid model database is divided into two parts database A and database B, and database A includes 24 three-dimensionals Grid model carries out calibration by 23 people and generates three-dimensional key point Ground truth.Database B includes 43 three-dimensional grid moulds Type carries out calibration by 16 people and generates Ground truth.It is selected respectively 2/3rds of database A and database B as instruction Practice collection, remaining is as test set.
For each of database A training set three-dimensional grid model, Selecting All Parameters σ ∈ 0.01,0.02 ..., 0.1 } and Ground truth corresponding to parameter n ∈ { 11,12 ..., 22 } is positive sample, remaining point is negative sample.Positive sample This sum is 17115, and negative sample sum is 148565.For each of database B training set three-dimensional grid model, choose Ground truth corresponding to parameter σ ∈ { 0.01,0.02 ..., 0.1 } and parameter n ∈ { 8,9 ..., 15 } is positive sample, Remaining point is negative sample.Positive sample sum is 18427, and negative sample sum is 222034.
Step 2, the negative sample point that is positive form characteristic attribute:
For each of training set three-dimensional grid model, expression of the model in scale space is calculated first, so After calculate information of each of model sample point in scale i=0,1,2 ..., Ω, the value of Ω is 6 in the present invention. Local message fd、fθWith fcIt can be calculated by the following formula respectively:
fd=[max (dk),min(dk),max(dk)-min(dk),mean(dk),var(dk),harmmean(dk)]
fθ=[max (θk),min(θk),max(θk)-min(θk),mean(θk),var(θk),harmmean(θk)]
Global information flsIt is obtained by calculating logarithm-Laplce's spectrum in the scrambling of spectral domain:
R (f)=| L (f)-JΓ(f)*L(f)|
WhereinIn Γ be 9.Global information (fls) are as follows:
fls=[max (sk),min(sk),max(sk)-min(sk),mean(sk),var(sk),harmmean(sk)]
The characteristic attribute of each of final three-dimensional grid model sample point are as follows:
F=[f0,f1,f2,...,f6]T
fi=[fd,fθ,fc,fls], i=0,1,2 ..., 6
Sample dimension size is 665.
Step 3 utilizes the characteristic attribute collection built and the sparse self-encoding encoder of corresponding tally set training and depth nerve Network:
The characteristic attribute collection obtained using step 1 and step 2 and the sparse self-encoding encoder of corresponding tally set training and depth Neural network, relative parameters setting are as shown in the table:
Wherein ρ is the sparsity parameter of sparse self-encoding encoder, and β controls sparse punishment in the cost function of sparse self-encoding encoder The weight of item.
Detect the three-dimensional key point stage:
Step 4 predicts three-dimensional grid model using trained deep neural network regression model and obtains it Conspicuousness response diagram:
For each of test set three-dimensional grid model, by taking chair three-dimensional grid model as an example.First, in accordance with Method in step 2 obtains its expression in scale space, then puts for each and calculates characteristic attribute.Using trained Deep neural network regression model predicts each point on chair three-dimensional grid model.For each point, The output of deep neural network is that a value is regressand value between 0 to 1, and output valve indicates that the value more may be three closer to 1 Key point is tieed up, vice versa.All these output valves together constitute the conspicuousness response diagram of chair three-dimensional grid model.
Step 5 obtains three-dimensional key point according to conspicuousness response diagram:
The point in the conspicuousness response diagram of chair three-dimensional grid model with local maximum is chosen as chair three dimensional network The three-dimensional key point of lattice model.For each of chair three-dimensional grid model point, if the conspicuousness response ratio of the point The conspicuousness response put in 5- ring neighborhood around it is all big, then the point is three-dimensional key point.Otherwise, which is not just three-dimensional pass Key point.Fig. 4 illustrates the three-dimensional key point of the chair three-dimensional grid model detected using method of the invention.
Fig. 5 illustrates the three-dimensional key point in the pedestrian's three-dimensional test sequence detected using the method for the present invention under different frame Distribution.
Fig. 6 illustrates the method for the present invention with the performance comparison figure of remaining 5 kinds three-dimensional critical point detection methods, and evaluation index is IOU criterion (L.Teran and P.Mordohai, " 3d interest point detection via discriminativ learning,”in European Conference on Computer Vision.Zurich,Switzerland,Sept 2014), Fig. 6 (a) illustrates performance chart of 6 kinds of algorithms in database A test set, and Fig. 6 (b) illustrates 6 kinds of algorithms and exists Performance chart in database B test set.

Claims (1)

1. the three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder, comprising the following steps:
Step 1 chooses training set and test set from three-dimensional grid model database, and chooses positive negative sample from training set Point:
The training set and test set of selection do not overlap, and three-dimensional grid model has the three-dimensional generated by handmarking crucial Point;For each of training set three-dimensional grid model, chooses and be all positive sample by the three-dimensional key point of handmarking's generation This point, remaining sample point that is negative;
Step 2, the negative sample point that is positive form characteristic attribute, construction feature property set:
The characteristic attribute of sample point is formed using local message of the three-dimensional grid model in multiscale space and global information;
The local message includes three parts: 1) the Euclidean distance f of the surrounding neighbors point of measured point to measured point tangent planed, 2) and quilt Measuring point and the angle f around it between normal vector of field pointθ, 3) and four kinds of curvature of curved surface fc: maximum principal curvatures, minimum principal curvatures, Gaussian curvature and mean value curvature;The global information is Laplce's spectral information fls
For any point v in three-dimensional grid model M (x, y, z), enabling f is its characteristic attribute, then:
F=[f0,f1,f2,...,fΩ]T (1)
fi=[fd,fθ,fc,fls], i=0,1,2 ..., Ω (2)
Wherein fi, i=0,1,2 ..., Ω expression three-dimensional grid model M (x, y, z) characteristic attribute information corresponding to scale i, fiInclude fd、fθ、fcAnd fls;Evolution of the three-dimensional grid model M (x, y, z) in dimensional space indicates are as follows:
Mδ(x, y, z)=M (x, y, z) * G (x, y, z, δ) (3)
Wherein δ ∈ { 0, ε, 2 ε ..., Ω ε } is the standard deviation of three-dimensional Gaussian filter, and ε is to surround three-dimensional grid model completely 0.3%, the δ=0 of the leading diagonal length of minimum cube indicate the evolutionary model be initial three-dimensional grid model M (x, y, Z), * is convolution operator;
Three-dimensional grid model is made of series of points and its connection relationship, to any point v in three-dimensional grid model, enables Vk (v), k=1,2,3,4,5 be surrounding k- ring neighborhood point, and n is its normal vector;Enable vkjFor Vk(v) j-th point in, nkjFor Its normal vector, then point vkjTo the Euclidean distance d of tangent plane corresponding to point vkjAre as follows:
Point vkjWith the angle between point v normal vector are as follows:
Wherein, (xv,yv,zv) be point v coordinate, (xkj,ykj,zkj) it is point vkjCoordinate;It enables Wherein NkFor Vk(v) number at midpoint, then fdAnd fθAre as follows:
fd=[max (dk),min(dk),max(dk)-min(dk),mean(dk),var(dk),harmmean(dk)] (7)
fθ=[max (θk),min(θk),max(θk)-min(θk),mean(θk),var(θk),harmmean(θk)] (8)
Wherein, mean (), var () and harmmean () respectively indicate arithmetic average, variance and harmonic average;fcBy four Kind curvature is constituted:
Wherein c1For minimum principal curvatures, c2For maximum principal curvatures, (c1+c2)/2 are mean value curvature, c1c2For Gaussian curvature;
The Laplacian Matrix of three-dimensional grid model is a symmetrical matrix and can decompose are as follows:
L=B Λ BT (10)
Wherein Λ=Diag { λf, 1≤f≤Ψ } and it be the element of a diagonal matrix and the inside is arranged according to ascending order, λfIt is three-dimensional The characteristic value of the Laplacian Matrix of grid model;The column vector of orthogonal matrix B is corresponding characteristic vector, and Ψ is three-dimensional grid The sum at model midpoint, Laplce's spectrum is defined as:
H (f)={ λf,1≤f≤Ψ} (11)
Global information, logarithm-Laplce spectrum L (f) are obtained using logarithm-Laplce's spectrum is defined as:
L (f)=log (H (f)) (12)
The scrambling R (f) of spectrum is used to obtain grid conspicuousness:
R (f)=| L (f)-JΓ(f)*L(f)| (13)
Wherein,It is the vector of a 1 × Γ, passes through following formula:
The scrambling of spectrum is transformed into real domain from spectral domain, wherein R1=Diag { exp (R (f)): 1≤f≤Ψ } is diagonal Matrix,For Hadamard product, W is weight matrix, wherein
It enablesS is the element of S, then:
fls=[max (sk),min(sk),max(sk)-min(sk),mean(sk),var(sk),harmmean(sk)] (17)
Step 3, the characteristic attribute collection and corresponding tally set built using step 2, the sparse self-encoding encoder of training and depth mind Through network:
Sparse self-encoding encoder is a variant of self-encoding encoder, and adding sparsity limitation in the hidden layer part of self-encoding encoder It obtains;Three sparse self-encoding encoders are trained first, and the coded portion of these three sparse self-encoding encoders is then extracted cascade Together, the sparse self-encoding encoder of depth is formed, first level logical is then trained to return layer to handle the sparse self-encoding encoder of depth The feature exported after coding;
Deep neural network regression model is made of the sparse self-encoding encoder of depth and above-mentioned logistic regression level connection;
Finally the effect that deep neural network regression model realizes accurate adjustment is acted on using back-propagation algorithm;
Step 4, the deep neural network regression model obtained using step 3 are predicted three-dimensional grid model and are obtained corresponding Conspicuousness response diagram:
Characteristic attribute is formed to each point of three-dimensional grid model in test set using method same in step 2, and with step Rapid 3 obtained deep neural network regression models predict the point, obtain a regressand value;Three-dimensional grid model is obtained again The regressand value of middle all the points is constituted the conspicuousness response diagram of the three-dimensional grid model with it;
Step 5, the conspicuousness response diagram obtained according to step 4 obtain three-dimensional key point:
The point in conspicuousness response diagram with local maximum is chosen as three-dimensional key point;For every in three-dimensional grid model One point, if the conspicuousness response of the point is all bigger than the conspicuousness response put in 5- ring neighborhood around it, which is Three-dimensional key point;Otherwise, which is not just three-dimensional key point.
CN201610279232.4A 2016-04-29 2016-04-29 Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder Expired - Fee Related CN106709997B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610279232.4A CN106709997B (en) 2016-04-29 2016-04-29 Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610279232.4A CN106709997B (en) 2016-04-29 2016-04-29 Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder

Publications (2)

Publication Number Publication Date
CN106709997A CN106709997A (en) 2017-05-24
CN106709997B true CN106709997B (en) 2019-07-19

Family

ID=58939671

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610279232.4A Expired - Fee Related CN106709997B (en) 2016-04-29 2016-04-29 Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder

Country Status (1)

Country Link
CN (1) CN106709997B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107346430A (en) * 2017-06-29 2017-11-14 济南浪潮高新科技投资发展有限公司 A kind of nuclear magnetic resonance image auto-screening method
CN107862315B (en) * 2017-11-02 2019-09-17 腾讯科技(深圳)有限公司 Subtitle extraction method, video searching method, subtitle sharing method and device
CN108596152B (en) * 2018-05-10 2021-07-20 湖北大学 Method for acquiring 3D structure from sequence image
CN109147430A (en) * 2018-10-19 2019-01-04 渭南师范学院 A kind of teleeducation system based on cloud platform
CN109448007B (en) * 2018-11-02 2020-10-09 北京迈格威科技有限公司 Image processing method, image processing apparatus, and storage medium
CN109543745B (en) * 2018-11-20 2021-08-24 江南大学 Feature learning method and image identification method based on conditional countermeasure self-coding network
CN109616104B (en) * 2019-01-31 2022-12-30 天津大学 Environment sound identification method based on key point coding and multi-pulse learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104112263A (en) * 2014-06-28 2014-10-22 南京理工大学 Method for fusing full-color image and multispectral image based on deep neural network
CN105205453A (en) * 2015-08-28 2015-12-30 中国科学院自动化研究所 Depth-auto-encoder-based human eye detection and positioning method
CN105512680A (en) * 2015-12-02 2016-04-20 北京航空航天大学 Multi-view SAR image target recognition method based on depth neural network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104112263A (en) * 2014-06-28 2014-10-22 南京理工大学 Method for fusing full-color image and multispectral image based on deep neural network
CN105205453A (en) * 2015-08-28 2015-12-30 中国科学院自动化研究所 Depth-auto-encoder-based human eye detection and positioning method
CN105512680A (en) * 2015-12-02 2016-04-20 北京航空航天大学 Multi-view SAR image target recognition method based on depth neural network

Also Published As

Publication number Publication date
CN106709997A (en) 2017-05-24

Similar Documents

Publication Publication Date Title
CN106709997B (en) Three-dimensional critical point detection method based on deep neural network and sparse self-encoding encoder
CN107766933B (en) Visualization method for explaining convolutional neural network
Smith et al. Measuring texture classification algorithms
CN108038846A (en) Transmission line equipment image defect detection method and system based on multilayer convolutional neural networks
CN111241989A (en) Image recognition method and device and electronic equipment
Xie et al. Combination of dominant color descriptor and Hu moments in consistent zone for content based image retrieval
CN106354735A (en) Image target searching method and device
Zhang et al. Fabric defect classification based on LBP and GLCM
CN105930796B (en) A kind of single sample facial image recognition method based on depth self-encoding encoder
CN108897791A (en) A kind of image search method based on depth convolution feature and semantic similarity amount
CN106250918B (en) A kind of mixed Gauss model matching process based on improved soil-shifting distance
Garro et al. Scale space graph representation and kernel matching for non rigid and textured 3D shape retrieval
CN107704840A (en) A kind of remote sensing images Approach for road detection based on deep learning
Dong et al. Fusing multilevel deep features for fabric defect detection based NTV-RPCA
Merzoug et al. Lesions detection of multiple sclerosis in 3D brian MR images by using artificial immune systems and support vector machines
CN109902692A (en) A kind of image classification method based on regional area depth characteristic coding
Srinidhi et al. Pothole detection using CNN and AlexNet
CN117011274A (en) Automatic glass bottle detection system and method thereof
Blackledge et al. A surface inspection machine vision system that includes fractal texture analysis
Gao et al. A novel patterned fabric defect detection algorithm based on GHOG and low-rank recovery
Kulkarni et al. Comparison of methods for detection of copy-move forgery in digital images
CN109697257A (en) It is a kind of based on the network information retrieval method presorted with feature learning anti-noise
Patel et al. Surface defect detection using SVM‐based machine vision system with optimized feature
JP6355372B2 (en) 3D model feature extraction method and 3D model annotation system
Prasomphan Cultural heritage content management system by deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190719

CF01 Termination of patent right due to non-payment of annual fee