CN108154066A

CN108154066A - A kind of Three-dimensional target recognition method based on curvature feature recurrent neural network

Info

Publication number: CN108154066A
Application number: CN201611096314.1A
Authority: CN
Inventors: 梁炜; 李杨; 郑萌; 谈金东; 彭士伟
Original assignee: Shenyang Institute of Automation of CAS
Current assignee: Shenyang Institute of Automation of CAS
Priority date: 2016-12-02
Filing date: 2016-12-02
Publication date: 2018-06-12
Anticipated expiration: 2036-12-02
Also published as: CN108154066B

Abstract

The present invention relates to image recognition technology, in order to effectively portray feature of the objective under different visual angles, for picture noise problem existing during Three-dimensional target recognition, it is proposed that a kind of Three-dimensional target recognition method based on curvature feature recurrent neural network.First, the present invention obtains the joint curvature of target three-dimensional by the local average Gaussian curvature and average mean curvature that calculate target three-dimensional, and pass through the curvature sketch that extraction joint curvature local maximum forms threedimensional model, convert input of the 360 ° of two-dimensional image sequences of generation as training recurrent neural network by the use of transmission projection；Secondly, by the use of forward-backward recutrnce neural network (BRNN) as threedimensional model various visual angles sequence signature learning method, the identification classification of correct probability maximum is acquired using softmax functions at softmax layers.The present invention can automatically extract the common trait of objective and two dimensional image, and preferable robustness and higher object recognition rate can be kept under the conditions of picture noise.

Description

A kind of Three-dimensional target recognition method based on curvature feature recurrent neural network

Technical field

The present invention relates to image identification technical field, it is specifically a kind of based on curvature feature recurrent neural network three Tie up target identification method.

Background technology

Three-dimensional target recognition refers to detect automatically from any given two dimensional image scene, positions, identifies specified mesh The process of mark pattern is one of critical issue of computer vision research.It is three-dimensional with the continuous development of computer vision technique Target identification is applied to the fields such as industrial detection, augmented reality and medical image more and more widely.But due to by illumination The influence of the factors such as variation, picture noise and target occlusion, it is difficult to extract objective and its two dimensional image under different visual angles Common trait, become Three-dimensional target recognition urgent problem to be solved.

The key of Three-dimensional target recognition is to find the two dimension expression of three dimensional object model, extracts objective and two dimensional image Common trait.Existing Three-dimensional target recognition method mainly includes the method based on handmarking's point, based on geometric properties Method and method based on deep learning etc..Method based on handmarking's point needs the feature in artificial initialization two dimensional image Point, due to needing man-machine interactively, so such method does not have repeatability；Based on the method for geometric properties by extracting target Center line skeleton, the information realizations target identification such as contour shape, but such method is identified in the case where image is there are noise Effect is poor；Method based on deep learning is believed low-level multi-features into semanteme using deep neural network The high-level feature of breath can well solve the picture noise problem of two dimensional image during Three-dimensional target recognition, but logical The depth convolutional neural networks often used are beyond expression sequence properties, it is impossible to effectively portray objective under different visual angles Feature.Therefore, there is an urgent need for propose it is a kind of in different visual angles image to the automatized three-dimensional target identification of picture noise problem robust Method.

Invention content

The present invention seeks to can more effectively portray feature of the objective under different visual angles, feature extraction is reduced Journey improves Three-dimensional target recognition accuracy rate, the present invention proposes a kind of based on curvature feature recurrence to the sensitivity of picture noise The Three-dimensional target recognition method of neural network.

Present invention technical solution used for the above purpose is：It is a kind of based on curvature feature recurrent neural network Three-dimensional target recognition method, includes the following steps：

Step 1：Calculate the joint curvature of target three-dimensionalExtract joint curvatureLocal maximum form three The curvature sketch R of dimension module_Sketch；Again to the curvature sketch R of threedimensional model_Sketch360 ° of two dimensions of generation are converted using transmission projection Image P_m, wherein m=1,2 ..., 360；

Step 2：360 ° of two dimensional images are inputted into BRNN, carrying out study using multi-angle feature calculates it under various visual angles Sequence properties；Identification classification in the softmax layers of correct probability maximum for acquiring using softmax functions sequence properties；Institute BRNN is stated as forward-backward recutrnce neural network.

The joint curvature for calculating target three-dimensionalInclude the following steps：

IfIt is the normal vector that a bit (x, y, z) is given on target three-dimensional R；It enablesThen p_x,p_y,q_x,q_yIt is defined as

Calculate the mean-Gaussian curvature around the normal vector of every bit in 3 × 3 neighborhoods on threedimensional model RAveragely It is worth curvature

Wherein,For average curvature matrix, Trace () is the mark of matrix,Respectively p, q, p_x,p_y,q_x,q_yAverage value in 3 × 3 neighborhoods；

Define the joint curvature of target three-dimensional RFor：

It is described that 360 ° of two dimensional images are inputted into BRNN, it carries out study using multi-angle feature and calculates it under various visual angles Sequence properties include the following steps：

Obtain the one-dimensional characteristic sequence T of 360 ° of two dimensional images_S, s=1,2 ..., 360, then characteristic sequence T_SIn BRNN i-th The output of layer is divided into positive outputWith reversed outputAnd it is exported respectively with the forward direction of the upper sequences of this layer of BRNN The reversed output of the lower sequences of this layer of BRNNAnd the positive output of last layer BRNNWith reversed outputJust like ShiShimonoseki System：

Wherein,For the weight matrix of each outlet chamber, b is biasing, and tanh is Neuron activation functions；

Then characteristic sequence T_SIn total output O of BRNN^s, the input I of as full articulamentum fc_fcFor：

Wherein,Respectively positive output and reversely output are in the connection weight of full articulamentum；

Therefore, characteristic sequence T_SIt is in the cumulative output of full articulamentum fcAs sequence properties.

The identification classification in the softmax layers of correct probability maximum for acquiring using softmax functions sequence properties, Include the following steps：

Correct probability p (the C that recognition result is kth class are calculated using softmax functions at softmax layers_k)

Wherein, C is total for identification classification, A_kSequence properties for kth class objective are in the cumulative defeated of full articulamentum fc Go out result；

When then acquiring loss function minimum value using maximum Likelihood, i.e. correct probability p (C_k) it is maximum when Identify classification k：

Wherein, δ () is Kronecker functionR represents characteristic sequence T_SJust Really identification classification.

The invention has the advantages that and advantage：

1. the joint curvature sketch feature extracting method that the present invention designs, can automatically extract threedimensional model and two dimensional image Common trait, and combine local average Gaussian curvature used in curvature and local average mean curvature and can effectively solve Certainly picture noise problem.

2. present invention design multi-angle feature learning forward-backward recutrnce neural network, can consider threedimensional model polygonal simultaneously Characteristic sequence under degree, can accurately identify objective in two dimensional image at any angle.

Description of the drawings

Fig. 1 is the method for the present invention flow chart；

Fig. 2 is the multi-angle feature learning forward-backward recutrnce neural network framework figure in the method for the present invention.

Specific embodiment

The present invention is described in further detail with reference to the accompanying drawings and embodiments.

The present invention is broadly divided into two parts, is as shown in Figure 1 the method for the present invention flow chart, the specific implementation process is as follows institute It states.

Step 1：The joint curvature of target three-dimensional is calculated, and passes through extraction joint curvature local maximum and forms three-dimensional The curvature sketch of model converts input of the 360 ° of two dimensional images of generation as training recurrent neural network by the use of transmission projection；

Step 1.1：IfIt is the normal vector that a bit (x, y, z) is given on threedimensional model.It enablesThen p_x,p_y,q_x,q_yIt is defined asThen threedimensional model Gaussian curvature G_KFor

G_K=| C |,

Its curvature matrixThreedimensional model Mean value curvature M_KForTrace () is the mark of matrix.In order to eliminate influence of noise, the present invention calculates Mean-Gaussian curvature on threedimensional model around the normal vector of every bit in its 3 × 3 neighborhoodWith average mean curvature

WhereinFor average curvature matrix,Respectively p, q, p_x,p_y,q_x,q_yAverage value in 3 × 3 neighborhoods.We define threedimensional model as a result, Joint curvatureFor

Step 1.2：Extract joint curvatureLocal maximum point form the curvature sketch R of threedimensional model R_Sketch.It is logical Cross perspective projection transformation, generation three dimensional curvature sketch R_Sketch360 ° of two-dimensional projection image P_m, m=1,2 ..., 360, as The input of BRNN.

Step 2：The present invention is using a kind of depth recurrent neural network (DRNN) as curvature feature recognition methods, DRNN frames Frame is as shown in Figure 2.Sequence properties of the threedimensional model under various visual angles are portrayed using multi-angle feature learning BRNN, in softmax Layer acquires the identification classification of correct probability maximum using softmax functions.

Step 2.1：In order to portray the sequentiality of threedimensional model feature under different visual angles, threedimensional model is defined in various visual angles Under one-dimensional characteristic sequence be T_S, s=1,2 ..., 360, then characteristic sequence T_SIt is divided into positive output in i-th layer of outputs of BRNNWith reversed outputIt is exported respectively with the forward direction of the upper sequences of this layer of BRNNThe lower sequences of this layer of BRNN it is reversed defeated Go outAnd the positive output of last layer BRNNWith reversed outputThere is following relationship：

WhereinFor the weight matrix of each outlet chamber, b is bigoted, and tanh is god Through first activation primitive；Then characteristic sequence T_SIn total output O of BRNN^s, the input I of as full articulamentum fc_fcFor

Wherein,Respectively positive output and reversely output are in the connection weight of full articulamentum.

Step 2.2：Characteristic sequence T_SIt is in the cumulative output of full articulamentum fcAs sequence properties. Softmax layers calculate the correct probability p (C that recognition result is kth class using softmax functions_k)

Wherein C is total for identification classification, A_kFor kth class objective sequence properties full articulamentum fc cumulative output As a result.When then acquiring loss function minimum value using maximum Likelihood, i.e. correct probability p (C_k) it is maximum when identification Classification k：

Wherein δ () is Kronecker functionR represents characteristic sequence T_SCorrect knowledge Other classification.

Claims

A kind of 1. Three-dimensional target recognition method based on curvature feature recurrent neural network, which is characterized in that include the following steps：

Step 1：Calculate the joint curvature of target three-dimensionalExtract joint curvatureLocal maximum form three-dimensional mould The curvature sketch R of type_Sketch；Again to the curvature sketch R of threedimensional model_Sketch360 ° of two dimensional images of generation are converted using transmission projection P_m, wherein m=1,2 ..., 360；

Step 2：360 ° of two dimensional images are inputted into BRNN, carrying out study using multi-angle feature calculates its sequence under various visual angles Attribute；Identification classification in the softmax layers of correct probability maximum for acquiring using softmax functions sequence properties；It is described BRNN is forward-backward recutrnce neural network.
2. according to the Three-dimensional target recognition method based on curvature feature recurrent neural network a kind of described in claim 1, feature It is, the joint curvature for calculating target three-dimensionalInclude the following steps：

IfIt is the normal vector that a bit (x, y, z) is given on target three-dimensional R；It enablesThen p_x,p_y,q_x,q_yIt is defined as

Calculate the mean-Gaussian curvature around the normal vector of every bit in 3 × 3 neighborhoods on threedimensional model RWith average mean curvature

Wherein,For average curvature matrix, trace () It is the mark of matrix,Respectively p, q, p_x,p_y,q_x,q_yAverage value in 3 × 3 neighborhoods；

Define the joint curvature of target three-dimensional RFor：
3. according to the Three-dimensional target recognition method based on curvature feature recurrent neural network a kind of described in claim 1, feature It is, it is described that 360 ° of two dimensional images are inputted into BRNN, it carries out study using multi-angle feature and calculates its sequence under various visual angles Column Properties include the following steps：

Obtain the one-dimensional characteristic sequence T of 360 ° of two dimensional images_S, s=1,2 ..., 360, then characteristic sequence T_SI-th layer of BRNN's Output is divided into positive outputWith reversed outputAnd it is exported respectively with the forward direction of the upper sequences of this layer of BRNNThis layer The reversed output of the lower sequences of BRNNAnd the positive output of last layer BRNNWith reversed outputThere is following relationship：

Wherein,For the weight matrix of each outlet chamber, b is biasing, and tanh is nerve First activation primitive；

Then characteristic sequence T_SIn total output O of BRNN^s, the input I of as full articulamentum fc_fcFor：

Wherein,Respectively positive output and reversely output are in the connection weight of full articulamentum；

Therefore, characteristic sequence T_SIt is in the cumulative output of full articulamentum fcAs sequence properties.
4. according to the Three-dimensional target recognition method based on curvature feature recurrent neural network a kind of described in claim 1, feature It is, the identification classification in the softmax layers of correct probability maximum for acquiring using softmax functions sequence properties, packet Include following steps：

Correct probability p (the C that recognition result is kth class are calculated using softmax functions at softmax layers_k)

Wherein, C is total for identification classification, A_kFor kth class objective sequence properties full articulamentum fc cumulative output knot Fruit；

When then acquiring loss function minimum value using maximum Likelihood, i.e. correct probability p (C_k) it is maximum when identification class Other k：

Wherein, δ () is Kronecker functionR represents characteristic sequence T_SCorrect identification Classification.