CN109785389A

CN109785389A - A kind of three-dimension object detection method based on Hash description and iteration closest approach

Info

Publication number: CN109785389A
Application number: CN201910049505.XA
Authority: CN
Inventors: 杨厚易
Original assignee: Sichuan Changhong Electric Co Ltd
Current assignee: Sichuan Changhong Electric Co Ltd
Priority date: 2019-01-18
Filing date: 2019-01-18
Publication date: 2019-05-21

Abstract

The three-dimension object detection method based on Hash description and iteration closest approach that the invention discloses a kind of, by the pose data for detecting target object in three-dimensional information, the Job Operations that can allow the power tools such as robot that can complete richer job task and more be replicated, so that structuring is produced towards unstructured transformation, come to carry out only explanation to three data using Hash description, reduce the matching to invalid data, accelerates characteristic matching speed.The iterative closest point approach of tangent plane is arrived using point, so that matching is relatively reliable, precision is higher.

Description

A kind of three-dimension object detection method based on Hash description and iteration closest approach

Technical field

The present invention relates to the Three-dimension object recognition fields in machine vision, more particularly to one kind is based on Hash description and iteration The three-dimension object detection method of closest approach.

Background technique

The method perceived by machine vision technique to working scene is in express delivery sorting, object ranging, automatic makeup It has a wide range of applications with equal fields, the identification especially in two dimensional image develops highly developed with location technology, can expire The current most automated production job requirements of foot.With the iterative method of National Industrial 4.0, journey is automated to manufacturing industry Degree requires higher and higher, just seems awkward with location technology based on the identification of two dimensional image.With three-dimensional visual sensor The decline of the maturation and price of technology, more and more application scenarios begin trying to obtain operation using three-dimensional visual sensor The three-dimensional information of scene, and position detection and attitude detection are carried out to target in scene according to three-dimensional information, i.e. pose identifies.

The prior art to the method for recognizing position and attitude of three-dimension object need to extract in three-dimensional data a large amount of characteristic information and Characteristic matching, recognition speed is slow, and positioning accuracy is very low, cannot achieve fast and accurately objective pose identification.

Summary of the invention

The purpose of the present invention is to provide a kind of three-dimension object detection method based on Hash description and iteration closest approach, tools There is the time complexity for reducing objective signature search, improve the advantages of identification positioning accuracy.

Above-mentioned purpose of the invention has the technical scheme that

A kind of three-dimension object detection method based on Hash description and iteration closest approach, including depth camera and PC machine, also The following steps are included:

The contextual data for the object that S1, depth camera acquisition need to detect；

S2, depth camera emit infrared-ray to scene, and the infrared remote receiver in depth camera will receive the close red of scene External reflectance generates the three-dimensional point data of scene, i.e., each reflection point in scene is believed relative to the XYZ of depth camera coordinate system Breath；

Collected contextual data is passed to PC machine and saved by S3, depth camera, and preservation format is PLY format；

S4, the object module M that detection object is extracted using the contextual data of the S3 PLY format obtained；

S5, PC machine calculate the covariance matrix of each point of threedimensional model point cloud, seek the feature vector of covariance matrix, Obtain the normal vector of each point；

S6, the point of object module M is generated to feature according to the normal vector in S5；

It S7, is cryptographic Hash, the Hash description of generation object module M by the all-pair Feature Conversion of object module M；

S8, scene S is handled using depth camera and PC machine, obtains the cryptographic Hash of scene S, access target model M Hash description, be quickly found out matched similitude pair, matching result be converted into Hough ballot value；

S9, accumulator of being voted by global Hough count Hough value, and the Hough value of highest scoring is as scene S's Optimal Hough value generates the first testing result to object module M by optimal Hough value；

S10, by carrying out secondary correction to first testing result to iteration closest approach algorithm, obtain accurate object pose inspection Survey result.

Further, the step S4, the coordinate of contextual data is defined as on depth transducer, the coordinate of each data Value is (x, y, z), counts the quantity of z value in all data, removal most z values occurs to reject data, and then obtains to be checked Survey the object module M of object.

Further, the step S5 obtains the normal vector of each three-dimensional data points in object module MObject module M= {m₀,m₁,m₂,...,m_k-1,m_k, the normal vector of each three-dimensional data pointsAround can be by each point Consecutive points calculate, in order to calculate normal vector a little, it is necessary first to obtain the covariance matrix between the point and consecutive points, it is false If seeking m_iThe normal vector of pointShown in the calculating of covariance matrix C such as formula (1), R is indicated with m_iSpherical shape centered on point is empty Between radius, distance value d_i(i ∈ 1,2 ..., k }) indicate consecutive points m_jWith central point m_iEuclidean distance, pass through formula (1) count Calculate m_iThe covariance matrix C of point and all the points in radius R:

M can be calculated by covariance matrix_iThe normal vector of point can to the carry out Eigenvalues Decomposition of covariance matrix C The characteristic value and feature vector for acquiring covariance matrix C, for the point cloud data of body surface, in the space of normal direction point Cloth variation is the faintest, therefore central point m_iNormal vectorThen for feature corresponding to covariance matrix C minimal eigenvalue to Amount, and so on, obtain the normal vector of all the points

Further, in the step S6, the normal vector of each point is utilizedTo calculate a little to feature, for three dimensional point cloud In point p_iWith the arbitrary point p in addition to the point_j, by two o'clock can form a little pair, using point-to-point transmission normal vector and it is European away from From the point that can be formed a little pair to feature, put shown in the calculation such as formula (2) to feature:

WhereinIndicate the line direction of two o'clock,Indicate the Euclidean distance of two o'clock line,Indicate p_iPoint Method phase vectorWith two o'clock line directionAngle,Indicate two o'clock normal vector between angle, for vector it Between angle calcu-lation such as formula (3) shown in:

Further, in step S7, it is cryptographic Hash by the all-pair Feature Conversion of object module M, generates object module M's Hash description, defining point converge the arbitrary point in p to (p_i, p_j) ∈ p, it puts shown in the expression formula such as formula (2) to feature, utilizes formula (4) cryptographic Hash is converted to feature by point:

Wherein d_distIndicate sampling step length, d_angleIndicate sampling angle, d_angle=2 π/N_angle, N_angleIndicate angle grain Degree, user can adjust N according to actual result_angle, can usually be set to 30.Pass through d_dist、d_angleCome to point to spy Sign carries out sliding-model control, and similar features is enabled to be converted into identical characteristic value, after putting to feature F discretization, point To feature F (p₁,p₂)_discretized=F (f₁,f₂,f₃,f₄) cryptographic Hash by (5) formula obtain:

Index=P₁*f₁+P₂*f₂+P₃*f₃+f₄ (5)

P1, P2, P3 in formula (5) are 3 different prime numbers, this is the system in order to enable the cryptographic Hash after conversion is unique The point of identical cryptographic Hash can be carried out central access, all cryptographic Hash compositions by the cryptographic Hash for counting all-pair by Hash table Hash table, the Hash table are known as the Hash description of point cloud data.

Further, it in step S8, describes to generate Hough ballot value using Hash, defines object module M, the three-dimensional point of scene Cloud data set is S, is described according to the Hash that step S5 generates object module point cloud data collection, by by the point of contextual data collection S To cryptographic Hash is converted into, the Hash description that access target model M generates, and then realize quick similar features matching, then will Hough value is converted to result, carries out global Hough ballot.

Further, step S9 counts Hough value by global Hough ballot accumulator, the Hough value of highest scoring As the optimal Hough value of scene S, the first detection to object module M is generated as a result, Hough ballot value by optimal Hough value (m_r, α) in m_rDirectly by (m_r, m_i) obtain, and angle [alpha] then is calculated to come by formula (6), definition world coordinate system W, in formula (6) 'sWithIt describes point s_rAnd m_rIt is moved to the displacement movement that world unit coordinate origin is done and the method by two o'clock VectorWithMain shaft x with unit coordinate system, the world is aligned done rotation transformation:

For the point with identical cryptographic Hash to (m_r, m_i) ∈ M and (s_r, s_i) ∈ S, it first will point s_rWith point m_rNormal vectorIt rotates to that parallel with the main shaft x of world unit coordinate system W and direction is consistent, then moves the position and the world of two o'clock The origin of unit coordinate system is overlapped, this time point is to (m_r, m_i) only need to can be with around the x-axis rotation alpha angle of world unit coordinate system Point is to (s_r, s_i) be overlapped, it is calculated by world unit coordinate systemWithLater, angle [alpha] can be solved according to formula (6), Scene point s_rHough ballot value (m_r, α) solve finish；

Followed by the optimal Hough ballot value of vote by ballot, establish_n×mGlobal Hough ballot accumulator it is each to collect The poll of a Hough ballot value, the row element of two-dimentional accumulator is by each reference point for filtering out in model point cloudLine number n be equal to reference point number, the columns m of accumulator then with angle granularity N_angleHave It closes, works as N_angleThe columns of ballot accumulator is 30 when=30；

Scene reference point s_rIt can be with other scene point s_iComposition point pair, each group of scene point is several to may all search out Similar model points are to (m_r, m_i), these points are to illustrating (s_r, s_i) may the position present in object module M, calculate field Sight spot is to (s_r, s_i) and similitude to (m_r, m_i) between angle [alpha], so that it may obtain scene point s_rMultiple Hough ballot value (m_r, α), position adds one in corresponding ballot accumulator, and the point that current scene reference point and other scene points form throws all completion Houghs After the calculating of ticket value, optimal Hough is thrown the optimal Hough ballot value as current scene point by the peak value in accumulator of voting Ticket result brings formula (7) into, it can obtains the first pose testing result RigidTrans between object module M and scene S:

Further, step S10 carries out second-order correction, first testing result to first testing result using iteration closest approach algorithm Reacted position orientation relation between object module M and scene S, since just to meet reference point posture close for first testing result Condition, can to iteration closest approach algorithm carry out linearization process so that iteration closest approach algorithm is while ensureing precision Possess faster speed；

Second-order correction is carried out to first testing result using the iteration closest approach algorithm of point to tangent plane, for putting to cutting flat with The iteration closest approach algorithm in face, the optimization aim of error function are that the mean square error of model points to the tangent plane for corresponding to scene point reaches To minimum；

Shown in error function such as formula (8):

M in formula (8)_i=(m_ix,m_iy,m_iz,T_im), T_imCertain point that expression model points are concentrated, and s_i=(s_ix,s_iy,s_iz, T_is), T_isIndicate m_iCorresponding points in target point cloud, n_i=(n_ix,n_iy,n_iz, 0) and it is then point s_iNormal vector；

M is the rotational transformation matrix of 4x4, by formula (8) as it can be seen that moving after object module M is moved according to transformation matrix M Model points m after dynamic_iWith scene point s_iMake the difference, obtain one description displacement difference vector, this vector again with point s_iNormal direction AmountDo dot product, the result of dot product can be used to estimate a point to the distance of another tangent plane, as all model points m_iIt presses After being moved through according to Metzler matrix, so that formula (8) adds up and reach minimum, M at this time is just M_opt；

Since model point set is similar to the posture of scene point set, can using point to tangent plane iteration closest approach algorithm come Nonlinear problem is approached towards linear problem, so that the mathematical form of iteration closest approach algorithm is simpler, is calculated The method speed of service is more quick；

For the transformation matrix M of description displacement and rotation, by rotating part R (α, beta, gamma) and displaced portion T (t_x,t_y, t_z) composition, as shown in formula (9):

M=T (t_x,t_y,t_z)·R(α,β,γ) (9)

Wherein:

Each value in formula (11) are as follows:

r₁₁=cos γ cos β,

r₁₂=-sin γ cos α+cos γ sin β sin α,

r₁₃=sin γ sin α+cos γ sin β cos α,

r₂₁=sin γ cos β,

r₂₂=cos γ cos α+sin γ sin β sin α,

r₂₃=-cos γ sin α+sin γ sin β cos α,

r₃₁=-sin β

r₃₂=cos β sin α

r₃₃=cos β cos α

Rx (α), Ry (β), Rz (γ) are respectively indicated around x, y, the rotation of z-axis, utilize the similitude of equal value of trigonometric function It is found that we can carry out the trigonometric function in R-portion generation of equal value if model point set is similar to the posture of scene point set It changes, the rotating part R in M can be written as the form of formula (12) at this time:

Therefore, transformation matrix M can be indicated are as follows:

At this point, formula (8) can also convert into:

Wherein:

Point pair relevant for N group can get N group system of linear equations by formula (15), these equation groups can be written as Ax- The form of b, in which:

Solve M_optThe problem of translated into solution X_optThe problem of, this is the linear optimization problem of a standard:

By SVD decomposition come the solution of perfect (18), carrying out SVD decomposition to A can be obtained A=U Σ V^T, to calculate A Pseudoinverse A⁺=V Σ⁺U^T, then the linear least-squares solution of formula (18) are as follows:

x_opt=A⁺·b (19)

The secondary correction that three-dimension object spatial pose is just estimated is completed, X_optThe as final knowledge of three-dimension object spatial pose Other result.

In conclusion the invention has the following advantages:

It 1) is to use Hash description table look-up comparing, speed is quickly；

2) Hash description is utilized and point arrives the advantage of the iterative closest point approach of tangent plane respectively, compensates for lacking for the two Point accelerates so that original slow point realizes algorithm by linearisation to tangent plane method.

Detailed description of the invention

In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.

Fig. 1 is the measurement scene three-dimensional data of the embodiment of the present invention and extracts target point cloud model schematic diagram；

Fig. 2 is schematic diagram of the point to feature of the three-dimensional point cloud of the embodiment of the present invention；

Fig. 3 is that the point of the embodiment of the present invention generates the flow chart of Hash description to feature；

Fig. 4 is the schematic diagram of the world unit coordinate system W of the embodiment of the present invention；

Fig. 5 is the flow chart for taking optimal office's Hough ballot value of the embodiment of the present invention；

Fig. 6 be the embodiment of the present invention point to tangent plane iteration closest approach algorithm illustraton of model；

Fig. 7 is the exemplary examined object of the embodiment of the present invention；

Fig. 8 is that the example of the embodiment of the present invention describes the effect picture after progress coarse localization by Hash；

Fig. 9 is the example of the embodiment of the present invention by the revised effect picture of iteration closest approach.

Specific embodiment

In the following detailed description, many details are proposed, in order to complete understanding of the present invention.But It will be apparent to those skilled in the art that the present invention can not need some details in these details In the case of implement.Below to the description of embodiment just for the sake of provided by showing example of the invention to it is of the invention more Understand well.

Below in conjunction with attached drawing, the technical solution of the embodiment of the present invention is described.

Embodiment:

Wherein, in step S4, the coordinate of contextual data is defined as on depth transducer, as shown in Figure 1, each data Coordinate value be (x, y, z), count the quantity of z value in all data, removal most z values occurs to reject data, and then obtains Take the object module M of examined object.

Step S5 obtains the normal vector of each three-dimensional data points in object module MObject module M={ m₀,m₁,m₂,..., m_k-1,m_k, the normal vector of each three-dimensional data pointsIt can be counted by the consecutive points around each point It calculates, in order to calculate normal vector a little, it is necessary first to obtain the covariance matrix between the point and consecutive points, it is assumed that seek m_iPoint Normal vectorShown in the calculating of covariance matrix C such as formula (1), R is indicated with m_iThe radius of diameter of Spherical Volume centered on point, away from From value d_i(i ∈ 1,2 ..., k }) indicate consecutive points m_jWith central point m_iEuclidean distance, pass through formula (1) calculate m_iPoint and radius The covariance matrix C of all the points in R:

As shown in Fig. 2, utilizing the normal vector of each point in step S6To calculate a little to feature, for three dimensional point cloud In point p_iWith the arbitrary point p in addition to the point_j, by two o'clock can form a little pair, using point-to-point transmission normal vector and it is European away from From the point that can be formed a little pair to feature, put shown in the calculation such as formula (2) to feature:

WhereinIndicate the line direction of two o'clock,Indicate the Euclidean distance of two o'clock line,Indicate p_iPoint Method phase vectorWith two o'clock line directionAngle,The angle between two o'clock normal vector is indicated, for vector Between angle calcu-lation such as formula (3) shown in:

It is cryptographic Hash by the all-pair Feature Conversion of object module M in step S7, the Hash for generating object module M is retouched It states, defining point converges the arbitrary point in p to (p_i, p_j) ∈ p, it puts to shown in the expression formula such as formula (2) of feature, using formula (4) by point Cryptographic Hash is converted into feature:

Wherein d_distIndicate sampling step length, d_angleIndicate sampling angle, d_angle=2 π/N_angle, N_angleIndicate angle grain Degree, this value is user's sets itself, N_angleIt is bigger, d_angleWith regard to smaller, that is to say, that got over to the discrimination of two angles Greatly, this value is usually 30, if sampling angle d_angleIt is 12 °, that is for the angle in 0 ° -12 ° this sections, it is believed that he Be similar.User can adjust N according to actual result_angle, can usually be set to 30.Pass through d_dist、d_angleCome Sliding-model control is carried out to feature to point, similar features is enabled to be converted into identical characteristic value, to point to feature F discretization Later, point is to feature F (p₁,p₂)_discretized=F (f₁,f₂,f₃,f₄) cryptographic Hash by (5) formula obtain:

Index=P₁*f₁+P₂*f₂+P₃*f₃+f₄ (5)

P1, P2, P3 in formula (5) are 3 different prime numbers, this is the example in order to enable the cryptographic Hash after conversion is unique Such as F (1,2,3,4) and F (4,3,2,1), the cryptographic Hash of all-pair is counted, the point of identical cryptographic Hash can be passed through into Hash table Central access is carried out, all cryptographic Hash form Hash table, which is known as the Hash description of point cloud data, as shown in Figure 3.

In step S8, describes to generate Hough ballot value using Hash, define object module M, the three dimensional point cloud of scene Integrate as S, described according to the Hash that step S5 generates object module point cloud data collection, by by the point of contextual data collection S to conversion For cryptographic Hash, the Hash description that access target model M generates, and then realize quick similar features matching, then by matching result Hough value is converted to, global Hough ballot is carried out, the definition mode of the Hough value of matching result is as follows:

The random partial data point chosen in S is as a reference point, it is assumed that target to be detected exists among scene, deposits In a reference point s_r∈ S is placed exactly in the surface of target to be detected.At this point, should exist a bit in model point cloud data set m_rThe same s of ∈ M_r∈ S is corresponding.Hypothetical model point cloud can be with m_rPoint movement, with m_rThe normal vector rotation of point.If by m_rPoint moves Move s_rPosition, while allowing m_rPoint and s_rThe normal vector of point is overlapped, and model point cloud is only needed around s at this time_rThe normal vector rotation of point Turn α degree and achieves that scene objects are overlapped with cloud template is put.Under such thinking, by by scene point s_rWith model points m_r Position be aligned with normal vector, point converges the point m of the displacement rotating transformation of M to scene S just in available M_rIt is indicated with angle [alpha]. At this point, (m_r, α) and it is known as scene reference point s_rThe Hough ballot value in M is converged in point.

If a point s in scene_rIt is placed exactly in the surface of target object, and on the point and another target object surface Another point s_iIt constitutes a little pair.It will put to (s_r, s_i) it is converted into cryptographic Hash, the Hash description of index object module M will obtain It obtains and (s_r, s_i) similitude is to (m_r, m_i), using these points to (m_r, m_i) point s can be calculated_rHough ballot value.

Step S9 counts Hough value by global Hough ballot accumulator, and the Hough value of highest scoring is as field The optimal Hough value of scape S generates the first detection to object module M as a result, Hough ballot value (m by optimal Hough value_r, α) in M_rDirectly by (m_r, m_i) obtain, and angle [alpha] then by formula (6) calculate come, define world coordinate system W, as shown in figure 4, scheme (4) In formula (6)WithIt describes point s_rAnd m_rBe moved to displacement movement that world unit coordinate origin is done and By the normal vector of two o'clockWithMain shaft x with unit coordinate system, the world is aligned done rotation transformation:

Followed by the optimal Hough ballot value of vote by ballot, as shown in figure 5, establishing_n×mThe ballot of global Hough it is cumulative Device collects the poll of each Hough ballot value, and the row element of two-dimentional accumulator is by each reference point for filtering out in model point cloudLine number n be equal to reference point number, the columns m of accumulator then with angle granularity N_angleHave It closes, works as N_angleThe columns of ballot accumulator is 30 when=30；

Step S10 carries out second-order correction to first testing result using iteration closest approach algorithm, and first testing result is reacted Position orientation relation between object module M and scene S, since first testing result just meets condition similar in reference point posture, Linearization process can be carried out to iteration closest approach algorithm, so that iteration closest approach algorithm possesses faster while ensureing precision Speed；

As shown in fig. 6, second-order correction is carried out to first testing result using the iteration closest approach algorithm of point to tangent plane, it is right The iteration closest approach algorithm of tangent plane is arrived in point, the optimization aim of error function is tangent plane of the model points to corresponding scene point Mean square error reaches minimum；

Shown in error function such as formula (8):

M=T (t_x,t_y,t_z)·R(α,β,γ) (9)

Wherein:

Each value in formula (11) are as follows:

r₁₁=cos γ cos β,

r₁₂=-sin γ cos α+cos γ sin β sin α,

r₁₃=sin γ sin α+cos γ sin β cos α,

r₂₁=sin γ cos β,

r₂₂=cos γ cos α+sin γ sin β sin α,

r₂₃=-cos γ sin α+sin γ sin β cos α,

r₃₁=-sin β

r₃₂=cos β sin α

r₃₃=cos β cos α

Therefore, transformation matrix M can be indicated are as follows:

At this point, formula (8) can also convert into:

Wherein:

By SVD decomposition come the solution of perfect (18), carrying out SVD decomposition to A can be obtained A=U Σ V^T, to calculate A Pseudoinverse A⁺=V Σ⁺U^T, then the linear least-squares solution of formula (19) are as follows:

x_opt=A⁺·b (19)

Positioning accuracy height is because algorithm uses Hash and describes this method, when algorithm is run not yet, just The Hash table for foring object module M, in the algorithm operation phase, it is only necessary to which the then Hash feature for calculating scene directly looks into mesh The Hash table of mark model M, which can be obtained by, is quickly found out similar feature, to position object.

In practice, as shown in fig. 7, being a scene to be measured, matched result such as Fig. 8 after being described by Hash It is shown, it can be seen that have certain error, this is because using d in calculating Hash description_distAnd N_angleDiscretization is carried out Processing, precision is lossy, but by way of looking into Hash table, can be realized quick coarse positioning, that is, can be fast Speed reaches around target position, then carries out second-order correction by the way of iteration closest approach, as shown in figure 9, Fig. 9 is exactly repeatedly For the revised result of closest approach.Simultaneously as known Hash describes method can navigate to the substantially position of target as Fig. 8 It sets, it is possible to the iteration closest approach algorithm of tangent plane is arrived using point,

It illustrates, there are also a kind of method of iteration closest approach for being point-to-point, the iteration closest approach sides of point-to-point Method be it is most common, under normal circumstances the iterative closest point approach of point-to-point than put arrive tangent plane iterative closest point approach than It fastly, is because before being positioned, object module M is often placed into some position in scene by we, from positioning Position have a certain distance, the iterative closest point approach of point-to-point relative to the iterative closest point approach of point to tangent plane will less based on Calculate tangent plane the step of point.

This patent is had been able to navigate near target position, therefore can arrive point due to using Hash to describe Non-linearization part in the iterative closest point approach of tangent plane carries out linearization process, and available linearization processing can be also point to cutting One unique place of the iterative closest point approach of plane.It is equivalent to this patent and incorporates two algorithms, enable them to benefit With the advantage of the two, disadvantage is mutually made up.

The advantage of Hash description is that locating speed is very fast, and disadvantage is that positioning accuracy is not high, because of discretization process therein It will cause loss of significance.

If the advantage of iterative closest point approach of point to tangent plane is when initial position is close to actual position, such as Shown in Fig. 8, the non-linear partial in algorithm can be linearized, disadvantage be if initial position apart from actual position compared with Far, the speed of positioning can be very slow.

The mode of iteration closest approach has been used to carry out two to the positioning result that Hash describes as it can be seen that positioning accuracy height is embodied in Secondary correction.Speed is embodied in two o'clock fastly:

It 1) is to use Hash description table look-up comparing, speed is quickly；

2) iterative closest point approach that Hash description is utilized and puts to tangent plane combines the advantage of two methods respectively, The shortcomings that compensating for the two accelerates so that original slow point realizes algorithm by linearisation to tangent plane method.

The above embodiments are merely illustrative of the technical solutions of the present invention, rather than limits the protection scope of invention.It is aobvious So, described embodiment is only section Example of the present invention, rather than whole embodiments.Based on these embodiments, ability Domain those of ordinary skill every other embodiment obtained without creative efforts, belongs to institute of the present invention Scope of protection.

Although referring to above-described embodiment, invention is explained in detail, and those of ordinary skill in the art still can be with In the absence of conflict, creative work is not made to be according to circumstances combined with each other the feature in various embodiments of the present invention, increase It deletes or makees other adjustment, to obtain other technologies scheme different, that essence is without departing from design of the invention, these technical sides Case similarly belongs to invention which is intended to be protected.

Claims

1. a kind of three-dimension object detection method based on Hash description and iteration closest approach, which is characterized in that including depth camera And PC machine, it is further comprising the steps of:

S2, depth camera emit infrared-ray to scene, and the infrared remote receiver in depth camera is anti-by the near-infrared for receiving scene It penetrates, generates the three-dimensional point data of scene, i.e., XYZ information of each reflection point relative to depth camera coordinate system in scene；

S8, scene S is handled using depth camera and PC machine, obtains the cryptographic Hash of scene S, the Kazakhstan of access target model M Uncommon description, is quickly found out matched similitude pair, matching result is converted to Hough ballot value；

S9, accumulator of being voted by global Hough count Hough value, and the Hough value of highest scoring is as the optimal of scene S Hough value generates the first testing result to object module M by optimal Hough value；

S10, by carrying out secondary correction to first testing result to iteration closest approach algorithm, obtain accurate object pose detection knot Fruit.

2. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is that the coordinate of contextual data is defined as on depth transducer by the step S4, the coordinate values of each data be (x, Y, z), the quantity of z value in all data is counted, removal most z values occurs to reject data, and then obtains examined object Object module M.

3. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is that the step S5 obtains the normal vector of each three-dimensional data points in object module MObject module M={ m₀,m₁, m₂,...,m_k-1,m_k, the normal vector of each three-dimensional data pointsIt can be by adjacent around each point Point is to calculate, in order to calculate normal vector a little, it is necessary first to obtain the covariance matrix between the point and consecutive points, it is assumed that it is required that Take m_iThe normal vector of pointShown in the calculating of covariance matrix C such as formula (1), R is indicated with m_iHalf of diameter of Spherical Volume centered on point Diameter, distance value d_i(i ∈ 1,2 ..., k }) indicate consecutive points m_jWith central point m_iEuclidean distance, pass through formula (1) calculate m_iPoint With the covariance matrix C of all the points in radius R:

M can be calculated by covariance matrix_iThe normal vector of point can acquire the carry out Eigenvalues Decomposition of covariance matrix C The characteristic value and feature vector of covariance matrix C becomes the point cloud data of body surface in the spatial distribution of normal direction Change is the faintest, therefore central point m_iNormal vectorIt is then feature vector corresponding to covariance matrix C minimal eigenvalue, with This analogizes, and obtains the normal vector of all the points

4. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is, in the step S6, utilizes the normal vector of each pointTo calculate a little to feature, for the point p in three dimensional point cloud_i With the arbitrary point p in addition to the point_j, it can be made up of a little pair two o'clock, it can be with using the normal vector and Euclidean distance of point-to-point transmission The point of point pair is formed to feature, is put shown in the calculation such as formula (2) to feature:

WhereinIndicate the line direction of two o'clock,Indicate the Euclidean distance of two o'clock line,Indicate p_iThe method of point Phase vectorWith two o'clock line directionAngle,The angle between two o'clock normal vector is indicated, between vector Angle calcu-lation such as formula (3) shown in:

5. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is, is cryptographic Hash by the all-pair Feature Conversion of object module M in step S7, the Hash for generating object module M is retouched It states, defining point converges the arbitrary point in p to (p_i, p_j) ∈ p, it puts to shown in the expression formula such as formula (2) of feature, using formula (4) by point Cryptographic Hash is converted into feature:

Wherein d_distIndicate sampling step length, d_angleIndicate sampling angle, d_angle=2 π/N_angle, N_angleIt indicates angle granularity, makes User can adjust N according to actual result_angle, can usually be set to 30.Pass through d_dist、d_angleTo carry out feature point Sliding-model control enables similar features to be converted into identical characteristic value, after putting to feature F discretization, puts to feature F (p₁,p₂)_discretized=F (f₁,f₂,f₃,f₄) cryptographic Hash by (5) formula obtain:

Index=P₁*f₁+P₂*f₂+P₃*f₃+f₄ (5)

P1, P2, P3 in formula (5) are 3 different prime numbers, this is to count institute in order to enable the cryptographic Hash after conversion is unique The point of identical cryptographic Hash can be passed through Hash table and carry out central access by cryptographic Hash a little pair, and all cryptographic Hash form Hash Table, the Hash table are known as the Hash description of point cloud data.

6. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is, in step S8, describes to generate Hough ballot value using Hash, defines object module M, the three dimensional point cloud collection of scene For S, described according to the Hash that step S5 generates object module point cloud data collection, by by the point of contextual data collection S to being converted into Cryptographic Hash, the Hash description that access target model M generates, and then realize quick similar features matching, then matching result is turned It is changed to Hough value, carries out global Hough ballot.

7. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is that step S9 counts Hough value by global Hough ballot accumulator, and the Hough value of highest scoring is as scene The optimal Hough value of S generates the first detection to object module M as a result, Hough ballot value (m by optimal Hough value_r, α) in m_r Directly by (m_r, m_i) obtain, and angle [alpha] then is calculated to come by formula (6), definition world coordinate system W, in formula (6)WithIt retouches Having stated will point s_rAnd m_rIt is moved to the displacement movement that world unit coordinate origin is done and the normal vector by two o'clockWithTogether The main shaft x of world unit coordinate system is aligned done rotation transformation:

Followed by the optimal Hough ballot value of vote by ballot, establish_n×mGlobal Hough ballot accumulator come collect it is each suddenly The poll of husband's ballot value, the row element of two-dimentional accumulator is by each reference point for filtering out in model point cloudLine number n be equal to reference point number, the columns m of accumulator then with angle granularity N_angleHave It closes, works as N_angleThe columns of ballot accumulator is 30 when=30；

Scene reference point s_rIt can be with other scene point s_iComposition point pair, each group of scene point is several similar to may all search out Model points to (m_r, m_i), these points are to illustrating (s_r, s_i) may the position present in object module M, calculate scene point To (s_r, s_i) and similitude to (m_r, m_i) between angle [alpha], so that it may obtain scene point s_rMultiple Hough ballot value (m_r, α), it is right The position in accumulator that should vote adds one, and the point of current scene reference point and other scene points composition is to all completing Hough ballot value After calculating, the peak value in accumulator of voting is by the optimal Hough ballot value as current scene point, by optimal Hough voting results Bring formula (7) into, it can obtain the first pose testing result RigidTrans between object module M and scene S:

8. a kind of three-dimension object detection method based on Hash description and iteration closest approach according to claim 1, special Sign is that step S10 carries out second-order correction to first testing result using iteration closest approach algorithm, and first testing result is reacted Position orientation relation between object module M and scene S, since first testing result just meets condition similar in reference point posture, Linearization process can be carried out to iteration closest approach algorithm, so that iteration closest approach algorithm possesses faster while ensureing precision Speed；

Second-order correction is carried out to first testing result using the iteration closest approach algorithm of point to tangent plane, tangent plane is arrived for point Iteration closest approach algorithm, the optimization aim of error function are that the mean square error of model points to the tangent plane for corresponding to scene point reaches most It is small；

Shown in error function such as formula (8):

M in formula (8)_i=(m_ix,m_iy,m_iz,T_im), T_imCertain point that expression model points are concentrated, and s_i=(s_ix,s_iy,s_iz,T_is), T_is Indicate m_iCorresponding points in target point cloud, n_i=(n_ix,n_iy,n_iz, 0) and it is then point s_iNormal vector；

M is the rotational transformation matrix of 4x4, by formula (8) as it can be seen that after object module M is moved according to transformation matrix M, after mobile Model points m_iWith scene point s_iMake the difference, obtain one description displacement difference vector, this vector again with point s_iNormal vector Do dot product, the result of dot product can be used to estimate a point to the distance of another tangent plane, as all model points m_iAccording to M square After battle array is moved through, so that formula (8) adds up and reach minimum, M at this time is just M_opt；

Since model point set is similar to the posture of scene point set, the iteration closest approach algorithm that can use point to tangent plane will be non- Linear problem is approached towards linear problem, so that the mathematical form of iteration closest approach algorithm is simpler, algorithm fortune Scanning frequency degree is more quick；

For the transformation matrix M of description displacement and rotation, by rotating part R (α, beta, gamma) and displaced portion T (t_x,t_y,t_z) group At as shown in formula (9):

M=T (t_x,t_y,t_z)·R(α,β,γ) (9)

Wherein:

Each value in formula (11) are as follows:

r₁₁=cos γ cos β,

r₁₂=-sin γ cos α+cos γ sin β sin α,

r₁₃=sin γ sin α+cos γ sin β cos α,

r₂₁=sin γ cos β,

r₂₂=cos γ cos α+sin γ sin β sin α,

r₂₃=-cos γ sin α+sin γ sin β cos α,

r₃₁=-sin β

r₃₂=cos β sin α

r₃₃=cos β cos α

Rx (α), Ry (β), Rz (γ) are respectively indicated around x, y, the rotation of z-axis, using trigonometric function similitude of equal value it is found that If model point set is similar to the posture of scene point set, the trigonometric function in R-portion can be carried out equivalent substitution by us, at this time Rotating part R in M can be written as the form of formula (12):

Therefore, transformation matrix M can be indicated are as follows:

At this point, formula (8) can also convert into:

Wherein:

Point pair relevant for N group can get N group system of linear equations by formula (15), these equation groups can be written as Ax-b's Form, in which:

By SVD decomposition come the solution of perfect (18), carrying out SVD decomposition to A can be obtained A=U Σ V^T, to calculate the pseudoinverse of A A⁺=V Σ⁺U^T, then the linear least-squares solution of formula (18) are as follows:

x_opt=A⁺·b (19)

The secondary correction that three-dimension object spatial pose is just estimated is completed, X_optThe as final identification knot of three-dimension object spatial pose Fruit.