CN110276371A

CN110276371A - A kind of container angle recognition methods based on deep learning

Info

Publication number: CN110276371A
Application number: CN201910367932.2A
Authority: CN
Inventors: 高明煜; 叶健; 杨宇翔; 黄继业; 何志伟
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2019-05-05
Filing date: 2019-05-05
Publication date: 2019-09-24
Anticipated expiration: 2039-05-05
Also published as: CN110276371B

Abstract

The container angle recognition methods based on deep learning that the present invention relates to a kind of.It needs that container angle is fixed using fork truck in actual industrial environment, needs to obtain the spatial positional information of corner fittings first.The system deployment that color camera is combined with depth camera is convenient and efficient, and precision is high, so it is most extensive to carry out corner fittings detection using visual information.It carries out Container inspection system using traditional vision algorithm to be difficult to extract effective feature, precision is lower, and speed is slower.The present invention carries out feature extraction by YOLO neural network, a kind of method for carrying out container angle detection using neural network and Feature Correspondence Algorithm is proposed, the deviation angle of container angle coordinate and container angle plane and camera plane can be obtained under different workplaces.

Description

A kind of container angle recognition methods based on deep learning

Technical field

The invention belongs to computer vision fields, and in particular to a kind of container angle identification side based on deep learning Method.

Background technique

The detection of target object is carried out by visual information, obtaining position of the target object in picture is computer vision An important topic.In the identification of container angle, primary is exactly the position for identifying container in picture, so Coordinate further according to container in space afterwards judges the deviation angle of container front side and camera plane, and on this basis The coordinate of container angle in space is judged, so the accuracy of target detection has significant effect subsequent step.Currently, The method of target detection mainly has: 1) based on the target detection of conventional method；2) based on the object detection method of study.

Traditional object detection method is all rapidly to carry out feature under the premise of guaranteeing abundant extraction, accurate feature It calculates and predicts.But the feature of traditional method for extracting is essentially all the feature of low level, artificial selection.These features are opposite It is more intuitive, it is easy by human intelligible, but a large amount of, multi-class targets cannot be expressed well, and accuracy rate is lower.

Object detection method based on study refers to using deep learning, from the high level of data focusing study target object Feature, these features have better ability to express.These methods are usually the information extracted in whole picture, are not merely used A part of feature of object, so the object detection method based on study not only makes the essence of target detection relative to conventional method Degree and speed are all greatly improved, and have better noiseproof feature, that is, robustness is stronger.

Summary of the invention

The container angle recognition methods based on deep learning that in view of the deficiencies of the prior art, the present invention proposes a kind of.

The present invention combine with color camera under scene high-resolution color picture and the collected depth map of depth camera, Intensity map, using YOLO-v3 neural network and Principal Component Analysis detect container angle in coordinate in the real world, with And the angle of plane where plane where corner fittings and camera.It comprises the concrete steps that:

Step (1) is denoted as C using the high-resolution colour picture that color camera obtains a container region, uses The depth map that depth camera obtains in container region is denoted as D, intensity map is denoted as I；

Step (2) uses the container and corner fittings in YOLO-v3 neural network sense colors picture；It is specific as follows:

1. color image is inputted in trained YOLO-v3 neural network；Neural network can automatically to input picture into Then row feature extraction is predicted coordinate of the target object in color image using the feature extracted, is finally passed through Multiple possible coordinates are screened, the maximum coordinate of possibility is exported, is accurate to pixel；The wherein detection mesh of YOLO-v3 It is designated as container and corner fittings, can finally export three as a result, being container and two corner fittings respectively；

2. judging whether network output is effective；Judge by the way that whether the coordinate to two corner fittings is located in container coordinate The correctness of web results is only correct just to enter in next step；Concrete mode is as follows:

Wherein, flag indicates whether result is effective, x_{box_min},x_{box_max},y_{box_min},y_{box_max}It is container respectively in coloured silk Transverse and longitudinal coordinate range in chromatic graph, i, j are the transverse and longitudinal coordinate of corner fittings respectively；

Container region in color image is converted to grayscale image by step (3), is denoted as G；It is detected using SURF operator Key point in grayscale image G and intensity map I, and both the description operator of two picture of FLANN algorithmic match is used, and is calculated pass The distance between key point；Threshold value is selected according to the codomain of distance, matched key point is then used as less than threshold value；To matched key Point carries out perspective transform, obtains transformation matrix H, and outline all match points in intensity map, depth is regarded as in this region Spend camera in container coordinate, and using be located at G in corner fittings position key point in I corresponding key point as the seat of corner fittings Mark；

Position of the step (4) according to container in depth map, extracts the three-dimensional coordinate data of container plane；Because Plane where the depth areas of the container of previous step not just corner fittings, it is therefore desirable to container angle plane location It is screened in domain；Specific screening step is as follows:

If the depth map coordinate of two corner fittings is respectively (x₁,y₁) and (x₂,y₂), then point (m, the n) Ying Fuhe in region with Lower condition:

The point for meeting conditions above is modeled, to obtain the angle of container angle plane and camera plane；Specifically Steps are as follows:

Areal model is established to plane:

Ax+By+Cz+D=0

Wherein A, B, C, D are the parameter of plane, and x, y, z is the coordinate value of Plane-point；

According to the confidence level figure of depth camera, the corresponding confidence level of each three-dimensional coordinate is compared with threshold value, when setting Reliability reaches requirement, is just included in calculating point and concentrated, on the contrary then abandon；

The parameter of plane normal vector is solved using Principal Component Analysis Algorithm, calculation method is as follows:

(a) covariance matrix of all the points in plane is solved；Formula are as follows:

Σ=E (hh^T)-E(h)E(hh^T)

(b) feature vector and characteristic value of covariance matrix are solved, wherein the corresponding feature vector of minimal eigenvalue is exactly The corresponding normal vector of plane, that is, parameter A, B, C in plane equation；

(c) all valid data in the three-dimensional coordinate data of container plane are substituted into equation, multiple D values is acquired, to D Value is averaging, and obtains an accurate D；

(d) corner fittings plane is being obtained after the equation of depth camera coordinate system, using the method in solid geometry, calculating The angle of vertical plane and corner fittings plane where depth camera out；It is as follows using formula:

Wherein, angle of the θ between two planes,WithRespectively normal vector and corner fittings of the camera perpendicular to floor The normal vector of plane；

Utilize step (1)-(4), the three-dimensional coordinate of the angle and corner fittings of plane and camera plane where obtaining corner fittings.

Beneficial effects of the present invention: the powerful capability of fitting of neural network is utilized in the method for the present invention, detects object The specific location of body in the picture, then using the color image of target object as sample, to find the correspondence of object in depth map Position avoids depth camera and the cumbersome conversion process of color camera, and obtained container angle position is accurate, plane included angle Estimation is accurate.

Specific implementation step

Step (1) obtains a high-resolution colour picture using color camera and is denoted as C, obtains packaging using depth camera Depth map in case region is denoted as D, intensity map is denoted as I.

Step (2) uses the container and corner fittings in YOLO-v3 neural network sense colors picture.It is specific as follows:

1. color image is inputted in trained YOLO neural network.Neural network can automatically carry out input picture special Sign is extracted, and is then predicted using the feature extracted coordinate of the target object in color image, finally by more A possible coordinate is screened, and is exported the maximum coordinate of possibility, is accurate to pixel.In the present invention, the detection target of YOLO For container and corner fittings, three results (being container and two corner fittings respectively) can be finally exported.

2. judging whether network output is effective.Judge by the way that whether the coordinate to two corner fittings is located in container coordinate The correctness of web results is only correct just to enter in next step.Concrete mode is as follows:

Wherein, flag indicates whether result is effective, x_{box_min},x_{box_max},y_{box_min},y_{box_max}It is container respectively in coloured silk Transverse and longitudinal coordinate range in chromatic graph, i, j are the transverse and longitudinal coordinate of corner fittings respectively.

Container region in color image is converted to grayscale image by step (3), is denoted as G.It is detected using SURF operator Key point in grayscale image G and intensity map I, and both the description operator of two picture of FLANN algorithmic match is used, and is calculated pass The distance between key point.Suitable threshold value is selected according to the codomain of distance, threshold value is used to screen matched key point.To key point Perspective transform is carried out, transformation matrix H, and all match points in intensity map center is obtained, depth is regarded as into this region The coordinate of container in camera, and using be located at G in corner fittings position key point in I corresponding key point as the seat of corner fittings Mark.

Position of the step (4) according to container in depth map extracts the three-dimensional coordinate data (point of container plane Cloud).Because of plane where the depth areas of the container of previous step not just corner fittings, it is therefore desirable to flat to container angle It is screened face region.Specific screening step is as follows:

The point for meeting conditions above is modeled, to obtain the angle of container angle plane and camera plane.Specifically Steps are as follows:

Areal model is established to plane:

Ax+By+Cz+D=0

Wherein A, B, C, D are the parameter of plane, and x, y, z is the coordinate value of Plane-point.

(a) covariance matrix of all the points in plane is solved.Formula are as follows:

Σ=E (xx^T)-E(x)E(x^T)

(b) feature vector and characteristic value of covariance matrix are solved, wherein the corresponding feature vector of minimal eigenvalue is exactly The corresponding normal vector of plane, that is, parameter A, B, C in plane equation.

(c) by cloud all valid data substitute into equation, acquire multiple D values, to D value be averaging, obtain one compared with For accurate D.

By Principal Component Analysis Algorithm, an available accurate plane equation, still, because depth camera is adopted There are much noises for the data collected, to plane equation accuracy there are larger impact, need to pre-process initial data. Method particularly includes:

According to the confidence level figure of depth camera, the corresponding confidence level of each three-dimensional coordinate is compared with threshold value, when setting Reliability reaches requirement, is just included in calculating point and concentrated, on the contrary then abandon.

(d) plane is being obtained after the equation of depth camera coordinate system, using the method in solid geometry, calculate depth The angle of vertical plane and corner fittings plane where spending camera.It is as follows using formula:

Wherein, angle of the θ between two planes,WithRespectively normal vector and corner fittings of the camera perpendicular to floor The normal vector of plane.

Utilize above 4 steps, the three-dimensional coordinate of the angle and corner fittings of plane and camera plane where corner fittings can be obtained.

Claims

1. a kind of container angle recognition methods based on deep learning, which is characterized in that this method comprises the concrete steps that:

Step (1) is denoted as C using the high-resolution colour picture that color camera obtains a container region, uses depth The depth map that camera obtains in container region is denoted as D, intensity map is denoted as I；

1. color image is inputted in trained YOLO-v3 neural network；Neural network can automatically carry out input picture special Sign is extracted, and is then predicted using the feature extracted coordinate of the target object in color image, finally by more A possible coordinate is screened, and is exported the maximum coordinate of possibility, is accurate to pixel；Wherein the detection target of YOLO-v3 is Container and corner fittings can finally export three as a result, being container and two corner fittings respectively；

2. judging whether network output is effective；Network is judged by the way that whether the coordinate to two corner fittings is located in container coordinate As a result correctness is only correct just to enter in next step；Concrete mode is as follows:

Wherein, flag indicates whether result is effective, x_{box_min},x_{box_max},y_{box_min},y_{box_max}It is container respectively in cromogram Transverse and longitudinal coordinate range, i, j are the transverse and longitudinal coordinate of corner fittings respectively；

Container region in color image is converted to grayscale image by step (3), is denoted as G；Gray scale is detected using SURF operator Scheme the key point in G and intensity map I, and use the description operator of two picture of FLANN algorithmic match, and calculates the two key point The distance between；Threshold value is selected according to the codomain of distance, matched key point is then used as less than threshold value；To matched crucial click-through Row perspective transform obtains transformation matrix H, and all match points are outlined in intensity map, and depth phase is regarded as in this region The coordinate of container in machine, and using be located at G in corner fittings position key point in I corresponding key point as the coordinate of corner fittings；

Position of the step (4) according to container in depth map, extracts the three-dimensional coordinate data of container plane；Because upper one The depth areas of the container of step plane not just where corner fittings, it is therefore desirable to container angle plane region into Row screening；Specific screening step is as follows:

If the depth map coordinate of two corner fittings is respectively (x₁,y₁) and (x₂,y₂), then the point (m, n) in region should meet following item Part:

The point for meeting conditions above is modeled, to obtain the angle of container angle plane and camera plane；Specific steps It is as follows:

Areal model is established to plane:

Ax+By+Cz+D=0

According to the confidence level figure of depth camera, the corresponding confidence level of each three-dimensional coordinate is compared with threshold value, works as confidence level Reach requirement, is just included in calculating point and concentrated, it is on the contrary then abandon；

Σ=E (hh^T)-E(h)E(hh^T)

(b) feature vector and characteristic value of covariance matrix are solved, wherein the corresponding feature vector of minimal eigenvalue is exactly plane Corresponding normal vector, that is, parameter A, B, C in plane equation；

(c) all valid data in the three-dimensional coordinate data of container plane are substituted into equation, acquires multiple D values, D value is asked It is average, obtain an accurate D；

(d) corner fittings plane is being obtained after the equation of depth camera coordinate system, using the method in solid geometry, calculate depth The angle of vertical plane and corner fittings plane where spending camera；It is as follows using formula:

Wherein, angle of the θ between two planes,WithRespectively normal vector and corner fittings plane of the camera perpendicular to floor Normal vector；