CN108550168A

CN108550168A - The multiple target quick high accuracy of Intelligentized mechanical arm view-based access control model identifies and localization method

Info

Publication number: CN108550168A
Application number: CN201810348592.4A
Authority: CN
Inventors: 谢美华; 邓立新
Original assignee: Hunan Niu Shun Technology Co Ltd
Current assignee: Hunan Niu Shun Technology Co Ltd
Priority date: 2018-04-18
Filing date: 2018-04-18
Publication date: 2018-09-18

Abstract

Identification and localization method of the Intelligentized mechanical arm to multiple target quick high accuracy that the present invention relates to a kind of for connecting firmly visual unit.It is realized in source images to the identification and positioning of the quick high accuracy of template image by template image and source images in the present invention.With the down-sampled method restored with laplacian pyramid of gaussian pyramid in the present invention, in the high-precision for improving identification with ensure that identification and positioning while locating speed.In addition the present invention proposes a kind of plane multi-targets recognition and localization method being combined with non-maxima suppression based on normalized crosscorrelation matching algorithm, this method may be implemented to identification and positioning while multiple targets in source images, and substantially increase robustness compared to tional identification and location algorithm.

Description

The multiple target quick high accuracy of Intelligentized mechanical arm view-based access control model identifies and localization method

Technical field

The invention belongs to computer vision fields, and in particular to for connecting firmly the Intelligentized mechanical arm of visual unit to more mesh Mark identification and the localization method of quick high accuracy.

Background technology

With the research and development of field of machine vision, more and more industrial robots are equipped with vision system, make machine Device people is accurately positioned technical aspect in autonomous and mechanical arm and is obviously improved.At the same time, panoramic vision and The successful application of image algorithm also considerably enhances the level of intelligence of mechanical arm, and the reality that mechanical arm captures object The standard that Shi Xing, Stability and veracity and mechanical arm carry out the level of intelligence of intelligent grabbing using vision guide embodies.It wants The level of intelligence for improving a series of actions made by mechanical arm, makes it have the function of Intelligent mobile and crawl, it is important to make Robot has environmental goals certain cognitive ability, determines the action of itself by perceiving environmental goals, it is made to be not required to transport Dynamic contact can realize the actions such as crawl to environmental goals, and therefore, the research of vision system has become raising robot automtion A more popular direction, even more emphasis studies a question among these for the identification of target and localization method.

Currently, in the research field, have some comparative maturities and the identification for putting into application and location algorithm, but work as Lower existing method is more for a certain concrete application, and there are larger limitations.Such as " the plane based on region division Workpiece identification and localization method " is only applicable in the workpiece with outline of straight line, is unable to get just when workpiece is irregular shape True recognition result；" statistical matrix marked region algorithm " reaches identifying purpose using the geometric properties of target itself, chooses matter Heart labelling method realizes the positioning of target object, this method some shaped objects in mixed and disorderly object for identification, and is not suitable for In identical multi-targets recognition and positioning；" feature based template simultaneously carries out template matches using principal component analysis (PCA) method Algorithm " realizes the identification and positioning of plurality of classes target, has certain robustness and improves efficiency, but is run in algorithm It is required for longer Offtime to learn template or feature before, cannot be satisfied the variability of engineering duty demand.Always For knot, existing target identification is primarily present problems with location algorithm：(1) algorithm based on gray scale or characteristic matching is only It can identify single goal；(2) recognition methods of multiple target needs longer Offtime to learn template or feature, is not suitable with The changeable actual conditions of engineering duty；(3) most of algorithms can ensure the accuracy of identification positioning, but speed is slower, nothing Method meets the requirement of real-time.

Invention content

It is an object of the invention to provide a kind of target identification of versatility to the Intelligentized mechanical arm comprising visual unit With localization method, quick to multiple target, high-accuracy, high robust identification and positioning are realized, be that Intelligentized mechanical arm is related The crawl of function, including but not limited to target, target movement, target are picked, and information support is provided.

Need to use the Intelligentized mechanical arm comprising visual unit, wherein visual unit (one camera) and machinery in the present invention Arm connects firmly installation, and mechanical arm can obtain the relative position of mechanical arm and target according to camera to the location information of target, to Target is implemented the functions such as to capture, move, pick.The relative position that installation refers to visual unit and mechanical arm is connected firmly to fix.It is real The multiple target quick high accuracy identification for applying view-based access control model needs to use template image and source images, wherein Prototype drawing with localization method Seem the image for the target that previously known needs identify, source images are to connect firmly visual unit by Intelligentized mechanical arm to shoot The image arrived.

High-precision quickly identification and positioning are carried out to multiple target in image with the method in the present invention, realized in source images Quick high accuracy identification and positioning to template image, include the following steps：

The first step, gaussian pyramid are down-sampled

Multiple target quick high accuracy identification in the present invention and location algorithm need template image and the camera unit shooting of target The source images arrived.Template image and source images are reduced into image size to improve image by the way that gaussian pyramid is down-sampled first Matching speed, gaussian pyramid obtain a series of down-sampled images by Gaussian smoothing and sub-sampling, and each layer is all in accordance under To upper serial number, layer G_i+1Indicate that size is less than layer G_iAdjacent layer.A given integer variable n and one-dimensional sampling sequence f (n), then the up-sampling sequence definition that base is 2 is as follows：

The definition of down-sampling that base is 2 operation is：

f_2↓(n)=f (2n) (2)

In general, the whole process time can be shortened to one third by carrying out a down-sampling, but it is down-sampled after The detailed information of some images can be lost.

Second step, normalized crosscorrelation matching

Obtain template image and source images it is down-sampled after, cross-correlation operation is normalized in the two.Normalized crosscorrelation Matching algorithm is a kind of template matching algorithm of classics, can be true by the cross correlation value of calculation template image and matching image Fixed matched degree, search window position when cross correlation value maximum determine position of the template image in image to be matched. NCC algorithms have very high accuracy and adaptability, and have robustness to the linear transformation of gray value of image, not by gray scale The influence of the linear transformation of value.Mathematical expression forms of the NCC algorithm response function R in the position coordinate (i, j) is shown below：

I (i+m-1, j+n-1) indicates source images in the pixel value of the position coordinate (i+m-1, j+n-1), T (m, n) expression Prototype drawings As the position coordinate (m, n) pixel value,Indicate template image and source images in corresponding template size region respectively The average value of interior pixel value, (M, N) are the size of template image.Source images and template image can be calculated by above formula A cross correlation value, it may be determined that matched degree, search window position when cross correlation value maximum determine that template image is waiting for With the position in image, but that be only limitted to identification single goal the case where.For the multi-targets recognition problem based on single mode plate, by In a series of influence of interference such as illumination, noise, need to find multiple targets just need to find it is all more than certain response Position, and remove and be presented as the pixel being sticked together in response image.

Third walks, non-maxima suppression

In a window, if there is the response of multiple pixels meets threshold condition, then retain response it is maximum that Pixel excludes other pixels.The pixel for meeting response lag condition is arranged according to response descending first in the present invention Row, then achieve the purpose that detect maximum using the dilation operation of image, are in addition provided with distance threshold in the process to full The pixel of sufficient condition is constrained, and to have the function that remove error hiding, and then avoids the uncertainty of single constraint.Hair The bright middle two pixel (x used₁,y₁) and (x₂,y₂) between the expression formula of distance be：

Rule of thumb it is found that when NCC receptance function values are more than 0.9, match condition is good.Expansive working window 18*18 is set, Distance threshold is chosen for that 0.9* template short sides are long, obtains response position of the template in source images, but the position is only one Point target will obtain complete target and need to restore it.

4th step, laplacian pyramid restore

Laplacian pyramid is to subtract first to reduce a series of images of the image amplified afterwards and constitute by source images, Ke Yili Solution is the inverse form of gaussian pyramid, the mathematical definition that i-th layer of laplacian pyramid：

G in formula_iIndicate the i-th tomographic image, UP operations are by pixel-map that position in source images is (x, y) to target image The position (2x+1,2y+1), symbolIndicate convolution algorithm, g_5×5Indicate the Gaussian kernel of 5*5.

Advantages of the present invention：

1. by the down-sampled size for reducing source images and template image of gaussian pyramid in the present invention, to substantially reduce The matched calculation amount of normalized crosscorrelation, again by laplacian pyramid pair after finally obtaining object recognition and detection result It detects obtained result to be restored, to improve identification and ensure that the high-precision of identification and positioning while locating speed Degree.

2. the present invention proposes a kind of more mesh of plane being combined with non-maxima suppression based on normalized crosscorrelation matching algorithm Mark not may be implemented with localization method, this method to identification and positioning while multiple targets in source images, and compared to Tional identification and location algorithm substantially increase robustness so that false-alarm, false retrieval, missing inspection probability all substantially reduce.

Specific implementation mode

It needs to obtain template image first in the present invention, and shoots and obtain source images, just by template image and source images Can through the invention in the identification of multiple target quick high accuracy identified in source images with localization method and orient all moulds Plate image object.It obtains providing Information base for Intelligentized mechanical arm subsequent operation after targeting information.More mesh in the present invention Mark quick high accuracy identification is as follows with localization method：

The first step, gaussian pyramid are down-sampled

Obtained template image and source images progress gaussian pyramid is down-sampled, and the specific down-sampled parameter of gaussian pyramid needs It is selected according to the demand in Practical Project situation；

Second step, normalized crosscorrelation matching

Cross Correlation Matching is normalized in the down-sampled result of template image and source images that the first step obtains, is obtained corresponding Cross correlation value；

Third walks, non-maxima suppression

For carrying out non-maxima suppression operation by the cross correlation value that normalized crosscorrelation matches in second step, weight is rejected Reinspection, which is surveyed, improves efficiency of algorithm, while constraints removal error hiding is arranged；

4th step, laplacian pyramid restore

It walks to have obtained the response position of the template image in source images by third, then is answered by laplacian pyramid Original finally obtains template image identification and positioning result in source images.

Claims

1. the multiple target quick high accuracy of Intelligentized mechanical arm view-based access control model identifies and localization method, realize in source images to mould The identification and positioning of the quick high accuracy of plate image, wherein template image are the images for the target that previously known needs identify, Source images are the images that the visual unit that Intelligentized mechanical arm connects firmly is shot, which is characterized in that are included the following steps：

The first step, gaussian pyramid are down-sampled

It is first that template image and source images is down-sampled fast to improve images match to reduce image size by gaussian pyramid Degree, gaussian pyramid obtain a series of down-sampled images by Gaussian smoothing and sub-sampling, and each layer is all in accordance with from top to bottom Serial number, layer G_i+1Indicate that size is less than layer G_iAdjacent layer.An integer variable n and one-dimensional sampling sequence f (n) are given, then The up-sampling sequence definition that base is 2 is as follows：

The definition of down-sampling that base is 2 operation is：

f_2↓(n)=f (2n) (2)

Second step, normalized crosscorrelation matching

To obtain it is down-sampled after template image and source images be normalized, the template image T (m, n) after being normalized With source images I (m, n), calculate the cross correlation value of T (m, n) and I (m, n) by cross correlation algorithm, receptance function R coordinate (i, J) the mathematical expression form of position is shown below：

I (i+m-1, j+n-1) indicates source images in the pixel value of the position coordinate (i+m-1, j+n-1), T (m, n) expression Prototype drawings As the position coordinate (m, n) pixel value,Indicate template image and source images in corresponding template size region respectively The average value of interior pixel value, (M, N) are the size of template image；

Third walks, non-maxima suppression

Pixel for meeting response lag condition in the response image that is obtained by cross correlation algorithm is dropped according to response Sequence arranges, and then achievees the purpose that detect maximum using the dilation operation of image, is in addition provided with distance threshold in the process Pixel to meeting condition constrains, and to have the function that remove error hiding, and then avoids the uncertain of single constraint Property, two pixel (x₁,y₁) and (x₂,y₂) between the expression formula of distance be：

4th step, laplacian pyramid restore

The response image obtained in being walked to third carries out laplacian pyramid and restores to obtain Prototype drawing in final source images The testing result of picture, the mathematical definition that i-th layer of laplacian pyramid：

G in formula_iIndicate the i-th tomographic image, UP operations are by (the 2x of pixel-map that position in source images is (x, y) to target image + 1,2y+1) position, symbolIndicate convolution algorithm, g_5×5Indicate the Gaussian kernel of 5*5.