CN106886216B

CN106886216B - Robot automatic tracking method and system based on RGBD face detection

Info

Publication number: CN106886216B
Application number: CN201710028570.5A
Authority: CN
Inventors: 陈东伟
Original assignee: Shenzhen Qianhai Yongyida Robot Co ltd
Current assignee: Shenzhen Qianhai Yongyida Robot Co ltd
Priority date: 2017-01-16
Filing date: 2017-01-16
Publication date: 2020-08-14
Anticipated expiration: 2037-01-16
Also published as: CN106886216A

Abstract

The invention relates to a robot automatic tracking method and system based on RGBD face detection, wherein the system comprises the following steps: the system comprises an RGBD camera, an image algorithm processing platform and a robot motion control unit; the RGBD camera is used for collecting RGB images and depth images of a human body and is used for identifying the upper half body and the face images of the human body; the image algorithm processing platform is used for receiving a human body image acquired by the RGBD camera, realizing human face recognition of a human body through algorithm processing, and calculating the distance between the robot and the human face; the robot motion control unit obtains the longitudinal forward and backward distance of the human face through the image algorithm processing platform and controls the robot to move along with the movement of the human face according to the distance change information. The invention solves the problems of face tracking accuracy and face longitudinal accurate distance measurement, and ensures that the robot moves along with the face with higher accuracy and stronger human-like function.

Description

Robot automatic tracking method and system based on RGBD face detection

Technical Field

The invention relates to the technical field of robot control, in particular to an automatic robot tracking method and system based on RGBD face detection.

Background

With the development of computer science and automatic control technology, more and more intelligent robots appear in production and life, and a vision system is more and more valued by people as an important subsystem in an intelligent robot system. The intelligent robot vision system takes the intelligent behavior ability of a human as a blueprint, realizes the aspects of the robot such as environment perception organization, complex scene adaptation, interaction and cooperation, concept formation and integration, knowledge acquisition and reasoning, autonomous cognition and advanced decision making, humanoid intelligent behavior and the like, and develops the intelligent development of the robot. The face detection follows a part of the intellectualization of the robot at any time, and is mainly used for rapid and accurate face detection and tracking, so that better experience is obtained in the communication between the human and the robot.

The domestic patent CN103093212A discloses a method and a device for intercepting face images based on face detection and tracking, the system comprises a detection module, a tracking module, a judgment module and an interception module, and a cascade classifier is adopted to carry out face detection on images to be detected; when the face target is detected, carrying out face tracking on the face target by using a mean value tracking algorithm; when the face target leaves the detection area, judging whether the face detection and the face tracking in the same frame correspond to the same face target or not on each frame according to the position of the target, and selecting each frame of which the face detection and the face tracking correspond to the same face target; calculating the contact ratio of a window of face detection and a window of face tracking in the same frame in each selected frame, and taking a face image obtained by face detection on the frame with the maximum contact ratio as an intercepted face image; the invention has the disadvantages that the cascade classifier is singly used, the classifier has high false detection rate and is not beneficial to being used in a complex environment, and the used 2D camera has no image depth information, so that the size can be only roughly estimated when the face moves forwards and backwards, and the actual distance of the movement forwards and backwards cannot be measured.

The domestic patent CN101477616 discloses a face detection and tracking method, which is executed by a computer or a microprocessor with computing power to identify the face and its position in the image, and the face detection is adopted to perform face tracking on each frame of image, record the face position, and perform face detection once again on the image after several frames of images, under the condition of the found face position, so as to quickly find out other faces that may be added newly.

Accordingly, there is a need in the art for improvements.

Disclosure of Invention

The invention discloses an automatic robot tracking method and system based on RGBD face detection, which are used for enabling a robot to move along with a face with higher accuracy and stronger human-like function.

On one hand, the robot automatic tracking method based on the RGBD face detection provided by the embodiment of the present invention includes:

acquiring human body sample image data, and detecting technical indexes of a classifier through the human body sample image;

using a cascade classifier to detect the upper half of the human body, using depth image data to perform numerical analysis, and confirming the detected human body image;

using a cascade classifier to detect a human face in an image for detecting the upper half of the human body by using a method of integrating image depth information until the human face is detected, and confirming the detected human face image, wherein the cascade classifier is formed by connecting a plurality of strong classifiers which are weighted by a plurality of weak classifiers;

after the human face area is detected, the control module is used for extracting a corresponding area from the depth image to perform distance operation, and the accurate distance between the human and the robot is obtained.

In another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, the acquiring human body sample image data includes the following steps:

preparing training data, and making a human body image training sample, wherein the human body image training sample comprises: a negative sample and a positive sample, the negative sample referring to an image not including an object; the positive sample is an image of an object to be detected and cannot contain any negative sample;

using an image test classifier of a finished training sample, and analyzing technical indexes of the classifier according to a test result, wherein the technical indexes comprise: the detection success rate and the false detection rate of the upper half body image and the success rate and the false detection rate of the human face detection;

adjusting technical parameters of a classifier, wherein the parameters adjusted by the classifier comprise: the number of positive samples used during training, the number of negative samples used during training of each stage of classifier, the stage number of the classifier, the haar feature type and the minimum detection rate of each stage of the classifier.

In another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, the performing upper body detection on a human body by using a classifier and performing numerical analysis by using depth image data, and the determining an image of the detected human body includes:

firstly, carrying out gray level conversion on an RGB image to change a color image into a gray level image of 0 to 255 orders;

enhancing the brightness of the gray scale image by using a histogram equalization method;

selecting an image detection area, and carrying out integral processing on the selected area, wherein when the rectangular characteristic is calculated by adopting an integral method, the sum of pixel gray values in a rectangle does not need to be counted again every time, only the integral of a plurality of corresponding points of the rectangle needs to be calculated, namely the rectangular characteristic value is calculated, and the calculation time does not change along with the change of the size of the rectangle;

scanning the image detection area once in a whole to obtain a detection factor coefficient;

multiplying the first-stage detection result by the detection factor coefficient to obtain a weak classifier result, and cascading weak classifiers to obtain a strong classifier detection result;

zooming an original image by half, and scanning an image to be detected by using a haar window with a fixed size as a basic window;

calculating the mean value and the variance in windows of different sizes, traversing the image to be detected, and considering the window as the image of the upper half of the human body when the set conditions are met;

calculating Haar-like characteristics in the candidate region, and transmitting the characteristics to a cascade adaboost classifier for further judgment;

if three windows exist and can simultaneously detect the upper half body image of the human body, the output result of the cascade classifier is determined to be a true value, the upper half body coordinate is output, and if the output result of the cascade classifier does not exist, the output result of the cascade classifier is determined to be a non-true value;

calculating distance (x, y) Pix (x, y)/1000 for all pixel values in the upper half body region depth map;

and performing consistency analysis on the measured distance data, and if the measured distance data is consistent, confirming that the target detected in the RGB image exists in the depth image, otherwise, confirming that the target does not exist in the depth image.

In another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, the using a classifier to perform face detection on the image in which the upper half of the human body is detected by using a method of merging image depth information until the human face is detected, and confirming the detected face image includes:

pre-loading face detection classifier data, setting the size of a scanning window, wherein the size of the scanning window is not limited and is specifically determined according to the requirements in actual projects;

and scanning the image according to the window with the set scale until the human face is detected.

In another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, the performing distance calculation on the region corresponding to the depth image extraction includes:

obtaining the distance faceDis (x, y) of each pixel point in the face area in the depth map as image (x, y)/1000;

and summing and averaging the distances of all the pixel points to finally obtain the actual distance between the robot and the person.

An embodiment of the present invention further provides an RGBD face detection-based robot automatic tracking system, including: the system comprises an RGBD camera, an image algorithm processing platform and a robot motion control unit;

the RGBD camera is used for collecting RGB images and depth images of a human body and is used for identifying the upper half body and the face images of the human body; the image algorithm processing platform is used for receiving a human body image acquired by the RGBD camera, realizing human face recognition of a human body through algorithm processing, and calculating the distance between the robot and the human face; the robot motion control unit obtains the longitudinal forward and backward distance of the human face through the image algorithm processing platform and controls the robot to move along with the movement of the human face according to the distance change information;

the RGBD camera comprises an RGB camera and a depth camera, the RGB camera is used for shooting RGB images of a human body, the depth camera is used for shooting depth images of the human body, and after the RGB camera and the depth camera are calibrated by a camera, images shot by the depth camera are overlapped with images shot by the RGB camera;

the RGBD camera is an intel depth processing camera.

In another embodiment of the robot automatic tracking system based on RGBD face detection according to the present invention, the RGBD camera further includes a cascade classifier, the cascade classifier is configured to detect an upper half image of a human body in an image captured by the RGB camera, and when the upper half image is detected, the cascade classifier is used again for face detection on the upper half region of the image.

In another embodiment of the above robot automatic tracking system based on RGBD face detection according to the present invention, the cascade classifier comprises a plurality of strong classifiers connected together for operation, and the strong classifier comprises a plurality of weak classifiers composed of a plurality of weak classifiers by weighting.

In another embodiment of the robot automatic tracking system based on RGBD face detection according to the present invention, the image algorithm processing platform performs image algorithm processing by adding a haar algorithm and an adboost algorithm to a target depth information analysis method, and includes:

In another embodiment of the robot automatic tracking system based on RGBD face detection according to the present invention, the robot motion control unit is configured to perform a distance operation on a region corresponding to the depth image extraction, where the distance operation includes:

summing and averaging the distances of all the pixel points to finally obtain the actual distance between the robot and the person;

and controlling the robot behavior according to the calculated actual concrete behaviors of the robot and the human.

Compared with the prior art, the invention has the following advantages:

the invention relates to a robot automatic tracking method and a system based on RGBD face detection, which comprises the steps of firstly collecting RGB images through an RGB camera of the RGBD camera, carrying out image detection on the upper half of a human body by using a cascade classifier, detecting the upper half of the human body by using an image algorithm processing platform, then carrying out face detection on the upper half of the image by using the cascade classifier again, determining the region of the depth information of the face in the image formed by the depth camera according to the output coordinate information after detecting the face, extracting the face depth information, carrying out distance operation to obtain the longitudinal forward and backward distance of the face, and controlling a robot to move along with the movement of the face according to the distance change information The human-like function is stronger.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description will be given below to the drawings used in the description of the embodiments or the prior art.

Fig. 1 is a schematic structural diagram of an embodiment of the robot automatic tracking system based on RGBD face detection of the present invention.

Fig. 2 is a flowchart of an embodiment of the robot automatic tracking method based on RGBD face detection of the present invention.

Fig. 3 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection of the present invention.

Fig. 4 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection of the present invention.

Fig. 5 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection of the present invention.

Fig. 6 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection of the present invention.

In the figure: the system comprises a 1RGBD camera, a 11RGB camera, a 12 depth camera, a 13 cascade classifier, a 2 image algorithm processing platform and a 3 robot motion control unit.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Fig. 1 is a schematic structural diagram of an embodiment of the robot automatic tracking system based on RGBD face detection of the present invention, as shown in fig. 1, the robot automatic tracking system based on RGBD face detection includes:

the robot comprises an RGBD camera 1, an image algorithm processing platform 2 and a robot motion control unit 3;

the RGBD camera 1 is used for collecting RGB images and depth images of a human body and identifying the upper half body and face images of the human body; the image algorithm processing platform 2 is used for receiving the human body image acquired by the RGBD camera 1, realizing human face recognition of the human body through algorithm processing, and calculating the distance between the robot and the human face; the robot motion control unit 3 obtains the longitudinal forward and backward distance of the human face through the image algorithm processing platform 2, and controls the robot to move along with the movement of the human face according to the distance change information.

The RGBD camera 1 comprises an RGB camera 11 and a depth camera 12;

the RGB camera 11 is used for shooting RGB images of a human body;

the depth camera 12 is used for shooting a depth image of a human body;

after the RGB camera 11 and the depth camera 12 are calibrated by the cameras, the image shot by the depth camera 12 coincides with the image shot by the RGB camera.

The RGBD camera 1 further includes a cascade classifier 13, the cascade classifier 13 is configured to detect an upper half body image of a human body in the image captured by the RGB camera 11, and when the upper half body is detected by the image, the cascade classifier 13 is used again to detect a human face in the upper half body region of the image.

RGBD camera 1 is intel depth processing camera.

The cascade classifier 13 includes a plurality of strong classifiers, which are connected together to operate, and the strong classifier includes a plurality of weak classifiers, which are weighted by the plurality of weak classifiers.

The image algorithm processing platform 2 adds a target depth information analysis method to perform image algorithm processing by adopting a haar algorithm and an adboost algorithm, and comprises the following steps:

if three windows exist and can simultaneously detect the upper half body image of the human body, the output result of the cascade classifier 13 is determined to be a true value, the upper half body coordinate is output, and if the three windows do not exist, the output result of the cascade classifier 13 is determined to be a non-true value;

The robot motion control unit 3 is configured to perform distance calculation in a region corresponding to the depth image extraction, where the distance calculation includes:

Fig. 2 is a flowchart of an embodiment of the robot automatic tracking method based on RGBD face detection of the present invention, and as shown in fig. 2, the robot automatic tracking method based on RGBD face detection includes:

acquiring human body sample image data, and detecting technical indexes of a classifier through the human body sample image; the human body sample image data is shot by an RGBD camera 1, and the technical indexes of the classifier comprise the detection success rate and the false detection rate of the upper half body and the success rate and the false detection rate of human face detection;

20, using a cascade classifier 13 to detect the upper body of a human body, using depth image data to perform numerical analysis, and confirming a detected human body image, wherein the cascade classifier 13 is formed by connecting a plurality of strong classifiers which are weighted by a plurality of weak classifiers together for operation;

30, using a cascade classifier 13 to detect the human face in the image for detecting the upper half of the human body by using a method of integrating image depth information until the human face is detected, and confirming the detected human face image;

and 40, after the human face area is detected, using a control module to extract a corresponding area from the depth image for distance calculation to obtain the accurate distance between the human and the robot.

The method for detecting the human body upper part of the body by using the classifier and carrying out numerical analysis by using the depth image data is a cascade classification method, wherein the cascade classification method is to connect a plurality of strong classifiers together for operation, and the strong classifier is formed by weighting a plurality of weak classifiers.

For example, the image detection of the upper half of the human body uses 10 weak classifiers which are cascaded together to form a cascaded strong classifier, and because the distinguishing accuracy of each strong classifier on the negative samples is very high, once the detected target bit negative sample is found, the following strong classifiers are not called continuously, so that the detection time is reduced; since many areas to be detected in one image are negative samples, the cascade classifier 1 abandons the complex detection of many negative samples in the early stage of the classifier, so the speed of the cascade classifier is very high; only the positive samples are sent to the next strong classifier for secondary inspection, so that the probability of false positive of the finally output positive samples is very low; the cascade structure classifier consists of a plurality of weak classifiers, and each stage is more complex than the previous stage; each classifier allows all positive examples to pass through, and filters most negative examples, so that the number of positive examples to be detected in each stage is less than that in the previous stage, a large number of non-detection targets are eliminated, and the detection speed is greatly improved.

Fig. 3 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection of the present invention, and as shown in fig. 3, the acquiring human body sample image data includes the following technical indicators of a human body sample image detection classifier:

preparing training data, and making a human body image training sample, wherein the human body image training sample comprises: a negative sample and a positive sample, the negative sample referring to an image not including an object; the positive sample is an image of an object to be detected and cannot contain any negative sample; the negative samples are more diversified, cannot be related to the positive samples, and are better to be taken from daily life;

12, using an image test classifier of the finished training sample, and analyzing technical indexes of the classifier according to a test result, wherein the technical indexes comprise: the detection success rate and the false detection rate of the upper half body image and the success rate and the false detection rate of the human face detection;

and 13, adjusting technical parameters of the classifier, wherein the parameters adjusted by the classifier comprise: the number of positive samples used during training, the number of negative samples used during training of each stage of classifier, the stage number of the classifier, the haar feature type and the minimum detection rate of each stage of the classifier.

The preparing of the training data and the making of the human body image training sample comprise:

using an RGBD camera 1 to obtain an image of a training sample, wherein the training sample is divided into a positive sample and a negative sample, and the negative sample refers to an image without an object; the positive sample is an image of an object to be detected and cannot contain any negative sample; the negative samples are more diversified, cannot be related to the positive samples, and are better to be taken from daily life;

calculating an integral image of the sample image, and constructing a characteristic model;

calculating a characteristic value of the characteristic model to obtain a characteristic set;

determining a domain value, and generating a corresponding weak classifier according to the feature set to obtain a weak classifier set;

training a strong classifier by using an adaboost algorithm;

adding part of the negative sample set again, and calculating an integral image of an integral sample image;

and finishing the acquisition of the human body sample by the cascade classifier.

Fig. 4 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, and as shown in fig. 4, the detecting the upper body of the human body by using the classifier and performing numerical analysis by using the depth image data includes:

firstly, carrying out gray scale conversion on an RGB image to change a color image into a gray scale image of 0 to 255 orders;

202, enhancing the brightness of the gray-scale image by using a histogram equalization method;

203, selecting an image detection area, and performing integration processing on the selected area, wherein the integration method is adopted to calculate the rectangular characteristic without counting the sum of pixel gray values in the rectangle again every time, only the integration of a plurality of corresponding points of the rectangle is required to be calculated, namely the rectangular characteristic value is calculated, and the calculation time is not changed along with the change of the size of the rectangle;

204, scanning the image detection area once to obtain a detection factor coefficient;

205, multiplying the first-stage detection result by the detection factor coefficient to obtain a weak classifier result, and cascading the weak classifiers to obtain a strong classifier detection result; setting a 40 × 80 window, scanning the image once to obtain a detection factor coefficient, multiplying the first-stage detection result by the detection factor coefficient to obtain a weak classifier result, and cascading the weak classifiers to obtain a final detection result.

206, zooming the original image by half, and scanning the image to be detected by using a fixed-size haar window, wherein the fixed-size haar window is a basic window;

207, calculating the mean value and the variance in the windows with different sizes, traversing the image to be detected, and considering the window as the image of the upper half of the human body when the set conditions are met;

208, calculating Haar-like characteristics in the candidate region, and transmitting the characteristics to a cascade adaboost classifier for further judgment;

209, if three windows exist to simultaneously detect the upper half body image of the human body, determining that the output result of the cascade classifier is a true value, and outputting the coordinates of the upper half body, and if the three windows do not exist, determining that the output result of the cascade classifier is a non-true value;

210, calculating distance (x, y) Pix (x, y)/1000 for all pixel values in the upper body region depth map;

and 211, performing consistency analysis on the measured distance data, if the measured distance data is consistent, confirming that the target detected in the RGB image exists in the depth image, and otherwise, confirming that the target does not exist in the depth image.

The method is characterized in that target depth information analysis is added on the basis of a haar algorithm and an adboost algorithm. The Haar algorithm is essentially characterized by a statistical principle, inherent defects exist in classification by using a statistical method, namely fuzzy characteristics exist in the statistical principle, the defect of the statistical principle can be effectively made up by adding target depth information, the accuracy of target detection is greatly improved, and the false detection rate is effectively reduced.

Fig. 5 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection according to the present invention, and as shown in fig. 5, the performing face detection using the method of merging image depth information in an image in which the upper half of the human body is detected by using a classifier until a face is detected, and confirming the detected face image includes:

31, pre-loading face detection classifier data, and setting the size of a scanning window, wherein the size of the scanning window is not limited and is specifically determined according to the requirements in an actual project;

and 32, scanning the image according to the window with the set scale until the human face is detected.

Fig. 6 is a flowchart of another embodiment of the robot automatic tracking method based on RGBD face detection in the present invention, and as shown in fig. 6, the performing distance calculation on the region corresponding to the depth image extraction includes:

41, obtaining the distance faceDis (x, y) of each pixel point in the face area in the depth map as image (x, y)/1000;

and 42, summing and averaging the distances of all the pixel points to finally obtain the actual distance between the robot and the person.

The method and the system for automatically tracking the robot based on the RGBD face detection provided by the present invention are described in detail above, and a specific example is applied in the text to explain the principle and the implementation of the present invention, and the description of the above embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments or portions thereof without departing from the spirit and scope of the invention.

Claims

1. An automatic robot tracking method based on RGBD face detection is characterized by comprising the following steps:

using a cascade classifier to detect the upper half of the body of the human body, using depth image data to perform numerical analysis, and confirming a detected human body image;

after the face area is detected, a control module is used for extracting a corresponding area from the depth image to perform distance operation to obtain an accurate distance between the robot and the person;

using the classifier to carry out human upper body detection to using the depth image data to carry out numerical analysis, confirming detecting human images includes:

2. The method of claim 1, wherein the obtaining of the human body sample image data and the detecting of the technical index of the classifier through the human body sample image comprises:

using an image test classifier of a finished training sample, and analyzing technical indexes of the classifier according to a test result, wherein the technical indexes comprise: the detection success rate of the upper half body image and the error preparation training data are used for making a human body image training sample, and the human body image training sample comprises: a negative sample and a positive sample, the negative sample referring to an image not including an object; the positive sample is an image of an object to be detected and cannot contain any negative sample;

the detection rate, the success rate of face detection and the false detection rate;

3. The method according to claim 1, wherein the using the classifier to perform face detection using a method of merging image depth information into an image in which the upper body of the human body is detected until the face is detected, and the confirming the detected face image comprises:

4. The method of claim 1, wherein the distance operation on the corresponding region extracted from the depth image comprises:

5. An automatic robot tracking system based on RGBD face detection is characterized by comprising: the system comprises an RGBD camera, an image algorithm processing platform and a robot motion control unit;

the RGBD camera is an intel depth processing camera;

the image algorithm processing platform adopts a haar algorithm and an adboost algorithm to add a target depth information analysis method for image algorithm processing, and comprises the following steps:

6. The system of claim 5, wherein the RGBD camera further comprises a cascade classifier, and the cascade classifier is configured to detect an upper body image of a human body in the image captured by the RGB camera, and when the upper body image is detected, perform face detection on the upper body region of the image again using the cascade classifier.

7. The system of claim 6, wherein the cascaded classifier comprises a plurality of strong classifiers operatively connected together, the strong classifiers comprising a plurality of weak classifiers, weighted by the plurality of weak classifiers.

8. The system of claim 5, wherein the robot motion control unit is configured to perform a distance operation on the corresponding region extracted from the depth image, and the distance operation comprises: