CN110738665B

CN110738665B - Object contact identification method based on depth image information

Info

Publication number: CN110738665B
Application number: CN201910875363.2A
Authority: CN
Inventors: 陈衍; 俞云松; 许杭; 袁玉华; 周建仓; 王海萍; 孙璐; 姜胜男; 朱飞腾
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-09-17
Filing date: 2019-09-17
Publication date: 2021-10-29
Anticipated expiration: 2039-09-17
Also published as: CN110738665A

Abstract

The invention provides an object contact identification method based on depth image information. And analyzing the connected domain and depth information of different moving objects or between the moving object and a static object. Thereby definitely determining whether a contact relation between the objects occurs. The method overcomes the defects that the two-dimensional information of the common camera shooting information is limited and the contact relation of the object cannot be accurately judged; the method provided by the invention is suitable for contact recognition between moving and static objects, and particularly provides a quick and practical discrimination method under the condition of partial and complete shielding. By using less operation cost, whether the contact relation exists between the moving object and the static object can be accurately judged under the shielding condition.

Description

Object contact identification method based on depth image information

Technical Field

The invention relates to an object contact identification method in an image, in particular to an object contact identification method based on depth image information.

Background

In the field of video surveillance artificial intelligence applications, core problems that often need to be dealt with include segmentation, tracking, recognition, and behavior analysis of objects. Among the behavioral analysis requirements, the most fundamental analysis requirement is to determine the direct relative position and contact relationship between an object and an object. Contact relationships are often considered more central recognition decision points. Many common intelligent application requirements, such as limb contact discrimination, intrusion into alert zones, strict inhibition of touch object monitoring, etc., require contact relationship discrimination as a basis.

The common single camera monitoring video is planar two-dimensional data without depth information, although advanced artificial intelligence technology can perform object segmentation and position relation judgment in a certain range. However, in practical applications, due to the natural two-dimensional information limitation of a single camera, even if the plane relations between the object objects are completely adjacent, it is difficult to determine whether the object objects are at the same depth position, so that it is impossible to accurately distinguish whether the contact relation between the objects actually exists. Meanwhile, when the judged object is completely shielded, whether the judged object has a real contact relation with the shielding object is more difficult to judge. Even if the problem can be partially solved by applying a more complex algorithm for a specific scene, the high algorithm design cost is required. The root cause problem is the lack of depth information.

Disclosure of Invention

The invention aims to provide an object contact identification method based on depth image information, aiming at the defects of the prior art.

The purpose of the invention is realized by the following technical scheme:

an object contact identification method based on depth image information comprises the following steps:

(1) completing depth image information acquisition: and acquiring original depth image information by a depth image camera.

(2) Object segmentation: dividing the object into a moving object and a static object and respectively segmenting the moving object and the static object, wherein the method comprises the following substeps:

(2.1) moving object segmentation: and (4) extracting a foreground moving object by using a background removal algorithm.

(2.2) segmentation of stationary objects: and (4) performing segmentation extraction on the static object by using a watershed algorithm of the depth image.

(3) Identifying an object to be detected: and (3) identifying the object to be detected from the objects segmented in the step (2) through artificial intelligence.

(4) The contact identification of the object to be detected is divided into the following two conditions:

and (4.1) when the object to be detected is partially shielded or not shielded, directly utilizing a connected domain analysis method to carry out contact identification on the object to be detected and other objects.

(4.2) when the object to be detected is completely shielded, judging the contact relation of the moving object to be detected when the moving object to be detected is completely shielded by utilizing the contact identification result of the partially shielded or unshielded state of the object before the object is completely shielded; and the depth value of the static object to be detected is subtracted from the depth value of the moving object for shielding the static object to be detected by the static object to be detected when the static object to be detected is not shielded, so that the depth deviation value is obtained. And when the depth deviation amount is smaller than the shielding judgment threshold value, judging that the static object to be detected and a moving object shielding the static object to be detected have a contact relation. The shielding judgment threshold value is about 10 times of the minimum resolution unit of the depth camera.

Further, in the step (1), the depth image camera includes a binocular camera and a structured light camera.

Further, in the step (2.1), the background removal algorithm is a gaussian mixture model algorithm, a kernel density estimation algorithm, a ViBe algorithm, a background subtraction method, or the like.

Further, in the step (3), since the position of the stationary object in the stationary scene is fixed, the stationary object can also be identified and located by an active pre-marking method.

The invention has the beneficial effects that: the method overcomes the defects that the two-dimensional information of the common camera shooting information is limited and the contact relation of the object cannot be accurately judged; the method provided by the invention is suitable for contact recognition between moving and static objects, and particularly provides a quick and practical discrimination method under the condition of partial and complete shielding. By using less operation cost, whether the contact relation exists between the moving object and the static object can be accurately judged under the shielding condition.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a schematic diagram of a LeNet neural network architecture;

fig. 3 is a stationary object occlusion determination flow.

Detailed Description

The invention is described in further detail below with reference to the figures and examples.

As shown in fig. 1, an object contact recognition method based on depth image information includes the following steps:

(3) Identifying an object to be detected: in step (2), the objects in the scene are segmented one by one. But they are not necessarily the objects to be measured which we need, so the objects need to be identified one by one. Identification of the object to be tested is typically accomplished using artificial intelligence. The mainstream intelligent object recognition algorithm at present takes a convolutional neural network algorithm (CNN) as a core. Taking conventional LeNet as an example, as shown in fig. 2, the network structure is composed of a convolutional layer, a pooling layer, and a full connection layer. The connections between local pixels in an image are relatively close, while pixels at greater distances are relatively weak. Therefore, each neuron does not need to sense the image overall situation, only needs to sense local information, and then integrates the local information at a higher layer to obtain the overall information. The convolution layer operation is to realize local perception field, and the convolution operation can share weight value, so the parameter quantity is reduced. The pooling layer is used for reducing the size of an input image, reducing pixel information and only keeping important information, and is mainly used for reducing the calculation amount. Mainly comprises maximum pooling and mean pooling. The full-connection layer plays a role of a classifier in the whole convolutional neural network to obtain a final object classification result. While the layers incorporate non-linearity using an activation function. Common activation functions are sigmod, tanh, relu, which are commonly used in fully-connected layers, and relu is commonly used in convolutional layers.

(4.1) when the object to be detected is partially shielded or not shielded: according to the characteristics of the depth image information, when partial shielding or non-shielding exists between two objects, a connected point of the depth information inevitably exists between the two objects, so that the contact identification of the object to be detected and other objects is directly carried out by using a connected domain analysis method.

And (4.2) when the object to be detected is completely shielded, the object to be detected does not appear in the scene. Judging the contact relation of the moving object to be detected when the moving object to be detected is completely shielded by utilizing the contact identification result of the partially shielded or unshielded state of the object before the object is completely shielded; the depth information of the static object to be measured is theoretically fixed, so when the static object to be measured is completely shielded, the depth value of the moving object shielding the static object to be measured can be used for subtracting the depth value of the position where the static object to be measured is located when the static object to be measured is not shielded, and the depth deviation value is obtained, as shown in fig. 2. And when the depth deviation amount is smaller than the shielding judgment threshold value, judging that the static object to be detected and a moving object shielding the static object to be detected have a contact relation. The occlusion determination threshold is a preset small depth deviation amount, and is generally set according to an actual detection scene determination scale, and is generally set to be about 10 times of a minimum resolution unit of a depth camera.

In addition, in the step (1), the depth image camera comprises a binocular camera and a structured light camera. In the step (2.1), the background removing algorithm is a Gaussian mixture model algorithm, a kernel density estimation algorithm, a ViBe algorithm, a background difference method and the like.

The method overcomes the defects that the two-dimensional information of the common camera shooting information is limited and the contact relation of the object cannot be accurately judged; the method provided by the invention is suitable for contact recognition between moving and static objects, and particularly provides a quick and practical discrimination method under the condition of partial and complete shielding. By using less operation cost, whether the contact relation exists between the moving object and the static object can be accurately judged under the shielding condition.

Claims

1. An object contact identification method based on depth image information is characterized by comprising the following steps:

(1) completing depth image information acquisition: acquiring original depth image information by a depth image camera;

(2.1) moving object segmentation: extracting foreground moving objects by using a background removal algorithm;

(2.2) segmentation of stationary objects: performing segmentation extraction on the static object by using a watershed algorithm of the depth image;

(3) identifying an object to be detected: identifying an object to be detected from the objects segmented in the step (2) through artificial intelligence;

(4.1) when the object to be detected is partially shielded or not shielded, directly utilizing a connected domain analysis method to carry out contact identification on the object to be detected and other objects;

(4.2) when the object to be detected is completely shielded, judging the contact relation of the moving object to be detected when the moving object to be detected is completely shielded by utilizing the contact identification result of the partially shielded or unshielded state of the object before the object is completely shielded; subtracting the depth value of the position of the static object to be detected when the static object to be detected is not shielded from the depth value of the moving object shielding the static object to be detected by the static object to be detected to obtain a depth deviation value; when the depth deviation amount is smaller than the shielding judgment threshold value, judging that the static object to be detected and a moving object shielding the static object to be detected have a contact relation; the shielding judgment threshold value is 10 times of the minimum resolution unit of the depth image camera.

2. The method of claim 1, wherein in step (1), the depth image camera comprises a binocular camera, a structured light camera.

3. The method according to claim 1, wherein in step (2.1), the background removal algorithm is a Gaussian mixture model algorithm, a kernel density estimation algorithm, a ViBe algorithm or a background subtraction method.

4. The method according to claim 1, wherein in the step (3), the stationary object is further identified and located by an active pre-marking method.