CN110765877A

CN110765877A - Pedestrian detection method and system based on thermal imager and binocular camera

Info

Publication number: CN110765877A
Application number: CN201910894646.1A
Authority: CN
Inventors: 周竹萍; 杨旭; 梅亚岚; 裘梦琪
Original assignee: Nanjing Tech University
Current assignee: Nanjing Tech University
Priority date: 2019-09-20
Filing date: 2019-09-20
Publication date: 2020-02-07
Anticipated expiration: 2039-09-20
Also published as: CN110765877B

Abstract

The invention provides a pedestrian detection method and system based on a thermal imager and a binocular camera, and the method comprises the following steps: acquiring an image in front of the vehicle by using a thermal imager and a binocular camera; eliminating parallax between images collected by a binocular camera; graying the images of the binocular camera and the thermal imager; respectively scanning the grayed binocular camera image and thermal imager image in a raster scanning mode according to set step lengths by using a sliding window, and simultaneously extracting HOG characteristics of the binocular camera image and the thermal imager image in each step length to obtain HOG characteristic description values; judging whether the window has the pedestrian or not by utilizing the classification result of the SVM; and comparing the detection results of the binocular camera image and the thermal imager image to obtain a final detection result. The invention has small pedestrian detection calculation amount, continuously updates each parameter and state quantity, can perform early warning uninterruptedly and improves the timeliness and continuity of early warning.

Description

Pedestrian detection method and system based on thermal imager and binocular camera

Technical Field

The invention belongs to the field of automatic driving of automobiles, and particularly relates to a pedestrian detection method and system based on a thermal imager and a binocular camera.

Background

Automatic and reliable detection of pedestrians is an important function of autonomous vehicles or Advanced Driver Assistance Systems (ADAS). Research efforts regarding pedestrian detection are largely dependent on the data, as different data and methods may yield different assessment results. In the current data collection device of the pedestrian detection device, the most common sensor in data acquisition is a conventional color video camera, but the color video camera still has many limitations. For example, color cameras are sensitive to lighting conditions. If the image quality suffers under light deficient conditions. Thermal imagers can be used to overcome some of the limitations of color cameras because they are not affected by lighting conditions. But a single thermal imager detection can be affected by the ambient temperature. For example, in the case of high outdoor temperatures, pedestrians cannot be well distinguished from surrounding objects.

Disclosure of Invention

The invention aims to provide a pedestrian detection method based on a thermal imager and a binocular camera.

The technical solution for realizing the purpose of the invention is as follows: a pedestrian detection method based on a thermal imager and a binocular camera comprises the following steps:

step 1, arranging a thermal imager and a binocular camera on a vehicle, and collecting an image in front of the vehicle;

step 2, eliminating parallax between images collected by a binocular camera;

step 3, graying the binocular camera image and the thermal imager image after the parallax is eliminated in the step 2;

step 4, scanning the grayed binocular camera image and the thermal imager image by using a sliding window in a raster scanning mode according to set step lengths, and simultaneously extracting HOG characteristics of the binocular camera image and the thermal imager image in each step length to obtain HOG characteristic description values;

step 5, training the SVM by adopting a pedestrian database to obtain a coefficient w^TThe HOG characteristic description value X corresponding to each sliding window and the coefficient w are compared^TPerforming multiply-accumulate to obtain the result and setting the threshold

Adding, if the sum is larger than 0, judging that a pedestrian exists in the window;

and 6, comparing the detection results of the binocular camera image and the thermal imager image, outputting a unified result if the detection results of the two images are consistent, and judging that the two images are not consistent.

Preferably, the step 4 of scanning the grayed binocular camera image and the thermal imager image by using the sliding window in a raster scanning manner according to the set step length comprises the following specific steps:

setting an interested area of a scene to be detected;

setting the size of the sliding window and the horizontal and longitudinal moving step length;

and moving a sliding window to traverse the whole picture to be detected according to the rules from left to right and from top to bottom in the region of interest from the upper left corner of the picture to be detected by a set transverse and longitudinal step length.

Preferably, the specific method for performing HOG feature extraction on the binocular camera image and the thermal imager image is as follows:

calculating the pixel gradient of the gray image in the sliding window, and determining the mode and the direction of the pixel gradient;

dividing the gray level image in the sliding window into n pixel blocks, and performing dot multiplication on a modulus of each pixel point in each pixel block and a Gaussian matrix;

performing three-bit linear interpolation on each pixel block and arbitrating accumulation to form a 36-dimensional vector histogram according to the coordinates of the pixel block where the pixel point is located and the gradient vector angle of the pixel point;

normalizing the 36-dimensional vector histogram obtained by each pixel block;

and combining the n normalized 36-dimensional vectors to obtain the 36X n-dimensional HOG feature description value X in the window.

Preferably, the specific method for calculating the gradient of the gray image pixel in the sliding window and determining the mode and direction of the gradient of the pixel point comprises the following steps:

calculating the directional gradient of the abscissa and the ordinate of the pixel by first order differential derivation:

f (x, y) respectively represents the gradient of the pixel point in the horizontal direction, the gradient in the vertical direction and the pixel value;

the mode and direction of the gradient direction of the pixel point are as follows:

in the formula, | G (x, y) | represents a modulus of a gradient vector of a pixel point, and θ (x, y) represents a direction of the gradient vector.

Preferably, the formula for normalizing the vector histogram is:

in the formula, v^*Represents the histogram normalization result, v represents a 36-dimensional vector histogram, and epsilon is a constant.

Preferably, the pedestrian database is adopted to train the SVM to obtain the coefficient w^TAnd a threshold valueThe HOG characteristic description value X corresponding to each sliding window is compared with the coefficient w^TPerforming multiply-accumulate to obtain the result and setting the threshold

Adding, if the sum is greater than 0, judging the windowThe specific method for the existence of the pedestrian comprises the following steps:

training the SVM by adopting a Daimer pedestrian database to obtain a 36 multiplied by n dimensional coefficient w^TAnd a threshold value

Coefficient w^TDividing the HOG characteristic description value X corresponding to each window into 15 parts, and matching the HOG characteristic description value X corresponding to each window with the coefficient w of the 15 parts^TPerforming multiplication accumulation, wherein the formula is as follows:

t＝0,1,…,14；

k＝0,1,…,6…,s

in the formula, X represents a characteristic description value, and s is the number of times of transverse movement of the sliding window;

calculating the value of y by the following formula, and if y is a positive number, determining that the pedestrian exists in the window

Wherein y represents the multiply-accumulate result; x represents a feature description value;

representing a threshold value.

The invention also provides a pedestrian detection system based on the thermal imager and the binocular camera, which is characterized by comprising the following components:

the thermal imager and the binocular camera are arranged on the vehicle and used for acquiring image information in front of the vehicle;

a computer for processing image information;

the computer is used for completing:

eliminating parallax between images collected by a binocular camera;

graying the binocular camera image and the thermal imager image after the parallax is eliminated;

respectively scanning the grayed binocular camera image and thermal imager image in a raster scanning mode according to set step lengths by using a sliding window, and simultaneously extracting HOG characteristics of the binocular camera image and the thermal imager image in each step length to obtain HOG characteristic description values;

training the SVM by adopting a pedestrian database to obtain a coefficient w^TAnd a threshold valueThe HOG characteristic description value X corresponding to each sliding window is compared with the coefficient w^TPerforming multiply-accumulate to obtain the result and setting the threshold

and comparing the detection results of the binocular camera image and the thermal imager image, outputting a unified result if the detection results of the two images are consistent, and judging that the two images are not consistent.

Preferably, the thermal imaging system further comprises an aluminum box for protecting the binocular camera and the thermal imager, and the binocular camera and the thermal imager are arranged in the aluminum box.

Compared with the prior art, the invention has the following remarkable advantages:

1. the invention uses the thermal imager and the binocular camera to collect real data, performs machine learning classification training, detects pedestrians according to comparison results, further can be interconnected with a motor vehicle braking system through a computer, is matched with a conflict detection system, and is applied to the fields of automatic driving of automobiles and the like;

2. the pedestrian detection calculation amount is small, each parameter and state amount are continuously updated, the early warning can be uninterruptedly carried out, and the timeliness and the continuity of the early warning are improved;

3. in the selection of the vehicle-mounted equipment, mature sensors and processors in the prior art can be adopted, so that the vehicle-mounted equipment is conveniently installed in various types of automobiles and is convenient to debug;

4. by utilizing the pedestrian detection method, the stereoscopic vision camera and the thermal imager are combined, so that the method has multiple applicable scenes and high pedestrian detection precision, and has a promoting effect on the research of an automatic driving automobile or an advanced driving auxiliary system.

The present invention is described in further detail below with reference to the attached drawings.

Drawings

Fig. 1 is a flowchart of a pedestrian detection method based on a thermal imager and a binocular camera.

Fig. 2 is a schematic view of the installation of the thermal imager and the binocular camera according to the present invention.

Fig. 3 is a schematic diagram of the operation of the sliding window for detecting pedestrians according to the present invention.

FIG. 4 is a flowchart of SVM classifier computation of the present invention.

Detailed Description

As shown in fig. 1, a pedestrian detection method based on a thermal imager and a binocular camera is characterized by comprising the following steps:

step 1, as shown in FIG. 2, arranging a thermal imager and a binocular camera on a vehicle, and collecting images in front of the vehicle;

step 2, eliminating parallax between images collected by a binocular camera;

step 4, as shown in fig. 3, respectively scanning the grayed binocular camera image and thermal imager image in a raster scanning mode according to set step lengths by using a sliding window, and simultaneously performing HOG feature extraction on the binocular camera image and the thermal imager image in each step length to obtain HOG feature description values; the method comprises the following specific steps of respectively scanning the grayed binocular camera image and the thermal imager image in a raster scanning mode according to a set step length by utilizing a sliding window:

setting an interested area of a scene to be detected;

The specific method for carrying out HOG feature extraction on the images of the binocular camera and the thermal imager comprises the following steps:

calculating the pixel gradient of the gray image in the sliding window, and determining the mode and the direction of the pixel gradient, wherein the specific method comprises the following steps:

normalizing the 36-dimensional vector histogram obtained by each pixel block, wherein the formula is as follows:

in the formula, v^*Representing a histogram normalization result, v represents a 36-dimensional vector histogram, and epsilon is a constant;

Step 5, as shown in fig. 4, training the SVM by using the pedestrian database to obtain the coefficient w^TThe HOG characteristic description value X corresponding to each sliding window and the coefficient w are compared^TPerforming multiply-accumulate to obtain the result and setting the threshold

Adding, if the sum is larger than 0, judging that the pedestrian exists in the window, and the specific method comprises the following steps:

order to

t＝0,1,…,14；

k＝0,1,…,6…,s

Wherein, X represents a feature description value,

and calculating the value of y by the following formula, and if y is a positive number, judging that the pedestrian exists in the window:

representing a threshold value.

A pedestrian detection system based on a thermal imager and a binocular camera is carried on a vehicle and comprises:

the device for collecting the thermal image information in front of the vehicle comprises at least one thermal imager, wherein the thermal imager is arranged on the top of the vehicle and below a binocular camera.

The computer for processing the image information comprises at least one notebook computer, is arranged in the vehicle, and is connected with the binocular camera and the thermal imager by a data line.

In a further embodiment, the computer performs:

eliminating parallax between images collected by a binocular camera;

respectively scanning the grayed binocular camera image and thermal imager image in a raster scanning mode according to set step lengths by using a sliding window, and simultaneously extracting HOG characteristics of the binocular camera image and the thermal imager image in each step length to obtain HOG characteristic description values; the method comprises the following specific steps of respectively scanning the grayed binocular camera image and the thermal imager image in a raster scanning mode according to a set step length by utilizing a sliding window:

setting an interested area of a scene to be detected;

in the formula, v^*Representing the histogram normalization result, v represents a 36-dimensional vector histogram, and ε is a constantCounting;

Training the SVM by adopting a pedestrian database to obtain a coefficient w^TThe HOG characteristic description value X corresponding to each sliding window and the coefficient w are compared^TPerforming multiply-accumulate to obtain the result and setting the threshold

order to

t＝0,1,…,14；

k＝0,1,…,6…,s

(s is the number of times the sliding window is moved laterally-1)

Wherein, X represents a feature description value,

representing a threshold value.

The device for protecting the binocular camera and the thermal imager comprises at least one aluminum box, wherein the aluminum box is installed at the top of a vehicle, and the binocular camera and the thermal imager are installed inside the aluminum box.

Example 1

The pedestrian detection method based on the thermal imager and the binocular camera comprises the following steps:

step 2, eliminating parallax between images collected by a binocular camera; in this embodiment, the disparity value of the images acquired by the left and right cameras is calculated by using the following formula:

x_{left side of}＝x_{Right side}-parallax (x)_{Right side},y)

Wherein (x)_{Left side of}Y) is the position of the midpoint in the image captured by the left camera, (x)_{Right side}Y) is the position of the point in the right camera captured image, and disparity () is the disparity value at a given position;

step 3, graying the binocular camera image and the thermal imager image after the parallax is eliminated in the step 2, wherein the specific method comprises the following steps:

extracting color information (numerical values of three color channels of red (R), green (G) and blue (B)) in the picture to be detected, carrying out gray processing,

r after gradation (R + before processing, G + before processing, B)/3

G after graying is (R + G before processing + B before processing)/3;

b after graying (R + G before processing + B before processing)/3

R, G, B in the formula represent the values of three color channels of red, green and blue, respectively.

Step 4, in this embodiment, a 64 × 128 pixel window is used to scan a frame of image after graying in a raster scanning manner, and the scanning steps in the horizontal direction and the vertical direction are both 8 pixels;

HOG feature extraction is carried out on the grayed binocular camera image and thermal imager image in each step length, and the specific method comprises the following steps:

determining the mode and the direction of the gradient direction of the pixel point:

and

respectively representing the distance between two pixel points in the horizontal direction and the vertical direction, | G (x, y) | and theta (x, y) represent the modulus of the gradient vector of the pixel point and the direction of the gradient vector;

dividing a window by units of 8 × 8 pixels to form 8 × 16-128 units, regarding 4 adjacent units of upper, lower, left and right sides as a pixel block, and regarding each window to include 15 × 7-105 pixel blocks;

performing dot multiplication on a modulus of each pixel point in a 16 × 16 pixel block and a Gaussian matrix to darken the edge of the pixel block;

according to the coordinates (x, y) of the pixel points relative to the pixel blocks and the gradient vector angle theta of the pixel points, performing three-bit linear interpolation on each pixel block and arbitrating and accumulating to form a 36-dimensional vector histogram:

v(x₁，y₁，z₁)←v(x₁，y₁，z₁)+w[1-(x-x₁)/b_x]×[1-(y-y₁)/b_y]×[1-(z-z₁)/b_z]

v(x₁，y₁，z₂)←v(x₁，y₁，z₂)+w[1-(x-x₁)/b_x]×[1-(y-y₁)/b_y]×[1-(z-z₂)/b_z]

v(x₁，y₂，z₁)←v(x₁，y₂，z₁)+w[1-(x-x₁)/b_x]×[1-(y-y₂)/b_y]×[1-(z-z₁)/b_z]

v(x₂，y₁，z₁)←v(x₂，y₁，z₁)+w[1-(x-x₂)/b_x]×[1-(y-y₁)/b_y]×[1-(z-z₁)/b_z]

v(x₂，y₁，z₂)←v(x₂，y₁，z₂)+w[1-(x-x₂)/b_x]×[1-(y-y₁)/b_y]×[1-(z-z₂)/b_z]

v(x₁，y₂，z₂)←v(x₁，y₂，z₂)+w[1-(x-x₁)/b_x]×[1-(y-y₂)/b_y]×[1-(z-z₂)/b_z]

v(x₂，y₂，z₁)←v(x₂，y₂，z₁)+w[1-(x-x₂)/b_x]×[1-(y-y₂)/b_y]×[1-(z-z₁)/b_z]

v(x₂，y₂，z₂)←v(x₂，y₂，z₂)+w[1-(x-x₂)/b_x]×[1-(y-y₂)/b_y]×[1-(z-z-z₁)/b_z]

in the formula, x and y represent the spatial position of a pixel point; z represents the gradient direction of the point, for the pixel point (x, y), its gradient is setDegree amplitude is omega, gradient direction is z, z₁And z₂Is the midpoint coordinate of the 2 most adjacent histograms, and the histogram bandwidth of the gradient histogram v along the x, y and z directions is b ═ b [, b [ ]_x，b_y，b_z]，b_x＝b_y＝8，b_z＝180°/9，

Finally, an L2-Hys function is selected to normalize the obtained 36-dimensional vector histogram, and the formula is as follows:

wherein v is^*Histogram normalization result (36-dimensional vector) representing one pixel block; v represents a 36-dimensional vector histogram; ε is 0.005.

Thus, in a window comprising 105 pixel blocks, v for 105 pixel blocks is determined^*Merging to generate 3780-dimensional HOG characteristic description value X;

step 5, training the SVM by adopting a pedestrian database to obtain a vector 3780-dimensional coefficient w^TAnd a threshold value

Will w^TDividing into 15 parts, and performing multiplication and accumulation of the 36-dimensional characteristic value of the current pixel block and the 36-dimensional coefficient corresponding to the horizontal 7 windows simultaneously by the 15 parts, and calculating as follows:

order to

n＝0,1,…,14；

k＝0,1,…,6

Wherein X represents a feature description value,

Wherein y represents the multiply-accumulate result; x represents a feature drawingThe value is as described;

represents a threshold value;

and 6, comparing the detection results of the binocular camera image and the thermal imager image, outputting the result if the detection results of the two images are consistent, and judging that the two images are not consistent.

Claims

1. A pedestrian detection method based on a thermal imager and a binocular camera is characterized by comprising the following steps:

step 2, eliminating parallax between images collected by a binocular camera;

2. The pedestrian detection method based on the thermal imager and the binocular camera according to claim 1, wherein the step 4 of scanning the grayed image of the binocular camera and the image of the thermal imager by using the sliding window in a raster scanning manner according to a set step length comprises the specific steps of:

setting an interested area of a scene to be detected;

3. The pedestrian detection method based on the thermal imager and the binocular camera according to claim 1, wherein the specific method for performing HOG feature extraction on the image of the binocular camera and the image of the thermal imager is as follows:

normalizing the 36-dimensional vector histogram obtained by each pixel block;

4. The pedestrian detection method based on the thermal imager and the binocular camera according to claim 3, wherein the specific method for calculating the gradient of the gray image pixels in the sliding window and determining the mode and the direction of the gradient of the pixel points comprises the following steps:

G_x(x,y)＝f(x+1,y)-f(x-1,y)

G_y(x,y)＝f(x,y+1)-f(x,y-1)

G_x(x,y)、G_y(x,y)、f(x,y) respectively representing the gradient of the pixel point in the horizontal direction, the gradient in the vertical direction and the pixel value;

5. The thermal imager and binocular camera based pedestrian detection method of claim 3, wherein the formula for normalizing the vector histogram is:

6. The thermal imager and binocular camera-based pedestrian detection method according to claim 1, wherein a pedestrian database is used to train the SVM to obtain the coefficient w^TAnd a threshold valueThe HOG characteristic description value X corresponding to each sliding window is compared with the coefficient w^TPerforming multiply-accumulate to obtain the result and setting the threshold

And if the sum is greater than 0, the specific method for judging that the pedestrian exists in the window is as follows:

training the SVM by adopting a Daimer pedestrian database to obtainA 36 × n dimensional coefficient w^TAnd a threshold value

t＝0,1,…,14；

k＝0,1,…,6…,s

Wherein, y represents the result of multiply-accumulate,

representing a threshold value.

7. The utility model provides a pedestrian detection system based on thermal imager and binocular camera which characterized in that includes:

a computer for processing image information;

the computer is used for executing the steps 2 to 6 of any claim from 1 to 6.

8. The pedestrian detection system based on the thermal imager and the binocular camera according to claim 1, further comprising an aluminum box for protecting the binocular camera and the thermal imager, the binocular camera and the thermal imager being disposed in the aluminum box.