CN110647813A

CN110647813A - Human face real-time detection and identification method based on unmanned aerial vehicle aerial photography

Info

Publication number: CN110647813A
Application number: CN201910771567.1A
Authority: CN
Inventors: 刘洋; 冉欢欢; 李毅捷; 谢雨峰; 时翔; 唐柯
Original assignee: CHENGDU SHINE TECHNOLOGY Co Ltd
Current assignee: CHENGDU SHINE TECHNOLOGY Co Ltd
Priority date: 2019-08-21
Filing date: 2019-08-21
Publication date: 2020-01-03

Abstract

The invention discloses a human face real-time detection and identification method based on unmanned aerial vehicle aerial photography, which comprises the following steps: loading a data processing model, acquiring data by a camera, and writing image data; preprocessing data; compressing the converted image data and storing the image data in a shared memory; the processor reads the image data; carrying out convolution processing on the data; the pooling layer performs dimensionality reduction on the data; carrying out nonlinear mapping on the data; carrying out classification processing on the weighted sum of the data; performing non-maximum suppression on the data; face matching is carried out on adjacent video frames, and a tracking template is detected and updated through a fixed frame number; calculating the correlation degree of the obtained target and the detected target to enhance the matching accuracy; superposing the face frames and counting the faces; and displaying the plug flow. The invention can realize real-time face detection and identification and greatly reduce the false detection rate.

Description

Human face real-time detection and identification method based on unmanned aerial vehicle aerial photography

Technical Field

The invention relates to the technical field of unmanned aerial vehicle monitoring, in particular to a human face real-time detection and identification method based on unmanned aerial vehicle aerial photography.

Background

The traditional face recognition is mainly applied to recognition scenes based on fixed cameras, the face recognition effect is poor, the flexibility is poor, and once personnel move or the cameras move, face recognition data are inaccurate; in the relative movement of the camera and the personnel, the real-time performance of the face recognition is poor.

Disclosure of Invention

The invention aims to solve the problems and provides a human face real-time detection and identification method based on unmanned aerial vehicle aerial photography, which comprises the following steps:

s1: the detection module loads a data processing model and waits for receiving image data;

s2, the camera collects data and writes the data into the memory;

s3, preprocessing data: converting the format of image data, normalizing the image data, and converting the data into RGB data;

s4, compressing the converted image data and storing the image data in a shared memory;

s5, the detection module reads the image data and processes the data;

s6, carrying out convolution processing on the convolution layer neural network through convolution check data;

s7, the pooling layer performs dimensionality reduction on the data, and selects the maximum pixel value in the convolution kernel as a dimensionality reduced value;

s8, the excitation layer carries out nonlinear mapping on the output data of the convolution processing: let alpha be the slope coefficient, x be the abscissa value, and g (x) be the result of the function, then

g(x)＝max(αx，x)；

S9, classifying the extracted feature data by the full connection layer through weighted summation: y is Wx, wherein W is a weight vector and x is a feature vector;

s10, performing non-maximum value inhibition on the detection result, and removing repeated detection of the same target;

s11, reading each frame data in the video, and setting a reference value of the frame spacing frame number according to the scene; judging whether the current frame number is a multiple of a reference value; if yes, carrying out face detection on the next frame, and updating a tracking template; if not, matching with the face detected in the front so as to track the face;

s12, judging the correlation degree of the targets in the front frame and the rear frame by a normalized cross-correlation matching algorithm, if the correlation degree is within a certain threshold value range, determining the targets as the same face targets, otherwise, determining the targets as different face targets, and achieving the purpose of face tracking;

s13, superposing the face frames and counting the faces; and displayed by plug flow.

The invention has the beneficial effects that: based on the view angle of the unmanned aerial vehicle, the invention can realize mobile shooting, carry out real-time deep learning network, carry out network optimization and training aiming at the data format of the camera, further reduce the network operation amount, realize real-time face detection and recognition, and simultaneously greatly reduce the false detection rate.

Drawings

FIG. 1 is a flow chart of the present invention.

Detailed Description

The invention will be further described with reference to the accompanying drawings in which:

as shown in fig. 1, the invention relates to a human face real-time detection and identification method based on unmanned aerial vehicle aerial photography, which comprises the following steps:

s2, the camera collects data and writes the data into the memory;

s5, the detection module reads the image data and processes the data;

g(x)＝max(αx，x)；

Further, the normalizing the data in S3 specifically includes:

and setting x as the value of the current pixel value normalization, x as the current pixel value, and max as the maximum value of all pixels in the current frame, then: x is the number of^*＝log₁₀(x)/log₁₀(max)。

Further, the step S2 of converting the data format specifically includes: conversion of 1920 × 1080 × 2 uyvy data into 1920 × 1080 × 3 RGB data:

wherein: y is brightness, u and v denote chromaticity, R denotes a red channel, G denotes a green channel, and B denotes a blue channel.

Further, the convolution processing in S6 specifically includes:

where R (i, j) is the convolution value at the image (i, j), k is the convolution kernel of n × n, n is 2a, and f (i, j) is the pixel value at the image (i, j).

Further, the specific process of S7 is as follows:

y(u，v)＝max{f(u+i，v+j)，i∈(0，n)，j∈(0，n)}；

wherein: n denotes the size of the kernel and y (u, v) denotes the maximum pixel value in the neighborhood n x n in the image after max-firing.

Further, the specific process of S10 is as follows:

wherein, the interration is the intersection of two target frames, the Union is the Union of the two target frames, and if IoU exceeds a set threshold, the two target frames are the same target frame;

further, the performing face matching on adjacent frames in S11 specifically includes:

wherein: t is the matching template, S is the image matrix, and D is the matching result at (i, j).

Further, the normalization product correlation algorithm in S12 specifically processes:

wherein: e (S)^i，j) The average gray scale value of the subgraph at (i, j), e (T) the average gray scale value of the template, M × N the matching template size, and T the matching template.

The specific process is as follows:

the detection module loads a data processing model and waits for receiving image data; the camera collects data and writes the data into the memory.

Pretreatment: the data that airborne camera gathered are fixed format, fixed resolution, and the data of gathering are the uyvy format, 1920x 1080's two-channel data, need carry out the preliminary treatment to data, including data format conversion and normalization, specifically include: and setting x as the value of the current pixel value normalization, x as the current pixel value, and max as the maximum value of all pixels in the current frame, then: x is the number of^*＝log₁₀(x)/log₁₀(max)。

After normalization, 1920 × 1080 × 2 uyvy data was converted into 1920 × 1080 × 3 rgb data:

To reduce the amount of computation and maintain the accuracy of target detection, a 1920x1080 resolution is compressed to 640 x 640: let Q₁₁、Q₁₂、Q₂₁And Q₂₂The pixel coordinate positions of adjacent pixel points in the image are respectively (x)₁，y₁)、(x₁，y₂)、(x₂，y₁) And (x)₂，y₂) P is the point to be estimated, R₁Is Q₁₁And Q₂₁Approximate evaluation point of, R₂Is Q₁₂And Q₂₂P is R₁And R₂Then:

through the normalization processing and bilinear interpolation calculation, the original data is processed into a format suitable for a face detection algorithm and stored in a shared memory, as shown in a flow chart shown in fig. 1.

And the processor reads the image data, loads the data processing model and processes the data.

Convolution processing: carrying out convolution processing on data through convolution check in the neural network: and setting x as the value of the current pixel value normalization, x as the current pixel value, and max as the maximum value of all pixels in the current frame, then: x is the number of^*＝log₁₀(x)/log₁₀(max)。

Performing pooling treatment: the pooling layer performs dimensionality reduction on the data, a max-posing mode is selected for dimensionality reduction, the calculated amount can be reduced, and the maximum value of the data in the convolution kernel is selected as the value after dimensionality reduction: setting: n represents the size of the kernel, y (u, v) represents the maximum pixel value in the neighborhood n x n in the image (u, v) as a result after max-posing, then: y (u, v) ═ max { f (u + i, v + j), i ∈ (0, n), j ∈ (0, n) }.

Excitation: non-linear mapping of the convolution processed output: assuming α is the slope coefficient, x is the abscissa parameter, and g (x) is the result of the function, then: g (x) max (α x, x).

The full connection layer classifies the extracted feature data through weighted summation: and y is Wx, wherein W is a weight vector and x is a feature vector.

By the above-mentioned feature extraction of the data, and the target detection and classification, the same target may be detected many times in the result, and in order to eliminate the phenomenon, the result is subjected to non-maximum suppression, and the process is as follows:

the intersections is the intersection of two target frames, the Union is the Union of the two target frames, the larger the IoU is, the higher the overlapping degree is, if the overlapping degree exceeds a certain threshold value, the same target frame is considered, and the selection is preferred.

Reading frame data, judging whether the frame number is a multiple of a reference value according to the reference value of the scene set frame number, if so, performing face matching on adjacent frames, updating a tracking template, otherwise, performing face matching on the adjacent frames, and performing face tracking;

further judging the absolute error and the target obtained by the algorithm and the detected target by normalizing the algorithm correlation coefficientAnd the correlation degree greatly reduces the false detection rate. The specific process is as follows: setting: e (S)^i，j) (ii) the average gray value of the subgraph at (i, j), e (T) the average gray value of the template, M × N the size of the matching template, and T the matching template, then:

based on the view angle of the unmanned aerial vehicle, the invention can realize mobile shooting, carry out real-time deep learning network, carry out network optimization and training aiming at the data format of the camera, further reduce the network operation amount, realize real-time face detection and recognition, and simultaneously greatly reduce the false detection rate.

The technical solution of the present invention is not limited to the limitations of the above specific embodiments, and all technical modifications made according to the technical solution of the present invention fall within the protection scope of the present invention.

Claims

1. A real-time face detection and identification method based on unmanned aerial vehicle aerial photography is characterized by comprising the following steps:

s2, the camera collects data and writes the data into the memory;

s5, the detection module reads the image data and processes the data;

g(x)＝max(αx，x)；

2. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the normalization processing of data in S3 specifically comprises:

3. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the step S3 of converting the data format specifically comprises: converting the 1920 × 1080 × 2 UYVY-formatted data into 1920 × 1080 × 3 RGB-formatted data:

wherein: y is brightness, U and V denote chromaticities, R denotes a red channel, G denotes a green channel, and B denotes a blue channel.

4. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the convolution processing in S6 specifically comprises:

5. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the S7 specific process is as follows:

y(u，v)＝max{f(u+i，v+j)，i∈(0，n)，j∈(0，n)}；

6. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the S10 specific process is as follows:

wherein, the interration is the intersection of two target frames, the Union is the Union of the two target frames, and if IoU exceeds the set threshold, the same target frame is obtained.

7. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the performing of human face matching of adjacent frames in S11 specifically comprises:

8. The real-time human face detection and identification method based on unmanned aerial vehicle aerial photography according to claim 1, wherein the normalization product correlation algorithm in the step S12 is used for processing the specific process: