CN108681704A - A kind of face identification system based on deep learning - Google Patents

A kind of face identification system based on deep learning Download PDF

Info

Publication number
CN108681704A
CN108681704A CN201810460351.9A CN201810460351A CN108681704A CN 108681704 A CN108681704 A CN 108681704A CN 201810460351 A CN201810460351 A CN 201810460351A CN 108681704 A CN108681704 A CN 108681704A
Authority
CN
China
Prior art keywords
face
module
network
recognition
deep learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810460351.9A
Other languages
Chinese (zh)
Inventor
曲秀杰
魏天博
彭程
杜鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Original Assignee
Beijing Institute of Technology BIT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT filed Critical Beijing Institute of Technology BIT
Priority to CN201810460351.9A priority Critical patent/CN108681704A/en
Publication of CN108681704A publication Critical patent/CN108681704A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/166Detection; Localisation; Normalisation using acquisition arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A kind of face identification system based on deep learning, belongs to deep learning and technical field of image processing.Including the ends PC network training module, system on chip and periphery peripheral module:The ends PC network training module includes data set processing, CNN training, initialization, gradient passback, backpropagation and propagated forward unit;Network structure is built in the network module of the ends PC first, while acquiring enough face training sets, trains an available deep learning recognition of face network;System on chip includes image capture module, fifo control modules, deep learning face recognition module, personnel information storage module and image display.400FPS can be reached in recognition speed under 50MHz clocks;Accuracy of identification is up to 99.25% than quasi- eye recognition precision;Training set covers various illumination conditions, can complete identification function under most illumination condition, possess good robustness.

Description

A kind of face identification system based on deep learning
Technical field
The present invention relates to a kind of face identification systems based on deep learning, belong to digital image processing techniques field.
Background technology
Face has the characteristics that height is nonrigid, and there is a large amount of details for embodying individual difference.Recognition of face is to pass through The facial image detected from still image or dynamic video is compared with the facial image in database, finds therewith The process of matched face is commonly used in the purpose of identification and identification, is the project for belonging to living things feature recognition field.
There are many traditional face identification methods, can be broadly divided into two classes, and one kind is the method based on part, is such as utilized Local description Gabor, local binary patterns etc. are identified;Another kind of is based on global method, includes the face of classics The streams such as recognizer, such as eigenface method, Fisher face sub-space learning algorithm and locality preserving projections algorithm Row learning algorithm.
Principal Component Analysis is mainly still concentrated in the research of field of face identification now both at home and abroad, LBP algorithms, and On deep learning algorithm, these three methods can be better than other algorithms in the success rate and realization difficulty of recognition of face.
Deep learning possesses very strong robustness and high discrimination in field of face identification, and recognition effect is far super Other algorithms.But recognition of face is realized in FPGA platform at this stage mainly or uses Principal Component Analysis and LBP algorithms.
Although both algorithms compared with deep learning algorithm, are more easy to realize in FPGA platform, due to algorithm The limitation (poor robustness, discrimination are low, and pretreatment consumes when increasing) of itself, in the speed and precision of recognition of face, this two Kind algorithm cannot reach the technology requirement for identifying and adapting in real time frame by frame various illumination conditions.Document is shown, at present in FPGA The scientific achievement detection speed of realization recognition of face is mostly within 10FPS on platform, and most fast achievement can only also reach at this stage To 25FPS.And deep learning algorithm is mostly to utilize CPU, GPU, TPU are realized that the research in FPGA platform is also relatively It is few, it is designed for the even more opposite blank of the deep learning network of recognition of face using FPGA, also there is some scientific research personnel It makes some attempts in this respect, but never ripe scientific achievement.
The present invention realizes a kind of face identification system of deep learning in FPGA platform, the system 50MHz when Clock operation at frequencies can reach 400FPS in recognition speed, some remote super existing achievements.Accuracy of identification is than anthropomorphic outlook Other precision, up to 99.25%.Training set covers various illumination conditions, and identification work(can be completed under most illumination condition Can, possess good robustness.
Invention content
It is an object of the invention to be directed in FPGA platform face recognition technology have that real-time is poor, accuracy of detection is low and A kind of defect of poor robustness, it is proposed that face identification system based on deep learning.
It is a kind of based on the face identification system of deep learning by the ends PC network training module, system on chip and periphery peripheral hardware mould Block forms:
Wherein, the ends PC network training module includes:Data set processing unit, CNN training units, initialization unit, gradient Back propagation unit, backpropagation unit and propagated forward unit;According to actual needs, net is built in the network module of the ends PC first Network structure, while enough face training sets are acquired, an available deep learning recognition of face network is trained, wherein;People Face identifies that the activation primitive that network uses is sigmoid functions;
System on chip includes that image capture module, fifo control modules, deep learning face recognition module, personal information are deposited Store up module and image display;
Periphery peripheral module includes camera submodule, VGA D/A converter modules and local display;
Wherein, include mainly multi-path camera and camera connecting line in camera submodule;
The connection relation of each module is as follows in a kind of face identification system based on deep learning:
Multichannel data channel is connected to image by camera connecting line and adopted by camera submodule multi-path camera in the block Collecting module, image capture module is connected to fifo control modules, and fifo control modules are connected to deep learning face recognition module, It is connected to image display simultaneously, deep learning face module is connected to personnel information storage module, and personal information stores mould Block is integrated in fifo control modules, is controlled by fifo control modules, and result is output to image display;Image is aobvious Show that module again by digital data transmission to VGA D/A converter modules, is output to local display after digital-to-analogue conversion;
Modules function is as follows:
Multi-path camera completes the acquisition function of image, and is transferred to image capture module;
The function of image capture module is to receive the input data of video flowing, and be transferred to fifo control modules;
Fifo control modules cache the vision signal transmitted, to ensure the integrality of output image data, and it is right The picture progress transmitted is down-sampled in real time, and the pixel size for being downsampled to suitable Network Recognition is stored in fifo;
The function of deep learning face recognition module is to realize the acceleration processing of face identification functions, and input is by drop The facial image of sampling after module completes recognition of face, exports face result;
The information of personnel is deposited in personnel information storage module, it receives the knot from deep learning face recognition module Fruit, and personnel's specifying information is output to image display, which is integrated in fifo control modules;It is controlled by fifo Module controls, and is output to image display;
The function of image display be receive by caching video stream data, generate specification video flowing sequential and With corresponding pixel data, the VGA D/A converter modules being output to outside piece, in addition the module is also with data automatic aligning work( Can, after local sequential can be overcome to generate problem of misalignment is shown caused by input video flow data lag;
The modules function of periphery peripheral module is as follows:
The function of camera is acquisition outer scene, generates stable digital video frequency flow;
VGA D/A converter module functions are the digital signal and row field sync signal of reception image display outflow, and Digital signal is subjected to digital-to-analogue conversion, is transferred to local display;The function of local display is acquisition image and identification knot Fruit real-time display;
A kind of course of work of the face identification system based on deep learning, including build recognition of face network, training of human Face identification network builds four parts of FPGA platform and network parameter update, is as follows:
Step 1 builds the convolutional neural networks for recognition of face in the ends PC network training module;
Step 2 is trained recognition of face network with the face database being collected in advance, obtains trained network ginseng Number;
Step 3 builds recognition of face network based on FPGA platform;
In step 4, the recognition of face network for putting up the network parameter steps for importing 3 of step 2 training output.
Advantageous effect
A kind of face identification system based on deep learning has the advantages that compared with prior art:
The design that 1. deep learning is not combined with recognition of face in the prior art and is realized in FPGA platform, And the present invention is successfully realized in FPGA platform based on deep learning recognition of face network;
2. compared with existing face recognition technology, recognition speed is improved:The place of existing correlation face identification system Reason speed is generally below 25FPS, the present invention by using the CNN models in depth learning technology field to the image collected into Row training, substantially increases image processing speed;
3. compared with existing recognition of face, identification accuracy is improved, and is embodied as:Based on FPGA platform reality The method of existing recognition of face is mostly Principal Component Analysis and LBP algorithms, and the precision of identification can only achieve 80% or so;And this The deep learning algorithm discrimination that invention uses can reach 99% or more, than the other precision of personification outlook.
Description of the drawings
Fig. 1 is a kind of structure diagram of the face identification system based on deep learning of the present invention;
Fig. 2 is the schematic network structure in a kind of face identification system embodiment based on deep learning of the present invention.
Specific implementation mode
The present invention will be further described with reference to the accompanying drawings and examples and detailed description.
Embodiment 1
The present embodiment describes the structure of a kind of face identification system based on deep learning of the present invention and specific Implement.
As shown in Figure 1, being a kind of structure diagram of the face identification system based on deep learning.According to the structure diagram, The present embodiment is made.
The present embodiment is realized that whole system processing image size is using company of purple light Tontru PGT180H devices 540X480 pixels.
First, one 6 layers of CNN convolutional neural networks of matlab project trainings are used in the network training module of the ends PC, Here we use the CMU_PIE face databases of Carnegie Mellon University and the face database of part oneself acquisition.By test The test of collection, 6 layer network can reach 99.25% to 20 people's discriminations in face database.
6 layers of convolutional neural networks are by one layer of input layer, two layers of convolutional layer, two layers of pond layer and one layer of full articulamentum group At.Input layer is that the image of 32x32 pixel sizes is connected to the full articulamentum of last layer, and export knot after 4 hidden layers Fruit.Network uses activation primitive for sigmoid functions;
Wherein, 4 hidden layers include first layer convolutional layer, first layer pond layer, second layer convolutional layer and second layer pond layer;
6 layer network structures are put up in FPGA platform, and network architecture parameters are trained as shown in Fig. 2, importing.
For embodiment system camera using MT9V034 gray level image sensors, this is a kind of global exposure CMOS biographies Sensor, when both full-pixel (752Hx480V) is exported, speed is 60FPS, and SoC systems on chip output it in initialization and are configured to Continuous data (including field/row synchronize) then device automatic collection and is input to the Image Acquisition mould of system by 540x480 sizes Block;
The example has built one 6 layers of convolutional neural networks in FPGA platform.The network structure and the training of the ends PC Network structure is identical, by one layer of input layer, two layers of convolutional layer, two layers of pond layer and one layer of full articulamentum composition, wherein Sigmoid activation primitives are realized using 6 sections of piecewise fittings.Under the clock frequency of 50MHz, the system is complete in 2.4ms At the identification of face, the number of identification is 20 people.After completing algorithm process, the result of identification is transferred to fifo control modules, The output of personal information is carried out by fifo control module controllers information storage modules;
The real-time cache image that camera acquires and personal information are transferred to image display by fifo control modules, Image display generates corresponding row field sync signal, and peripheral hardware VGA D/A converter modules are given in output.VGA D/A converter modules Conversion of the digital signal to analog signal is completed, and connects VGA display screens, completes real-time acquisition display and the recognition result of image Display.
The operating process of the present invention:System electrification loads bit files, completes the configuration of system.Identification target enters image Pickup area, target identification result carry out real-time display in local display.
System described in this example builds the recognition of face net of 20 people in the case where Image Acquisition speed is sufficiently fast Network can complete recognition of face using the clock frequency of 50MHz as system clock in 2.4ms, recognition speed can reach 400FPS realizes the real-time face identification of 60FPS since the frame number of MT9V034 gray level image sensors limits in this example, Non-delay, compared to the processing speed of existing design (being less than 25FPS), real-time is improved.And it is high on recognition accuracy Up to 99.25%, eye recognition accuracy rate of comparing, remote super some current designs based on principal component analysis and LBP algorithms.Not This example is tested under same light environment, recognition of face can be accurately completed, to the strong robustness of illumination.
The above is presently preferred embodiments of the present invention, and the present invention should not be limited to disclosed in the embodiment and attached drawing Content.It is every not depart from the lower equivalent or modification completed of spirit disclosed in this invention, both fall within the scope of protection of the invention.

Claims (1)

1. a kind of face identification system based on deep learning, it is characterised in that:Including the ends PC network training module, system on chip With periphery peripheral module;
Wherein, the ends PC network training module includes:Data set processing unit, CNN training units, initialization unit, gradient passback Unit, backpropagation unit and propagated forward unit;According to actual needs, network knot is built first in the network module of the ends PC Structure, while enough face training sets are acquired, an available deep learning recognition of face network is trained, wherein;Face is known The activation primitive that other network uses is sigmoid functions;
System on chip includes image capture module, fifo control modules, deep learning face recognition module, personal information storage mould Block and image display;
Periphery peripheral module includes camera submodule, VGA D/A converter modules and local display;
Wherein, include mainly multi-path camera and camera connecting line in camera submodule;
The connection relation of each module is as follows in a kind of face identification system based on deep learning:
Multichannel data channel is connected to Image Acquisition mould by camera submodule multi-path camera in the block by camera connecting line Block, image capture module are connected to fifo control modules, and fifo control modules are connected to deep learning face recognition module, simultaneously It is connected to image display, deep learning face module is connected to personnel information storage module, personnel information storage module collection It in fifo control modules, is controlled by fifo control modules, result is output to image display;Image shows mould Block is output to local display again by digital data transmission to VGA D/A converter modules after digital-to-analogue conversion;
Modules function is as follows:
Multi-path camera completes the acquisition function of image, and is transferred to image capture module;
The function of image capture module is to receive the input data of video flowing, and be transferred to fifo control modules;
Fifo control modules cache the vision signal transmitted, to ensure the integrality of output image data, and to transmitting Picture carry out in real time it is down-sampled, the pixel size for being downsampled to suitable Network Recognition is stored in fifo;
The function of deep learning face recognition module is to realize the acceleration processing of face identification functions, and input is by down-sampled Facial image, module complete recognition of face after, export face result;
The information of personnel is deposited in personnel information storage module, it receive from deep learning face recognition module as a result, simultaneously Personnel's specifying information is output to image display, which is integrated in fifo control modules;By fifo control modules Control, is output to image display;
The function of image display is the video stream data received by caching, generates the video flowing sequential of specification and matches phase The pixel data answered, the VGA D/A converter modules being output to outside piece, in addition the module also have the function of data automatic aligning, can To overcome the caused display problem of misalignment of input video flow data lag after local sequential generation;
The modules function of periphery peripheral module is as follows:
The function of camera is acquisition outer scene, generates stable digital video frequency flow;
VGA D/A converter module functions are to receive the digital signal and row field sync signal of image display outflow, and will count Word signal carries out digital-to-analogue conversion, is transferred to local display;The function of local display is that acquisition image and recognition result are real When show;
A kind of course of work of the face identification system based on deep learning, including build recognition of face network, training face knowledge Other network builds four parts of FPGA platform and network parameter update, is as follows:
Step 1 builds the convolutional neural networks for recognition of face in the ends PC network training module;
Step 2 is trained recognition of face network with the face database being collected in advance, obtains trained network parameter;
Step 3 builds recognition of face network based on FPGA platform;
In step 4, the recognition of face network for putting up the network parameter steps for importing 3 of step 2 training output.
CN201810460351.9A 2018-05-15 2018-05-15 A kind of face identification system based on deep learning Pending CN108681704A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810460351.9A CN108681704A (en) 2018-05-15 2018-05-15 A kind of face identification system based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810460351.9A CN108681704A (en) 2018-05-15 2018-05-15 A kind of face identification system based on deep learning

Publications (1)

Publication Number Publication Date
CN108681704A true CN108681704A (en) 2018-10-19

Family

ID=63806132

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810460351.9A Pending CN108681704A (en) 2018-05-15 2018-05-15 A kind of face identification system based on deep learning

Country Status (1)

Country Link
CN (1) CN108681704A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109274941A (en) * 2018-10-23 2019-01-25 合肥博焱智能科技有限公司 Multichannel video decoding Face datection recognition methods based on FPGA
CN109446996A (en) * 2018-10-31 2019-03-08 北京智慧眼科技股份有限公司 Facial recognition data processing unit and processing method based on FPGA
CN109815929A (en) * 2019-01-31 2019-05-28 青岛科技大学 Face identification method based on convolutional neural networks
CN111079717A (en) * 2020-01-09 2020-04-28 西安理工大学 Face recognition method based on reinforcement learning

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103093175A (en) * 2011-10-27 2013-05-08 上海德致伦电子科技有限公司 Face identification system on chip based on field programmable gate array (FPGA)
CN105589938A (en) * 2015-12-13 2016-05-18 公安部第三研究所 Image retrieval system and retrieval method based on FPGA
CN105956660A (en) * 2016-05-16 2016-09-21 浪潮集团有限公司 Neural network chip realization method used for real-time image identification
CN106845380A (en) * 2017-01-13 2017-06-13 江苏信息职业技术学院 Face identification system based on mechanical arm
CN107423680A (en) * 2017-06-06 2017-12-01 广西师范大学 Face identification device based on FPGA

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103093175A (en) * 2011-10-27 2013-05-08 上海德致伦电子科技有限公司 Face identification system on chip based on field programmable gate array (FPGA)
CN105589938A (en) * 2015-12-13 2016-05-18 公安部第三研究所 Image retrieval system and retrieval method based on FPGA
CN105956660A (en) * 2016-05-16 2016-09-21 浪潮集团有限公司 Neural network chip realization method used for real-time image identification
CN106845380A (en) * 2017-01-13 2017-06-13 江苏信息职业技术学院 Face identification system based on mechanical arm
CN107423680A (en) * 2017-06-06 2017-12-01 广西师范大学 Face identification device based on FPGA

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109274941A (en) * 2018-10-23 2019-01-25 合肥博焱智能科技有限公司 Multichannel video decoding Face datection recognition methods based on FPGA
CN109274941B (en) * 2018-10-23 2020-12-18 合肥博焱智能科技有限公司 Multi-channel video decoding face detection and identification method based on FPGA
CN109446996A (en) * 2018-10-31 2019-03-08 北京智慧眼科技股份有限公司 Facial recognition data processing unit and processing method based on FPGA
CN109446996B (en) * 2018-10-31 2021-01-22 智慧眼科技股份有限公司 Face recognition data processing device and method based on FPGA
CN109815929A (en) * 2019-01-31 2019-05-28 青岛科技大学 Face identification method based on convolutional neural networks
CN111079717A (en) * 2020-01-09 2020-04-28 西安理工大学 Face recognition method based on reinforcement learning

Similar Documents

Publication Publication Date Title
CN108681704A (en) A kind of face identification system based on deep learning
He Research of a sign language translation system based on deep learning
Deep et al. Hybrid model featuring CNN and LSTM architecture for human activity recognition on smartphone sensor data
CN107480178B (en) Pedestrian re-identification method based on cross-modal comparison of image and video
CN107092894A (en) A kind of motor behavior recognition methods based on LSTM models
CN109886225A (en) A kind of image gesture motion on-line checking and recognition methods based on deep learning
US11386598B2 (en) Method and system for sensor data recognition using data enrichment for the learning process
US11526704B2 (en) Method and system of neural network object recognition for image processing
CN110458046B (en) Human motion trajectory analysis method based on joint point extraction
CN107609525A (en) Remote Sensing Target detection method based on Pruning strategy structure convolutional neural networks
CN104851099A (en) Method for image fusion based on representation learning
CN112001347B (en) Action recognition method based on human skeleton morphology and detection target
CN113158727A (en) Bimodal fusion emotion recognition method based on video and voice information
CN109409219A (en) Indoor occupant locating and tracking algorithm based on depth convolutional network
WO2022156654A1 (en) Text data processing method and apparatus
CN108171734A (en) A kind of method and device of ORB feature extracting and matchings
CN111680660B (en) Human behavior detection method based on multi-source heterogeneous data stream
CN108921037A (en) A kind of Emotion identification method based on BN-inception binary-flow network
Chalasani et al. Egocentric gesture recognition for head-mounted ar devices
Makarov et al. Russian sign language dactyl recognition
Kasukurthi et al. American sign language alphabet recognition using deep learning
CN109583334A (en) A kind of action identification method and its system based on space time correlation neural network
CN116895098A (en) Video human body action recognition system and method based on deep learning and privacy protection
CN105608662A (en) FPGA-based dynamic target identification system and identification method thereof
Özbay et al. 3D Human Activity Classification with 3D Zernike Moment Based Convolutional, LSTM-Deep Neural Networks.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20181019

WD01 Invention patent application deemed withdrawn after publication