CN107194380A - The depth convolutional network and learning method of a kind of complex scene human face identification - Google Patents
The depth convolutional network and learning method of a kind of complex scene human face identification Download PDFInfo
- Publication number
- CN107194380A CN107194380A CN201710531386.2A CN201710531386A CN107194380A CN 107194380 A CN107194380 A CN 107194380A CN 201710531386 A CN201710531386 A CN 201710531386A CN 107194380 A CN107194380 A CN 107194380A
- Authority
- CN
- China
- Prior art keywords
- max
- convolutional network
- depth convolutional
- layer
- human face
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 13
- 238000005286 illumination Methods 0.000 claims abstract description 7
- 239000000203 mixture Substances 0.000 claims abstract description 5
- 238000005070 sampling Methods 0.000 claims description 12
- 230000006870 function Effects 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 4
- 238000013434 data augmentation Methods 0.000 claims description 4
- 230000003416 augmentation Effects 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 3
- 230000000007 visual effect Effects 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000001815 facial effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000012806 monitoring device Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/10—Image enhancement or restoration using non-spatial domain filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/30—Noise filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/19—Recognition using electronic means
- G06V30/192—Recognition using electronic means using simultaneous comparisons or correlations of the image signals with a plurality of references
- G06V30/194—References adjustable by an adaptive method, e.g. learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20016—Hierarchical, coarse-to-fine, multiscale or multiresolution image processing; Pyramid transform
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
Abstract
The present invention proposes the depth convolutional network and learning method of a kind of complex scene human face identification, and the network structure is optimized due to four full convolution modules and three full link block compositions using random tonsure descent algorithm to the parameter of depth convolutional network.The recognition of face depth convolutional network of design is expected to obtain in true application scenarios, for complex illumination, all has stable recognition performance under various visual angles, in high-performance GPU support, the demand of engineer applied is met under higher calculating degree.
Description
Technical field
The present invention relates to a kind of depth convolutional network of complex scene human face identification and learning method, belong to computer skill
Art field.
Background technology
The face recognition algorithms of early stage are mainly included to image preprocessing, propose such as LBP conventional faces feature and answer
The feature extracted is learnt and classified with vector machine.Wherein Image Pretreatment Algorithm is particularly critical, specifically includes pin
To the Preprocessing Algorithm under different illumination conditions, for the Denoising Algorithm of picture noise, and for different angle faces to picture
Alignment algorithm.After the optimization specifically designed for some face recognition database, such as classical LFW facial recognition datas
Storehouse, has been achieved for the result close with mankind's identification level.
And in recent years, with the quick increase of miscellaneous capture apparatus, the facial image exponentially increased is a large amount of
Appear in all kinds of shooting images and video, especially from the shooting of cell phone apparatus and monitoring device.The thing followed is then
It is to occur in that wilderness demand for face recognition application, including the authentication based on face, the figure based on recognition of face
As retrieval, the application such as the Given Face identification based on monitor video.Face recognition technology of today has been taken leave of in small-scale data
The period of algorithm optimization is carried out on storehouse, then entering needs accurately to know the facial image shot under a large amount of various environment
Other application period.However, rapidly increasing with the digital picture including face shot from various environment, and photographer
Under level and very different environment, recognition of face essence of the foregoing early stage face recognition technology in such true environment
Degree is rapid to be declined, far from the application demand for meeting reality.
The content of the invention
In order to solve the above technical problems, one aspect of the present invention is:A kind of complex scene human face identification
Depth convolutional network, it is characterised in that:Including multiple convolution modules and full link block, each convolution module or complete
Convolution kernel, activation primitive and pond function of the link block under different scale are constituted,.
Further, the convolution module has 4 altogether, and full link block has 3 altogether, wherein, preceding 3 convolution modules difference
By convolutional layer, relu is once constituted with maximum pond layer, and maximum pond layer is responsible for the operation of down-sampling, with multilayer pyramid down-sampling
Structure to image carry out depth multi-layer filtering operation.
Further, specific algorithm of the convolutional network from first layer to last layer and hidden layer input and output with
Equation below is realized:
1)Wherein X0∈R224×224×3,w1∈R3×3×32。
2)X2=max { 0, X1, wherein X1∈R224×224×32。
3)Wherein X2∈R224×224×32, w3∈R3×3×32×64。
4)X4=max { 0, X3, wherein X3∈R224×224×64。
5)Wherein X4∈R224×224×64, w5∈R3×3×64×128
6)X6=max { 0, X5, wherein X5∈R224×224×128
7)X7=maxpool (X6), wherein X6∈R224×224×128
8)Wherein X7∈R112×112×128, w8∈R3×3×128×128
9)X9=max { 0, X8, wherein X8∈R112×112×128
10)Wherein X9∈R112×112×128, w10∈R3×3×128×256
11)X11=max { 0, X10, wherein X10∈R112×112×256
12)Wherein X11∈R112×112×256, w12∈R3×3×256×256
13)X13=max { 0, X12, wherein X12∈R112×112×256
14)X14=maxpool (X13), wherein X13∈R112×112×256
15)Wherein X14∈R56×56×256, w15∈R3×3×256×256
16)X16=max { 0, X15, wherein X15∈R56×56×256
17)Wherein X16∈R56×56×256, w17∈R3×3×256×512
18)X18=max { 0, X17, wherein X17∈R56×56×512
19)Wherein X18∈R56×56×512, w19∈R3×3×512×512
20)X20=max { 0, X19, wherein X19∈R56×56×512
21)X21=maxpool (X20), wherein X20∈R56×56×512
22)Wherein X21∈R28×28×512, w22∈R3×3×512×512
23)X23=max { 0, X22, wherein X19∈R28×28×512
24)Wherein X23∈R28×28×512, w24∈R3×3×512×512
25)X25=max { 0, X24, wherein X24∈R28×28×512
26)Wherein X25∈R28×28×512, w26∈R3×3×512×512
27)X27=max { 0, X26, wherein X26∈R28×28×512
28)Wherein X27∈R28×28×512, w28∈R3×3×512×512
29)X29=max { 0, X28, wherein X26∈R14×14×512
30)X30=maxpool (X29), wherein X29∈R7×7×512
31)Wherein X30∈R7×7×512, w31∈R7×7×512×4096
32)X32=max { 0, X31, wherein X31∈R1×1×4096
33)Wherein X32∈R1×1×4096, w33∈R1×1×4096×4096
34)X34=max { 0, X33, wherein X33∈R1×1×4096
35)Wherein X34∈R1×1×4096, w35∈R1×4096×K,
Wherein, Xi represents i-th layer of output, wiI-th layer of filter coefficient is represented, K represents the class for the face that need to be recognized
Shuo not.
A kind of learning method of the depth convolutional network of complex scene human face identification is also provided, it is characterised in that:It is described
Learning method is achieved by the steps of:
Step 1:Stochastical sampling, the number generated using stochastical sampling are carried out to the face picture collected in real scene
According to initialization network data, manually by offset parameter zero setting;
Step 2:Data augmentation operation is carried out to existing face training image, the size of all training images is uniformly returned
One changes to 224*224, is inputting into before network, is subtracting average image, is faster being restrained with reaching.
Specifically, the augmentation operation includes following concrete mode:
(1) 70%-90% of the face picture recognized is cut out at random, then carries out placed in the middle and zoom to 224*224, can be with
It is effective to improve the discrimination for working as face to this;
(2) for the inclined situation of picture, original image manually rotates to 5 at random, 110,15 degree, then is trained;
(3) the random progress contrast stretching of original face training figure or contrast are compressed, it is complicated in illumination to improve
In the environment of recognition of face performance.
Further, its learning process is optimized using stochastic gradient descent algorithm to the parameter of depth convolutional network.
The situation of prior art is different from, the beneficial effects of the invention are as follows:
The recognition of face depth convolutional network that the present invention is designed is expected to obtain in true application scenarios, for complex illumination,
All there is stable recognition performance under various visual angles.In high-performance GPU support, engineer applied is met under higher calculating degree
Demand.It is embodied in:There is powerful model table specifically designed for the depth convolutional network of complex scene human face identification mission
Danone power, to the recognition of face object of complicated illumination condition and various visual angles.
Brief description of the drawings
Fig. 1 is the structure chart of the depth convolutional network of complex scene human face identification of the present invention.
Fig. 2 is the recognition of face depth convolutional network tactic pattern figure that the present invention is used under complex scene.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation is described, it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments.It is based on
Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made
Embodiment, belongs to the scope of protection of the invention.
A kind of depth convolutional network of complex scene human face identification, as shown in figure 1, connecting including multiple convolution modules and entirely
Connection module, convolution kernel, activation primitive and the pond function of each convolution module or full link block under different scale
Composition, wherein first three convolution module are respectively by convolutional layer, and relu layers and maximum pond layer (max pooling) composition are maximum
Pond layer is responsible for the operation of down-sampling, carries out the operation of depth multi-layer filtering to image with the structure of multilayer pyramid down-sampling.
In this application, in order to which aspect describes network structure, j-th volume in i-th of convolution module is represented with convi_j
Product kernel function, tanhi_j represents j-th of tanh activation primitive in i-th of convolution module, and pooli represents i-th of maximum pond
Function (max pooling function), and fci represent i-th of full-mesh layer, and in depth convolutional network, we use Xi generations
The output that i-th layer of table, wiThe data relationship between i-th layer of filter coefficient, its each layer is represented as shown in Fig. 2 by as follows
Formula, calculates the tensor number from concrete operations of the first layer of convolutional network to last layer and middle level hidden layer input and output
According to:
1)Wherein X0∈R224×224×3,w1∈R3×3×32。
2)X2=max { 0, X1, wherein X1∈R224×224×32。
3)Wherein X2∈R224×224×32, w3∈R3×3×32×64。
4)X4=max { 0, X3, wherein X3∈R224×224×64。
5)Wherein X4∈R224×224×64, w5∈R3×3×64×128
6)X6=max { 0, X5, wherein X5∈R224×224×128
7)X7=maxpool (X6), wherein X6∈R224×224×128
8)Wherein X7∈R112×112×128, w8∈R3×3×128×128
9)X9=max { 0, X8, wherein X8∈R112×112×128
10)Wherein X9∈R112×112×128, w10∈R3×3×128×256
11)X11=max { 0, X10, wherein X10∈R112×112×256
12)Wherein X11∈R112×112×256, w12∈R3×3×256×256
13)X13=max { 0, X12, wherein X12∈R112×112×256
14)X14=maxpool (X13), wherein X13∈R112×112×256
15)Wherein X14∈R56×56×256, w15∈R3×3×256×256
16)X16=max { 0, X15, wherein X15∈R56×56×256
17)Wherein X16∈R56×56×256, w17∈R3×3×256×512
18)X18=max { 0, X17, wherein X17∈R56×56×512
19)Wherein X18∈R56×56×512, w19∈R3×3×512×512
20)X20=max { 0, X19, wherein X19∈R56×56×512
21)X21=maxpool (X20), wherein X20∈R56×56×512
22)Wherein X21∈R28×28×512, w22∈R3×3×512×512
23)X23=max { 0, X22, wherein X19∈R28×28×512
24)Wherein X23∈R28×28×512, w24∈R3×3×512×512
25)X25=max { 0, X24, wherein X24∈R28×28×512
26)Wherein X25∈R28×28×512, w26∈R3×3×512×512
27)X27=max { 0, X26, wherein X26∈R28×28×512
28)Wherein X27∈R28×28×512, w28∈R3×3×512×512
29)X29=max { 0, X28, wherein X26∈R14×14×512
30)X30=maxpool (X29), wherein X29∈R7×7×512
31)Wherein X30∈R7×7×512, w31∈R7×7×512×4096
32)X32=max { 0, X31, wherein X31∈R1×1×4096
33)Wherein X32∈R1×1×4096, w33∈R1×1×4096×4096
34)X34=max { 0, X33, wherein X33∈R1×1×4096
35)Wherein X34∈R1×1×4096, w35∈R1×4096×K,
Wherein, K represents the classification number for the face that need to be recognized.
A kind of learning method of the depth convolutional network of complex scene human face identification is also provided, it is real as follows
It is existing, step 1:At the beginning of stochastical sampling, the data generated using stochastical sampling being carried out to the face picture collected in real scene
Beginningization network data, manually by offset parameter zero setting;
Step 2:Data augmentation operation is carried out to existing face training image, the size of all training images is uniformly returned
One changes to 224*224, is inputting into before network, is subtracting average image, is faster being restrained with reaching.
Wherein, augmentation operation mainly includes some specific contents:
(1) in true application scenarios, the situation that part is blocked often occurs in face picture.To efficiently identify this
Class face picture, the 70% of our random cropping original pictures, 80%, 90%, then carry out placed in the middle, 224*224 is zoomed to, the data
Augmentation algorithm can effectively improve the discrimination to blocking face.
(2) inclined situation often occurs in face picture, for such case, we by original training picture it is artificial with
Machine rotates 5 degree, 10 degree or 15 degree, then is trained.
(3) to improve the performance recognized in illumination complex environment human face, in view of different light conditions can cause not
Same contrast, it is proposed that original face training image is carried out into contrast stretching or contrast compression at random, then is carried out
Training.
Embodiments of the invention are the foregoing is only, are not intended to limit the scope of the invention, it is every to utilize this hair
Equivalent structure or equivalent flow conversion that bright specification and accompanying drawing content are made, or directly or indirectly it is used in other related skills
Art field, is included within the scope of the present invention.
Claims (6)
1. a kind of depth convolutional network of complex scene human face identification, it is characterised in that:Including the multiple convolution being sequentially connected
Module and full link block, convolution kernel under different scale of each convolution module and full link block, activation primitive and
Pond function composition.
2. a kind of depth convolutional network of complex scene human face identification according to claim 1, it is characterised in that:It is described
Convolution module has 4 altogether, and full link block has 3 altogether, wherein, preceding 3 convolution modules are respectively by convolutional layer, and relu is once and maximum
Pond layer composition, maximum pond layer is responsible for the operation of down-sampling, depth is carried out to image with the structure of multilayer pyramid down-sampling
The operation of multi-layer filtering.
3. a kind of depth convolutional network of complex scene human face identification according to claim 1, it is characterised in that:It is described
Specific algorithm and hidden layer input and output of the convolutional network from first layer to last layer are realized with equation below:
1)Wherein X0∈R224×224×3,w1∈R3×3×32。
2)X2=max { 0, X1, wherein X1∈R224×224×32。
3)Wherein X2∈R224×224×32, w3∈R3×3×32×64。
4)X4=max { 0, X3, wherein X3∈R224×224×64。
5)Wherein X4∈R224×224×64, w5∈R3×3×64×128
6)X6=max { 0, X5, wherein X5∈R224×224×128
7)X7=maxpool (X6), wherein X6∈R224×224×128
8)Wherein X7∈R112×112×128, w8∈R3×3×128×128
9)X9=max { 0, X8, wherein X8∈R112×112×128
10)Wherein X9∈R112×112×128, w10∈R3×3×128×256
11)X11=max { 0, X10, wherein X10∈R112×112×256
12)Wherein X11∈R112×112×256, w12∈R3×3×256×256
13)X13=max { 0, X12, wherein X12∈R112×112×256
14)X14=maxpool (X13), wherein X13∈R112×112×256
15)Wherein X14∈R56×56×256, w15∈R3×3×256×256
16)X16=max { 0, X15, wherein X15∈R56×56×256
17)Wherein X16∈R56×56×256, w17∈R3×3×256×512
18)X18=max { 0, X17, wherein X17∈R56×56×512
19)Wherein X18∈R56×56×512, w19∈R3×3×512×512
20)X20=max { 0, X19, wherein X19∈R56×56×512
21)X21=maxpool (X20), wherein X20∈R56×56×512
22)Wherein X21∈R28×28×512, w22∈R3×3×512×512
23)X23=max { 0, X22, wherein X19∈R28×28×512
24)Wherein X23∈R28×28×512, w24∈R3×3×512×512
25)X25=max { 0, X24, wherein X24∈R28×28×512
26)Wherein X25∈R28×28×512, w26∈R3×3×512×512
27)X27=max { 0, X26, wherein X26∈R28×28×512
28)Wherein X27∈R28×28×512, w28∈R3×3×512×512
29)X29=max { 0, X28, wherein X26∈R14×14×512
30)X30=maxpool (X29), wherein X29∈R7×7×512
31)Wherein X30∈R7×7×512, w31∈R7×7×512×4096
32)X32=max { 0, X31, wherein X31∈R1×1×4096
33)Wherein X32∈R1×1×4096, w33∈R1×1×4096×4096
34)X34=max { 0, X33, wherein X33∈R1×1×4096
35)Wherein X34∈R1×1×4096, w35∈R1×4096×K,
Wherein, Xi represents i-th layer of output, wiI-th layer of filter coefficient is represented, K represents the classification number for the face that need to be recognized.
4. a kind of learning method of the depth convolutional network of complex scene human face identification, it is characterised in that:The learning method
It is achieved by the steps of:
Step 1:At the beginning of stochastical sampling, the data generated using stochastical sampling being carried out to the face picture collected in real scene
Beginningization network data, manually by offset parameter zero setting;
Step 2:Data augmentation operation is carried out to existing face training image, the size of all training images is unified into normalizing
Change, inputting into before network, subtracting average image, faster restrained with reaching.
5. a kind of learning method of the depth convolutional network of complex scene human face identification according to claim 4, it is special
Levy and be:The augmentation operation includes following concrete mode:
(1) 70%-90% of the face picture recognized is cut out at random, then carries out placed in the middle and zoom to 224*224, can be effective
Raising to this work as face discrimination;
(2) for the inclined situation of picture, original image manually rotates to 5 at random, 110,15 degree, then is trained;
(3) the random progress contrast stretching of original face training figure or contrast are compressed, to improve in the complicated ring of illumination
The performance of border human face identification.
6. a kind of learning method of the depth convolutional network of complex scene human face identification according to claim 4, it is special
Levy and be:Its learning process is optimized using stochastic gradient descent algorithm to the parameter of depth convolutional network.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710531386.2A CN107194380A (en) | 2017-07-03 | 2017-07-03 | The depth convolutional network and learning method of a kind of complex scene human face identification |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710531386.2A CN107194380A (en) | 2017-07-03 | 2017-07-03 | The depth convolutional network and learning method of a kind of complex scene human face identification |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107194380A true CN107194380A (en) | 2017-09-22 |
Family
ID=59880314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710531386.2A Pending CN107194380A (en) | 2017-07-03 | 2017-07-03 | The depth convolutional network and learning method of a kind of complex scene human face identification |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107194380A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108334847A (en) * | 2018-02-06 | 2018-07-27 | 哈尔滨工业大学 | A kind of face identification method based on deep learning under real scene |
CN108399631A (en) * | 2018-03-01 | 2018-08-14 | 北京中测智绘科技有限公司 | A kind of inclination image of scale invariability regards dense Stereo Matching method more |
CN108596120A (en) * | 2018-04-28 | 2018-09-28 | 北京京东尚科信息技术有限公司 | A kind of object detection method and device based on deep learning |
CN108596256A (en) * | 2018-04-26 | 2018-09-28 | 北京航空航天大学青岛研究院 | One kind being based on RGB-D object identification grader building methods |
WO2019223254A1 (en) * | 2018-05-21 | 2019-11-28 | 北京亮亮视野科技有限公司 | Construction method for multi-scale lightweight face detection model and face detection method based on model |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101075291A (en) * | 2006-05-18 | 2007-11-21 | 中国科学院自动化研究所 | Efficient promoting exercising method for discriminating human face |
CN103268653A (en) * | 2013-05-30 | 2013-08-28 | 苏州福丰科技有限公司 | Face identification method for access control system |
CN105701507A (en) * | 2016-01-13 | 2016-06-22 | 吉林大学 | Image classification method based on dynamic random pooling convolution neural network |
-
2017
- 2017-07-03 CN CN201710531386.2A patent/CN107194380A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101075291A (en) * | 2006-05-18 | 2007-11-21 | 中国科学院自动化研究所 | Efficient promoting exercising method for discriminating human face |
CN103268653A (en) * | 2013-05-30 | 2013-08-28 | 苏州福丰科技有限公司 | Face identification method for access control system |
CN105701507A (en) * | 2016-01-13 | 2016-06-22 | 吉林大学 | Image classification method based on dynamic random pooling convolution neural network |
Non-Patent Citations (6)
Title |
---|
万士宁: "基于卷积神经网络的人脸识别研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
叶浪: "基于卷积神经网络的人脸识别研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
尹晓燕: "基于深度学习的人脸识别研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
蒋树强等: "面向智能交互的图像识别技术综述与展望", 《计算机研究与发展》 * |
颜志国等: "《多摄像机协同关注目标检测跟踪技术》", 30 June 2017 * |
黄攀: "基于深度学习的自然场景文字识别", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108334847A (en) * | 2018-02-06 | 2018-07-27 | 哈尔滨工业大学 | A kind of face identification method based on deep learning under real scene |
CN108334847B (en) * | 2018-02-06 | 2019-10-22 | 哈尔滨工业大学 | A kind of face identification method based on deep learning under real scene |
CN108399631A (en) * | 2018-03-01 | 2018-08-14 | 北京中测智绘科技有限公司 | A kind of inclination image of scale invariability regards dense Stereo Matching method more |
CN108399631B (en) * | 2018-03-01 | 2022-02-11 | 北京中测智绘科技有限公司 | Scale invariance oblique image multi-view dense matching method |
CN108596256A (en) * | 2018-04-26 | 2018-09-28 | 北京航空航天大学青岛研究院 | One kind being based on RGB-D object identification grader building methods |
CN108596256B (en) * | 2018-04-26 | 2022-04-01 | 北京航空航天大学青岛研究院 | Object recognition classifier construction method based on RGB-D |
CN108596120A (en) * | 2018-04-28 | 2018-09-28 | 北京京东尚科信息技术有限公司 | A kind of object detection method and device based on deep learning |
CN108596120B (en) * | 2018-04-28 | 2021-01-26 | 北京京东尚科信息技术有限公司 | Target detection method and device based on deep learning |
WO2019223254A1 (en) * | 2018-05-21 | 2019-11-28 | 北京亮亮视野科技有限公司 | Construction method for multi-scale lightweight face detection model and face detection method based on model |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112766160B (en) | Face replacement method based on multi-stage attribute encoder and attention mechanism | |
CN107194380A (en) | The depth convolutional network and learning method of a kind of complex scene human face identification | |
CN109785258B (en) | Face image restoration method based on multi-discriminator generated countermeasure network | |
CN108647639B (en) | Real-time human body skeleton joint point detection method | |
CN111127308B (en) | Mirror image feature rearrangement restoration method for single sample face recognition under partial shielding | |
WO2021164234A1 (en) | Image processing method and image processing device | |
CN108717524B (en) | Gesture recognition system based on double-camera mobile phone and artificial intelligence system | |
CN107123091B (en) | Near-infrared face image super-resolution reconstruction method based on deep learning | |
CN110532897A (en) | The method and apparatus of components image recognition | |
CN106570474A (en) | Micro expression recognition method based on 3D convolution neural network | |
CN109886153B (en) | Real-time face detection method based on deep convolutional neural network | |
WO2019019086A1 (en) | Image contrast enhancement method and device, and storage medium | |
CN107993238A (en) | A kind of head-and-shoulder area image partition method and device based on attention model | |
CN108573243A (en) | A kind of comparison method of the low quality face based on depth convolutional neural networks | |
CN113627504B (en) | Multi-mode multi-scale feature fusion target detection method based on generation of countermeasure network | |
CN109255758A (en) | Image enchancing method based on full 1*1 convolutional neural networks | |
CN110097106A (en) | The low-light-level imaging algorithm and device of U-net network based on deep learning | |
CN108734677B (en) | Blind deblurring method and system based on deep learning | |
CN109753864A (en) | A kind of face identification method based on caffe deep learning frame | |
CN110222718A (en) | The method and device of image procossing | |
CN112861659A (en) | Image model training method and device, electronic equipment and storage medium | |
CN111341462A (en) | Mobile phone terminal diabetic retinopathy screening APP based on deep learning | |
CN110503608B (en) | Image denoising method based on multi-view convolutional neural network | |
Peng et al. | Litchi detection in the field using an improved YOLOv3 model | |
CN111414988A (en) | Remote sensing image super-resolution method based on multi-scale feature self-adaptive fusion network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170922 |
|
RJ01 | Rejection of invention patent application after publication |