WO2021258588A1 - 一种人脸图像识别方法、装置、设备及存储介质 - Google Patents
一种人脸图像识别方法、装置、设备及存储介质 Download PDFInfo
- Publication number
- WO2021258588A1 WO2021258588A1 PCT/CN2020/123588 CN2020123588W WO2021258588A1 WO 2021258588 A1 WO2021258588 A1 WO 2021258588A1 CN 2020123588 W CN2020123588 W CN 2020123588W WO 2021258588 A1 WO2021258588 A1 WO 2021258588A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- face
- network
- image
- occlusion image
- face occlusion
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 230000009466 transformation Effects 0.000 claims abstract description 132
- 230000006870 function Effects 0.000 claims description 54
- 238000012549 training Methods 0.000 claims description 35
- 238000006243 chemical reaction Methods 0.000 claims description 23
- 230000015654 memory Effects 0.000 claims description 21
- 238000010276 construction Methods 0.000 claims description 4
- 238000013473 artificial intelligence Methods 0.000 abstract description 2
- 238000013135 deep learning Methods 0.000 abstract description 2
- 239000000284 extract Substances 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 238000000605 extraction Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000004913 activation Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 230000001815 facial effect Effects 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 238000013519 translation Methods 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000004709 eyebrow Anatomy 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/166—Detection; Localisation; Normalisation using acquisition arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
- G06V40/171—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Multimedia (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Image Analysis (AREA)
- Collating Specific Patterns (AREA)
Abstract
Description
Claims (14)
- 一种人脸图像识别方法,包括:基于预先获取的人脸遮挡图像的空间网络特征,对所述人脸遮挡图像进行空间变换,得到正面人脸遮挡图像;将所述正面人脸遮挡图像输入到人脸识别网络,得到身份识别结果。
- 根据权利要求1所述的方法,其中,基于预先获取的人脸遮挡图像的空间网络特征,对所述人脸遮挡图像进行空间变换,得到正面人脸遮挡图像,包括:将预先获取的人脸遮挡图像输入空间变换网络中的卷积网络,得到所述人脸遮挡图像的特征图像;将所述特征图像输入所述空间变换网络中的定位网络,得到所述人脸遮挡图像的空间网络特征;将所述空间网络特征和所述特征图像输入所述空间变换网络中的变换网络,得到所述特征图像的像素点转换数据;将所述像素点转换数据和所述特征图像输入到所述空间变换网络中的插值网络,得到正面人脸遮挡图像。
- 根据权利要求1所述的方法,其中,在基于预先获取的人脸遮挡图像的空间网络特征,对所述人脸遮挡图像进行空间变换之前,还包括:基于预先获取的人脸遮挡图像的人脸关键点特征,对所述人脸遮挡图像进行对齐。
- 根据权利要求1所述的方法,其中,所述正面人脸遮挡图像基于空间变换网络得到,所述方法还包括:在模型训练阶段,对所述空间变换网络和所述人脸识别网络进行联合训练。
- 根据权利要求4所述的方法,其中,对所述空间变换网络和所述人脸识别网络进行联合训练,包括:将样本人脸遮挡图像输入到所述空间变换网络,得到样本正面人脸遮挡图像;将所述样本正面人脸遮挡图像输入到所述人脸识别网络,得到 样本身份识别结果;根据所述样本正面人脸遮挡图像、所述样本身份识别结果、所述样本人脸遮挡图像中标注的人脸关键点和真实身份,构建联合损失函数;基于所述联合损失函数,对所述空间变换网络和所述人脸识别网络进行监督训练。
- 根据权利要求5所述的方法,其中,根据所述样本正面人脸遮挡图像、所述样本身份识别结果、所述样本人脸遮挡图像中标注的人脸关键点和真实身份,构建联合损失函数,包括:根据所述样本人脸遮挡图像中标注的人脸关键点和所述样本正面人脸遮挡图像,确定空间变换损失函数;根据所述样本人脸遮挡图像中标注的真实身份和所述样本身份识别结果,确定识别损失函数;根据所述空间变换损失函数和所述识别损失函数,构建联合损失函数。
- 一种人脸图像识别装置,包括:空间变换模块,设置为基于预先获取的人脸遮挡图像的空间网络特征,对所述人脸遮挡图像进行空间变换,得到正面人脸遮挡图像;身份识别模块,设置为将所述正面人脸遮挡图像输入到人脸识别网络,得到身份识别结果。
- 根据权利要求7所述的装置,其中,所述空间变换模块包括:特征图像确定单元,设置为将预先获取的人脸遮挡图像输入空间变换网络中的卷积网络,得到所述人脸遮挡图像的特征图像;网络特征确定单元,设置为将所述特征图像输入所述空间变换网络中的定位网络,得到所述人脸遮挡图像的空间网络特征;数据转换单元,设置为将所述空间网络特征和所述特征图像输入所述空间变换网络中的变换网络,得到所述特征图像的像素点转换数据;数据差值单元,设置为将所述像素点转换数据和所述特征图像 输入到所述空间变换网络中的插值网络,得到正面人脸遮挡图像。
- 根据权利要求7所述的装置,其中,还包括:图像对齐模块,设置为基于预先获取的人脸遮挡图像的人脸关键点特征,对所述人脸遮挡图像进行对齐。
- 根据权利要求7所述的装置,其中,所述正面人脸遮挡图像基于空间变换网络得到,所述装置还包括:模型训练模块,设置为在模型训练阶段,对所述空间变换网络和所述人脸识别网络进行联合训练。
- 根据权利要求10所述的装置,其中,所述模型训练模块包括:第一数据输入模块,设置为将样本人脸遮挡图像输入到所述空间变换网络,得到样本正面人脸遮挡图像;第二数据输入模块,设置为将所述样本正面人脸遮挡图像输入到所述人脸识别网络,得到样本身份识别结果;损失函数构建单元,设置为根据所述样本正面人脸遮挡图像、所述样本身份识别结果、所述样本人脸遮挡图像中标注的人脸关键点和真实身份,构建联合损失函数;监督训练单元,设置为基于所述联合损失函数,对所述空间变换网络和所述人脸识别网络进行监督训练。
- 根据权利要求11所述的装置,其中,所述损失函数构建单元是设置为:根据所述样本人脸遮挡图像中标注的人脸关键点和所述样本正面人脸遮挡图像,确定空间变换损失函数;根据所述样本人脸遮挡图像中标注的真实身份和所述样本身份识别结果,确定识别损失函数;根据所述空间变换损失函数和所述识别损失函数,构建联合损失函数。
- 一种电子设备,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-6中任一项所述的人脸图像识别方法。
- 一种存储有计算机指令的非瞬时计算机可读存储介质,所述计算机指令用于使所述计算机执行权利要求1-6中任一项所述的人脸图像识别方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020227036111A KR20220154227A (ko) | 2020-06-24 | 2020-10-26 | 얼굴 이미지 식별 방법, 장치, 설비 및 저장매체 |
JP2022577076A JP2023529225A (ja) | 2020-06-24 | 2020-10-26 | 顔画像認識方法、装置、機器および記憶媒体 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010592663.2 | 2020-06-24 | ||
CN202010592663.2A CN111783605A (zh) | 2020-06-24 | 2020-06-24 | 一种人脸图像识别方法、装置、设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021258588A1 true WO2021258588A1 (zh) | 2021-12-30 |
Family
ID=72759827
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/123588 WO2021258588A1 (zh) | 2020-06-24 | 2020-10-26 | 一种人脸图像识别方法、装置、设备及存储介质 |
Country Status (4)
Country | Link |
---|---|
JP (1) | JP2023529225A (zh) |
KR (1) | KR20220154227A (zh) |
CN (1) | CN111783605A (zh) |
WO (1) | WO2021258588A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114549369A (zh) * | 2022-04-24 | 2022-05-27 | 腾讯科技(深圳)有限公司 | 数据修复方法、装置、计算机及可读存储介质 |
CN116453201A (zh) * | 2023-06-19 | 2023-07-18 | 南昌大学 | 基于相邻边缘损失的人脸识别方法及系统 |
WO2023158408A1 (en) * | 2022-02-16 | 2023-08-24 | Bahcesehir Universitesi | Face recognition method |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111783605A (zh) * | 2020-06-24 | 2020-10-16 | 北京百度网讯科技有限公司 | 一种人脸图像识别方法、装置、设备及存储介质 |
CN112364827B (zh) * | 2020-11-30 | 2023-11-10 | 腾讯科技(深圳)有限公司 | 人脸识别方法、装置、计算机设备和存储介质 |
CN112507833A (zh) * | 2020-11-30 | 2021-03-16 | 北京百度网讯科技有限公司 | 人脸识别及模型训练的方法、装置、设备和存储介质 |
CN112418190B (zh) * | 2021-01-21 | 2021-04-02 | 成都点泽智能科技有限公司 | 移动端医学防护遮蔽人脸识别方法、装置、系统及服务器 |
CN113963426B (zh) * | 2021-12-22 | 2022-08-26 | 合肥的卢深视科技有限公司 | 模型训练、戴口罩人脸识别方法、电子设备及存储介质 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010157073A (ja) * | 2008-12-26 | 2010-07-15 | Fujitsu Ltd | 顔認識装置、顔認識方法及び顔認識プログラム |
CN104992148A (zh) * | 2015-06-18 | 2015-10-21 | 江南大学 | 基于随机森林的atm终端部分遮挡人脸关键点检测方法 |
CN109886121A (zh) * | 2019-01-23 | 2019-06-14 | 浙江大学 | 一种遮挡鲁棒的人脸关键点定位方法 |
CN109948573A (zh) * | 2019-03-27 | 2019-06-28 | 厦门大学 | 一种基于级联深度卷积神经网络的噪声鲁棒人脸识别方法 |
CN109960975A (zh) * | 2017-12-23 | 2019-07-02 | 四川大学 | 一种基于人眼的人脸生成及其人脸识别方法 |
CN110232369A (zh) * | 2019-06-20 | 2019-09-13 | 深圳和而泰家居在线网络科技有限公司 | 一种人脸识别方法和电子设备 |
CN111783605A (zh) * | 2020-06-24 | 2020-10-16 | 北京百度网讯科技有限公司 | 一种人脸图像识别方法、装置、设备及存储介质 |
-
2020
- 2020-06-24 CN CN202010592663.2A patent/CN111783605A/zh active Pending
- 2020-10-26 WO PCT/CN2020/123588 patent/WO2021258588A1/zh active Application Filing
- 2020-10-26 KR KR1020227036111A patent/KR20220154227A/ko not_active Application Discontinuation
- 2020-10-26 JP JP2022577076A patent/JP2023529225A/ja active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010157073A (ja) * | 2008-12-26 | 2010-07-15 | Fujitsu Ltd | 顔認識装置、顔認識方法及び顔認識プログラム |
CN104992148A (zh) * | 2015-06-18 | 2015-10-21 | 江南大学 | 基于随机森林的atm终端部分遮挡人脸关键点检测方法 |
CN109960975A (zh) * | 2017-12-23 | 2019-07-02 | 四川大学 | 一种基于人眼的人脸生成及其人脸识别方法 |
CN109886121A (zh) * | 2019-01-23 | 2019-06-14 | 浙江大学 | 一种遮挡鲁棒的人脸关键点定位方法 |
CN109948573A (zh) * | 2019-03-27 | 2019-06-28 | 厦门大学 | 一种基于级联深度卷积神经网络的噪声鲁棒人脸识别方法 |
CN110232369A (zh) * | 2019-06-20 | 2019-09-13 | 深圳和而泰家居在线网络科技有限公司 | 一种人脸识别方法和电子设备 |
CN111783605A (zh) * | 2020-06-24 | 2020-10-16 | 北京百度网讯科技有限公司 | 一种人脸图像识别方法、装置、设备及存储介质 |
Non-Patent Citations (1)
Title |
---|
YAN, XIANG: "Face Alignment Based on Deep Learning", DOCTORAL DISSERTATION , 31 July 2019 (2019-07-31), CN, XP009533230 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023158408A1 (en) * | 2022-02-16 | 2023-08-24 | Bahcesehir Universitesi | Face recognition method |
CN114549369A (zh) * | 2022-04-24 | 2022-05-27 | 腾讯科技(深圳)有限公司 | 数据修复方法、装置、计算机及可读存储介质 |
CN116453201A (zh) * | 2023-06-19 | 2023-07-18 | 南昌大学 | 基于相邻边缘损失的人脸识别方法及系统 |
CN116453201B (zh) * | 2023-06-19 | 2023-09-01 | 南昌大学 | 基于相邻边缘损失的人脸识别方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN111783605A (zh) | 2020-10-16 |
JP2023529225A (ja) | 2023-07-07 |
KR20220154227A (ko) | 2022-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021258588A1 (zh) | 一种人脸图像识别方法、装置、设备及存储介质 | |
US11854118B2 (en) | Method for training generative network, method for generating near-infrared image and device | |
KR102597377B1 (ko) | 이미지 인식방법, 장치, 기기, 컴퓨터 저장매체 및 컴퓨터 프로그램 | |
US11430265B2 (en) | Video-based human behavior recognition method, apparatus, device and storage medium | |
US11887388B2 (en) | Object pose obtaining method, and electronic device | |
US11403799B2 (en) | Method and apparatus for recognizing face-swap, device and computer readable storage medium | |
WO2021175180A1 (zh) | 视线确定方法、装置、电子设备和计算机可读存储介质 | |
US11568590B2 (en) | Cartoonlization processing method for image, electronic device, and storage medium | |
US20220189189A1 (en) | Method of training cycle generative networks model, and method of building character library | |
KR102551835B1 (ko) | 능동적 인터랙션 방법, 장치, 전자 기기 및 판독 가능 기록 매체 | |
US20220215507A1 (en) | Image stitching | |
WO2022237481A1 (zh) | 举手识别方法、装置、电子设备及存储介质 | |
WO2022247343A1 (zh) | 识别模型训练方法、识别方法、装置、设备及存储介质 | |
US20210201441A1 (en) | Method, apparatus, device and storage medium for transforming hairstyle | |
WO2022227765A1 (zh) | 生成图像修复模型的方法、设备、介质及程序产品 | |
EP4080470A2 (en) | Method and apparatus for detecting living face | |
JP7242812B2 (ja) | 画像認識方法、装置及び電子機器 | |
JP2021136028A (ja) | エッジベースの拡張現実3次元追跡登録方法、装置及び電子機器 | |
KR20210095817A (ko) | 얼굴 합성 이미지의 검출방법, 검출장치, 전자기기 및 저장매체 | |
CN114067394A (zh) | 人脸活体检测方法、装置、电子设备及存储介质 | |
WO2023029702A1 (zh) | 用于验证图像的方法和装置 | |
KR20220104110A (ko) | 안면 생체 검출 방법, 장치, 전자 기기 및 저장 매체 | |
KR20220152378A (ko) | 음성 엔드포인트 검출 방법, 장치, 전자 기기 및 기록 매체 | |
CN116704620A (zh) | 一种活体检测方法、装置、电子设备和存储介质 | |
CN117746502A (zh) | 图像标注方法、动作识别方法、装置和电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20942085 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20227036111 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2022577076 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 19.05.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20942085 Country of ref document: EP Kind code of ref document: A1 |