CN111819568A - 人脸旋转图像的生成方法及装置 - Google Patents

人脸旋转图像的生成方法及装置 Download PDF

Info

Publication number
CN111819568A
CN111819568A CN201880090767.4A CN201880090767A CN111819568A CN 111819568 A CN111819568 A CN 111819568A CN 201880090767 A CN201880090767 A CN 201880090767A CN 111819568 A CN111819568 A CN 111819568A
Authority
CN
China
Prior art keywords
image
face
network
loss
rotation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201880090767.4A
Other languages
English (en)
Inventor
饶强
遇冰
冯柏岚
胡一博
吴翔
赫然
孙哲南
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Institute of Automation of Chinese Academy of Science
Original Assignee
Huawei Technologies Co Ltd
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd, Institute of Automation of Chinese Academy of Science filed Critical Huawei Technologies Co Ltd
Publication of CN111819568A publication Critical patent/CN111819568A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/18Image warping, e.g. rearranging pixels individually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/70Denoising; Smoothing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/165Detection; Localisation; Normalisation using facial parts and geometric relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • G06T2207/30201Face

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Geometry (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

本申请提供一种人脸旋转图像的生成方法及装置,涉及人工智能领域,具体涉及计算机视觉领域。本方法包括:根据获取的人脸图像中的两个或两个以上关键点对所述人脸图像进行姿态编码以获得姿态编码图;从训练数据集中获取多张包含人脸的训练图片,且所述多张训练图片中包含的人脸呈现的旋转角度均为同一角度;采用前述类似方式根据目标人脸图像中的两个或两个以上关键点对所述目标人脸图像进行姿态编码以获得姿态编码图;其中,所述目标人脸图像是根据所述多张训练图片得到的;根据所述人脸图像和前述两种姿态编码图生成待输入信号;将所述待输入信号输入人脸旋转图像生成模型得到人脸旋转图像。通过本方法,可以提高姿态编码的连续性和准确性,从而提高人脸旋转图像的生成效率。

Description

PCT国内申请,说明书已公开。

Claims (36)

  1. PCT国内申请,权利要求书已公开。
CN201880090767.4A 2018-06-01 2018-06-01 人脸旋转图像的生成方法及装置 Pending CN111819568A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/089611 WO2019227479A1 (zh) 2018-06-01 2018-06-01 人脸旋转图像的生成方法及装置

Publications (1)

Publication Number Publication Date
CN111819568A true CN111819568A (zh) 2020-10-23

Family

ID=68697775

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880090767.4A Pending CN111819568A (zh) 2018-06-01 2018-06-01 人脸旋转图像的生成方法及装置

Country Status (3)

Country Link
US (1) US11232286B2 (zh)
CN (1) CN111819568A (zh)
WO (1) WO2019227479A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112837211A (zh) * 2021-01-28 2021-05-25 北京奇艺世纪科技有限公司 一种图片处理方法、装置、电子设备及可读存储介质

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020044556A1 (en) * 2018-08-31 2020-03-05 Nec Corporation Information processing apparatus, method, and program
CN109800294B (zh) * 2019-01-08 2020-10-13 中国科学院自动化研究所 基于物理环境博弈的自主进化智能对话方法、系统、装置
US11188740B2 (en) * 2019-12-18 2021-11-30 Qualcomm Incorporated Two-pass omni-directional object detection
CN111583099A (zh) * 2020-04-14 2020-08-25 上海联影智能医疗科技有限公司 图像摆正方法、计算机设备和存储介质
CN111847147B (zh) * 2020-06-18 2023-04-18 闽江学院 一种无接触眼动式电梯楼层输入方法及装置
CN112070888B (zh) * 2020-09-08 2024-04-05 抖音视界有限公司 图像生成方法、装置、设备和计算机可读介质
CN112418344B (zh) * 2020-12-07 2023-11-21 汇纳科技股份有限公司 一种训练方法、目标检测方法、介质及电子设备
CN112800898A (zh) * 2021-01-18 2021-05-14 深圳市网联安瑞网络科技有限公司 行人重识别数据集增强方法、系统、终端、摄像头及介质
CN112669240B (zh) * 2021-01-22 2024-05-10 深圳市格灵人工智能与机器人研究院有限公司 高清图像修复方法、装置、电子设备和存储介质
TWI768913B (zh) * 2021-05-20 2022-06-21 國立中正大學 眼睛中心定位方法及其定位系統
CN113222144B (zh) * 2021-05-31 2022-12-27 北京有竹居网络技术有限公司 图像修复模型的训练方法及图像修复方法、装置及设备
CN113326934B (zh) * 2021-05-31 2024-03-29 上海哔哩哔哩科技有限公司 神经网络的训练方法、生成图像及视频的方法和装置
US11900534B2 (en) * 2021-07-30 2024-02-13 The Boeing Company Systems and methods for synthetic image generation
CN116310659B (zh) * 2023-05-17 2023-08-08 中数元宇数字科技(上海)有限公司 训练数据集的生成方法及设备

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015090126A1 (zh) * 2013-12-16 2015-06-25 北京天诚盛业科技有限公司 人脸特征的提取、认证方法及装置
CN106251294A (zh) * 2016-08-11 2016-12-21 西安理工大学 一种单幅正视人脸图像的虚拟多姿态生成方法
CN107122705A (zh) * 2017-03-17 2017-09-01 中国科学院自动化研究所 基于三维人脸模型的人脸关键点检测方法
CN107292813A (zh) * 2017-05-17 2017-10-24 浙江大学 一种基于生成对抗网络的多姿态人脸生成方法
CN107437077A (zh) * 2017-08-04 2017-12-05 深圳市唯特视科技有限公司 一种基于生成对抗网络的旋转面部表示学习的方法

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8090160B2 (en) * 2007-10-12 2012-01-03 The University Of Houston System Automated method for human face modeling and relighting with application to face recognition
CN103065360B (zh) * 2013-01-16 2016-08-24 中国科学院重庆绿色智能技术研究院 一种发型效果图的生成方法及系统
CN105740758A (zh) 2015-12-31 2016-07-06 上海极链网络科技有限公司 基于深度学习的互联网视频人脸识别方法
CN107871107A (zh) * 2016-09-26 2018-04-03 北京眼神科技有限公司 人脸认证方法和装置
US10474880B2 (en) * 2017-03-15 2019-11-12 Nec Corporation Face recognition using larger pose face frontalization
US10878612B2 (en) * 2017-04-04 2020-12-29 Intel Corporation Facial image replacement using 3-dimensional modelling techniques
CN107506717B (zh) * 2017-08-17 2020-11-27 南京东方网信网络科技有限公司 无约束场景中基于深度变换学习的人脸识别方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015090126A1 (zh) * 2013-12-16 2015-06-25 北京天诚盛业科技有限公司 人脸特征的提取、认证方法及装置
CN106251294A (zh) * 2016-08-11 2016-12-21 西安理工大学 一种单幅正视人脸图像的虚拟多姿态生成方法
CN107122705A (zh) * 2017-03-17 2017-09-01 中国科学院自动化研究所 基于三维人脸模型的人脸关键点检测方法
CN107292813A (zh) * 2017-05-17 2017-10-24 浙江大学 一种基于生成对抗网络的多姿态人脸生成方法
CN107437077A (zh) * 2017-08-04 2017-12-05 深圳市唯特视科技有限公司 一种基于生成对抗网络的旋转面部表示学习的方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
AI 科技评论: "如何旋转图像中的人脸? | CVPR 2018", 《HTTPS://ZHUANLAN.ZHIHU.COM/P/37305160》, pages 1 - 9 *
LUAN TRAN等: "Disentangled Representation Learning GAN for Pose-Invariant Face Recognition", 《2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》, pages 1415 - 1424 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112837211A (zh) * 2021-01-28 2021-05-25 北京奇艺世纪科技有限公司 一种图片处理方法、装置、电子设备及可读存储介质
CN112837211B (zh) * 2021-01-28 2023-07-18 北京奇艺世纪科技有限公司 一种图片处理方法、装置、电子设备及可读存储介质

Also Published As

Publication number Publication date
US20210012093A1 (en) 2021-01-14
WO2019227479A1 (zh) 2019-12-05
US11232286B2 (en) 2022-01-25

Similar Documents

Publication Publication Date Title
CN111819568A (zh) 人脸旋转图像的生成方法及装置
CN110532871B (zh) 图像处理的方法和装置
CN112446270B (zh) 行人再识别网络的训练方法、行人再识别方法和装置
CN108921893B (zh) 一种基于在线深度学习slam的图像云计算方法及系统
CN111274916B (zh) 人脸识别方法和人脸识别装置
CN112236779A (zh) 基于卷积神经网络的图像处理方法和图像处理装置
CN110222717B (zh) 图像处理方法和装置
CN109993707B (zh) 图像去噪方法和装置
CN112446476A (zh) 神经网络模型压缩的方法、装置、存储介质和芯片
CN111783748B (zh) 人脸识别方法、装置、电子设备及存储介质
CN112639828A (zh) 数据处理的方法、训练神经网络模型的方法及设备
CN111914997B (zh) 训练神经网络的方法、图像处理方法及装置
CN111832592B (zh) Rgbd显著性检测方法以及相关装置
WO2021218238A1 (zh) 图像处理方法和图像处理装置
CN110222718A (zh) 图像处理的方法及装置
CN113191489B (zh) 二值神经网络模型的训练方法、图像处理方法和装置
WO2022165722A1 (zh) 单目深度估计方法、装置及设备
CN113807183A (zh) 模型训练方法及相关设备
WO2022052782A1 (zh) 图像的处理方法及相关设备
CN111797881A (zh) 图像分类方法及装置
US11138812B1 (en) Image processing for updating a model of an environment
CN113536970A (zh) 一种视频分类模型的训练方法及相关装置
CN110705564B (zh) 图像识别的方法和装置
CN112329662B (zh) 基于无监督学习的多视角显著性估计方法
WO2021057091A1 (zh) 视点图像处理方法及相关设备

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination