WO2021204037A1 - Detection method and apparatus for facial key point, and storage medium and electronic device - Google Patents

Detection method and apparatus for facial key point, and storage medium and electronic device Download PDF

Info

Publication number
WO2021204037A1
WO2021204037A1 PCT/CN2021/084220 CN2021084220W WO2021204037A1 WO 2021204037 A1 WO2021204037 A1 WO 2021204037A1 CN 2021084220 W CN2021084220 W CN 2021084220W WO 2021204037 A1 WO2021204037 A1 WO 2021204037A1
Authority
WO
WIPO (PCT)
Prior art keywords
key point
occlusion
image
point
confidence
Prior art date
Application number
PCT/CN2021/084220
Other languages
French (fr)
Chinese (zh)
Inventor
蔡中印
赵晓辉
陈斌
宋晨
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021204037A1 publication Critical patent/WO2021204037A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation

Definitions

  • the acquisition module is used to acquire the to-be-labeled image containing the human face
  • a determining module configured to determine the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence
  • step S120 the image to be annotated is input to the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, the predicted key point coordinates, and the to-be-annotated image The occlusion confidence of each point in the image.
  • the first key point coordinates and the second key point coordinates are respectively selected from the predicted key point coordinates and the heat map as the target key point coordinates. Therefore, the coordinates with higher reliability can be selected from the predicted key point coordinates and the heat map respectively as the target key point coordinates, thereby ensuring the accuracy of the target key point coordinates.
  • step S320 the occlusion sample images in the occlusion training sample set are input into the key point annotation model, so that the key point annotation model outputs the occlusion confidence of each point in the occlusion sample image.
  • step S610 the training sample set is input to the key point labeling models to be trained with different learning rates, so that each key point labeling model outputs training data respectively, and the training data includes the heat map corresponding to each sample image , Predict the coordinates of key points and the occlusion confidence of each point in the sample image.
  • a determining unit configured to compare the occlusion confidence of each point in the to-be-labeled image with a preset occlusion threshold, and determine the point to be processed whose occlusion confidence is less than the occlusion threshold;
  • the electronic device 500 according to this embodiment of the present application will be described below with reference to FIG. 8.
  • the electronic device 500 shown in FIG. 8 is only an example, and should not bring any limitation to the function and scope of use of the embodiments of the present application.
  • the electronic device 500 may also communicate with one or more external devices 700 (such as keyboards, pointing devices, Bluetooth devices, etc.), and may also communicate with one or more devices that enable a user to interact with the electronic device 500, and/or communicate with Any device (such as a router, modem, etc.) that enables the electronic device 500 to communicate with one or more other computing devices. This communication can be performed through an input/output (I/O) interface 550.
  • the electronic device 500 may also communicate with one or more networks (for example, a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet) through the network adapter 560.
  • networks for example, a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet

Abstract

A detection method and apparatus for a facial key point, and a storage medium and an electronic device, which belong to the technical field of facial recognition. The method comprises: acquiring an image to be labeled, which includes a face (S110); inputting said image into a pre-trained key point labeling model, so that the key point labeling model outputs a thermodynamic diagram and predicted key point coordinates that correspond to said image, and an occlusion confidence of each point location in said image (S120); determining target key point coordinates of said image according to the thermodynamic diagram, the predicted key point coordinates and the occlusion confidence (S130); and performing key point labeling on said image according to the target key point coordinates (S140). By means of the method, the recognition efficiency of a facial key point can be improved, and the recognition accuracy of the facial key point is ensured.

Description

人脸关键点的检测方法、装置、存储介质及电子设备Method, device, storage medium and electronic equipment for detecting key points of human face
本申请要求于2020年11月12日在中国专利局提交的、申请号为202011264438.2、发明名称为“人脸关键点的检测方法、装置、存储介质及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed at the Chinese Patent Office on November 12, 2020 with the application number 202011264438.2 and the invention title "Methods, devices, storage media and electronic equipment for detecting key points of human faces". The entire content is incorporated into this application by reference.
技术领域Technical field
本申请涉及人脸识别技术领域,具体而言,涉及一种人脸关键点的检测方法、人脸关键点的检测装置、计算机可读存储介质以及电子设备。This application relates to the technical field of face recognition, and specifically, to a method for detecting key points of a face, a device for detecting key points of a face, a computer-readable storage medium, and an electronic device.
背景技术Background technique
人脸关键点检测是指在人脸图像中检测出人脸上如眼睛、鼻子、脸部边缘等关键点的技术。其可应用于定位人脸局部、识别表情、智能驾考判定、辅助驾驶等场景中。在目前的技术方案中,通过对图像进行多次标注取平均值的方式,以消除在人脸关键点标注时的误差。然而发明人发现,多次标注耗费时间较长,成本较高。因此,如何提高人脸关键点的识别效率,并保证人脸关键点的识别的准确度成为了亟待解决的技术问题。 Face key point detection refers to a technology that detects key points on the face, such as eyes, nose, and face edges, in a face image. It can be used in scenes such as locating parts of human faces, recognizing facial expressions, intelligent driving test judgments, and assisted driving. In the current technical solution, an image is marked for multiple times and the average value is taken to eliminate errors in marking the key points of the face. However, the inventor found that multiple labeling takes a long time and the cost is high. Therefore, how to improve the recognition efficiency of the key points of the face and ensure the accuracy of the recognition of the key points of the face has become an urgent technical problem to be solved.
需要说明的是,在上述背景技术部分公开的信息仅用于加强对本申请的背景的理解,因此可以包括不构成对本领域普通技术人员已知的现有技术的信息。It should be noted that the information disclosed in the background art section above is only used to enhance the understanding of the background of the application, and therefore may include information that does not constitute the prior art known to those of ordinary skill in the art.
技术问题technical problem
本申请实施例的目的之一在于提供一种人脸关键点的检测方法、人脸关键点的检测装置、计算机可读存储介质以及电子设备,以解决多次标注耗费时间较长,成本较高的问题,提高人脸关键点的识别效率,并保证人脸关键点的识别的准确度。One of the objectives of the embodiments of the present application is to provide a method for detecting key points of a human face, a device for detecting key points of a human face, a computer-readable storage medium, and electronic equipment, so as to solve the problem that multiple annotations are time-consuming and costly. To improve the recognition efficiency of the key points of the face, and ensure the accuracy of the recognition of the key points of the face.
技术解决方案Technical solutions
为解决上述技术问题,本申请实施例采用的技术方案是:In order to solve the above technical problems, the technical solutions adopted in the embodiments of this application are:
本申请实施例的第一方面提供了一种人脸关键点的检测方法,其中,包括:The first aspect of the embodiments of the present application provides a method for detecting key points of a human face, which includes:
获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
本申请实施例的第二方面提供了一种人脸关键点的检测装置,其中,包括:The second aspect of the embodiments of the present application provides an apparatus for detecting key points of a human face, which includes:
获取模块,用于获取包含人脸的待标注图像;The acquisition module is used to acquire the to-be-labeled image containing the human face;
处理模块,用于将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;The processing module is used to input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and the to be annotated The occlusion confidence of each point in the image;
确定模块,用于根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;A determining module, configured to determine the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
标注模块,用于根据所述目标关键点坐标,对所述待标注图像进行关键点标注。The marking module is used to mark the key points of the image to be marked according to the coordinates of the target key points.
本申请实施例的第三方面提供了一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现步骤包括:A third aspect of the embodiments of the present application provides a computer-readable storage medium on which a computer program is stored, wherein the steps of implementing the computer program when the computer program is executed by a processor include:
获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
本申请实施例的第四方面提供了一种电子设备,其中,包括:The fourth aspect of the embodiments of the present application provides an electronic device, which includes:
处理器;以及Processor; and
存储器,其上存储有计算机程序;A memory on which a computer program is stored;
其中,所述处理器配置为经由执行所述计算机程序来实现的步骤包括:Wherein, the steps that the processor is configured to be implemented by executing the computer program include:
获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
有益效果Beneficial effect
本申请的有益效果在于:The beneficial effects of this application are:
基于本申请的各实施例,通过获取包含人脸的待标注图像,将待标注图像输入至预先训练完成的关键点标注模型,以使关键点标注模型输出待标注图像对应的热力图、预测关键点坐标以及待标注图像中各点位的遮挡置信度,再根据热力图、预测关键点坐标以及遮挡置信度,确定待标注图像的目标关键点坐标,以对待标注图像进行关键点标注,由此,通过热力图、预测关键点坐标以及遮挡置信度,确定目标关键点坐标,可以保证目标关键点坐标的准确度,同时无需多次标注,进而提高了人脸关键点的识别效率。Based on the embodiments of the present application, by acquiring the image to be annotated containing the face, the image to be annotated is input to the pre-trained key point annotation model, so that the key point annotation model outputs the heat map and prediction key corresponding to the image to be annotated Point coordinates and the occlusion confidence of each point in the image to be annotated, and then determine the target key point coordinates of the image to be annotated according to the heat map, predicted key point coordinates, and occlusion confidence, so as to mark the key points of the image to be annotated. , Determine the target key point coordinates through the heat map, predict the key point coordinates and the occlusion confidence, which can ensure the accuracy of the target key point coordinates, and at the same time, there is no need to mark multiple times, thereby improving the recognition efficiency of the face key points.
附图说明Description of the drawings
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本申请的实施例,并与说明书一起用于解释本申请的原理。显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。The drawings herein are incorporated into the specification and constitute a part of the specification, show embodiments that conform to the application, and are used together with the specification to explain the principle of the application. Obviously, the drawings in the following description are only some embodiments of the application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1示出了根据本申请的一个实施例的人脸关键点的检测方法的流程示意图。Fig. 1 shows a schematic flowchart of a method for detecting key points of a human face according to an embodiment of the present application.
图2示出了根据本申请的一个实施例的图1的人脸关键点的检测方法中步骤S130的流程示意图。FIG. 2 shows a schematic flowchart of step S130 in the method for detecting key points of a human face in FIG. 1 according to an embodiment of the present application.
图3示出了根据本申请的一个实施例的人脸关键点的检测方法还包括的确定遮挡阈值的流程示意图。Fig. 3 shows a schematic flow chart of determining an occlusion threshold further included in the method for detecting key points of a human face according to an embodiment of the present application.
图4示出了根据本申请的一个实施例的图3的人脸关键点的检测方法中步骤S330的流程示意图。FIG. 4 shows a schematic flowchart of step S330 in the method for detecting key points of a human face in FIG. 3 according to an embodiment of the present application.
图5示出了根据本申请的一个实施例的人脸关键点的检测方法中还包括的训练关键点标注模型的流程示意图。FIG. 5 shows a schematic flowchart of training a key point annotation model further included in the method for detecting key points of a face according to an embodiment of the present application.
图6示出了根据本申请的一个实施例的图5的人脸关键点的检测方法中步骤S540的流程示意图。FIG. 6 shows a schematic flowchart of step S540 in the method for detecting key points of a human face in FIG. 5 according to an embodiment of the present application.
图7示出了根据本申请一个实施例的人脸关键点的检测装置的示意组成框图。Fig. 7 shows a schematic composition block diagram of a device for detecting key points of a human face according to an embodiment of the present application.
图8示出了根据本申请一个实施例的电子设备的示意组成框图。Fig. 8 shows a schematic block diagram of an electronic device according to an embodiment of the present application.
图9示出了根据本申请一个实施例的一种计算机可读存储介质的示意图。Fig. 9 shows a schematic diagram of a computer-readable storage medium according to an embodiment of the present application.
本发明的实施方式Embodiments of the present invention
现在将参考附图更全面地描述示例实施方式。然而,示例实施方式能够以多种形式实施,且不应被理解为限于在此阐述的范例;相反,提供这些实施方式使得本申请将更加全面和完整,并将示例实施方式的构思全面地传达给本领域的技术人员。所描述的特征、结构或特性可以以任何合适的方式结合在一个或更多实施方式中。在下面的描述中,提供许多具体细节从而给出对本申请的实施方式的充分理解。然而,本领域技术人员将意识到,可以实践本申请的技术方案而省略所述特定细节中的一个或更多,或者可以采用其它的方法、组元、装置、步骤等。在其它情况下,不详细示出或描述公知技术方案以避免喧宾夺主而使得本申请的各方面变得模糊。Example embodiments will now be described more fully with reference to the accompanying drawings. However, the example embodiments can be implemented in various forms, and should not be construed as being limited to the examples set forth herein; on the contrary, the provision of these embodiments makes this application more comprehensive and complete, and fully conveys the concept of the example embodiments To those skilled in the art. The described features, structures or characteristics can be combined in one or more embodiments in any suitable way. In the following description, many specific details are provided to give a sufficient understanding of the embodiments of the present application. However, those skilled in the art will realize that the technical solutions of the present application can be practiced without one or more of the specific details, or other methods, components, devices, steps, etc. can be used. In other cases, the well-known technical solutions are not shown or described in detail to avoid overwhelming the crowd and obscure all aspects of the present application.
此外,附图仅为本申请的示意性图解,并非一定是按比例绘制。图中相同的附图标记表示相同或类似的部分,因而将省略对它们的重复描述。附图中所示的一些方框图是功能实体,不一定必须与物理或逻辑上独立的实体相对应。可以采用软件形式来实现这些功能实体,或在一个或多个硬件模块或集成电路中实现这些功能实体,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。In addition, the drawings are only schematic illustrations of the application and are not necessarily drawn to scale. The same reference numerals in the figures denote the same or similar parts, and thus their repeated description will be omitted. Some of the block diagrams shown in the drawings are functional entities and do not necessarily correspond to physically or logically independent entities. These functional entities may be implemented in the form of software, or implemented in one or more hardware modules or integrated circuits, or implemented in different networks and/or processor devices and/or microcontroller devices.
图1示出了根据本申请的一个实施例的人脸关键点的检测方法的流程示意图。该人脸关键点的检测方法可以应用于终端设备中,例如智能手机、平板电脑或者便携式电脑等,在其他实施例中,该人脸关键点的检测方法也可以应用于服务器中,本申请对此不做特殊限定。Fig. 1 shows a schematic flowchart of a method for detecting key points of a human face according to an embodiment of the present application. The method for detecting key points of a face can be applied to terminal devices, such as smart phones, tablets, or portable computers. In other embodiments, the method for detecting key points of a face can also be applied to a server. There are no special restrictions.
参照图1所示,该人脸关键点的检测方法至少包括步骤S110至步骤S140,详细介绍如下:Referring to FIG. 1, the method for detecting key points of a human face at least includes steps S110 to S140, which are described in detail as follows:
在步骤S110中,获取包含人脸的待标注图像。In step S110, an image to be labeled containing a human face is acquired.
在本申请一实施例中,终端设备可以从本地的存储位置中获取待标注图像,其中,该待标注图像包含未标注的人脸部分。需要说明的,所获取的待标注图像的数量可以是一个,也可以是两个或者两个以上的任意数量,本申请对此不做特殊限定。In an embodiment of the present application, the terminal device may obtain an image to be annotated from a local storage location, where the image to be annotated includes an unlabeled face part. It should be noted that the number of acquired images to be labeled may be one, or any number of two or more, which is not specifically limited in this application.
在本申请一实施例中,当终端设备接收到标注指令时,则可以打开其配置有的拍照装置如摄像头等,用户可以将该拍照装置对准欲标注的对象,从而获取待标注图像。In an embodiment of the present application, when the terminal device receives the labeling instruction, it can turn on its configured photographing device such as a camera, and the user can aim the photographing device at the object to be labelled to obtain the image to be labelled.
在步骤S120中,将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度。In step S120, the image to be annotated is input to the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, the predicted key point coordinates, and the to-be-annotated image The occlusion confidence of each point in the image.
在该实施例中,关键点标注模型可以是采用卷积神经网络训练而成,通过预先训练,以使该关键点标注模型可以输出与输入相对应的热力图、预测关键点坐标以及输入中各点位的遮挡置信度。In this embodiment, the key point labeling model can be trained by using a convolutional neural network, and through pre-training, the key point labeling model can output a heat map corresponding to the input, predict the key point coordinates, and each of the inputs. The occlusion confidence of the point.
需要说明的,该热力图可以通过各点位的亮度从而表征出各点位为人脸关键点的可能性大小,即亮度越高,则表示该点位对应关键点的可能性越大,相反的,亮度越低,则表示该点位对应关键点的可能性越小。It should be noted that the heat map can use the brightness of each point to characterize the possibility that each point is a key point of the face, that is, the higher the brightness, the greater the probability that the point corresponds to the key point, and the opposite , The lower the brightness, the less likely that the point corresponds to the key point.
然而,热力图对存在遮挡的人脸的待标注图像的识别效果不佳,其对应于遮挡部分的可信度较低。因此,通过训练一组全链接层以使该全链接层可以输出对待标注图像的关键点位置的预测关键点坐标。再训练关键点标注模型以使该关键点标注模型可以输出待标注图像中各点位的遮挡置信度。However, the recognition effect of the heat map on the to-be-labeled image with the occluded face is not good, and its reliability corresponding to the occluded part is low. Therefore, a set of fully-linked layers is trained so that the fully-linked layer can output the predicted key point coordinates of the key point positions of the image to be labeled. Retrain the key point annotation model so that the key point annotation model can output the occlusion confidence of each point in the image to be annotated.
需要说明的,该遮挡置信度可以用于描述待标注图像中各点位是否被遮挡的可能性大小。在实际使用中,遮挡置信度可以是介于0至1之间的数值,若遮挡置信度越大,则表示对应点位未被遮挡的可能性越大,若遮挡置信度越小,则表示对应点位被遮挡的可能性越大。It should be noted that the occlusion confidence can be used to describe the likelihood that each point in the image to be labeled is occluded. In actual use, the occlusion confidence can be a value between 0 and 1. If the occlusion confidence is greater, it means that the corresponding point is more likely to be unoccluded; if the occlusion confidence is smaller, it means The more likely the corresponding point is to be blocked.
在步骤S130中,根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标。In step S130, the target key point coordinates of the image to be marked are determined according to the heat map, the predicted key point coordinates, and the occlusion confidence.
在该步骤中,通过考虑遮挡置信度,从热力图的高亮位置的坐标以及预测关键点坐标中识别出目标关键点坐标,由此可以保证目标关键点坐标的准确度。In this step, by considering the occlusion confidence, the target key point coordinates are identified from the coordinates of the highlighted position of the heat map and the predicted key point coordinates, thereby ensuring the accuracy of the target key point coordinates.
在本申请一实施例中,可以将热力图中的高亮位置的坐标与待标注图像中各点位的遮挡置信度进行比对。若热力图中的高亮位置的坐标所对应的遮挡置信度较低,处于被遮挡的数值范围内,则表示该高亮位置被遮挡的可能性较大,因此,可以将该高亮位置的亮度调至低亮度范围内。In an embodiment of the present application, the coordinates of the highlighted position in the heat map can be compared with the occlusion confidence of each point in the image to be annotated. If the occlusion confidence level corresponding to the coordinates of the highlighted position in the heat map is low and is within the occluded numerical range, it means that the highlight position is more likely to be occluded. Therefore, the highlighted position can be Adjust the brightness to the low brightness range.
若热力图中的高亮位置的坐标所对应的遮挡置信度较高,处于未被遮挡的数值范围内,则表示该高亮位置未被遮挡的可能性较大,因此该高亮位置的可信度较高,对该高亮位置可以不做处理。由此,比对之后可以得到更新后的热力图。If the occlusion confidence level corresponding to the coordinates of the highlighted position in the heat map is high and is within the range of unoccluded values, it means that the highlight position is more likely to be unoccluded, so the highlight position can be The reliability is high, and the highlighted position may not be processed. As a result, the updated heat map can be obtained after the comparison.
结合更新后的热力图与关键点标注模型所输出的预测关键点坐标,将更新后的热力图的高亮位置的坐标以及预测关键点坐标进行去重,以得到目标关键点坐标。由此,通过遮挡置信度,可以对热力图中的高亮位置的坐标进行筛选,再结合预测关键点坐标,由此,可以保证目标关键点坐标的准确性。Combining the updated heat map with the predicted key point coordinates output by the key point annotation model, the coordinates of the highlighted position of the updated heat map and the predicted key point coordinates are deduplicated to obtain the target key point coordinates. Thus, by occluding the confidence level, the coordinates of the highlighted position in the heat map can be screened, combined with the prediction of the key point coordinates, thereby ensuring the accuracy of the target key point coordinates.
在步骤S140中,根据所述目标关键点坐标,对所述待标注图像进行关键点标注。In step S140, the key points of the image to be marked are marked according to the coordinates of the target key points.
在该步骤中,根据所确定的目标关键点坐标,可以对待标注图像进行关键点标注。在一示例中,对待标注图像进行关键点标注可以是将待标注图像中的目标关键点坐标进行高亮显示,例如采用红色或者黄色的预定颜色进行显示等。在另一示例中,对待标注图像进行关键点标注可以是采用批注的方式,具体地,批注框的一端可以指示目标关键点坐标所对应的位置,另一端可以包含该关键点的所属区域信息,例如嘴唇关键点、眼睛关键点或者鼻部关键点等等。In this step, based on the determined target key point coordinates, the image to be marked can be marked with key points. In an example, the key point annotation of the image to be annotated may be to highlight the coordinates of the target key point in the image to be annotated, for example, to display in a predetermined color of red or yellow. In another example, the key point labeling of the image to be labeled may be in the form of annotation. Specifically, one end of the annotation box may indicate the position corresponding to the target key point coordinates, and the other end may contain the region information of the key point. For example, key points of lips, key points of eyes or key points of nose and so on.
在图1所示的实施例中,通过将待标注图像输入至关键点标注模型中,以使该关键点标注模型输出待标注图像对应的热力图、预测关键点坐标以及待标注图像中各点位的遮挡置信度,由此,根据该热力图、预测关键点坐标以及遮挡置信度,从而确定待标注图像的目标关键点坐标,可以保证目标关键点坐标的准确度,同时也无需多次进行标注,提高了标注效率。In the embodiment shown in FIG. 1, the image to be annotated is input into the key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, the predicted key point coordinates, and each point in the image to be annotated According to the heat map, the predicted key point coordinates and the occlusion confidence degree, the target key point coordinates of the image to be labeled can be determined. The accuracy of the target key point coordinates can be ensured, and there is no need to perform multiple times. Marking improves the marking efficiency.
基于图1所示的实施例,图2示出了根据本申请的一个实施例的图1的人脸关键点的检测方法中步骤S130的流程示意图。参照图2所示,步骤S130至少包括步骤S210至步骤S240,详细介绍如下:Based on the embodiment shown in FIG. 1, FIG. 2 shows a schematic flowchart of step S130 in the method for detecting key points of a human face in FIG. 1 according to an embodiment of the present application. Referring to FIG. 2, step S130 includes at least step S210 to step S240, which are described in detail as follows:
在步骤S210中,将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位。In step S210, the occlusion confidence of each point in the to-be-labeled image is compared with a preset occlusion threshold to determine the point to be processed whose occlusion confidence is less than the occlusion threshold.
其中,遮挡阈值可以是用于确定该点位是否被遮挡的阈值,若某一点位的遮挡置信度小于遮挡阈值,则表示该点位被遮挡的可能性较大;若某一点位的遮挡置信度大于或等于遮挡阈值,则表示该点位未被遮挡的可能性较大。Among them, the occlusion threshold can be a threshold used to determine whether the point is occluded. If the occlusion confidence of a certain point is less than the occlusion threshold, it means that the point is more likely to be occluded; if the occlusion of a certain point is confident If the degree is greater than or equal to the occlusion threshold, it means that the point is more likely to be unoccluded.
在该实施例中,将待标注图像中各点位对应的遮挡置信度与预先设定的遮挡阈值进行比较,可以得到待标注图像中具有较大可能性的点位即遮挡置信度小于遮挡阈值的待处理点位。In this embodiment, the occlusion confidence level corresponding to each point in the image to be labeled is compared with the preset occlusion threshold, and it can be obtained that the point in the image to be labeled has a higher probability, that is, the occlusion confidence is less than the occlusion threshold. To be processed.
在步骤S220中,根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标。In step S220, according to the points to be processed, the predicted key point coordinates corresponding to the points to be processed are selected from the predicted key point coordinates as the first key point coordinates.
在该实施例中,由于待处理点位为具有较大可能性被遮挡的点位,因此,相比于热力图中对应于该待处理点位的高亮位置,与该待处理点位相对应的预测关键点坐标的可信度较高,因此将预测关键点坐标中与待处理点位相对应的预测关键点坐标作为第一关键点坐标。In this embodiment, since the point to be processed is a point that has a greater possibility of being occluded, compared to the highlighted position corresponding to the point to be processed in the heat map, it corresponds to the point to be processed The predicted key point coordinates of the predicted key point coordinates are highly reliable, so the predicted key point coordinates corresponding to the point to be processed in the predicted key point coordinates are taken as the first key point coordinates.
在步骤S230中,根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标。In step S230, according to the point to be processed, the coordinates of the highlight point except for the position corresponding to the point to be processed are selected from the heat map as the second key point coordinate.
在该实施例中,由于待处理点位为具有较大可能性被遮挡的点位,因此,热力图中除与待处理点位相对应的位置之外的高亮点位即为具有较大可能性未被遮挡的点位,可信度较高,所以可以将其作为第二关键点坐标。In this embodiment, since the point to be processed is a point that has a greater possibility of being occluded, the highlight in the heat map except for the position corresponding to the point to be processed is a point that has a greater possibility The unobstructed point has high credibility, so it can be used as the second key point coordinate.
在步骤S240中,将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。In step S240, the first key point coordinates and the second key point coordinates are integrated to determine the target key point coordinates of the image to be annotated.
在该实施例中,根据所识别得到的第一关键点坐标以及第二关键点坐标,将第一关键点坐标和第二关键点坐标进行整合,以将第一关键点坐标和第二关键点坐标作为目标关键点坐标。In this embodiment, according to the identified first key point coordinates and the second key point coordinates, the first key point coordinates and the second key point coordinates are integrated to combine the first key point coordinates and the second key point coordinates. The coordinates are used as the target key point coordinates.
在图2所示的实施例中,通过遮挡置信度的设置,分别从预测关键点坐标以及热力图中分别选取第一关键点坐标和第二关键点坐标,以作为目标关键点坐标。由此可以分别从预测关键点坐标以及热力图中选取可信度较高的坐标作为目标关键点坐标,从而保证了目标关键点坐标的准确度。In the embodiment shown in FIG. 2, by setting the occlusion confidence, the first key point coordinates and the second key point coordinates are respectively selected from the predicted key point coordinates and the heat map as the target key point coordinates. Therefore, the coordinates with higher reliability can be selected from the predicted key point coordinates and the heat map respectively as the target key point coordinates, thereby ensuring the accuracy of the target key point coordinates.
基于图1所示的实施例,图3示出了根据本申请的一个实施例的人脸关键点的检测方法还包括的确定遮挡阈值的流程示意图。参照图3所示,确定遮挡阈值至少包括步骤S310至步骤S330,详细介绍如下:Based on the embodiment shown in FIG. 1, FIG. 3 shows a schematic flowchart of determining the occlusion threshold further included in the method for detecting key points of a human face according to an embodiment of the present application. Referring to FIG. 3, determining the occlusion threshold includes at least step S310 to step S330, which are described in detail as follows:
在步骤S310中,获取遮挡训练样本集合,所述遮挡训练样本集合包括多个遮挡样本图像,所述遮挡样本图像中的人脸包括部分遮挡。In step S310, an occlusion training sample set is obtained, the occlusion training sample set includes a plurality of occlusion sample images, and the face in the occlusion sample image includes partial occlusion.
在该实施例中,遮挡训练样本集合中可以包含多个遮挡样本图像,该遮挡样本图像中的人脸存在部分遮挡,例如背景、帽子、口罩、刘海、眼镜、胡子、手指、笔或者麦克风等常见人脸遮挡物。对应于每一个遮挡样本图像,可以对应存储该遮挡样本图像对应的遮挡位置的坐标,以在后续进行比对。In this embodiment, the occlusion training sample set may contain multiple occlusion sample images, and the face in the occlusion sample image is partially occluded, such as background, hat, mask, bangs, glasses, mustache, fingers, pen, or microphone, etc. Common face occluders. Corresponding to each occlusion sample image, the coordinates of the occlusion position corresponding to the occlusion sample image can be correspondingly stored for subsequent comparison.
在步骤S320中,将所述遮挡训练样本集合中的遮挡样本图像输入至所述关键点标注模型中,以使所述关键点标注模型输出所述遮挡样本图像中各点位的遮挡置信度。In step S320, the occlusion sample images in the occlusion training sample set are input into the key point annotation model, so that the key point annotation model outputs the occlusion confidence of each point in the occlusion sample image.
在该实施例中,通过对关键点标注模型进行构建,以使该关键点标注模型能够输出其输入所对应的各点位的遮挡置信度。将遮挡训练样本集合中的遮挡样本图像输入至待训练的关键点标注模型,以使该关键点标注模型能够输出各遮挡样本图像中各点位的遮挡置信度。In this embodiment, the key point annotation model is constructed so that the key point annotation model can output the occlusion confidence of each point corresponding to its input. The occlusion sample images in the occlusion training sample set are input to the key point annotation model to be trained, so that the key point annotation model can output the occlusion confidence of each point in the occlusion sample image.
在步骤S330中,根据多个所述遮挡样本图像中各点位的遮挡置信度,确定遮挡阈值。In step S330, an occlusion threshold is determined according to the occlusion confidence of each point in the multiple occlusion sample images.
在本申请一实施例中,根据关键点标注模型所输出的遮挡样本图像中各点位的遮挡置信度,将其与各遮挡样本图像中对应于遮挡位置的坐标进行比对,得到各遮挡样本图像中无遮挡位置的点位的遮挡置信度,即各遮挡样本图像中除对应于遮挡位置之外的点位的遮挡置信度。以根据对应于无遮挡位置的点位的遮挡置信度确定遮挡阈值,用于后续判断。In an embodiment of the present application, according to the occlusion confidence of each point in the occlusion sample image output by the key point annotation model, it is compared with the coordinates corresponding to the occlusion position in each occlusion sample image to obtain each occlusion sample The occlusion confidence of the points in the image without occlusion, that is, the occlusion confidence of the points in each occlusion sample image other than the occlusion position. The occlusion threshold is determined according to the occlusion confidence of the point corresponding to the unoccluded position for subsequent judgment.
在本申请一示例中,可以从各遮挡样本图像中无遮挡位置的点位的遮挡置信度中,选取最小值以作为遮挡阈值,以在后续比对中能够保证该遮挡阈值尽可能的识别出待标注图像中被遮挡的位置。In an example of the present application, the minimum value can be selected as the occlusion threshold from the occlusion confidence of the points in the unoccluded position in each occlusion sample image, so as to ensure that the occlusion threshold can be identified as much as possible in the subsequent comparison The occluded position in the image to be annotated.
在图3所示的实施例中,通过设置遮挡训练样本集合,作为该关键点标注模型的输入,从而能够使关键点标注模型输出遮挡训练样本集合中各遮挡样本图像中各点位的遮挡置信度,进而根据该遮挡置信度确定遮挡阈值,保证了遮挡阈值设置的有效性,使得该遮挡阈值具有参考价值。In the embodiment shown in FIG. 3, by setting the occlusion training sample set as the input of the key point labeling model, the key point labeling model can output the occlusion confidence of each point in the occlusion sample image in the training sample set. Therefore, the occlusion threshold is determined according to the occlusion confidence, which ensures the validity of the occlusion threshold setting, so that the occlusion threshold has reference value.
基于图1和图3所示的实施例,图4示出了根据本申请的一个实施例的图3的人脸关键点的检测方法中步骤S330的流程示意图。参照图4所示,步骤S330至少包括步骤S410至步骤S420,详细介绍如下:Based on the embodiments shown in FIG. 1 and FIG. 3, FIG. 4 shows a schematic flowchart of step S330 in the method for detecting key points of a human face in FIG. 3 according to an embodiment of the present application. Referring to FIG. 4, step S330 includes at least step S410 to step S420, which are described in detail as follows:
在步骤S410中,从多个所述遮挡样本图像中各点位的遮挡置信度中,选取对应于遮挡样本图像中无遮挡位置的点位的遮挡置信度作为待选置信度。In step S410, from the occlusion confidence of each point in the multiple occlusion sample images, the occlusion confidence of the point corresponding to the unoccluded position in the occlusion sample image is selected as the confidence to be selected.
在该实施例中,将关键点标注模型所输出的各遮挡样本图像中的各点位的遮挡置信度与各遮挡样本图像中对应于被遮挡位置的点位的坐标进行匹配,可以得到各遮挡样本图像中除被遮挡位置以外的点位的遮挡置信度,即对应于各遮挡样本图像中无遮挡位置的点位的遮挡置信度,并将对应于各遮挡样本图像中无遮挡位置的点位的遮挡置信度作为待选置信度,以从中选取之一作为遮挡阈值。In this embodiment, the occlusion confidence of each point in each occlusion sample image output by the key point annotation model is matched with the coordinates of the point corresponding to the occluded position in each occlusion sample image, and each occlusion can be obtained. The occlusion confidence of points in the sample image except the occluded position, that is, the occlusion confidence of the points corresponding to the unoccluded positions in each occluded sample image, and will correspond to the points of the unoccluded positions in each occluded sample image The occlusion confidence of is used as the candidate confidence, and one of them is selected as the occlusion threshold.
在步骤S420中,从所述待选置信度中按照从大到小的顺序,选取排列在预定比例位置的遮挡置信度作为遮挡阈值。In step S420, the occlusion confidence levels arranged in predetermined proportions are selected from the to-be-selected confidence levels in descending order as the occlusion threshold.
在该实施例中,预定比例可以是由本领域技术人员预先设定、用以确定遮挡阈值的比例,例如预定比例可以是98%、99%或者99.5%等等。例如待选置信度的数量为1000,若预定比例为99.5%,则将按照从大到小的顺序进行排列的待选置信度中选取排列在第995位(即1000*99.5%)的待选置信度作为遮挡阈值。In this embodiment, the predetermined ratio may be a ratio that is preset by a person skilled in the art to determine the occlusion threshold. For example, the predetermined ratio may be 98%, 99%, or 99.5%, and so on. For example, the number of confidence levels to be selected is 1000. If the predetermined ratio is 99.5%, the candidate ranked 995th (ie 1000*99.5%) will be selected from the candidate confidence levels arranged in descending order The confidence is used as the occlusion threshold.
需要说明的,由于关键点标注模型的识别存在一定误差,因此对应于遮挡位置以及无遮挡位置的遮挡置信度也存在一定的误差,具有交集,因此本领域技术人员可以根据在先经验设定预定比例,以消除该误差,从而保证遮挡阈值的有效性,避免后续的误识别的情况发生。It should be noted that because there is a certain error in the recognition of the key point annotation model, there is also a certain error in the occlusion confidence corresponding to the occluded position and the unoccluded position, and there is an intersection. Therefore, those skilled in the art can set the preset according to prior experience. Ratio to eliminate the error, thereby ensuring the effectiveness of the occlusion threshold and avoiding subsequent misidentification.
基于图1所示的实施例,图5示出了根据本申请的一个实施例的人脸关键点的检测方法中还包括的训练关键点标注模型的流程示意图。参照图5所示,训练关键点标注模型至少包括步骤S510至步骤S530,详细介绍如下:Based on the embodiment shown in FIG. 1, FIG. 5 shows a schematic flowchart of training a key point labeling model further included in the method for detecting key points of a face according to an embodiment of the present application. Referring to FIG. 5, training the key point annotation model includes at least step S510 to step S530, which are described in detail as follows:
在步骤S510中,获取训练样本集合,所述训练样本集合中包含多个包含人脸的样本图像,所述样本图像中包含关键点信息。In step S510, a training sample set is obtained. The training sample set includes a plurality of sample images including human faces, and the sample images include key point information.
在该实施例中,训练样本集合可以是用于训练关键点标注模型的样本集合,其中可以包含多个包含人脸的样本图像,且每一样本图像中可以包含有自身的关键点信息,该关键点信息可以是预先进行标定的待标注图像的关键点坐标。In this embodiment, the training sample set may be a sample set used to train the key point annotation model, which may include multiple sample images containing human faces, and each sample image may contain its own key point information. The key point information may be the coordinates of the key point of the image to be labeled that has been calibrated in advance.
在本申请的一示例中,可以从本地的存储位置中获取该训练样本集合例如图像数据库等。具体地,当接收到对关键点标注模型的训练请求时,可以从图像数据库中随机选取预定数量的样本图像进行随机排列以得到训练样本集合。在其他示例中,也可以通过网络从第三方机构中获取训练样本集合,本申请对此不作特殊限定。In an example of the present application, the training sample set, such as an image database, can be obtained from a local storage location. Specifically, when a training request for the key point annotation model is received, a predetermined number of sample images can be randomly selected from the image database and randomly arranged to obtain a training sample set. In other examples, the training sample set can also be obtained from a third-party organization through the network, which is not specifically limited in this application.
在步骤S520中,将所述样本图像输入至待训练的关键点标注模型中,以使所述关键点标注模型输出与所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度。In step S520, the sample image is input into the key point annotation model to be trained, so that the key point annotation model outputs a heat map corresponding to the sample image, predicted key point coordinates, and the sample image The occlusion confidence of each point.
在该实施例中,通过对关键点标注模型进行构建,使其具有三个分支,一是热力图分支,该热力图分支能够输出样本图像对应的热力图,热力图中的高亮点位可以用于表示该样本图像的关键点位置;二是预测关键点坐标分支,其能够对样本图像进行分析,从而输出对样本图像的关键点进行预测的预测关键点坐标;三是遮挡置信度输出分支,该分支能够输出样本图像中各点位对应的遮挡置信度。In this embodiment, the key point annotation model is constructed to have three branches, one is the heat map branch, which can output the heat map corresponding to the sample image, and the highlights of the heat map can be used Yu represents the key point position of the sample image; the second is the prediction key point coordinate branch, which can analyze the sample image to output the predicted key point coordinates for predicting the key point of the sample image; the third is the occlusion confidence output branch, This branch can output the occlusion confidence level corresponding to each point in the sample image.
由此,将训练样本集合中的每一样本图像输入至关键点标注模型中,以使该关键点标注模型的三个分支分别输出各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度。Therefore, each sample image in the training sample set is input to the keypoint annotation model, so that the three branches of the keypoint annotation model output the heat map corresponding to each sample image, the predicted keypoint coordinates, and the sample image. The occlusion confidence of each point.
在步骤S530中,根据所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度,确定所述样本图像中的目标关键点坐标。In step S530, the target key point coordinates in the sample image are determined according to the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion confidence of each point in the sample image.
在该实施例中,可以根据各样本图像所对应的热力图、预测关键点坐标以及各点位的遮挡置信度,参照如上实施例所述的选取方法,确定各样本图像对应的目标关键点坐标,本申请在此不再赘述。In this embodiment, the target key point coordinates corresponding to each sample image can be determined by referring to the selection method described in the above embodiment according to the heat map corresponding to each sample image, the predicted key point coordinates, and the occlusion confidence of each point. , This application will not repeat it here.
在步骤S540中,调整待训练的关键点标注模型中的参数,以使所述样本图像中的目标关键点坐标与所述关键点信息相匹配。In step S540, the parameters in the key point labeling model to be trained are adjusted so that the target key point coordinates in the sample image match the key point information.
在该实施例中,根据所确定的样本图像的目标关键点坐标,将其与各样本图像预先标定的关键点信息进行比对,从而确定目标关键点坐标是否与关键点信息相匹配,若不匹配,则表示关键点标注模型识别有误,因此,可以通过调整关键点标注模型的参数,从而使得根据关键点标注模型的输出所确定的目标关键点坐标能够与样本图像的关键点信息进行匹配,从而保证关键点标注模型的识别的准确性。In this embodiment, according to the determined target key point coordinates of the sample image, it is compared with the pre-calibrated key point information of each sample image to determine whether the target key point coordinates match the key point information. Matching means that the key point labeling model is incorrectly recognized. Therefore, the parameters of the key point labeling model can be adjusted so that the target key point coordinates determined according to the output of the key point labeling model can be matched with the key point information of the sample image , So as to ensure the accuracy of the recognition of the key point annotation model.
基于图1和图5所示的实施例,图6示出了根据本申请的一个实施例的图5的人脸关键点的检测方法中步骤S540的流程示意图。参照图6所示,步骤S540至少包括步骤S610至步骤S640,详细介绍如下:Based on the embodiments shown in FIGS. 1 and 5, FIG. 6 shows a schematic flowchart of step S540 in the method for detecting key points of a human face in FIG. 5 according to an embodiment of the present application. Referring to FIG. 6, step S540 includes at least step S610 to step S640, which are described in detail as follows:
在步骤S610中,将所述训练样本集合输入至不同学习率的待训练的关键点标注模型中,以使各关键点标注模型分别输出训练数据,所述训练数据包括各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度。In step S610, the training sample set is input to the key point labeling models to be trained with different learning rates, so that each key point labeling model outputs training data respectively, and the training data includes the heat map corresponding to each sample image , Predict the coordinates of key points and the occlusion confidence of each point in the sample image.
在该实施例中,将训练样本集合输入至不同学习率的待训练的关键点标注模型中,以使不同学习率的关键点标注模型输出多组训练数据,该训练数据包括各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度。In this embodiment, the training sample set is input to the key point labeling model to be trained with different learning rates, so that the key point labeling model with different learning rates outputs multiple sets of training data. The training data includes the corresponding training data for each sample image. Heat map, predicted key point coordinates, and occlusion confidence of each point in the sample image.
在本申请的一个实施例中,可以对第一次训练时的关键点标注模型设置一个较大的学习率,每基于训练样本集合进行训练完成100次后,将该学习率降低10倍,并根据更新后的学习率继续训练,直到关键点标注模型接近收敛,且损失函数不再下降的时候,存储不同学习率下的关键点标注模型所输出的多组训练数据。In an embodiment of the present application, a larger learning rate can be set for the key point labeling model during the first training. After 100 times of training based on the training sample set are completed, the learning rate is reduced by 10 times, and Continue training according to the updated learning rate until the key point labeling model is close to convergence and the loss function no longer drops, storing multiple sets of training data output by the key point labeling model under different learning rates.
在步骤S620中,根据多个所述训练数据进行统计,从多个所述训练数据中识别出目标训练数据。In step S620, statistics are performed based on the plurality of training data, and target training data is identified from the plurality of training data.
在本申请的一个实施例中,根据多组训练数据,可以对应确定每一组训练数据中各样本图像对应的目标关键点坐标数据。由此,可以得到每一样本图像对应的多组目标关键点坐标数据,再根据各样本图像对应的多组目标关键点坐标数据,可以将出现次数在预定次数以上目标关键点坐标作为该样本图像的真实的关键点坐标,例如在某一样本图像对应的多组目标关键点坐标数据中,坐标A出现的次数为50次,坐标B出现的次数为10次,预定次数为40次,则可以将坐标A确定为该样本图像的真实的关键点坐标,坐标B则不予采纳,等等。需要说明的,以上数字仅为示例性举例,本申请对此不作特殊限定。In an embodiment of the present application, according to multiple sets of training data, the target key point coordinate data corresponding to each sample image in each set of training data can be correspondingly determined. As a result, multiple sets of target key point coordinate data corresponding to each sample image can be obtained, and then according to the multiple sets of target key point coordinate data corresponding to each sample image, the target key point coordinates can be used as the sample image with the number of occurrences above a predetermined number of times. For example, in multiple sets of target key point coordinate data corresponding to a sample image, the number of times that coordinate A appears is 50 times, the number of times that coordinate B appears is 10 times, and the predetermined number is 40 times. The coordinate A is determined as the true key point coordinate of the sample image, the coordinate B is not adopted, and so on. It should be noted that the above figures are only illustrative examples, and this application does not specifically limit this.
应该理解的,在各个样本图像对应的多组目标关键点样本数据中,出现次数越多的目标关键点坐标,则越有可能为该样本图像的真实的关键点坐标,因此,可以将出现次数大于预定次数的目标关键点坐标作为该样本图像对应的真实关键点坐标并进行整合,以得到该样本图像对应的目标训练数据。It should be understood that in the multiple sets of target key point sample data corresponding to each sample image, the more frequent the target key point coordinates are, the more likely they are the true key point coordinates of the sample image. Therefore, the number of occurrences can be reduced The target key point coordinates greater than a predetermined number of times are used as the real key point coordinates corresponding to the sample image and integrated to obtain the target training data corresponding to the sample image.
在步骤S630中,根据所述目标训练数据对所述样本图像包含的关键点信息进行更新,得到所述样本图像更新后的关键点信息。In step S630, the key point information contained in the sample image is updated according to the target training data to obtain updated key point information of the sample image.
在该实施例中,根据所得到的目标训练数据,对样本图像原先包含的关键点信息进行替换,从而得到样本图像更新后的关键点信息。In this embodiment, the key point information originally contained in the sample image is replaced according to the obtained target training data, so as to obtain the updated key point information of the sample image.
在步骤S640中,调整待训练的所述关键点标注模型中的参数,以使所述目标关键点坐标与所述更新后的关键点信息相匹配。In step S640, the parameters in the key point labeling model to be trained are adjusted to make the target key point coordinates match the updated key point information.
在该实施例中,在后续训练关键点标注模型时,通过调整关键点标注模型的参数,使得该关键点标注模型的输出能够与各样本图像的更新后的关键点信息相匹配。由此,通过更新后的关键点信息对关键点标注模型的训练进行指导,以消除原本预先标定的关键点信息的误差,从而能够保证关键点标注模型的训练效果,以保证关键点标注模型的输出的准确性。In this embodiment, when the key point annotation model is subsequently trained, the parameters of the key point annotation model are adjusted so that the output of the key point annotation model can match the updated key point information of each sample image. As a result, the updated key point information is used to guide the training of the key point annotation model to eliminate the error of the original pre-calibrated key point information, so as to ensure the training effect of the key point annotation model to ensure the key point annotation model. The accuracy of the output.
本申请还提供了一种人脸关键点的检测装置。参考图7所示,该装置可以包括:This application also provides a device for detecting key points of a human face. Referring to FIG. 7, the device may include:
获取模块710,用于获取包含人脸的待标注图像;The obtaining module 710 is configured to obtain an image to be annotated containing a human face;
处理模块720,用于将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;The processing module 720 is configured to input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and the to-be-annotated image. Annotate the occlusion confidence of each point in the image;
确定模块730,用于根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;The determining module 730 is configured to determine the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence level;
标注模块740,用于根据所述目标关键点坐标,对所述待标注图像进行关键点标注。The marking module 740 is configured to mark the key points of the image to be marked according to the coordinates of the target key points.
在本申请的一实施例中,所述确定模块730包括:In an embodiment of the present application, the determining module 730 includes:
确定单元,用于将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位;A determining unit, configured to compare the occlusion confidence of each point in the to-be-labeled image with a preset occlusion threshold, and determine the point to be processed whose occlusion confidence is less than the occlusion threshold;
第一选取单元,用于根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标;The first selection unit is configured to select, from the predicted key point coordinates, the predicted key point coordinates corresponding to the point to be processed as the first key point coordinates according to the point to be processed;
第二选取单元,用于根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标;The second selection unit is configured to select, from the heat map, the coordinates of the highlight point except for the position corresponding to the point to be processed as the second key point coordinate according to the point to be processed;
整合单元,用于将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。The integration unit is configured to integrate the first key point coordinates and the second key point coordinates to determine the target key point coordinates of the image to be annotated.
上述人脸关键点的检测装置中各模块的具体细节已经在对应的人脸关键点的检测方法中进行了详细的描述,因此此处不再赘述。The specific details of each module in the above-mentioned face key point detection device have been described in detail in the corresponding face key point detection method, so it will not be repeated here.
应当注意,尽管在上文详细描述中提及了用于动作执行的设备的若干模块或者单元,但是这种划分并非强制性的。实际上,根据本申请的实施方式,上文描述的两个或更多模块或者单元的特征和功能可以在一个模块或者单元中具体化。反之,上文描述的一个模块或者单元的特征和功能可以进一步划分为由多个模块或者单元来具体化。It should be noted that although several modules or units of the device for action execution are mentioned in the above detailed description, this division is not mandatory. In fact, according to the embodiments of the present application, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of a module or unit described above can be further divided into multiple modules or units to be embodied.
此外,尽管在附图中以特定顺序描述了本申请中方法的各个步骤,但是,这并非要求或者暗示必须按照该特定顺序来执行这些步骤,或是必须执行全部所示的步骤才能实现期望的结果。附加的或备选的,可以省略某些步骤,将多个步骤合并为一个步骤执行,以及/或者将一个步骤分解为多个步骤执行等。In addition, although the various steps of the method in the present application are described in a specific order in the drawings, this does not require or imply that these steps must be performed in the specific order, or that all the steps shown must be performed to achieve the desired result. Additionally or alternatively, some steps may be omitted, multiple steps may be combined into one step for execution, and/or one step may be decomposed into multiple steps for execution, etc.
通过以上的实施方式的描述,本领域的技术人员易于理解,这里描述的示例实施方式可以通过软件实现,也可以通过软件结合必要的硬件的方式来实现。因此,根据本申请实施方式的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中或网络上,包括若干指令以使得一台计算设备(可以是个人计算机、服务器、移动终端、或者网络设备等)执行根据本申请实施方式的方法。Through the description of the above embodiments, those skilled in the art can easily understand that the example embodiments described here can be implemented by software, or can be implemented by combining software with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, U disk, mobile hard disk, etc.) or on the network , Including several instructions to make a computing device (which can be a personal computer, a server, a mobile terminal, or a network device, etc.) execute the method according to the embodiment of the present application.
在本申请的示例性实施例中,还提供了一种能够实现上述方法的电子设备。In an exemplary embodiment of the present application, an electronic device capable of implementing the above method is also provided.
所属技术领域的技术人员能够理解,本申请的各个方面可以实现为系统、方法或程序产品。因此,本申请的各个方面可以具体实现为以下形式,即:完全的硬件实施方式、完全的软件实施方式(包括固件、微代码等),或硬件和软件方面结合的实施方式,这里可以统称为“电路”、“模块”或“系统”。Those skilled in the art can understand that various aspects of the present application can be implemented as a system, a method, or a program product. Therefore, each aspect of the present application can be specifically implemented in the following forms, namely: complete hardware implementation, complete software implementation (including firmware, microcode, etc.), or a combination of hardware and software implementations, which can be collectively referred to herein as "Circuit", "Module" or "System".
下面参照图8来描述根据本申请的这种实施方式的电子设备500。图8显示的电子设备500仅仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。The electronic device 500 according to this embodiment of the present application will be described below with reference to FIG. 8. The electronic device 500 shown in FIG. 8 is only an example, and should not bring any limitation to the function and scope of use of the embodiments of the present application.
如图8所示,电子设备500以通用计算设备的形式表现。电子设备500的组件可以包括但不限于:上述至少一个处理单元510、上述至少一个存储单元520、连接不同系统组件(包括存储单元520和处理单元510)的总线530。As shown in FIG. 8, the electronic device 500 is represented in the form of a general-purpose computing device. The components of the electronic device 500 may include, but are not limited to: the aforementioned at least one processing unit 510, the aforementioned at least one storage unit 520, and a bus 530 connecting different system components (including the storage unit 520 and the processing unit 510).
其中,所述存储单元存储有程序代码,所述程序代码可以被所述处理单元510执行,使得所述处理单元510执行本说明书上述“示例性方法”部分中描述的根据本申请各种示例性实施方式的步骤。例如,所述处理单元510可以执行如图1中所示的步骤110:获取包含人脸的待标注图像;步骤S120:将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;步骤S130,根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;步骤S140,根据所述目标关键点坐标,对所述待标注图像进行关键点标注。Wherein, the storage unit stores program code, and the program code can be executed by the processing unit 510, so that the processing unit 510 executes the various exemplary methods described in the “Exemplary Method” section of this specification. Steps of implementation. For example, the processing unit 510 may perform step 110 as shown in FIG. 1: obtain an image to be annotated containing a human face; step S120: input the image to be annotated into the pre-trained key point annotation model, so that The key point annotation model outputs the heat map corresponding to the image to be annotated, the predicted key point coordinates, and the occlusion confidence of each point in the image to be annotated; step S130, according to the heat map and the predicted key point The coordinates and the occlusion confidence level determine the target key point coordinates of the image to be annotated; step S140, according to the target key point coordinates, mark the key points of the image to be annotated.
存储单元520可以包括易失性存储单元形式的可读介质,例如随机存取存储单元(RAM)5201和/或高速缓存存储单元5202,还可以进一步包括只读存储单元(ROM)5203。The storage unit 520 may include a readable medium in the form of a volatile storage unit, such as a random access storage unit (RAM) 5201 and/or a cache storage unit 5202, and may further include a read-only storage unit (ROM) 5203.
存储单元520还可以包括具有一组(至少一个)程序模块5205的程序/实用工具5204,这样的程序模块5205包括但不限于:操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。The storage unit 520 may also include a program/utility tool 5204 having a set of (at least one) program module 5205. Such program module 5205 includes but is not limited to: an operating system, one or more application programs, other program modules, and program data, Each of these examples or some combination may include the implementation of a network environment.
总线530可以为表示几类总线结构中的一种或多种,包括存储单元总线或者存储单元控制器、外围总线、图形加速端口、处理单元或者使用多种总线结构中的任意总线结构的局域总线。The bus 530 may represent one or more of several types of bus structures, including a storage unit bus or a storage unit controller, a peripheral bus, a graphics acceleration port, a processing unit, or a local area using any bus structure among multiple bus structures. bus.
电子设备500也可以与一个或多个外部设备700(例如键盘、指向设备、蓝牙设备等)通信,还可与一个或者多个使得用户能与该电子设备500交互的设备通信,和/或与使得该电子设备500能与一个或多个其它计算设备进行通信的任何设备(例如路由器、调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口550进行。并且,电子设备500还可以通过网络适配器560与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器560通过总线530与电子设备500的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备500使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。The electronic device 500 may also communicate with one or more external devices 700 (such as keyboards, pointing devices, Bluetooth devices, etc.), and may also communicate with one or more devices that enable a user to interact with the electronic device 500, and/or communicate with Any device (such as a router, modem, etc.) that enables the electronic device 500 to communicate with one or more other computing devices. This communication can be performed through an input/output (I/O) interface 550. In addition, the electronic device 500 may also communicate with one or more networks (for example, a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet) through the network adapter 560. As shown in the figure, the network adapter 560 communicates with other modules of the electronic device 500 through the bus 530. It should be understood that although not shown in the figure, other hardware and/or software modules can be used in conjunction with the electronic device 500, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives And data backup storage system, etc.
通过以上的实施方式的描述,本领域的技术人员易于理解,这里描述的示例实施方式可以通过软件实现,也可以通过软件结合必要的硬件的方式来实现。因此,根据本申请实施方式的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中或网络上,包括若干指令以使得一台计算设备(可以是个人计算机、服务器、终端装置、或者网络设备等)执行根据本申请实施方式的方法。Through the description of the above embodiments, those skilled in the art can easily understand that the example embodiments described here can be implemented by software, or can be implemented by combining software with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, U disk, mobile hard disk, etc.) or on the network , Including several instructions to make a computing device (which can be a personal computer, a server, a terminal device, or a network device, etc.) execute the method according to the embodiment of the present application.
在本申请的示例性实施例中,还提供了一种计算机可读存储介质,其上存储有能够实现本说明书上述方法的程序产品。在一些可能的实施方式中,本申请的各个方面还可以实现为一种程序产品的形式,其包括程序代码,当所述程序产品在终端设备上运行时,所述程序代码用于使所述终端设备执行本说明书上述“示例性方法”部分中描述的根据本申请各种示例性实施方式的步骤。In the exemplary embodiment of the present application, a computer-readable storage medium is also provided, on which a program product capable of implementing the above method of this specification is stored. In some possible implementation manners, various aspects of the present application can also be implemented in the form of a program product, which includes program code. When the program product runs on a terminal device, the program code is used to make the The terminal device executes the steps according to various exemplary embodiments of the present application described in the above-mentioned "Exemplary Method" section of this specification.
参考图9所示,描述了根据本申请的实施方式的用于实现上述方法的程序产品600,其可以采用便携式紧凑盘只读存储器(CD-ROM)并包括程序代码,并可以在终端设备,例如个人电脑上运行。然而,本申请的程序产品不限于此,在本文件中,可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而且,可读存储介质可以是非易失性的,也可以是易失性的。Referring to FIG. 9, a program product 600 for implementing the above method according to an embodiment of the present application is described. It can adopt a portable compact disk read-only memory (CD-ROM) and include program code, and can be installed in a terminal device, For example, running on a personal computer. However, the program product of this application is not limited to this. In this document, the readable storage medium can be any tangible medium that contains or stores a program, and the program can be used by or in combination with an instruction execution system, device, or device. Moreover, the readable storage medium may be non-volatile or volatile.
所述程序产品可以采用一个或多个可读介质的任意组合。可读介质可以是可读信号介质或者可读存储介质。可读存储介质例如可以为但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。The program product can use any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. The readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above. More specific examples (non-exhaustive list) of readable storage media include: electrical connections with one or more wires, portable disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Type programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了可读程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。可读信号介质还可以是可读存储介质以外的任何可读介质,该可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。The computer-readable signal medium may include a data signal propagated in baseband or as a part of a carrier wave, and readable program code is carried therein. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. The readable signal medium may also be any readable medium other than a readable storage medium, and the readable medium may send, propagate, or transmit a program for use by or in combination with the instruction execution system, apparatus, or device.
可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于无线、有线、光缆、RF等等,或者上述的任意合适的组合。The program code contained on the readable medium can be transmitted by any suitable medium, including but not limited to wireless, wired, optical cable, RF, etc., or any suitable combination of the foregoing.
可以以一种或多种程序设计语言的任意组合来编写用于执行本申请操作的程序代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、C++等,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算设备上执行、部分地在用户设备上执行、作为一个独立的软件包执行、部分在用户计算设备上部分在远程计算设备上执行、或者完全在远程计算设备或服务器上执行。在涉及远程计算设备的情形中,远程计算设备可以通过任意种类的网络,包括局域网(LAN)或广域网(WAN),连接到用户计算设备,或者,可以连接到外部计算设备(例如利用因特网服务提供商来通过因特网连接)。The program code used to perform the operations of the present application can be written in any combination of one or more programming languages. The programming languages include object-oriented programming languages—such as Java, C++, etc., as well as conventional procedural programming languages. Programming language-such as "C" language or similar programming language. The program code can be executed entirely on the user's computing device, partly on the user's device, executed as an independent software package, partly on the user's computing device and partly executed on the remote computing device, or entirely on the remote computing device or server Executed on. In the case of a remote computing device, the remote computing device can be connected to a user computing device through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computing device (for example, using Internet service providers). Business to connect via the Internet).
此外,上述附图仅是根据本申请示例性实施例的方法所包括的处理的示意性说明,而不是限制目的。易于理解,上述附图所示的处理并不表明或限制这些处理的时间顺序。另外,也易于理解,这些处理可以是例如在多个模块中同步或异步执行的。In addition, the above-mentioned drawings are merely schematic illustrations of the processing included in the method according to the exemplary embodiments of the present application, and are not intended for limitation. It is easy to understand that the processing shown in the above drawings does not indicate or limit the time sequence of these processings. In addition, it is easy to understand that these processes can be executed synchronously or asynchronously in multiple modules, for example.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本申请的其他实施例。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由权利要求指出。After considering the specification and practicing the invention disclosed herein, those skilled in the art will easily think of other embodiments of the present application. This application is intended to cover any variations, uses, or adaptive changes of this application. These variations, uses, or adaptive changes follow the general principles of this application and include common knowledge or customary technical means in the technical field that are not disclosed in this application. . The description and the embodiments are only regarded as exemplary, and the true scope and spirit of the application are pointed out by the claims.

Claims (20)

  1. 一种人脸关键点的检测方法,其中,包括:A method for detecting key points of a human face, which includes:
    获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
    将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
    根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
    根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
  2. 根据权利要求1所述的检测方法,其中,根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标,包括:The detection method according to claim 1, wherein, according to the heat map, the predicted key point coordinates, and the occlusion confidence, determining the target key point coordinates of the image to be annotated comprises:
    将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位;Comparing the occlusion confidence of each point in the image to be labeled with a preset occlusion threshold, and determine the point to be processed with the occlusion confidence less than the occlusion threshold;
    根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标;According to the point to be processed, the predicted key point coordinate corresponding to the point to be processed is selected from the predicted key point coordinate as the first key point coordinate;
    根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标;According to the point to be processed, the coordinates of the highlight point other than the position corresponding to the point to be processed are selected from the heat map as the second key point coordinate;
    将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。The first key point coordinates and the second key point coordinates are integrated to determine the target key point coordinates of the image to be annotated.
  3. 根据权利要求2所述的检测方法,其中,所述检测方法还包括:The detection method according to claim 2, wherein the detection method further comprises:
    获取遮挡训练样本集合,所述遮挡训练样本集合包括多个遮挡样本图像,所述遮挡样本图像中的人脸包括部分遮挡;Acquiring an occlusion training sample set, the occlusion training sample set includes a plurality of occlusion sample images, and the face in the occlusion sample image includes partial occlusion;
    将所述遮挡训练样本集合中的遮挡样本图像输入至所述关键点标注模型中,以使所述关键点标注模型输出所述遮挡样本图像中各点位的遮挡置信度;Inputting the occlusion sample images in the occlusion training sample set into the key point annotation model, so that the key point annotation model outputs the occlusion confidence of each point in the occlusion sample image;
    根据多个所述遮挡样本图像中各点位的遮挡置信度,确定遮挡阈值。The occlusion threshold is determined according to the occlusion confidence of each point in the multiple occlusion sample images.
  4. 根据权利要求3所述的检测方法,其中,根据多个所述遮挡样本图像中各点位的遮挡置信度,确定遮挡阈值,包括:The detection method according to claim 3, wherein determining the occlusion threshold according to the occlusion confidence of each point in the plurality of occlusion sample images includes:
    从多个所述遮挡样本图像中各点位的遮挡置信度中,选取对应于所述遮挡样本图像中无遮挡位置的点位的遮挡置信度作为待选置信度;From the occlusion confidence levels of each point in the multiple occlusion sample images, select the occlusion confidence level of the point corresponding to the unoccluded position in the occlusion sample image as the confidence level to be selected;
    从所述待选置信度中按照从大到小的顺序,选取排列在预定比例位置的遮挡置信度作为遮挡阈值。From the to-be-selected confidences, in descending order, select the occlusion confidences arranged at a predetermined ratio position as the occlusion threshold.
  5. 根据权利要求1所述的检测方法,其中,所述检测方法还包括:The detection method according to claim 1, wherein the detection method further comprises:
    获取训练样本集合,所述训练样本集合中包含多个包含人脸的样本图像,所述样本图像中包含关键点信息;Acquiring a training sample set, where the training sample set includes a plurality of sample images including human faces, and the sample images include key point information;
    将所述样本图像输入至待训练的关键点标注模型中,以使所述关键点标注模型输出与所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度;The sample image is input into the key point annotation model to be trained, so that the key point annotation model outputs the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion of each point in the sample image Confidence;
    根据所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度,确定所述样本图像中的目标关键点坐标;Determine the target key point coordinates in the sample image according to the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion confidence of each point in the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述样本图像中的目标关键点坐标与所述关键点信息相匹配。Adjust the parameters in the key point labeling model to be trained so that the target key point coordinates in the sample image match the key point information.
  6. 根据权利要求5所述的检测方法,其中,调整待训练的所述关键点标注模型中的参数,以使所述样本图像中的目标关键点坐标与所述关键点信息相匹配,包括:The detection method according to claim 5, wherein adjusting the parameters in the key point labeling model to be trained so that the target key point coordinates in the sample image match the key point information comprises:
    将所述训练样本集合输入至不同学习率的待训练的关键点标注模型中,以使各关键点标注模型分别输出训练数据,所述训练数据包括各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度;The training sample set is input into the key point labeling models to be trained with different learning rates, so that each key point labeling model outputs training data respectively, and the training data includes the heat map corresponding to each sample image and the predicted key point coordinates And the occlusion confidence of each point in the sample image;
    根据多组所述训练数据进行统计,从多个所述训练数据中识别出目标训练数据;Perform statistics based on multiple sets of the training data, and identify target training data from the multiple sets of training data;
    根据所述目标训练数据对所述样本图像包含的关键点信息进行更新,得到所述样本图像更新后的关键点信息;Updating the key point information contained in the sample image according to the target training data to obtain updated key point information of the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述目标关键点坐标与所述更新后的关键点信息相匹配。The parameters in the key point labeling model to be trained are adjusted to make the target key point coordinates match the updated key point information.
  7. 一种人脸关键点的检测装置,其中,包括:A detection device for key points of a human face, which includes:
    获取模块,用于获取包含人脸的待标注图像;The acquisition module is used to acquire the to-be-labeled image containing the human face;
    处理模块,用于将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;The processing module is used to input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and the to be annotated The occlusion confidence of each point in the image;
    确定模块,用于根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;A determining module, configured to determine the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
    标注模块,用于根据所述目标关键点坐标,对所述待标注图像进行关键点标注。The marking module is used to mark the key points of the image to be marked according to the coordinates of the target key points.
  8. 根据权利要求7所述的检测装置,其中,所述确定模块包括:The detection device according to claim 7, wherein the determining module comprises:
    确定单元,用于将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位;A determining unit, configured to compare the occlusion confidence of each point in the to-be-labeled image with a preset occlusion threshold, and determine the point to be processed whose occlusion confidence is less than the occlusion threshold;
    第一选取单元,用于根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标;The first selection unit is configured to select, from the predicted key point coordinates, the predicted key point coordinates corresponding to the point to be processed as the first key point coordinates according to the point to be processed;
    第二选取单元,用于根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标;The second selection unit is configured to select, from the heat map, the coordinates of the highlight point except for the position corresponding to the point to be processed as the second key point coordinate according to the point to be processed;
    整合单元,用于将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。The integration unit is configured to integrate the first key point coordinates and the second key point coordinates to determine the target key point coordinates of the image to be annotated.
  9. 一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现步骤包括:A computer-readable storage medium having a computer program stored thereon, wherein the implementation steps when the computer program is executed by a processor include:
    获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
    将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
    根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
    根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
  10. 根据权利要求9所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现步骤还包括:The computer-readable storage medium according to claim 9, wherein the step of implementing the computer program when the computer program is executed by the processor further comprises:
    将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位;Comparing the occlusion confidence of each point in the image to be labeled with a preset occlusion threshold, and determine the point to be processed with the occlusion confidence less than the occlusion threshold;
    根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标;According to the point to be processed, the predicted key point coordinate corresponding to the point to be processed is selected from the predicted key point coordinate as the first key point coordinate;
    根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标;According to the point to be processed, the coordinates of the highlight point other than the position corresponding to the point to be processed are selected from the heat map as the second key point coordinate;
    将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。The first key point coordinates and the second key point coordinates are integrated to determine the target key point coordinates of the image to be annotated.
  11. 根据权利要求10所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现步骤还包括:The computer-readable storage medium according to claim 10, wherein the step of implementing the computer program when the computer program is executed by the processor further comprises:
    获取遮挡训练样本集合,所述遮挡训练样本集合包括多个遮挡样本图像,所述遮挡样本图像中的人脸包括部分遮挡;Acquiring an occlusion training sample set, the occlusion training sample set includes a plurality of occlusion sample images, and the face in the occlusion sample image includes partial occlusion;
    将所述遮挡训练样本集合中的遮挡样本图像输入至所述关键点标注模型中,以使所述关键点标注模型输出所述遮挡样本图像中各点位的遮挡置信度;Inputting the occlusion sample images in the occlusion training sample set into the key point annotation model, so that the key point annotation model outputs the occlusion confidence of each point in the occlusion sample image;
    根据多个所述遮挡样本图像中各点位的遮挡置信度,确定遮挡阈值。The occlusion threshold is determined according to the occlusion confidence of each point in the multiple occlusion sample images.
  12. 根据权利要求11所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现步骤还包括:The computer-readable storage medium according to claim 11, wherein the step of implementing the computer program when the computer program is executed by the processor further comprises:
    从多个所述遮挡样本图像中各点位的遮挡置信度中,选取对应于所述遮挡样本图像中无遮挡位置的点位的遮挡置信度作为待选置信度;From the occlusion confidence levels of each point in the multiple occlusion sample images, select the occlusion confidence level of the point corresponding to the unoccluded position in the occlusion sample image as the confidence level to be selected;
    从所述待选置信度中按照从大到小的顺序,选取排列在预定比例位置的遮挡置信度作为遮挡阈值。From the to-be-selected confidences, in descending order, select the occlusion confidences arranged at a predetermined ratio position as the occlusion threshold.
  13. 根据权利要求9所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现步骤还包括:The computer-readable storage medium according to claim 9, wherein the step of implementing the computer program when the computer program is executed by the processor further comprises:
    获取训练样本集合,所述训练样本集合中包含多个包含人脸的样本图像,所述样本图像中包含关键点信息;Acquiring a training sample set, where the training sample set includes a plurality of sample images including human faces, and the sample images include key point information;
    将所述样本图像输入至待训练的关键点标注模型中,以使所述关键点标注模型输出与所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度;The sample image is input into the key point annotation model to be trained, so that the key point annotation model outputs the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion of each point in the sample image Confidence;
    根据所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度,确定所述样本图像中的目标关键点坐标;Determine the target key point coordinates in the sample image according to the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion confidence of each point in the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述样本图像中的目标关键点坐标与所述关键点信息相匹配。Adjust the parameters in the key point labeling model to be trained so that the target key point coordinates in the sample image match the key point information.
  14. 根据权利要求13所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现步骤还包括:The computer-readable storage medium according to claim 13, wherein the step of implementing when the computer program is executed by the processor further comprises:
    将所述训练样本集合输入至不同学习率的待训练的关键点标注模型中,以使各关键点标注模型分别输出训练数据,所述训练数据包括各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度;The training sample set is input into the key point labeling models to be trained with different learning rates, so that each key point labeling model outputs training data respectively, and the training data includes the heat map corresponding to each sample image and the predicted key point coordinates And the occlusion confidence of each point in the sample image;
    根据多组所述训练数据进行统计,从多个所述训练数据中识别出目标训练数据;Perform statistics based on multiple sets of the training data, and identify target training data from the multiple sets of training data;
    根据所述目标训练数据对所述样本图像包含的关键点信息进行更新,得到所述样本图像更新后的关键点信息;Updating the key point information contained in the sample image according to the target training data to obtain updated key point information of the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述目标关键点坐标与所述更新后的关键点信息相匹配。The parameters in the key point labeling model to be trained are adjusted to make the target key point coordinates match the updated key point information.
  15. 一种电子设备,其中,包括:An electronic device, including:
    处理器;以及Processor; and
    存储器,其上存储有计算机程序;A memory on which a computer program is stored;
    其中,所述处理器配置为经由执行所述计算机程序来实现的步骤包括:Wherein, the steps that the processor is configured to be implemented by executing the computer program include:
    获取包含人脸的待标注图像;Obtain an image to be annotated containing a human face;
    将所述待标注图像输入至预先训练完成的关键点标注模型,以使所述关键点标注模型输出所述待标注图像对应的热力图、预测关键点坐标以及所述待标注图像中各点位的遮挡置信度;Input the image to be annotated into the pre-trained key point annotation model, so that the key point annotation model outputs the heat map corresponding to the image to be annotated, predicted key point coordinates, and each point in the image to be annotated Occlusion confidence level;
    根据所述热力图、所述预测关键点坐标以及所述遮挡置信度,确定所述待标注图像的目标关键点坐标;Determining the target key point coordinates of the image to be annotated according to the heat map, the predicted key point coordinates, and the occlusion confidence;
    根据所述目标关键点坐标,对所述待标注图像进行关键点标注。According to the coordinate of the target key point, the key point is marked on the image to be marked.
  16. 根据权利要求15所述电子设备,其中,所述处理器配置为经由执行所述计算机程序来实现的步骤还包括:The electronic device according to claim 15, wherein the step of configuring the processor to be implemented via execution of the computer program further comprises:
    将所述待标注图像中各点位的遮挡置信度与预先设定的遮挡阈值进行比较,确定遮挡置信度小于所述遮挡阈值的待处理点位;Comparing the occlusion confidence of each point in the to-be-annotated image with a preset occlusion threshold, and determine the point to be processed whose occlusion confidence is less than the occlusion threshold;
    根据所述待处理点位,从所述预测关键点坐标中选取与所述待处理点位相对应的预测关键点坐标作为第一关键点坐标;According to the point to be processed, the predicted key point coordinate corresponding to the point to be processed is selected from the predicted key point coordinate as the first key point coordinate;
    根据所述待处理点位,从所述热力图中选取除与所述待处理点位相对应的位置之外的高亮点位的坐标作为第二关键点坐标;According to the point to be processed, the coordinates of the highlight point other than the position corresponding to the point to be processed are selected from the heat map as the second key point coordinate;
    将所述第一关键点坐标与所述第二关键点坐标进行整合,确定所述待标注图像的目标关键点坐标。The first key point coordinates and the second key point coordinates are integrated to determine the target key point coordinates of the image to be annotated.
  17. 根据权利要求16所述电子设备,其中,所述处理器配置为经由执行所述计算机程序来实现的步骤还包括:The electronic device according to claim 16, wherein the step of configuring the processor to be implemented via execution of the computer program further comprises:
    获取遮挡训练样本集合,所述遮挡训练样本集合包括多个遮挡样本图像,所述遮挡样本图像中的人脸包括部分遮挡;Acquiring an occlusion training sample set, the occlusion training sample set includes a plurality of occlusion sample images, and the face in the occlusion sample image includes partial occlusion;
    将所述遮挡训练样本集合中的遮挡样本图像输入至所述关键点标注模型中,以使所述关键点标注模型输出所述遮挡样本图像中各点位的遮挡置信度;Inputting the occlusion sample images in the occlusion training sample set into the key point annotation model, so that the key point annotation model outputs the occlusion confidence of each point in the occlusion sample image;
    根据多个所述遮挡样本图像中各点位的遮挡置信度,确定遮挡阈值。The occlusion threshold is determined according to the occlusion confidence of each point in the multiple occlusion sample images.
  18. 根据权利要求17所述电子设备,其中,所述处理器配置为经由执行所述计算机程序来实现的步骤还包括:The electronic device according to claim 17, wherein the step of configuring the processor to be implemented via execution of the computer program further comprises:
    从多个所述遮挡样本图像中各点位的遮挡置信度中,选取对应于所述遮挡样本图像中无遮挡位置的点位的遮挡置信度作为待选置信度;From the occlusion confidence levels of each point in the multiple occlusion sample images, select the occlusion confidence level of the point corresponding to the unoccluded position in the occlusion sample image as the confidence level to be selected;
    从所述待选置信度中按照从大到小的顺序,选取排列在预定比例位置的遮挡置信度作为遮挡阈值。From the to-be-selected confidences, in descending order, select the occlusion confidences arranged at a predetermined ratio position as the occlusion threshold.
  19. 根据权利要求15所述电子设备,其中,所述处理器配置为经由执行所述计算机程序来实现的步骤还包括:The electronic device according to claim 15, wherein the step of configuring the processor to be implemented via execution of the computer program further comprises:
    获取训练样本集合,所述训练样本集合中包含多个包含人脸的样本图像,所述样本图像中包含关键点信息;Acquiring a training sample set, where the training sample set includes a plurality of sample images including human faces, and the sample images include key point information;
    将所述样本图像输入至待训练的关键点标注模型中,以使所述关键点标注模型输出与所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度;The sample image is input into the key point annotation model to be trained, so that the key point annotation model outputs the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion of each point in the sample image Confidence;
    根据所述样本图像对应的热力图、预测关键点坐标以及所述样本图像中各点位的遮挡置信度,确定所述样本图像中的目标关键点坐标;Determine the target key point coordinates in the sample image according to the heat map corresponding to the sample image, the predicted key point coordinates, and the occlusion confidence of each point in the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述样本图像中的目标关键点坐标与所述关键点信息相匹配。Adjust the parameters in the key point labeling model to be trained so that the target key point coordinates in the sample image match the key point information.
  20. 根据权利要求19所述电子设备,其中,所述处理器配置为经由执行所述计算机程序来实现的步骤还包括:The electronic device according to claim 19, wherein the step of configuring the processor to be implemented via execution of the computer program further comprises:
    将所述训练样本集合输入至不同学习率的待训练的关键点标注模型中,以使各关键点标注模型分别输出训练数据,所述训练数据包括各样本图像对应的热力图、预测关键点坐标以及样本图像中各点位的遮挡置信度;The training sample set is input into the key point labeling models to be trained with different learning rates, so that each key point labeling model outputs training data respectively, and the training data includes the heat map corresponding to each sample image and the predicted key point coordinates And the occlusion confidence of each point in the sample image;
    根据多组所述训练数据进行统计,从多个所述训练数据中识别出目标训练数据;Perform statistics based on multiple sets of the training data, and identify target training data from the multiple sets of training data;
    根据所述目标训练数据对所述样本图像包含的关键点信息进行更新,得到所述样本图像更新后的关键点信息;Updating the key point information contained in the sample image according to the target training data to obtain updated key point information of the sample image;
    调整待训练的所述关键点标注模型中的参数,以使所述目标关键点坐标与所述更新后的关键点信息相匹配。The parameters in the key point labeling model to be trained are adjusted to make the target key point coordinates match the updated key point information.
PCT/CN2021/084220 2020-11-12 2021-03-31 Detection method and apparatus for facial key point, and storage medium and electronic device WO2021204037A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011264438.2A CN112380981A (en) 2020-11-12 2020-11-12 Face key point detection method and device, storage medium and electronic equipment
CN202011264438.2 2020-11-12

Publications (1)

Publication Number Publication Date
WO2021204037A1 true WO2021204037A1 (en) 2021-10-14

Family

ID=74583598

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/084220 WO2021204037A1 (en) 2020-11-12 2021-03-31 Detection method and apparatus for facial key point, and storage medium and electronic device

Country Status (2)

Country Link
CN (1) CN112380981A (en)
WO (1) WO2021204037A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115578451A (en) * 2022-09-30 2023-01-06 北京百度网讯科技有限公司 Image processing method, and training method and device of image processing model

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112380981A (en) * 2020-11-12 2021-02-19 平安科技(深圳)有限公司 Face key point detection method and device, storage medium and electronic equipment
CN113361334B (en) * 2021-05-18 2022-07-22 山东师范大学 Convolutional pedestrian re-identification method and system based on key point optimization and multi-hop intention
CN113313189A (en) * 2021-06-11 2021-08-27 上海高德威智能交通系统有限公司 Shielding detection method and device and electronic equipment
CN113505763B (en) * 2021-09-09 2022-02-01 北京爱笔科技有限公司 Key point detection method and device, electronic equipment and storage medium
CN114462495B (en) * 2021-12-30 2023-04-07 浙江大华技术股份有限公司 Training method of face shielding detection model and related device
CN114937300A (en) * 2022-05-20 2022-08-23 北京数美时代科技有限公司 Method and system for identifying shielded face

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170083752A1 (en) * 2015-09-18 2017-03-23 Yahoo! Inc. Face detection
CN109657675A (en) * 2018-12-06 2019-04-19 广州景骐科技有限公司 Image labeling method, device, computer equipment and readable storage medium storing program for executing
CN110046600A (en) * 2019-04-24 2019-07-23 北京京东尚科信息技术有限公司 Method and apparatus for human testing
CN110532981A (en) * 2019-09-03 2019-12-03 北京字节跳动网络技术有限公司 Human body key point extracting method, device, readable storage medium storing program for executing and equipment
CN111027504A (en) * 2019-12-18 2020-04-17 上海眼控科技股份有限公司 Face key point detection method, device, equipment and storage medium
CN112380981A (en) * 2020-11-12 2021-02-19 平安科技(深圳)有限公司 Face key point detection method and device, storage medium and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170083752A1 (en) * 2015-09-18 2017-03-23 Yahoo! Inc. Face detection
CN109657675A (en) * 2018-12-06 2019-04-19 广州景骐科技有限公司 Image labeling method, device, computer equipment and readable storage medium storing program for executing
CN110046600A (en) * 2019-04-24 2019-07-23 北京京东尚科信息技术有限公司 Method and apparatus for human testing
CN110532981A (en) * 2019-09-03 2019-12-03 北京字节跳动网络技术有限公司 Human body key point extracting method, device, readable storage medium storing program for executing and equipment
CN111027504A (en) * 2019-12-18 2020-04-17 上海眼控科技股份有限公司 Face key point detection method, device, equipment and storage medium
CN112380981A (en) * 2020-11-12 2021-02-19 平安科技(深圳)有限公司 Face key point detection method and device, storage medium and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115578451A (en) * 2022-09-30 2023-01-06 北京百度网讯科技有限公司 Image processing method, and training method and device of image processing model
CN115578451B (en) * 2022-09-30 2024-01-23 北京百度网讯科技有限公司 Image processing method, training method and device of image processing model

Also Published As

Publication number Publication date
CN112380981A (en) 2021-02-19

Similar Documents

Publication Publication Date Title
WO2021204037A1 (en) Detection method and apparatus for facial key point, and storage medium and electronic device
US11782981B2 (en) Method, apparatus, server, and storage medium for incorporating structured entity
US20200152174A1 (en) Method, Apparatus, and System for Conflict Detection and Resolution for Competing Intent Classifiers in Modular Conversation System
WO2019233421A1 (en) Image processing method and device, electronic apparatus, and storage medium
US20200301514A1 (en) Gesture recognition method, device, electronic device, and storage medium
WO2021249528A1 (en) Intelligent dialogue method and apparatus and electronic device
WO2021189889A1 (en) Text detection method and apparatus in scene image, computer device, and storage medium
US20220130146A1 (en) Method for recognizing video action, and device and storage medium thereof
WO2021017286A1 (en) Facial recognition method and apparatus, electronic device and non-volatile computer readable storage medium
WO2022105118A1 (en) Image-based health status identification method and apparatus, device and storage medium
CN109409241A (en) Video checking method, device, equipment and readable storage medium storing program for executing
WO2021174812A1 (en) Data cleaning method and apparatus for profile, and medium and electronic device
JP2022531620A (en) A method of learning a deep learning network by AI and a learning device using this
WO2021151321A1 (en) Method and apparatus for identifying lane line, and device and storage medium
WO2022227218A1 (en) Drug name recognition method and apparatus, and computer device and storage medium
CN111881683A (en) Method and device for generating relation triples, storage medium and electronic equipment
CN110827799A (en) Method, apparatus, device and medium for processing voice signal
CN110850982A (en) AR-based human-computer interaction learning method, system, device and storage medium
WO2019214019A1 (en) Online teaching method and apparatus based on convolutional neural network
WO2024051823A1 (en) Method for managing reception information and back-end device
WO2021174814A1 (en) Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium
CN114067790A (en) Voice information processing method, device, equipment and storage medium
CN115497458B (en) Continuous learning method and device of intelligent voice assistant, electronic equipment and medium
WO2020252880A1 (en) Reverse turing verification method and apparatus, storage medium, and electronic device
WO2021151354A1 (en) Word recognition method and apparatus, computer device, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21783853

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21783853

Country of ref document: EP

Kind code of ref document: A1