CN114511909A

CN114511909A - Face brushing payment intention identification method, device and equipment

Info

Publication number: CN114511909A
Application number: CN202210180422.6A
Authority: CN
Inventors: 尹英杰; 丁菁汀; 李亮
Original assignee: Alipay Hangzhou Information Technology Co Ltd
Current assignee: Alipay Hangzhou Information Technology Co Ltd
Priority date: 2022-02-25
Filing date: 2022-02-25
Publication date: 2022-05-17
Anticipated expiration: 2042-02-25
Also published as: CN114511909B

Abstract

The embodiment of the specification discloses a method, a device and equipment for recognizing willingness to pay by brushing face. The scheme comprises the following steps: acquiring a face brushing image, and determining a candidate to be identified in the face brushing image; according to the located area of each candidate in the face brushing image, respectively generating corresponding mask images to distinguish the located area from other areas in the face brushing image; extracting the features of the face brushing image, and obtaining fusion features according to the features of the face brushing image and the mask image; and identifying whether each candidate has a face brushing willingness to pay according to the fusion characteristics. The safety of the face brushing payment can be improved.

Description

A method, device and device for recognizing willingness to pay for face-scanning

技术领域technical field

本说明书涉及计算机技术领域，尤其涉及一种刷脸支付意愿识别方法、装置以及设备。The present specification relates to the field of computer technology, and in particular, to a method, device, and device for recognizing the willingness to pay for face-scanning.

背景技术Background technique

随着计算机和互联网技术的发展，很多业务都可以在线上进行，促进了各种线上业务平台的发展。其中，刷脸支付是指基于人工智能、机器视觉、3D传感、大数据等技术实现的新型支付方式，通过采用人脸识别作为身份验证的支付方式，给用户带来了极大的便利性，受到用户的普遍喜爱。With the development of computer and Internet technology, many businesses can be conducted online, which has promoted the development of various online business platforms. Among them, face-swiping payment refers to a new payment method based on artificial intelligence, machine vision, 3D sensing, big data and other technologies. By using face recognition as a payment method for identity verification, it brings great convenience to users. , which is generally favored by users.

目前，在刷脸支付场景中，待支付用户开启刷脸支付后，需要站在具有刷脸支付功能设备的前方，进行人脸识别。但是，在刷脸的过程中，可能在设备的前方站着多个用户，将导致设备采集的刷脸图像中，出现多个用户。此时，设备对刷脸图像进行人脸识别时，难以判断哪个用户是当前的待支付用户，即，哪个用户具有刷脸支付意愿。换言之，只有当前的待支付用户具有刷脸支付意愿，而其他用户则不具有刷脸支付意愿。At present, in the face-swiping payment scenario, after the paying user opens the face-swiping payment, he needs to stand in front of the device with the face-swiping payment function to perform face recognition. However, during the face-swiping process, there may be multiple users standing in front of the device, which will cause multiple users to appear in the face-swiping image collected by the device. At this time, when the device performs face recognition on the face-swiping image, it is difficult to determine which user is the current user to be paid, that is, which user has the willingness to pay for face-swiping. In other words, only the current user to be paid has the willingness to pay, while other users do not have the willingness to pay.

基于此，刷脸支付意愿识别是对支付系统中刷脸安全保障的重要环节，有助于提升刷脸安全体验，但是，设备如果识别到其他用户，对其他用户进行识别，将出现误刷脸支付，从而降低刷脸支付的安全性。Based on this, face-swiping willingness to pay recognition is an important part of face-swiping security in the payment system, which helps to improve the face-swiping security experience. However, if the device recognizes other users and recognizes other users, there will be false face-swiping. payment, thereby reducing the security of face payment.

基于此，对于刷脸支付需要更安全的识别方案。Based on this, a more secure identification scheme is required for face-scanning payment.

发明内容SUMMARY OF THE INVENTION

本说明书一个或多个实施例提供一种刷脸支付意愿识别方法、装置、设备以及存储介质，用以解决如下技术问题：对于刷脸支付需要更安全的识别方案。One or more embodiments of this specification provide a method, device, device, and storage medium for recognizing a willingness to pay by face-swiping, so as to solve the following technical problem: a more secure identification solution is required for face-swiping payment.

为解决上述技术问题，本说明书一个或多个实施例是这样实现的：To solve the above technical problems, one or more embodiments of the present specification are implemented as follows:

本说明书一个或多个实施例提供的一种刷脸支付意愿识别方法，包括：One or more embodiments of this specification provide a method for recognizing the willingness to pay for face-scanning, including:

获取刷脸图像，并在所述刷脸图像中确定待识别的候选人；Obtaining a face brushing image, and determining a candidate to be identified in the brushing face image;

根据各所述候选人在所述刷脸图像中的所处区域，分别生成对应的掩码图以区别所述所处区域和所述刷脸图像中的其他区域；According to the area where each candidate is located in the face brushing image, a corresponding mask map is respectively generated to distinguish the located area from other areas in the face brushing image;

提取所述刷脸图像的特征，并根据所述刷脸图像的特征和所述掩码图，得到融合特征；Extracting the features of the face brushing image, and obtaining fusion features according to the characteristics of the face brushing image and the mask map;

根据所述融合特征，识别各所述候选人是否具有刷脸支付意愿。According to the fusion feature, identify whether each candidate has a willingness to pay for face-scanning.

本说明书一个或多个实施例提供的一种刷脸支付意愿识别装置，包括：One or more embodiments of the present specification provide a device for recognizing a willingness to pay for face-scanning, including:

获取模块，获取刷脸图像，并在所述刷脸图像中确定待识别的候选人；an acquisition module, acquires a face brushing image, and determines a candidate to be identified in the brushing face image;

生成模块，根据各所述候选人在所述刷脸图像中的所处区域，分别生成对应的掩码图以区别所述所处区域和所述刷脸图像中的其他区域；The generation module, according to the area where each candidate is located in the face brushing image, respectively generates a corresponding mask map to distinguish the area and other areas in the brushing face image;

提取模块，提取所述刷脸图像的特征，并根据所述刷脸图像的特征和所述掩码图，得到融合特征；an extraction module, extracting the features of the face brushing image, and obtaining fusion features according to the characteristics of the face brushing image and the mask map;

识别模块，根据所述融合特征，识别各所述候选人是否具有刷脸支付意愿。The identification module, according to the fusion feature, identifies whether each candidate has a willingness to pay for face-scanning.

本说明书一个或多个实施例提供的一种刷脸支付意愿识别设备，包括：One or more embodiments of this specification provide a device for recognizing the willingness to pay for face-scanning, including:

至少一个处理器；以及，at least one processor; and,

与所述至少一个处理器通信连接的存储器；其中，a memory communicatively coupled to the at least one processor; wherein,

所述存储器存储有可被所述至少一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器能够：The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to:

本说明书一个或多个实施例提供的一种刷脸支付意愿识别非易失性计算机存储介质，存储有计算机可执行指令，所述计算机可执行指令设置为：One or more embodiments of this specification provide a non-volatile computer storage medium for recognizing a face-to-pay willingness to pay, storing computer-executable instructions, and the computer-executable instructions are set to:

本说明书一个或多个实施例采用的上述至少一个技术方案能够达到以下有益效果：The above-mentioned at least one technical solution adopted by one or more embodiments of this specification can achieve the following beneficial effects:

通过为各候选人在刷脸图像中所处区域，分别生成对应的掩码图，能够将该候选人的特征信息更加鲜明化，从而增加了具有刷脸支付意愿与不具有刷脸支付意愿的差异性，通过融合特征，识别各候选人是否具有刷脸支付意愿，实现了增强图像对比效果，从而实现了将注意力集中至具有刷脸支付意愿的候选人，能够更加将刷脸图像中具有刷脸支付意愿的候选人与不具有刷脸支付意愿的候选人进行准确区分，更有针对性地识别刷脸图像中候选人的刷脸支付意愿，从而能够增强刷脸安全体验，有效保障刷脸系统的可用安全性。By generating corresponding mask maps for each candidate's location in the face-swiping image, the candidate's feature information can be more distinct, thereby increasing the number of candidates with and without the willingness to pay for face-swiping. Differences, through the fusion of features, it can identify whether each candidate has the willingness to pay for face brushing, and realize the enhanced image contrast effect, so as to realize the focus on the candidates with the willingness to pay for face brushing, which can be more Candidates with willingness to pay for face-scanning can be accurately distinguished from candidates who do not have the willingness to pay for face-scanning, and more targeted identification of candidates' willingness to pay for face-scanning in the face-scanning image can enhance the security experience of face-scanning and effectively guarantee the face-scanning payment. Usable security of face systems.

附图说明Description of drawings

为了更清楚地说明本说明书实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本说明书中记载的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present specification or the prior art, the following briefly introduces the accompanying drawings required in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments described in this specification. For those of ordinary skill in the art, other drawings can also be obtained according to these drawings without creative labor.

图1为本说明书一个或多个实施例提供的一种刷脸支付意愿识别方法的流程示意图；FIG. 1 is a schematic flowchart of a method for recognizing a willingness to pay for face-scanning provided by one or more embodiments of this specification;

图2为本说明书一个或多个实施例提供的一种刷脸支付意愿识别系统的框架示意图；FIG. 2 is a schematic framework diagram of a face-scanning willingness-to-pay recognition system provided by one or more embodiments of the present specification;

图3为本说明书一个或多个实施例提供的一种基于深度卷积神经网络端到端学习的刷脸支付意愿识别方法的流程示意图；3 is a schematic flowchart of a method for recognizing the willingness to pay for face brushing based on end-to-end learning of a deep convolutional neural network provided by one or more embodiments of this specification;

图4为本说明书一个或多个实施例提供的一种刷脸支付意愿识别装置的结构示意图；4 is a schematic structural diagram of a device for recognizing a willingness to pay for face-scanning provided by one or more embodiments of the present specification;

图5为本说明书一个或多个实施例提供的一种刷脸支付意愿识别设备的结构示意图。FIG. 5 is a schematic structural diagram of a device for recognizing a willingness to pay for face-scanning provided by one or more embodiments of the present specification.

具体实施方式Detailed ways

本说明书实施例提供一种刷脸支付意愿识别方法、装置、设备以及存储介质。The embodiments of this specification provide a method, device, device, and storage medium for recognizing a willingness to pay for face-scanning.

为了使本技术领域的人员更好地理解本说明书中的技术方案，下面将结合本说明书实施例中的附图，对本说明书实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本申请一部分实施例，而不是全部的实施例。基于本说明书实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都应当属于本申请保护的范围。In order to make those skilled in the art better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below with reference to the accompanying drawings in the embodiments of this specification. Obviously, the described The embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments of the present specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of the present application.

图1为本说明书一个或多个实施例提供的一种刷脸支付意愿识别方法的流程示意图。该流程可以由具有刷脸支付功能的电子设备执行，该电子设备可以是具备图像数据处理功能的终端，例如，可以是手机、平板、笔记本等移动终端，也可以是台式机等固定终端或服务器，流程中的某些输入参数或者中间结果允许人工干预调节，以帮助提高准确性。FIG. 1 is a schematic flowchart of a method for recognizing the willingness to pay for face-scanning provided by one or more embodiments of the present specification. This process can be performed by an electronic device with a face-scanning payment function. The electronic device can be a terminal with an image data processing function, for example, a mobile terminal such as a mobile phone, tablet, and notebook, or a fixed terminal such as a desktop computer or a server. , some input parameters in the process or intermediate results allow manual intervention adjustments to help improve accuracy.

图1中的流程可以包括以下步骤：The flow in Figure 1 can include the following steps:

S102：获取刷脸图像，并在所述刷脸图像中确定待识别的候选人。S102: Acquire a face brushing image, and determine a candidate to be identified in the face brushing image.

在本说明书的一个或多个实施例中，可以是电子设备在接收到刷脸支付指令后，通过预先安装的摄像设备获取刷脸图像，也可以是电子设备根据支付订单生成刷脸支付指令，通过摄像设备获取刷脸图像。其中，刷脸图像可以是从视频或者图像中获取的单帧图像。In one or more embodiments of this specification, after receiving the face-swiping payment instruction, the electronic device may obtain the face-swiping image through a pre-installed camera device, or the electronic device may generate the face-swiping payment instruction according to the payment order, The brush face image is acquired through the camera device. Wherein, the face brushing image may be a single frame image obtained from a video or an image.

待识别的候选人是指需要支付相关费用的用户，需要说明的是，候选人想要进行刷脸支付，需要在对应的客户端上进行身份信息注册，并录入人脸信息，用于在候选人开启刷脸支付时，识别到候选人具有刷脸支付识别意愿之后，通过预先注册的人脸信息对该候选人进行身份认证。The candidate to be identified refers to the user who needs to pay the relevant fees. It should be noted that if the candidate wants to pay by face-swiping, he needs to register his identity information on the corresponding client and enter the face information, which will be used in the candidate. When a person opens face-swiping payment, after identifying that the candidate has the willingness to recognize face-swiping payment, the candidate is authenticated through the pre-registered face information.

即，在刷脸图像中包括候选人的人脸信息，通过识别人脸信息能够获取到候选人是否具有刷脸支付意愿，然后通过人脸信息能够对候选人进行身份认证。That is, the face information of the candidate is included in the face-swiping image, and whether the candidate has the willingness to pay for face-swiping can be obtained by identifying the face information, and then the identity of the candidate can be authenticated through the face information.

其中，摄像设备前方的候选人的数量可以为一个或者多个，当候选人的数量为多个时，则刷脸图像中包括多个候选人，当候选人的数量为一个时，则刷脸图像中包括一个候选人。同时，刷脸图像不仅包括候选人的人脸信息，还可以包括候选人的其他特征信息，比如，躯干信息以及四肢信息，也可以包括其他不需要识别的对象，比如，候选人所处环境中所包括的的桌椅、悬挂物等。Wherein, the number of candidates in front of the camera device may be one or more, when the number of candidates is multiple, the face-scanning image includes multiple candidates, and when the number of candidates is one, the face-scanning image is A candidate is included in the image. At the same time, the face-swiping image includes not only the candidate's face information, but also other feature information of the candidate, such as torso information and limb information, and can also include other objects that do not need to be identified, such as the candidate's environment. Included tables, chairs, hanging objects, etc.

另外，通常情况下，电子设备在执行单次刷脸支付指令时，是为了对当前开启刷脸支付的特定候选人进行身份认证，而该特定候选人通常也具有刷脸支付意愿，也就是说，在执行单次刷脸支付指令时，即使刷脸图像中包括多个候选人，但是，该多个候选人并非都具有刷脸支付意愿，只有特定候选人具有刷脸支付意愿，可以认为该特定候选人为支付意愿安全，其他候选人为支付意愿非安全。In addition, under normal circumstances, when the electronic device executes a single face-swiping payment instruction, it is to authenticate the identity of a specific candidate who has currently opened face-swiping payment, and the specific candidate usually also has the willingness to pay for face-swiping, that is to say , when executing a single face-swiping payment instruction, even if the face-swiping image includes multiple candidates, not all of the multiple candidates have the willingness to pay for face-swiping, and only a specific candidate has the willingness to pay for face-swiping, it can be considered that the Certain candidates are willing to pay safe, other candidates are willing to pay non-safe.

例如，在公共场合中，通常采用线下物联网(Internet of Things，IoT)刷脸机具进行刷脸支付。其中，公共场所的刷脸IoT机具是指商超/便利店/餐饮/酒旅/校园教育医疗/校园教育等公共消费场景设置的具有刷脸功能的机具。For example, in public places, offline Internet of Things (IoT) face-swiping devices are usually used for face-swiping payment. Among them, face-scanning IoT devices in public places refer to devices with face-scanning function set up in public consumption scenarios such as supermarkets/convenience stores/catering/hotels/campus education and medical care/campus education.

如果A候选人点击刷脸支付，则将开启刷脸支付，IoT刷脸机具接收到刷脸支付指令，通过摄像设备获取到刷脸图像，由于处在开放的公共场所，存在着多个候选人排队支付的情景，那么IoT机具获取的刷脸图像中极有可能包括多个候选人，但是，在多个候选人中，实际上只有A候选人具有刷脸支付意愿，此时需要识别到A候选人具有刷脸支付意愿的前提下，通过A候选人的人脸信息对A候选人进行身份认证，即，其他候选人实际上并不具有刷脸支付意愿。If candidate A clicks the face-swiping payment, the face-swiping payment will be enabled. The IoT face-swiping device receives the face-swiping payment instruction, and obtains the face-swiping image through the camera device. Since it is in an open public place, there are multiple candidates. In the scenario of queuing for payment, the face-swiping image obtained by the IoT device is very likely to include multiple candidates. However, among multiple candidates, only candidate A has the willingness to pay for face-swiping. At this time, it is necessary to identify A. On the premise that the candidate has the willingness to pay for face-scanning, the identity of candidate A is authenticated through the face information of candidate A, that is, other candidates do not actually have the willingness to pay for face-scanning.

进一步地，假如B候选人排在A候选人的后面，那么即使B候选人没有刷脸支付意愿，A候选人在刷脸支付认证的过程中，摄像设备可能拍摄A候选人的同时，拍摄到B候选人，导致拍摄的刷脸图像中既包括A候选人，又包括B候选人。如果电子设备在识别刷脸图像的过程中，没有将A侯选人作为刷脸用户，而是误识到B候选人，那么电子设备不对B候选人识别是否具有刷脸支付意愿的话，，而是直接对B候选人进行身份认证，在认证通过后，将通过B候选人的账户进行支付，从而误刷B候选人的资产，导致B候选人的资产损失。Further, if candidate B ranks behind candidate A, even if candidate B does not have the willingness to pay for face-swiping, candidate A may be photographed by the camera equipment while candidate A is being photographed during the process of face-swiping payment authentication. Candidate B, as a result, both candidate A and candidate B are included in the photographed face brushing image. If the electronic device does not use candidate A as a face-swiping user in the process of recognizing the face-swiping image, but misidentifies candidate B, then the electronic device does not recognize whether candidate B has the willingness to pay for face-swiping, but directly The identity of candidate B is authenticated. After the authentication is passed, the payment will be made through the account of candidate B, thereby mistakenly brushing the assets of candidate B, resulting in the loss of candidate B's assets.

S104：根据各所述候选人在所述刷脸图像中的所处区域，分别生成对应的掩码图以区别所述所处区域和所述刷脸图像中的其他区域。S104: According to the area where each candidate is located in the face brushing image, generate a corresponding mask map respectively to distinguish the located area from other areas in the face brushing image.

在本说明书的一个或多个实施例中，所述区域可以包括候选人的外观特征信息，比如，人脸信息、躯干信息、四肢信息，但是为了增加识别结果的准确性，所处区域主要包括候选人的人脸信息。同时，可以根据候选人在刷脸图像中的位置信息确定所处区域。In one or more embodiments of this specification, the area may include the candidate's appearance feature information, such as face information, torso information, limb information, but in order to increase the accuracy of the recognition result, the area mainly includes Candidate's face information. At the same time, the location of the candidate can be determined according to the location information of the candidate in the face-swiping image.

由于图像的掩码操作是指通过掩码核算子重新计算图像中各个像素的值，掩码核算子刻画领域像素点对新像素值得影响程度，同时根据掩码算子中权重因子对像素点进行加权平均，图像掩码操作常用于图像平滑、边缘检测、特征分析等区域，因此可以通过掩码操作，区别候选人在刷脸图像中的所处区域和刷脸图像中的其他区域。Because the mask operation of the image refers to recalculating the value of each pixel in the image through the mask operator, the mask operator describes the influence of the pixel in the field on the value of the new pixel, and at the same time, according to the weight factor in the mask operator, the pixel is calculated. Weighted average, image mask operation is often used in areas such as image smoothing, edge detection, feature analysis, etc. Therefore, the mask operation can be used to distinguish the area where the candidate is located in the face brushing image and other areas in the face brushing image.

需要说明的是，单个候选人在刷脸图像中的所处区域，对应单张掩码图，也就是说，如果刷脸图像中有多个候选人，则为每个候选人均生成一张对应的掩码图，最终得到多张掩码图。It should be noted that the area where a single candidate is located in the face brushing image corresponds to a single mask image, that is, if there are multiple candidates in the face brushing image, a corresponding image is generated for each candidate. The mask map of , and finally get multiple mask maps.

也就是说，在单张掩码图中，能够区分所处区域与其他区域，比如，将所处区域填充值作为1，将其他区域填充值作为0(也可以采用其他的具有较高区分度的不同填充值)。即，通过生成每个候选人对应的掩码图，能够将该候选人的特征信息更加鲜明化，从而增加了具有刷脸支付意愿与不具有刷脸支付意愿的差异性。That is to say, in a single mask image, the area where you are located can be distinguished from other areas. For example, the filling value of the area where you are located is 1, and the filling value of other areas is 0 (other areas with higher discrimination can also be used). different padding values). That is, by generating a mask map corresponding to each candidate, the feature information of the candidate can be more distinct, thereby increasing the difference between those who have the willingness to pay for face-scanning and those who do not.

S106：提取所述刷脸图像的特征，并根据所述刷脸图像的特征和所述掩码图，得到融合特征。S106: Extract the features of the face brushing image, and obtain fusion features according to the characteristics of the face brushing image and the mask map.

在本说明书的一个或多个实施例中，如何提取刷脸图像的特征，在此不作限定，比如，通过特征提取模型提取刷脸图像的特征。特征可以包括各候选人的人脸特征、躯干特征、四肢特征。其中，对于人脸特征可以是候选人的人脸的全局特征，通过全局特征识别人脸，能够提高识别结果的准确度。In one or more embodiments of this specification, how to extract the features of the face-brushing image is not limited here, for example, the features of the face-brushing image are extracted through a feature extraction model. The features may include facial features, torso features, and limb features of each candidate. Wherein, the face feature may be the global feature of the candidate's face, and identifying the face through the global feature can improve the accuracy of the recognition result.

当然，在得到掩码图之后，可以通过将刷脸图像的特征和掩码图输入融合特征提取模型中，得到融合特征。Of course, after the mask image is obtained, the fusion feature can be obtained by inputting the features of the face brush image and the mask image into the fusion feature extraction model.

通过提取刷脸图像的特征，将刷脸图像的特征与掩码图进行结合，相当于为刷脸图像的特征新增加了一个通道，即，增加刷脸图像的特征的通道数量，从而得到融合特征，在融合特征中，则更加注意刷脸图像中的对应候选人的人脸特征，能够更加将刷脸图像中具有刷脸支付意愿的候选人与不具有刷脸支付意愿的候选人进行准确区分。By extracting the features of the face brushing image and combining the features of the face brushing image with the mask map, it is equivalent to adding a new channel to the characteristics of the face brushing image, that is, increasing the number of channels of the features of the face brushing image, thereby obtaining fusion. In the fusion feature, more attention is paid to the face features of the corresponding candidates in the face-swiping image, which can more accurately compare the candidates who have the willingness to pay for face-swiping and the candidates who do not have the willingness to pay for face-swiping in the face-swiping image. distinguish.

S108：根据所述融合特征，识别各所述候选人是否具有刷脸支付意愿。S108: According to the fusion feature, identify whether each candidate has a willingness to pay for face-scanning.

在本说明书的一个或多个实施例中，根据融合特征确定出刷脸图像信息，刷脸图像信息中包括候选人的特征信息。其中，更为关注该融合特征的掩码图所对应的候选人的特征信息，特征信息主要包括对应候选人的人脸信息。即，在生成一张掩码图时，则执行一次识别过程，识别掩码图对应的候选人是否具有刷脸支付意愿。In one or more embodiments of the present specification, the face brushing image information is determined according to the fusion feature, and the face brushing image information includes feature information of the candidate. Among them, more attention is paid to the feature information of the candidate corresponding to the mask map of the fusion feature, and the feature information mainly includes the face information of the corresponding candidate. That is, when a mask map is generated, a recognition process is performed to identify whether the candidate corresponding to the mask map has the willingness to pay for face-scanning.

需要说明的是，可以结合预设规则识别所对应的候选人是否具有刷脸支付意愿，比如，识别到该候选人的人脸区域位于正中间区域，则认为该候选人具有刷脸支付意愿，或者，识别到该候选人的人脸区域占据刷脸图像的大部分区域，则认为该候选人具有刷脸支付意愿，或者识别到该候选人的人脸区域占据刷脸图像的大部分区域，并且人脸角度符合预设角度阈值，则认为该候选人具有刷脸支付意愿。It should be noted that whether the corresponding candidate has the willingness to pay for face brushing can be identified in combination with the preset rules. For example, if the face area of the candidate is identified in the middle area, the candidate is considered to have the willingness to pay for face brushing. Or, if it is recognized that the face area of the candidate occupies most of the face-swiping image, it is considered that the candidate has the willingness to pay for face-swiping, or the candidate's face area is identified to occupy most of the face-swiping image, And if the face angle meets the preset angle threshold, it is considered that the candidate has the willingness to pay for face brushing.

进一步地，可以将融合特征输入支付意愿识别模型，通过支付意愿识别模型识别出刷脸图像信息，并根据预设规则输出处理结果，然后，根据处理结果生成刷脸支付意愿概率值，可以通过刷脸支付意愿概率值判断候选人是否具有刷脸支付意愿。Further, the fusion feature can be input into the willingness to pay recognition model, the image information of the face brushing can be identified through the willingness to pay recognition model, and the processing result is output according to the preset rules, and then the probability value of the willingness to pay for the face brushing can be generated according to the processing result, which can be obtained by brushing the face. The face-to-pay probability value determines whether the candidate has the willingness to pay for face-scanning.

其中，处理结果可以是支付意愿识别模型生成的用于表示刷脸支付意愿概率值的向量。Wherein, the processing result may be a vector generated by the willingness to pay recognition model and used to represent the probability value of the willingness to pay for brushing the face.

比如，若概率值大于预设概率阈值，则可以认为候选人具有刷脸支付意愿，即，电子设备的刷脸支付指令为该候选人开启刷脸支付之后，所生成的，若概率值小于或者等于预设概率阈值，则可以认为候选人不具有刷脸支付意愿。即，电子设备的刷脸支付指令不是该候选人开启刷脸支付之后，所生成的，而是其他候选人开启刷脸支付之后，所生成的。For example, if the probability value is greater than the preset probability threshold, it can be considered that the candidate has the willingness to pay for face-swiping, that is, the face-swiping payment instruction of the electronic device is generated after the candidate has enabled face-swiping payment, if the probability value is less than or equal to the preset probability threshold, it can be considered that the candidate does not have the willingness to pay for face recognition. That is, the face-swiping payment instruction of the electronic device is not generated after the candidate opens face-swiping payment, but is generated after other candidates open face-swiping payment.

进一步地，若是概率值大于预设概率的候选人的数量为多个，则说明本次刷脸支付意愿识别的结果不可信，将提示认证失败。反正，若不存在概率值大于预设概率的候选人，则同样说明本次刷脸支付意愿识别的结果不可信，将提示认证失败。Further, if the number of candidates whose probability value is greater than the preset probability is multiple, it means that the result of the face-scanning willingness-to-pay recognition is not credible, and the authentication failure will be prompted. Anyway, if there is no candidate with a probability value greater than the preset probability, it also means that the result of the face-scanning willingness-to-pay recognition is not credible, and the authentication failure will be prompted.

通过图1的方法，通过为各候选人在刷脸图像中所处区域，分别生成对应的掩码图，能够将该候选人的特征信息更加鲜明化，增加了具有刷脸支付意愿与不具有刷脸支付意愿的差异性，通过融合特征，识别各候选人是否具有刷脸支付意愿，能够增强图像对比效果，从而实现了将注意力集中至具有刷脸支付意愿的候选人，从而能够更加将刷脸图像中具有刷脸支付意愿的候选人与不具有刷脸支付意愿的候选人进行准确区分，更有针对性地识别刷脸图像中候选人的刷脸支付意愿，从而能够增强刷脸安全体验。Through the method of Fig. 1, by generating corresponding mask maps for each candidate's location in the face-swiping image, the feature information of the candidate can be made more distinct, increasing the willingness to pay for face-swiping and those who do not. Differences in the willingness to pay for face-scanning. Through the fusion of features, it is possible to identify whether each candidate has the willingness to pay for face-scanning, which can enhance the image comparison effect, so as to realize the focus on candidates with the willingness to pay for face-scanning, which can be more In the face-swiping image, candidates with the willingness to pay for face-swiping are accurately distinguished from those who do not, and the willingness to pay for face-swiping of the candidates in the face-swiping image is more targeted, which can enhance the safety of face-swiping. experience.

基于图1的方法，本说明书还提供了该方法的一些具体实施方案和扩展方案，下面继续进行说明。Based on the method in FIG. 1 , the present specification also provides some specific implementations and extensions of the method, which will be described below.

在本说明书一个或多个实施例中，在生成对应的掩码图时，In one or more embodiments of this specification, when generating the corresponding mask map,

具体地，在确定出候选人之后，提取刷脸图像中候选人的人脸区域，比如，首先通过人脸提取模型提取候选人在刷脸图像中的人脸，然后通过人脸的位置信息，确定出候选人的人脸区域。然后对人脸区域进行处理，确定人脸区域选择框。其中，人脸区域选择框可以有多种展示方式，比如，圆形框、矩形框、不规则多边形框等，但是，有一个前提条件，为了保证识别结果的准确性，人脸区域选择框必须全部包围候选人的人脸区域。Specifically, after the candidate is determined, extract the face area of the candidate in the face brushing image, for example, first extract the candidate's face in the face brushing image through the face extraction model, and then use the position information of the face, Determine the face area of the candidate. Then, the face area is processed to determine the face area selection frame. Among them, the face area selection frame can be displayed in various ways, such as circular frame, rectangular frame, irregular polygon frame, etc. However, there is a precondition, in order to ensure the accuracy of the recognition result, the face area selection frame must be All encompass the face area of the candidate.

在得到人脸区域选择框之后，将根据候选人的人脸区域选择框，确定该候选人对应的的掩码图的第一填充区域。其中，第一填充区域的形状可以有多种展示方式，在此不作限定，比如，圆形区域、矩形区域、不规则多边形区域等。但是，有一个前提条件，为了保证识别结果的准确性，在确定第一填充区域时，要结合人脸区域选择框，尽可能接近实际的人脸区域。After the face region selection box is obtained, the first filling region of the mask map corresponding to the candidate is determined according to the candidate's face region selection box. The shape of the first filling area may be displayed in various manners, which are not limited herein, for example, a circular area, a rectangular area, an irregular polygonal area, and the like. However, there is a precondition. In order to ensure the accuracy of the recognition result, when determining the first filling area, the selection frame of the face area should be combined to be as close to the actual face area as possible.

在确定第一填充区域之后，在刷脸图像中继续确定出除第一填充区域之外的第二填充区域，并且为第一填充区域与第二填充区域赋予不同的填充值。After the first filling area is determined, a second filling area other than the first filling area is continuously determined in the face brushing image, and different filling values are assigned to the first filling area and the second filling area.

需要说明的是，为了掩码图中的第一填充区域尽可能与刷脸图像中的人脸区域相符，从而在为第一填充区域与第二填充区域赋予不同的填充值之后，生成分辨率与所述刷脸图像的分辨率一致的掩码图。It should be noted that, in order to match the first filling area in the mask image with the face area in the face brushing image as much as possible, after assigning different filling values to the first filling area and the second filling area, the resolution is generated. A mask map consistent with the resolution of the brush face image.

进一步地，由于人脸区域大部分是圆形区域或者椭圆形区域，因此为了使得第一填充区域更加贴合候选人的人脸区域，将第一填充区域作为圆形区域。Further, since most of the face area is a circular area or an oval area, in order to make the first filled area fit the candidate's face area better, the first filled area is taken as a circular area.

具体地，在对人脸区域进行处理，确定人脸区域选择框时，将人脸区域选择框确定为矩形框。在得到矩形框的人脸区域之后，通过矩形框在刷脸图像中的位置，计算人脸框宽与人脸框高，通过人脸框宽与人脸框高计算圆形区域的半径。Specifically, when the face area is processed and the face area selection frame is determined, the face area selection frame is determined as a rectangular frame. After obtaining the face area of the rectangular frame, calculate the face frame width and face frame height through the position of the rectangular frame in the face brushing image, and calculate the radius of the circular area through the face frame width and face frame height.

其中，在计算圆形区域的半径时，由于人脸区域为圆形区域或者椭圆形区域，那么在最初生成矩形框时，人脸区域类似于矩形框的内切圆，因此为了最大可能还原人脸区域，同时保证第一填充区域尽可能包括全部的人脸区域，取矩形框宽一半长度与框高一半长度之间的最大值，作为圆形区域的半径。Among them, when calculating the radius of the circular area, since the face area is a circular area or an oval area, when the rectangular frame is initially generated, the face area is similar to the inscribed circle of the rectangular frame. At the same time, ensure that the first filling area includes all the face area as much as possible, and take the maximum value between half the width of the rectangular frame and half the length of the frame as the radius of the circular area.

因此，将矩形框的中心作为圆心，确定矩形框的最长边的一半长度，作为半径，然后基于圆心和半径构成的圆形区域，确定为该候选人对应的掩码图的第一填充区域。Therefore, the center of the rectangular box is taken as the center of the circle, and the half length of the longest side of the rectangular box is determined as the radius, and then the circular area formed by the center and the radius is determined as the first filling area of the mask map corresponding to the candidate .

例如，假设候选人的人脸矩形框在刷脸图像中的位置为(x₁，y₁，x₂，y₂)，其中，x₁与x₂分别为矩形框宽在x轴的位置坐标，y₁与y₂分别为矩形框高在y轴的位置坐标。For example, suppose the position of the candidate's face rectangle in the face brushing image is (x ₁ , y ₁ , x ₂ , y ₂ ), where x ₁ and x ₂ are the position coordinates of the width of the rectangle on the x-axis, respectively , y ₁ and y ₂ are the position coordinates of the height of the rectangle on the y-axis, respectively.

则计算人脸框宽的表达式为w＝x₂-x₁，其中，w为人脸框宽。需要说明的是，x₁的位置坐标小于x₂的位置坐标。Then the expression for calculating the width of the face frame is w=x ₂ -x ₁ , where w is the width of the face frame. It should be noted that the position coordinates of x1 are smaller _than the position coordinates of _x2 .

则计算人脸框高的表达式为h＝y₂-y₁，其中，h为人脸框高，需要说明的是，y₁的位置坐标小于y₂的位置坐标。The expression for calculating the height of the face frame is h=y ₂ -y ₁ , where h is the height of the face frame. It should be noted that the position coordinates of y ₁ are smaller than the position coordinates of y ₂ .

确定圆形区域半径的表达式为

其中，R为圆形区域半径。The expression for determining the radius of a circular area is

where R is the radius of the circular area.

基于此，矩形框的中心的位置坐标为

R为

Based on this, the position coordinates of the center of the rectangular box are

R is

在本说明书一个或多个实施例中，更直观地，结合图2-图3，对本方案进行更具体的说明。In one or more embodiments of the present specification, more intuitively, this solution is described in more detail with reference to FIGS. 2-3 .

图2为本说明书一个或多个实施例提供的一种刷脸支付意愿识别系统的框架示意图。FIG. 2 is a schematic framework diagram of a face-scanning payment willingness recognition system provided by one or more embodiments of the present specification.

在本说明书一个或多个实施例中，为了更加准确地识别候选人的刷脸支付意愿，提出了对刷脸支付过程中获取的刷脸图像，采用深度卷积神经网络端到端学习的方式，能够实现刷脸支付意愿安全检测，并引入候选人区域注意力机制，以便将掩码图融入到网络学习中，使得更有针对性地识别刷脸图像中候选人的刷脸支付意愿，从而能够增强刷脸安全体验。In one or more embodiments of this specification, in order to more accurately identify the candidate's willingness to pay for face-swiping, it is proposed to use a deep convolutional neural network end-to-end learning method for the face-swiping images acquired during the face-swiping payment process. , which can realize the security detection of the willingness to pay for face brushing, and introduce the candidate area attention mechanism to integrate the mask map into the network learning, so that the willingness to pay for brushing the face of the candidate in the face brushing image can be more targeted, so as to It can enhance the security experience of face brushing.

如图2所示，刷脸支付意愿识别系统采用深度卷积神经网络端到端学习的方式实现，该系统包括刷脸图像、候选人在刷脸图像中的所处区域生成的掩码图、第一卷积网络模块、候选人所处区域注意力机制实施模块、第三卷积模块、网络输出。As shown in Figure 2, the face-swiping willingness-to-pay recognition system is implemented by a deep convolutional neural network end-to-end learning method. The first convolutional network module, the implementation module of the attention mechanism in the region where the candidate is located, the third convolutional module, and the network output.

需要说明的是，第一卷积网络与第三卷积网络具有特定的对应关系，也就是说，第一卷积网络与第三卷积网络的所涉及的具有网络类型，在此不作限定，但是，两者之间不是割裂的，比如，第一卷积网络与第三卷积网络属于同一识别网络类型的不同部分。It should be noted that the first convolutional network and the third convolutional network have a specific correspondence, that is, the types of networks involved in the first convolutional network and the third convolutional network are not limited here. However, the two are not separated, for example, the first convolutional network and the third convolutional network belong to different parts of the same recognition network type.

其中，刷脸图像与生成的掩码图作为深度卷积神经网络的输入数据，网络输出为刷脸支付意愿概率值，即，意愿安全概率以及意愿非安全概率。Among them, the face-swiping image and the generated mask image are used as the input data of the deep convolutional neural network, and the network output is the probability value of the willingness to pay for face-swiping, that is, the willingness safety probability and the willingness non-safety probability.

基于此，刷脸支付意愿识别的过程中，首先通过第一卷积网络，提取到刷脸图像的特征，将刷脸图像的特征和掩码图输入候选人所处区域注意力机制实施模块，候选人所处区域注意力机制实施模块对刷脸图像的特征和掩码图进行处理，输出融合特征。最后，将融合特征输入第三卷积网络，通过第三卷积网络对融合特征进行处理，得到处理结果。Based on this, in the process of face-swiping willingness to pay recognition, the first convolutional network is used to extract the features of the face-swiping image, and the features and mask images of the face-swiping image are input into the implementation module of the attention mechanism in the region where the candidate is located. The implementation module of the attention mechanism in the region where the candidate is located processes the features and mask images of the face-swiping image, and outputs the fusion features. Finally, the fusion feature is input into the third convolutional network, and the fusion feature is processed by the third convolutional network to obtain the processing result.

下面继续说明候选人所处区域注意力机制实施模块具体如何对刷脸图像的特征和掩码图进行处理，输出融合特征。The following will continue to explain how the implementation module of the attention mechanism in the region where the candidate is located specifically processes the features and mask images of the face-swiping image, and outputs the fusion features.

如图2所示，候选人所处区域注意力机制实施模块包括刷脸图像的特征、降分辨率处理后的掩码图，第二卷积网络模块，融合特征。As shown in Figure 2, the implementation module of the attention mechanism in the region where the candidate is located includes the features of the face-swiping image, the mask map after down-resolution processing, the second convolutional network module, and the fusion features.

具体地，刷脸支付意愿识别的过程中，首先将刷脸图像输入第一卷积网络模块中的第一卷积网络，通过第一卷积网络，提取到刷脸图像的特征，并且对掩码图进行降分辨率处理，得到降分辨率处理后的掩码图，以适应于刷脸图像的特征。Specifically, in the process of face-swiping willingness to pay recognition, the face-swiping image is first input into the first convolutional network in the first convolutional network module. The code image is subjected to down-resolution processing to obtain a mask image after down-resolution processing, which is suitable for the features of the face-swiping image.

然后，将刷脸图像的特征与降分辨率处理后的掩码图输入第二卷积神经网络模块，通过第二卷积网络，融合刷脸图像的特征和降分辨率处理后的掩码图，得到融合特征。Then, the features of the face brushing image and the reduced-resolution mask image are input into the second convolutional neural network module, and the features of the brushed face image and the reduced-resolution mask image are fused through the second convolutional network. , to get the fusion feature.

通过深度卷积神经网络端到端学习的方式，实现候选人的刷脸支付意愿识别，通过候选人所处区域注意力机制实施模块能够使得网络更有针对性的学习判断刷脸图像中候选人的刷脸支付意愿。尤其在公共场所中，能够有效防止A候选人启用刷脸，误刷B候选人的资产的技术效果。Through the end-to-end learning method of the deep convolutional neural network, the recognition of the candidate's willingness to pay for face-swiping is realized, and the implementation module of the attention mechanism in the area where the candidate is located can make the network more targeted to learn and judge the candidates in the face-swiping image. the willingness to pay for face brushing. Especially in public places, it can effectively prevent the technical effect of candidate A's face-swiping and mistakenly swiping candidate B's assets.

更直观地，图3为本说明书一个或多个实施例提供的一种基于深度卷积神经网络端到端学习的刷脸支付意愿识别方法的流程示意图。More intuitively, FIG. 3 is a schematic flowchart of a method for recognizing the willingness to pay for face brushing based on end-to-end learning of a deep convolutional neural network according to one or more embodiments of the present specification.

图3中的流程可以包括以下步骤：The flow in Figure 3 may include the following steps:

S302：获取刷脸图像，并在所述刷脸图像中确定待识别的候选人。S302: Acquire a face brushing image, and determine a candidate to be identified in the face brushing image.

S304：根据各所述候选人在所述刷脸图像中的所处区域，分别生成对应的掩码图以区别所述所处区域和所述刷脸图像中的其他区域。S304: According to the area where each candidate is located in the face brushing image, generate a corresponding mask map respectively to distinguish the located area from other areas in the face brushing image.

S306：通过第一卷积网络，提取所述刷脸图像的特征。S306: Extract the features of the face brushing image through the first convolutional network.

需要说明的是，第一卷积网络预先通过有监督训练得到。It should be noted that the first convolutional network is obtained through supervised training in advance.

S308：对所述掩码图进行降分辨率处理，以适应于所述刷脸图像的特征。S308: Perform a resolution reduction process on the mask image to adapt to the features of the face brushing image.

比如，通过最近邻采样的方式，降低掩码图的分辨率，从而生成与刷脸图像的相同分辨率的掩码图，能够适应于刷脸图像的特征，以便第二卷积网络进行处理。For example, by reducing the resolution of the mask map by means of nearest neighbor sampling, a mask map of the same resolution as the face brushing image is generated, which can be adapted to the characteristics of the brushing face image for processing by the second convolutional network.

S310：通过第二卷积网络，融合所述刷脸图像的特征和所述降分辨率处理后的掩码图，得到融合特征。需要说明的是，第二卷积网络预先通过有监督训练得到。S310: Through the second convolutional network, fuse the features of the face brushing image and the mask map after the resolution reduction process to obtain fused features. It should be noted that the second convolutional network is obtained through supervised training in advance.

在本说明书的一个或多个实施例中，在得到融合特征的过程中，按照通道维度，将刷脸图像的特征和降分辨率处理后的掩码图进行连接，将连接得到的特征输入第二卷积网络进行处理，得到融合特征。In one or more embodiments of this specification, in the process of obtaining the fusion feature, according to the channel dimension, the feature of the face brushing image and the mask map after the resolution reduction process are connected, and the obtained feature is input into the first The second convolutional network is processed to obtain fused features.

其中，第二卷积网络的卷积层数，在此不作具体限定。也就是说，第二卷积网络的卷积层数可以由1个或多个卷积层构成。同时，融合特征的分辨率与刷脸图像的特征相同，且融合特征的特征通道数与刷脸图像的特征相同。The number of convolutional layers of the second convolutional network is not specifically limited here. That is, the number of convolutional layers of the second convolutional network may consist of one or more convolutional layers. At the same time, the resolution of the fusion feature is the same as that of the face brush image, and the number of feature channels of the fusion feature is the same as that of the face brush image.

S312：将所述融合特征输入对应于所述第一卷积网络的第三卷积网络进行处理，得到处理结果，其中，所述第一卷积网络和所述第三卷积网络是预先从同一个卷积网络拆分得到的。需要说明的是，第三卷积网络预先通过有监督训练得到。S312: Input the fusion feature into a third convolutional network corresponding to the first convolutional network for processing, and obtain a processing result, wherein the first convolutional network and the third convolutional network are obtained in advance from The same convolutional network split is obtained. It should be noted that the third convolutional network is obtained through supervised training in advance.

在本说明书的一个或多个实施例中，在第一卷积网络和第三卷积网络是预先从同一个卷积网络拆分得到的，比如，卷积网络为resent，ShUffleNet V2等。In one or more embodiments of this specification, the first convolutional network and the third convolutional network are obtained by splitting from the same convolutional network in advance, for example, the convolutional network is resent, ShUffleNet V2, etc.

在拆分的过程中，可以分别将第一卷积网络和第三卷积网络作为同一个卷积网络的前后部分。In the process of splitting, the first convolutional network and the third convolutional network can be used as the front and rear parts of the same convolutional network, respectively.

针对于拆分的位置，可以由刷脸图像的特征的分辨率的大小决定。即使还没有开始提取刷脸图像的特征(假定此时模型尚未构造完成)，对于也可以刷脸图像的特征，也可以有个预期的分辨率(将其称为目标分辨率，比如，为刷脸图像的分辨率的

)，以便相应拆分卷积网络，完成模型构造，使得前一部分的卷积网络(第一卷积网络)恰好能够输出该分辨率的特征。For the split position, it can be determined by the resolution of the features of the face brushing image. Even if the feature extraction of the face image has not started (assuming that the model has not been constructed at this time), there can be an expected resolution for the features that can also brush the face image (referred to as the target resolution, for example, for the brush the resolution of the face image

), so as to split the convolutional network accordingly and complete the model construction, so that the convolutional network in the previous part (the first convolutional network) can just output the features of this resolution.

在同一个卷积网络中的卷积层中，确定与目标分辨率匹配的卷积层。最后，以匹配的卷积层作为拆分点，将同一个卷积网络拆分为前一部分和后一部分，前一部分作为第一卷积网络，后一部分作为第三卷积网络。Among the convolutional layers in the same convolutional network, determine the convolutional layer that matches the target resolution. Finally, with the matching convolutional layer as the split point, the same convolutional network is split into the former part and the latter part, the former part is used as the first convolutional network, and the latter part is used as the third convolutional network.

S314：根据所述处理结果生成概率值，以表示对应的所述候选人是否具有刷脸支付意愿。S314: Generate a probability value according to the processing result to indicate whether the corresponding candidate has a willingness to pay for face-scanning.

比如，将概率值与设定的阈值概率进行比较，若概率值大于设定的阈值概率，则判定为该候选人的意愿安全，即，该候选人具有刷脸支付意愿。若概率值小于或等于设定的阈值概率，则判定为该候选人的意愿非安全，即，该候选人不具有刷脸支付意愿。For example, the probability value is compared with the set threshold probability, and if the probability value is greater than the set threshold probability, it is determined that the candidate's willingness is safe, that is, the candidate has the willingness to pay for face-scanning. If the probability value is less than or equal to the set threshold probability, it is determined that the candidate's willingness is not safe, that is, the candidate does not have the willingness to pay for face-scanning.

根据前面的说明，下面继续阐述如何对第一卷积网络、第二卷积网络、第三卷积网络进行有监督训练。According to the previous description, the following continues to describe how to perform supervised training on the first convolutional network, the second convolutional network, and the third convolutional network.

在本说明书的一个或多个实施例中，首先需要建立训练数据集，然后通过训练数据集进行网络训练。In one or more embodiments of this specification, a training data set needs to be established first, and then network training is performed through the training data set.

具体地，在建立训练数据集时，首先获取包含已确认为刷脸用户的刷脸样本图像，即，候选人启用刷脸支付，通过摄像设备采集刷脸图像，无论刷脸图像中是否存在多个待识别候选人，在该刷脸图像中，将该候选人作为刷脸用户，将该刷脸图像作为刷脸样本图像。比如，通过线下IoT刷脸机具上的摄像头采集刷脸图像，若A候选人启用刷脸支付，则在该刷脸图像中将A候选人作为刷脸用户，将该刷脸图像作为刷脸样本图像。Specifically, when establishing a training data set, first obtain the face-swiping sample images that include the confirmed face-swiping users, that is, the candidate enables face-swiping payment, and collects the face-swiping images through a camera device, regardless of whether there are many face-swiping images in the face-swiping images. A candidate to be identified, in the face-swiping image, the candidate is used as a face-swiping user, and the face-swiping image is used as a face-swiping sample image. For example, the face-swiping image is collected by the camera on the offline IoT face-swiping device. If candidate A enables face-swiping payment, candidate A will be used as the face-swiping user in the face-swiping image, and the face-swiping image will be used as the face-swiping image. Sample image.

需要说明的是，每次启用刷脸支付时，都会采集一次对应的刷脸图像。比如，若A候选人启用刷脸支付时，则IoT刷脸机具将针对附近用户进行采集图像，从而得到单张刷脸图像，若B候选人启用刷脸支付时，则IoT刷脸机具将再次针对附近用户进行采集图像，从而再次得到单张刷脸图像。It should be noted that each time the face-swiping payment is enabled, a corresponding face-swiping image will be collected. For example, if candidate A enables face-swiping payment, the IoT face-swiping device will collect images of nearby users to obtain a single face-swiping image; if candidate B enables face-swiping payment, the IoT face-swiping device will again Collect images for nearby users, so as to obtain a single face brushing image again.

在获取刷脸样本图像后，通过将刷脸用户标记为具有刷脸支付意愿，并生成对应的掩码图，得到正样本。其中，生成对应的掩码图时，从刷脸图像中选取刷脸用户的人脸选择框在刷脸图像中的位置，然后通过该位置生成对应的掩码图。且在将刷脸用户标记为具有刷脸支付意愿时，可以通过意愿标记标签进行标记，比如，意愿标签为{0,1}，其中，1代表具有刷脸支付意愿，0表示不具有刷脸支付意愿。After acquiring the face-swiping sample image, the positive samples are obtained by marking the face-swiping user as having the willingness to pay for face-swiping and generating the corresponding mask map. Wherein, when generating the corresponding mask image, the position of the face selection frame of the user who brushes the face in the face brushing image is selected from the face brushing image, and then the corresponding mask image is generated through the position. And when a face-swiping user is marked as having a willingness to pay for face-swiping, it can be marked with a willingness tag. For example, the willingness tag is {0, 1}, where 1 represents the willingness to pay for face-swiping, and 0 means no face-swiping willingness to pay. Willingness to pay.

在获取刷脸样本图像后，由于刷脸样本图像可能存在多个待识别候选人，因此，若刷脸样本图像中还包含被顺便拍到的其他用户，则通过将其他用户标记为不具有刷脸支付意愿，并生成对应的掩码图，得到负样本。比如，若A候选人启用刷脸支付，则在该刷脸图像中将B候选人标记为不具有刷脸支付意愿。After the face-swiping sample image is obtained, since there may be multiple candidates to be identified in the face-swiping sample image, if the face-swiping sample image also includes other users who are photographed incidentally, the other users can be marked as not having Face willingness to pay, and generate the corresponding mask map to get negative samples. For example, if candidate A enables face-swiping payment, in the face-swiping image, mark candidate B as not having the willingness to pay for face-swiping.

最后，根据所得到的正样本与负样本，对第一卷积网络、第二卷积网络、第三卷积网络进行有监督训练。Finally, supervised training is performed on the first convolutional network, the second convolutional network, and the third convolutional network according to the obtained positive samples and negative samples.

其中，在进行有监督训练时，首先在训练数据集中对正样本与负样本进行随机采样，生成训练batch及其对应的标记标签。然后，将训练batch及其对应的标记标签输入初始深度卷积神经网络。初始深度卷积神经网络包括未训练的第一卷积网络、第二卷积网络、第三卷积网络。Among them, during supervised training, the positive samples and negative samples are randomly sampled in the training data set, and the training batch and its corresponding label are generated. Then, the training batch and its corresponding labeled labels are fed into the initial deep convolutional neural network. The initial deep convolutional neural network includes untrained first convolutional network, second convolutional network, and third convolutional network.

然后，初始深度卷积神经网络输出概率值，通过概率值与对应的标记标签计算损失函数，并通过梯度下降法，不打断优化损失函数进行网络训练，从而完成有监督训练，得到深度卷积神经网络。其中，能够通过网络训练，得到识别候选人是否具有刷脸支付意愿的规则。比如，深度卷积神经网络识别到候选人的人脸区域位于正中间区域，则认为该候选人具有刷脸支付意愿。Then, the initial deep convolutional neural network outputs the probability value, calculates the loss function through the probability value and the corresponding label, and uses the gradient descent method to perform network training without interrupting the optimized loss function, thus completing the supervised training and obtaining the deep convolution Neural Networks. Among them, the rules for identifying whether a candidate has the willingness to pay for face-scanning can be obtained through network training. For example, if the deep convolutional neural network recognizes that the candidate's face area is located in the middle area, it is considered that the candidate has the willingness to pay for face brushing.

基于同样的思路，本说明书一个或多个实施例还提供了上述方法对应的装置和设备，如图4、图5所示。Based on the same idea, one or more embodiments of the present specification also provide apparatuses and devices corresponding to the foregoing methods, as shown in FIG. 4 and FIG. 5 .

图4为本说明书一个或多个实施例提供的一种刷脸支付意愿识别装置的结构示意图，所述装置包括：FIG. 4 is a schematic structural diagram of a device for recognizing a willingness to pay for face-scanning provided by one or more embodiments of the present specification, and the device includes:

获取模块402，获取刷脸图像，并在所述刷脸图像中确定待识别的候选人；Obtaining module 402, obtains a face brushing image, and determines a candidate to be identified in the brushing face image;

生成模块404，根据各所述候选人在所述刷脸图像中的所处区域，分别生成对应的掩码图以区别所述所处区域和所述刷脸图像中的其他区域；The generation module 404, according to the area where each candidate is located in the face brushing image, respectively generates a corresponding mask map to distinguish the area and other areas in the brushing face image;

提取模块406，提取所述刷脸图像的特征，并根据所述刷脸图像的特征和所述掩码图，得到融合特征；Extraction module 406, extracts the features of the face brushing image, and obtains fusion features according to the characteristics of the face brushing image and the mask map;

识别模块408，根据所述融合特征，识别各所述候选人是否具有刷脸支付意愿。The identification module 408, according to the fusion feature, identifies whether each of the candidates has a willingness to pay for face-scanning.

可选地，所述生成模块404，分别针对所述确定出的各所述候选人，执行：Optionally, the generating module 404, for each of the determined candidates, respectively, executes:

根据该候选人的人脸区域选择框，确定该候选人对应的掩码图的第一填充区域，以及所述第一填充区域之外的第二填充区域；According to the face area selection frame of the candidate, determine the first filling area of the mask map corresponding to the candidate, and the second filling area other than the first filling area;

通过为所述第一填充区域和所述第二填充区域赋予不同的填充值，生成分辨率与所述刷脸图像的分辨率一致的所述掩码图。By assigning different filling values to the first filling area and the second filling area, the mask map whose resolution is consistent with the resolution of the face brushing image is generated.

可选地，所述人脸区域选择框为矩形框；Optionally, the face area selection frame is a rectangular frame;

所述生成模块404，确定所述矩形框的中心作为圆心，确定所述矩形框的最长边的一半长度，作为半径；The generating module 404 determines the center of the rectangular frame as the center of the circle, and determines the half length of the longest side of the rectangular frame as the radius;

将基于所述圆心和所述半径构成的圆形区域，确定为该候选人对应的掩码图的第一填充区域。The circular area formed based on the center of the circle and the radius is determined as the first filling area of the mask map corresponding to the candidate.

可选地，所述提取模块406，通过第一卷积网络，提取所述刷脸图像的特征；Optionally, the extraction module 406 extracts the features of the face brushing image through the first convolutional network;

对所述掩码图进行降分辨率处理，以适应于所述刷脸图像的特征；performing down-resolution processing on the mask image to adapt to the features of the face brushing image;

通过第二卷积网络，融合所述刷脸图像的特征和所述降分辨率处理后的掩码图，得到融合特征。Through the second convolutional network, the features of the face brushing image and the mask map after the resolution reduction process are fused to obtain fused features.

可选地，所述提取模块406，按照通道维度，将所述刷脸图像的特征和所述降分辨率处理后的掩码图进行连接；Optionally, the extraction module 406, according to the channel dimension, connects the feature of the face brushing image and the mask image after the resolution reduction process;

将所述连接得到的特征输入所述第二卷积网络进行处理，得到融合特征。The features obtained by the connection are input into the second convolutional network for processing to obtain fusion features.

可选地，所述识别模块408，将所述融合特征输入对应于所述第一卷积网络的第三卷积网络进行处理，得到处理结果，其中，所述第一卷积网络和所述第三卷积网络是预先从同一个卷积网络拆分得到的；Optionally, the identifying module 408 inputs the fusion feature into a third convolutional network corresponding to the first convolutional network for processing, and obtains a processing result, wherein the first convolutional network and the The third convolutional network is split from the same convolutional network in advance;

根据所述处理结果生成概率值，以表示对应的所述候选人是否具有刷脸支付意愿。A probability value is generated according to the processing result to indicate whether the corresponding candidate has a willingness to pay for face-scanning.

可选地，所述识别模块408，确定目标分辨率，以便作为所述刷脸图像的特征的分辨率；Optionally, the identification module 408 determines the target resolution, so as to be the resolution of the feature of the face brushing image;

在所述同一个卷积网络中的卷积层中，确定与所述目标分辨率匹配的卷积层；In the convolutional layers in the same convolutional network, determine the convolutional layer matching the target resolution;

以所述匹配的卷积层作为拆分点，将同一个卷积网络拆分为前一部分和后一部分，所述前一部分作为所述第一卷积网络，所述后一部分作为所述第三卷积网络。Using the matched convolutional layer as a splitting point, the same convolutional network is split into a former part and a latter part, the former part as the first convolutional network, and the latter part as the third Convolutional Networks.

可选地，所述装置还包括有监督训练模块，所述有监督训练模块，获取包含已确认为刷脸用户的刷脸样本图像；Optionally, the device further includes a supervised training module, the supervised training module acquires a face-swiping sample image containing a user who has been confirmed as a face-swiping user;

通过将所述刷脸用户标记为具有刷脸支付意愿，并生成对应的掩码图，得到正样本；By marking the face-swiping user as having a willingness to pay for face-swiping, and generating a corresponding mask map, a positive sample is obtained;

若所述刷脸样本图像中还包含被顺便拍到的其他用户，则通过将所述其他用户标记为不具有刷脸支付意愿，并生成对应的掩码图，得到负样本；If the face-swiping sample image also includes other users who were photographed incidentally, then by marking the other users as not having the willingness to pay for face-swiping, and generating a corresponding mask image, a negative sample is obtained;

根据所得到的样本，对所述第一卷积网络、所述第二卷积网络、所述第三卷积网络进行有监督训练。According to the obtained samples, supervised training is performed on the first convolutional network, the second convolutional network, and the third convolutional network.

可选地，所述刷脸图像中包含至少两张人脸。Optionally, the face brushing image includes at least two human faces.

可选地，所述装置应用于线下的IoT刷脸机具，所述刷脸图像由所述IoT刷脸机具针对附近用户采集得到。Optionally, the device is applied to an offline IoT face brushing machine, and the face brushing image is collected by the IoT face brushing machine for nearby users.

图5为本说明书一个或多个实施例提供的一种刷脸支付意愿识别设备的结构示意图，所述设备包括：FIG. 5 is a schematic structural diagram of a device for recognizing the willingness to pay for face-scanning provided by one or more embodiments of the present specification, and the device includes:

至少一个处理器；以及，at least one processor; and,

基于同样的思路，本说明书一个或多个实施例还提供了对应于上述方法的一种刷脸支付意愿识别非易失性计算机存储介质，存储有计算机可执行指令，所述计算机可执行指令设置为：Based on the same idea, one or more embodiments of this specification also provide a non-volatile computer storage medium corresponding to the above method for recognizing the willingness to pay for face-scanning, storing computer-executable instructions, the computer-executable instructions setting for:

在20世纪90年代，对于一个技术的改进可以很明显地区分是硬件上的改进(例如，对二极管、晶体管、开关等电路结构的改进)还是软件上的改进(对于方法流程的改进)。然而，随着技术的发展，当今的很多方法流程的改进已经可以视为硬件电路结构的直接改进。设计人员几乎都通过将改进的方法流程编程到硬件电路中来得到相应的硬件电路结构。因此，不能说一个方法流程的改进就不能用硬件实体模块来实现。例如，可编程逻辑器件(Programmable Logic Device,PLD)(例如现场可编程门阵列(Field Programmable GateArray，FPGA))就是这样一种集成电路，其逻辑功能由用户对器件编程来确定。由设计人员自行编程来把一个数字系统“集成”在一片PLD上，而不需要请芯片制造厂商来设计和制作专用的集成电路芯片。而且，如今，取代手工地制作集成电路芯片，这种编程也多半改用“逻辑编译器(logic compiler)”软件来实现，它与程序开发撰写时所用的软件编译器相类似，而要编译之前的原始代码也得用特定的编程语言来撰写，此称之为硬件描述语言(Hardware Description Language，HDL)，而HDL也并非仅有一种，而是有许多种，如ABEL(Advanced Boolean Expression Language)、AHDL(Altera Hardware DescriptionLanguage)、Confluence、CUPL(Cornell University Programming Language)、HDCal、JHDL(Java Hardware Description Language)、Lava、Lola、MyHDL、PALASM、RHDL(RubyHardware Description Language)等，目前最普遍使用的是VHDL(Very-High-SpeedIntegrated Circuit Hardware Description Language)与Verilog。本领域技术人员也应该清楚，只需要将方法流程用上述几种硬件描述语言稍作逻辑编程并编程到集成电路中，就可以很容易得到实现该逻辑方法流程的硬件电路。In the 1990s, improvements in a technology could be clearly differentiated between improvements in hardware (eg, improvements to circuit structures such as diodes, transistors, switches, etc.) or improvements in software (improvements in method flow). However, with the development of technology, the improvement of many methods and processes today can be regarded as a direct improvement of the hardware circuit structure. Designers almost get the corresponding hardware circuit structure by programming the improved method flow into the hardware circuit. Therefore, it cannot be said that the improvement of a method flow cannot be realized by hardware entity modules. For example, a Programmable Logic Device (PLD) (eg, Field Programmable Gate Array (FPGA)) is an integrated circuit whose logic function is determined by user programming of the device. It is programmed by the designer to "integrate" a digital system on a PLD without having to ask a chip manufacturer to design and manufacture a dedicated integrated circuit chip. And, instead of making integrated circuit chips by hand, these days, most of this programming is done using "logic compiler" software, which is similar to the software compilers used in program development and writing, but before compiling The original code also has to be written in a specific programming language, which is called Hardware Description Language (HDL), and there is not only one HDL, but many kinds, such as ABEL (Advanced Boolean Expression Language) , AHDL (Altera Hardware Description Language), Confluence, CUPL (Cornell University Programming Language), HDCal, JHDL (Java Hardware Description Language), Lava, Lola, MyHDL, PALASM, RHDL (RubyHardware Description Language), etc. The most commonly used are VHDL (Very-High-Speed Integrated Circuit Hardware Description Language) and Verilog. It should also be clear to those skilled in the art that a hardware circuit for implementing the logic method process can be easily obtained by simply programming the method process in the above-mentioned several hardware description languages and programming it into the integrated circuit.

控制器可以按任何适当的方式实现，例如，控制器可以采取例如微处理器或处理器以及存储可由该(微)处理器执行的计算机可读程序代码(例如软件或固件)的计算机可读介质、逻辑门、开关、专用集成电路(Application Specific Integrated Circuit，ASIC)、可编程逻辑控制器和嵌入微控制器的形式，控制器的例子包括但不限于以下微控制器：ARC 625D、Atmel AT91SAM、Microchip PIC18F26K20以及Silicone Labs C8051F320，存储器控制器还可以被实现为存储器的控制逻辑的一部分。本领域技术人员也知道，除了以纯计算机可读程序代码方式实现控制器以外，完全可以通过将方法步骤进行逻辑编程来使得控制器以逻辑门、开关、专用集成电路、可编程逻辑控制器和嵌入微控制器等的形式来实现相同功能。因此这种控制器可以被认为是一种硬件部件，而对其内包括的用于实现各种功能的装置也可以视为硬件部件内的结构。或者甚至，可以将用于实现各种功能的装置视为既可以是实现方法的软件模块又可以是硬件部件内的结构。The controller may be implemented in any suitable manner, for example, the controller may take the form of, for example, a microprocessor or processor and a computer readable medium storing computer readable program code (eg software or firmware) executable by the (micro)processor , logic gates, switches, application specific integrated circuits (ASICs), programmable logic controllers and embedded microcontrollers, examples of controllers include but are not limited to the following microcontrollers: ARC 625D, Atmel AT91SAM, Microchip PIC18F26K20 and Silicon Labs C8051F320, the memory controller can also be implemented as part of the control logic of the memory. Those skilled in the art also know that, in addition to implementing the controller in the form of pure computer-readable program code, the controller can be implemented as logic gates, switches, application-specific integrated circuits, programmable logic controllers and embedded devices by logically programming the method steps. The same function can be realized in the form of a microcontroller, etc. Therefore, such a controller can be regarded as a hardware component, and the devices included therein for realizing various functions can also be regarded as a structure within the hardware component. Or even, the means for implementing various functions can be regarded as both a software module implementing a method and a structure within a hardware component.

上述实施例阐明的系统、装置、模块或单元，具体可以由计算机芯片或实体实现，或者由具有某种功能的产品来实现。一种典型的实现设备为计算机。具体的，计算机例如可以为个人计算机、膝上型计算机、蜂窝电话、相机电话、智能电话、个人数字助理、媒体播放器、导航设备、电子邮件设备、游戏控制台、平板计算机、可穿戴设备或者这些设备中的任何设备的组合。The systems, devices, modules or units described in the above embodiments may be specifically implemented by computer chips or entities, or by products with certain functions. A typical implementation device is a computer. Specifically, the computer can be, for example, a personal computer, a laptop computer, a cellular phone, a camera phone, a smart phone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or A combination of any of these devices.

为了描述的方便，描述以上装置时以功能分为各种单元分别描述。当然，在实施本说明书时可以把各单元的功能在同一个或多个软件和/或硬件中实现。For the convenience of description, when describing the above device, the functions are divided into various units and described respectively. Of course, when implementing this specification, the functions of each unit may be implemented in one or more software and/or hardware.

本领域内的技术人员应明白，本说明书实施例可提供为方法、系统、或计算机程序产品。因此，本说明书实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且，本说明书实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by one skilled in the art, the embodiments of the present specification may be provided as a method, a system, or a computer program product. Accordingly, embodiments of this specification may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present specification may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本说明书是参照根据本说明书实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The specification is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the specification. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in a flow or flow of a flowchart and/or a block or blocks of a block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions The apparatus implements the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上，使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in the flow or blocks of the flowcharts and/or the block or blocks of the block diagrams.

在一个典型的配置中，计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.

内存可能包括计算机可读介质中的非永久性存储器，随机存取存储器(RAM)和/或非易失性内存等形式，如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。Memory may include non-persistent memory in computer readable media, random access memory (RAM) and/or non-volatile memory in the form of, for example, read only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.

计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括，但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带，磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质，可用于存储可以被计算设备访问的信息。按照本文中的界定，计算机可读介质不包括暂存电脑可读媒体(transitory media)，如调制的数据信号和载波。Computer-readable media includes both persistent and non-permanent, removable and non-removable media, and storage of information may be implemented by any method or technology. Information may be computer readable instructions, data structures, modules of programs, or other data. Examples of computer storage media include, but are not limited to, phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Flash Memory or other memory technology, Compact Disc Read Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage, Magnetic tape cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer-readable media does not include transitory computer-readable media, such as modulated data signals and carrier waves.

还需要说明的是，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、商品或者设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、商品或者设备所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括所述要素的过程、方法、商品或者设备中还存在另外的相同要素。It should also be noted that the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device comprising a series of elements includes not only those elements, but also Other elements not expressly listed, or which are inherent to such a process, method, article of manufacture, or apparatus are also included. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in the process, method, article of manufacture, or device that includes the element.

本说明书可以在由计算机执行的计算机可执行指令的一般上下文中描述，例如程序模块。一般地，程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构等等。也可以在分布式计算环境中实践本说明书，在这些分布式计算环境中，由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中，程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。This specification may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. The specification can also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including storage devices.

本说明书中的各个实施例均采用递进的方式描述，各个实施例之间相同相似的部分互相参见即可，每个实施例重点说明的都是与其他实施例的不同之处。尤其，对于装置、设备、非易失性计算机存储介质实施例而言，由于其基本相似于方法实施例，所以描述的比较简单，相关之处参见方法实施例的部分说明即可。The various embodiments in this specification are described in a progressive manner, and the same and similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for the apparatus, equipment, and non-volatile computer storage medium embodiments, since they are basically similar to the method embodiments, the description is relatively simple, and reference may be made to some descriptions of the method embodiments for related parts.

上述对本说明书特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下，在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外，在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中，多任务处理和并行处理也是可以的或者可能是有利的。The foregoing describes specific embodiments of the present specification. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps recited in the claims can be performed in an order different from that in the embodiments and still achieve desirable results. Additionally, the processes depicted in the figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.

以上所述仅为本说明书的一个或多个实施例而已，并不用于限制本说明书。对于本领域技术人员来说，本说明书的一个或多个实施例可以有各种更改和变化。凡在本说明书的一个或多个实施例的精神和原理之内所作的任何修改、等同替换、改进等，均应包含在本说明书的权利要求范围之内。The above descriptions are merely one or more embodiments of the present specification, and are not intended to limit the present specification. Various modifications and variations of the one or more embodiments of this specification are possible for those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of one or more embodiments of this specification should be included within the scope of the claims of this specification.

Claims

1. A face brushing willingness-to-pay recognition method comprises the following steps:

acquiring a face brushing image, and determining a candidate to be identified in the face brushing image;

according to the located area of each candidate in the face brushing image, respectively generating corresponding mask images to distinguish the located area from other areas in the face brushing image;

extracting the features of the face brushing image, and obtaining fusion features according to the features of the face brushing image and the mask image;

and identifying whether each candidate has a face brushing willingness to pay according to the fusion characteristics.

2. The method according to claim 1, wherein the generating a corresponding mask map according to the region of each candidate in the face brushing image respectively includes:

respectively aiming at each determined candidate, executing:

determining a first filling area of a mask image corresponding to the candidate and a second filling area outside the first filling area according to the face area selection box of the candidate;

generating the mask map having a resolution identical to that of the brush face image by assigning different fill values to the first fill region and the second fill region.

3. The method of claim 2, wherein the face region selection box is a rectangular box;

the determining, according to the face area selection box of the candidate, a first filling area of a mask map corresponding to the candidate specifically includes:

determining the center of the rectangular frame as a circle center, and determining the half length of the longest edge of the rectangular frame as a radius; and determining a circular area formed based on the circle center and the radius as a first filling area of the mask map corresponding to the candidate.

4. The method according to claim 1, wherein the extracting the features of the brushed face image and obtaining the fusion features according to the features of the brushed face image and the mask map specifically include:

extracting the characteristics of the brushing image through a first convolution network;

performing resolution reduction processing on the mask image to adapt to the characteristics of the face brushing image;

and fusing the characteristics of the brushing image and the mask image after resolution reduction processing through a second convolution network to obtain fused characteristics.

5. The method according to claim 4, wherein fusing, by using the second convolution network, the feature of the brushed image and the mask map after the resolution reduction processing to obtain a fused feature specifically includes:

according to the channel dimension, connecting the characteristics of the face brushing image with the mask image after resolution reduction processing; and inputting the connected features into the second convolution network for processing to obtain fusion features.

6. The method of claim 4, wherein identifying whether each candidate has a willingness to swipe a face according to the fusion features comprises:

inputting the fusion characteristics into a third convolution network corresponding to the first convolution network for processing to obtain a processing result, wherein the first convolution network and the third convolution network are obtained by splitting the same convolution network in advance; and generating a probability value according to the processing result to indicate whether the corresponding candidate has the willingness to pay by brushing the face.

7. The method according to claim 6, wherein the splitting specifically comprises:

determining a target resolution as a resolution of a feature of the brushed face image;

determining a convolutional layer matched with the target resolution in convolutional layers in the same convolutional network;

and taking the matched convolutional layer as a splitting point, splitting the same convolutional network into a former part and a latter part, wherein the former part is taken as the first convolutional network, and the latter part is taken as the third convolutional network.

8. The method of claim 6, prior to identifying whether each of the candidates has a willingness to swipe according to the fused features, the method further comprising:

acquiring a face brushing sample image containing a confirmed face brushing user;

marking the face brushing user as having a face brushing willingness to pay, and generating a corresponding mask map to obtain a positive sample;

if the face brushing sample image also contains other users shot by the way, marking the other users as not having the intention of face brushing payment, and generating a corresponding mask map to obtain a negative sample;

and carrying out supervised training on the first convolutional network, the second convolutional network and the third convolutional network according to the obtained samples.

9. A method as claimed in any one of claims 1 to 8, wherein the brushed face image comprises at least two human faces.

10. The method of any of claims 1-8 applied to an offline IoT facer, the facer image captured by the IoT facer for nearby users.

11. A brushing will-of-payment recognition device, comprising:

the acquisition module acquires a face brushing image and determines a candidate to be identified in the face brushing image;

the generating module is used for respectively generating corresponding mask images according to the areas of the candidate persons in the face brushing image so as to distinguish the areas from other areas in the face brushing image;

the extraction module is used for extracting the features of the face brushing image and obtaining fusion features according to the features of the face brushing image and the mask image;

and the identification module identifies whether each candidate has a face brushing willingness to pay according to the fusion characteristics.

12. The apparatus of claim 11, wherein the generating module performs, for each of the determined candidates:

13. The apparatus of claim 12, wherein the face region selection box is a rectangular box;

the generating module is used for determining the center of the rectangular frame as the circle center and determining the half length of the longest edge of the rectangular frame as the radius;

and determining a circular area formed based on the circle center and the radius as a first filling area of the mask map corresponding to the candidate.

14. The apparatus of claim 11, the extraction module to extract features of the brushed face image through a first convolutional network;

and fusing the characteristics of the face brushing image and the mask image after resolution reduction processing through a second convolution network to obtain fused characteristics.

15. The apparatus of claim 14, the extraction module to connect features of the brushed face image and the deresolved mask map according to a channel dimension;

and inputting the connected features into the second convolution network for processing to obtain fusion features.

16. The apparatus of claim 14, wherein the identifying module inputs the fused feature into a third convolutional network corresponding to the first convolutional network for processing, so as to obtain a processing result, and the first convolutional network and the third convolutional network are obtained by splitting the same convolutional network in advance;

and generating a probability value according to the processing result to indicate whether the corresponding candidate has the willingness to pay by brushing the face.

17. The apparatus of claim 16, the recognition module to determine a target resolution as a resolution of a feature of the brush face image;

and taking the matched convolution layer as a splitting point, splitting the same convolution network into a front part and a rear part, wherein the front part is taken as the first convolution network, and the rear part is taken as the third convolution network.

18. The apparatus of claim 16, further comprising a supervised training module that obtains a brush sample image containing confirmed brushing users;

19. An apparatus as claimed in any one of claims 11 to 16, wherein the brushed face image comprises at least two human faces.

20. The apparatus of any of claims 11-16, applied to an offline IoT facer, the facer image captured by the IoT facer for nearby users.

21. A brushing willingness-to-pay recognition device, comprising:

at least one processor; and the number of the first and second groups,

a memory communicatively coupled to the at least one processor; wherein,

the memory stores instructions executable by the at least one processor to enable the at least one processor to: