WO2021072998A1 - 一种亚洲人脸数据自动收集及清理的方法和系统 - Google Patents
一种亚洲人脸数据自动收集及清理的方法和系统 Download PDFInfo
- Publication number
- WO2021072998A1 WO2021072998A1 PCT/CN2020/070658 CN2020070658W WO2021072998A1 WO 2021072998 A1 WO2021072998 A1 WO 2021072998A1 CN 2020070658 W CN2020070658 W CN 2020070658W WO 2021072998 A1 WO2021072998 A1 WO 2021072998A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- asian
- face
- person
- target
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/51—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/53—Querying
- G06F16/535—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/54—Browsing; Visualisation therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
Definitions
- the invention relates to the technical field of image processing and recognition, in particular to a method and system for automatic collection and cleaning of Asian face data.
- the source of face photos is mainly It is downloaded and collected from the Internet through crawlers, and then it is necessary to mark and clean up the photos through complicated manual operations. This series of tasks not only requires very high computing and storage equipment, but also requires a lot of manpower and time costs. Due to their unique picture resources and operating capital advantages, Internet giants have private large-scale face data sets, but so far, there are very few large-scale public face data sets that ordinary users can obtain for free.
- the face data sets mainly include Youtube Face, CASIA-WebFace and MS-1M-Celeb.
- the present invention provides a method and system for automatically collecting and cleaning Asian face data, which automatically cleans the collected Asian face photo data, achieving low time cost and per capita workload. It has fewer effects and can build an Asian face database with a higher recall rate.
- the present invention provides a method for automatically collecting and cleaning Asian face data, which includes the following steps:
- Preset multiple Asian target person identifications obtain official photo links of Asian target persons, and construct a data list, the data list including key information of multiple Asian target person identifications;
- search for the Asian target person identification and the person identification after adding keywords to obtain reference person data search for the Asian target person identification and the person identification after adding keywords to obtain reference person data.
- the specific steps include:
- the Asian target person identification adopts the names of different Asian target persons or digital numbers for distinguishing different Asian target persons
- the official photo link of the Asian target person adopts a URL link
- each row of the data list corresponds to An Asian target person logo and the corresponding official photo URL link.
- the searched reference person data is associated and stored with the corresponding Asian target person identification and keywords, and the specific steps are as follows:
- main folder is named using the Asian target person logo, and multiple sub-folders are created in the main folder, each named using keywords;
- the reference character data obtained through different search methods are associated and saved in the corresponding subfolders.
- Preliminary cleaning use face detection algorithm to perform face detection on reference person data, and obtain reference face data after face detection processing;
- In-depth cleaning use face recognition algorithm to detect the reference face data processed by face detection, update the official photo list of Asian target person identification, check whether it matches the official photo list of Asian target person identification, and delete the official photo if it does not match
- the reference face data processed by the face detection process if it matches, the reference face data processed by the face detection process is retained as the target face data associated with the target person in Asia.
- the specific steps for deleting the obtained repeated downloading reference character data are:
- the repeated download adopts whether the file naming of the reference person data is the same as the standard. If there is a repeated download, keep the data in the repeated download reference person data One of them, and the rest of the duplicate reference character data for deletion processing.
- the specific steps of using a face detection algorithm to perform face detection on reference person data are as follows:
- each reference face data in the multiple face frames cut out each reference face data in the multiple face frames, while keeping the reference person data, extract the official photos corresponding to the Asian target person and the face features of each reference face data, and calculate them separately
- the degree of matching between the facial features of each reference face data and the facial features of the official photos of the corresponding Asian target person, and the reference person data corresponding to the reference face data with the highest matching degree is used as the reference person after face detection processing Face data is retained.
- the specific steps of the deep cleaning are:
- the first preset threshold is greater than the second preset threshold.
- the present invention also provides a system for automatically collecting and cleaning Asian face data, including: a data list building module, a reference person data acquisition module, an associated storage module, and a reference person data cleaning module;
- the data list building module is used to obtain the official photo link of the Asian target person by presetting a plurality of Asian target person identifications to construct a data list;
- the reference person data obtaining module is configured to obtain reference person data associated with the Asian target person identification and the person identification keywords according to the content of the data list;
- the associative storage module is used for associative storage of reference character data with corresponding Asian target character identifiers and keywords;
- the reference person data cleaning module is used to clean up the stored reference person data to obtain target face data associated with the target person in Asia.
- the reference person data cleaning module includes a preliminary cleaning submodule and a deep cleaning submodule.
- the preliminary cleaning submodule is used to perform face detection on the reference person data using a face detection algorithm to obtain a The processed reference face data is detected, and the deep cleaning sub-module is used to detect the reference face data processed by face detection using a face recognition algorithm, update the official photo list of the Asian target person identification, and check whether it matches the Asian target person Identify the official photo list matching, and use the matched reference face data as the target face data associated with the target person in Asia.
- the present invention has the following advantages and beneficial effects:
- the present invention uses a picture processing tool to check the readability and unify the format of all reference character data, and delete the repeatedly downloaded reference character data, so as to improve the fluency and processing efficiency of the subsequent cleaning process.
- the present invention uses multiple search methods to obtain reference person data, which increases the diversity and accuracy of obtaining reference person data.
- the present invention uses preliminary cleaning and in-depth cleaning for data cleaning to obtain the target face data associated with the target person identification, and in the deep cleaning, the official photo list of the Asian target person identification is updated, that is, the reference database is updated to improve the person Accuracy of facial feature comparison.
- FIG. 1 is a schematic flowchart of the method for automatically collecting and cleaning Asian face data according to this embodiment
- FIG. 2 is a schematic diagram of a data list of the method for automatically collecting and cleaning Asian face data according to this embodiment
- FIG. 3 is a schematic diagram of the effect when the cleaning work is not performed in the method for automatically collecting and cleaning Asian face data according to this embodiment
- FIG. 4 is a schematic diagram of the preliminary cleaning effect in the method for automatic collection and cleaning of Asian face data according to this embodiment
- FIG. 5 is a schematic diagram of the effect of Asian face data collection in the method for automatically collecting and cleaning Asian face data according to this embodiment.
- a method for automatically collecting and cleaning Asian face data includes the following steps:
- the multiple Asian target person identifications in step S1 are the names of different Asian target persons or preset numerical numbers used to distinguish different Asian target persons; for example, Baidu Company provides a "Baidu Baike Star Popularity List”
- the list contains sub-modules such as "venue China Male Celebrities List”, “venue China Female Celebrities List”, “Hong Kong and Taiwan Southeast Asia Male Celebrities List”, “Hong Kong and Taiwan Southeast Asia Female Celebrities List” and other sub-modules, which can be automatically Obtain the selected list, record the names of the stars in the Asian stars sub-module on the list in a data list, and optionally use 0 as the starting and gradually increasing integer as the number label to distinguish the Asian stars; this
- the embodiment uses a public figure as an Asian target person identification has two advantages. In addition to easily obtaining a large number of photos of a specified target person through the identification on the search engine, it can also avoid any privacy and infringement issues caused by the use of these photos;
- the official photo link in step S1 is the URL link of the official photo of the target person in Asia that is provided by Baidu Baike for download in this embodiment.
- "Baidu Encyclopedia Star Popularity List” will display the celebrity’s official photo and name in turn according to the star’s real-time popularity. Clicking on the celebrity’s official photo or name will enter the corresponding Baidu Encyclopedia introduction page.
- the star can be obtained through crawler technology. Display the URL link of the official photo on the page and record the link on the data list.
- the reference person data contains multiple key information of Asian target person identification, and each row corresponds to an Asian target person identification and its corresponding official photo URL link, from the left To the right are the number label, name, and URL link, with a tab character' ⁇ t' as the interval;
- step S2 The specific steps of step S2 are:
- the computer technology is optional but not limited to web crawlers, downloader tools, etc.;
- the Asian target person ID Search for the Asian target person ID, and obtain at least one reference person data associated with a single ID; specifically, if the Asian target person ID is the name of the target person, then the reference person data is related photos of the Asian target person, for example
- a Python script can be used to simulate the process of manually searching and downloading pictures, and a large amount of reference character data can be easily obtained by searching for the Asian target person identification on the Baidu picture search engine;
- This embodiment also performs a search on the target person’s identity plus keywords, and respectively obtains at least one associated reference character data corresponding to different keywords on a single identity combination.
- the keywords are selectable but not limited to glasses, hats, actors, Singer (professional), etc., can obtain various reference data such as "target person name + glasses”, “target person name + hat”, “target person name + occupation”, which can increase the diversity and accuracy of obtaining reference person data Sex
- At least one reference person data is associated and stored with the corresponding Asian target person identifier, and the data with keywords added, until all the reference person data are associated and stored;
- the photos are collected and stored for each Asian target person identification in the data list, for example:
- step S4 The specific steps of step S4 include:
- the image processing tool uses the image processing tool to check the readability and unify the format of all reference character data, and remove the small part of the reference character data that cannot be read and written due to download errors, format errors, etc., among which, the image processing tool is optional but It is not limited to image processing software or programming languages such as MATLAB, Python, OpenCV, Photoshop, etc.; for example, when downloading images in batches from the Internet through crawler methods, they are often affected by network fluctuations and anti-crawler mechanisms, resulting in incomplete image content, Download errors and other problems. Such pictures cannot be read and written normally by the software, which will seriously affect the fluency of data cleaning. They should be removed to improve the efficiency of data processing. In addition, for the convenience of subsequent processing and data management, the data cleaning work Before starting, you can choose MATLAB to unify all reference character data into the common JEPG format;
- repeated downloading is based on the same standard as the file name of the reference character data; for example, to search for a public figure, first create a home folder, and the search methods are "someone's name, somebody's name+hat, somebody's name+glasses, some Person name + singer" etc., so several subfolders name, hat, glass, job, etc. are created in the main folder. The official photos are saved in the standard subfolder.
- the reference character data can be obtained in the search engine through different combinations. The engine will inevitably return part of the reference character data with the same naming and content.
- This repeated data can easily lead to over-fitting of the neural network and seriously affect the face recognition performance of the network, so the purpose of this step is to target the sub-folders For all the files with the same name, delete the duplicate and keep only one; the official photos in the standard subfolder of this embodiment are retained, and the official photos downloaded from the URL can be named standard.jpg, and other batch search downloads can be used.
- the photo retains the image naming of the source network, so that the official photo naming in the standard subfolder does not repeat with other pictures; or it is not necessary to change the official photo naming in the standard subfolder, adding a judgment link. If one of the pictures is repeated If the picture is located in the standard sub-folder, the pictures in the standard sub-file are retained.
- step S42 The specific steps of step S42 include:
- the program uses the program to record the file naming of the reference character data in the subfolder associated with the Asian target character in turn.
- the program is optional but not limited to MATLAB, Python and other languages;
- step S43 The specific steps of step S43 include:
- the face detection algorithm can use, but is not limited to, deep learning methods such as MTCNN, and deep learning methods such as MTCNN. Including a series of steps such as face detection, face correction, and face alignment. Among them, face detection can eliminate some non-face data, such as only glasses, hats and other pictures. Face correction and face alignment can The lateral face is corrected and aligned to improve the processing efficiency of subsequent face feature matching.
- the reference person data when no cleaning work is performed, the reference person data includes reference face data and reference non-face data.
- the preliminary cleaning work performed in this embodiment is to remove the reference non-face data in the reference person data.
- the methods include the MTCNN deep learning method and the face detection toolkit that comes with the OpenCV software.
- the specific principle is to detect and locate the five key points (eyes, nose, and mouth corners) of the face in the photo through the algorithm and return to the face Frame, you can judge whether the reference person data is reference face data according to whether the face frame is returned; if the face frame is not returned, the photo is deleted; if a face frame is returned, a face frame is cropped out
- the reference face data of the image is retained, and the photo is retained; if more than one face frame is returned, the reference face data in more than one face frame are respectively cropped, and the reference person data processed by face detection is retained at the same time, and then based on
- the face recognition algorithm selects the reference face data with the highest matching degree with the official photo of the target person in Asia, and deletes the rest of the reference face data;
- step S44 The specific steps of step S44 include:
- the official photo list of the target person is not only the official photo, but also includes the target person’s photo with high matching degree obtained by the target person’s keyword search, because the person of the target person in Asia obtained through different search methods
- the face data of the Asian target person obtained through keyword search is compared with the official photo of the target person obtained only through the URL link of the official photo of the Asian target person in step S2. It is easy to have a low matching degree.
- the official photo is a frontal face photo, it is not decorated with hats, glasses and other accessories, and the photos searched by adding keywords (such as hats, glasses) (after face recognition screening) are not decorated with hats, Comparing official photos of accessories such as glasses, it is prone to matching deviations. Therefore, it is necessary to add the highly matched photos of the target person to the target person's official photo list by adding the keyword search to the target person, and update the target person's official photo list. List of photos to increase the accuracy of facial feature comparison;
- the remaining preliminary cleaned reference face data is matched with the data in the updated official photo list of the target person one by one, and the remaining preliminary cleaned reference face data whose matching degree is greater than or equal to the second preset threshold is retained , And the remaining remaining reference face data after preliminary cleaning is deleted.
- a face recognition algorithm is used to sequentially extract the feature vector of the Asian target person corresponding to the reference face data after preliminary cleaning, and at the same time set the first preset threshold to 0.9, and then the feature vector of the reference face data is compared with The feature vectors of official photos are matched one by one.
- the reference face data whose matching degree is greater than or equal to 0.9 can be filtered out, and these photos can be considered as official photos of the target person; then, the second preset threshold Set it to 0.7, and then match the feature vector of the remaining reference face data with the feature vector of the photo in the official photo queue selected in the first round, and match the feature vector of any photo in the official photo queue.
- the first preset threshold and the second preset threshold used in the embodiment can be adjusted according to actual conditions.
- This embodiment also provides a system for automatically collecting and cleaning Asian face data, including: a data list building module, a reference person data acquisition module, an associated storage module, and a reference person data cleaning module;
- a data list construction module a reference character data acquisition module, an associated storage module, and a reference character data cleaning module;
- the data list building module is used to obtain the official photo link of the Asian target person by presetting multiple Asian target person identifications, and build a data list;
- the reference person data acquisition module is used to obtain the data list and the Asian target person Identification and reference person data associated with person identification keywords;
- the associated storage module is used to associate the reference person data with the corresponding Asian target person identification and keywords;
- the reference person data cleaning module is used to clean up the stored references Person data, to obtain target face data associated with the target person in Asia.
- the reference person data cleaning module includes a preliminary cleaning submodule and a deep cleaning submodule.
- the preliminary cleaning submodule is used to perform face detection on the reference person data using a face detection algorithm to obtain the face detection processed
- the deep cleaning sub-module is used to use face recognition algorithms to detect the reference face data processed by face detection, update the official photo list of the Asian target person identification, and check whether it matches the official photo list of the Asian target person identification.
- the matched reference face data is used as the target face data associated with the target person in Asia.
- the entire process from the collection of Asian face data to the cleaning up replaces the traditional manual labeling, classification and other heavy procedures through automatic processing, which greatly reduces the time cost of establishing an Asian face database and also solves the problem. Issues such as imbalance of categories in the face database promote the development and progress of corresponding technologies.
Abstract
Description
Claims (9)
- 一种亚洲人脸数据自动收集及清理的方法,其特征在于,包括下述步骤:预设多个亚洲目标人物标识,获取亚洲目标人物官方照片链接,构建数据列表,所述数据列表包括多个亚洲目标人物标识关键信息;根据数据列表内容,对所述亚洲目标人物标识、以及人物标识增加关键词后进行搜索,获取参考人物数据,具体步骤包括:根据所述亚洲目标人物官方照片链接,获得亚洲目标人物的官方照片;对所述亚洲目标人物标识进行搜索,获取单一标识情况下的相关联参考人物数据;对所述亚洲目标人物标识加上多种关键词进行搜索,分别获取单一标识组合上不同关键词对应的相关联参考人物数据;将搜索后的参考人物数据与对应的亚洲目标人物标识、以及关键词进行关联存储,直到所有参考人物数据均关联存储完毕;清理存储完毕的参考人物数据,得到与亚洲目标人物关联的目标人脸数据。
- 根据权利要求1所述的亚洲人脸数据自动收集及清理的方法,其特征在于,所述亚洲目标人物标识采用不同亚洲目标人物的名字或者用于区分不同亚洲目标人物的数字编号,所述亚洲目标人物官方照片链接采用URL链接,所述数据列表每一行对应一个亚洲目标人物标识及对应的官方照片URL链接。
- 根据权利要求1所述的亚洲人脸数据自动收集及清理的方法,其特征在于,所述将搜索后的参考人物数据与对应的亚洲目标人物标识、以及关键词进行关联存储,具体步骤为:创建主文件夹,所述主文件夹采用亚洲目标人物标识进行命名,在主文件夹内创建多个子文件夹,分别采用关键词进行命名;将通过不同搜索方式获取的参考人物数据关联地保存在相应的子文件夹中。
- 根据权利要求1所述的亚洲人脸数据自动收集及清理的方法,其特征在 于,所述清理存储完毕的参考人物数据,具体步骤为:采用图片处理工具对所有参考人物数据进行可读性检验和格式统一,剔除无法正常读写的参考人物数据;删除获取到的重复下载参考人物数据,所述亚洲目标人物的官方照片保留;初步清理:采用人脸检测算法对参考人物数据进行人脸检测,得到经人脸检测处理后的参考人脸数据;深度清理:采用人脸识别算法检测经人脸检测处理后的参考人脸数据,更新亚洲目标人物标识官方照片列表,检验是否与亚洲目标人物标识官方照片列表匹配,若不匹配,则删除经人脸检测处理后的参考人脸数据,若匹配,则保留经人脸检测处理后的参考人脸数据,作为与亚洲目标人物关联的目标人脸数据。
- 根据权利要求4所述的亚洲人脸数据自动收集及清理的方法,其特征在于,所述删除获取到的重复下载参考人物数据,具体步骤为:参考人物数据与对应的亚洲目标人物标识、以及关键词关联存储到文件夹后,所述重复下载采用参考人物数据的文件命名是否相同为标准,若存在重复下载,保留重复下载参考人物数据中的其中一个,其余重复参考人物数据作删除处理。
- 根据权利要求4所述的亚洲人脸数据自动收集及清理的方法,其特征在于,所述采用人脸检测算法对参考人物数据进行人脸检测,具体步骤为:通过人脸检测算法定位参考人物数据中人脸关键点的位置并检测人脸框;若不存在人脸框,则删除参考人物数据;若存在一个人脸框,则裁剪出人脸框内的参考人脸数据,保留参考人物数据;若存在多个人脸框,则分别裁剪出多个人脸框内的各个参考人脸数据,同 时保留参考人物数据,提取对应亚洲目标人物的官方照片和各个参考人脸数据的人脸特征,分别计算各个参考人脸数据的人脸特征与对应亚洲目标人物的官方照片的人脸特征的匹配度,将匹配度最高的参考人脸数据所对应的参考人物数据作为经人脸检测处理后的参考人脸数据保留。
- 根据权利要求4所述的亚洲人脸数据自动收集及清理的方法,其特征在于,所述深度清理的具体步骤为:基于人脸识别算法提取亚洲目标人物官方照片和对应初步清理后的参考人脸数据的人脸特征;分别计算出亚洲目标人物官方照片的人脸特征与对应初步清理后的参考人脸数据的人脸特征的匹配度,将匹配度大于或等于第一预设阈值的参考人脸数据归入目标人物官方照片列表,更新目标人物官方照片列表;将剩余初步清理后参考人脸数据的人脸特征与更新后的亚洲目标人物官方照片列表中的人脸特征进行逐一匹配,保留匹配度大于或等于第二预设阈值的参考人脸数据,删除其余的参考人脸数据;所述第一预设阈值大于第二预设阈值。
- 一种亚洲人脸数据自动收集及清理的系统,其特征在于,包括:数据列表构建模块、参考人物数据获取模块、关联存储模块和参考人物数据清理模块;所述数据列表构建模块用于通过预设多个亚洲目标人物标识,获取亚洲目标人物官方照片链接,构建数据列表;所述参考人物数据获取模块用于根据数据列表内容获取与亚洲目标人物标识、以及人物标识关键词相关联的参考人物数据;所述关联存储模块用于将参考人物数据与对应的亚洲目标人物标识、以及关键词进行关联存储;所述参考人物数据清理模块用于清理存储完毕的参考人物数据,得到与亚 洲目标人物关联的目标人脸数据。
- 根据权利要求8所述的亚洲人脸数据自动收集及清理的系统,其特征在于,所述参考人物数据清理模块包括初步清理子模块和深度清理子模块,所述初步清理子模块用于采用人脸检测算法对参考人物数据进行人脸检测,得到经人脸检测处理后的参考人脸数据,所述深度清理子模块用于采用人脸识别算法检测经人脸检测处理后的参考人脸数据,更新亚洲目标人物标识官方照片列表,检验是否与亚洲目标人物标识官方照片列表匹配,将匹配的参考人脸数据作为与亚洲目标人物关联的目标人脸数据。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910977959.3 | 2019-10-15 | ||
CN201910977959.3A CN110807108A (zh) | 2019-10-15 | 2019-10-15 | 一种亚洲人脸数据自动收集及清洗的方法和系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021072998A1 true WO2021072998A1 (zh) | 2021-04-22 |
Family
ID=69488429
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/070658 WO2021072998A1 (zh) | 2019-10-15 | 2020-01-07 | 一种亚洲人脸数据自动收集及清理的方法和系统 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110807108A (zh) |
WO (1) | WO2021072998A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115083572A (zh) * | 2022-07-25 | 2022-09-20 | 广州思德医疗科技有限公司 | 图片存储和提取方法、系统、设备及介质 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111680202B (zh) * | 2020-04-24 | 2022-04-26 | 烽火通信科技股份有限公司 | 一种基于本体的人脸图像数据收集方法和装置 |
CN112418169A (zh) * | 2020-12-10 | 2021-02-26 | 上海芯翌智能科技有限公司 | 一种对人体属性数据进行处理的方法及设备 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102938065A (zh) * | 2012-11-28 | 2013-02-20 | 北京旷视科技有限公司 | 基于大规模图像数据的人脸特征提取方法及人脸识别方法 |
CN106844412A (zh) * | 2016-11-02 | 2017-06-13 | 厦门中控生物识别信息技术有限公司 | 一种人脸数据收集方法及装置 |
CN106874898A (zh) * | 2017-04-08 | 2017-06-20 | 复旦大学 | 基于深度卷积神经网络模型的大规模人脸识别方法 |
CN109034106A (zh) * | 2018-08-15 | 2018-12-18 | 北京小米移动软件有限公司 | 人脸数据清洗方法及装置 |
CN109241310A (zh) * | 2018-07-25 | 2019-01-18 | 南京甄视智能科技有限公司 | 人脸图像数据库的数据去重方法与系统 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103984738B (zh) * | 2014-05-22 | 2017-05-24 | 中国科学院自动化研究所 | 一种基于搜索匹配的角色标注方法 |
JP6345276B2 (ja) * | 2014-06-16 | 2018-06-20 | ベイジン センスタイム テクノロジー デベロップメント カンパニー, リミテッド | 顔認証方法およびシステム |
US10943096B2 (en) * | 2017-12-31 | 2021-03-09 | Altumview Systems Inc. | High-quality training data preparation for high-performance face recognition systems |
CN109063784B (zh) * | 2018-08-23 | 2021-03-05 | 深圳码隆科技有限公司 | 一种人物服饰图像数据筛选方法及其装置 |
-
2019
- 2019-10-15 CN CN201910977959.3A patent/CN110807108A/zh active Pending
-
2020
- 2020-01-07 WO PCT/CN2020/070658 patent/WO2021072998A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102938065A (zh) * | 2012-11-28 | 2013-02-20 | 北京旷视科技有限公司 | 基于大规模图像数据的人脸特征提取方法及人脸识别方法 |
CN106844412A (zh) * | 2016-11-02 | 2017-06-13 | 厦门中控生物识别信息技术有限公司 | 一种人脸数据收集方法及装置 |
CN106874898A (zh) * | 2017-04-08 | 2017-06-20 | 复旦大学 | 基于深度卷积神经网络模型的大规模人脸识别方法 |
CN109241310A (zh) * | 2018-07-25 | 2019-01-18 | 南京甄视智能科技有限公司 | 人脸图像数据库的数据去重方法与系统 |
CN109034106A (zh) * | 2018-08-15 | 2018-12-18 | 北京小米移动软件有限公司 | 人脸数据清洗方法及装置 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115083572A (zh) * | 2022-07-25 | 2022-09-20 | 广州思德医疗科技有限公司 | 图片存储和提取方法、系统、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN110807108A (zh) | 2020-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021072998A1 (zh) | 一种亚洲人脸数据自动收集及清理的方法和系统 | |
US9430719B2 (en) | System and method for providing objectified image renderings using recognition information from images | |
US8649572B2 (en) | System and method for enabling the use of captured images through recognition | |
US9069795B2 (en) | System and process for building a catalog using visual objects | |
US7809722B2 (en) | System and method for enabling search and retrieval from image files based on recognized information | |
WO2015101289A1 (zh) | 图片管理方法、装置及系统 | |
US20170024384A1 (en) | System and method for analyzing and searching imagery | |
US20060251292A1 (en) | System and method for recognizing objects from images and identifying relevancy amongst images and information | |
CN103793697A (zh) | 一种人脸图像的身份标注方法及人脸身份识别方法 | |
CN103824053A (zh) | 一种人脸图像的性别标注方法及人脸性别检测方法 | |
Bharati et al. | Beyond pixels: Image provenance analysis leveraging metadata | |
WO2021012793A1 (zh) | 基于大数据分析的律师推荐方法及相关设备 | |
CN105183950B (zh) | 一种基于移动终端查阅工程图纸的方法及系统 | |
CN106713859A (zh) | 一种图片视觉监控搜索系统及其搜索方法 | |
WO2013034050A1 (zh) | 一种社区网站页面的图片检索方法和系统 | |
CN113761242A (zh) | 一种基于人工智能的大数据图像识别系统及方法 | |
CN109359090A (zh) | 基于卷积神经网络的文件碎片分类方法及系统 | |
US10198664B2 (en) | Auxiliary observing method and auxiliary observing apparatus | |
CN103093213B (zh) | 视频文件分类方法及终端 | |
US10824811B2 (en) | Machine learning data extraction algorithms | |
CN114611618A (zh) | 一种面向跨模态检索的数据采集处理方法及系统 | |
CN113361395A (zh) | 一种基于多任务学习模型的ai换脸视频检测方法 | |
WO2020191706A1 (zh) | 主动学习自动图像标注系统及方法 | |
Hu et al. | Semi-automatic annotation of distorted image based on neighborhood rough set | |
Rúnarsson | A face recognition plug-in for the PhotoCube browser |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20876846 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20876846 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 10/10/2022 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20876846 Country of ref document: EP Kind code of ref document: A1 |