WO2020155939A1 - Procédé et dispositif de reconnaissance d'image, support de stockage et processeur - Google Patents

Procédé et dispositif de reconnaissance d'image, support de stockage et processeur Download PDF

Info

Publication number
WO2020155939A1
WO2020155939A1 PCT/CN2019/127817 CN2019127817W WO2020155939A1 WO 2020155939 A1 WO2020155939 A1 WO 2020155939A1 CN 2019127817 W CN2019127817 W CN 2019127817W WO 2020155939 A1 WO2020155939 A1 WO 2020155939A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
model
training
sets
accuracy
Prior art date
Application number
PCT/CN2019/127817
Other languages
English (en)
Chinese (zh)
Inventor
张玉兵
Original Assignee
广州视源电子科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州视源电子科技股份有限公司 filed Critical 广州视源电子科技股份有限公司
Publication of WO2020155939A1 publication Critical patent/WO2020155939A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • This application relates to the field of image recognition, for example, to an image recognition method, device, storage medium, and processor.
  • image recognition models are all trained based on deep learning algorithm models.
  • the quality of deep learning model training has an impact on recognition accuracy.
  • the data set used for training is the top priority, which will have a decisive impact on the final algorithm performance of the deep learning model.
  • Deep learning models are basically performed on a single training data set.
  • the training data set can be face data collected in a certain scene or a public face database downloaded from the Internet. Because different data sets may cover the same person, and because the naming rules between different data sets are not uniform, it is difficult to merge face pictures of the same person according to their file names. In the face recognition classification training, the face pictures of the same person must be required to share the same label category number, so it is impossible to use multiple face data sets that may have intersections at the same time.
  • a deep learning model trained only on a single training data set has low accuracy in image recognition and cannot meet the needs of different applications.
  • the embodiments of the present application provide an image recognition method, device, storage medium, and processor to at least solve the problem of low recognition accuracy of the image recognition method in the related art.
  • an image recognition method which includes: obtaining an image to be recognized; obtaining a pre-established image recognition model, wherein the image recognition model is obtained by training the initial model through multiple training sets Yes, the initial model is a recognition model based on the branch training algorithm, the same training set is extracted from the same data set, and different training sets are extracted from different data sets; the image recognition model is used to recognize the image to be recognized, Get the recognition result.
  • the above method further includes: acquiring multiple data sets; classifying each image in the multiple data sets to obtain a label of each image, wherein the label is used to represent the classification result of each image, and The labels of at least two images contained in each data set are the same; sample images are extracted from each data set after classification to obtain multiple training sets.
  • the above method before extracting sample images from each classified data set to obtain multiple training sets, the above method further includes: extracting preset features of each image in each classified data set; The preset features of images are aligned for each image; sample images are extracted from each data set after the operation to obtain multiple training sets.
  • the preset feature when each image is a face image, includes at least one of the following: eyes, eyebrows, nose tip, and mouth corners.
  • extracting sample images from each data set after the operation to obtain multiple training sets includes: randomly extracting sample images from each data set after the operation; obtaining the storage path and labels of the sample images to obtain multiple training sets. Training sets.
  • acquiring multiple data sets includes: acquiring a video image and a preset data set collected by an acquisition device; and detecting the video image and the preset data set to obtain multiple data sets.
  • the above method further includes: establishing an initial model based on a branch training algorithm, where the initial model at least includes: multiple loss functions, multiple loss functions corresponding to multiple training sets one-to-one; multiple training sets Set parallel input into the initial model to train the initial model; determine whether the trained model meets the preset conditions; if the trained model meets the preset conditions, the trained model is determined to be an image recognition model.
  • inputting multiple training sets into the initial model in parallel to train the initial model includes: inputting multiple training sets into the initial model in parallel to obtain function values of multiple loss functions; according to the multiple loss functions The function value of and the chain derivation algorithm to obtain the gradient value of each parameter in the initial model; the gradient value of each parameter is updated according to the stochastic gradient descent algorithm to obtain the trained model.
  • judging whether the model obtained by training satisfies a preset condition includes: obtaining a verification set; verifying the model obtained by using the verification set to obtain the accuracy of the trained model; judging the accuracy of the trained model Whether the historical accuracy is the same, where the historical accuracy is the accuracy obtained by the trained model in the last verification process; if the accuracy of the trained model is the same as the historical accuracy, it is determined that the trained model meets the preset conditions.
  • the accuracy of the trained model is determined to be the historical accuracy, and the initial model is continuously trained.
  • accuracy is used to characterize the ratio of the sum of the verification results of all verification samples in the verification set to the total number of all verification samples.
  • obtaining the verification set includes: obtaining images other than the sample images in the multiple data sets; randomly extracting image verification pairs from other images to obtain the verification set.
  • the image verification pair includes: a positive sample pair and a negative sample pair, the positive sample pair includes two images with the same label, and the negative sample pair includes two images with different labels.
  • the loss function is a square loss function.
  • an image recognition device including: a first acquisition module for acquiring an image to be recognized; a second acquisition module for acquiring a pre-established image recognition model, wherein The image recognition model is obtained by training the initial model through multiple training sets.
  • the initial model is a recognition model established based on the branch training algorithm.
  • the same training set is extracted from the same data set, and different training sets are derived from different It is extracted from the data set; the recognition module is used to recognize the image to be recognized by using the image recognition model to obtain the recognition result.
  • a storage medium includes a stored program, wherein the device where the storage medium is located is controlled to execute the above-mentioned image recognition method when the program runs.
  • a processor is also provided, which is configured to run a program, wherein the image recognition method described above is executed when the program is running.
  • an initial model can be established based on a branch training algorithm, and the initial model can be trained through multiple training sets generated from different data sets to obtain an image recognition model.
  • the image recognition model is used to perform the image recognition input by the user. Recognize, get the final recognition result.
  • the image recognition model that combines branch training with multiple data sets has a higher accuracy rate than the image recognition model trained based on a single data set, and achieves the technical effect of improving the recognition accuracy, thereby solving related technologies.
  • Fig. 1 is a flowchart of an image recognition method according to an embodiment of the present application
  • Fig. 2 is a schematic diagram of an optional face picture according to an embodiment of the present application.
  • Fig. 3 is a schematic diagram of an optional aligned face picture according to an embodiment of the present application.
  • FIG. 4 is a schematic diagram of an optional face recognition deep neural network model based on a single data set input according to an embodiment of the present application
  • Fig. 5 is a schematic diagram of an optional deep neural network model for face recognition based on input of multiple data sets according to an embodiment of the present application
  • Fig. 6 is a flowchart of an optional image recognition method according to an embodiment of the present application.
  • Fig. 7 is a schematic diagram of an image recognition device according to an embodiment of the present application.
  • an embodiment of an image recognition method is provided.
  • the steps shown in the flowchart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and although shown in the flowchart The logical order is shown, but in some cases, the steps shown or described can be performed in a different order than here.
  • Fig. 1 is a flowchart of an image recognition method according to an embodiment of the present application. As shown in Fig. 1, the method includes the following steps:
  • Step S102 Obtain an image to be recognized.
  • the above-mentioned image to be recognized may be an image that needs to be recognized.
  • a face image is taken as an example for description.
  • Step S104 Obtain a pre-established image recognition model.
  • the image recognition model is obtained by training the initial model through multiple training sets.
  • the initial model is a recognition model established based on the branch training algorithm.
  • the same training set is from the same Extracted from one data set, and different training sets are extracted from different data sets.
  • multiple training sets may be constructed through multiple different data sets in advance, and the initial model may be trained through the training sets, so as to obtain the final image recognition model.
  • the branch training method can be combined to build a deep neural network model to obtain the initial model. By separating different data sets for branch training, a trained image recognition model can be obtained, and the trained image recognition model can be deployed to application scenarios.
  • Step S106 using the image recognition model to recognize the image to be recognized, and obtain the recognition result.
  • the face recognition process can be performed by comparing the facial feature feat-ID (using Euclidean distance).
  • an initial model can be established based on a branch training algorithm, and the initial model can be trained through multiple training sets generated from different data sets to obtain an image recognition model.
  • the image recognition model is used to perform the image recognition input by the user. Recognize, get the final recognition result.
  • the image recognition model that combines branch training with multiple data sets has a higher accuracy rate than the image recognition model trained based on a single data set, and achieves the technical effect of improving the recognition accuracy, thereby solving related technologies.
  • the technical problem of the low recognition accuracy of the image recognition method is based on a branch training algorithm, and the initial model can be trained through multiple training sets generated from different data sets to obtain an image recognition model.
  • the image recognition model is used to perform the image recognition input by the user. Recognize, get the final recognition result.
  • the image recognition model that combines branch training with multiple data sets has a higher accuracy rate than the image recognition model trained based on a single data set, and achieves the technical effect of improving the recognition accuracy, thereby solving related technologies.
  • the method further includes: acquiring multiple data sets; classifying each image in the multiple data sets to obtain a label of each image, wherein the label is used to characterize each image
  • the labels of at least two images contained in multiple data sets are the same; sample images are extracted from each data set after classification to obtain multiple training sets.
  • face pictures in different application scenarios may be obtained in advance to obtain multiple data sets. Since public face data sets downloaded from the Internet are generally already labeled, for unlabeled data sets, face images can be manually detected and extracted, classified and labeled, and face images belonging to the same person are placed Put them together and label them, and get a label for each photo. Suppose the total number of people is N, and each person has M face pictures. A certain number of face images can be randomly selected from each data set that has been labeled to obtain each training set.
  • the method before extracting sample images from each classified data set to obtain multiple training sets, the method further includes: extracting a preview of each image in each classified data set. Set features; based on the preset features of each image, perform alignment operations on each image; extract sample images from each data set after the operation to obtain multiple training sets.
  • the preset feature includes at least one of the following: eyes, eyebrows, nose tip, and mouth corners.
  • the angle of the face and the position of the face in the face picture are inconsistent.
  • the image is aligned to remove the influence of face angle on face recognition.
  • the key points include the positions of the eyes, nose tip, and mouth corners, as shown in Figure 2.
  • the aligned face is shown in Figure 3.
  • extracting sample images from each data set after the operation to obtain multiple training sets includes: randomly extracting sample images from each data set after the operation; and obtaining the storage path of the sample images And labels to get multiple training sets.
  • a face image containing both face identity information and verification information may be randomly selected from face images that have been annotated and face aligned to obtain sample images.
  • Each training sample extracted is as follows: Face picture img_1, identity information (category number) of img_1, ..., face picture img_N, identity information (category number) of img_N.
  • the face picture img_1 refers to the storage path of the first face picture
  • the category number refers to the pre-labeled label for the person
  • the category number generally starts from 0.
  • Different labels represent the numerical codes for different people in the same data set. For example, if there are 100 people in the first data set, the category numbers are 1-0, 1-1, 1-2,..., 1-99; the second data set or scene covers 50 people, then the category numbers are respectively It is 2-0, 2-1, 2-2,..., 2-49.
  • the two groups of category numbers are not the same, they come from different data sets.
  • acquiring multiple data sets includes: acquiring a video image and a preset data set collected by an acquisition device; and detecting the video image and the preset data set to obtain multiple data sets.
  • the collection device in the field of face recognition, can be a camera installed in different application scenarios.
  • the camera is used to collect video pictures and stored in a computer system through network transmission and data lines.
  • the application scenario can be engineering Use scenarios corresponding to the project, such as bank remote teller machine (Video Teller Machine, VTM) verification, jewelry store VIP identification, etc.
  • the aforementioned preset data set may be a public face data set downloaded from the Internet.
  • the face data sets obtained by the above methods may cover the same people.
  • the photos of customers captured by cameras in banks and jewelry stores may also appear on the Internet and be sorted into public face data sets.
  • the public face data sets A and B on the Internet may also contain face pictures of the same person.
  • face detection is performed on the collected video pictures, and the face pictures are extracted and stored in the hard disk of the computer system.
  • the method further includes: establishing an initial model based on a branch training algorithm, where the initial model at least includes: multiple loss functions, and multiple loss functions have a one-to-one correspondence with multiple training sets ; Input multiple training sets into the initial model in parallel to train the initial model; determine whether the trained model meets the preset conditions; if the trained model meets the preset conditions, determine that the trained model is an image recognition model.
  • Softmax loss loss function SoftmaxLoss 1.
  • Different data sets can be divided for branch training and input into the same image recognition model in parallel.
  • the aligned face images in the i-th data set are forward-propagated to obtain features, they are connected to the corresponding loss function SoftmaxLoss i, Optimize as an independent objective function.
  • the image recognition model shown in FIG. 4 and FIG. 5 shows a schematic diagram of a simplified general residual network.
  • the loss function is a square loss function.
  • multiple loss functions in the initial model may be square loss functions.
  • the aforementioned preset condition may be a training end judgment condition.
  • the model obtained by training satisfies the preset condition, it is determined that the training ends, and the final trained model is a trained image recognition model.
  • inputting multiple training sets into the initial model in parallel to train the initial model includes: inputting multiple training sets into the initial model in parallel to obtain function values of multiple loss functions; According to the function values of multiple loss functions and the chain derivation algorithm, the gradient value of each parameter in the initial model is obtained; the gradient value of each parameter is updated according to the stochastic gradient descent algorithm to obtain the trained model.
  • the function value Loss of the loss function can be obtained through branch training, and the image recognition model shown in Figure 5 can be obtained according to Loss and the chain derivation algorithm.
  • the gradient value of each parameter updates the model parameters according to the stochastic gradient descent algorithm to obtain a trained model. After the trained model meets the training end judgment condition, the trained model can be determined as the final image recognition model.
  • judging whether the model obtained by training satisfies preset conditions includes: obtaining a verification set; verifying the model obtained by using the verification set to obtain the accuracy of the trained model; Whether the accuracy of the model is the same as the historical accuracy, where the historical accuracy is the accuracy of the trained model in the last verification process; if the accuracy of the trained model is the same as the historical accuracy, it is determined that the trained model meets the preset condition.
  • the currently trained model can be tested on the validation set every fixed number of iterations. As the model is trained, the trained model will be tested on the validation set. The accuracy will continue to improve, but as the model continues to be trained, when the model tends to converge or overfitting occurs, the accuracy of the model on the validation set will no longer increase steadily, indicating that the model training can be stopped.
  • accuracy is used to characterize the ratio of the sum of the verification results of all verification samples in the verification set to the total number of all verification samples.
  • the verification set is composed of a verification pair of randomly selected face pictures.
  • the number of face image verification pairs in the verification set is 6000 pairs.
  • the aforementioned historical accuracy may be the accuracy of the trained model obtained when the trained model was verified last time. If during this verification process, the accuracy of the trained model is the same as the historical accuracy, that is, the accuracy of the trained model is no longer steadily improving, the training can be determined to end, and the trained model will be used as the final image recognition model.
  • the accuracy of the trained model is determined to be the historical accuracy, and the initial model is continuously trained.
  • the accuracy of the trained model is different from the historical accuracy, that is, the trained model does not meet the preset conditions, it is determined that the training has not ended and the training needs to be continued. As the historical accuracy during the next model verification. It is judged again whether the accuracy of the trained model is the same as the historical accuracy, so as to determine whether the trained model meets the preset conditions.
  • obtaining the verification set includes: obtaining images other than the sample images in the multiple data sets; and randomly extracting image verification pairs from other images to obtain the verification set.
  • the image verification pair includes: a positive sample pair and a negative sample pair, the positive sample pair includes two images with the same label, and the negative sample pair includes two images with different labels.
  • the face pictures of the remaining N-K individuals can be used for the preparation of the verification set.
  • the verification set consists of randomly selected face photo verification pairs. Positive sample pairs and negative sample pairs are drawn. The number of positive and negative sample pairs is the same. For a verification set containing 6000 pairs of face image verification pairs, each positive and negative sample pair Take 3000 pairs. Among them, the positive sample pair is the ath picture of the nth person, and the bth picture of the nth person; the negative sample pair is the cth picture of the i-th person, and the dth picture of the jth person.
  • the verification result can be determined to be correct; when the image recognition model judges the two face pictures in the negative sample pair as not the same person, it can confirm the verification The result is correct; otherwise, the verification result is wrong.
  • Fig. 6 is a flowchart of an optional image recognition method according to an embodiment of the present application, taking the field of face recognition as an example for description.
  • the method includes: collecting face pictures in multiple scenes ; Perform face detection on the collected face pictures, extract the face pictures and store them in the computer hard disk; manually classify and label the detected and extracted face pictures, and place the face pictures belonging to the same person Mark them together and mark them together; perform key point alignment operations on face images to remove the impact of face angles on face recognition; randomly select photos that have been marked and aligned to contain face identity information and verification
  • the face image pair of the information is trained, that is, the face identity-verification training set is extracted; combined with the branch training algorithm to build a face recognition deep neural network model, the model contains multiple loss functions; face recognition based on multiple data sets
  • the deep neural network model is trained to obtain a trained network model; to determine whether the test accuracy of the trained network model on the verification set is continuously improving, that is, to determine whether the training end condition is reached; if it is not met,
  • the solution provided by the above embodiments can be used in the bank VIP recognition project to collect face pictures in real application scenarios, and at the same time download some public face data sets from the Internet; detect the face pictures in these data sets Align the operation, and make the corresponding face identity-verification training set; use the method described above to train the face recognition algorithm model, so as to obtain the face recognition algorithm with high recognition rate and recognition effect in the bank VIP recognition scene, This method can better combine the face data information in multiple data sets, so as to obtain a face recognition model with better recognition effect.
  • the branch training facial deep neural network model that combines multiple data sets has a higher accuracy than the general deep learning network based on a single data set training (including successive fine-tuning on multiple data sets).
  • an embodiment of an image recognition device is provided.
  • Fig. 7 is a schematic diagram of an image recognition device according to an embodiment of the present application. As shown in Fig. 7, the device includes:
  • the first obtaining module 72 is used to obtain the image to be recognized.
  • the above-mentioned image to be recognized may be an image that needs to be recognized.
  • a face image is taken as an example for description.
  • the second acquisition module 74 is used to acquire a pre-built image recognition model.
  • the image recognition model is obtained by training the initial model through multiple training sets.
  • the initial model is a recognition model established based on the branch training algorithm.
  • the training set is extracted from the same data set, and different training sets are extracted from different data sets.
  • multiple training sets may be constructed through multiple different data sets in advance, and the initial model may be trained through the training sets, so as to obtain the final image recognition model.
  • the branch training method can be combined to build a deep neural network model to obtain the initial model. By separating different data sets for branch training, a trained image recognition model can be obtained, and the trained image recognition model can be deployed to application scenarios.
  • the recognition module 76 is configured to recognize the image to be recognized by using the image recognition model to obtain the recognition result.
  • the face recognition process can be performed by comparing the facial feature feat-ID (using Euclidean distance).
  • an initial model can be established based on a branch training algorithm, and the initial model can be trained through multiple training sets generated from different data sets to obtain an image recognition model.
  • the image recognition model is used to perform the image recognition Recognize, get the final recognition result.
  • the image recognition model that combines branch training with multiple data sets has a higher accuracy rate than the image recognition model trained based on a single data set, and achieves the technical effect of improving the recognition accuracy, thereby solving related technologies.
  • the technical problem of the low recognition accuracy of the image recognition method is based on a branch training algorithm, and the initial model can be trained through multiple training sets generated from different data sets to obtain an image recognition model.
  • the image recognition model is used to perform the image recognition Recognize, get the final recognition result.
  • the image recognition model that combines branch training with multiple data sets has a higher accuracy rate than the image recognition model trained based on a single data set, and achieves the technical effect of improving the recognition accuracy, thereby solving related technologies.
  • the device further includes: a third acquisition module for acquiring multiple data sets; a classification module for classifying each image in the multiple data sets to obtain each image The label is used to characterize the classification result of each image, and the labels of at least two images contained in multiple data sets are the same; the first extraction module is used to extract sample images from each data set after classification to obtain Multiple training sets.
  • the device further includes: a second extraction module, configured to extract preset features of each image in each data set after classification; an alignment module, configured based on each image Preset features to perform alignment operations on each image; the third extraction module is used to extract sample images from each data set after the operation to obtain multiple training sets.
  • a second extraction module configured to extract preset features of each image in each data set after classification
  • an alignment module configured based on each image Preset features to perform alignment operations on each image
  • the third extraction module is used to extract sample images from each data set after the operation to obtain multiple training sets.
  • the preset feature includes at least one of the following: eyes, eyebrows, nose tip, and mouth corners.
  • the third extraction module includes: an extraction unit for randomly extracting sample images from each data set after the operation; a first acquisition unit for acquiring storage paths and labels of the sample images , Get multiple training sets.
  • the third acquisition module includes: a second acquisition unit, configured to acquire a video image and a preset data set collected by the collection device; a detection unit, configured to compare the video image and preset data Collect multiple data sets for detection.
  • the device further includes: a building module for building an initial model based on a branch training algorithm, where the initial model at least includes: multiple loss functions, multiple loss functions, and multiple training sets There is a one-to-one correspondence; the training module is used to input multiple training sets into the initial model in parallel to train the initial model; the judgment module is used to judge whether the trained model meets the preset conditions; the determination module is used to if The model obtained by training satisfies the preset conditions, and the model obtained by training is determined to be an image recognition model.
  • the loss function is a square loss function.
  • the training module includes: an input unit for inputting multiple training sets into the initial model in parallel to obtain function values of multiple loss functions; The function value of and the chain derivation algorithm to obtain the gradient value of each parameter in the initial model; the update unit is used to update the gradient value of each parameter according to the stochastic gradient descent algorithm to obtain the trained model.
  • the judgment module includes: a third acquisition unit, configured to acquire a verification set; and a verification unit, configured to verify the model obtained by training using the verification set to obtain the accuracy of the model obtained by training;
  • the judging unit is used to judge whether the accuracy of the trained model is the same as the historical accuracy, where the historical accuracy is the accuracy of the trained model in the last verification process;
  • the determining unit is used to determine if the accuracy of the trained model is the same as If the historical accuracy is the same, it is determined that the trained model meets the preset conditions.
  • accuracy is used to characterize the ratio of the sum of the verification results of all verification samples in the verification set to the total number of all verification samples.
  • the training module is further configured to determine that the accuracy of the trained model is the historical accuracy if the accuracy of the trained model is different from the historical accuracy, and continue to train the initial model.
  • the third acquiring unit is configured to acquire images other than the sample images in the multiple data sets, and randomly extract image verification pairs from the other images to obtain a verification set.
  • the image verification pair includes: a positive sample pair and a negative sample pair, the positive sample pair includes two images with the same label, and the negative sample pair includes two images with different labels.
  • an embodiment of a storage medium includes a stored program, wherein the device where the storage medium is located is controlled to execute the image recognition method in Embodiment 1 when the program is running.
  • an embodiment of a processor is provided, and the processor is used to run a program, where the image recognition method in the foregoing embodiment 1 is executed when the program is running.
  • the disclosed technical content can be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the units may be a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual couplings or direct couplings or communication connections may be indirect couplings or communication connections through some interfaces, units or modules, and may be in electrical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • Part of the technical solution of this application or all or part of the technical solution can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium and includes several instructions to enable a computer device (which can be a personal computer, A server or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program code .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Image Analysis (AREA)

Abstract

La présente invention concerne un procédé et un dispositif de reconnaissance d'image, un support de stockage et un processeur. Le procédé consiste à : acquérir une image à reconnaître ; acquérir un modèle de reconnaissance d'image préétabli, le modèle de reconnaissance d'image étant obtenu par apprentissage d'un modèle initial au moyen d'une pluralité d'ensembles d'apprentissage, le modèle initial étant un modèle de reconnaissance établi sur la base d'un algorithme d'apprentissage de branche, le même ensemble d'apprentissage étant extrait du même ensemble de données et différents ensembles d'apprentissage étant extraits de différents ensembles de données ; et reconnaître une image à reconnaître au moyen d'un modèle de reconnaissance d'image afin d'obtenir un résultat de reconnaissance.
PCT/CN2019/127817 2019-01-31 2019-12-24 Procédé et dispositif de reconnaissance d'image, support de stockage et processeur WO2020155939A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910101257.9A CN109766872B (zh) 2019-01-31 2019-01-31 图像识别方法和装置
CN201910101257.9 2019-01-31

Publications (1)

Publication Number Publication Date
WO2020155939A1 true WO2020155939A1 (fr) 2020-08-06

Family

ID=66455816

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/127817 WO2020155939A1 (fr) 2019-01-31 2019-12-24 Procédé et dispositif de reconnaissance d'image, support de stockage et processeur

Country Status (2)

Country Link
CN (1) CN109766872B (fr)
WO (1) WO2020155939A1 (fr)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112070060A (zh) * 2020-09-21 2020-12-11 北京金山云网络技术有限公司 识别年龄的方法、年龄识别模型的训练方法和装置
CN112149741A (zh) * 2020-09-25 2020-12-29 北京百度网讯科技有限公司 图像识别模型的训练方法、装置、电子设备及存储介质
CN112395439A (zh) * 2020-11-17 2021-02-23 林铭 一种图像数据库实现方法及其系统和网络通信设备
CN112434714A (zh) * 2020-12-03 2021-03-02 北京小米松果电子有限公司 多媒体识别的方法、装置、存储介质及电子设备
CN112529008A (zh) * 2020-11-03 2021-03-19 浙江大华技术股份有限公司 图像识别和图像特征处理方法、电子设备及存储介质
CN112733958A (zh) * 2021-01-22 2021-04-30 北京农业信息技术研究中心 一种温室臭氧浓度控制方法及系统
CN112766162A (zh) * 2021-01-20 2021-05-07 北京市商汤科技开发有限公司 活体检测方法、装置、电子设备及计算机可读存储介质
CN112766052A (zh) * 2020-12-29 2021-05-07 有米科技股份有限公司 基于ctc的图像文字识别方法及装置
CN112818865A (zh) * 2021-02-02 2021-05-18 北京嘀嘀无限科技发展有限公司 车载领域图像识别方法、识别模型建立方法、装置、电子设备和可读存储介质
CN113657406A (zh) * 2021-07-13 2021-11-16 北京旷视科技有限公司 模型训练和特征提取方法、装置、电子设备及存储介质
CN113743176A (zh) * 2021-01-29 2021-12-03 北京沃东天骏信息技术有限公司 一种图像识别方法、设备和计算机可读存储介质
CN113743499A (zh) * 2021-09-02 2021-12-03 广东工业大学 一种基于对比学习的视角无关特征解离方法及系统
CN114264361A (zh) * 2021-12-07 2022-04-01 深圳市博悠半导体科技有限公司 结合雷达和摄像头的物体识别方法、装置及智能电子秤
CN114792426A (zh) * 2021-10-25 2022-07-26 北京中电兴发科技有限公司 一种行人属性识别中的图像数据均衡方法
CN116612358A (zh) * 2023-07-20 2023-08-18 腾讯科技(深圳)有限公司 一种数据处理的方法、相关装置、设备以及存储介质
CN117237705A (zh) * 2023-08-29 2023-12-15 广州粤建三和软件股份有限公司 一种关于自建房屋的安全隐患管理系统

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109766872B (zh) * 2019-01-31 2021-07-09 广州视源电子科技股份有限公司 图像识别方法和装置
CN110569911B (zh) * 2019-09-11 2022-06-07 深圳绿米联创科技有限公司 图像识别方法、装置、系统、电子设备及存储介质
CN110674720A (zh) * 2019-09-18 2020-01-10 深圳市网心科技有限公司 图片识别方法、装置、电子设备及存储介质
CN110784465B (zh) * 2019-10-25 2023-04-07 新华三信息安全技术有限公司 一种数据流检测方法、装置及电子设备
CN111141412A (zh) * 2019-12-25 2020-05-12 深圳供电局有限公司 电缆温度和防盗的双监测方法、系统和可读存储介质
CN111814810A (zh) * 2020-08-11 2020-10-23 Oppo广东移动通信有限公司 图像识别方法、装置、电子设备及存储介质
CN112116021A (zh) * 2020-09-27 2020-12-22 广州华多网络科技有限公司 一种宝石相似性度量数据处理方法及相关设备
CN113052561A (zh) * 2021-04-01 2021-06-29 苏州惟信易量智能科技有限公司 一种基于可穿戴设备的流程控制系统及方法
CN114782757A (zh) * 2022-06-21 2022-07-22 北京远舢智能科技有限公司 香烟缺陷检测模型训练方法、装置、电子设备及存储介质
CN115019218B (zh) * 2022-08-08 2022-11-15 阿里巴巴(中国)有限公司 图像处理方法和处理器

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6456991B1 (en) * 1999-09-01 2002-09-24 Hrl Laboratories, Llc Classification method and apparatus based on boosting and pruning of multiple classifiers
CN104715227A (zh) * 2013-12-13 2015-06-17 北京三星通信技术研究有限公司 人脸关键点的定位方法和装置
CN105975959A (zh) * 2016-06-14 2016-09-28 广州视源电子科技股份有限公司 基于神经网络的人脸特征提取建模、人脸识别方法及装置
CN106503669A (zh) * 2016-11-02 2017-03-15 重庆中科云丛科技有限公司 一种基于多任务深度学习网络的训练、识别方法及系统
CN106778684A (zh) * 2017-01-12 2017-05-31 易视腾科技股份有限公司 深度神经网络训练方法及人脸识别方法
CN109766872A (zh) * 2019-01-31 2019-05-17 广州视源电子科技股份有限公司 图像识别方法和装置

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7813538B2 (en) * 2007-04-17 2010-10-12 University Of Washington Shadowing pipe mosaicing algorithms with application to esophageal endoscopy
US8442330B2 (en) * 2009-03-31 2013-05-14 Nbcuniversal Media, Llc System and method for automatic landmark labeling with minimal supervision
CN105404877A (zh) * 2015-12-08 2016-03-16 商汤集团有限公司 基于深度学习和多任务学习的人脸属性预测方法及装置
WO2017177371A1 (fr) * 2016-04-12 2017-10-19 Xiaogang Wang Procédé et système de réidentification d'objet
CN107025443A (zh) * 2017-04-06 2017-08-08 江南大学 基于深度卷积神经网络的堆场烟雾监测及在线模型更新方法
CN107247947B (zh) * 2017-07-07 2021-02-09 智慧眼科技股份有限公司 人脸属性识别方法及装置
CN107392164A (zh) * 2017-07-28 2017-11-24 深圳市唯特视科技有限公司 一种基于面部动作单元强度估计的表情分析方法
CN107633242A (zh) * 2017-10-23 2018-01-26 广州视源电子科技股份有限公司 网络模型的训练方法、装置、设备和存储介质
CN107844784A (zh) * 2017-12-08 2018-03-27 广东美的智能机器人有限公司 人脸识别方法、装置、计算机设备和可读存储介质
CN108509860A (zh) * 2018-03-09 2018-09-07 西安电子科技大学 基于卷积神经网络的可可西里藏羚羊检测方法
CN108921092B (zh) * 2018-07-02 2021-12-17 浙江工业大学 一种基于卷积神经网络模型二次集成的黑色素瘤分类方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6456991B1 (en) * 1999-09-01 2002-09-24 Hrl Laboratories, Llc Classification method and apparatus based on boosting and pruning of multiple classifiers
CN104715227A (zh) * 2013-12-13 2015-06-17 北京三星通信技术研究有限公司 人脸关键点的定位方法和装置
CN105975959A (zh) * 2016-06-14 2016-09-28 广州视源电子科技股份有限公司 基于神经网络的人脸特征提取建模、人脸识别方法及装置
CN106503669A (zh) * 2016-11-02 2017-03-15 重庆中科云丛科技有限公司 一种基于多任务深度学习网络的训练、识别方法及系统
CN106778684A (zh) * 2017-01-12 2017-05-31 易视腾科技股份有限公司 深度神经网络训练方法及人脸识别方法
CN109766872A (zh) * 2019-01-31 2019-05-17 广州视源电子科技股份有限公司 图像识别方法和装置

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112070060A (zh) * 2020-09-21 2020-12-11 北京金山云网络技术有限公司 识别年龄的方法、年龄识别模型的训练方法和装置
CN112149741A (zh) * 2020-09-25 2020-12-29 北京百度网讯科技有限公司 图像识别模型的训练方法、装置、电子设备及存储介质
CN112149741B (zh) * 2020-09-25 2024-04-16 北京百度网讯科技有限公司 图像识别模型的训练方法、装置、电子设备及存储介质
CN112529008A (zh) * 2020-11-03 2021-03-19 浙江大华技术股份有限公司 图像识别和图像特征处理方法、电子设备及存储介质
CN112395439A (zh) * 2020-11-17 2021-02-23 林铭 一种图像数据库实现方法及其系统和网络通信设备
CN112395439B (zh) * 2020-11-17 2024-03-01 林铭 一种图像数据库实现方法及其系统和网络通信设备
CN112434714A (zh) * 2020-12-03 2021-03-02 北京小米松果电子有限公司 多媒体识别的方法、装置、存储介质及电子设备
CN112766052A (zh) * 2020-12-29 2021-05-07 有米科技股份有限公司 基于ctc的图像文字识别方法及装置
CN112766162B (zh) * 2021-01-20 2023-12-22 北京市商汤科技开发有限公司 活体检测方法、装置、电子设备及计算机可读存储介质
CN112766162A (zh) * 2021-01-20 2021-05-07 北京市商汤科技开发有限公司 活体检测方法、装置、电子设备及计算机可读存储介质
CN112733958B (zh) * 2021-01-22 2024-06-07 北京农业信息技术研究中心 一种温室臭氧浓度控制方法及系统
CN112733958A (zh) * 2021-01-22 2021-04-30 北京农业信息技术研究中心 一种温室臭氧浓度控制方法及系统
CN113743176A (zh) * 2021-01-29 2021-12-03 北京沃东天骏信息技术有限公司 一种图像识别方法、设备和计算机可读存储介质
CN112818865A (zh) * 2021-02-02 2021-05-18 北京嘀嘀无限科技发展有限公司 车载领域图像识别方法、识别模型建立方法、装置、电子设备和可读存储介质
CN113657406B (zh) * 2021-07-13 2024-04-23 北京旷视科技有限公司 模型训练和特征提取方法、装置、电子设备及存储介质
CN113657406A (zh) * 2021-07-13 2021-11-16 北京旷视科技有限公司 模型训练和特征提取方法、装置、电子设备及存储介质
CN113743499B (zh) * 2021-09-02 2023-09-05 广东工业大学 一种基于对比学习的视角无关特征解离方法及系统
CN113743499A (zh) * 2021-09-02 2021-12-03 广东工业大学 一种基于对比学习的视角无关特征解离方法及系统
CN114792426A (zh) * 2021-10-25 2022-07-26 北京中电兴发科技有限公司 一种行人属性识别中的图像数据均衡方法
CN114792426B (zh) * 2021-10-25 2024-05-28 北京中电兴发科技有限公司 一种行人属性识别中的图像数据均衡方法
CN114264361A (zh) * 2021-12-07 2022-04-01 深圳市博悠半导体科技有限公司 结合雷达和摄像头的物体识别方法、装置及智能电子秤
CN116612358B (zh) * 2023-07-20 2023-10-03 腾讯科技(深圳)有限公司 一种数据处理的方法、相关装置、设备以及存储介质
CN116612358A (zh) * 2023-07-20 2023-08-18 腾讯科技(深圳)有限公司 一种数据处理的方法、相关装置、设备以及存储介质
CN117237705A (zh) * 2023-08-29 2023-12-15 广州粤建三和软件股份有限公司 一种关于自建房屋的安全隐患管理系统

Also Published As

Publication number Publication date
CN109766872B (zh) 2021-07-09
CN109766872A (zh) 2019-05-17

Similar Documents

Publication Publication Date Title
WO2020155939A1 (fr) Procédé et dispositif de reconnaissance d'image, support de stockage et processeur
WO2019120115A1 (fr) Procédé et appareil de reconnaissance faciale et dispositif informatique
CN109284729B (zh) 基于视频获取人脸识别模型训练数据的方法、装置和介质
CN105844206A (zh) 身份认证方法及设备
CN106529414A (zh) 一种通过图像比对实现结果认证的方法
WO2020134527A1 (fr) Procédé et appareil de reconnaissance faciale
CN109271915B (zh) 防伪检测方法和装置、电子设备、存储介质
CN111861240A (zh) 可疑用户识别方法、装置、设备及可读存储介质
CN106803289A (zh) 一种智能移动防伪签到方法与系统
WO2016084072A1 (fr) Système anti-usurpation et procédés utilisés conjointement
CN110688901A (zh) 一种人脸识别方法及装置
CN109063649B (zh) 基于孪生行人对齐残差网络的行人重识别方法
CN108009482A (zh) 一种提高人脸识别效率方法
JP6969663B2 (ja) ユーザの撮影装置を識別する装置及び方法
CN110502694A (zh) 基于大数据分析的律师推荐方法及相关设备
CN103415859A (zh) 用于人脸注册的方法
CN109919060A (zh) 一种基于特征匹配的身份证内容识别系统及方法
CN109902223A (zh) 一种基于多模态信息特征的不良内容过滤方法
CN108108711A (zh) 人脸布控方法、电子设备及存储介质
CN108446687A (zh) 一种基于移动端和后台互联的自适应人脸视觉认证方法
CN110503099A (zh) 基于深度学习的信息识别方法及相关设备
CN113591603A (zh) 证件的验证方法、装置、电子设备及存储介质
CN111079687A (zh) 证件伪装识别方法、装置、设备及存储介质
CN109558798B (zh) 一种基于卷积特征图匹配的人脸识别方法及系统
CN109359689A (zh) 一种数据识别方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19914046

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19914046

Country of ref document: EP

Kind code of ref document: A1