CN112668529A - Dish sample image enhancement identification method - Google Patents

Dish sample image enhancement identification method Download PDF

Info

Publication number
CN112668529A
CN112668529A CN202011643563.4A CN202011643563A CN112668529A CN 112668529 A CN112668529 A CN 112668529A CN 202011643563 A CN202011643563 A CN 202011643563A CN 112668529 A CN112668529 A CN 112668529A
Authority
CN
China
Prior art keywords
dish
image
training
training set
generator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011643563.4A
Other languages
Chinese (zh)
Inventor
瞿晨非
井焜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Synthesis Electronic Technology Co Ltd
Original Assignee
Synthesis Electronic Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Synthesis Electronic Technology Co Ltd filed Critical Synthesis Electronic Technology Co Ltd
Priority to CN202011643563.4A priority Critical patent/CN112668529A/en
Publication of CN112668529A publication Critical patent/CN112668529A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention provides a dish sample image enhancement and identification method, which comprises the steps of obtaining a complete image of a single tableware-contained dish, and constructing an original dish training set; constructing an antagonistic network comprising a generator G and a discriminator D, and generating a standardized dish training set by using an original dish training set; constructing a dish identification network, utilizing a standardized dish training set for training to obtain a dish identification model, and ending the training process; collecting dishes to be identified on the same day, constructing a dish identification base, generating dish characteristics, standardizing base images to train a dish identification model, and obtaining and storing dish characteristic vectors; respectively obtaining the standardized representation and the characteristic representation of the dish by using steps 4, 5 and 6 when the dish is to be identified on the day; and comparing the dish features with all the features, and considering the current dish to be detected as the corresponding dish in the bottom warehouse by the two groups of the dish features closest to each other. The invention improves the recognition possibility of the recognition model, reduces the difference among the same type of samples and can reduce the model training cost.

Description

Dish sample image enhancement identification method
Technical Field
The invention relates to a dish sample image enhancement identification method, in particular to a dish identification method based on image data enhancement.
Background
At present, a dish identification mode based on deep learning mainly aims at identifying dish images in tableware, dish areas are preferentially extracted in the identification process, tableware information is introduced by a method of extracting a tableware outer edge matrix, and the accuracy of dish identification is greatly influenced by the condition of the space occupied by dishes in the tableware due to insufficient sample space coverage of training samples. In addition, the background modeling method is used for removing redundant tableware information, so that the information redundancy is large, the generated result is not standard, and the deep learning training and recognition process is not facilitated. In the existing deep learning dish identification application, dish patterns in training data are complex and are unevenly distributed, so that the space of a training sample is uneven, and the accuracy rate is low after model training is finished. Therefore, a method for directly enhancing the dish image area is designed, and the accuracy during training and use is improved.
Disclosure of Invention
The invention aims to provide an image recognition method for enhancing a dish image, which improves the recognition possibility of a recognition model and reduces the difference among samples of the same type, thereby reducing the model training cost.
In order to achieve the purpose, the invention is realized by the following technical scheme:
step 1: obtaining a complete image of the dishes contained in a single tableware through an image acquisition unit, and constructing an original dish training set according to the complete image;
step 2: constructing an antagonistic network comprising a generator G and a discriminator D, wherein the antagonistic network is formed by using a convolutional neural network, and generating a standardized dish training set by using an original dish training set;
and step 3: constructing a dish identification network, training by using resnet-50, generating a standardized dish training set by using the step 2, training to obtain a dish identification model, and finishing the training process;
and 4, step 4: in the application process, firstly, dishes to be identified on the day are collected, and the collection process of a single picture is the same as that in the step 1;
and 5: constructing a dish identification base library, and generating standardized dishes by using the countermeasure network used in the step 2 for the dish images obtained in the step 4;
step 6: generating dish features, and acquiring and storing dish feature vectors by using the normalized bottom library image generated in the step 5 through the dish recognition model trained in the step 3;
and 7: respectively obtaining the standardized representation and the characteristic representation of the dish by using steps 4, 5 and 6 when the dish is to be identified on the day;
step 8, calculating and comparing the dish features generated in the step 7 with all the features in the step 6 by adopting Euclidean distances, and considering the dish to be detected as the corresponding dish in the bottom library if the two groups of features are closer;
preferably, the step 2 of generating the normalized dish training set by the training set comprises the following steps:
1) the method comprises the following steps of training a generator and a discriminator by taking an obtained original dish training set as a real sample, so that a dish generation sample acceptable to human eyes can be generated finally;
2) in the training process, a generator G is used for generating an image of a specified category of dishes, and a discriminator G is used for judging whether the image of the category of dishes is the image of the specified category of dishes;
3) the training process is alternately finished in pairs, the accuracy of the generated image of the generator G is optimized through the judgment result of the discriminator G, and the generator G is used for generating the image;
4) judging ability of result optimization discriminator G on image truth and final optimization function
Figure RE-GDA0002972993050000021
Figure RE-GDA0002972993050000022
Wherein x represents an input real picture, z represents noise information, and c represents an artificial control condition; d (x, c) is a judgment result of the original real picture under certain artificial parameters through a discriminator D, (G (Z)), c) represents picture information generated by a generator G under the condition of artificial control of noise Z, and D (G (Z)), c) represents a judgment result of authenticity through the discriminator D;
5) the control condition c specifies the area, shape and proportion of the dish in the tableware, the generator is modified by the control parameter to generate a dish result, the dish image generated by the generator G has the same expression form on the non-dish characteristics, the space position, the proportion and the shape of the generated dish result in the tableware are nearly the same, and the internal structure of the dish is not changed;
6) after the training is finished, the obtained original dish training set is input into a generator G by controlling artificial conditions c, and each dish image information in the sample generates a corresponding standard dish to form a standardized dish training set.
The invention has the advantages that:
1. and (3) using a countermeasure generation network, enhancing the original image by using the characteristics of the dishes, increasing the characteristics of the image and improving the recognition possibility of the recognition model.
2. The original sample set is uniformly enhanced by using the countermeasure generation network, model parameters are controlled, the dish images integrally have a uniform pattern, and the difference among samples of the same type is reduced, so that the model training cost can be reduced.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.
FIG. 1 is a schematic diagram of the training process of the present invention.
FIG. 2 is a schematic diagram of the process of constructing a dish library according to the present invention.
FIG. 3 is a flow chart illustrating an identification process according to the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The scheme mainly comprises the following steps
Step 1: and obtaining a complete image of the dishes contained in the single tableware through an image acquisition unit, and constructing an original dish training set according to the complete image.
Step 2: and constructing an antagonistic network and generating a standardized dish training set by using the original dish training set.
Furthermore, the confrontation generation network comprises a generator G and a discriminator D which are both formed by a convolutional neural network;
further, the original dish training set obtained in the step 1 is used as a real sample, and a generator and a discriminator are trained, so that a dish generation sample acceptable to human eyes can be generated finally;
furthermore, in the training process, a generator G is used for generating an image of the dish of the specified category, and a discriminator G is used for judging whether the image of the dish of the specified category is the image of the dish of the specified category.
Further, the training process is alternately finished in pairs, the accuracy of the generated image is optimized through the judgment result of the discriminator G, and the judgment capability of the discriminator G on the image truth is optimized through the generation result of the generator G.
Further, a final optimization function
Figure RE-GDA0002972993050000041
Figure RE-GDA0002972993050000042
Wherein x represents an input real picture, z represents noise information, and c represents an artificial control condition; d (x, c) is the result of judging the original real picture by the discriminator D under some artificial parameters, (G (Z)), c) represents the picture information generated by the generator G under the artificial control condition of the noise Z, D (G (Z)), c) represents the result of judging the authenticity of (G (Z), c) by the discriminator D.
Furthermore, the artificial control condition c specifies the area, shape and proportion of the dish in the tableware, and the dish result is generated by controlling the parameter modification generator, so that the dish image generated by the generator G has the same expression form on the non-dish characteristics, the space position, the proportion and the shape of the generated dish result in the tableware are nearly the same, and the internal structure of the dish is not changed.
Further, after the training is finished, inputting the original dish training set obtained in the step 1 into a generator G by controlling artificial conditions c, so that each dish image information in the sample generates a corresponding standard dish to form a standardized dish training set.
And step 3: and (3) constructing a dish identification network, and generating a standardized dish training set for training by utilizing the step (2) to obtain a dish identification model. The training process is ended.
Further, the dish identification network may be trained using resnet-50.
And 4, step 4: in the application process, dishes to be identified on the day are collected firstly, and the process of collecting single picture is the same as the step 1
And 5: and (4) constructing a dish identification base library, and generating the standardized dishes by using the countermeasure network used in the step 2 for the dish images obtained in the step 4.
Step 6: and (4) generating dish features, and acquiring and storing dish feature vectors by using the normalized bottom library image generated in the step (5) through the dish identification model trained in the step (3).
And 7: when the dish is to be identified, the normalized representation and the characteristic representation of the dish are respectively obtained by using the steps 4, 5 and 6.
And 8, comparing the dish features generated in the step 7 with all the features in the step 6, and determining that the dish to be detected is the corresponding dish in the bottom library if the two groups of features are closer.
Further, the Euclidean distance can be adopted for calculation during feature comparison.

Claims (2)

1. A dish sample image enhancement identification method is characterized by comprising the following steps:
step 1: obtaining a complete image of the dishes contained in a single tableware through an image acquisition unit, and constructing an original dish training set according to the complete image;
step 2: constructing an antagonistic network comprising a generator G and a discriminator D, wherein the antagonistic network is formed by using a convolutional neural network, and generating a standardized dish training set by using an original dish training set;
and step 3: constructing a dish identification network, training by using resnet-50, generating a standardized dish training set by using the step 2, training to obtain a dish identification model, and finishing the training process;
and 4, step 4: collecting dishes to be identified on the same day, wherein the process of collecting single pictures is the same as that in the step 1;
and 5: establishing a dish identification base, generating a normalized dish by using the picture through an confrontation network, and generating a normalized base image;
step 6: generating dish features, and acquiring and storing dish feature vectors by utilizing the dish identification model trained in the step 3 on the normalized bottom library image;
and 7: respectively obtaining the standardized representation and the characteristic representation of the dish by using steps 4, 5 and 6 when the dish is to be identified on the day;
and 8: calculating and comparing the dish features generated in the step 7 with all the features in the step 6 by adopting Euclidean distances, and considering the dish to be detected as the corresponding dish in the bottom library if the two groups of features are closer;
2. the method for enhancing and identifying the image of the dish sample according to claim 1, wherein the step 2 of generating the normalized dish training set by the training set comprises the following steps:
1) the method comprises the following steps of training a generator and a discriminator by taking an obtained original dish training set as a real sample, so that a dish generation sample acceptable to human eyes can be generated finally;
2) in the training process, a generator G is used for generating an image of a specified category of dishes, and a discriminator G is used for judging whether the image of the category of dishes is the image of the specified category of dishes;
3) the training process is alternately finished in pairs, the accuracy of the generated image of the generator G is optimized through the judgment result of the discriminator G, and the generator G is used for generating the image;
4) judging ability of result optimization discriminator G on image truth and final optimization function
Figure FDA0002874475060000021
Figure FDA0002874475060000022
Wherein x represents an input real picture, z represents noise information, and c represents an artificial control condition; d (x, c) is a judgment result of the original real picture under certain artificial parameters through a discriminator D, (G (Z)), c) represents picture information generated by a generator G under the condition of artificial control of noise Z, and D (G (Z)), c) represents a judgment result of authenticity through the discriminator D;
5) the control condition c specifies the area, shape and proportion of the dish in the tableware, the generator is modified by the control parameter to generate a dish result, the dish image generated by the generator G has the same expression form on the non-dish characteristics, the space position, the proportion and the shape of the generated dish result in the tableware are nearly the same, and the internal structure of the dish is not changed;
6) after the training is finished, the obtained original dish training set is input into a generator G by controlling artificial conditions c, and each dish image information in the sample generates a corresponding standard dish to form a standardized dish training set.
CN202011643563.4A 2020-12-31 2020-12-31 Dish sample image enhancement identification method Pending CN112668529A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011643563.4A CN112668529A (en) 2020-12-31 2020-12-31 Dish sample image enhancement identification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011643563.4A CN112668529A (en) 2020-12-31 2020-12-31 Dish sample image enhancement identification method

Publications (1)

Publication Number Publication Date
CN112668529A true CN112668529A (en) 2021-04-16

Family

ID=75412340

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011643563.4A Pending CN112668529A (en) 2020-12-31 2020-12-31 Dish sample image enhancement identification method

Country Status (1)

Country Link
CN (1) CN112668529A (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108985464A (en) * 2018-07-17 2018-12-11 重庆科技学院 The continuous feature generation method of face for generating confrontation network is maximized based on information
CN109508669A (en) * 2018-11-09 2019-03-22 厦门大学 A kind of facial expression recognizing method based on production confrontation network
CN110136063A (en) * 2019-05-13 2019-08-16 南京信息工程大学 A kind of single image super resolution ratio reconstruction method generating confrontation network based on condition
CN110610174A (en) * 2019-07-16 2019-12-24 北京工业大学 Bank card number identification method under complex conditions
CN111161295A (en) * 2019-12-30 2020-05-15 神思电子技术股份有限公司 Background stripping method for dish image
CN111275115A (en) * 2020-01-20 2020-06-12 星汉智能科技股份有限公司 Method for generating counterattack sample based on generation counternetwork
CN111667491A (en) * 2020-05-09 2020-09-15 中山大学 Breast mass image generation method with marginal landmark annotation information based on depth countermeasure network
CN111680603A (en) * 2020-05-28 2020-09-18 浙江师范大学 Dish detection and identification method
CN111754596A (en) * 2020-06-19 2020-10-09 北京灵汐科技有限公司 Editing model generation method, editing model generation device, editing method, editing device, editing equipment and editing medium
CN111767861A (en) * 2020-06-30 2020-10-13 苏州兴钊防务研究院有限公司 SAR image target identification method based on multi-discriminator generation countermeasure network
CN111783545A (en) * 2020-06-02 2020-10-16 山西潞安环保能源开发股份有限公司五阳煤矿 Coal mine unmanned aerial vehicle image enhancement method based on generation of countermeasure network
CN112115966A (en) * 2020-08-05 2020-12-22 西安交通大学 Dish and attribute information identification system and method based on fine-grained identification

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108985464A (en) * 2018-07-17 2018-12-11 重庆科技学院 The continuous feature generation method of face for generating confrontation network is maximized based on information
CN109508669A (en) * 2018-11-09 2019-03-22 厦门大学 A kind of facial expression recognizing method based on production confrontation network
CN110136063A (en) * 2019-05-13 2019-08-16 南京信息工程大学 A kind of single image super resolution ratio reconstruction method generating confrontation network based on condition
CN110610174A (en) * 2019-07-16 2019-12-24 北京工业大学 Bank card number identification method under complex conditions
CN111161295A (en) * 2019-12-30 2020-05-15 神思电子技术股份有限公司 Background stripping method for dish image
CN111275115A (en) * 2020-01-20 2020-06-12 星汉智能科技股份有限公司 Method for generating counterattack sample based on generation counternetwork
CN111667491A (en) * 2020-05-09 2020-09-15 中山大学 Breast mass image generation method with marginal landmark annotation information based on depth countermeasure network
CN111680603A (en) * 2020-05-28 2020-09-18 浙江师范大学 Dish detection and identification method
CN111783545A (en) * 2020-06-02 2020-10-16 山西潞安环保能源开发股份有限公司五阳煤矿 Coal mine unmanned aerial vehicle image enhancement method based on generation of countermeasure network
CN111754596A (en) * 2020-06-19 2020-10-09 北京灵汐科技有限公司 Editing model generation method, editing model generation device, editing method, editing device, editing equipment and editing medium
CN111767861A (en) * 2020-06-30 2020-10-13 苏州兴钊防务研究院有限公司 SAR image target identification method based on multi-discriminator generation countermeasure network
CN112115966A (en) * 2020-08-05 2020-12-22 西安交通大学 Dish and attribute information identification system and method based on fine-grained identification

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
姚哲维等: "改进型循环生成对抗网络的血管内超声图像增强", 《计算机科学》 *
姚哲维等: "改进型循环生成对抗网络的血管内超声图像增强", 《计算机科学》, no. 05, 15 May 2019 (2019-05-15) *
张澎等: "基于深度卷积生成对抗网络的植物图像识别方法的研究", 《西南民族大学学报(自然科学版)》 *
张澎等: "基于深度卷积生成对抗网络的植物图像识别方法的研究", 《西南民族大学学报(自然科学版)》, no. 02, 25 March 2019 (2019-03-25) *
徐一峰: "生成对抗网络理论模型和应用综述", 《金华职业技术学院学报》 *
徐一峰: "生成对抗网络理论模型和应用综述", 《金华职业技术学院学报》, no. 03, 1 May 2017 (2017-05-01) *

Similar Documents

Publication Publication Date Title
CN111199550B (en) Training method, segmentation method, device and storage medium of image segmentation network
CN110070935B (en) Medical image synthesis method, classification method and device based on antagonistic neural network
CN111339990B (en) Face recognition system and method based on dynamic update of face features
CN112734775B (en) Image labeling, image semantic segmentation and model training methods and devices
US20030021448A1 (en) Method for detecting eye and mouth positions in a digital image
CN108764358A (en) A kind of Terahertz image-recognizing method, device, equipment and readable storage medium storing program for executing
CN113762138B (en) Identification method, device, computer equipment and storage medium for fake face pictures
CN106778489A (en) The method for building up and equipment of face 3D characteristic identity information banks
CN110287813A (en) Personal identification method and system
CN108564120A (en) Feature Points Extraction based on deep neural network
CN112199530B (en) Multi-dimensional face library picture automatic updating method, system, equipment and medium
CN109285147B (en) Image processing method and device for breast molybdenum target calcification detection and server
CN112257328A (en) Furniture layout method and electronic equipment
CN112001215A (en) Method for identifying identity of text-independent speaker based on three-dimensional lip movement
CN105956570A (en) Lip characteristic and deep learning based smiling face recognition method
CN114332938A (en) Pet nose print recognition management method and device, intelligent equipment and storage medium
CN112560710A (en) Method for constructing finger vein recognition system and finger vein recognition system
CN110633689B (en) Face recognition model based on semi-supervised attention network
CN113592893A (en) Image foreground segmentation method combining determined main body and refined edge
CN114331473A (en) Method and device for identifying telecommunication fraud event and computer-readable storage medium
CN110443790B (en) Cartilage identification method and system in medical image
CN112668529A (en) Dish sample image enhancement identification method
CN115311691B (en) Joint identification method based on wrist vein and wrist texture
CN206363347U (en) Based on Corner Detection and the medicine identifying system that matches
CN114821681A (en) Fingerprint augmentation method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination