CN112487909A

CN112487909A - Fruit variety identification method based on parallel convolutional neural network

Info

Publication number: CN112487909A
Application number: CN202011327435.9A
Authority: CN
Inventors: 李锋; 李超; 黄炜嘉; 张勇停; 汪平; 张慧慧; 孙晗笑; 叶童玲
Original assignee: Jiangsu University of Science and Technology
Current assignee: Jiangsu University of Science and Technology
Priority date: 2020-11-24
Filing date: 2020-11-24
Publication date: 2021-03-12

Abstract

The invention discloses a fruit variety identification method based on a parallel convolution neural network, which comprises the following steps of 1: changing the fruit image with the category label into a 3-channel picture with 128 x 128 pixels; step 2: carrying out translation, rotation and mirror image overturning operations on the result of the step 1 at the same time to generate fruit image data sets of all scales; step 3, inputting the result of the step 2 into a generation countermeasure network model to enhance the data; step 4, constructing a parallel convolution neural network model, and performing multi-scale feature extraction on the result of the step 3; and 5: and (4) according to the characteristics extracted in the step (4), predicting the category of the fruit image by the parallel convolutional neural network model, comparing the category with the category label, and training the parallel convolutional neural network model according to the comparison result. The method can improve the accuracy of fruit variety identification based on the image, and has important significance in mechanized and intelligent application of fruit industry.

Description

Fruit variety identification method based on parallel convolutional neural network

Technical Field

The invention relates to image recognition, in particular to a fruit variety recognition method based on a parallel convolution neural network.

Background

At present, the mechanization degree of the current fruit industry structure in China is low, and most production links, especially fruit picking, are mainly carried out by manpower which wastes time and labor. The whole fruit production operation comprises picking, storing, transporting, processing, selling and other links, so that the research and development of fruit agricultural production robots are necessary trends of improving the fruit production efficiency and saving the labor cost. However, in the picking and sorting robot and the fruit quality and variety detection system in the fruit production link, the normal work of the picking and sorting robot depends on the correct identification of the fruit by the image processing module, for example, the picking robot can provide motion parameters for the mechanical arm only by identifying the fruit from the fruit tree and obtaining the accurate position of the fruit, and then the picking operation of the fruit is completed.

In recent years, the deep learning technology has been rapidly developed, which can excellently perform various computer vision tasks and is gradually applied to the agricultural field. The built deep learning model can automatically learn the characteristic information of different objects through a large amount of data training, and the difference of each category is obtained. The deep learning model can convert the original data into more abstract and high-level expression through training and learning, and then tasks such as image classification and detection are completed. However, the accuracy of the current fruit variety identification method based on image deep learning is low, and the requirements of practical application cannot be completely met.

Disclosure of Invention

The purpose of the invention is as follows: the invention aims to provide a fruit variety identification method based on a parallel convolution neural network, which solves the problems that the existing data set is less and the identification rate of the traditional parallel convolution neural network is low, and realizes the rapid and accurate identification of similar fruit varieties.

The technical scheme is as follows: the invention provides a fruit variety identification method based on a parallel convolution neural network, which comprises the following steps:

step 1: changing the fruit image with the category label into a 3-channel picture with 128 x 128 pixels;

step 2: carrying out translation, rotation and mirror image overturning operations on the result of the step 1 at the same time to generate fruit image data sets of all scales;

step 3, inputting the result of the step 2 into a generated countermeasure network model to enhance the data;

step 4, constructing a parallel convolution neural network model, and performing multi-scale feature extraction on the result of the step 3;

and 5: and (4) according to the characteristics extracted in the step (4), predicting the category of the fruit image by the parallel convolution neural network model, comparing the fruit image with the category label, and training the parallel convolution neural network model according to the comparison result, so that the accuracy of the training model is highest, and the loss rate is reduced to the lowest. .

Preferably, step 2 comprises a 30 °, 60 °, 90 ° rotation of the picture, a 10%, 20%, 30% translation, a 30 °, 60 °, 90 ° mirror rotation.

The generation countermeasure network in the step 3 comprises a generator and a discriminator, noise is input into the generator, random sampling is carried out on the noise, and 3-channel data samples of 128 x 128 are generated through a 5-layer network; the data sample of the discriminator is compared with the real sample, the authenticity of the sample generated by the generator is judged, and the generator is updated by the aid of the fixed parameters of the discriminator to generate a picture which makes the discriminator more difficult to distinguish the authenticity.

The parallel convolutional neural network used in the step 4 comprises 8 convolutional layers, 6 maximal pooling layers and finally a full-connection layer, wherein the full-connection layer integrates local information of the convolutional layers and the maximal pooling layers with classification information.

And step 5, optimizing the convolutional neural network model by utilizing the combination of the maximum class spacing loss function and the SoftmaxWithLoss loss function.

And (3) taking the fruit image to be classified as a target image, performing the operations of the steps 1 to 3, and inputting the trained parallel convolution neural network model to perform fruit identification.

Performing traditional data enhancement and generating a countermeasure network for data enhancement; and bringing the fruit images to be classified into the trained model for training.

Has the advantages that: compared with the prior art, the invention has the following remarkable advantages: generating a large number of high-quality data sets by combining a countermeasure generation network with a traditional data enhancement method; the parallel convolution neural network is utilized to complete the synchronous extraction of the features with different scales, so that the feature expression is richer, and the network extracts more feature information; the distance between similar varieties is increased by utilizing a maximum class spacing loss function and a SoftmaxWithLoss combination mode, so that the identification accuracy rate between the similar varieties is improved. The accuracy of the method can reach 98.85% on the public data set front-360.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of the generation of a countermeasure network of the present invention;

FIG. 3 is a schematic diagram of a parallel convolutional neural network of the present invention.

Detailed Description

The technical scheme of the invention is further explained by combining the attached drawings. The invention provides a method for training a fruit variety classification model, which comprises the following steps:

step 1: conventional data enhancement is performed on the fruit image with the category label to form a fruit image with multiple data sets.

Step 2: and constructing a generation countermeasure network model, and generating more high-quality data sets by using the generation countermeasure network. The network comprises a discriminator and a generator, and the discriminator and the generator train a countermeasure network model in a game mode. The discriminator adjusts the parameters of the discriminator through the parameters transmitted by the generator so as to judge the authenticity of the incoming data, the generator updates the parameters of the discriminator by the fixed parameters of the discriminator so as to generate more data which are difficult to be judged by the discriminator, finally, the discriminator and the generator reach a balance point, and the training is finished.

When the generator is trained, the parameters of the discriminator are fixed, and the generator updates the parameters of the generator by the fixed parameters of the discriminator, so that the generated sample is difficult to judge whether the data is real data or a simulation sample by the discriminator.

When the discriminator is trained, the parameters of the generator are fixed, and the data generated by the generator are taken as negative samples.

And step 3: and (4) taking the fruit images with the class labels as training data, and respectively carrying out feature extraction by bringing the training data into a neural network model with parallel convolution. C1, C2 are 16 convolution kernels of 3 × 3, with all zero padding, step size 1, and output of 128 × 128 × 3. S1 is the maximum pooling layer of 2 × 2, step size is 2, and output is 64 × 64 × 16. C3 are 16 convolution kernels of size 3 × 3, filled with all zeros, with a step size of 1, and output of 64 × 64 × 16. C4 is 16 convolution kernels of size 3 × 3, with all zero padding, step size 1, output 64 × 64 × 16, S2 is the largest pooling layer of size 2 × 2, step size 2, output 32 × 32 × 16. And inputting the generated characteristic image into two parallel channels a and b, wherein the channel a comprises 32 convolution layers of 3 multiplied by 3 and a maximum pooling layer of 2 multiplied by 2, the convolution layers use all zero padding, the step size is 1, and the pooling layer step size is 4. The b-channel comprises 32 5 × 5 convolution kernels and two 2 × 2 maximum pooling layers, wherein the convolutional layers use all-zero padding, the step size is 1, the step size of the maximum pooling layer is 2, the a and b channels each generate 32 8 × 8 feature maps, then 64 8 × 8 feature maps generated by the two channels are used as input of C5, C5 and C6 are 64 convolutional layers with the size of 3 × 3, all-zero padding is used, the step size is 1, S3 is the maximum pooling layer with the size of 2 × 2, and the step size is 2. And finally, integrating local information of the convolution layer and the pooling layer with classification information for the full connection layer.

And 4, step 4: and predicting the variety types of the fruit images by using the extracted characteristic information, comparing the variety types with corresponding labels, and training the parallel convolution neural network model based on the comparison result.

In step 4, a new loss function consisting of a maximum class spacing loss function and a SoftmaxWithLoss function for predicting the same class and different varieties can be used for optimizing the convolutional neural network. The maximum class spacing loss function formula is as follows:

wherein i represents the ith fruit, j represents the jth fruit, M represents the total fruit category, and M⁽ⁱ⁾Means, M, for class i fruit^(j)Represents the mean of the jth class, n represents the number of samples of the ith fruit, x^(i,e)Value, h, representing the e variety of the ith fruit_w,b(x^(i,e)) Represents wx^(i,e-1)+ b, w represents the weight of the e-th breed, b represents the bias term. The identified varieties do not need to be compared with other fruits one by one, and only need to be compared with similar varieties, and the maximum class spacing can increase the spacing between similar varieties along with the increase of training times. Combining the maximum class spacing with SoftmaxWithLoss to obtain a formula:

J＝S-λL (3)

wherein S represents SoftmaxWithLoss, L represents a maximum class spacing function, lambda represents a hyper-parameter, and J enables classification between similar classes to be more and more accurate. The derivation of L yields the formula:

wherein Z represents f (h)_w,b(x^(i,e)) F, l, n respectively represent activation function, number of convolution layers, and number of samples.

In the step 3, the last pooling layer feature and the last full-connection layer feature of each picture can be extracted; and the regularization operation can also be carried out on the original image, the maximum pooling is carried out on the cutting image, and then the regularization operation is carried out.

In step 4, a Softmax classifier can also be used for variety prediction.

The method of the invention is as effective as the fruit image without label in the image, and the internal label means that: bounding-box labels, outline labels, etc.

Claims

1. A fruit variety identification method based on a parallel convolution neural network is characterized by comprising the following steps:

step 3, inputting the result of the step 2 into a generation countermeasure network model to enhance the data;

and 5: and (4) according to the characteristics extracted in the step (4), predicting the category of the fruit image by the parallel convolutional neural network model, comparing the category with the category label, and training the parallel convolutional neural network model according to the comparison result to improve the identification accuracy.

2. The parallel convolutional neural network-based fruit variety identification method of claim 1, wherein said step 2 comprises rotating the picture by 30 °, 60 °, 90 °, shifting by 10%, 20%, 30%, mirroring by 30 °, 60 °, 90 °.

3. The fruit variety identification method based on the parallel convolutional neural network as claimed in claim 1, wherein the generation countermeasure network of step 3 comprises a generator and a discriminator, noise is input into the generator, then random sampling is carried out from the noise, and 3-channel data samples of 128 x 128 are generated through a 5-layer network; the discriminator compares the data sample with the real sample, the generator is judged to generate the true or false of the sample, and the generator is updated by the aid of the fixed parameters of the discriminator to generate a picture which makes the discriminator more difficult to distinguish the true or false.

4. The fruit variety identification method based on the parallel convolutional neural network as claimed in claim 1, wherein the parallel convolutional neural network used in the step 4 comprises 8 convolutional layers, 6 maximal pooling layers and a full-link layer, and the full-link layer integrates local information of the convolutional layers and the maximal pooling layers with classification information.

5. The parallel convolutional neural network-based fruit variety identification method as claimed in claim 1, wherein said step 5 further comprises optimization of convolutional neural network model using a combination of maximum class spacing loss function and SoftmaxWithLoss loss function.

6. The fruit variety identification method based on the parallel convolutional neural network as claimed in any one of claims 1 to 5, further comprising using the fruit image to be classified as a target image, performing the operations of steps 1 to 3, and inputting the trained parallel convolutional neural network model to perform fruit variety identification.