CN114882497A

CN114882497A - Method for realizing fruit classification and identification based on deep learning algorithm

Info

Publication number: CN114882497A
Application number: CN202210486150.2A
Authority: CN
Inventors: 高宇飞; 张学健; 温家璇
Original assignee: Northeast Petroleum University
Current assignee: Northeast Petroleum University
Priority date: 2022-05-06
Filing date: 2022-05-06
Publication date: 2022-08-09

Abstract

The invention belongs to the field of fruit picture classification, and particularly relates to a fruit picture classification algorithm of a convolution neural network of AlexNet. 1, constructing a reasonable prediction model and establishing a sample data set; 2. searching related fruit image samples through various search engines and databases, and downloading the fruit image samples to the local; 3. the convolution neural network model AlexNet is referred to, some simplification and optimization are carried out, and after a large amount of training and debugging, the optimal parameters are obtained; 4. returning to Pycharm, and performing classification test on the test set by using a TensorFlow calling model.

Description

Method for realizing fruit classification and identification based on deep learning algorithm

The technical field is as follows:

the invention belongs to the field of fruit picture classification, and particularly relates to a fruit picture classification algorithm based on AlexNet convolutional neural network.

Background art:

in recent years, with the rapid development of electronic technology and the rapid popularization of the internet, the threshold for using electronic products such as smart phones and digital cameras is becoming lower and lower. The image is used as a high-efficiency information propagation medium, and is not increased all the time no matter locally or on various social platforms. Therefore, it is very important to scan, analyze, label, and sort and file various images.

Meanwhile, the continuous development of computer science and information technology gradually guides artificial intelligence into our lives, and brings great convenience to our lives. The recognition, image identification and classification are important components in the field of artificial intelligence, and are also popular subjects in deep learning at present.

Fruits are common and universal in our lives, have various varieties, complex styles and extremely strong diversity, and have important significance in researching fruit classification and identification algorithms.

The invention content is as follows:

the invention aims to overcome the defect that the existing different classifiers in the automatic fruit identification technology have unbalanced classification effects on different fruit types, and provides a method for realizing fruit classification and identification based on a deep learning algorithm, so that the accuracy of fruit image identification is improved.

The technical scheme adopted by the invention is as follows: a method for realizing fruit classification and identification based on a deep learning algorithm comprises the following steps:

the method comprises the following steps: constructing a reasonable prediction model, and establishing a sample data set, which respectively comprises the following steps: training set, validation set and test set.

Step two: and searching relevant fruit image samples through various search engines and databases, and downloading the fruit image samples to the local.

Step three: and (3) obtaining optimal parameters by referring to a convolutional neural network model AlexNet and performing simplification and optimization, and training and debugging a large amount.

Step four: returning to Pycharm, and performing classification test on the test set by using a TensorFlow call model.

The invention has the beneficial effects that:

(1) the algorithm model is provided with 8 network layers in total, and the ReLU is used as an excitation function, so that the ReLU excitation function can play a role in the deeper layer of the network, and the phenomenon of gradient disappearance is avoided;

(2) the maximum pooling is used at the rear part of each convolutional layer, and the pooling proportion is larger, so that the size of a design network is greatly reduced while a significant part in a characteristic image is kept, and the memory occupation is reduced;

(3) the model integrally uses an Adam optimizer, and the dual advantages of Adam enable gradient reduction in back propagation to be faster, more accurate and more stable.

Description of the drawings:

FIG. 1 is a flow chart of image recognition according to the present invention;

FIG. 2 is a diagram of an algorithm model structure of the present invention

FIG. 3 is an algorithm model training diagram of the present invention;

fig. 4 is a model predictive thermodynamic diagram.

The specific implementation mode is as follows:

example one

A method for realizing fruit classification and identification based on a deep learning algorithm comprises the following steps:

the method comprises the following steps: and establishing a reasonable sample data set.

Step two: fruit image samples are collected and downloaded locally.

Step three: after a large amount of training and debugging, the optimal parameters are obtained.

Step four: and carrying out classification test on the test set.

Specifically, the intelligent identification method comprises the following steps:

the method comprises the following steps: constructing a reasonable prediction model from a large amount of data; at different stages of model creation, the sample can be divided into 3 data sets, respectively: training set, validation set and test set.

(1) Training set

A training set refers to a data set used at a learner. In a classification task for fruit recognition, a training set is used to fit parameters, such as weights, in a model. The selection of the training set needs to prepare a large-data-size and diversified training set so as to obtain a good prediction model, and certainly, all samples are not divided into the training set, so that the detected samples cannot be identified, which is called overfitting. The figure shows that the sample is divided into verification sets, so that the overfitting is prevented.

(2) Verification set

The data set used to optimize the learning model is then called the validation set. The validation set refers to a sample set used to evaluate the performance of the model after the model training is completed. It can verify whether the current model is in the best state, and adjust each parameter according to the difference, make the model in a stable and optimal state. In use, the training of the model is prevented from falling into a wrong area by a single training set.

(3) Test set

The test set is the final evaluation of the model, and intuitively shows the accuracy of the model prediction and whether the model is over-fit or under-fit. For the results of the test set, we need to retrain the model and adjust the parameters.

With respect to the partitioning of the training set validation set and the test set, a decision needs to be made by aggregating specific questions and the amount of data available. Generally, the distribution ratios that are commonly used are: 60% training set, 20% validation set, 20% test set.

Step two: the data set is divided into 5 fruit types, 5 ten thousand pictures are randomly extracted, and the pictures are classified and stored in corresponding positions.

(1) Collecting

The data set sample can be from a network and a picture of a sample object in life. The method has the characteristics of wide sample source and diversity of fruit types, and is suitable for training. A total of approximately 5000 samples were collected in this experiment.

(2) Screening

The screening is to perform manual screening once after the detection sample is collected, and the method is to remove fuzzy, blank and wrong samples and prevent the errors generated by the samples in the learning process.

(3) Classification

The screened samples are firstly classified into 5 categories according to the labels of fruits, and then the data under the labels of the fruits are randomly grouped according to the proportion of 60% of training set, 20% of verification set and 20% of testing set.

Step three: algorithm training

(1) Algorithm model design parameters

The design of the convolutional neural network model refers to AlexNet, and is simplified and optimized on the basis of the AlexNet, the parameters are the optimal parameters obtained after a large amount of training and debugging, the concept of maximum pooling is used, the more obvious characteristics can be optimally reserved, the recognition capability of a deep network is enhanced, and the classification accuracy of the algorithm can reach a higher level.

(2) Algorithm model training algorithm

After Debug, the program has no error, the training iteration number is set to 300, the training process is started, the total parameter is 2,674,845, and the trainable parameter is 2,674,845.

Before data is input into a neural network, an initialization process including resampling, normalization, layer splitting and the like is needed. Initialization is a system to facilitate feature images in post-processing

Secondly, feature extraction is the most important link, the convolution layer is required to split the features of the image, and the process of splitting the features applies the principle of local perception.

③ Down-sampling

The down sampling is completed through a pooling link, and parameters in the neural network are simplified through removing unimportant neurons in the neural network, so that the operation efficiency is improved.

Fourthly, classified calculation

Convolved and pooled image features. The probability distribution is input into a Softmax function, and the probability distribution is calculated for different types.

Loss calculation

The obtained distribution probability is input into a loss function-a cross entropy function, and compared with the existing verification set, the result of comparison is the current loss value.

Sixth, backward propagation

The current loss value is propagated forward in a reverse direction through the gradient, and the weight of the front end of the neural network is redistributed.

Seventhly, multiple iterations

After a number of iterations. The weight distribution in the network tends to be optimal, the accuracy of the classifier is improved, and the loss function tends to be stable after being reduced.

Step four: algorithm validation

(1) Model accuracy testing

Returning to Pycharm, and performing classification test on the test set by using a TensorFlow calling model. The test set contains about 1000 images, and the test program outputs the prediction results of different fruits in a form of weight values through classification tests on the images.

(2) Probability distribution visualization

The probability distribution obtained by testing is a very complex matrix, and the probability distribution and the cross entropy of different classes can be more obviously seen by drawing the matrix into a thermodynamic diagram of the probability distribution by using NumPy and Matplotlib.

(3) Optimization space of model

If the model learning space is further increased, more samples with differences need to be obtained from a wider source. The method has the advantages that a more diversified and large-scale sample library is provided, and the training efficiency and accuracy of the model are obviously improved.

The foregoing is a more detailed description of the present invention that is presented in conjunction with specific embodiments, which are not to be construed as limiting the invention to the specific embodiments described above. Numerous other simplifications or substitutions may be made without departing from the spirit of the invention as defined in the claims and the general concept thereof, which shall be construed to be within the scope of the invention.

Claims

1. A method for realizing fruit classification and identification based on a deep learning algorithm comprises the following steps:

(1) Training set

(2) Verification set

The data set used to optimize the learning model is then called the validation set. The validation set refers to a sample set used to evaluate the performance of the model after the model training is completed. It can verify whether the current model is in the best state, and adjust each parameter according to the difference, make the model in a stable and optimal state. In use, the training of the model is prevented from falling into a false zone by a single training set.

(3) Test set

(1) Collecting

(2) Screening

(3) Classification

Step three: algorithm training

(1) Algorithm model design parameters

(2) Algorithm model training algorithm

③ Down-sampling

Fourthly, classified calculation

Loss calculation

Sixth, backward propagation

Seventhly, multiple iterations

Step four: algorithm validation

(1) Model accuracy testing

(2) Probability distribution visualization

The probability distribution obtained by testing is a very complex matrix, and NumPy and Matplotlib are used for drawing the matrix into a thermodynamic diagram of the probability distribution, so that the direct probability distribution and cross entropy of different classes can be more obviously seen.

(3) Optimization space of model