CN114677553A

CN114677553A - Image recognition method for solving unbalanced problem of crop disease and insect pest samples

Info

Publication number: CN114677553A
Application number: CN202111676323.9A
Authority: CN
Inventors: 苏家仪; 韦光亮; 王筱东; 朱燕红; 莫振东; 顾小宁
Original assignee: Guangxi Talentcloud Information Technology Co ltd
Current assignee: Guangxi Talentcloud Information Technology Co ltd
Priority date: 2021-12-31
Filing date: 2021-12-31
Publication date: 2022-06-28
Anticipated expiration: 2041-12-31
Also published as: CN114677553B

Abstract

The invention relates to the field of pest and disease identification, in particular to an image identification method for solving the problem of imbalance of crop pest and disease samples. The method comprises the steps of performing model training by using a current labeled data set, selecting a current optimal model through model verification, performing image enhancement on a picture without the labeled data set for a plurality of times to obtain an enhanced image, performing reasoning and screening to obtain an identification result of the image without the label, inputting the identification result into a sample selection strategy, judging whether the result is reserved according to the sample selection strategy, generating a pseudo label if the result is reserved, moving the pseudo label to the current labeled data set, continuing training a new labeled data set, and performing iterative learning according to the process until the accuracy is not improved any more. The method can reduce the influence of long tail distribution, improve the recall rate and the accuracy rate of the tail category through iterative learning, simultaneously do not influence the identification effect of the head category, only adopt a single model to carry out reasoning, do not introduce an additional network layer and have no influence on the reasoning speed.

Description

Image recognition method for solving unbalanced problem of crop disease and insect pest samples

Technical Field

The invention relates to the field of pest and disease identification, in particular to an image identification method for solving the problem of unbalanced crop pest and disease sample.

Background

Crop diseases and insect pests are one of the main agricultural disasters in the world, and if the diseases and insect pests are discovered and prevented in time, great loss can be caused to agricultural production, and the national food safety and agricultural product quality safety are threatened. Crop diseases and pests have the characteristics of multiple varieties, large influence and frequent outbreak of disasters, and the characteristics bring great challenges to the monitoring of the crop diseases and pests.

With the rapid development of computer vision and artificial intelligence, the pest and disease identification technology based on images is applied to pest and disease monitoring of various crops with the characteristics of low cost and high efficiency. The current image-based pest identification method generally uses a deep learning algorithm to perform model training and reasoning, deep learning needs to rely on mass data to achieve maximum identification effect, but crop pest image data has the characteristic of unbalanced samples, the data volume of common pest categories is very large, the data volume of uncommon pest categories is small, so that pest data are distributed in a long tail manner, the head data distributed in the long tail manner are very large, the middle part is gradually reduced, the data volume of the tail portion is very small or even no sample, the crop pest categories are many, and the tail portion is pulled to be very long.

The unbalanced sample problem has great influence on the effect of the crop disease and pest model, the model is easy to over-fit the head type with more data, and under-fit the tail type with less data. There are many general methods for solving the problem of sample imbalance, for example, a resampling algorithm undersamples the head class and oversamples the tail class to ensure the balance of training samples, but this can cause the model to be under-fitted to the head class and over-fitted to the tail class; the weight weighting algorithm gives low weight to the head category and high weight to the tail category, but the effect improvement is limited; the crop disease long-tail image identification method based on multi-stage training adjusts sample distribution in a multi-stage enhancement training mode on labeled data, massive label-free data are not fully utilized, and the richness of tail type data is insufficient.

Disclosure of Invention

Aiming at the defects in the background technology, the invention provides an image recognition method for solving the problem of imbalance of crop pest samples, and the specific technical scheme is as follows:

an image recognition method for solving the problem of imbalance of crop pest samples comprises the following steps:

step S1, creating a labeled data set: collecting crop pest and disease picture data, and marking the positions of the pests and diseases by using a rectangular frame to form a marked data set; dividing the labeled data set into a training set, a verification set and a test set according to a certain proportion;

Step S2, model training: constructing a target detection model, training the training set in the data set of the step S1 by adopting the constructed target detection model, and outputting an intermediate target detection model after each training;

step S3, model verification: inputting the verification set images in the step S1 into the intermediate model trained in the step S2 for model verification, and selecting the intermediate target detection model with the highest recognition accuracy as the current optimal target detection model;

step S4, creating a label-free data set: collecting mass crop disease and insect pest picture data as a label-free data set;

step S5, image enhancement: performing data enhancement on each original picture without the labeled data set in the step S4 to obtain enhanced N pictures, and merging the enhanced N pictures with the corresponding original pictures to obtain N +1 combined pictures as a group of data to be processed;

step S6, reasoning without a label data model: inputting each group of data to be processed in the step S5 into the current optimal target detection model in the step S3 respectively for reasoning to obtain N +1 recognition results, performing post-processing on each recognition result respectively, overlapping the post-processed recognition results, screening the overlapped results through a non-maximum suppression algorithm, and finally obtaining the recognition result without labeled data;

Step S7, sample selection: judging the identification result of the non-labeled data in the step S6 according to a sample selection strategy, determining whether to retain the identification result, and if so, selecting the original picture corresponding to the identification result from the non-labeled data set in the step S4 as a new sample;

step S8, new data generation: generating a pseudo label of the non-artificial annotation for the new sample in the step S7 in a rectangular frame annotation manner with an annotated data set in the step S1, taking the pseudo label and the original picture corresponding to the unmarked data set in the step S4 as new data, putting all the new data into the training set, the verification set and the test set in the annotated data set in the step S1 according to a certain proportion, and removing the original picture corresponding to the unmarked data set in the step S4;

step S9, after the newly generated data of step S8 is added into the labeled data set in step S1, the iterative learning is continued according to the flow of steps S1-S8, if the accuracy of the optimal target detection model is not improved any more in step S3, the iterative learning is ended, and the final target detection model is obtained;

step S10, labeled data model reasoning: and (4) inputting the test set with the labeled data set in the step (S1) into the final target detection model obtained in the step (S9) for model reasoning to obtain an identification result of the test set after iterative learning optimization.

Preferably, in step S1, the ratio of 0.8: 0.1: the annotated data set is divided into a training set, a validation set, and a test set at a ratio of 0.1.

Preferably, the target detection model in step S2 is a YOLOv5l6 network structure model using a YOLOv5 target detection algorithm.

Preferably, the data enhancement in step S5 includes 4 ways: and randomly turning horizontally, randomly turning vertically, randomly rotating, and randomly increasing the brightness, wherein N is 4.

Preferably, the sample selection strategy in step S7 includes the following steps:

step S71, head and tail division: performing sample quantity statistics on the training set with the labeled data set in the step S1, wherein the labeled data set has C pest categories in total, and calculating the labeled quantity N of each pest category C_cC is equal to {1,2, …, C }, and the total number of labels is N_totalAverage number of labels N_mAnd then:

the number of labels is larger than N_mIs divided into a head category, otherwise the number of labels is less than or equal to N_mClassifying into a tail category; counting the total number N of the head category labels_hTotal number of tail class labels N_tAnd then:

N_h+N_t＝N_total；

step S72, head and tail determination: classifying the corresponding category of each rectangular frame in the identification result of the label-free data in the step S6 to obtain the number of the head and the tail respectively, wherein if the number of the head is greater than the number of the tail, the sample belongs to the head sample, otherwise, the sample belongs to the tail sample;

Step S73, new sample candidate: for the sample judged as the head, calculating the reliability mean value of the head class in the identification result of the sample, and if the reliability mean value of the head class is larger than the head reliability threshold value T_hThen add the sample to the head new sample candidate queue Q_hPerforming the following steps; for the samples judged as the tail, calculating the reliability mean value of the tail category, and if the reliability mean value of the tail category is larger than the tail reliability threshold value T_tThen add the sample to the tail new sample candidate queue Q_tPerforming the following steps;

step S74, selecting a new sample: candidate queue Q for head new sample_hSorting in descending order according to the credibility to obtain a sorted head new sample candidate queue Q_hFrom the sorted head new sample candidate queue Q_h' the head ratio is selected to be P_hAs a new sample of the head; candidate queue Q for tail new samples_tSorting in descending order according to the credibility to obtain a sorted tail new sample candidate queue Q_tFrom the sorted tail new sample candidate queue Q_t' in the selection of the ratio of tail to P_tThe sample of (2) is taken as a new tail sample; the head new sample and the tail new sample are combined into a current new sample.

Preferably, the head confidence threshold T _hHas a value range of T being not less than 0.9_h＜1。

Preferably, the tail confidence threshold T_tHas a value range of T being not less than 0.9_t＜1。

Preferably, the head proportion P_hIs calculated in a manner that

Preferably, the tail portion ratio P_tIs calculated in a manner that

The invention has the beneficial effects that: the invention provides an image recognition method for solving the unbalanced problem of crop pest samples, which comprises the steps of performing model training by utilizing a current labeled data set, selecting a current optimal model through model verification, performing image enhancement on a picture without the labeled data set for a plurality of times, obtaining an enhanced image, reasoning, screening a superposed result through a non-maximum suppression algorithm to obtain a recognition result without a labeled image, inputting the recognition result into a sample selection strategy, judging whether the result is retained according to the sample selection strategy, generating a pseudo label if the result is retained, moving to the current labeled data set, continuing training a new labeled data set, and performing iterative learning according to the flow until the accuracy is not improved any more. The invention fully utilizes massive unmarked crop disease and pest data to carry out semi-supervised learning, designs a sample selection strategy aiming at the problem of unbalanced samples, continuously adjusts the data distribution, reduces the influence of long tail distribution, improves the recall rate and the accuracy rate of tail categories by iterative learning, does not influence the identification effect of head categories, only adopts a single model to carry out reasoning, does not introduce an additional network layer, and has no influence on the reasoning speed.

Drawings

In order to more clearly illustrate the detailed description of the invention or the technical solutions in the prior art, the drawings that are needed in the detailed description of the invention or the prior art will be briefly described below. Throughout the drawings, like elements or portions are generally identified by like reference numerals. In the drawings, elements or portions are not necessarily drawn to scale.

FIG. 1 is a schematic flow chart of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without inventive step, are within the scope of protection of the present invention.

It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.

It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.

It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.

As shown in fig. 1, the specific embodiment of the present invention provides an image recognition method for solving the unbalanced problem of crop pest samples, comprising the following steps:

step S1, creating a labeled data set: collecting crop pest and disease picture data, and marking the positions of the pests and diseases by using a rectangular frame to form a marked data set; according to the weight ratio of 0.8: 0.1: dividing the labeled data set into a training set, a verification set and a test set in a proportion of 0.1;

step S2, model training: constructing a target detection model, training the training set in the data set of the step S1 by adopting the constructed target detection model, and outputting an intermediate target detection model after each training; the target detection model is a YOLOv5l6 network structure model adopting a YOLOv5 target detection algorithm.

step S5, image enhancement: performing data enhancement on each original picture without the labeled data set in the step S4 to obtain enhanced N pictures, and merging the enhanced N pictures with the corresponding original pictures to obtain N +1 combined pictures as a group of data to be processed; data enhancement includes 4 ways: and randomly turning horizontally, randomly turning vertically, randomly rotating and randomly increasing the brightness, wherein N is 4.

Step S6, reasoning without a label data model: inputting each group of data to be processed in the step S5 into the current optimal target detection model in the step S3 respectively for reasoning to obtain N +1 recognition results, respectively performing post-processing on each recognition result, wherein the post-processing comprises recovering the random horizontally-overturned picture result according to a horizontal overturning parameter, recovering the random vertically-overturned picture result according to a vertical overturning parameter, recovering the random rotated picture result according to a rotating parameter, superposing each post-processed recognition result, and screening the superposed results through a non-maximum suppression algorithm to finally obtain a recognition result without labeled data;

Step S7, sample selection: and judging the identification result of the non-labeled data in the step S6 according to a sample selection strategy, determining whether to retain the identification result, and if so, selecting the original picture corresponding to the identification result from the non-labeled data set in the step S4 as a new sample. The sample selection strategy comprises the following steps:

step S71, head and tail division: carrying out sample quantity statistics on the training set with the labeled data set in the step S1, wherein the labeled data set has C pest categories in total, and calculating the labeled quantity N of each pest category C_cC is equal to {1,2, …, C }, and the total number of labels is N_totalAverage number of labels N_mAnd then:

N_h+N_t＝N_total。

assuming that the training set with labeled data set has 100 pest categories, C is 100, the 1 st category is ulcer disease, the labeled number of ulcer disease is 20000, N₁20000, class 2 is Huanglongbing, the number of labels for Huanglongbing is 20, N₂Counting the total number N of labels in all categories as 20 _totalAnd obtaining:

average number of labels

Step S72, head-tail determination: and (5) performing head and tail classification on the category corresponding to each rectangular frame in the identification result without the labeling data in the step (S6) to respectively obtain the number of the head and the tail, wherein if the number of the head is greater than that of the tail, the sample belongs to the head sample, otherwise, the sample belongs to the tail sample.

Head and tail judgment is carried out on 100 pest categories, the number 20000 of ulcer disease labels is greater than the average number 1000 of labels, the disease belongs to the head category, and the number 20 of Huanglongbing disease labels is less than the average number 1000 of labels, and the disease belongs to the tail category. Counting the total number N of head class labels_hAssuming that 20 categories are head categories and 80 categories are tail categories, the total number of labels N is counted for the 20 head categories_hTo obtain N_h95000, the 20 tail classes are statistically labeled with the total number N_tTo obtain N_t＝5000，N_h+N_t＝95000+5000＝100000＝N_total100000 is the total number of labels N for all categories_total。

Assuming that there are 200000 picture samples in the unlabeled data set, sequentially performing head and tail determination on each sample, wherein the identification result of the 1 st sample contains 2 detection frames, 2 of which are ulcer diseases, dividing according to the head and tail categories in step S71, and determining that the 1 st sample is a head sample if the number of heads is 2, the number of tails is 0, and the number of heads is greater than the number of tails; the identification result of the 2 nd sample contains 3 detection frames, wherein 1 is ulcer disease and 2 are huanglongbing disease, the 2 nd sample is judged to be a tail sample according to the head and tail classification in the step S71, the number of heads is 1, the number of tails is 2, and the number of heads is less than the number of tails.

Step S73, new sample candidate: for the sample judged as the head, the credibility of the head class label in the sample recognition result is summed and divided by the total number of the head class labels in the sample recognition result to obtain the credibility average value of the head class, and if the credibility average value of the head class is larger than the head credibility threshold T_hThen add the sample to the head new sample candidate queue Q_hThe preparation method comprises the following steps of (1) performing; for the samples judged as the tail, the credibility of the tail category labels in the sample identification result is summed, and the sum is divided by the total number of the tail category labels in the sample identification result to obtain the credibility average value of the tail category, if the credibility average value of the tail category is larger than the tail credibility threshold T_tThen add the sample to the tail new sample candidate queue Q_tPerforming the following steps; head confidence threshold T_hHas a value range of 0.9 to T_hLess than 1; tail confidence threshold T_tHas a value range of 0.9 to T_t＜1。

For the sample determined to be the head in step S72, if the confidence levels of 2 ulcer diseases are 0.95 and 0.91 in the 1 st sample, respectively, the average confidence level is

Setting a head confidence threshold T_h0.90 and 0.93 > 0.90, add the 1 st sample to the new head sample candidateIn the selection queue, Q _hContinuing to judge other head samples, namely {1 }; for the sample judged to be the tail in S72, if the confidence level of ulcer disease is 0.92 and the confidence levels of 2 huanglongbing diseases are 0.91 and 0.98, respectively, the average confidence level is

Setting a tail confidence threshold T_t0.92 and 0.937 > 0.92, add the 2 nd sample to the tail new sample candidate queue, Q_tAnd (2), continuing to judge other tail samples.

Step S74, selecting a new sample: candidate queue Q for head new sample_hSorting in descending order according to the credibility to obtain a sorted head new sample candidate queue Q_hFrom the sorted head new sample candidate queue Q_h' where the head ratio is selected to be P_hAs a new sample of the head; candidate queue Q for tail new samples_tSorting in descending order according to the credibility to obtain a sorted tail new sample candidate queue Q_tFrom the sorted tail new sample candidate queue Q_t' in the selection of the ratio of tail to P_tThe sample of (2) is taken as a new tail sample; the head new sample and the tail new sample are combined into a current new sample. Head ratio P_hIs calculated in a manner that

Tail ratio P_tIs calculated in a manner that

Candidate queue Q for head new sample_hWith an average confidence of {0.93,0.90,0.92, … }, Q is assigned to the confidence level {1,3,4, … } _hSorting in a descending order to obtain Q_h' {1,4,3, … }, from Q_h' where the head ratio is selected to be

As a new sample of the head; for tail new sample candidatesQueue Q_tWith an average confidence of {2,5,6, … }, Q is given a confidence of {0.937,0.92,0.93, … }, respectively_tSorting in descending order to obtain Q_t' {2,6,5, … }, from Q_t' in selecting the ratio of tail to tail

The sample of (2) is taken as the tail new sample. The head new samples and the tail new samples are combined to form the current new samples, the tail new data quantity proportion is far larger than that of the head, the richness of tail category data is improved, and meanwhile the head category quantity is ensured to be slowly increased.

Step S8, new data generation: generating a pseudo label of the non-artificial label for the new sample in the step S7 in a manner of labeling the rectangular frame with the labeled data set in the step S1, taking the pseudo label and the original picture corresponding to the label-free data set in the step S4 as new data, putting all the new data into the training set, the verification set and the test set in the labeled data set in the step S1 according to a certain proportion, and removing the original picture corresponding to the label-free data set in the step S4;

Step S10, the annotated data model inference: and (4) inputting the test set with the labeled data set in the step (S1) into the final target detection model obtained in the step (S9) for model reasoning to obtain an identification result of the test set after iterative learning optimization.

Those of ordinary skill in the art will appreciate that the elements of the various embodiments described in connection with the embodiments disclosed herein can be embodied in electronic hardware, computer software, or combinations of both, and that the compositions of the various embodiments have been described above generally in terms of their functionality in order to clearly illustrate the interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

In the embodiments provided in the present application, it should be understood that the division of the unit is only one division of logical functions, and other division manners may be used in actual implementation, for example, multiple units may be combined into one unit, one unit may be split into multiple units, or some features may be omitted.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; the modifications and the substitutions do not cause the essence of the corresponding technical solutions to depart from the scope of the technical solutions of the embodiments of the present invention, and the corresponding technical solutions are all covered in the claims and the specification of the present invention.

Claims

1. The utility model provides a solve unbalanced image recognition method of crops plant diseases and insect pests sample which characterized in that: the method comprises the following steps:

step S1, creating a labeled data set: collecting crop disease and insect pest picture data, and marking the positions of the disease and insect pests by using a rectangular frame to form a marked data set; dividing the labeled data set into a training set, a verification set and a test set according to a certain proportion;

step S6, reasoning without a label data model: inputting each group of data to be processed in the step S5 into the current optimal target detection model in the step S3 respectively for reasoning to obtain N +1 recognition results, respectively performing post-processing on each recognition result, overlapping each post-processed recognition result, screening the overlapped results through a non-maximum suppression algorithm, and finally obtaining the recognition result without labeled data;

Step S8, new data generation: generating a pseudo label of the non-artificial annotation for the new sample in the step S7 in a rectangular frame annotation manner with an annotated data set in the step S1, taking the pseudo label and the original picture corresponding to the unmarked data set in the step S4 as new data, putting all the new data into the training set, the verification set and the test set in the annotated data set in the step S1 according to a certain proportion, and removing the corresponding original picture from the unmarked data set in the step S4;

step S9, after the newly generated data in step S8 is added into the labeled data set in step S1, the iterative learning is continued according to the flow from step S1 to step S8, if the accuracy of the optimal target detection model in step S3 is not improved any more, the iterative learning is ended, and the final target detection model is obtained;

step S10, the annotated data model inference: and (4) inputting the test set with the labeled data set in the step S1 into the final target detection model obtained in the step S9 for model reasoning, so as to obtain an identification result of the test set after iterative learning optimization.

2. The image recognition method for solving the imbalance problem of the crop pest samples according to claim 1, characterized in that: in step S1, the ratio of 0.8: 0.1: the labeled data set is divided into a training set, a validation set and a test set by a ratio of 0.1.

3. The image recognition method for solving the unbalance problem of the crop pest and disease damage samples according to claim 1, characterized in that: the target detection model in step S2 is a YOLOv5l6 network structure model using a YOLOv5 target detection algorithm.

4. The image recognition method for solving the unbalance problem of the crop pest and disease damage samples according to claim 1, characterized in that: the data enhancement in step S5 includes 4 ways: and randomly turning horizontally, randomly turning vertically, randomly rotating and randomly increasing the brightness, wherein N is 4.

5. The image recognition method for solving the unbalance problem of the crop pest and disease damage samples according to claim 1, characterized in that: the sample selection strategy in step S7 includes the following steps:

the number of labels is larger than N_mIs divided into a head category, otherwise the number of labels is less than or equal to N _mDividing into tail categories; counting the total number N of the head category labels_hTotal number of tail class labels N_tAnd then:

N_h+N_t＝N_total；

step S73, new sample candidate: for the sample judged as the head, calculating the reliability mean value of the head class in the identification result of the sample, and if the reliability mean value of the head class is larger than the head reliability threshold value T_hThen add the sample to the head new sample candidate queue Q_hPerforming the following steps; for the samples judged as the tail, calculating the reliability mean value of the tail category, and if the reliability mean value of the tail category is larger than the tail reliability threshold T_tThen add the sample to the tail new sample candidate queue Q_tPerforming the following steps;

step S74, selecting a new sample: candidate queue Q for head new sample_hSorting in descending order according to the credibility to obtain a sorted head new sample candidate queue Q_hFrom the sorted head new sample candidate queue Q_h' where the head ratio is selected to be P _hAs a new sample of the head; candidate queue Q for tail new samples_tSorting in descending order according to the credibility to obtain a sorted tail new sample candidate queue Q_tFrom the sorted tail new sample candidate queue Q_t' in the selection of the ratio of tail to P_tThe sample of (2) is taken as a new tail sample; the head new sample and the tail new sample are combined into a current new sample.

6. The image recognition method for solving the imbalance problem of the crop pest samples according to claim 5, characterized in that:the head confidence threshold T_hHas a value range of 0.9 to T_h＜1。

7. The image recognition method for solving the imbalance problem of the crop pest samples according to claim 5, characterized in that: the tail confidence threshold T_tHas a value range of 0.9 to T_t＜1。

8. The image recognition method for solving the imbalance problem of the crop pest samples according to claim 5, characterized in that: the head ratio P_hIs calculated in a manner that

9. The image recognition method for solving the imbalance problem of the crop pest samples according to claim 5, characterized in that: the tail portion ratio P_tIs calculated in a manner that