CN111369540B

CN111369540B - Plant leaf disease identification method based on mask convolutional neural network

Info

Publication number: CN111369540B
Application number: CN202010150980.9A
Authority: CN
Inventors: 王勇; 刘雪月; 胥克翔; 靳伟昭; 杨琦; 朱文涛
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2020-03-06
Filing date: 2020-03-06
Publication date: 2023-06-02
Anticipated expiration: 2040-03-06
Also published as: CN111369540A

Abstract

The invention discloses a plant leaf disease identification method based on a mask convolutional neural network, which mainly solves the problem of low accuracy in identifying plant leaf diseases in the prior art. The scheme is as follows: enhancing and expanding the original data set to obtain a training set and a testing set; carrying out semantic segmentation on the training set and the test set to obtain corresponding mask sets; a disease feature screening module is added between a full convolution layer and a mask branch of the model, and a training set and a mask set are input into a network for training to obtain target classification and target detection results; taking a feature map belonging to the disease blade in the target classification result as the input of a mask branch, and obtaining a trained model after multiple iterations; inputting the test set into the model, classifying and detecting the targets of the blades, and dividing the blades belonging to the disease category. The invention improves the accuracy of identifying the leaf diseases on the basis of the traditional method, and can be used for identifying and dividing the leaf diseases of plants in agricultural planting.

Description

Plant leaf disease identification method based on mask convolutional neural network

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a plant leaf disease identification method which can be used for cutting and identifying plant leaf disease in agricultural planting.

Background

In modern intelligent agriculture, crop diseases are a great threat to grain safety, and plant diseases cause serious damage to crops by significantly reducing yield. Among them, early blight is a typical disease that may severely reduce yield. Similarly, in humid climates, late blight is another very damaging disease that affects the leaves, stems and fruits of plants. Protection of plants from disease is critical to ensure quality and quantity of the crop. The protection of crops should begin with early disease discovery in order to select the appropriate treatment at the correct time to prevent disease transmission. The disease types for identifying greenhouse plant diseases mainly comprise bacterial spots, early blight, late blight, leaf mold, black spot and other diseases. However, in practice, it is difficult to accurately determine the type of diseases due to the large number of diseases and similar behavior on the leaves.

Currently, disease studies on plant leaves mainly involve their detection and classification using image processing or deep learning methods. In the control of Plant diseases, foreign Plant Village team proposed a method for detecting and classifying diseases through Plant leaves by deep learning in 2016, classifying specific Plant diseases through smart phones mainly in a simple background, recognizing colors, gray scales and segmented pictures under different proportion data sets and different networks mainly through a deep learning method, but the method can only classify Plant disease leaves and cannot segment positions of Plant disease leaves and diseases.

The same year, mads dymann 1 et al, proposed a convolutional neural network to classify plant species in color images. The network was constructed de novo, trained and tested. These images are from six different local datasets, data acquisition is performed in terms of illumination, resolution and soil type at different growth stages, but with lower accuracy and no detection and classification of plant disease.

In 2018 Liu Na et al, image processing technology and artificial neural network technology are applied to realize the detection of cucumber leaf diseases and classification of the degrees of infection, and experimental researches are mainly carried out on cucumber downy mildew, powdery mildew and virus diseases with high morbidity and serious harm, but the number of training samples is small, the number of the identified diseases is small, the test accuracy is low, and the fitting is easy to generate.

2017, he Kaiming proposes a Mask convolutional neural network Mask R-CNN architecture, in which a branch is added to two branches of a fast R-CNN, namely classification and coordinate regression, so as to perform semantic segmentation, image features are extracted mainly by using a residual network res net101/50 or a pyramid network FPN as a main network, foreground and background of a target area are obtained by using an area recommendation network RPN, classification and example segmentation results are obtained for the obtained target area and image features by using a full convolutional layer, and then semantic segmentation results are obtained by using convolutional network identification. The network is mainly used for target detection of COCO data sets, and is not used in the field of plant disease identification at present.

In summary, the current plant disease research mainly includes classification and identification of single plant disease category similar to cucumber, wherein the number of disease category and sample in the contained samples is less, and the identification accuracy is lower. The existing classification method for plant disease leaves through deep learning cannot separate the positions of the plant disease leaves and the diseases, and the identification rate is low.

Disclosure of Invention

The invention aims to solve the defects of the prior art, and provides a plant leaf disease identification method based on a mask convolutional neural network mask-CNN, so as to divide the positions of plant leaf diseases and diseases, and improve the accuracy and efficiency of identification.

In order to achieve the above purpose, the technical scheme of the invention comprises the following steps:

(1) Sequentially enhancing, expanding and semantically segmenting the original data set to obtain a training and testing image set and a mask set;

(2) Optimizing the Mask R-CNN network: namely, a disease feature screening module is added between a full convolution layer after the Mask R-CNN network ROI alignment and Mask branches;

(3) Training the optimized Mask R-CNN network:

(3a) Setting main network parameters:

selecting a backbone network from two residual networks, namely ResNet50 and ResNet 101;

setting the size of the epochs and the number of steps of each epoch training;

setting a receiving threshold T of a disease blade in a disease characteristic screening module ₀ The other parameters are default values of Mask R-CNN;

(3b) According to known classification errors L _cls Detection error L _box And a segmentation error L _mask The optimized Mask R-CNN network loss function is determined as follows: l (L) _loss ＝L _cls +L _box +L _mask ；

(3c) Inputting the training image set and the training Mask set into the optimized Mask R-CNN network for training to obtain a trained model;

(4) And inputting the test image into the trained model for testing.

Compared with the prior art, the invention has the following advantages:

first, compared with the GoogleNet and VGG methods, the method of the invention improves the identification accuracy of disease leaves when identifying plant leaf diseases.

Second, according to the invention, as Mask images of the target area are generated through Mask branches of the Mask R-CNN network model, plant disease leaves and disease positions thereof can be accurately extracted.

Thirdly, according to the invention, as the disease feature screening module is added in the network structure of the Mask R-CNN, the Mask branch is trained and tested only aiming at unhealthy blades, the burden of the Mask branch is reduced, and the network identification efficiency is improved on the premise of ensuring that the accuracy of the Mask R-CNN network is unchanged.

Fourth, in the present invention, the image transformation is adopted to increase the samples in the data set, and the adaptive contrast enhancement algorithm is adopted to enhance the data set, so as to improve the blurred image in the data set.

Drawings

FIG. 1 is a general flow diagram of an implementation of the present invention;

FIG. 2 is a schematic diagram of the overall structure of the optimized Mask R-CNN network according to the present invention;

FIG. 3 is a block diagram of a training sub-process for an optimized Mask R-CNN network in accordance with the present invention;

FIG. 4 is a training image and training binary mask image acquired in the present invention;

FIG. 5 is a test image of healthy and diseased leaves obtained in the present invention;

FIG. 6 is a resulting image of a healthy leaf identified by the simulation of the present invention;

FIG. 7 is a resulting image of a disease blade identified by the simulation of the present invention.

Detailed Description

Specific embodiments and effects of the present invention are described in further detail below with reference to the accompanying drawings:

the application environment of this example is the farming scene, and the purpose is to detect and discern the vegetation that has the disease in the farming, provides this kind of disease information for the planting personnel is more accurate.

Referring to fig. 1, the implementation steps of this example are as follows:

step 1, image enhanced dataset D ₁ 。

(1.1) from public item PlaDownloading ntVillage-Dataset to obtain plant disease leaf data set D ₀ Pair D using adaptive contrast enhancement algorithm ₀ Image enhancement is carried out to improve blurred images in the database, and a database D after image enhancement is obtained ₁ ：

(1.1 a) acquiring the Low frequency part m of the image x (i, j) _x (i, j) and a high-frequency part h _x (i, j) obtaining a low frequency portion of the image by mean filtering:

h _x (i,j)＝x(i,j)-m _x (i,j)

wherein, (2n+1) ² Representing a window size with (i, j) as coordinates of a center point of the image;

(1.1 b) multiplying the high frequency part of the image by the gain value G (I, j) of the part to obtain an amplified high frequency part I (I, j):

I(i,j)＝G(i,j)h _x (i,j)

wherein G (I, j) can take a constant C greater than 1 to obtain an amplified high-frequency part I _c (i, j) is:

I _c (i,j)＝Ch _x (i,j)

the gain value G (i, j) in this example is taken from the local mean square error sigma _x (i, j) inversely proportional variation value

Wherein D is a constant and the local mean square error of the image is:

obtain amplified high-frequency part I _σ (i, j) is:

(1.1 c) recombining the high frequency part and the low frequency part to obtain an enhanced image f (i, j):

f(i,j)＝m _x (i,j)+I _σ (i,j)；

step 2, acquiring a training image set D ₃ And test image set D ₄ And training mask set D ₅ And test mask set D ₆ 。

(2.1) enhancing the image with the labeling tool labelme of semantic segmentation ₁ Respectively plotting the image targets in the image database to generate masks of the targets to obtain a mask set D containing mask information and label information ₂ The resulting training binary mask image is shown in fig. 4 b.

(2.2) Using image transformations on enhanced data set D ₁ Sum mask set D ₂ Sequentially performing translation, rotation and overturn to increase the data volume of the sample and obtain a data set D after expanding the sample ₃ Sum mask set D ₄ ；

(2.2) expanding the data set D ₃ Sum mask set D ₄ Dividing into a training image set D according to the proportion of 8:2 ₅ And test image set D ₆ And training mask set D ₇ And test mask set D ₈ The training set image is shown in fig. 4a, and the test set image is shown in fig. 5a and 5 b;

and 3, continuously optimizing the Mask R-CNN network structure.

The existing Mask R-CNN network comprises a backbone network, an area recommendation generation network RPN, a full convolution layer, mask branches, namely a full convolution layer and a full connection layer, and a disease feature screening module is added between the full convolution layer and the Mask branches to obtain an optimized Mask R-CNN network structure, as shown in fig. 2.

Referring to fig. 2, the Mask R-CNN network structure after optimization in this example is: backbone network- & gt region recommended generation network RPN- & gt full convolution layer- & gt disease feature screening module- & gt full convolution layer- & gt full connection layer.

The disease characteristic screening module is used for judging the confidence degree T of the disease blade output by the full convolution layer ₁ Disease leaf joint given by network initialization parametersThreshold T ₀ And (3) screening out the characteristic spectrum of the disease leaves in the batch processing, and inputting the screening result into a mask branch, namely a full convolution layer and full connection for processing.

Step 4, training the optimized Mask R-CNN network to obtain a trained network model:

referring to fig. 3, the specific implementation of this step is as follows:

(4.1) setting main network parameters:

selecting a backbone network from two residual networks, namely ResNet50 and ResNet101, wherein ResNet101 is selected as the backbone network in the embodiment;

setting the number of labels to be 11 according to the types of the images in the database, wherein the number of the labels comprises 1 background label and 10 image labels;

setting the iteration times epoch of all samples to be 100, setting the iteration times of each epoch to be 100, setting the learning rate to be 0.001, and setting the weight attenuation coefficient to be 0.0001; setting the number of GPUs as 1, setting the number of images processed by each GPU as 2, and receiving threshold T of disease leaves ₀ The other parameters are default values of Mask R-CNN;

(4.2) according to the known classification error L _cls Detection error L _box And a segmentation error L _mask The optimized Mask R-CNN network loss function is determined as follows: l (L) _loss ＝L _cls +L _box +L _mask ；

(4.3) training the optimized Mask R-CNN network:

(4.3 a) initializing the network parameters in (3 a) to train the image set D ₅ And training mask set D ₇ Inputting the optimized Mask R-CNN network;

(4.3 b) extracting feature pattern F of the training image through the training residual network ₀ ；

(4.3 c) feature map F ₀ Inputting the target region into a region recommendation generation network RPN to obtain a foreground F of the target region ₁ And background F ₂ ；

(4.3 d) using the ROI alignment method to Align the foreground F of the target region ₁ Mapping to feature map F ₀ Generates a fixed sizeFeature map F of (1) ₃ ：

First, a target region foreground F is calculated ₁ Belonging to the characteristic layer:

wherein ,w₀ and h₀ Respectively representing the width and height of the target area, k ₀ The value is 4;

second, in the target region foreground F ₁ After finding out the corresponding characteristic layer k, obtaining the step length s corresponding to the characteristic layer;

then, the target area foreground F is calculated ₁ Mapping to the width of a feature map

And high->

And obtaining a target region Z on the feature map according to the two parameters:

Z＝w ₁ *h ₁

then, the target region Z on the feature map is divided into n ² Obtaining the divided target area Z _i ，

Z _i ＝w ₂ *h ₂ ,i＝1,2,…n ²

wherein ,w₂ and h₂ Represents Z _i Width and height, respectively, of the size of

Then, each target zone Z _i Dividing the image into four parts, obtaining pixel values of four points by taking the central point position of each part, and taking the maximum value of the four pixel values as each target area Z _i To obtain n in the target region Z ² The pixel values form a characteristic diagram with the size of n multiplied by n;

(4.3 e) feature map F ₃ Obtaining a target classification result and a target detection result through the full convolution layer, and calculating a classification error L _cls And detecting an error L _box ：

Probability p corresponding to target classification result u _u Obtaining the classification error: l (L) _cls ＝-logp _u ；

Let t be _i ^u ＝{t _x ,t _y ,t _w ,t _h 4 parameterized coordinates, v, of the target detection result _i ＝{v _x ,v _y ,v _w ,v _h The detection error L is calculated by the following formula, wherein the detection error L is the target translation scaling parameter _box ：

Wherein smooths _L1 The smoothed norm loss function is expressed as:

(4.3 f) judging whether the classification result belongs to the disease leaves:

if the target classification does not belong to the disease blade, continuing to judge the target classification of the next blade;

if the classification result belongs to the disease leaves, the confidence degree T of the classification result is obtained ₁ And a disease leaf reception threshold T ₀ Comparing;

when T is ₁ ＞T ₀ In the case of feature map F ₃ Selecting confidence coefficient as T ₁ Feature map F of (1) ₄ ；

When T is ₁ ＜＝T ₀ If so, continuing to judge the target classification of the next blade;

(4.3 g) the feature map F selected in (4.3 d) ₄ And training mask set D ₇ Inputting the binary mask into a mask branch for training to obtain a binary mask of a target area, namely a segmentation result of a disease blade and a disease position of the disease blade:

first, the feature map F is transformed by deconvolution ₃ Amplifying to obtain binary mask regions M of all classes _k ；

Then, the binary mask areas of all categories are traversedM _k Binary mask region M for each category _i Applying Sigmoid activation functions

After the classification operation is carried out, a binary mask of a target area corresponding to the target classification is obtained, wherein the maximum probability y in the classification probability vector H is the maximum probability y;

(4.3 h) calculating the segmentation error L of the binary mask of the target region obtained in (4.3 e) _mask ；

Where y is the predictive probability of the binary mask for the target region,

a true tag that is a binary mask of the target region;

(4.3 i) calculating the loss value L of the network _loss ＝L _cls +L _box +L _mask Back-propagating updating network weights after each iteration using the loss values;

(4.3 g) determining whether the number of iterations of all samples is greater than 100 times set:

if the iteration times of all the samples are more than 100, stopping the network training to obtain a trained network model;

and (3) repeating (4.3 c) to (4.3 g) until the iteration number of all samples is greater than 100 when the iteration number of all samples is less than or equal to 100.

And 5, obtaining a recognition result of the plant leaf diseases.

(5.1) the test image set D obtained in (1.2) ₆ The healthy leaves in (a) are as shown in fig. 5a and the diseased leaves are as shown in fig. 5 b) are input into the trained network model, feature vectors are extracted, and feature vector A of each real target classification in n real leaf categories is calculated _i Cosine similarity cos of feature vector B classified with prediction target _i (θ) to obtain a classified probability vector P:

P＝{cos ₁ (θ)，cos ₂ (θ)，…，cos _i (θ)，…cos _n (θ)}，i＝1,2…,n

wherein, cosine similarity cos _i The calculation formula of (θ) is as follows:

wherein ,||A_i The I represents the second norm of the feature vector of each real target classification, and the B represents the second norm of the feature vector of the predicted target classification;

(5.3) selecting the maximum probability value q in the probability vector P, and taking the category corresponding to q as a target classification result;

(5.2) judging whether the category corresponding to q belongs to the disease leaves:

if the class corresponding to q is healthy leaf, directly outputting the target detection result and the target classification result;

if the category corresponding to q belongs to the disease leaves, q is matched with a disease leaf receiving threshold T ₀ Comparison is performed:

if q > T ₀ Outputting the target detection, target classification and the segmentation result of the disease position;

if q < = T ₀ And directly outputting the target detection result and the target classification result.

The effect of the invention can be further illustrated by the following simulation experiments:

experimental conditions

The software platform for the experimental training is as follows: google colab; the hardware platform is as follows: tesla P4 GPU; development environments are keras and Tensorflow;

the software platform for the experimental test is as follows: windows10; the hardware platform is as follows: a CPU;

the test image is selected from the blade image with the size (256,256,3) shown in fig. 5, and the test set is selected from the test set D obtained in the step 2 ₃ 。

Second, experimental details

Experiment 1. Comparing the single test images with Mask R-CNN network model and the method of the present invention.

Fig. 5 is a test image, wherein fig. 5a is a healthy leaf and fig. 5b is a diseased leaf. The detection result of the Mask R-CNN network model is shown in FIG. 6, wherein FIG. 6a is the detection result of healthy leaves, and FIG. 6b is the detection result of diseased leaves. The detection result of the method is shown in fig. 7, wherein fig. 7a is the detection result of healthy leaves, and fig. 7b is the detection result of disease leaves. From the detection results of the two methods, when healthy leaves are detected, compared with a mask-CNN network, the method reduces redundant operation of dividing the disease areas of the leaves.

Experiment 2. Comparative experiments were performed with the inventive method and the google net and VGG networks, respectively.

The method and the existing GoogleNet and VGG network models are used for carrying out the test on the image set D obtained in the step 1 ₆ 1000 tests were performed, and the recognition accuracy results obtained are shown in table 1,

table 1 recognition accuracy results

Method	Average accuracy rate	Time/s
			VGG	0.8846	3.17
GoogleNet	0.9040	3.32
			Optimized Mask R-CNN	0.9257	3.59

As can be seen from Table 1, the method of the present invention has improved recognition accuracy compared with the conventional GoogleNet and VGG network models.

The above description is only a specific example of the invention and does not constitute any limitation of the invention, and it will be apparent to those skilled in the art that modifications and variations in form and detail may be made without departing from the principles, construction of the invention, but these modifications and variations based on the inventive concept remain within the scope of the appended claims.

Claims

1. The plant leaf disease identification method based on the mask convolutional neural network is characterized by comprising the following steps of:

(3) Training the optimized Mask R-CNN network:

(3a) Setting main network parameters:

setting the size of the epochs and the number of steps of each epoch training;

(3c) Inputting the training image set and the training Mask set into the optimized Mask R-CNN network for training to obtain a trained model, wherein the training model is realized as follows:

(3c1) Initializing the network parameters in (3 a), training the image set D ₅ And training mask set D ₇ Inputting the optimized Mask R-CNN network;

(3c2) Extraction of D through training residual network ₅ The characteristics of (3) are obtained to obtain a characteristic spectrum F ₀ ；

(3c3) Feature map F ₀ Inputting the target region into a region recommendation generation network RPN to obtain a foreground F of the target region ₁ And background F ₂ ；

(3c4) Foreground F of target region using ROI alignment method ₁ Mapping to feature map F ₀ Generates a feature map F of a fixed size ₃ ；

(3c5) Map F of the characteristics ₃ Obtaining a target classification result and a target detection result through the full convolution layer, and calculating a classification error L _cls And detecting an error L _box ；

(3c6) If the classification result is a disease leaf, confidence coefficient T of the classification result is obtained ₁ And a disease leaf reception threshold T ₀ In comparison, when T ₁ ＞T ₀ When the disease characteristic screening module is used, the disease characteristic screening module is used for screening the disease characteristic image F ₃ Selecting confidence coefficient as T ₁ Feature map F of (1) ₄ ；

(3c7) The feature map F selected in (3 c 6) ₄ And training mask set D ₇ Inputting the images into a mask branch for training to obtain a binary mask of a target area, namely a segmentation result of the disease blade and the disease position thereof, and calculating a segmentation error L _mask ；

(3c8) Calculating a loss value L of a network _loss ＝L _cls +L _box +L _mask The network weight is updated by back propagation of the loss value after each iteration, and when the trained epoch is greater than the initialized epoch, the network training is stopped, and a trained network model is obtained;

(4) And inputting the test image into the trained model for testing.

2. The method of claim 1, wherein (1) the original data set is sequentially enhanced, expanded and semantically segmented to obtain a training and testing image set and a mask set, which are implemented as follows:

(1a) Downloading from public Plant Village-Dataset a Plant disease leaf Dataset D ₀ Pair D using adaptive contrast enhancement algorithm ₀ Image enhancement is carried out to improve blurred images in the database, and a data set D after image enhancement is obtained ₁ ；

(1b) Labeling tool for enhanced data set D using semantic segmentation ₁ Respectively plotting the image targets in the image database to generate masks of the targets to obtain a mask set D containing mask information and label information ₂ ；

(1c) Enhanced data set D using image transformations ₁ Sum mask set D ₂ Sequentially performing translation, rotation and overturn to increase the data volume of the sample and obtain a data set D after expanding the sample ₃ Sum mask set D ₄ ；

(1d) The expanded data set D is added according to the proportion of 8:2 ₃ Sum mask set D ₄ Divided into training image sets D ₅ And test image set D ₄ Training mask set D ₇ And test mask set D ₈ 。

3. The method of claim 2, wherein the image enhancement in (1 a) is performed using an adaptive contrast enhancement algorithm, implemented as follows:

(1a1) Dividing the image into high frequency parts h _x (i, j) and a low frequency part m _x (i, j), wherein (i, j) refers to a pixel point of the image;

(1a2) Multiplying the high frequency part in the image by the gain value G (I, j) of the part to obtain an amplified high frequency part I (I, j):

I(i,j)＝G(i,j)h _x (i,j)

(1a3) After recombining the high frequency part and the low frequency part, an enhanced image f (i, j) is obtained:

f(i,j)＝m _x (i,j)+I(i,j)。

4. the method of claim 1, wherein the optimized Mask R-CNN network obtained in (2) has the following structure:

residual network ResNet101/50 or pyramid network FPN- & gt region recommendation generation network RPN- & gt full convolution layer- & gt disease feature screening module- & gt mask branch, namely full convolution layer and full connection layer;

the disease feature screening module is used for judging the confidence coefficient T of the disease blade output by the full convolution layer ₁ With a given disease leaf acceptance threshold T ₀ And (3) screening out the characteristic spectrum of the disease leaves in the batch processing, and inputting the screening result into a mask branch for processing.

5. The method according to claim 1, wherein (3 c 4) the ROI alignment method is used to Align the foreground F of the target region ₁ Mapping to feature map F ₀ Is realized as follows:

(3 c4 a) calculating a feature layer to which the target area belongs:

(3 c4 b) after finding the corresponding characteristic layer k in the target area, obtaining the step length s corresponding to the characteristic layer;

(3 c4 c) calculating the width w of the target region mapped to the feature map ₁ And height h ₁ Obtaining a target region Z on the feature map:

Z＝w ₁ *h ₁

wherein ,

(3 c4 d) dividing the target region Z on the feature map into n ² Obtaining the divided target area Z _i ，

Z _i ＝w ₂ *h ₂ ,i＝1,2,…n ²

(3 c4 e) each target zone Z _i Dividing the image into four parts, obtaining pixel values of four points by taking the central point position of each part, and taking the maximum value of the four pixel values as each target area Z _i Finally, n is obtained in the target zone Z ² And the pixel values form a characteristic diagram with the size of n multiplied by n.

6. The method of claim 1, wherein (3 c 5) the classification error L is calculated _cls And detecting an error L _loc The implementation is as follows:

(3 c5 a) probability p corresponding to the target classification result u _u Obtaining classification error L _cls ：

L _cls ＝-logp _u

(3 c5 b) let t be _i ^u ＝{t _x ,t _y ,t _w ,t _h 4 parameterized coordinates, v, of the target detection result _i ＝{v _x ,v _y ,v _w ,v _h The detection error L is calculated by the following formula, wherein the detection error L is the target translation scaling parameter _box ：

/>

Wherein smooths _L1 The smoothed norm loss function is expressed as:

7. the method of claim 1, wherein (3 c 7) obtaining a binary mask through a mask branch is implemented as follows:

(3 c7 a) mapping the feature map F by deconvolution transformation ₃ Amplifying to obtain binary mask regions M of all classes _k ；

(3 c7 b) traversing the binary mask areas M of all classes _k Binary mask region M for each category _i Applying Sigmoid activation functions

(3 c7 c) calculating the segmentation error L of the mask branch _mask ：

wherein ,

the true label of the binary mask for the target region.

8. The method of claim 1, wherein (4) inputting the test image into the trained model for testing is accomplished by:

(4a) The test image set D obtained in (1 a) is collected ₆ Inputting the probability vector P into a trained network model, extracting a feature vector, obtaining a classified probability vector P by calculating the similarity of the feature vector, selecting the maximum probability value q in the P, and taking the class corresponding to the q as a target classification result;

(4b) Judging whether the category corresponding to q belongs to the disease leaves:

if q > T ₀ Outputting the target detection, target classification and the segmentation result of the disease positions;

9. The method of claim 8, wherein (4 a) calculating the similarity of the feature vectors to obtain the classified probability vectors is performed by calculating cosine similarity cos (θ) of the feature vector a of each real object classification and the feature vector B of the predicted object classification, as follows:

wherein A represents the two norms of the feature vector of the real target classification, and B represents the two norms of the feature vector of the predicted target classification.