CN116258711A

CN116258711A - Rice leaf roller harmful image detection method based on inclined rectangular frame

Info

Publication number: CN116258711A
Application number: CN202310274762.XA
Authority: CN
Inventors: 王儒敬; 陈天娇; 陈红波; 杜健铭; 张洁; 李�瑞; 胡海瀛
Original assignee: Hefei Institutes of Physical Science of CAS
Current assignee: Hefei Institutes of Physical Science of CAS
Priority date: 2023-03-16
Filing date: 2023-03-16
Publication date: 2023-06-13

Abstract

The invention relates to a method for detecting harmful images of rice leaf rollers based on an inclined rectangular frame, which solves the problems that the harmful images of the rice leaf rollers in a complex field environment have the phenomena of non-directional growth, various sizes and the like and are difficult to accurately detect compared with the prior art. The invention comprises the following steps: obtaining and preprocessing a harmful image of rice leaf rollers; constructing a rice leaf roller pest detection model; training a rice leaf roller pest detection model; obtaining an image of rice leaf roller pests to be detected; and obtaining the image detection result of the rice leaf roller pest. The invention can realize accurate detection of the rice pest-shaped area in a complex field environment, improve the accuracy of detecting the rice leaf roller pest, and strengthen the visualization capability of detecting the rice leaf roller pest.

Description

Rice leaf roller harmful image detection method based on inclined rectangular frame

Technical Field

The invention relates to the technical field of insect pest image recognition, in particular to a method for detecting a pest-like image of cnaphalocrocis medinalis based on an inclined rectangular frame.

Background

As an insect harmful to agriculture, cnaphalocrocis medinalis is mainly harmful to rice. It can reduce tillering of rice, delay growth period, prevent normal growth of rice and finally reduce yield. Since the pests are hidden in curled leaves, it is difficult to count directly. Therefore, in field investigation, plant protection personnel can only visually record the number of damaged curled blades. However, this method is inefficient and labor intensive. Under the condition of lacking basic plant protection personnel and technical strength, the image detection and identification theory based on the deep learning technology supports automatic monitoring and intelligent investigation of plant diseases and insect pests.

However, the current general target detection algorithm based on deep learning uses a rectangular frame without angles for positioning, and this method may contain a lot of redundant information with low value or easy confusion, so this method cannot meet the detection of harmful situations in practical application requirements.

In particular, the field background is complex, the growth direction of rice plants is random, so that the pest damage symptoms often have the phenomena of inclination, crossing and the like. Therefore, how to detect the pest of cnaphalocrocis medinalis by adopting the inclined frame labeling method and the deep learning technology based on the inclined frame representation has become an urgent technical problem to be solved.

Disclosure of Invention

The invention aims to solve the defect that in the prior art, rice leaf rollers in a complex field environment are harmful, have the phenomena of non-directional growth, various sizes and the like, and are difficult to accurately detect, and provides an inclined rectangular frame-based rice leaf roller harmful image detection method for solving the problems.

In order to achieve the above object, the technical scheme of the present invention is as follows:

a rice leaf roller pest-like image detection method based on an inclined rectangular frame comprises the following steps:

obtaining and preprocessing harmful images of rice leaf rollers: acquiring a rice leaf roller image, setting the image as a harmful image, marking a data set by using inclined rectangular frame marking software ropylelmg, and establishing a harmful training sample set;

constructing a rice leaf roller pest detection model: constructing a rice leaf roller pest detection model based on the inclined rectangular frame;

training of a rice leaf roller pest detection model: training a rice leaf roller pest detection model based on the pest training sample set;

obtaining an image of rice leaf roller pests to be detected: acquiring an image of rice leaf rollers to be detected and preprocessing the image;

obtaining the image detection result of rice leaf roller pests: inputting the image of the rice leaf rollers to be detected into a trained pest-like detection model of the rice leaf rollers to obtain a pest-like image detection result of the rice leaf rollers.

The construction of the rice leaf roller pest detection model comprises the following steps:

the first part of the rice leaf roller pest-shape detection model is a feature extraction module using ResNet-50, the second part is a feature pyramid module using FPN, the third part is a pending inclined rectangular frame acquisition module, and the fourth part is an inclined rectangular detection frame acquisition module;

setting a pending inclined rectangular frame acquisition module:

setting up non-angle rectangular frames at all spatial positions of m layers in the feature pyramid module, wherein the length-width ratio is (0.5, 1, 2), and the areas of non-angle initial rectangular frames of m different layers are (32) ² ,64 ² ,128 ² ,256 ² ,512 ² )；

Acquiring undetermined inclined rectangular frames by using non-angle rectangular frames, selecting undetermined inclined rectangular frames with highest classification probability for each layer, performing maximum value inhibition for nms, and selecting t frames (x _p ,y _p ,w _p ,h _p θ) fine tuning by correcting the feature map;

setting an inclined rectangular detection frame acquisition module:

the inclined rectangular detection frame acquisition module comprises an inclined characteristic dimension reduction module and a detection module, wherein the local characteristic dimension reduction corresponding to t undetermined inclined rectangular frames is changed into k x 256, and then the final detection is executed by using 2 full connections;

the t pending-tilting rectangular boxes acquired in the first stage are denoted (x) _p ,y _p ,w _p ,h _p θ), corresponding to local features (x) on the feature map acquired by the feature pyramid network _f ,y _f ,w _f ,h _f θ), the local features of k x 256 are obtained through the inclined feature dimension reduction module to execute subsequent full connection detection,

s is the dimension reduction ratio from the original image to the feature image, and for the (m, n) th feature point in the C (0.ltoreq.c < C) th dimension, the value is:

wherein 0 is less than or equal to m, n is less than k, l represents the sampling number of single square lattice in k square lattice corresponding to undetermined inclined rectangular frame on the characteristic diagram, F _c (R _θ (x, y)) means the sampling position (x, y) in a single square by the tilting operationValues in c dimensions.

The training of the rice leaf roller pest detection model comprises the following steps:

inputting the pest-shaped training sample set into a pest-shaped detection model of cnaphalocrocis medinalis;

obtaining feature images of m different layers through a feature extraction module of a rice leaf roller harmful detection model;

inputting the feature graphs of m different layers into a feature pyramid module to obtain m feature graphs fused by different scales;

based on the undetermined inclined rectangular frame acquisition network of the self-adaptive selection training sample mechanism, finding the most suitable ratio threshold value of intersection and union in the initially set non-angle rectangular frame aiming at each harm,

for each harm-like gt, m feature maps fused on different scales respectively acquire N non-angle initial rectangular frames nearest to the harm-like center, m are counted for N, then a ratio set of inclined intersections and union of the m x N non-angle initial rectangular frames and the harm-like gt is calculated, finally a ratio threshold value mean (io us) +std (io us) of the intersection and the union corresponding to the harm-like gt is acquired, a positive sample setting threshold value of each harm-like is obtained in a self-adaption mode according to the statistical characteristics of the data sets, positive samples are guaranteed to exist in each harm-like state during training of the model, negative samples are selected randomly from other non-angle initial rectangular frames, N=256 positive samples are selected by one image by default, and the positive sample ratio is set as 1 by default: 1, a step of;

n training samples are obtained for one image, 256 are defaulted, wherein the number of positive samples is N ₁ ，N ₁ The positive and negative samples together complete the classification loss L related to the number and specific positions of harmful shapes in the image _cls Is used to obtain regression loss L using only positive samples _reg As shown in the following formula:

p _dt acquisition mesh for pending tilting rectangular frameBranch outcome of the classification of the collaterals, p _gt Classification labels for samples, p if the sample is a positive sample _gt 1, otherwise 0; l (L) _cls Using cross entropy loss, L _reg Using

Losses, including regression losses of center coordinates, long and short sides, and angles,

wherein the method comprises the steps of

Obtaining a regression branch result of the network for the pending inclined rectangular box, representing a gap between the pending inclined rectangular box and the initial rectangular box without angle, i _gt ,i∈{x,y,w,h, θ } is the center coordinate, long and short sides and angle of the marked inclined rectangular frame, ++>

Representing the difference between the marked inclined rectangular frame and the initial rectangular frame without angle by minimizing +.>

And->

The difference between the two is utilized to train a network by using a back propagation algorithm, and the center coordinate (x) of the undetermined inclined rectangular frame is obtained by calculating the network regression branch result and the initial rectangular frame without angles in the test process _dt ,y _dt ) Long and short edges (w) _dt ,h _dt ) And tilt angle, angle normalization operation ++>

The final result is a pending-tilting rectangular box, denoted (x) _dt ,y _dt ,w _dt ,h _dt ,θ _dt )；

Obtaining the regression branch result of the network and the initial rectangular frame without angles through the undetermined inclined rectangular frame, and performing the calculation to obtain the center coordinates, long and short sides and the inclined angle (x) _dt ,y _dt ,w _dt ,h _dt ,θ _dt ) All undetermined inclined rectangular frames obtain local characteristics k x C through inclined characteristic dimension reduction operation, and then 2 full-connection operations are carried out;

the 1024-dimensional feature vector of the final execution detection of the undetermined inclined rectangular frame is obtained, the inclined rectangular detection frame is obtained through the feature vector through 2 full-connection operations respectively, the feature vector comprises classification scores of categories and position information (x, y, w, h, theta) containing inclination angles, and a loss function of an inclined rectangular detection frame obtaining module is identical with that of an undetermined inclined rectangular frame obtaining network.

Advantageous effects

Compared with the prior art, the method for detecting the rice leaf rollers in the harmful state based on the inclined rectangular frame can realize accurate detection of the rice harmful state areas in the complex field environment, improves the accuracy of detecting the rice leaf rollers in the harmful state, and enhances the visualization capability of detecting the rice leaf rollers in the harmful state.

Drawings

FIG. 1 is a process sequence diagram of the present invention;

FIG. 2a is a graph showing the detection effect of the Faster-rcnn detection algorithm of the rectangular frame without angle in the prior art;

FIG. 2b is a graph showing the detection effect by the method of the present invention.

Detailed Description

For a further understanding and appreciation of the structural features and advantages achieved by the present invention, the following description is provided in connection with the accompanying drawings, which are presently preferred embodiments and are incorporated in the accompanying drawings, in which:

as shown in FIG. 1, the method for detecting the harmful image of the cnaphalocrocis medinalis based on the inclined rectangular frame comprises the following steps of:

firstly, obtaining and preprocessing harmful images of rice leaf rollers. And acquiring a rice leaf roller image, setting the image as a harmful image, marking the data set by using inclined rectangular frame marking software ropylelmg, and establishing a harmful training sample set.

Secondly, constructing a rice leaf roller pest detection model: and constructing a rice leaf roller pest detection model based on the inclined rectangular frame. The built model uses an inclined rectangular frame, the obtained detection frame is more concentrated in the area where pests are harmful, the detection frame can be tightly surrounded into a harmful shape under the complex field condition, and meanwhile, the detection result is conveniently checked and corrected.

(1) The method comprises the steps of setting a first part of a rice leaf roller pest-shaped detection model to be a feature extraction module using ResNet-50, a second part of the rice leaf roller pest-shaped detection model to be a feature pyramid module using FPN, a third part of the rice leaf roller pest-shaped detection model to be a pending inclined rectangular frame acquisition module, and a fourth part of the rice leaf roller pest-shaped detection model to be an inclined rectangular detection frame acquisition module. The feature extraction module and the feature pyramid module are traditional modules.

(2) Setting a pending inclined rectangular frame acquisition module:

Acquiring undetermined inclined rectangular frames by using non-angle rectangular frames, selecting undetermined inclined rectangular frames with highest classification probability for each layer, performing maximum value inhibition for nms, and selecting t frames (x _p ,y _p ,w _p ,h _p θ) is fine-tuned by correcting the feature map.

(3) Setting an inclined rectangular detection frame acquisition module:

/>

wherein 0 is less than or equal to m, n is less than k, l represents the sampling number of single square lattice in k square lattice corresponding to undetermined inclined rectangular frame on the characteristic diagram, F _c (R _θ (x, y)) represents the value in the c-th dimension of the sampling position (x, y) in a single square after the tilting operation.

Thirdly, training a rice leaf roller pest detection model: and training the rice leaf roller pest detection model based on the pest training sample set. Training the constructed model by using an inclined rectangular frame enables the acquired characteristic map to be concentrated in the pest damage area, and more effective characteristics are extracted for detection.

(1) And (5) inputting the pest-shaped training sample set into a pest-shaped detection model of the cnaphalocrocis medinalis.

(2) And obtaining feature images of m different layers through a feature extraction module of the rice leaf roller harmful detection model.

(3) And inputting the feature graphs of m different layers into a feature pyramid module to obtain m feature graphs fused by different scales.

(4) Based on the undetermined inclined rectangular frame acquisition network of the self-adaptive selection training sample mechanism, finding the most suitable ratio threshold value of intersection and union in the initially set non-angle rectangular frame aiming at each harm,

n training samples are obtained for one image, 256 are defaulted, wherein the training samples are positiveThe number of samples is N ₁ ，N ₁ The positive and negative samples together complete the classification loss L related to the number and specific positions of harmful shapes in the image _cls Is used to obtain regression loss L using only positive samples _reg As shown in the following formula:

p _dt obtaining the classified branch result of the network for the pending inclined rectangle box, p _gt Classification labels for samples, p if the sample is a positive sample _gt 1, otherwise 0; l (L) _cls Using cross entropy loss, L _reg Using

/>

wherein the method comprises the steps of

Obtaining a regression branch result of the network for the pending inclined rectangular box, representing a gap between the pending inclined rectangular box and the initial rectangular box without angle, i _gt I epsilon { x, y, w, h, θ } is the center coordinate, long and short sides and angle of the marked inclined rectangular frame, +.>

And->

The final result is a pending-tilting rectangular box, denoted (x) _dt ,y _dt ,w _dt ,h _dt ,θ _dt )。

(5) Obtaining the regression branch result of the network and the initial rectangular frame without angles through the undetermined inclined rectangular frame, and performing the calculation to obtain the center coordinates, long and short sides and the inclined angle (x) _dt ,y _dt ,w _dt ,h _dt ,θ _dt ) All undetermined inclined rectangular frames obtain local characteristics k x C through inclined characteristic dimension reduction operation, and then pass through 2Full connection operation;

Fourth, obtaining an image of rice leaf roller pests to be detected: and obtaining an image of the cnaphalocrocis medinalis to be detected and preprocessing the image.

Fifthly, obtaining an image detection result of rice leaf roller pests: inputting the image of the rice leaf rollers to be detected into a trained pest-like detection model of the rice leaf rollers to obtain a pest-like image detection result of the rice leaf rollers.

As shown in fig. 2a and fig. 2b, it can be seen that compared with a general rectangular frame without angles, the inclined rectangular frame of the invention can more accurately position pest damage under natural conditions, especially in places where the damage is dense or crossed, and is convenient for naked eyes to check the detection result of pictures.

The foregoing has shown and described the basic principles, principal features and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, and that the above embodiments and descriptions are merely illustrative of the principles of the present invention, and various changes and modifications may be made therein without departing from the spirit and scope of the invention, which is defined by the appended claims. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. The method for detecting the harmful image of the cnaphalocrocis medinalis based on the inclined rectangular frame is characterized by comprising the following steps of:

11 Obtaining and preprocessing harmful images of rice leaf rollers: acquiring a rice leaf roller image, setting the image as a harmful image, marking a data set by using inclined rectangular frame marking software ropylelmg, and establishing a harmful training sample set;

12 Building a rice leaf roller pest detection model: constructing a rice leaf roller pest detection model based on the inclined rectangular frame;

13 Training of rice leaf rollers as pest detection model: training a rice leaf roller pest detection model based on the pest training sample set;

14 Obtaining an image of rice leaf roller pests to be detected: acquiring an image of rice leaf rollers to be detected and preprocessing the image;

15 Obtaining the image detection result of rice leaf roller pests: inputting the image of the rice leaf rollers to be detected into a trained pest-like detection model of the rice leaf rollers to obtain a pest-like image detection result of the rice leaf rollers.

2. The method for detecting the pest-like image of rice leaf rollers based on the inclined rectangular frame as claimed in claim 1, wherein the construction of the pest-like detection model of rice leaf rollers comprises the following steps:

21 Setting a first part of a rice leaf roller pest-shaped detection model as a feature extraction module using ResNet-50, a second part as a feature pyramid module using FPN, a third part as a pending inclined rectangular frame acquisition module and a fourth part as an inclined rectangular detection frame acquisition module;

22 A pending inclined rectangular frame acquisition module) is set:

23 A set inclined rectangular detection frame acquisition module:

3. The method for detecting the pest-like image of the cnaphalocrocis medinalis based on the inclined rectangular frame as claimed in claim 1, wherein the training of the pest-like detection model of the cnaphalocrocis medinalis comprises the following steps:

31 Inputting the harmful training sample set into a rice leaf roller detection model;

32 The feature extraction module of the harmful detection model of the cnaphalocrocis medinalis to obtain feature graphs of m different layers;

33 Inputting the feature graphs of m different layers into a feature pyramid module to obtain m feature graphs fused by different scales;

34 Based on the undetermined inclined rectangular frame acquisition network of the self-adaptive selection training sample mechanism, finding the most suitable ratio threshold value of intersection and union in the initially set non-angle rectangular frame aiming at each harm,

wherein the method comprises the steps of

Obtaining a regression branch result of the network for the pending inclined rectangular box, representing a gap between the pending inclined rectangular box and the initial rectangular box without angle, i _gt I epsilon { x, y, w, h, θ } is the labeled moment of inclinationCenter coordinates, long and short sides and angle of the frame, +.>

And->

35 Obtaining the regression branch result of the network and the initial rectangular frame without angles through the undetermined inclined rectangular frame, and obtaining the center coordinates, long and short sides and the inclined angle (x) of the undetermined inclined rectangular frame by the calculation _dt ,y _dt ,w _dt ,h _dt ,θ _dt ) All undetermined inclined rectangular frames obtain local characteristics k x C through inclined characteristic dimension reduction operation, and then 2 full-connection operations are carried out;