CN111797937B

CN111797937B - Greenhouse environment assessment method based on PNN network

Info

Publication number: CN111797937B
Application number: CN202010678726.6A
Authority: CN
Inventors: 关守平; 方秋杨; 陈旭涛
Original assignee: 东北大学
Priority date: 2020-07-15
Filing date: 2020-07-15
Publication date: 2023-06-13
Anticipated expiration: 2040-07-15
Also published as: CN111797937A

Abstract

The invention provides a greenhouse environment assessment method based on a PNN network, and relates to the technical field of facility agriculture. Firstly, establishing a greenhouse environment parameter sample library, classifying samples, and dividing the samples into training samples and test samples; then, clustering the training samples by using an improved K-means clustering algorithm, and selecting a batch of representative samples as new training samples of the PNN network according to a representative sample selection threshold; training the PNN network after normalizing the new training sample, performing grade evaluation on the normalized test sample by using the trained PNN network, and calculating the error rate of classifying the test sample; and finally, enabling the mode layer neurons of the same class in the PNN to adopt the same smoothing factors, enabling the mode layer neurons of different classes to adopt different smoothing factors, and modifying the smoothing factors of the PNN network by taking the classified error rate as an objective function of a particle swarm optimization algorithm to obtain an optimal PNN classification model.

Description

Greenhouse environment assessment method based on PNN network

Technical Field

The invention relates to the technical field of facility agriculture, in particular to a greenhouse environment assessment method based on a PNN network.

Background

With the development of agricultural technologies and the change of climate environment, greenhouse planting technology plays an increasingly important role in agricultural production. The greenhouse planting technology has the main effects of providing a proper growth environment for crops in a severe outdoor environment, so that in order to ensure that the greenhouse environment meets the requirement of crop growth, the greenhouse environment is monitored on line or off line, and the quality of the greenhouse environment is evaluated according to expert experience, so that the greenhouse planting technology has very important significance in guiding agricultural production.

At present, the common problems of the comprehensive greenhouse environment detector in the market are as follows: the comprehensive greenhouse environment detector cannot judge whether the greenhouse environment quality is good or bad according to the detected data, and the greenhouse environment quality assessment mainly depends on human expert experience knowledge due to the lack of guiding significance for users.

The probabilistic neural network (Probabilistic Neural Network, PNN) is a neural network with simple structure and wide application proposed by d.f. specht doctor in 1988, and the basic idea is: and separating a decision space from the multidimensional input space by using a Bayesian minimum risk criterion. The PNN network is a feedforward neural network based on a statistical principle and taking Parzen window functions as activation functions, absorbs the advantages of a radial basis neural network and a classical probability density estimation principle, and has obvious advantages in mode classification compared with the traditional feedforward neural network. However, the conventional PNN network has two drawbacks: (1) When training samples are too many, the PNN network has higher requirements on the storage space and computing power of the hardware, which increases the difficulty and cost of hardware design; (2) In the conventional PNN network, the smoothing factor has a crucial influence on the classification accuracy of the network, but the value of the smoothing factor is usually given manually, and a specific selection basis is lacked.

In terms of structural optimization and parameter optimization of PNN networks, the common options are: firstly, clustering training samples by using a clustering algorithm such as a K-means algorithm and the like, and using a representative clustering center as a new training sample of the PNN network, thereby simplifying the structure of the PNN network; then, assuming that the smoothing factors of the neurons of the PNN network mode layer have the same value, taking the PNN network classification error rate as an objective function, and carrying out parameter optimization on the smoothing factors by utilizing an optimization algorithm such as a particle swarm algorithm (Particle Swarm Optimization, PSO) so as to improve the classification accuracy of the network. However, this method has two disadvantages: (1) When the number of the clustering centers is small, the phenomenon that the classification accuracy of the PNN network is reduced due to the small number of the training samples can be caused by using the clustering centers as new training samples of the PNN network; (2) Assuming that the PNN network mode layer neurons have the same parameters, the PNN network has poor control capability on the differences between samples when performing classification tasks, and may ignore details of the test samples.

Disclosure of Invention

Aiming at the defects of the prior art, the technical problem to be solved by the invention is to provide a greenhouse environment assessment method based on a PNN network, so as to assess the greenhouse environment.

In order to solve the technical problems, the invention adopts the following technical scheme: a greenhouse environment assessment method based on PNN network comprises the following steps:

step 1: establishing a greenhouse environment parameter sample library with the size of n, and carrying out quality evaluation on each group of samples in the sample library according to M grades, so as to divide the n samples into M classes; the dimension of each group of samples in the sample library is q, and the parameters respectively represent greenhouse environment parameters with important influence on plant growth;

step 2: selecting m samples from a sample library as training samples, and using other l=n-m samples as test samples;

step 3: initializing parameters in an improved K-means clustering algorithm and a particle swarm optimization algorithm;

the initialized parameters are specifically as follows: initializing the number K of clusters in the improved K-means clustering algorithm and the initial cluster center

An iteration stop threshold epsilon, a representative sample selection threshold alpha, a maximum iteration number J and a current iteration number J; initializing the number N of particles in a particle swarm optimization algorithm (Partical Swarm Optimization, namely PSO), wherein the solution space dimension is D, the maximum iteration number max_iter of the PSO algorithm, the initial position vector px and the initial velocity vector pv of the particles; let the position vector of the particle be denoted as px _i ＝[px _i1 ，px _i2 ，…，px _iD ]，i∈[1，N]The velocity vector of the particle is denoted as pv _i ＝[pv _i1 ，pv _i2 ，…，pv _iD ]The individual optimal position in the current iteration at which the objective function is minimized is pbest _i ＝[pbest _i1 ，pbest _i2 ，…，pbest _iD ]The optimal position of the population is gbest= [ gbest ] ₁ ，gbest ₂ ，…，gbest _D ]The minimum values of the objective functions experienced by the individuals and the population in the iterative process are p_fitness respectively _i G_fitness; initializing all smoothing factor values in the PNN network to be 0.1;

step 4: using an improved K-means clustering algorithm to perform clustering treatment on the m training samples selected to obtain K clustering clusters and K clustering centers, wherein the number of samples in each cluster is m _g G=1, 2, … k; selecting a threshold alpha according to the representative samples, and selecting a batch of representative samples from each cluster as new training samples of the PNN network;

step 4.1: randomly selecting K sample points from m training samples to serve as initial clustering centers of K-means clustering algorithm

Step 4.2: setting the current iteration number as j, and for each sample point p in the training sample _t T=1, 2, …, m are calculated to each cluster center in turn

The Euclidean distance d (t, g) of (a) is shown in the following formula:

step 4.3: finding each sample point about the respective cluster center

And will correspond to the sample point p _t Dividing into and clustering center->

The cluster with the smallest distance;

step 4.4: recalculating each cluster by means of an averageIs a cluster center of (2)

The following formula is shown:

wherein p is _gw Representing the w sample point in the g cluster;

step 4.5: the sum of squares of the distances between the samples in each cluster and the new cluster center is calculated, and the formula is shown as follows:

wherein E is _j+1 Representing the sum of squares of the distances between the samples in each cluster and the new cluster center;

step 4.6: judging whether the iteration number J is equal to the maximum iteration number J or |E _j+1 -E _j Step 4.7 is executed if the I is less than epsilon, otherwise step 4.2 is executed again;

step 4.7: counting the number m of samples in each cluster _g And selecting m nearest neighbor of each cluster center according to the sample selection threshold alpha _g The alpha samples are output as the most representative training samples, resulting in p=m·alpha new training samples;

step 5: carrying out normalization processing on the new training sample;

setting a new training sample matrix X as follows:

wherein p represents the number of new training samples, and q represents the dimension of the new training samples;

and carrying out normalization processing on the new training sample matrix X through a normalization factor matrix B to obtain a matrix C, wherein the expressions of the matrix B and the matrix C are shown as follows:

step 6: training a PNN network according to the normalized training sample matrix C, performing grade evaluation on the normalized test samples by utilizing the trained PNN network, and simultaneously calculating the error rate of classifying the test samples;

the PNN network structure comprises an input layer, a mode layer, a summation layer and an output layer; the input layer does not process the data and sends the data into the mode layer; the number of the neurons of the mode layer is equal to the number of training samples, and the activation function is a Gaussian function; the connection mode of the summation layer and the mode layer is sparse connection, and the neuron number of the summation layer is equal to the class number of the training samples; the output layer selects the category corresponding to the maximum posterior probability for output according to the Bayesian decision rule;

step 6.1: constructing a mode layer of the PNN network by using the normalized training sample matrix C;

after the new training sample matrix X is normalized, a training sample matrix C is obtained, and the following formula is shown:

the training matrix C has p training samples, and is divided into M classes, and the number of the training samples of the M classes is set to be h respectively ₁ ，h ₂ ，…，h _M The following steps are:

p＝h ₁ +h ₂ +…+h _M (8)

setting M types of samples to be sequentially arranged in a sample matrix C, and sequentially numbering each neuron of the mode layer as 1 to p; numbered 1 through h ₁ Is a nerve of (2)The element corresponds to a training sample of type 1, the number h ₁ +1 in turn to h ₁ +h ₂ The +1 neurons correspond to class 2 training samples, and so on, numbered p-h _M Neurons from +1 to number p in turn belong to the class M sample;

step 6.2: calculating Euclidean distance between each test sample in the test sample matrix and each training sample in the training set;

the test sample matrix T consisting of l=n-m test samples and normalized is shown in the following formula:

the euclidean distance matrix E between each test sample and each training sample _d The following formula is shown:

step 6.3: activating the mode layer neurons by using radial basis functions;

selecting a Gaussian function as an activation function of the mode layer neuron, and calculating an activated probability matrix U, wherein the probability matrix U is shown in the following formula:

wherein sigma ₁ 、σ ₂ 、…σ _p The smoothing factors of the p mode layer neurons are respectively represented, and all the smoothing factors are set to have the same value, namely sigma at the beginning ₁ ＝σ ₂ ＝…σ _p ＝0.1；

Step 6.4: solving the initial probability and matrix S of each class of sample to be tested by a summation layer, wherein the initial probability and matrix S are represented by the following formula:

step 6.5: calculating the probability prob of the alpha-th sample to be tested belonging to the b-th class according to the initial probability sum of the samples to be tested belonging to the classes _ab The following formula is shown:

/>

wherein a epsilon [1, l ], b epsilon [1, M ];

step 6.6: according to the Bayesian decision theorem and the probability that each sample to be tested belongs to various types, the class corresponding to the a-th sample to be tested is determined, and the following formula is shown:

y _a ＝arg max(prob _ab ) (14)

wherein y is _a The prediction result of the PNN network on the a test sample is shown, namely the category corresponding to the a test sample;

step 6.7: calculating the error rate of the PNN network for classifying the test samples, wherein the error rate is shown in the following formula:

wherein ER represents the error rate of the PNN network for classifying the test sample, n _e The number of samples of the PNN network misclassification is represented, and l=n-m represents the number of test samples;

step 7: the method comprises the steps of enabling mode layer neurons of the same class in a PNN network to adopt the same smoothing factors and mode layer neurons of different classes to adopt different smoothing factors, then using error rate ER of the PNN network for classifying test samples as an objective function of a PSO algorithm, modifying smoothing factor parameters in the PNN network through the PSO algorithm, realizing optimization of the PNN network, and finally obtaining an optimal PNN classification model;

step 7.1: setting in the beta iteration process of optimizing PNN network parameters by PSO algorithm, firstly carrying out velocity vector pv on particles in PSO algorithm _i And a position vector px _i Is shown in the following formula:

wherein ω is an inertial weight, representing the searching capability of the particle swarm optimization algorithm; c ₁ ，c ₂ Learning factors of the individual extremum points and the global extremum points respectively; mu (mu) ₁ ，μ ₂ Respectively representing random numbers between 0 and 1; since the PSO algorithm optimization object is a smoothing factor adopted by each class of mode layer neurons, the particle solution space dimension D=M;

step 7.2: the updated particle location vector represents a feasible solution of the PNN network smoothing factor; vector the position of the particle

Replacing the value of the PNN network smoothing factor, and then calculating the PNN network pair ++>

Error rate of classifying the corresponding test sample +.>

And updating the individual optimal position information pbest of the particles according to the following updating rule _i Minimum value of objective function p_fitness _i Updating, namely updating the optimal position information gbest and the minimum value g_fit of the objective function experienced by the population at the same time;

the update rule is as follows:

if it is

Then->

Otherwise, pbest _i And p_fitness _i Remain unchanged;

if p_fitness _i < g_field, then gbest=pbest _i ，g_fitness＝p_fitness _i Otherwise, the gbest and g_fitness remain unchanged;

step 7.3: when the iteration number reaches the maximum iteration number max_iter, the PSO algorithm is terminated, wherein gbest represents an optimal solution of the PNN network on the smoothing factor, and g_fitness represents an error rate of classifying the test sample by using the PNN network with gbest as the optimal smoothing factor; otherwise, re-executing the step 7.1;

step 8: and (3) collecting greenhouse environment data to be evaluated, and evaluating the greenhouse environment quality by adopting the optimal PNN classification model obtained in the step (7).

The beneficial effects of adopting above-mentioned technical scheme to produce lie in: the greenhouse environment assessment method based on the PNN network provided by the invention has the advantages that the PNN classification model is applied to greenhouse environment quality assessment, so that the current situation that the greenhouse environment comprehensive detector lacks assessment capability is overcome; the improved K-means algorithm is utilized to select representative samples of the training samples, so that the selected training samples are more representative, and the requirement of the PNN network on the training samples is met; the complexity of the PNN network structure is reduced, and the difficulty of hardware implementation and the storage cost are reduced; the method reduces the complexity of the PNN network and simultaneously avoids the phenomenon that the accuracy of PNN network classification is greatly reduced due to too few training samples. And carrying out parameter optimization on the smoothing factors in the PNN by utilizing the improved PSO algorithm, and setting that the same smoothing factors are adopted by the mode layer neurons of the same class in the PNN and different smoothing factors are adopted by the mode layer neurons of different classes in the PNN, so that the classification accuracy of the PNN is further improved.

Drawings

FIG. 1 is a flowchart of a greenhouse environment assessment method based on a PNN network, provided by an embodiment of the present invention;

FIG. 2 is a diagram of environmental requirements of greenhouse cucumber planting provided by the embodiment of the invention;

fig. 3 is a schematic structural diagram of a PNN network according to an embodiment of the present invention;

fig. 4 is a diagram of classification results of a PNN network on a test sample according to an embodiment of the present invention.

Detailed Description

The following describes in further detail the embodiments of the present invention with reference to the drawings and examples. The following examples are illustrative of the invention and are not intended to limit the scope of the invention.

In this embodiment, taking a greenhouse environment for planting cucumber as an example, the greenhouse environment is evaluated by adopting the greenhouse environment evaluation method based on the PNN network.

A greenhouse environment assessment method based on PNN network, as shown in figure 1, comprises the following steps:

in this example, the dimension of each group of samples in the sample library is q=7, and represents 7 greenhouse environmental parameters, i.e. air temperature, air humidity, carbon dioxide concentration, illumination intensity, soil temperature, soil humidity and soil salinity, which have important influence on plant growth; and each group of sample data is evaluated and classified into four grades of good medium difference, which are respectively represented by 1,2,3 and 4.

In this embodiment, the requirements of greenhouse cucumber planting on 7 environmental parameters are shown in fig. 2, and it can be seen from fig. 2 that the conditions for optimum growth of greenhouse cucumbers are: the carbon dioxide concentration is 1000 ppt-1500 ppt, the illumination intensity is 55 KLx-60 KLx, the air humidity is 70.0-80.0%, the soil humidity is 80.0-90.0%, the air temperature is 25-30 ℃, the soil temperature is 20-24 ℃, and the soil salinity is 0.5 mS/cm-0.8 mS/cm. In the embodiment, 1000 groups of samples of the greenhouse environment are measured through a sensor technology, each group of samples is comprehensively evaluated, and partial sample data are shown in table 1;

TABLE 1 sample data for a portion of a sample library

in the embodiment, 900 samples are selected from a sample library to serve as training samples according to the ratio of 9:1, and the rest 100 samples serve as test samples;

in this embodiment, as can be seen from steps 1 and 2, the sample library size is nThe dimension of each sample is q=7, the class number of the sample is m=4, namely 4 different types of training samples, the original training sample number is m=900, and the test sample number is l=100; meanwhile, initializing the clustering center number k=8, iteration stop threshold epsilon=0.001, representative sample selection threshold alpha=0.2, current iteration times j=1 and maximum iteration times j=20 in an improved K-means algorithm; initializing the initial position vector px of the ith particle in the improved PSO algorithm, wherein the number of particles N=30, the solution space dimension D=M=4, the maximum iteration number max_iter=100 _i ＝[px _i1 ，px _i2 ，…，px _iD ]Initial velocity vector pv _i ＝[pv _i1 ，pv _i2 ，…，pv _iD ]The optimal position of the individual is pbest _i ＝[pbest _i1 ，pbest _i2 ，…，pbest _iD ]The optimal position of the population is gbest= [ gbest ] ₁ ，gbest ₂ ，…，gbest _D ]The minimum values of the objective functions experienced by the individuals and the population in the iterative process are p_fitness respectively _i G_fitness; the smoothing factor of the initial PNN network takes the same value and σ=0.1;

The Euclidean distance d (t, g) of (a) is as followsThe formula is shown as follows:

step 4.3: finding each sample point about the respective cluster center

The cluster with the smallest distance;

step 4.4: recalculating cluster centers of each cluster by means of average value

The following formula is shown:

wherein p is _gw Representing the w sample point in the g cluster;

wherein E is _j+1 The goal of the K-means algorithm can be seen as minimizing the sum of squares of the distances within the clusters, representing the sum of squares of the distances of the samples from each cluster to the new cluster center;

step 4.6: determining whether the number of iterations J is equal to the maximum number of iterations j=20 or |e _j+1 -E _j Step 4.7 is executed if epsilon=0.001, otherwise step 4.2 is executed again;

step 4.7: system for managing a plurality of dataCounting the number m of samples in each cluster _g And selecting m nearest neighbor of each cluster center according to the sample selection threshold alpha _g The α samples are output as the most representative training samples, yielding p=m·α=180 new training samples;

step 5: carrying out normalization processing on the new training sample;

setting a new training sample matrix X as shown in the following formula

Wherein p represents the number of new training samples, q represents the dimension of the new training samples, p=180, q=7;

the PNN network structure comprises an input layer, a mode layer, a summation layer and an output layer; the input layer does not process the data and sends the data into the mode layer; the number of the neurons of the mode layer is equal to the number of the training samples, the activation function is a Gaussian function, and the main function is to calculate the matching relation between the input characteristic vector and each training sample in the training samples; the connection mode of the summation layer and the mode layer is sparse connection, the number of neurons of the summation layer is equal to the number of categories of training samples, and the main function is that according to the calculation method of Parzen window functions, the output of neurons of the mode layer is summed according to categories and averaged to obtain the posterior probability of input vectors input into each category; the output layer is used for selecting and outputting the category corresponding to the maximum posterior probability according to the Bayesian decision rule;

there are p=180 training samples in training matrix C, and the training matrix C is divided into m=4 classes, and the number of training samples in the 4 classes is set to be h ₁ ，h ₂ ，h ₃ ，h ₄ The following steps are:

p＝h ₁ +h ₂ +h ₃ +h ₄ (8)

in PNN networks, the number of pattern layer neurons is determined by training samples. So when there are p training samples in the training set, then p neurons are also present at the pattern layer of PNN. Setting M types of samples to be sequentially arranged in a sample matrix C, and sequentially numbering each neuron of the mode layer as 1 to p; numbered 1 through h ₁ Corresponding to the neurons of (1) class training samples, number h ₁ +1 in turn to h ₁ +h ₂ The +1 neurons correspond to class 2 training samples, and so on, numbered p-h ₄ Neurons numbered sequentially +1 through p belong to class 4 samples;

in this embodiment, the PNN network is structured as shown in fig. 3.

the test sample matrix T consisting of l=n-m=100 test samples and normalized is shown in the following formula:

step 6.3: activating the mode layer neurons by using radial basis functions;

wherein sigma ₁ 、σ ₂ 、…σ _p The smoothing factors of the p mode layer neurons are respectively represented, and the value of the smoothing factors has a critical influence on the classification accuracy of the PNN network. Setting all smoothing factors to have the same value, namely sigma ₁ ＝σ ₂ ＝…σ _p ＝0.1；

step 6.5: calculating the probability prob of the a-th sample belonging to the b-th class according to the initial probability sum of the samples to be detected belonging to the classes _ab The following formula is shown:

wherein a epsilon [1, l ], b epsilon [1, M ];

y _a ＝arg max(prob _ab ) (14)

step 7: the method comprises the steps of enabling mode layer neurons of the same class in a PNN network to adopt the same smoothing factors and mode layer neurons of different classes to adopt different smoothing factors, then using error rate ER of the PNN network for classifying test samples as an objective function of a PSO algorithm, modifying smoothing factor parameters in the PNN network through the PSO algorithm, optimizing the PNN network, continuously reducing the value of ER under limited iteration times, achieving the purpose of optimizing PNN network parameters, and finally obtaining an optimal PNN classification model;

wherein ω is an inertial weight, representing the searching capability of the particle swarm optimization algorithm; c ₁ ，c ₂ Learning factors of the individual extremum points and the global extremum points respectively; mu (mu) ₁ ，μ ₂ Respectively representing random numbers between 0 and 1; since the PSO algorithm optimization object is a smoothing factor adopted by each class of mode layer neurons, the particle solution space dimension d=m=4;

in this embodiment, the inertia weight ω=0.6, and the learning factor c ₁ ＝c ₂ ＝2；

Replacing the value of the PNN network smoothing factor, then calculating the PNN network pair ++according to the process of calculating the PNN network classification error rate of the test sample in the step 6>

Error rate of classifying the corresponding test sample +.>

the update rule is as follows:

if it is

Then->

Otherwise, pbest _i And p_fitness _i Remain unchanged;

if p_fitness _i < g_fitness, then gbest＝pbest _i ，g_fitness＝p_fitness _i Otherwise, the gbest and g_fitness remain unchanged;

step 7.3: when the iteration number reaches max_iter=100, the PSO algorithm is terminated, wherein gbest represents an optimal solution of the PNN network on the smoothing factor, and g_fitness represents an error rate of classifying the test sample by using the gbest as the PNN network of the optimal smoothing factor; otherwise, re-executing the step 7.1;

In this embodiment, the test set samples are evaluated by using the optimal PNN classification model, the evaluation result is shown in fig. 4, the evaluation accuracy of the whole test sample reaches 85%, wherein the evaluation accuracy of each type of test sample is shown in table 2, the evaluation accuracy of the evaluation method of the present invention is higher than 95.2% for the type 2 test sample, and the evaluation accuracy of the evaluation method is as low as 62.5% for the type 4, which is caused by different sample distributions in the training samples.

Table 2 comparison table of classification results of the evaluation method of the present invention

In this embodiment, the optimal PNN classification model of the present invention is compared with the conventional PNN evaluation model from the perspective of evaluation accuracy, network structure, training time, test time and storage space, and as shown in table 3, the optimal PNN classification model of the present invention requires a longer time in the training process, but is superior to the conventional PNN network in classification accuracy, network structure, test time and storage space. Therefore, the assessment method provided by the invention has the advantages of higher assessment speed and lower hardware design difficulty and storage space requirements, and provides an effective method for greenhouse environment quality assessment.

Table 3 against conventional PNN evaluation model

Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present invention, and are not limiting; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some or all of the technical features thereof can be replaced with equivalents; such modifications and substitutions do not depart from the spirit of the corresponding technical solutions, which are defined by the scope of the appended claims.

Claims

1. A greenhouse environment assessment method based on a PNN network is characterized by comprising the following steps: the method comprises the following steps:

step 4.1: in m training samplesThe machine selects K sample points as initial clustering centers of the K-means clustering algorithm

The Euclidean distance d (t, g) of (a) is shown in the following formula:

step 4.3: finding each sample point about the respective cluster center

The cluster with the smallest distance;

The following formula is shown:

wherein p is _gw Representing the w sample point in the g cluster;

step 4.6: judging whether the iteration number J is equal to the maximum iteration number J or |E _j+1 -E _j |<Epsilon, if yes, executing the step 4.7, otherwise, re-executing the step 4.2;

step 5: carrying out normalization processing on the new training sample;

step 6: training a PNN network according to the normalized training sample matrix, performing grade evaluation on the normalized test samples by utilizing the trained PNN network, and simultaneously calculating the error rate of classifying the test samples;

step 7: the method comprises the steps of enabling mode layer neurons of the same class in a PNN network to adopt the same smoothing factors and mode layer neurons of different classes to adopt different smoothing factors, then using the error rate of the PNN network for classifying test samples as an objective function of a particle swarm optimization algorithm, modifying smoothing factor parameters in the PNN network through the particle swarm optimization algorithm, realizing optimization of the PNN network, and finally obtaining an optimal PNN classification model;

2. A PNN network-based greenhouse environment assessment method according to claim 1, wherein: the parameters initialized in the step 3 are specifically as follows: initializing the number K of clusters in the improved K-means clustering algorithm and the initial cluster center

An iteration stop threshold epsilon, a representative sample selection threshold alpha, a maximum iteration number J and a current iteration number J; initializing the number N of particles, the solution space dimension D, the maximum iteration number max_iter, the initial position vector px and the initial velocity vector pv of the particles in a particle swarm optimization algorithm; let the position vector of the particle be denoted as px _i ＝[px _i1 ,px _i2 ,…,px _iD ],i∈[1,N]The velocity vector of the particle is denoted as pv _i ＝[pv _i1 ,pv _i2 ,…,pv _iD ]The individual optimal position in the current iteration at which the objective function is minimized is pbest _i ＝[pbest _i1 ,pbest _i2 ,…,pbest _iD ]The optimal position of the population is gbest= [ gbest ] ₁ ,gbest ₂ ,…,gbest _D ]The minimum values of the objective functions experienced by the individuals and the population in the iterative process are p_fitness respectively _i G_fitness; all smoothing factor values in the initialized PNN network are 0.1.

3. A PNN network-based greenhouse environment assessment method according to claim 2, wherein: the specific method for carrying out normalization processing on the new training samples in the step 5 is as follows:

setting a new training sample matrix X as shown in the following formula

4. a PNN network-based greenhouse environment assessment method according to claim 3, wherein: the PNN network structure comprises an input layer, a mode layer, a summation layer and an output layer; the input layer does not process the data and sends the data into the mode layer; the number of the neurons of the mode layer is equal to the number of training samples, and the activation function is a Gaussian function; the connection mode of the summation layer and the mode layer is sparse connection, and the neuron number of the summation layer is equal to the class number of the training samples; and the output layer selects the category corresponding to the maximum posterior probability for output according to the Bayesian decision rule.

5. A PNN network-based greenhouse environment assessment method according to claim 4, wherein: the specific method of the step 6 is as follows:

the training matrix C has p training samples, and is divided into M classes, and the number of the training samples of the M classes is set to be h respectively ₁ ,h ₂ ,…,h _M The following steps are:

p＝h ₁ +h ₂ +…+h _M (8)

setting M types of samples to be sequentially arranged in a sample matrix C, and sequentially numbering each neuron of the mode layer as 1 to p; numbered 1 through h ₁ Corresponding to the neurons of (1) class training samples, number h ₁ +1 in turn to h ₁ +h ₂ The +1 neurons correspond to class 2 training samples, and so on, numbered p-h _M Neurons from +1 to number p in turn belong to the class M sample;

step 6.3: activating the mode layer neurons by using radial basis functions;

wherein a epsilon [1, l ], b epsilon [1, M ];

y _a ＝argmax(prob _ab ) (14)

wherein ER represents the error rate of the PNN network for classifying the test sample, n _e The number of samples that are misclassified by PNN is represented, and l=n-m is the number of test samples.

6. A PNN network-based greenhouse environment assessment method according to claim 5, wherein:

step 7.1: in the process of setting the beta-th iteration of optimizing PNN network parameters by the particle swarm optimization algorithm, firstly, carrying out velocity vector pv on particles in the particle swarm optimization algorithm _i And a position vector px _i Is shown in the following formula:

pv _i ^β+1 ＝w·pv _i ^β +c ₁ μ ₁ (pbest _i ^β -px _i ^β )+c ₂ μ ₂ (gbest ^β -px _i ^β ) (16)

px _i ^β+1 ＝px _i ^β +pv _i ^β+1 (17)

wherein ω is an inertial weight, representing the searching capability of the particle swarm optimization algorithm; c ₁ ,c ₂ Learning factors of the individual extremum points and the global extremum points respectively; mu (mu) ₁ ,μ ₂ Respectively representing random numbers between 0 and 1; since the PSO algorithm optimization object is a smoothing factor adopted by each class of mode layer neurons, the particle solution space dimension D=M;

step 7.2: the updated particle location vector represents a feasible solution of the PNN network smoothing factor; the position vector px of the particle _i ^β+1 Replacing the value of the PNN network smoothing factor, and then calculating the PNN network pair px _i ^β+1 Error rate of classifying corresponding test samples

the update rule is as follows:

if it is

Then pbest is _i ＝px _i ^β+1 ，/>

Otherwise, pbest _i And p_fitness _i Remain unchanged;

if p_fitness _i <g_fit, then gbest=pbest _i ，g_fitness＝p_fitness _i Otherwise, the gbest and g_fitness remain unchanged;

step 7.3: when the iteration number reaches the maximum iteration number max_iter, the particle swarm optimization algorithm is terminated, wherein gbest represents an optimal solution of the PNN network on the smoothing factor, and g_fitness represents an error rate of classifying the test sample by using the PNN network with gbest as the optimal smoothing factor; otherwise, step 7.1 is re-executed.