CN108229065A - A kind of construction method and Forecasting Methodology of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature - Google Patents

A kind of construction method and Forecasting Methodology of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature Download PDF

Info

Publication number
CN108229065A
CN108229065A CN201810119071.1A CN201810119071A CN108229065A CN 108229065 A CN108229065 A CN 108229065A CN 201810119071 A CN201810119071 A CN 201810119071A CN 108229065 A CN108229065 A CN 108229065A
Authority
CN
China
Prior art keywords
leaf
sample
xylophyta
photosynthetic rate
net photosynthetic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810119071.1A
Other languages
Chinese (zh)
Other versions
CN108229065B (en
Inventor
张晓宇
吉心悦
胡梦瑶
宋跃朋
张德强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Forestry University
Original Assignee
Beijing Forestry University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Forestry University filed Critical Beijing Forestry University
Priority to CN201810119071.1A priority Critical patent/CN108229065B/en
Publication of CN108229065A publication Critical patent/CN108229065A/en
Application granted granted Critical
Publication of CN108229065B publication Critical patent/CN108229065B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/231Hierarchical techniques, i.e. dividing or merging pattern sets so as to obtain a dendrogram

Abstract

The present invention provides the construction methods and Forecasting Methodology of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature, belong to field of bioinformatics.The present invention is clustered using the linkage that will be averaged and PAM clustering procedures combine, different sub- kinds is divided into the xylophyta of diverse geographic location, the difference of different sub- kinds is eliminated more reasonably to be predicted Net Photosynthetic Rate, the present invention is utilized respectively phenotypic data of the gradient boosting algorithm based on blade sample in every sub- kind and builds Net Photosynthetic Rate prediction model simultaneously, during model is established, present invention firstly provides gradient boosting algorithm iteration stopping criterion, xylophyta leaf morphology data for different sub- kinds provide the residual error critical value of algorithm iteration, establish the populus simonii Net Photosynthetic Rate prediction model of the improvement gradient boosting algorithm based on iteration stopping criterion, in the case of known leaf morphology data, there is ideal prediction result to the Net Photosynthetic Rate of the sub- kind of different populus simoniis.

Description

A kind of structure of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature Construction method and Forecasting Methodology
Technical field
The invention belongs to biological information fields, and in particular to a kind of net photosynthesis speed based on xylophyta leaf morphology feature The construction method and Forecasting Methodology of rate prediction model.
Background technology
Net Photosynthetic Rate is to weigh the important indicator of photosynthesis of plant intensity, and weight is played the part of in plant growth and development process It acts on (ShipleyB et al.2005).19th century, Sachs propositions first measure photosynthetic rate with half leaf method, this is earliest Applied to the method for value in measuring photosynthesis, it does not need to complicated instrument and equipment, simple and easy to do.But it can be broken using half leaf method Bad measured material, it is impossible to which METHOD FOR CONTINUOUS DETERMINATION, and minute is long, environmental condition is difficult to control, and leads to that determination efficiency is low, error is big (Wei Yongsheng .2008).Sachs in 1883 initiates the photosynthetic rate that plant is measured with dry weight method, the initial stage seventies, the Chinese Academy of Sciences Shanghai plant physiology research institute prevents photosynthate using the method (being known as improved dry weight method) of scald petiole base bast It transports, makes measurement result closer to reality, so as to promote the application of dry weight method and research.But due to the difference of environmental condition, Measured result is difficult to be compared to each other (RoderiekH.1978 under the conditions of different times;Wang Tian is into .1988;Pei Baohua et al.1990)。
At present, CO2Infrared gas analysis method in absorption process is the widely used method for measuring plant Net Photosynthetic Rate (yellow refined perfume .2003), development have passed through three phases.First stage is laboratory stage, starts the beginning of the fifties to apply, match Relatively easy, only leaf chamber (or assimilation box) and the corresponding configurations such as gas circuit and air pump are put, takes artificial reading and calculating, such as U.S. The infrared CO of BACKMAN companies2Analyzer, the FQ series CO of China Foshan analytical instrument factory production2Analyzer etc., later Small-sized DC dual-purpose CO is developed again2Analyzer, such as the GXH 1 of Beijing Analytical Instrument Factory's production.Second stage is quotient The product stage, in order to increase finding speed, energy multi-site determination, CO2Multipath conversion is configured on analyzer and corresponding record fills It puts, to reach the 10 combination of channels formula of 6 channels and mono- 865 types of U.S. BAcKMAN of machine multiple spot channel measure, such as Japan's production CO2Analyzer.Later in order to improve measurement accuracy and determination condition can be controlled, it is configured with the environmental control system of leaf chamber, such as moral The MINICUVITTECO of state's production2-H2O measurement systems.Phase III is the portable stage.The generation in this stage is due to list Piece machine, integrated circuit and Sensor Technology Development as a result, being launched in the mid-80.Microcontroller is by photosynthetic continuous mode Involved in the physical quantitys such as CO2, temperature, humidity, PAR (photosynthetically active radiation) and flow carry out various operations, substantially increase Determination efficiency.The data storage of configuration can store big amount measurement data, pass through additional leaf chamber, temperature, wet, light, CO2Concentration control System processed carrys out Regulate Environment factor.The LCA-III types, the mono- IV types of LCA that have Adc Inc. more commonly used at present, U.S. LICOR The LICOR-6200 and LICOR 1 (Jiang Gaoming, 1996) of company, mono- 301PS, CI-510 and CI 1 of CI of CID companies And the BAU photosynthesis measurement systems of China Agricultural University's production.When LI-6400XT is used to measure Net Photosynthetic Rate, need upper Photosynthetic photon flux density (PPFD) is set in 1600 μm of olm by 9 points to 11 points of the period of noon-2·s-1, concentration is set as 400μmol-1, when Net Photosynthetic Rate becomes constant (± 0.1), record the value of Net Photosynthetic Rate.As it can be seen that current existing net light It is generally higher to environmental requirement to close rate determination method.
Invention content
In view of this, the purpose of the present invention is to provide a kind of sides for rapidly and accurately predicting xylophyta Net Photosynthetic Rate Method, the method are not limited by instrument and environment.
In order to achieve the above-mentioned object of the invention, the present invention provides following technical scheme:
The present invention provides the construction method of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature, packets Include following steps:
1) the xylophyta blade of the same species of China's covering entire geographical distribution of xylophyta is chosen, obtains woody plant Object blade sample;
2) the leaf morphology feature of the xylophyta blade sample is measured, obtains the blade table of each xylophyta individual Type data;
The leaf morphology feature includes leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf perimeter and the leaf shape factor;
3) based on the leaf morphology data, using average linkage clustering method and PAM clustering methods to xylophyta Body carries out Clustering, obtains xylophyta individual phylogenetic group;
4) Net Photosynthetic Rate of xylophyta blade sample in the step 1) is measured, obtains the Net Photosynthetic Rate number of sample According to;
5) using gradient boosting algorithm, according to iteration stopping criterion to the blade of each xylophyta individual of sample phylogenetic group The Net Photosynthetic Rate data of phenotypic data and the sample build model, obtain Net Photosynthetic Rate prediction model;
The Net Photosynthetic Rate prediction model is shown in formula (1);
Wherein, M represents iterations;xiRepresent six-vector (x1i, x2i, x3i, x4i, x5i, x6i), it corresponds to i-th respectively The area of populus simonii sample, length, width, perimeter, length-width ratio and the leaf factor;RmjWhen representing the m times iteration, j-th of leaflet Poplar sample set;cmjWhen representing the m times iteration, in j-th of populus simonii sample set, pass through minimumObtained constant;
There is no time restriction between the step 2) and step 4).
Preferably, construction method in the step 5), includes the following steps:
A. initialization constants c so thatMinimum,To lose letter Number, N are sample number, yiNet Photosynthetic Rate value for i-th of populus simonii sample;I represents vector, is a six-vector (x1i, x2i, x3i, x4i, x5i, x6i), x1i, x2i, x3i, x4i, x5i, x6iThe leaf area of i-th of xylophyta sample is corresponded to respectively, and leaf is long Degree, leaf width degree, leaf perimeter, ratio of length to breadth and the leaf shape factor;
B. it to iterations m=1,2...M, if i=1,2...N, calculatesIt obtains The negative gradient of loss function, using obtained negative gradient as the estimation of residual error, fm(x) for the m times iteration function when, according to rmi Learn a regression tree;M is the m times iteration, and i is i-th of sample;Function when f (x) is the m-1 times iteration, first time iteration When be f0(x), i.e.,And f (xi) it is by the leaf area of i-th of xylophyta sample, leaf length, leaf width degree, leaf Perimeter, ratio of length to breadth and the leaf shape factor substitute into function f (x);
C. so thatMinimum acquires cmj, update regression tree
Wherein, λ ∈ (0.001,0.01);M be the m time iteration, j be j-th of region, the leaf of substantially several xylophytas The set of piece sample;RmjDuring for the m times iteration, all samples have been divided into J set, and j represents j-th of set;
D. the residual error critical value of iteration stopping is calculatedWherein n be sample number, s2For sample side Difference, α are significance;When residual error is less than σIt is criticalWhen, otherwise iteration stopping, carries out next iteration until residual error is less than σIt is critical, Obtaining regression tree model is
Preferably, the side of Clustering is carried out in the step 3) to xylophyta individual using averagely linkage clustering method Method includes the following steps:
1. the phenotypic characteristic data of xylophyta sample are chosen as the variable needed;
2. the variable data is normalized, normalization variable data is obtained;
3. calculate Distance matrix D using the normalization variable dataij
4. utilize hclust () function pair Distance matrix D of R languageijImplementation level clusters, and two closest classes are drawn Into one kind, wherein between class distance is defined as:Point and the average distance at another class midpoint in one class.
Preferably, the PAM clustering methods include the following steps:
A, sample is set as X { x (1), x (2) ... .. };
B. k cluster centre is randomly selected in the sample first;
C. the distance to each cluster centre and then to the sample point in addition to cluster centre is calculated, sample is referred to distance The nearest sample point of center of a sample, obtains initial cluster;
D. other sample points in each initial cluster in addition to the point at class center are calculated with the distance to other all the points The minimum value is just realized primary cluster optimization by the minimum value of sum;
E. step d is repeated, until the position of cluster centre twice no longer changes, obtains cluster result.
Preferably, the quantity of xylophyta sample is 200~500 in the step 1).
Preferably, shown in step 2) the middle period form factor calculation formula such as formula (2);
Preferably, when measure, will for 9 points to 11 points of the morning time of the measure of Net Photosynthetic Rate in the step 4) Photosynthetic photon flux density is set in 1600 μm of olm-2·s-1, concentration is set as 400 μm of ol-1
The present invention provides it is a kind of based on the prediction model that the construction method obtains to xylophyta Net Photosynthetic Rate Forecasting Methodology includes the following steps:
A. the leaf morphology feature of xylophyta to be predicted is measured, obtains leaf morphology characteristic, the leaf morphology Feature includes leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf perimeter and the leaf shape factor;
B. the leaf morphology data are inputted in the prediction model obtained in said program construction method, obtains prediction wood The predicted value of this plant Net Photosynthetic Rate.
The present invention provides the construction method of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature, by Different in the geographical location that the xylophyta of same species is distributed, the growing environment in different geographical locations makes the xylophyta With different leaves phenotypic characteristic and Net Photosynthetic Rate, the present invention will be using will averagely linkage cluster and PAM clustering procedures combine, to not Xylophyta with geographical location is divided into different sub- kinds, therefore eliminates the difference of different sub- kinds so as to net photosynthesis speed Rate more reasonably predicted, at the same the present invention using the method for average linkage cluster and PAM clusters combine to blade sample into Row cluster is so that more accurate to the division of sub- kind;The present invention is utilized respectively gradient boosting algorithm in every sub- kind simultaneously Phenotypic data structure Net Photosynthetic Rate prediction model based on blade sample, during model is established, the present invention carries for the first time Gradient boosting algorithm iteration stopping criterion is gone out, the xylophyta leaf morphology data for different sub- kinds provide algorithm iteration Residual error critical value, establish the improvement gradient boosting algorithm based on iteration stopping criterion populus simonii Net Photosynthetic Rate prediction mould Type in the case of known leaf morphology data, has the Net Photosynthetic Rate of the sub- kind of different populus simoniis ideal pre- Survey result.
Description of the drawings
Fig. 1 is the cluster result figure that averagely links in embodiment 1;
Fig. 2 is PAM cluster result figures in embodiment 1.
Specific embodiment
The present invention provides the construction method of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature, packets Include following steps:
1) the xylophyta blade of the same species of China's covering entire geographical distribution of xylophyta is chosen, obtains woody plant Object blade sample;
2) the leaf morphology feature of the xylophyta blade sample is measured, obtains the blade table of each xylophyta individual Type data;
The leaf morphology feature includes leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf perimeter and the leaf shape factor;
3) based on the leaf morphology data, using average linkage clustering method and PAM clustering methods to xylophyta Body carries out Clustering, obtains xylophyta individual phylogenetic group;
4) Net Photosynthetic Rate of xylophyta blade sample in the step 1) is measured, obtains the Net Photosynthetic Rate number of sample According to;
5) using gradient boosting algorithm, according to iteration stopping criterion to the blade of each xylophyta individual of sample phylogenetic group The Net Photosynthetic Rate data of phenotypic data and the sample build model, obtain Net Photosynthetic Rate prediction model;
The Net Photosynthetic Rate prediction model is shown in formula (1);
Wherein, M represents iterations;xiRepresent six-vector (x1i, x2i, x3i, x4i, x5i, x6i), it corresponds to i-th respectively The area of populus simonii sample, length, width, perimeter, length-width ratio and the leaf factor;RmjWhen representing the m times iteration, j-th of leaflet Poplar sample set;cmjWhen representing the m times iteration, in j-th of populus simonii sample set, pass through minimumObtained constant.
There is no time restriction between the step 2) and step 4).
The present invention chooses the xylophyta blade of the same species of China's covering entire geographical distribution of xylophyta, obtains wood This plant leaf blade sample.
The present invention does not have the type of the xylophyta particular/special requirement, and method provided by the invention is applicable to this field Any xylophyta kind known to technical staff.The xylophyta kind is preferably willow platymiscium, more preferably small Ye Yang.In embodiments of the present invention, it is tested for the xylophyta selection populus simonii.
In the present invention, when the xylophyta be populus simonii when, the geographical distribution be preferably Shaanxi, Qinghai, Hebei, Henan, Ningxia, Beijing and Inner Mongol.The quantity of the xylophyta blade is preferably 1000~1500.From the xylophyta Individual representative is preferably chosen in blade and obtains xylophyta blade sample.The quantity of the xylophyta blade sample is preferably 200~500.The standard that the selection individual represents is healthy and complete as blade.
After obtaining xylophyta blade sample, the present invention measures the leaf morphology feature of the xylophyta blade sample, Obtain the leaf morphology data of each xylophyta individual;The leaf morphology feature include leaf area, leaf length, leaf width degree, Ratio of length to breadth, leaf perimeter and the leaf shape factor.
In the present invention, the measurement is preferably used with instrument and utilizes portable laser leaf area device (CI-202).
In the present invention, shown in the leaf shape factor calculation formula such as formula (2);
The present invention is based on the leaf morphology data, using average linkage clustering method and PAM clustering methods to woody plant Object individual carries out Clustering, obtains xylophyta individual phylogenetic group.
In the present invention, the method for carrying out Clustering to xylophyta individual using averagely linkage clustering method, Include the following steps:
1. the phenotypic characteristic data of xylophyta sample are chosen as the variable needed;
2. the variable data is normalized, the variable data that obtains that treated;
3. treated described in, variable data calculates Distance matrix Dij
4. the hclust () using R language adjusts the distance matrix DijImplementation level clusters, and two closest classes are divided into one Class, wherein between class distance are defined as:Point and the average distance at another class midpoint in one class.
In the present invention, the PAM clustering methods include the following steps:
A, sample is set as X { x (1), x (2) ... .. };
B. k cluster centre is randomly selected in the sample first;
C. the distance to each cluster centre and then to the sample point in addition to cluster centre is calculated, sample is referred to distance The nearest sample point of center of a sample, obtains initial cluster;
D. to the distance sum of other sample points calculating in each cluster in addition to the point at class center to other all the points most Small value, using the minimum value as new cluster, center just realizes primary cluster optimization;
E. step d is repeated, until the position of cluster centre twice no longer changes, obtains cluster result.
In the present invention, the cluster result result that two kinds of clustering methods obtain unanimously is divided according to result, still The third clustering method FCM methods are used when the result that two kinds of clustering methods obtain is inconsistent, artificial contrast simultaneously chooses and FCM methods Closer Clustering Model.
The present invention measures the Net Photosynthetic Rate of the xylophyta blade sample, obtains the Net Photosynthetic Rate data of sample.
In the present invention, the time of the measure of the Net Photosynthetic Rate is 9 points to 11 points of the morning.It will be photosynthetic during the measure Photon flux density is set in 1600 μm of olm-2·s-1, concentration is set as 400 μm of ol-1.The measure of the Net Photosynthetic Rate is used Instrument is preferably LI-6400XT.When Net Photosynthetic Rate becomes constant (± 0.1), the value of Net Photosynthetic Rate is recorded.In order to obtain Accurate data measure the Net Photosynthetic Rate of each blade three times, and the measured value for the Net Photosynthetic Rate that is averaged, more accurate to provide Data.
After obtaining leaf morphology characteristic and net photosynthetic rate data, the present invention is using gradient boosting algorithm to every The leaf morphology data of a xylophyta individual of sample phylogenetic group and the Net Photosynthetic Rate data structure model of the sample, obtain Net Photosynthetic Rate prediction model.
In the present invention, using the method for gradient boosting algorithm structure prediction model, following steps are preferably included:
A. initialization constants c so thatMinimum,To lose letter Number, N are sample number, yiNet Photosynthetic Rate value for i-th of xylophyta sample;I represents vector, is a six-vector (x1i, x2i, x3i, x4i, x5i, x6i), x1i, x2i, x3i, x4i, x5i, x6iThe leaf area of i-th of xylophyta sample, leaf are corresponded to respectively Length, leaf width degree, leaf perimeter, ratio of length to breadth and the leaf shape factor;
B. it to iterations m=1,2...M, if i=1,2...N, calculatesIt obtains The negative gradient of loss function, using obtained negative gradient as the estimation of residual error, fm(x) for the m times iteration function when, according to rmi Learn a regression tree;M is the m times iteration, and i is i-th of sample;Function when f (x) is the m-1 times iteration, first time iteration When be f0(x), i.e.,And f (xi) it is by the leaf area of i-th of xylophyta sample, leaf length, leaf width degree, leaf Perimeter, ratio of length to breadth and the leaf shape factor substitute into function f (x);
C. so thatMinimum acquires cmj, update regression tree
Wherein, λ ∈ (0.001,0.01);M be the m time iteration, j be j-th of region, the leaf of substantially several xylophytas The set of piece sample;RmjDuring for the m times iteration, all samples have been divided into J set, and j represents j-th of set;
D. the residual error critical value of iteration stopping is calculatedWherein n be sample number, s2For sample side Difference, s are sample standard deviation, and α is significance;When residual error is less than σIt is criticalWhen, otherwise iteration stopping, it is straight to carry out next iteration It is less than σ to residual errorIt is critical, obtaining regression tree model is
In the present invention, iteration stopping criterion is used in the gradient boosting algorithm for the first time.The iteration stopping criterion is such as Under:
Under the hypothesis of gradient lift scheme, the residual error critical value σ of iteration stoppingIt is critical, the σIt is criticalSee formula (3),
Wherein, n is sample number, s2For sample variance, α is significance;
Card:To each predicted value, haveI=1,2..., n consider Hypothesis Testing Problem,
Choose test statistics
WhenWhen, χ22(n-1),
Then the region of rejection for the inspection that significance is α is
I.e.So
The present invention provides a kind of prediction sides of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature Method includes the following steps:
A. the leaf morphology feature of xylophyta to be predicted is measured, obtains leaf morphology characteristic;
B. the leaf morphology data are inputted in the prediction model obtained in said program construction method, obtains prediction wood The predicted value of this plant Net Photosynthetic Rate.
In the present invention, the leaf morphology feature preferably includes leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf week The long and leaf shape factor.The assay method of the leaf morphology feature is with above-mentioned record, and this will not be repeated here.
With reference to embodiment to a kind of Net Photosynthetic Rate based on xylophyta leaf morphology feature provided by the invention The construction method and Forecasting Methodology of prediction model are described in detail, but they cannot be interpreted as protecting model to the present invention The restriction enclosed.
Embodiment 1
S101:In national entire populus simonii geographical distribution (Shaanxi, Qinghai, Hebei, Henan, Ningxia, Beijing, Inner Mongol), 1233 populus simonii samples are acquired, we choose 235 individuals therein and represent.Shandong Province of China province close county (36 ° of 23'N, 115 ° of 47'E) clone plant garden in, vegetative propagation is carried out to it with this 235 individual roots.In our current research, these nothings The individual of pass is used for research and utilization leaf morphology data prediction Net Photosynthetic Rate.
S102:Blade area, length, width, perimeter, length-width ratio are measured using portable laser leaf area device (CI-202) With this six leaf morphology data of the leaf factor.
S103:With reference to average linkage cluster and PAM clusters, based on the leaf morphology data that CI-202 measurements obtain, to 235 A division for representing individual and carrying out sub- kind.
The average linkage cluster:
1. the variable that choosing needs is geographical location difference;
2. a pair variable data is normalized;
3. calculate Distance matrix Dij
4. utilize hclust () function pair DijImplementation level clusters, and two closest classes is divided into one kind, wherein between class Distance definition is:Point and the average distance at another class midpoint in one class.
PAM is clustered:
1. sample is set as X { x (1), x (2) ... .. };
2. randomly select k cluster centre in the sample first;
3. and then the distance to each cluster centre is calculated to the sample point outside except cluster centre, by sample be referred to away from The sample point nearest from center of a sample, this just realizes initial cluster;
4. again to the distance sum of other sample points calculating in each class in addition to the point at class center to other all the points most Small value, using the minimum point as new cluster, center just realizes primary cluster optimization;
5. repeating step 4, until the position of cluster centre twice no longer changes, this just completes final cluster.
S104:Maximum net photosynthetic rate in order to obtain, in 9 points to 11 points of the morning, by photosynthetic photon flux density (PPFD) It is set in 1600 μm of olm-2·s-1, concentration is set as 400 μm of ol-1, Net Photosynthetic Rate is measured using LI-6400XT.When net When photosynthetic rate becomes constant (± 0.1), the value of Net Photosynthetic Rate is recorded.Accurate data in order to obtain, we measure often The Net Photosynthetic Rate of a blade three times, and the measured value for the Net Photosynthetic Rate that is averaged, to provide more accurately data.
S105:Individual specimen is represented for 235, in every sub- kind after being divided by S103, is utilized respectively ladder Spend phenotypic data structure Net Photosynthetic Rate prediction model of the boosting algorithm based on blade sample.
S1051, initialization constants c so thatMinimum,
WhereinFor loss function, N is sample number,
yiNet Photosynthetic Rate value for i-th of populus simonii sample;I represents vector, is a six-vector (x1i, x2i, x3i, x4i, x5i, x6i), x1i, x2i, x3i, x4i, x5i, x6iThe leaf area of i-th of xylophyta sample, leaf length, leaf are corresponded to respectively Width, leaf perimeter, ratio of length to breadth and the leaf shape factor;
S1052. to iterations m=1,2...M, if i=1,2...N, r is calculatedmiFunction, For the negative gradient of loss function, using the negative gradient of loss function as the estimation of residual error, fm(x) for the m times iteration function when, According to rmiLearn a regression tree;
Wherein, m is the m times iteration, and i is i-th of sample;Function when f (x) is the m-1 times iteration, during first time iteration For f0(x), i.e.,
f(xi) it is by the area of i-th of populus simonii sample, length, width, perimeter, length-width ratio and the leaf factor substitute into letter Number f (x);
S1053. pass through minimumAcquire cmj
Wherein, m be the m time iteration, j be j-th of region, the set of substantially several populus simonii samples;
RmjDuring for the m times iteration, all samples have been divided into J set, and j represents j-th of set;
S1054. the residual error critical value of iteration stopping is calculatedWherein n be sample number, s2For sample This variance, s are sample standard deviation, and α is significance.When residual error is less than σIt is criticalWhen, otherwise iteration stopping, is changed next time Generation.
S1055. final regression tree model is
S106:Leaf morphology data (area, length, width, perimeter, the length of populus simonii to be predicted are measured using CI-202 Width than with the leaf factor).
S107:Based on populus simonii leaf morphology data to be predicted, the sub- kind divided using S103, with reference to average Dynamic cluster and PAM clusters, sub- kind division is carried out to populus simonii to be predicted.
S108:By S107, the sub- kind of populus simonii to be predicted has been obtained, the ladder of S105 structures is used in this sub- kind Spend lift scheme prediction populus simonii Net Photosynthetic Rate.
Prediction result
Since the geographical location of populus simonii distribution is different, different growing environments makes populus simonii have different characteristic, can quilt It is divided into different sub- kinds.Therefore, it is more reasonably pre- to be carried out to Net Photosynthetic Rate to eliminate the difference of different sub- kinds It surveys, carries out cluster analysis using averagely linkage method and PAM methods first when establishing prediction model, as a result as depicted in figs. 1 and 2.
From the point of view of cluster result, first kind populus simonii sample is from Shaanxi (110 degree of east longitude, 34 degree of north latitude) and Henan (east Through 113 degree, 33 degree of north latitude), the two province geographical locations are close, and climate type belongs to temperate zone subtropical monsoon climate, this The populus simonii phenotypic characteristic of one subclass is that blade area is big, and the median of blade area has reached 59.95cm2, the leaf factor Median is 4.52.
Second class populus simonii sample is essentially from western part of China, including Shaanxi, Qinghai, Ningxia.This subclass populus simonii leaf Piece area median is 25.13cm2, the median of the leaf factor is maximum, and it is 4.60 to be worth.
Third class populus simonii sample covers most provinces, essentially from Hebei, these central plain areas of Henan.It is this kind of small The blade area of Ye Yang and the second class populus simonii are similar, and blade area median is 24.65cm2, but the leaf factor is also with preceding two Class populus simonii gap is larger, and leaf factor median is only 4.32.
According to algorithm iteration stopping criterion, first kind populus simonii sample is utilizing gradient boosting algorithm iteration 1534 times Afterwards, it is upper and lower 1.006 to float to error existing for the prediction of each sample Net Photosynthetic Rate.Second and third class populus simonii sample exists Respectively after iteration 916 times, 231 times, it is respectively up and down 1.275, upper and lower 1.965 that prediction error, which is floated,.Predictablity rate is three 93.3%, 91.1%, 88.5% has been respectively reached in class populus simonii.
As can be seen that in the case of known leaf morphology feature, the model that the present invention is built is effectively improved net light Close the precision of prediction of rate.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications also should It is considered as protection scope of the present invention.

Claims (8)

1. the construction method of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature, includes the following steps:
1) the xylophyta blade of the same species of China's covering entire geographical distribution of xylophyta is chosen, obtains xylophyta leaf Piece sample;
2) the leaf morphology feature of the xylophyta blade sample is measured, obtains the leaf morphology number of each xylophyta individual According to;
The leaf morphology feature includes leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf perimeter and the leaf shape factor;
3) based on the leaf morphology data, using average linkage clustering method and PAM clustering methods to xylophyta individual into Row Clustering obtains xylophyta individual phylogenetic group;
4) Net Photosynthetic Rate of xylophyta blade sample in the step 1) is measured, obtains the Net Photosynthetic Rate data of sample;
5) blade phenotypic data in each xylophyta individual phylogenetic group after being grouped using gradient boosting algorithm to the step 3) The Net Photosynthetic Rate data structure model of the sample obtained with the step 4), obtains Net Photosynthetic Rate prediction model;
The Net Photosynthetic Rate prediction model is shown in formula (1);
Wherein, M represents iterations;xiRepresent six-vector (x1i, x2i, x3i, x4i, x5i, x6i), i-th of populus simonii is corresponded to respectively The area of sample, length, width, perimeter, length-width ratio and the leaf factor;RmjWhen representing the m times iteration, j-th of populus simonii sample Set;cmjWhen representing the m times iteration, in j-th of populus simonii sample set, pass through minimum Obtained constant;
There is no time restriction between the step 2) and step 4).
2. construction method according to claim 1, which is characterized in that construction method in the step 5), including following step Suddenly:
A. initialization constants c so thatMinimum,For loss function, N For sample number, yiNet Photosynthetic Rate value for i-th of woody sample;I represents vector, is a six-vector (x1i, x2i, x3i, x4i, x5i, x6i), x1i, x2i, x3i, x4i, x5i, x6iThe leaf area of i-th of xylophyta sample, leaf length, leaf are corresponded to respectively Width, leaf perimeter, ratio of length to breadth and the leaf shape factor;
B. it to iterations m=1,2...M, if i=1,2...N, calculatesIt is lost The negative gradient of function, using obtained negative gradient as the estimation of residual error, fm(x) for the m times iteration function when, according to rmiStudy One regression tree;M is the m times iteration, and i is i-th of sample;Function when f (x) is the m-1 times iteration, when first time iteration be f0(x), i.e.,And f (xi) it is the leaf length by the leaf area of i-th of xylophyta sample, leaf width degree, leaf perimeter, Ratio of length to breadth and the leaf shape factor substitute into function f (x);
C. so thatMinimum acquires cmj, update regression tree
Wherein, λ ∈ (0.001,0.01);M be the m time iteration, j be j-th of region, the blade sample of substantially several xylophytas This set;RmjDuring for the m times iteration, all samples have been divided into J set, and j represents j-th of set;
D. the residual error critical value of iteration stopping is calculatedWherein n be sample number, s2For sample variance, s For sample standard deviation, α is significance;When residual error is less than σIt is criticalWhen, otherwise iteration stopping, carries out next iteration until residual Difference is less than σIt is critical, obtaining regression tree model is
3. construction method according to claim 1, which is characterized in that using the clustering method that averagely links in the step 3) The method that Clustering is carried out to xylophyta individual, includes the following steps:
1. the phenotypic characteristic data of xylophyta sample are chosen as the variable needed;
2. the variable data is normalized, normalization variable data is obtained;
3. calculate Distance matrix D using the normalization variable dataij
4. utilize hclust () function pair Distance matrix D of R languageijImplementation level clusters, and two closest classes are divided into one Class, wherein between class distance are defined as:Point and the average distance at another class midpoint in one class.
4. construction method according to claim 1, which is characterized in that the PAM clustering methods include the following steps:
A, sample is set as X { x (1), x (2) ... .. };
B. k cluster centre is randomly selected in the sample;
C. the distance to each cluster centre is calculated the sample point in addition to cluster centre, sample is referred to distance sample center Nearest sample point obtains initial cluster;
D. to the distance sum of other sample points calculating in each initial cluster in addition to the point at class center to other all the points most The minimum value is just realized primary cluster optimization by small value;
E. step d is repeated, until the position of cluster centre twice no longer changes, obtains cluster result.
5. construction method according to claim 1, which is characterized in that the quantity of xylophyta sample is in the step 1) 200~500.
6. construction method according to claim 1, which is characterized in that step 2) the middle period form factor calculation formula is such as Shown in formula (2);
7. construction method according to claim 1, which is characterized in that in the step 4) measure of Net Photosynthetic Rate when Between for 9 points to 11 points of the morning, photosynthetic photon flux density is set in 1600 μm of olm during the measure-2·s-1, concentration sets It is set to 400 μm of ol-1
8. it is a kind of based on the prediction model that construction method claim 1~7 described obtains to the pre- of xylophyta Net Photosynthetic Rate Survey method, includes the following steps:
A. the leaf morphology feature of xylophyta to be predicted is measured, obtains leaf morphology characteristic, the leaf morphology feature Including leaf area, leaf length, leaf width degree, ratio of length to breadth, leaf perimeter and the leaf shape factor;
B. the leaf morphology data are inputted in the prediction model obtained in said program construction method, obtains predicting woody plant The predicted value of object Net Photosynthetic Rate.
CN201810119071.1A 2018-02-06 2018-02-06 A kind of construction method and prediction technique of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature Expired - Fee Related CN108229065B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810119071.1A CN108229065B (en) 2018-02-06 2018-02-06 A kind of construction method and prediction technique of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810119071.1A CN108229065B (en) 2018-02-06 2018-02-06 A kind of construction method and prediction technique of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature

Publications (2)

Publication Number Publication Date
CN108229065A true CN108229065A (en) 2018-06-29
CN108229065B CN108229065B (en) 2019-03-26

Family

ID=62671095

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810119071.1A Expired - Fee Related CN108229065B (en) 2018-02-06 2018-02-06 A kind of construction method and prediction technique of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature

Country Status (1)

Country Link
CN (1) CN108229065B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829943A (en) * 2018-11-13 2019-05-31 上海烟草集团有限责任公司 Blade construction detection method, system, medium and equipment based on machine vision
CN113159560A (en) * 2021-04-15 2021-07-23 中国科学院地理科学与资源研究所 Optimized configuration method for psammophyte industry
CN114814099A (en) * 2022-04-25 2022-07-29 南京农业大学 Photosynthesis prediction method based on grape leaf shape

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011223915A (en) * 2010-04-19 2011-11-10 Ohbayashi Corp System and method for predicting growth of lawn
CN103234916A (en) * 2013-04-06 2013-08-07 吉林大学 Prediction method for net photosynthetic rate of population
CN105678405A (en) * 2015-12-31 2016-06-15 西北农林科技大学 Modeling method for cucumber GA-RBF photosynthetic rate prediction model integrated with stomatal conductance
CN105956380A (en) * 2016-04-25 2016-09-21 东北林业大学 Construction method of leaf area prediction model
CN107037090A (en) * 2017-04-24 2017-08-11 江苏大学 A kind of photosynthesis Forecasting Methodology changed based on blade tensity

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011223915A (en) * 2010-04-19 2011-11-10 Ohbayashi Corp System and method for predicting growth of lawn
CN103234916A (en) * 2013-04-06 2013-08-07 吉林大学 Prediction method for net photosynthetic rate of population
CN105678405A (en) * 2015-12-31 2016-06-15 西北农林科技大学 Modeling method for cucumber GA-RBF photosynthetic rate prediction model integrated with stomatal conductance
CN105956380A (en) * 2016-04-25 2016-09-21 东北林业大学 Construction method of leaf area prediction model
CN107037090A (en) * 2017-04-24 2017-08-11 江苏大学 A kind of photosynthesis Forecasting Methodology changed based on blade tensity

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829943A (en) * 2018-11-13 2019-05-31 上海烟草集团有限责任公司 Blade construction detection method, system, medium and equipment based on machine vision
CN109829943B (en) * 2018-11-13 2023-02-10 上海烟草集团有限责任公司 Blade structure detection method, system, medium and equipment based on machine vision
CN113159560A (en) * 2021-04-15 2021-07-23 中国科学院地理科学与资源研究所 Optimized configuration method for psammophyte industry
CN113159560B (en) * 2021-04-15 2023-12-05 中国科学院地理科学与资源研究所 Optimization configuration method for sandy plant industry
CN114814099A (en) * 2022-04-25 2022-07-29 南京农业大学 Photosynthesis prediction method based on grape leaf shape
CN114814099B (en) * 2022-04-25 2023-09-12 南京农业大学 Photosynthesis prediction method based on grape leaf shape

Also Published As

Publication number Publication date
CN108229065B (en) 2019-03-26

Similar Documents

Publication Publication Date Title
Sinclair et al. An analysis of errors in the calculation of energy flux densities above vegetation by a Bowen-ratio profile method
Shipley et al. Dry matter content as a measure of dry matter concentration in plants and their parts
Overman et al. Evaluation of regression models for above-ground biomass determination in Amazon rainforest
CN108229065B (en) A kind of construction method and prediction technique of the Net Photosynthetic Rate prediction model based on xylophyta leaf morphology feature
CN108319984B (en) The construction method and prediction technique of xylophyta leaf morphology feature and photosynthesis characteristics prediction model based on DNA methylation level
Luo et al. Portable canopy chamber measurements of evapotranspiration in corn, soybean, and reconstructed prairie
Bai et al. Modelling diurnal and seasonal hysteresis phenomena of canopy conductance in an oasis forest ecosystem
Biscoe et al. Barley and its environment. I. Theory and practice
Yu et al. A PSO-XGBoost Model for Estimating Daily Reference Evapotranspiration in the Solar Greenhouse.
CN109214591B (en) Method and system for predicting aboveground biomass of woody plant
CN110334404A (en) A kind of rapid dry accurate recognition methods of drought of Watershed Scale
CN102903145A (en) Method for three-dimensional reconstruction of plant population morphological structure
Thakur et al. Increase in light interception cost and metabolic mass component of leaves are coupled for efficient resource use in the high altitude vegetation
CN106885892A (en) The method and device split to forest ecosystem evapotranspiration
CN101044823A (en) Method for estimating crop energy utilization rate and predetermining the yield
Meir et al. Plant structure-function relationships and woody tissue respiration: upscaling to forests from laser-derived measurements
Sugiura et al. Concentrative nitrogen allocation to sun-lit branches and the effects on whole-plant growth under heterogeneous light environments
Evans et al. Inventory time-cost and statistical power: a case study of a Lao rattan
Zhang et al. Use of a regression method to partition sources of ecosystem respiration in an alpine meadow
CN116705144A (en) Method for measuring correlation between species distribution data and environmental factors
CN116663779A (en) Multi-depth fertility point-surface conversion method and device for cultivated land soil
Zhang et al. Comparison of the partitioning of evapotranspiration–numerical modeling with different isotopic models using various kinetic fractionation coefficients
Zhong et al. Organic carbon storage in soils of tropical and subtropical China
MATSUDA et al. Dry deposition velocity of sulfur dioxides on a red pine forest in Nagano, Japan
CN108918578A (en) A kind of Heat Tolerance of Wheat Cultivars brief appraisal method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190326

Termination date: 20210206