CN113848225B

CN113848225B - XRF element quantitative analysis method based on PCA-SVR

Info

Publication number: CN113848225B
Application number: CN202111073294.7A
Authority: CN
Inventors: 杨婉琪; 李福生; 赵彦春
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2021-09-14
Filing date: 2021-09-14
Publication date: 2022-06-03
Anticipated expiration: 2041-09-14
Also published as: CN113848225A

Abstract

The invention discloses an XRF element quantitative analysis method based on PCA-SVR, which comprises the steps of reading element peak value information and content information; determining input and output of the PCA-SVR model; calculating a correlation coefficient and a unit feature vector; calculating a principal component; constructing a classification hyperplane, and converting the optimal classification hyperplane problem into a quadratic programming model; carrying out parameter optimization training on a PCA-SVR model, and quantitatively predicting the element content; selecting the optimal number of the main components; and calculating a decision coefficient and evaluating the prediction effect of the PCA-SVR model. The method has the advantages of simple operation process, high prediction accuracy, intuitive result, easy operation, capability of solving the problems of X fluorescence spectrum peak value overlapping interference, inaccuracy of the traditional instrument measurement method and the like, reduces the influence of the environment background, reduces the error caused by the statistical fluctuation, and can effectively and quickly carry out quantitative prediction on the elements contained in the object to be detected.

Description

XRF element quantitative analysis method based on PCA-SVR

Technical Field

The invention relates to the field of element detection, in particular to an XRF element quantitative analysis method based on PCA-SVR.

Background

With the gradual development of energy spectrum scientific research, the online qualitative and quantitative detection technology becomes a new development trend. Through perfect extension research in recent ten years, the analysis of element content by X-ray fluorescence (XRF) spectroscopy becomes a novel analysis technology, and the method is widely applied to various fields such as metallurgy, building materials, ground mines, commercial inspection, environmental protection, food sanitation, nonferrous metals and the like. The method has the advantages of rapid analysis, no damage to sample properties, wide analysis range, stable and reliable result, rapid realization of simultaneous analysis of multiple elements, simple operation and the like.

The traditional method mainly carries out accurate qualitative and accurate quantitative analysis on trace elements through an XRF spectrometer, is easy to have the problems of overlapping of peak counts among element spectral lines, uncertainty of element information, high element detection limit and the like, and how to improve the accuracy of an element quantitative analysis result under the condition of spectral line overlapping interference becomes the key point of the research of the invention. Therefore, the principal component analysis-support vector regression (PCA-SVR) algorithm is applied to the quantitative analysis of the elements, the problems of inaccurate calculation and lack of data inspection of the traditional X fluorescence spectrometer are solved, and an optional inspection method is provided for the quantitative analysis of the X fluorescence spectrometer result.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a method for predicting the content of elements in a substance to be tested based on a PCA-SVR algorithm.

In order to achieve the purpose, the invention provides an XRF element quantitative analysis method based on PCA-SVR, which comprises the following steps:

step 1: determining a standard sample set, supposing that n samples to be detected are concentrated in the standard sample set, taking a union set of all elements (No. 12-92 elements in an element periodic table) which can be identified by an ED-XRF fluorescence spectrometer (an energy dispersion type X-ray fluorescence spectrometer) in the standard sample set to form an element set contained in the n samples to be detected, and obtaining an element set A with content in the standard sample set;

step 2: reading in element peak information and content information. Taking any sample to be detected as a sample to be identified, and testing the corresponding element peak value information and content information in the element set A by an ED-XRF fluorescence spectrometer to obtain the actually measured component value (or peak count) X and content value Y of each element.

And step 3: and determining the input and the output of the PCA-SVR model. Constructing a PCA-SVR model, wherein a certain element needing quantitative analysis is called a target element, and an element which interferes with the target element is called an interference element. And taking an actually measured component value (or peak count) matrix consisting of the target elements and the interference elements in the standard sample set as the input of the PCA-SVR model, and taking an actually measured component value matrix consisting of the element content of the target elements as the output of the PCA-SVR model. For example, a matrix X of measured component values (or peak counts) of elements_nmIs a sample containing n samples to be tested, each sample to be tested is composed of component values of m elements, X_nmThe first column of the matrix is the measured component value of a single target element, and the other m-1 columns are composed of other target elements and interference elements corresponding to all the target elements. Likewise, the matrix Y of measured content values of the elements_n1Is a sample containing n parts of samples to be detected, and each part of samples to be detected consists of the concentration value of the single element;

and 4, step 4: XRF spectral data was normalized. Will be original X_nmStandardizing the matrix to obtain a standardized matrix

X_nmRow vector of ith row in matrix

Represents a vector of component values (or peak counts) of m elements contained in the ith test sample. To X_nmThe matrix is normalized as follows:

wherein i is a normalized matrix

And i is 1,2, n, j is a standardized matrix

And j is 1,2_ijTo represent

The component value (or peak count) of the jth element in (j),

is a matrix X_nmSample average value of j column of (1), x'_ijRepresents the component value (or peak count) of the jth element of the ith sample to be tested after standard transformation, S_jIs a matrix X_nmThe standard deviation of the samples in column j,

representing a normalized matrix

Row vectors for the ith row.

And 5: normalizing matrix

Correlation coefficient matrix R:

in the formula, n is the number of samples to be detected,

is a normalized matrix.

Step 6: normalizing a matrix

Unit feature vector of

Solving an equation system Rb for the determined characteristic root lambda and the correlation coefficient matrix R obtained in the step 5_j＝λb_jObtain a feature vector b_jThen for each feature vector b_jAfter normalization, m normalized unit feature vectors can be obtained

Wherein j is a normalized matrix

And 7: normalizing m unit feature vectors

Respectively converted into m main components, main component u_jThe calculation formula of (2) is as follows:

wherein i is 1,2, and n, j is 1,2, and m,

for normalizing the matrix

The row vector of the ith row.

And 8: mapping the original X by a non-linear function_nmMapping of m-dimensional data to k-dimensional data of each row in the matrix, i.e. by PCA reductionAfter dimension measurement, k main components are obtained, and new k-dimension data are used for reflecting information expressed by original m-dimension data. The k-dimensional element component value (or peak count) feature data is then mapped from the low-dimensional nonlinear separable space into a high-dimensional linear separable feature space. Constructing a classification hyperplane in this high-dimensional linearly separable feature space:

wherein p is 1,2, k (k is less than or equal to m), h_pFor class marking in p-dimension, in hyperplane

Is defined as h _p1 in the hyperplane

Is defined as h_pIs-1. w is the feature weight vector, b is the offset, x_pRepresents the element component value (or peak count) vector of the sample to be detected after the PCA is reduced to the p dimension,

to convert data x_pA non-linear mapping function mapped to a high-dimensional linearly separable feature space, wherein x is omitted for simplifying the formula_pSubscript i in (1) different samples to be tested correspond to different x_p。

And step 9: in order to control the calculation speed and reduce the error in the sample training, a penalty factor C and a relaxation variable xi are introduced_pAnd (3) constraining, and converting the classified hyperplane problem into a quadratic programming model:

in the formula, w is a feature weight vector, C is a penalty factor, xi_pAs a relaxation variable, h_pIs a class mark, b is an offset,

to convert data x_pA non-linear mapping function that maps to a high-dimensional linearly separable feature space.

Step 10: and performing parameter optimization by using a cross-validation method based on grid search, and training a PCA-SVR model. Obtaining an optimal parameter penalty factor C' and a relaxation variable xi by continuously iterating and searching for an optimal parameter_p', and introducing a Lagrange multiplier alpha_pAnd solving the formula (9) by the kernel function K, wherein different samples to be detected correspond to different alpha_pAnd xi_p. When the minimum classification hyperplane meeting the precision requirement of the above formula (9) is the target element content prediction result

Else, iteration is continued until optimal parameters C ' and ξ ' are found '_p. The calculation formula for predicting the content of the single target element of any ith sample to be detected is as follows:

wherein i is 1,2, n, p is 1,2, k, α_pIs a Lagrangian multiplier, h_pFor class labels, K is the kernel function and b is the offset.

Step 11: comparing the single target element content predicted by calculating k principal components in the step 10 with the actual content result condition of the single target element: the Root Mean Square Error (RMSE) is calculated for each of k by taking different values, and the optimum number of principal components is selected. The RMSE is used for measuring the closeness degree of a predicted value and an actual value, the smaller the RMSE is, the more accurate the selection of the number of the main components is, and the more accurate the element content prediction is. In general, RMSE decreases as the number of principal components increases until a minimum or constant value is reached. When the RMSE curve is obviously reduced and then gradually levels off, the corresponding k value is the optimal main component number k_optimal. I.e. the original matrix X_nmMapping m-dimensional data of each row to k_optimalDimension dataAbove, with k_optimalThe dimension data reflects the original matrix X_nmInformation expressed by medium m-dimensional data, k_optimalM is less than or equal to m, and the RMSE evaluation index is calculated as follows:

in the formula (I), the compound is shown in the specification,

is a predicted value y of the content of a single target element in the ith sample to be tested_iThe actual value of the content of the single target element in the ith sample to be detected is obtained.

Step 12: when the number of the main components is k_optimalComparing the prediction result with the actual content result of the standard sample, and calculating the determination coefficient (R)²)，R²The method is used for reasonably evaluating the prediction effect of the PCA-SVR model and is used for describing the fitting degree of a regression line and an observed value. R²The larger the element content, the more accurate the element content prediction. R²The calculation formula of (2) is as follows:

in the formula, y_iIs the real value of the content of a single target element in the ith sample to be detected,

is a predicted value of the content of the single target element in the ith sample to be detected,

the average value of the true value of the content of the single target element in the ith sample to be detected is obtained.

The method has the advantages of simple operation process, scientificity, reasonability, simple flow, convenience in operation, high prediction accuracy, intuitive result and popular and understandable property; the operation mode of the invention has the characteristics of high detection precision, high prediction accuracy, small calculation complexity and the like, can solve the problems of X fluorescence energy spectrum peak value overlapping interference, inaccurate measurement method of the traditional instrument and the like, reduces the influence of the environmental background, reduces the error caused by statistical fluctuation, and can effectively and quickly carry out quantitative prediction on the elements contained in the object to be detected.

Drawings

FIG. 1 is a flow chart of a PCA-SVR-based XRF elemental quantitative analysis method of the present invention;

FIG. 2 is a spectrum of a standard soil sample according to the present invention;

FIG. 3 is a graph of the results of principal component analysis based on the present invention;

FIG. 4 is a diagram showing the result of prediction of the content of soil elements according to the present invention.

Detailed Description

The following provides a more detailed description of the embodiments and the operation of the present invention with reference to the accompanying drawings.

The embodiment provides an XRF element quantitative analysis method based on PCA-SVR, the working flow of which is shown in FIG. 1, and the specific steps for obtaining element information and detection limit in a standard soil sample are as follows:

step 1: and determining a soil sample set, wherein n soil samples are set in the soil sample set, namely a sample 1 and a sample 2 … … sample 57. All elements capable of being identified by a spectrometer are taken to form an element set A contained in the soil sample, and a total 57 element sets A1-A57 are finally obtained, namely, a union set of A1-A57 is taken to obtain the element set A with the content in the soil sample set, wherein the elements in the element set A are included in No. 12-92 elements in the periodic table.

Step 2: 57 national standard samples are adopted as standard samples, and comprise GSS series soil component analysis standard substances, GBW series soil component analysis standard substances and GSD water system sediment component analysis standard substances, namely GSS-1-GSS-27, GSS-32, GBW 0070003-GBW 0070006 and GSD-2 a-GSD-33. An XRF spectrogram of a sample to be detected, an element component value X (or peak count) and a content value Y contained in the sample can be simultaneously obtained by an ED-XRF fluorescence spectrometer by using an intelligent energy dispersion fluorescence analysis method, and the XRF spectrogram of a standard soil sample is shown in figure 2.

And step 3: in the element set a, a union set of a target element to be studied and a corresponding interfering element thereof is taken As an input variable of the PCA-SVR model, the study objects in this embodiment are ten soil harmful elements of 23(V), 24(Cr), 25(Mn), 27(Co), 29(Cu), 30(Zn), 33(As), 42(Mo), 48(Cd), and 82(Pb), and the Cd element is taken As an example herein to perform detailed element content prediction. The information on the composition of some standard soil samples is shown in table 1.

Table 1 partial standard soil sample composition information

Taking 57 parts of standard soil samples as an example, the component content of the target element is recorded, and the detailed information is shown in table 2 when the original data is completely collected.

TABLE 2 national Standard soil sample composition information (ppm)

The measured component value matrix X consisting of the target element and the interference element thereof_nmAnd taking the target element content matrix as the input of the PCA-SVR model and the output of the PCA-SVR model. Details of the interference elements are shown in table 3.

TABLE 3 main interference elements of the target elements

Taking Cd element content prediction As an example, the input of the PCA-SVR model is a57 × 21 component data matrix, that is, a matrix containing 57 samples, each sample is composed of component values of twenty-one elements (a union of all target elements and their corresponding interfering elements) of Cd, V, Ti, As, K, Cr, Se, Fe, Ni, Zn, Sr, Cu, P, Co, Mn, Pb, Ca, Mo, Nb, Ag, Sb, wherein the first column is a component value of a single target element (Cd), the remaining 20 columns are component values of other target elements and all interfering elements, and the column positions of the 20 columns of data can be randomly arranged. Similarly, the output of the PCA-SVR model is a57 × 1 component data matrix, i.e., a matrix containing 57 samples, each sample consisting of a single target element (Cd) content value.

And 4, step 4: XRF spectral data was normalized. The original size of matrix X is n × m_nmCarrying out standardization processing to obtain a standardized matrix

X_nmRow vector of ith row in matrix

A vector consisting of component values (or peak counts) of m elements contained in the ith test sample. To X_nmThe matrix is normalized as follows:

wherein i is a normalized matrix

And i is 1,2, n, j is a standardized matrix

And j equals 1,2, a, m, x_ijTo represent

The component value (or peak count) of the jth element in (j),

is a matrix X_nmJ column sample mean, x'_ijRepresents the component value (or peak count) of the jth element of the ith sample to be tested after standard transformation, S_jIs a matrix X_nmThe standard deviation of the samples in column j,

representing a normalized matrix

Row vector for row i.

And 5: normalizing a matrix

Correlation coefficient matrix R of (a):

in the formula, n is the number of samples to be detected,

is a normalized matrix.

And 6: normalizing a matrix

Unit feature vector of

Wherein j is a normalized matrix

And 7: normalizing m unit feature vectors

wherein i is 1,2, and n, j is 1,2, and m,

for standardizing matrices

The row vector of the ith row.

And 8: mapping matrix X by a non-linear function_nmMapping the m-dimensional data of each row in the system to k-dimensional data, namely obtaining k principal components after dimensionality reduction by a Principal Component Analysis (PCA), reflecting information expressed by the original m-dimensional data by the k-dimensional data, mapping k-dimensional element component data from a low-dimensional nonlinear separable space to a high-dimensional linear separable feature space, and constructing a classification hyperplane in the high-dimensional linear separable feature space:

wherein p is 1,2, k is less than or equal to m, h_pFor class marking in p-dimension, in hyperplane

Is defined as h _p1 is ═ 1; in the hyperplane

Is defined as h_p-1, w is the feature weight vector, b is the offset, x_pRepresenting the element component value vector of the sample to be detected after the PCA is reduced to the p dimension,

to convert data x_pA non-linear mapping function mapped to a high-dimensional linearly separable feature space, wherein x is omitted for simplifying the formula_pSubscript i (different samples to be tested, corresponding to different x) in (1)_p)。

in the formula, w is a feature weight vector, C is a penalty factor, and xi_pAs a relaxation variable, h_pIs a class mark, b is an offset,

Step 10: and performing parameter optimization by using a cross-validation method based on grid search, and training a PCA-SVR model. Obtaining an optimal parameter penalty factor C ' and an optimal relaxation variable xi ' by continuously iterating and searching for an optimal parameter '_pAnd introducing a Lagrange multiplier alpha_pAnd solving the formula (9) by the kernel function K, wherein different samples to be detected correspond to different alpha_pAnd xi_pUsually in the parameters w, b, x_p，C，ξ_p，α_pThe subscript p in K varies with the input sample to be tested, e.g. x_p，ξ_p，α_pWithout the subscript p, the usual does not change, e.g., w, b, C, K. When the minimum classification hyperplane meeting the precision requirement of the above formula (9) is the content prediction result of the single target element (Cd)

Else, iteration is continued until optimal parameters C ' and ξ ' are found '_p. The calculation formula for predicting the content of the single target element (Cd) in any ith sample to be detected is as follows:

wherein i is 1,2, n, p is 1,2, k, α_pIs the Lagrange multiplier, h, of the ith sample to be measured_pFor class labels, K is the kernel function and b is the offset.

Step 11: calculating k principal components u in step 10_jComparing the predicted target element content with the actual target element content result, and taking the k as differentRespectively, the Root Mean Square Error (RMSE) is calculated to select the optimum number of principal components. The RMSE is used for measuring the closeness degree of a predicted value and an actual value, the smaller the RMSE is, the more accurate the selection of the number of the main components is, and the more accurate the element content prediction is. In general, RMSE decreases as the number of principal components increases until a minimum or constant value is reached. When the RMSE curve is obviously reduced and then gradually levels off, the corresponding k value is the optimal main component number k_optimalI.e. matrix X_nmMapping m-dimensional data of each row to k_optimalOn dimension data, with k_optimalDimensional data reflects matrix X_nmInformation expressed by medium m-dimensional data, k_optimalM is less than or equal to m, and the RMSE evaluation index is calculated as follows:

in the formula (I), the compound is shown in the specification,

is the predicted value y of the content of a single target element in the ith sample to be tested_iThe actual value of the content of the single target element in the ith sample to be detected is obtained.

As shown in fig. 3, the RMSE curves obtained by the conventional Partial Least Squares Regression (PLSR) method and the present invention based on the PCA-SVR method are compared, and the PCA-SVR method is optimal when the number of principal components is 4 and the PLSR method is optimal when the number of principal components is 8; under the condition of the same number of principal components, the RMSE of the PCA-SVR method is smaller than that of the PLSR method, and the prediction is more accurate.

in the formula, y_iIs the true value of the content of a single target element (Cd) in the ith sample to be detected,

is a predicted value of the content of the single target element (Cd) in the ith sample to be detected,

the average value of the true value of the content of the single target element (Cd) in the ith sample to be detected is obtained.

The element content prediction method for the other 9 kinds of study objects 23(V), 24(Cr), 25(Mn), 27(Co), 29(Cu), 30(Zn), 33(As), 42(Mo), and 82(Pb) in the element set a was the same As that for the element 48 (Cd).

The element determination coefficient R of the standard soil sample based on the PCA-SVR method of the invention and the traditional Partial Least Squares Regression (PLSR) method is adopted²Results are compared and detailed information is shown in table 4:

TABLE 4 determination coefficient R for prediction of element content of standard soil sample²Comparison of results

Taking Cd as an example, the content prediction result is shown in fig. 4, and it can be seen that the XRF element quantitative prediction result based on PCA-SVR better conforms to the actual result of element content compared to the Partial Least Squares Regression (PLSR) method. The PCA-SVR algorithm effectively solves the problem of spectral line overlapping, improves the accuracy of the element quantitative analysis result and embodies the superiority of the method of the invention.

While the invention has been described with reference to specific embodiments, any feature disclosed in this specification may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise; all of the disclosed features, or all of the method or process steps, may be combined in any combination, except mutually exclusive features and/or steps; any non-essential addition or replacement made by a person skilled in the art according to the technical features of the technical solution of the present invention is within the scope of the present invention.

Claims

1. An XRF element quantitative analysis method based on PCA-SVR is characterized by comprising the following steps:

step 1: determining a standard sample set, wherein the standard sample set comprises n samples to be detected, taking a union set of all elements which can be identified by an ED-XRF fluorescence spectrometer in the standard sample set to form an element set contained in the n samples to be detected, and obtaining an element set A with content in the standard sample set, wherein the elements which can be identified by the ED-XRF fluorescence spectrometer are No. 12-92 elements in an element periodic table;

step 2: reading element peak information and content information: taking any sample to be detected as a sample to be identified, and testing corresponding element peak value information and content information in an element set A by an ED-XRF fluorescence spectrometer to obtain the actually measured component value and content value of each element in the sample to be identified;

and step 3: determining the input and the output of the PCA-SVR model: constructing a PCA-SVR model, wherein a certain element needing quantitative analysis is called a target element, an element which interferes with the target element is called an interference element, and an actually measured component value matrix X consisting of all the target elements researched in the element set A and the corresponding interference elements is constructed_nmAs the input of the PCA-SVR model, the actually measured content value matrix Y formed by the element content of the target element_n1As the output of the PCA-SVR model, wherein the measured component values matrix X_nmIs a matrix comprising n samples to be tested, each sample to be tested consisting of the actually measured component values of m elements, X_nmThe first column of the matrix is the measured component value of a single target element, the remaining m-1 columns are other target elements in the element set A andmeasured component values of interference elements corresponding to all target elements; the measured content value matrix Y_n1Is a matrix comprising n samples to be tested, each sample to be tested consisting of the single target element content value;

and 4, step 4: XRF spectral data normalization: will matrix X_nmCarrying out standardization processing to obtain a standardized matrix

Matrix X_nmRow vector of the ith row

Representing the m element actual measurement component value vectors contained in the ith sample to be tested, and aligning the matrix X_nmThe normalization process is performed as follows:

wherein i is a normalized matrix

And i is 1,2, n, j is a standardized matrix

Column (2) ofAnd j is 1,2_ijTo represent

The measured component value of the jth element in (a),

is a matrix X_nmSample mean, x 'of column j'_ijRepresenting the component value, S, of the jth element of the ith sample to be tested after standard transformation_jIs a matrix X_nmThe standard deviation of the samples in column j,

representing a normalized matrix

A row vector for row i;

and 5: normalizing a matrix

Correlation coefficient matrix R:

step 6: normalizing matrix

Unit feature vector of

Solving an equation system Rb for the determined characteristic root lambda and the correlation coefficient matrix R obtained in the step 5_j＝λb_jGet the feature vector b_jThen for each feature vector b_jObtaining m normalized unit characteristic vectors after normalization

Wherein, | | · | | is a p-norm;

and 7: normalizing m unit feature vectors

and 8: mapping matrix X by a non-linear function_nmMapping the m-dimensional data of each row in the system to k-dimensional data, namely obtaining k principal components after dimensionality reduction by a Principal Component Analysis (PCA), reflecting information expressed by original m-dimensional element component value data by k-dimensional element component value data, then mapping the k-dimensional element component value data from a low-dimensional nonlinear separable space to a high-dimensional linear separable feature space, and constructing a classification hyperplane in the high-dimensional linear separable feature space:

Is defined as h_p1 is ═ 1; in a hyperplane

Is defined as h_p＝-1，w is the feature weight vector, b is the offset, x_pRepresenting the element component value vector of the sample to be detected after the PCA is reduced to the p dimension,

to convert data x_pA non-linear mapping function mapped to a high-dimensional linearly separable feature space, wherein x is omitted for simplifying the formula_pSubscript i in (1), i.e. different samples to be tested, corresponds to different x_p；

And step 9: introducing a penalty factor C and a relaxation variable xi_pAnd (3) constraining, and converting the classified hyperplane problem into a quadratic programming model:

step 10: performing parameter optimization by using a cross-validation method based on grid search, and training the PCA-SVR model: obtaining an optimal parameter penalty factor C ' and an optimal relaxation variable xi ' by continuously iterating and searching for an optimal parameter '_pAnd introducing a Lagrange multiplier alpha_pAnd solving the formula (9) by the kernel function K, wherein different samples to be detected correspond to different alpha_pAnd the resulting optimum relaxation variable ξ'_pAre also different; the minimum classification hyperplane meeting the precision requirement of the formula (9) is the prediction result of the content of the target element

The calculation formula for predicting the content of the single target element of any ith sample to be detected is as follows:

step 11: comparing the single target element content predicted in the step 10 with the actual single target element content result: taking k as different values to respectively calculate the root mean square error RMSE which follows the principal componentThe number is increased and decreased until reaching the minimum value or constant value, at which time the corresponding k value is the optimal number k of principal components_optimalI.e. matrix X_nmMapping m-dimensional data of each row to k_optimalOn dimension data, with k_optimalDimensional data reflects matrix X_nmInformation expressed by medium m-dimensional data, k_optimalM is less than or equal to m, and the RMSE evaluation index is calculated as follows:

in the formula (I), the compound is shown in the specification,

is the predicted value y of the content of a single target element in the ith sample to be tested_iThe true value of the content of the single target element in the ith sample to be detected is obtained;

step 12: when the number of the main components is k_optimalComparing the predicted single target element content with the actual single target element content result, and calculating the decision coefficient R²To evaluate the predictive effect of the model, R²The calculation formula of (2) is as follows:

in the formula (I), the compound is shown in the specification,

2. The method for PCA-SVR based XRF elemental quantitative analysis of claim 1 wherein n-57 and m-21.

3. The PCA-SVR-based XRF elemental analysis method of claim 2, characterized in that the samples to be tested in said standard sample set comprise GSS series soil composition analysis standard substance, GBW series soil composition analysis standard substance and GSD water system sediment composition analysis standard substance.

4. The PCA-SVR-based XRF elemental quantitative analysis method of claim 3 characterized in that all the target elements studied in said element set A include vanadium (V), chromium (Cr), manganese (Mn), cobalt (Co), copper (Cu), zinc (Zn), arsenic (As), molybdenum (Mo), cadmium (Cd), lead (Pb).

5. The PCA-SVR-based XRF elemental quantitative analysis method of claim 4 characterized in that said single target element is cadmium (Cd).