CN114492007A

CN114492007A - Factor effect online identification method and device based on hierarchical error control

Info

Publication number: CN114492007A
Application number: CN202210047863.9A
Authority: CN
Inventors: 施文; 谢翔; 陈晓红
Original assignee: Central South University
Current assignee: Central South University
Priority date: 2022-01-17
Filing date: 2022-01-17
Publication date: 2022-05-13

Abstract

The invention discloses a factor effect online identification method and device based on hierarchical error control, wherein the method comprises the following steps: initializing parameters of main/interactive effect identification; identifying variation factors in the simulation model driven by the online data in real time, and constructing a real-time design matrix; calculating the base effect of the variation factor from the output of the simulation model; converting the base effect into a new sample for main/interactive effect hypothesis test, sampling the two effects from the new sample to obtain Bootstrap samples, and repeating the steps to obtain B Bootstrap samples; calculating the test statistic of each Bootstrap sample, and calculating the p value of the main/interactive effect test; and comparing the p value of the main/interactive effect test with the respective test level to determine whether the variable factor has important main/interactive effect, and completing the identification of the variable factor by any observation point. The invention can identify the importance of factor effect on line and control the error of the judgment result.

Description

Factor effect online identification method and device based on hierarchical error control

Technical Field

The invention belongs to the field of simulation modeling and analysis, and particularly relates to a factor effect online identification method and device based on hierarchical error control.

Background

In the industrial 4.0 background, technologies such as internet of things, business intelligence, 5G and cloud computing are integrated into large-scale applications, thereby realizing real-time data transmission, processing and feedback. The digital twin is the most advanced digital research paradigm at present as a system modeling method based on real-time data and models. The digital twin builds a digital world through simulation modeling, and dynamically updates simulation model parameters through real-time data; one of the characteristics of the digital world is control, the dynamic state which is difficult to detect under different time dimensions is found based on a digital-analog linkage simulation mode, and then action instructions are transmitted to realize the value increment of physical entities. Such online data-driven simulations require more powerful tools to resolve the system internal complexity and observe its dynamic characteristics to support real-time decisions.

Factor effect identification/sensitivity analysis is an important technical means for describing potential input/output relationships of a simulation system, and can be combined with experimental design, sampling and statistical inference to identify factors (inputs) having significant influence on system output. Of these, the sequential branching method and the Morris-based effect method are generally considered to be the most effective two factor effect recognition methods. However, most of the traditional factor effect recognition research focuses on an offline simulation model under static data, and is limited to controlling the probability of two types of errors in one global sensitivity analysis. In the online data-driven simulation model, the value range of the factor may be changed continuously, and if the production capacity is reduced due to equipment failure in an intelligent manufacturing shop, the classification of the factor on the original importance of the system performance may also be changed correspondingly, that is, the significance of the main effect and the interactive (nonlinear) effect of the factor is changed. If the off-line factor effect identification method is simply applied to on-line setting, the efficiency and effectiveness requirements of real-time simulation experiment analysis are difficult to meet.

With the wide attention paid to online real-time decision in recent years, the rise of real-time error control research provides a solution for the invention. The most advanced real-time error control methods at present include monotonous generalized alpha investment rules, discarding-spending algorithms, etc., but these methods are all based on independence assumptions at different time points, and do not consider the situation of checking multiple assumptions with correlation (such as main effect assumption and interactive effect assumption considered in factor effect real-time identification) at the same time point. Therefore, there is a need to develop an adaptive online method to identify the important effects of the varying factors and control the overall error level in real time.

Disclosure of Invention

Aiming at the defects of the online data-driven simulation analysis technology under the current digital twin background, the invention provides a factor effect online identification method and device based on hierarchical error control.

In order to achieve the technical purpose, the invention adopts the following technical scheme:

a factor effect online identification method based on hierarchical error control comprises the following steps:

step 1, initializing parameters of main effect recognition and interactive effect recognition according to the FDR level eta of the whole factor effect recognition;

step 2, aiming at k factors existing in the online data-driven simulation model, acquiring a single factor which changes in the value range of any observation point t in the model, and recording the single factor as a change factor l_tFurther constructing a real-time design matrix; wherein the variation factor l_t∈{1，2，…，k}；

Step 3, inputting the real-time design matrix into a simulation model to obtain simulation output, and calculating the base effect of the variation factor;

and 4, converting and obtaining a new base effect sample for main effect hypothesis test and a new base effect sample for interactive effect test according to the original variation factor base effect sample:

step 5, independently sampling the new basic effect samples subjected to the main effect hypothesis test in a returning mode for N times to obtain Bootstrap samples subjected to the main effect hypothesis test, and independently sampling the new basic effect samples subjected to the interactive effect test in a returning mode for N times to obtain Bootstrap samples subjected to the interactive effect hypothesis test;

repeating the step 5 for B times to respectively obtain B Bootstrap samples for main effect hypothesis test and B Bootstrap samples for interactive effect hypothesis test;

step 6, respectively counting Bootstrap samples of each main effect hypothesis test and Bootstrap samples of each interactive effect hypothesis test to obtain corresponding test statistics;

step 7, by checking the principle effect hypothesisComparing Bootstrap sample statistic with primordial effect sample main effect statistic, and calculating p value of main effect test

And calculating a p-value for the interaction effect test by comparing the Bootstrap sample statistic of the interaction effect hypothesis test with the interaction effect statistic of the primordial effect sample

Step 8, determining the observation point of the most recent recognition of the important main effect

Observation points of important interaction effects

Obtaining the check level of the main effect layer of the observation point t by using the LORD1 rule

And the level of examination of the interaction effect layer

Step 9, judging the p value of the main effect layer

Greater than the level of examination of the layer

If yes, determining the variation factor l_tHas no important main effect, otherwise, the variation factor l is determined_tHas important main effects; and judging the p value of the interactive effect layer

Greater than the level of examination of the layer

If yes, determining the variation factor l_tHas no important interaction effect, otherwise, the variation factor l is judged_tHas important interaction effect; completing the random observation point t to the variation factor l_tThe identification of (2).

Further, the initializing parameters of the main effect recognition and the interactive effect recognition specifically includes:

step 1.1, according to the FDR level eta identified by the overall factor effect, the FDR level identified by the main effect is distributed

And interaction recognition FDR levels

And satisfy

Step 1.2, the maximum test level at which the initial observation point t is 1 is identified for the main effect

And defining a maximum increment of the check level in the recognition of the main effect

Maximum test level for interaction effect recognition when initial observation point t is 1

And defining maximum increment of check level in interactive effect recognition

Further, the method for constructing the real-time design matrix in the step 2 comprises the following steps: obtaining a variation factor l_tN sampling matrix random form

Then longitudinally splicing to construct a real-time design matrix

The sampling matrix is in random form

Is a2 xk dimensional matrix in which a behavior factor is combined

Another combination of behavior factors

Δ is a combination of two line factors

The difference in the above.

Further, each sampling matrix is in a random form

The acquisition process comprises the following steps:

step a1, constructing a (k +1) × k-dimensional sampling matrix B composed of 0 s and 1 s such that the j +1 th row differs from the 1 st row by 1 in the j column, defining the sampling matrix B as follows:

step A2, converting the sampling matrix B into the ith random form required by the observation point t

So that each factor j epsilon {1, 2, …, k } can only be within the value range of [ -1, 1 [ ]]Is taken at p discrete positions, i.e. x_jE { -1, -1+2/(p-1), …, 1}, and defining the conversion formula as follows:

wherein D is^*Is a diagonal matrix of k dimensions, the diagonal elements of which are 1 or-1; x is the number of^*Each factor x in_jThe value range of (a) is { -1, -1+2/(p-1), …, 1-delta }; j. the design is a square_k+1，kA (k +1) × k-dimensional matrix in which all elements are 1; p^*The random permutation matrix is k multiplied by k, only one element in each row and each column of the random permutation matrix is 1, and the rest elements are 0;

step A3, selecting

Line 0 in (1) and the variation factor l_tThe corresponding rows form a new matrix to obtain a random form of a real-time sampling matrix

Wherein the matrix is only in the l_tThe columns differ by a.

Further, the step 3 specifically includes: will design the matrix in real time

The values of the factors are reversely normalized to a real range, the factor combinations are input into a simulation model line by line to obtain corresponding simulation output, and finally different pairs of factor combinations are calculated

And

with respect to the variation factor l_tThe radical effect of (1); ith radical effect

The calculation formula of (c):

wherein the content of the first and second substances,

and

respectively in random form of sampling matrix

The 0 th line and the variation factor line of (c),

are respectively a combination of factors

And

simulation output;

line 0, in

With each other row differing by a in only one column.

Further, the specific process of step 4 is as follows:

step 4.1, setting a factor main effect threshold value delta_MEAnd interaction threshold Δ_IESetting the dominant effect hypothesis test

Interactive effect hypothesis testing

Step 4.2, calculating N original base effect samples

Sample mean of

And standard deviation of sample

The calculation formula is as follows:

step 4.3, respectively enabling the ith to be less than or equal to N original base effect samples

Converting the data into samples required by the hypothesis test of the main effect according to the following conversion formula

And samples required for hypothesis testing of interaction effects

Further, the calculation method for obtaining the test statistic by counting each Bootstrap sample in the step 6 is as follows:

step 6.1, calculating Bootstrap samples of each main effect hypothesis test

Sample mean of

Computing Bootstrap samples for each interaction hypothesis test

Sample mean of

And standard deviation of sample

Wherein B ∈ {1, …, B }, different Bootstrap samples for distinguishing the dominant effect hypothesis test, and different Bootstrap samples for distinguishing the interactive effect hypothesis test;

step 6.2, calculating Bootstrap samples of each main effect hypothesis test

Test statistic of

And calculating Bootstrap samples for each interaction effect hypothesis test

Test statistic of

In the formula,. DELTA._IEIs the threshold value of the interaction effect of the factor.

Further, the p value calculation method of the main effect test and the interactive effect test is as follows:

step 7.1: computing dominant effect statistics of primordial effect samples

And interaction effect statistics

In the formula (I), the compound is shown in the specification,

and

the sample mean and the sample standard deviation of the N original base effect samples are respectively.

Step 7.2: calculating the variation factor l_tHypothesis testing for p-value

Checking p-value of sum interaction hypothesis

Bootstrap samples representing the b-th hypothesis test for major effects

The test statistic of (a) is,

bootstrap samples examined for the b-th cross-effect hypothesis

The test statistic of (1).

Further, the step 8 specifically includes:

step 8.1, generating a monotonically non-increasing non-negative sequence

So that

Step 8.2, determining the observation point which has identified the important main effect for the last time in the first t-1 observation points

If no significant primary effect has been identified in the past, then order

Calculating the inspection level of the main effect layer of the observation point t

Step 8.3, determining the observation point which has identified the important interaction effect for the last time in the first t-1 observation points

If no significant interaction effect has been identified in the past, then order

Then calculating the inspection level of the t interaction effect layer of the observation point

An online identification device for factor effect based on hierarchical error control comprises a memory and a processor, wherein the memory stores a computer program, and the computer program is executed by the processor to enable the processor to realize the online identification method for factor effect based on hierarchical error control according to any one of the above technical solutions.

Compared with the prior art, the invention has the beneficial effects that: the invention provides a factor effect online identification method based on hierarchical error control, which comprises the steps of obtaining a base effect sample of a variable factor by constructing a real-time design matrix and inputting the real-time design matrix into a simulation model when the value range of a single factor of a dynamic simulation model is observed to change in real time, converting the base effect sample into a Bootstrap sample and comparing the Bootstrap sample with an original base effect sample, calculating to obtain a p value for hypothesis test of a main effect and an interactive (nonlinear) effect, and obtaining a p value based on historical factor effectThe recognition result assigns a test level to the dominant and interactive (non-linear) hypothesis tests, thereby controlling the overall error level while determining the importance of the factor dominant and interactive (non-linear) effects; the method simplifies the design matrix of the Morris-based effect method so as to be suitable for single variable factor effect identification, and can improve the actual economy from k/(k +1) to 1/2; compared with the Morris-based effect method, the method uses the expression as

Two direct recognition factors, the method of the invention uses factor principal effect and interaction effect threshold delta_MEAnd Δ_IEThe method has more theoretical basis and explanation; for hypothesis testing of main effects and interactive effects, the method is combined with a non-parametric Bootstrap hypothesis testing method, hypothesis base effect samples do not need to obey established distribution, and unbiased results can be obtained; for the dependency relationship between the p values of the main effect layer and the interactive effect layer, the method adopts a layered structure and combines a LORD1 method to control the error level of factor effect recognition in the whole real-time process, which is beneficial to meeting different preferences of different decision makers on main effect and interactive effect recognition, and recognizing important influence factors of a simulation system in real time, thereby facilitating government and enterprise departments to improve key links in time so as to improve the system performance.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention;

FIG. 2 is a flow chart of a complaint-recall vehicle in accordance with an embodiment of the present invention;

FIG. 3 is a graph illustrating the variation of parameters observed in 2020 in accordance with the present invention;

FIG. 4 is a graph of a low and high level setting of the variation factor in an embodiment of the present invention;

FIG. 5 is a diagram illustrating an identification result of significant effects of a variation factor according to an embodiment of the present invention.

Detailed Description

The following describes embodiments of the present invention in detail, which are developed based on the technical solutions of the present invention, and give detailed implementation manners and specific operation procedures to further explain the technical solutions of the present invention.

Aiming at the factors with the value range changing at any time point, the method identifies the obvious main effect and the interactive (nonlinear) effect (the interactive effect is abbreviated as the interactive effect) of the changed factors in real time. The method includes the steps that an experimental design matrix in a Morris base effect method is simplified, the base effect of a current variation factor is obtained through real-time sampling, and the mean value and the standard deviation of a base effect sample are estimated; because the distribution of the basal effect sample is unknown, adopting hypothesis test based on non-parametric Bootstrap to obtain the p value of the current variation factor about the main effect and the interactive effect hypothesis test; and dividing the p values of the main effect and the interactive effect into two independent layers for processing, and respectively obtaining corresponding significance levels based on historical identification results of different effect layers so as to judge whether the main effect and the interactive effect of the current variation factor are significant. The invention solves the problem of real-time hypothesis testing that the real-time arriving main effect and the interactive effect hypothesis are not independent by adopting the hierarchical structure to identify the importance of the factor effect on line and controlling the error of the judgment result.

The following detailed description of the present patent will be made in conjunction with the accompanying drawings and examples.

The embodiment of the invention provides a factor effect online identification method based on hierarchical error control, which is realized by the specific steps shown in figure 1. In the embodiment, an online automobile recall simulation model is constructed, and the specific flow from customer complaints to recall/non-recall based on the model is shown in fig. 2; the embodiment also uses the published data of the automobile recall in the middle of 2019 in 2015, performs initialization setting on 29 parameters in the model, uses the Chinese automobile recall data in 2020 as a real-time input to observe the dynamic change of the parameters, and pays attention to the real-time influence of the changed parameters on the system performance (the average flow time of complaints in the system), wherein fig. 3 shows the change of 10 observation point parameters in 2020. The specific implementation mode comprises the following steps:

step 1, initializing parameters of main effect recognition and interactive effect recognition according to the FDR level eta of the overall factor effect recognition; the method specifically comprises the following steps:

step 1.1, setting the FDR level eta of the whole factor effect recognition as 0.05, and distributing the FDR level of the main effect recognition

And interaction recognition FDR levels

0.025 and 0.025 respectively;

And defining a maximum increment of the verify level in the recognition of the main effect

Step 2, aiming at k factors existing in the online data-driven simulation model, acquiring a single factor which changes in the value range of any observation point t in the model, and recording the single factor as a change factor l_tAnd further constructing a real-time design matrix. Wherein the variation factor l_t∈{1，2，…，k}。

In a specific example, a parameter P with the number 1 is observed on 23.01.2020_UDIs changed from 0.0285 to 0.03, the current time point is marked as observation point t equal to 1, and the change factor l is marked₁ Get factor l 1₁Random form of N-200 sampling matrices

Longitudinal splicing construction real-time design matrix

Where i ∈ {1, 2, …, 200 };

the sampling matrix is in random form

Is a2 × 29 dimensional matrix in which a behavior factor is combined

Another combination of behavior factors

Δ is a combination of two factors

The difference in the above.

Further, the variation factor l₁N sampling matrix random form of 1

The acquisition process was as follows and repeated N200 times:

step a1, constructing a (29+1) × 29-dimensional sampling matrix B composed of 0 s and 1 s such that the difference between row 2 and row 1 in column 1 is 1, then using the sampling matrix B as follows:

step A2, converting the sampling matrix B into the ith random form required by the observation point t being 1

So that 29 factors can only be in the value range of-1, 1]And using Δ 2/3, i.e., { -1, -1/3, 1/3, 1}, using the following conversion equation:

wherein D is^*Is a diagonal matrix with k-29 dimensions, whose diagonal elements are 1 or-1; x is the number of^*Is a combination of factors

Making each factor take values only in { -1, -1/3, 1/3 }; j. the design is a square_29+1，29A matrix of (29+1) × 29 dimensions with elements all being 1; p^*A 29 × 29 random permutation matrix, in which only one element in each row and column is 1, and the rest elements are 0;

step A3, in order to calculate the variation factor l alone₁1, each selected

The initial input line (i.e., line 0) in (c) and the factor l₁Forming a new matrix by the row corresponding to 1 to obtain a real-time sampling matrix random form

Wherein the matrix is only in the l₁Column 1 differs by Δ; the initial input line is

With each other row differing by a factor of delta only in one column.

And 3, inputting the real-time design matrix into a simulation model to obtain simulation output, and calculating the base effect of the variation factor.

Defining the expectation that changing each factor from a low level to a high level will not reduce the simulation output, all factors with observation point t of 1 take the current level as a low level and increase (or decrease) the low level by 0.1 as a high level to keep the average flow time monotonically non-decreasing.

Firstly, real-time design matrix

The values of the factors are reversely normalized to a real range formed by a low level and a high level, and then the factors are processed line by lineCombining the input simulation models to obtain corresponding simulation output, and calculating each pair of different factor combinations

And

with respect to the variation factor l₁Define the ith radical effect

The calculation formula of (2) is as follows:

wherein, the first and the second end of the pipe are connected with each other,

and

respectively corresponding to random form of sampling matrix

The initial input row and the factor row of,

are respectively a combination of factors

And

simulation output; finally obtaining the original base effect sample

FIG. 4 shows the variation factor l for 10 observation points_tLow and high level settings of e.g. variation factor l ₁1 by current level 0.03 (low)Level) falls from 0.1 to 0.027 (high level), the average flow time is monotonously not decreased.

Value x of each factor_j(j ∈ {1, …, 29}) the formula for inverse normalization is as follows:

step 4, according to the original variation factor base effect sample, converting to obtain a base effect new sample for main effect hypothesis test and a base effect new sample for interactive effect test, and the specific process is as follows:

step 4.1, setting a factor main effect threshold value delta_ME0.05 and interaction threshold Δ_IESet the principle effect hypothesis test at 0.05

Interactive effect hypothesis testing

Step 4.2, calculating the sample mean value of all the N-200 original base effect samples

And standard deviation of sample

The calculation formula is as follows:

step 4.3, respectively enabling the ith to be less than or equal to 200 original base effect samples

Conversion into samples required for the hypothesis test of the main effect

And samples required for hypothesis testing of interaction effects

The conversion formula is as follows:

step 5, from the new sample of the primary effect of the hypothesis test

In the Bootstrap sample, N is independently sampled 200 times to obtain the main effect hypothesis test

And new samples of base effects from interaction effects tests

In the Bootstrap sample, N is independently sampled 200 times to obtain the mutual effect hypothesis test

The step needs to be repeated 1000 times, and 1000 Bootstrap samples for main effect hypothesis test are obtained respectively

Bootstrap sample for testing with interactive effect hypothesis

Step 6, Bootstrap samples for each main effect hypothesis test

Bootstrap samples tested with each interactive effect hypothesis

Respectively carrying out statistics to obtain corresponding test statistics

And

then B Bootstrap statistics are obtained by two effect hypothesis tests

And

where B is {1, …, B }.

Further, the test statistic

Sum statistics

The specific calculation process of (2) is as follows:

step 6.1, calculating Bootstrap samples of each main effect hypothesis test

Sample mean of

Computing Bootstrap samples for each interaction hypothesis test

Sample mean of

And standard deviation of sample

Wherein B is epsilon {1, …, B }, and is used for distinguishing different Bootstrap samples of the main effect hypothesis test and different Bootstrap samples of the interactive effect hypothesis test;

step 6.2, calculating Bootstrap samples of each main effect hypothesis test

Test statistic of

And calculating Bootstrap samples for each interaction effect hypothesis test

Test statistic of

Step 7, comparing Bootstrap sample statistic of the main effect hypothesis test with the primordial effect sample main effect statistic, and calculating the p value of the main effect test

The calculation process is as follows:

step 7.1: computing dominant effect statistics of primordial effect samples

And interaction effect statistics

Step 7.2: calculating the variation factor l₁Checking p-value for 1-prime effect hypothesis

Checking p-value of sum interaction hypothesis

Step 8, determining the observation point of the most recent important main effect identified by the cut-off observation point t being 1

Observation points of important interaction effects

The LORD1 rule is used to obtain the test level of the observation point t-1 main effect layer

And the level of examination of the interaction effect layer

The specific process is as follows:

step 8.1, generating a monotonically non-increasing non-negative sequence

So that

In this example

Step 8.2, since the cut-off observation point t is 1, no important main effect is identified yet, that is, the historical observation point of the important main effect which is identified last time is

Calculating the inspection level of the observation point t-1 main effect layer

Step 8.3, because the cut-off observation point t is 1, no important interaction effect is identified yet, that is, the historical observation point for identifying the important interaction effect last time is

Calculating the test level of the observation point t-1 interactive effect hypothesis

Is given by the formula

Step 9, judging the p value of the main effect layer

Greater than the level of examination of the layer

Greater than the level of examination of the layer

If yes, determining the variation factor l_tHas no important interaction effect, otherwise, the variation factor l is judged_tHas important interaction effect; completing observation point t to variable factor l_tIdentification of (1).

In the present embodiment, specifically, in observation point t ═ 1,

determining a shift factor P_UDHas no important main effect; while

Determining a variation factor P_UDHas important interaction effect.

In this example, 10 variable factors were observed in 2020, let l_tFor the sequence number of the variable factor, the above steps are repeated to complete the identification of the single variable factor effect of the observation point t.

Notably, when the most recent observation point t is the distance from the historical observation point t, the important main/interactive historical observation point is identified

Or

The above steps 8.2 and 8.3 need to be calculated based on the historical observation points and the maximum increment of the test level, and if the observation point 2 obtains the test level, the process is as follows:

step 8.2, because the cut-off observation point t is 2, no important main effect is identified yet, that is, the historical observation point which has identified the important main effect last time is

Calculating the test level of the observation point t-2 main effect hypothesis

The formula is as follows:

step 8.3, because the observation point t is 1, the important interaction effect is identified, that is, the observation point of the important interaction effect is identified last time

Calculating an observation point t ═2 level of examination of the Interactive Effect hypothesis

Is given by the formula

Fig. 5 shows the results of the 10-variation factor significant primary/interaction effect identification. As can be seen from FIG. 5, the method of the present invention considers the mean value and standard deviation index of the Morris-based effect method, generates the p value of the hypothesis test of the main effect and the interactive effect under the condition of no parameter, has certain flexibility in the dynamic distribution of the test level, can accurately identify the important effect of the variation factor, and can facilitate the management department to adjust the key link of the corresponding real system in time.

The invention breaks through the traditional offline factor effect identification method, can obtain unbiased results by combining a non-parametric Bootstrap hypothesis testing method aiming at the condition of single factor change in an online data-driven simulation model and considering the unknown characteristic of the distribution of a base effect sample, adopts a layered structure and combines an LORD1 method to control the error level of factor effect identification in the whole real-time process, effectively identifies important influence factors of a simulation system in real time, finds the bottleneck links of a real system, and has important significance for improving systems such as a supply line, a workshop and the like.

It should be noted that, the values of the parameters in the foregoing embodiments do not limit the scope of the present invention, and the values of the parameters may be set and adjusted according to actual needs.

The hierarchical structure adopted by the invention controls the hierarchical Online-FDR to be lower than a set level, so that the overall Online-FDR is always controlled below eta and cannot be influenced by the dependence of a p value between a main effect layer and an interactive effect layer, and the theoretical basis is the following theorem.

The Online-FDRs of the theorem current main effect layer and the interaction effect layer are respectively controlled at

And

within and satisfy

Can make the observation point

Hypothesis tested Total Online-FDR satisfaction

And (3) proving that: at the observation point t, if the layered Online-FDR is controlled at

And

within, the following can be obtained:

wherein

And

the number of important main effects and interaction effects are identified for the cut-off observation point t, respectively, and

and

the number of important main effects and interaction effects are identified for the errors, respectively. Therefore, for the first 2t factor effect test, canTo obtain the total Online-FDR of

The first inequality holds depending on

And is

The second inequality follows the premise assumption. Therefore, when

In this case, the FDR (t) < eta.

Claims

1. A factor effect online identification method based on hierarchical error control is characterized by comprising the following steps:

Observation points of important interaction effects

Separately obtaining the inspection level of the main effect layer of the observation point t by using the LORD1 rule

And the level of examination of the interaction effect layer

Step 9, judging the p value of the main effect layer

Greater than the level of examination of the layer

Greater than the level of examination of the layer

If yes, determining the variation factor l_tHas no important interaction effect, otherwise, the variation factor l is judged_tHas important interaction effect; completing the random observation point t to the variation factor l_tIdentification of (1).

2. The method according to claim 1, wherein the initialization of parameters for the recognition of the main effect and the recognition of the interaction effect is specifically:

And interaction recognition FDR levels

And satisfy

Identifying a maximum verify level at which an observation point t ═ 1 is initialized for interaction effects

And defining a maximum increment of the check level in interactive effect recognition

3. The method of claim 1, wherein the method for constructing the real-time design matrix in step 2 comprises: obtaining a variation factor l_tN sampling matrix random form

Then longitudinally splicing to construct a real-time design matrix

The sampling matrix is in random form

Is a2 xk dimensional matrix in which a behavior factor is combined

Another combination of behavior factors

Δ is a combination of two line factors

The difference in the above.

4. The method of claim 3, wherein each sampling matrix is in a random form

The acquisition process comprises the following steps:

So that each factor j epsilon {1, 2, …, k } can only be within the value range of [ -1, 1 [ ]]Is taken over p discrete positions, i.e. x_jE { -1, -1+2/(p-1), …, 1}, and defining the conversion formula as follows:

wherein D is^*Is a diagonal matrix of k dimensions, the diagonal elements of which are 1 or-1; x is the number of^*Each factor x in_jThe value range of (a) is { -1, -1+2/(p-1), …, 1-delta }; j. the design is a square_k+1，kA (k +1) × k-dimensional matrix in which all elements are 1; p is^*The random permutation matrix is k multiplied by k, only one element in each row and each column of the random permutation matrix is 1, and the rest elements are 0;

step A3, selecting

Wherein the matrix is only in the l_tThe columns differ by a.

5. The method according to claim 1, wherein step 3 is specifically: will design the matrix in real time

The values of the factors are reversely normalized to a real range, the factor combinations are input into a simulation model line by line to obtain corresponding simulation output, and finally, each pair of different factor combinations is calculated

And

with respect to the variation factor l_tThe radical effect of (2); ith radical effect

The calculation formula of (2):

wherein the content of the first and second substances,

and

respectively in random form of sampling matrix

The 0 th line and the variation factor line,

are respectively a combination of factors

And

simulation output;

line 0, in

With each other row differing by a in only one column.

6. The method according to claim 1, wherein the step 4 comprises the following specific processes:

Interactive effect hypothesis testing

Step 4.2, calculating N original base effect samples

Sample mean of

And standard deviation of sample

The calculation formula is as follows:

And samples required for hypothesis testing of interaction effects

7. The method of claim 1, wherein the step 6 of performing statistics on each Bootstrap sample to obtain test statistics is calculated by:

step 6.1, calculating Bootstrap samples of each main effect hypothesis test

Sample mean of

Computing Bootstrap samples for each interaction hypothesis test

Sample mean of

And standard deviation of sample

step 6.2, calculating Bootstrap samples of each main effect hypothesis test

Test statistic of

And calculating Bootstrap samples for each interaction effect hypothesis test

Test statistic of

8. The method of claim 1, wherein the p-value calculation method for the main effect test and the interactive effect test is as follows:

step 7.1: computing dominant effect statistics of primordial effect samples

And interaction effect statistics

In the formula (I), the compound is shown in the specification,

and

Step 7.2: calculating the variation factor l_tHypothesis testing for p-value

Checking p-value of sum interaction hypothesis

Bootstrap samples representing the b-th hypothesis test for major effects

The test statistic of (a) is,

bootstrap samples examined for the b-th cross-effect hypothesis

The test statistic of (1).

9. The method according to claim 1, wherein step 8 is specifically:

step 8.1, generating a monotonically non-increasing non-negative sequence

So that

If no significant primary effect has been identified in the past, then order

。

10. An online identification device for factor effect based on hierarchical error control, comprising a memory and a processor, the memory having stored therein a computer program, characterized in that the computer program, when executed by the processor, causes the processor to carry out the method according to any one of claims 1 to 9.