CN110598243B

CN110598243B - Virtual sample capacity expansion method based on historical data of mechanical product

Info

Publication number: CN110598243B
Application number: CN201910681887.8A
Authority: CN
Inventors: 裘乐淼; 李恒; 张树有; 王自立; 谭建荣
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-07-26
Filing date: 2019-07-26
Publication date: 2021-04-30
Anticipated expiration: 2039-07-26
Also published as: CN110598243A

Abstract

The invention discloses a virtual sample capacity expansion method based on historical data of mechanical products, which comprises the steps of firstly determining the virtual sample capacity of a virtual sample in a small sample problem according to actually measured historical data; then, constructing a sample pool for generating a virtual sample based on historical data of mechanical production and related priori knowledge of the mechanical production; then, sampling samples based on a wheel roulette sampling idea, and designing a virtual sample generation rule based on an agent model idea and a Jacobi's rotation theory; and finally, feasible expansion samples are reserved according to the sample rationality judgment condition, so that the expansion of the virtual sample for training the small sample regression problem for predicting the mechanical assembly precision is realized. The achievement of the invention can be used for expanding the sample capacity of the small-capacity sample machine learning training model, can solve the problem of insufficient sample quantity in the mechanical assembly precision prediction, and has important significance for researching the small sample problem of the tolerance transfer of the customized product by using a machine learning regression method.

Description

Virtual sample capacity expansion method based on historical data of mechanical product

Technical Field

The invention relates to a virtual sample capacity expansion method, in particular to a virtual sample capacity expansion method based on historical data of mechanical products.

Background

Machine learning techniques based on large sample data have been widely used in different fields. With the introduction of intelligent manufacturing concepts, the combination of machine learning techniques and the mechanical field is becoming more and more compact. However, with the continuous improvement of the mechanical design and manufacturing level, the mechanical products have the trend of diversified design requirements, nonstandard production and personalized customized production, so that some mechanical field problems no longer have the condition of generating large-capacity sample data, and the application of the machine learning technology is limited.

At present, machine learning in the mechanical field is limited to application scenes with sufficient sample quantity, such as image identification, signal identification and the like. And in most cases, a classification problem such as object recognition. The machine learning regression problem of small-capacity samples in the related field of traditional mechanical design is not fully taken into consideration. At present, tolerance transfer research mostly adopts a tolerance analysis method, and the method mostly uses a linear transfer model for tolerance transfer research and does not consider nonlinear errors. The introduction of machine learning techniques has helped improve this situation.

Most complex mechanical products are produced in small batches in a personalized and customized mode, and the problem of small samples is necessarily faced in the process of assembly precision prediction and deviation control of the products. At present, the related research in the mechanical field using the virtual sample capacity expansion technology is mostly limited to application scenarios in which sample labels are clear or are easy to obtain, such as single small sample generation. The virtual sample generation methods for regression-like problems are less studied.

For the prediction of the assembly precision of mechanical products, factors influencing the quality of parts in actual machining are various. The factors of production equipment precision, personnel operation factors, production inherent noise and the like cause the sizes of parts produced by different manufacturers to obey a certain probability distribution. At present, the research mostly adopts Gaussian distribution approximation to replace an actual product size distribution model. The data samples generated may not be consistent with the actual production situation. Therefore, historical data of research problems are introduced, and the virtual sample generation method is used for expansion, which is beneficial to introducing priori knowledge in the mechanical field.

Disclosure of Invention

The invention aims to provide a virtual sample capacity expansion method based on historical data of mechanical products aiming at the defects of the prior art.

The purpose of the invention is realized by the following technical scheme: a virtual sample capacity expansion method based on historical data of mechanical products comprises the following steps:

(1) reading historical data of the mechanical product as a sample in an original training set D, wherein the sample comprises input features and output features; obtaining potential parameters including virtual sample capacity n' and initialized candidate sample library capacity n according to the original training set D_Dc′；

The input features are dimensional tolerance, shape tolerance and position tolerance;

the output characteristic is assembly accuracy;

the original training set D { (x {)₁,y₁),(x₂,y₂)...,(x_n,y_n) }; where n is the number of samples, x_iIs a d-dimensional input feature, y_iAs an output characteristic, y_iIs a one-dimensional vector, i is 1 to n.

The value range of the virtual sample capacity n' is n multiplied by 2²≤n′≤n×2^d；

The initialized candidate sample library capacity n_Dc′＝n′；

(2) Based on the characteristic distribution rule in the original training set D obtained in the step (1), carrying out grouping, copying and expanding on the original training set D to obtain a sample pool D_pThe method comprises the following substeps:

(2.1) determining a sample set G_k: according to the output characteristic y obtained in the step (1)_iThe distribution of the magnitude of the values divides it into K intervals (g)_k-1,g_k]The interval end point g is calculated by the following formula_k：

Wherein K is 1, 2.., K; y is_minAs output characteristic y_iMinimum value of, y_maxAs output characteristic y_iMaximum value of (d); when k is 1, the 1 st interval is (g)₀,g₁]Wherein g is₀＝y_min(ii) a Correspondingly, dividing the original training set D into K groups according to the following formula to obtain a sample set G_k：

When k is 1, G₁＝{D_i|g₀≤y_i≤g₁,i＝1,2,...,n}

K2, 3, K, G_k＝{D_i|g_k-1＜y_i≤g_k,i＝1,2,...,n}

Wherein D is_iThe ith sample in the original training set D is taken;

(2.2) determining the extended number n 'of samples'_k: for the sample set G obtained in the step (2.1)_kRespectively copying the samples in each group, and collecting the samples in each group to obtain a sample set G_kNumber n of samples of_kExpanded to original

Multiplying to obtain an extended sample set G'_k(ii) a Wherein n is_kIs a set of samples G of each group_kThe number of samples in (1);

(2.3) expanding sample set G 'obtained in the step (2.2)'_kThe groups of samples form a sample pool D together_p；

(3) Sample cell D obtained from step (2.3)_pThe method comprises the following steps of performing medium random sampling, designing a virtual sample generation rule based on an agent model and a Jacobi rotation theory, performing sample capacity expansion operation on a selected sample, and forming a candidate virtual sample library Dc' by all candidate virtual samples obtained after the capacity expansion operation, wherein the method comprises the following substeps:

(3.1) offset of input features: from the sample cell D_pIn which a sample (x) is randomly taken_p,y_p) Generating virtual input x 'by an input feature offset operation'_p＝x_p± Δ, where positive and negative are randomly determined, and the offset Δ is determined by:

(3.2) constructing a response model J (x ') based on Jacobian's moment theory '_p): according to the assembly condition of the mechanical product assembly, a local coordinate system is constructed at the geometric center of each tolerance, and a response model J (x'_p) Determined according to the following formula:

wherein F is the number of local coordinate systems;

in the form of a jacobian matrix,

is the tolerance curl.

(3.3) generation of output features: constructing virtual output y 'based on proxy model'_pThe construction method comprises the following steps:

y′_p＝J(x′_p)+ε_p

wherein epsilon_pIs Gaussian random noise;

(3.4) obtaining a set of candidate virtual samples (x'_p,y′_p)：

(3.5) repeating the steps (3.1) - (3.4) until the number of the candidate virtual samples reaches the initial candidate sample library capacity n acquired in the step (1)_Dc′And a candidate virtual sample library Dc' is constructed.

(4) And (3) performing sample rationality screening on the candidate virtual sample library Dc' formed in the step (3.5), and reserving the candidate virtual samples meeting the sample rationality judgment condition as virtual expansion samples, wherein the method comprises the following substeps:

(4.1) randomly extracting n from the candidate virtual sample library Dc' constructed in the step (3.5)_mMixing the group candidate virtual samples with the original training set D obtained in the step (1) to obtain a mixed sample D_m；

(4.2) formulating rationality judgment indexes and conditions: the rationality determination index Pd (-) includes a sample mean E (-) and a sample variance σ²(. cndot.), sample skewness Skaew (), sample Kurt () expressed as:

Pd(·)＝{E(·),σ²(·),Skew(·),Kurt(·)}

the rationality determination conditions are:

Pd(D_m)≥ξPd(D)

where ξ is the confidence;

(4.3) mixing sample D obtained in the step (4.1)_mAnd (3) carrying out rationality judgment according to the rationality judgment condition formulated in the step (4.2) to obtain a virtual capacity expansion sample, wherein the rationality judgment comprises the following two conditions:

(4.3.1) if the sample D is mixed_mIf the criterion Pd (-) satisfies the criterion of the rationality judgment in the step (4.2), n extracted in the step (4.1) is retained_mThe group candidate virtual samples are used as virtual expansion samples;

(4.3.2) if the sample D is mixed_mDoes not satisfy the rationality judgment condition, and extracts n from the step (4.1)_mThe group of candidate virtual samples is put back into the candidate virtual sample library Dc';

(4.4) repeating the steps (4.1) - (4.3) and continuously obtaining the virtual sample capacity n'; when the number of the reserved virtual expansion samples reaches the virtual sample capacity n' obtained in the step (1), or new virtual expansion samples cannot be generated after three continuous screenings, stopping extraction;

(5) judging whether the number of the reserved virtual expansion samples reaches the virtual sample capacity n' obtained in the step (1) or not to obtain a virtual sample set, wherein the method comprises the following two conditions:

(5.1) if the number M of the virtual expansion samples reserved in the step (4) reaches the virtual sample capacity n', completing the expansion of the virtual samples to obtain a virtual sample set;

(5.2) if the number M of the virtual expansion samples reserved in the step (4) does not reach the capacity n' of the virtual samples, updating the capacity of the candidate sample library to be n_Dc′And (4) n '-M, jumping to the step (3) to form a new candidate virtual sample library Dc', continuously obtaining virtual expansion samples, and supplementing the insufficient part.

Further, the dimensional tolerance in the step (1) includes a basic size and a dimensional deviation.

Further, the shape tolerance in the step (1) includes straightness, flatness, roundness, cylindricity, line profile, and surface profile.

Further, the position tolerance in the step (1) comprises parallelism, perpendicularity, inclination, coaxiality, symmetry, position degree, circular run-out and full run-out.

Further, the assembling precision in the step (1) comprises assembling size precision, assembling angle deviation, rotation deviation, coaxiality and verticality.

Further, each set of extended sample set G 'in the step (2.2)'_kThe number of samples in

Further, n in the step (4.1)_mGet

An internal integer value.

Further, in the step (4.1), when the number of candidate dummy samples in the candidate dummy sample library Dc' is less than n_mWhen grouping, all the candidate virtual samples in the current candidate virtual sample library Dc' are taken to be mixed with the original training set D to obtain a mixed sample D_m。

Further, the value range of the confidence coefficient xi in the step (4.2) is 0.9-1.

The invention has the beneficial effects that: firstly, determining the virtual sample capacity of a virtual sample in a small sample problem according to actually measured historical data; then, constructing a sample pool for generating a virtual sample based on historical data of mechanical production and related priori knowledge of the mechanical production; then, sampling samples based on a wheel roulette sampling idea, and designing a virtual sample generation rule based on an agent model idea and a Jacobi's rotation theory; and finally, feasible expansion samples are reserved according to the sample rationality judgment condition, so that the expansion of the virtual sample for training the small sample regression problem for predicting the mechanical assembly precision is realized. The achievement of the invention can be used for expanding the sample capacity of the small-capacity sample machine learning training model, can solve the problem of insufficient sample quantity in the mechanical assembly precision prediction, and has important significance for researching the small sample problem of the tolerance transfer of the customized product by using a machine learning regression method.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is an example assembly view of the present invention;

FIG. 3 is a diagram of example dimensions and local coordinate systems of the present invention;

FIG. 4 is an example tolerance routing diagram of the present invention.

Detailed description of the invention

The method is described below on the basis of examples, but the method is not limited to the example problems. In the following detailed description of the present method, the following specific technical details are set forth in detail. The methods and operations provided are not described in any great detail in order to avoid obscuring the essence of the present method.

The overall flow chart of the virtual sample capacity expansion method based on the historical data of the mechanical product aiming at the problem of small sample regression of mechanical assembly precision prediction is shown in figure 1 and comprises the following steps:

(1) and reading real historical data of the mechanical product as samples in an original training set D, wherein the samples comprise input features and output features. Obtaining potential parameters including virtual sample capacity n' according to original training set D, and initializing candidate sample library capacity n_Dc′。

the output characteristic is assembly accuracy;

the dimensional tolerance includes a basic size, a dimensional deviation.

The shape tolerance includes straightness, flatness, roundness, cylindricity, line profile, and surface profile.

The position tolerance comprises parallelism, perpendicularity, inclination, coaxiality, symmetry, position degree, circular run-out and full run-out.

The assembly precision refers to the precision actually achieved after assembly, and comprises assembly size precision, assembly angle deviation, rotation deviation, coaxiality and perpendicularity.

The virtual sample capacity n' ranges from n × 2²≤n′≤n×2^dN' is arbitrarily taken within the range;

the initialized candidate sample library capacity n_Dc′＝n′；

(2) Based on the characteristic distribution rule in the original training set D obtained in the step (1), carrying out grouping, copying and expanding on the original training set D to obtain a sample pool D_p(ii) a The characteristic distribution refers to the distribution of the magnitude of the output characteristic value; the method comprises the following substeps:

(2.1) determining a sample set G_k: according to the output characteristic y obtained in the step (1)_iThe distribution of the magnitude of the values divides it equally into K intervals (g)_k-1,g_k]The interval end point g is calculated by the following formula_k：

Wherein K is 1, 2.., K; y is_minAs output characteristic y_iMinimum value of, y_maxAs output characteristic y_iMaximum value of (d); in particular, when k is 1, the 1 st interval is (g)₀,g₁]Wherein g is₀＝y_min；

Correspondingly, dividing the original training set D into K groups according to the following formula to obtain a sample set G_k：

G_k＝{D_i|g_k-1＜y_i≤g_k,i＝1,2,...,n}

Wherein D is_iThe ith sample in the original training set D is taken; in particular, when k is 1, G₁＝{D_i|g₀≤y_i≤g₁,i＝1,2,...,n}；

Multiplying to obtain an extended sample set G'_k(ii) a Respective sets of extended sample sets G'_kThe number of samples in

Wherein n is_kIs a set of samples G of each group_kAnd the number of samples in (2) and the output characteristic y in step (2.1)_iThe distribution of the numerical values is related; e (n)_k) Represents n_kThe average value of (a) of (b),

indicating a ceiling operation.

(2.3) expanding sample set G 'obtained in the step (2.2)'_kThe groups of samples form a sample pool D together_p：

(3) Sample cell D obtained from step (2.3)_pIn-process random sampling, designing a virtual sample generation rule based on a proxy model and a Jacobi momentum theory, and performing sample expansion on a selected sampleCapacity operation, wherein all candidate virtual samples obtained after the capacity expansion operation form a candidate virtual sample library Dc', and the method comprises the following substeps:

(3.1) offset of input features: from the sample cell D_pIn which a sample (x) is randomly taken_p,y_p) Generating virtual input x 'by an input feature offset operation'_p＝x_p± Δ, where positive and negative are determined randomly, and the offset Δ is a parametric estimate of the original training set D, determined by:

wherein the content of the first and second substances,

is to x_iAveraging;

wherein F is the number of local coordinate systems;

in the form of a jacobian matrix,

is the tolerance curl.

Jacobian matrix

Calculated by the following formula:

dx_f ^F＝dx_F-dx_f

dy_f ^F＝dy_F-dy_f

dz_f ^F＝dz_F-dz_f

wherein dx is_F,dy_F,dz_FIs the global coordinate of the F-th coordinate system, dx_f,dy_f,dz_fIs the global coordinate of the f-th coordinate system. When the x-axis direction of the f-th coordinate system is the same as the x-axis direction of the 0 th coordinate system, cx is 1; otherwise, cx is 0; cy and cz are the same.

Tolerance rotation

Calculated by the following formula:

wherein u, v, w represent the linear deviations in the x, y, z directions; α, β, γ represent angular deviations about the x, y, z axes.

(3.3) generation of output features: constructing virtual output y 'based on proxy model'_pThe assembled functional requirement, i.e. the output signature in the sample, is considered to be a combination of a fixed response model and a local bias to the input signature, hence the virtual output y'_pThe construction method comprises the following steps:

y′_p＝J(x′_p)+ε_p

wherein epsilon_pIs Gaussian random noise;

(3.4) obtaining a set of candidate virtual samples (x'_p,y′_p)：

(4.1) randomly extracting n from the candidate virtual sample library Dc' constructed in the step (3.5)_mMixing the group candidate virtual samples with the original training set D obtained in the step (1) to obtain a mixed sample D_m(ii) a Wherein n is_mCan take any value, but should not be too large, it is recommended to take

An internal integer value; when the number of candidate virtual samples in the candidate virtual sample library Dc' is less than n_mAnd when the virtual samples are grouped, the remaining candidate virtual samples are taken to be mixed with the original training set D.

Pd(·)＝{E(·),σ²(·),Skew(·),Kurt(·)}

the rationality determination conditions are:

Pd(D_m)≥ξPd(D)

and xi is a confidence coefficient and ranges from 0.9 to 1.

(4.3) mixing sample D obtained in the step (4.1)_mPerforming rationality judgment according to the rationality judgment conditions formulated in the step (4.2), and judging the mixed sample D_mCharacteristic pointWhether cloth changes obviously relative to original training set D, obtains virtual dilatation sample, includes following two kinds of situations:

(4.3.1) if the sample D is mixed_mIf the rationality judgment index Pd (-) satisfies the rationality judgment condition in the step (4.2), the selected n is retained_mThe group candidate virtual sample is used as the final virtual expansion sample;

(4.3.2) if the sample D is mixed_mDoes not satisfy the rationality determination condition, n_mThe group samples are placed back in the candidate virtual sample library Dc'.

(4.4) repeating the steps (4.1) - (4.3) and continuing random extraction of n_mPerforming rationality judgment on the candidate virtual samples, and stopping extraction when the number of reserved virtual expansion samples reaches the virtual sample capacity n' obtained in the step (1) or new reasonable virtual expansion samples cannot be generated by continuous and repeated screening;

(5.1) if the number of the virtual expansion samples reserved in the step (4) reaches the virtual sample capacity n', completing the expansion of the virtual samples to obtain a virtual sample set;

(5.2) if the number M of the reserved virtual expansion samples does not reach the virtual sample capacity n', updating the capacity of the candidate sample library to be n_Dc′And (4) jumping back to the step (3) to form a new candidate virtual sample library Dc', continuously obtaining virtual expansion samples, and supplementing the insufficient part.

Therefore, virtual sample capacity expansion operation based on actual measurement historical data is completed, and the virtual and real mixed sample set subjected to capacity expansion can be used for regression problem training in small sample machine learning.

Examples

Fig. 1 is a flowchart of a virtual sample capacity expansion method implemented by an example of the method. As shown in fig. 1, the virtual sample capacity expansion method based on historical data for the machine field according to the present invention includes the following steps:

(1) reading the existing real historical data of the research problem, extracting effective actual measurement samples, and obtaining potential parameters according to the actual samples.

In this example, a handle base assembly is illustrated, and FIG. 2 is a schematic view of an example assembly of the method. The assembly body consists of two parts: a handle and a base. Although relatively simple in construction, the assembly contains 3 dimensional tolerance variables, and 4 typical form and position tolerances; figure 3 identifies the 7 tolerance elements and associated dimensions of the mechanical product assembly described above.

And analyzing the assembly body, and extracting effective characteristic variables to construct a sample (x, y). Where x is a 4-dimensional input feature, in this example, 3 dimensional tolerance variables with 1 independent shape tolerance;_yfor a 1-dimensional output characteristic, the output characteristic in this example refers to the deviation of the handle end face in the direction of the axis of rotation after assembly is complete. Table 1 shows the input characteristics and the corresponding variables.

TABLE 1

Input feature x^d	Type of tolerance	Range of variables
			x¹	Dimensional deviation of 15 + -0.05	14.95～15.05
x²	Shaft dimensional tolerance phi 15g6	14.983～14.994
			x³	Hole size tolerance phi 15H7	15～15.018
x⁴	Face profile degree of 0.1	-0.05～0.05

Reading historical data to obtain 20 original training sets

As shown in table 2:

TABLE 2

From the actual sample volumes, the final produced virtual sample volume is determined according to the following equation:

n×2²≤n′≤n×2^d

in the formula, n' is 100 since n is 20 and d is 4. Simultaneous initialization of candidate sample pool capacity n_Dc′＝100。

(2) Based on the prior knowledge of the original training set D and the mechanical assembly, the characteristic distribution rule in the original training set D is mined, the original training set D is subjected to grouping, copying and expanding, and the copied actual measurement sample set is used as a sample pool D for expanding operation_p。

Further, the step 2 specifically includes:

according to the steps 2.1-2.2, dividing the original training set D into 5 groups, and equally dividing the output characteristics of the samples into 5 groups. Table 3 shows the statistics of the parameters, and the expansion factor of each group. The extended groups of samples jointly form a sample pool D for generating virtual samples_p。

TABLE 3

(3) From the sample cell D_pAnd (3) performing random sampling, performing virtual sample expansion on the selected samples based on virtual generation rules generated by a proxy model and a Jacobi's momentum theory, and forming a candidate virtual sample library by all generated virtual samples.

Step 3.1 is performed from the expanded sample pool D_pIn which a sample (x) is randomly taken_p,y_p) And determining a sample characteristic offset as [ 0.010.0010.0010.01 ]]Wherein positive and negative are randomly generated.

Tolerance transfer model is constructed based on Jacobian's rotation theory, and response model J (x ') is constructed by simple assembly structure shown in FIG. 2 '_p) The description is given; from the assembly of the mechanical product assembly shown in fig. 2, a local coordinate system as shown in fig. 3 is constructed at the geometric center of each tolerance.

First, a local coordinate system is determined, as shown in FIG. 3, with coordinate system 0 at the geometric center of reference A, coordinate system 1/2 at the geometric center of the base end face, coordinate 3/4 at the geometric center of the handle mating face, and coordinate 5 at the handle tip. According to the Jacobi's rotation theory, and the relevant parameters in Table 1, the Jacobi matrix of the local coordinate system is as follows:

the overall jacobian matrix is:

fig. 4 is a delivery route for the base handle assembly, which includes two sets of internal delivery (FE1, FE3) and one set of external delivery (FE 2). The tolerance expression for the input features to which the present invention relates can be obtained from tables 4 and 5, where simple constraints are determined by the formula in table 4, such as dimensional tolerances in the input features. The composite constraint tolerance representation is determined by the formula in table 5. The examples herein are given by way of illustration.

TABLE 4

TABLE 5

The fit of the shaft hole in this example is a composite constraint of class 1 in Table 5, corresponding to external transfer FE2 in FIG. 4. While bringing in the corresponding quantity and constant, T₁、T₂、T₃Is determined by the following formula:

J(x_i')＝[J]·[T₁T₂T₃]

y_i'＝J(x_i')+ε_i

and (5) repeatedly executing the steps 3.1-3.4 to generate 100 candidate virtual sample libraries Dc' to be screened.

(4) And (4) performing reasonableness screening on the candidate virtual sample library Dc' generated in the step (3).

The confidence ξ is determined to be 0.95.

Randomly extracting n from the candidate virtual sample library Dc_mMixing 5 groups of data with original training set D, and judging mixed sample D_mSample mean E (-) and sample variance σ of²(. cndot.), sample skewness Skaew (. cndot.), and sample kurtosis K urt (. cndot.).

If Pd (D)_m) 0.95Pd (D) and not more than Pd (D), the selected 5 groups of data are retained as the final virtual sample. If the above condition is not satisfied, the 5 groups of samples are put back into the candidate virtual sample library Dc'; the random drawing of 5 groups of data is continued and step 4.3 is executed.

(5) And if the number of the reserved virtual expansion samples reaches the virtual sample capacity of 100, completing the expansion operation and obtaining a virtual sample library.

If the number of the reserved virtual expansion samples reaches the virtual sample capacity, the number of the reserved virtual expansion samples is m, and the capacity of the updated candidate sample library is n_Dc′N' -m; executing the step 3-4, judging whether the number of the currently reserved virtual expansion samples reaches the virtual sample capacity 100, and completing the expansion of the virtual samples if the number of the currently reserved virtual expansion samples reaches the virtual sample capacity 100 to obtain a virtual sample set; and if not, executing the steps 3-4 again until the number of the currently reserved virtual expansion samples reaches the virtual sample capacity 100 and obtaining a virtual sample set.

Therefore, the virtual sample capacity expansion operation based on the measured historical data is completed.

Claims

1. A virtual sample capacity expansion method based on historical data of mechanical products is characterized by comprising the following steps:

the output characteristic is assembly accuracy;

the original training set D { (x {)₁,y₁),(x₂,y₂)...,(x_n,y_n) }; where n is the number of samples, x_iIs a d-dimensional input feature, y_iAs an output characteristic, y_iIs a one-dimensional vector, i is 1 to n;

The initialized candidate sample library capacity n_Dc′＝n′；

When k is 1, G₁＝{D_i|g₀≤y_i≤g₁,i＝1,2,...,n}

K2, 3, K, G_k＝{D_i|g_k-1＜y_i≤g_k,i＝1,2,...,n}

Wherein D is_iThe ith sample in the original training set D is taken;

wherein F is the number of local coordinate systems;

in the form of a jacobian matrix,

is the tolerance rotation;

y′_p＝J(x′_p)+ε_p

wherein epsilon_pIs Gaussian random noise;

(3.4) obtaining a set of candidate virtual samples (x'_p,y′_p)：

(3.5) repeating the steps (3.1) - (3.4) until the number of the candidate virtual samples reaches the initial candidate sample library capacity n acquired in the step (1)_Dc′Forming a candidate virtual sample library Dc';

Pd(·)＝{E(·),σ²(·),Skew(·),Kurt(·)}

the rationality determination conditions are:

Pd(D_m)≥ξPd(D)

where ξ is the confidence;

(4.31) if the sample D is mixed_mIf the criterion Pd (-) satisfies the criterion of the rationality judgment in the step (4.2), n extracted in the step (4.1) is retained_mThe group candidate virtual samples are used as virtual expansion samples;

2. The method for virtual sample expansion based on historical data of mechanical products according to claim 1, wherein the dimensional tolerance in step (1) comprises basic dimensions and dimensional deviation.

3. The method for virtual sample expansion based on historical data of mechanical products according to claim 1, wherein the shape tolerance in step (1) comprises straightness, flatness, roundness, cylindricity, line profile and surface profile.

4. The method for virtual sample capacity expansion based on historical data of mechanical products according to claim 1, wherein the position tolerance in the step (1) comprises parallelism, perpendicularity, inclination, coaxiality, symmetry, position degree, circular run-out and full run-out.

5. The virtual sample capacity expansion method based on historical data of mechanical products as claimed in claim 1, wherein the assembly precision in the step (1) comprises assembly dimension precision, assembly angle deviation, rotation deviation, coaxiality and verticality.

6. The method for virtual sample expansion based on mechanical product historical data according to claim 1, wherein each set of extended sample set G 'in the step (2.2)'_kThe number of samples in

7. The method for virtual sample expansion based on historical data of mechanical products according to claim 1, wherein n in the step (4.1)_mGet

An internal integer value.

8. The method for expanding virtual samples based on historical data of mechanical products according to claim 1, wherein in the step (4.1), when the number of candidate virtual samples in the candidate virtual sample library Dc' is less than n_mWhen grouping, all the candidate virtual samples in the current candidate virtual sample library Dc' are taken to be mixed with the original training set D to obtain a mixed sample D_m。

9. The virtual sample capacity expansion method based on historical data of mechanical products as claimed in claim 1, wherein the confidence coefficient xi in the step (4.2) is in a range of 0.9-1.