CN112784205A

CN112784205A - Partial differential equation data processing method, system, storage medium, device and application

Info

Publication number: CN112784205A
Application number: CN202110131994.0A
Authority: CN
Inventors: 王斌; 刘吉悦; 王颖; 张建龙; 李尧青; 刘子铭
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2021-01-30
Filing date: 2021-01-30
Publication date: 2021-05-11

Abstract

The invention belongs to the technical field of partial differential equation processing implied by a learning system from data, and discloses a partial differential equation data processing method, a system, a storage medium, equipment and application, wherein an attention mechanism is utilized to sample time space data; constructing a differential term alternative library by using prior knowledge and a basis function representation method; forming a corresponding coefficient of a differential term in a deep network learning alternative library by a plurality of single-layer regression networks; regularization is added into the loss function, and interference terms are reduced through sparse regression to obtain a final partial differential equation. The method can fully utilize the space and time information of observation data, reduce the data quantity required by the model, and improve the precision of the learned partial differential equation while ensuring the efficiency; the method solves the problems that the traditional method for learning the partial differential equation implicit in the system through manual experience and experimental verification is low in efficiency, is easily influenced by subjective factors of people, and is difficult to find a mechanism behind a complex system.

Description

Partial differential equation data processing method, system, storage medium, device and application

Technical Field

The invention belongs to the technical field of processing of partial differential equations implied by learning systems from data, and particularly relates to a method, a system, a storage medium, equipment and application for processing partial differential equation data.

Background

At present: differential equations, and in particular partial differential equations, play an important role in many disciplines and can be used to describe the physical laws behind a given system. Traditionally, partial differential equations have been derived mathematically or physically according to some basic principles, such as Schrodinger's equation in quantum mechanics to molecular dynamics models, Boltzmann's equation to Navy-Stokes equation, and so on. However, the mechanisms behind many complex systems in modern applications (such as many problems in multiphase flow, neuroscience, finance, bioscience, etc.) are not generally understood, and the partial differential equations of these systems are usually derived from empirical equations. With the rapid development of sensors, computing power and data storage technologies over the last decade, large amounts of data can now be easily collected, stored and processed. Such a large amount of data provides new opportunities for discovering (possibly new) physical laws from the data. Therefore, building a model to learn partial differential equations from data to approximate observed complex dynamic data would be of great interest to humans in analyzing and understanding the underlying mechanisms of complex systems in modern applications.

Early summary of the implicit mechanism from the system often relied on human experience, or hypothesis was proposed and then experimental verification was performed. The method is greatly influenced by human subjectivity, the efficiency is low, and the implicit partial differential equation of a complex or newly-appeared system is difficult to obtain by means of an empirical formula. Solutions to the partial differential equations implied by computer technology summary systems have therefore gradually emerged as computing power and data storage technologies have advanced. Joshbangard and Michael Schmidt made preliminary attempts in 2007 and 2009, respectively, on how to learn implicit partial differential equations in systems under data drive, and their main idea was to compare the numerical differential of input data with the analytical differential of candidate functions and determine a nonlinear dynamic system using symbolic regression and evolutionary algorithms. Emmanuel de Bezenac uses a nonlinear dynamics sparse identification method (SINDy) to carry out partial differential equation modeling on sea surface temperature in 2017, and the main idea of the SINDy is to firstly construct a candidate library of sufficiently large partial differential equation items and then learn an implicit partial differential equation from sea surface temperature data to predict the sea surface temperature. In 2018, Marizar Raissi proposes a model for learning unknown parameters on the premise that a nonlinear response function form of a partial differential equation is known, and the main idea of Marizar Raissi is to introduce a regularization idea between two continuous time steps through a Gaussian process. Generally, a method of learning partial differential equations from data focuses on representing observed data with a relatively simple model and obtaining an analytical form of the partial differential equations.

In the existing method of learning the partial differential equation implied by the system from data, there are several problems and drawbacks: (1) the traditional method for learning the partial differential equation implicit in the system through manual experience and experimental verification has low efficiency, is easily influenced by human subjective factors, and is difficult to find a mechanism behind a complex system. (2) The existing method for learning the partial differential equation hidden in the system under data driving has large limitation, more limitation on terms of the partial differential equation which can be learned, and the learning effect is easily influenced by noise. (3) The existing method for learning the partial differential equation implied by the system through deep learning needs a large amount of sample data, while some systems are difficult to acquire enough data for training, and the hardware cost of deep learning is high, and the operation efficiency is not high, so that the use of the methods is limited.

Through the above analysis, the problems and defects of the prior art are as follows:

(1) the traditional method for learning the partial differential equation implicit in the system through manual experience and experimental verification has low efficiency, is easily influenced by human subjective factors, and is difficult to find a mechanism behind a complex system, so that the finally obtained partial differential equation has overlarge error and unstable effect.

(2) The method for learning the partial differential equation implied by the system under the drive of data in the prior art has large limitation, and has more limitation on terms of the partial differential equation which can be learned, so that the learned partial differential equation does not conform to the analytic form of a real equation; and the learning effect is easily influenced by noise, so that the error of the finally obtained partial differential equation is overlarge.

(3) The existing method of learning the partial differential equation implied by the system through deep learning needs a large amount of sample data, while some systems are difficult to acquire enough data for training, and the hardware cost of deep learning is high, and the operation efficiency is not high, so that the use of the methods is limited.

The difficulty in solving the above problems and defects is: how to learn the partial differential equation implied by the system in the case of small data amount, and how to improve learning efficiency and accuracy.

The significance of solving the problems and the defects is as follows: the method for exploring the partial differential equation implied by the system through the data driving method can save a large amount of labor cost, help people to understand the operation rule of the complex system and promote the progress in the field of learning the partial differential equation implied by the system from data.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a partial differential equation data processing method, a system, a storage medium, equipment and application.

The invention is realized in such a way that a partial differential equation data processing method comprises the following steps:

sampling time space data by using an attention mechanism to improve learning precision and initial value robustness;

constructing a differential term alternative library by using prior knowledge and a basis function representation method to obtain a reasonable and complete alternative library;

forming corresponding coefficients of differential terms in a deep network learning alternative library by a plurality of single-layer regression networks to finish primary partial differential equation learning;

regularization is added into the loss function, and interference terms are reduced through sparse regression to obtain a final partial differential equation.

The method utilizes the attention mechanism to sample time space data, and because some systems controlled by the partial differential equation (such as image denoising and image segmentation tasks based on the partial differential equation) are more concerned about the final value of the partial differential equation, the attention mechanism is introduced to improve the precision and the initial value robustness during time sampling, so that the method is more effective in the task of concerning the final value result of the partial differential equation; constructing a differential term alternative library by using prior knowledge and a basis function representation method, so that the alternative library is complete and reasonable, and the calculated amount is reduced; then, learning coefficients corresponding to the candidate items in the candidate library by using a deep network formed by a plurality of single-layer regression networks so as to complete preliminary partial differential equation learning; and finally, adding a regularization term into the loss function to enable the network to become a partial differential equation learner by utilizing a sparse regression method so as to reduce irrelevant interference terms.

Further, the data sampling of the partial differential equation data processing method specifically includes: firstly, performing spatial sampling on input time sequence data, wherein random sampling is used for spatial sampling, and partial points in an input space are selected as observation objects; then time sampling is carried out, uniform sampling is used for time sampling, new sample values at intervals are obtained on the basis of observation points selected by space sampling, an attention mechanism is introduced during time sampling, and the weight of the data of the second half part is increased by controlling the time sampling rate;

further, the network parameters of the partial differential equation data processing method during data sampling are set as follows: boundary width: 5; spatial sampling rate: 2 percent; total time sampling rate: 25 percent; number of time-sampled equal segments: 5; attention mechanism correction: the sampling rate of each section is increased by 5%; segment time sampling rate: 15%, 20%, 25%, 30%, 35%.

Further, the partial differential equation data processing method constructs a differential term alternative library: equations containing partial derivatives or partial differentials of unknown functions are called partial differential equations, the general form of which is expressed as:

u_t(t，x，y)＝F(x，y，u，u_x，u_y，u_xx，u_xy，u_yy，...),(x,y)∈R²,t∈[0,T].；

wherein t representsM, x and y represent space, partial differential equation u_tIs the differential term x, y, u_x，u_y，u_xx，u_xy，u_yy,..; the partial differential equation is expressed in the form:

ω_t＝Θ(ω，u，v)ξ；

wherein Θ represents a matrix composed of various differential terms ω, u and v, and ξ represents coefficients of the differential terms, i.e. a candidate library of the differential terms is constructed, and then the coefficient ξ corresponding to each differential term is learned to obtain a final partial differential equation. Constructing a candidate library based on a basis function representation method, and constructing x, y, u_x，u_y，u_xx，u_xy，u_yyThe basic differential terms are equal, and when a specific system is aimed at, part of complex differential terms closely related to the system can be added into an alternative library according to prior knowledge; the alternative library is not too small, otherwise the accuracy of the final learned partial differential equation is influenced; the alternative library is not too large, otherwise, the operation efficiency is influenced;

alternative bank compression is performed. After the original differential item candidate base is obtained, the candidate base matrix is compressed according to the data sampling result of the previous step to reduce the calculation amount:

wherein

Representing matrix compression, and taking a new compressed matrix as a final differential item alternative library;

the candidate item constructing method selected when constructing the differential item candidate library comprises a forward difference method, a backward difference method, a central difference method and a Chebyshev polynomial interpolation method; when the subsequently obtained partial differential equation needs to be solved iteratively, a Chebyshev polynomial interpolation method is adopted to construct a differential term candidate library.

Further, the differential term coefficient learning of the partial differential equation data processing method: constructing a depth network to obtain a corresponding coefficient of each differential term in the candidate library, wherein the depth network is formed by stacking a plurality of single-layer regression networks to improve the precision of the network training result layer by layer, and the single regression network comprises the following components: obtaining the coefficient xi of the differential term by a regression method for the compressed differential term candidate library, wherein the loss function to be minimized is as follows:

wherein

Indicating the estimated value, the coefficient of the differential term obtained from the ith layer is xi_iFinally, the coefficient finally output by the single-layer regression network is used as the coefficient xi of the partial differential equation;

the regression mode of the network selects least square regression, and an lstsq tool in Numpy is selected to complete the task of least square regression.

Further, the regularization of the partial differential equation data processing method includes: the loss function is modified to:

wherein lambda represents a regularization coefficient, and an interference term with a small coefficient can be omitted from a partial differential equation subjected to sparse regression, so that the finally obtained result is closer to a real partial differential equation;

during the regularization, L1 regularization is selected for sparse regression, and the regularization coefficient lambda of the network is from 10 to 10 as the number of network layers is increased^-6To 10^-5And gradually changing, and the obtained partial differential equation term and the coefficient corresponding to each differential term form a partial differential equation finally learned by the model.

It is a further object of the invention to provide a computer device comprising a memory and a processor, the memory storing a computer program which, when executed by the processor, causes the processor to perform the steps of:

sampling the time-space data by using an attention mechanism;

constructing a differential term alternative library by using prior knowledge and a basis function representation method;

forming a corresponding coefficient of a differential term in a deep network learning alternative library by a plurality of single-layer regression networks;

It is another object of the present invention to provide a computer-readable storage medium storing a computer program which, when executed by a processor, causes the processor to perform the steps of:

sampling the time-space data by using an attention mechanism;

Another object of the present invention is to provide a partial differential equation data processing system implementing the partial differential equation data processing method, the partial differential equation data processing system including:

the data sampling module is used for sampling data firstly, and the data sampling is carried out in two aspects of space and time respectively;

the differential term alternative library construction module is used for constructing an alternative library and selecting differential terms to construct a differential term alternative library by utilizing the prior knowledge and a basis function representation method;

the differential term coefficient learning module is used for learning the coefficient corresponding to the candidate item in the candidate library by utilizing a depth network formed by a plurality of single-layer regression networks;

and the regularization screening differential term module is used for adding a regularization term into the loss function to enable the network to become a partial differential equation learner by using a sparse regression method so as to reduce irrelevant interference terms, and the finally obtained partial differential equation term and the coefficient corresponding to each differential term form a partial differential equation finally learned by the network.

Another object of the present invention is to provide a data learning terminal, which is used for implementing the partial differential equation data processing method.

By combining all the technical schemes, the invention has the advantages and positive effects that: according to the method, data are sampled firstly, the data sampling is carried out in two aspects of space and time, and because some systems controlled by partial differential equations (such as image denoising and image segmentation tasks based on the partial differential equations) are more concerned about the final values of the partial differential equations, an attention mechanism is introduced during the time sampling to improve the precision and the initial value robustness, so that the method is more effective in a task of concerning the final value results of the partial differential equations; then constructing an alternative library, and selecting reasonable differential terms to construct a differential term alternative library by using mathematical methods such as priori knowledge, basis function representation and the like; then, learning coefficients corresponding to the candidate items in the candidate library by using a deep network formed by a plurality of single-layer regression networks; and finally, adding a regularization term into the loss function to enable the network to become a partial differential equation learner by utilizing a sparse regression method so as to reduce irrelevant interference terms, wherein the finally obtained partial differential equation term and the coefficient corresponding to each differential term form a partial differential equation finally learned by the network.

Compared with other existing methods for learning partial differential equations from a system, the method can fully utilize space and time information of observation data, reduce data quantity required by a model, and improve the accuracy of the learned partial differential equations while ensuring efficiency. The invention designs a deep differential equation network to learn the partial differential equation implied by the system, and is used for solving the problems that the traditional method for learning the partial differential equation implied by the system through artificial experience and experimental verification has low efficiency, is easily influenced by human subjective factors, and is difficult to find a mechanism behind a complex system. The deep differential equation network can learn a plurality of differential terms of the partial differential equation, and has stronger initial value robustness on the task of processing more sensitive final value data. The deep differential equation network needs small data quantity, low hardware requirement and high operation efficiency.

Table 1 the method of the present invention learns the effect of the classical partial differential equation:

table 2 the method of the present invention learns the effect of the image processing partial differential equation:

average running time of the algorithm: 5.018 s.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the embodiments of the present application will be briefly described below, and it is obvious that the drawings described below are only some embodiments of the present application, and it is obvious for those skilled in the art that other drawings can be obtained from the drawings without creative efforts.

Fig. 1 is a flowchart of a partial differential equation data processing method according to an embodiment of the present invention.

Fig. 2 is a schematic structural diagram of a partial differential equation data processing system according to an embodiment of the present invention.

Fig. 3 is a flowchart of an implementation of a partial differential equation data processing method according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

In view of the problems in the prior art, the present invention provides a partial differential equation data processing method, system, storage medium, device and application, and the present invention is described in detail below with reference to the accompanying drawings.

As shown in fig. 1, the partial differential equation data processing method provided by the present invention includes the following steps:

s101: sampling the time-space data by using an attention mechanism;

s102: constructing a differential term alternative library by using prior knowledge and a basis function representation method;

s103: forming a corresponding coefficient of a differential term in a deep network learning alternative library by a plurality of single-layer regression networks;

s104: regularization is added into the loss function, and interference terms are reduced through sparse regression to obtain a final partial differential equation.

Those skilled in the art of the method for processing partial differential equation data provided by the present invention can also use other steps to implement, and the method for processing partial differential equation data provided by the present invention in fig. 1 is only a specific example.

As shown in fig. 2, the partial differential equation data processing system provided by the present invention includes:

the data sampling module is used for sampling data firstly, the data sampling is respectively carried out in two aspects of space and time, and because some systems controlled by partial differential equations (such as image denoising and image segmentation tasks based on partial differential equations) are more concerned about the final values of the partial differential equations, an attention mechanism is introduced during the time sampling to improve the precision and the initial value robustness, so that the data sampling module is more effective in the task of concerning the final value results of the partial differential equations;

the differential term alternative library construction module is used for constructing an alternative library, and reasonable differential terms are selected to construct the differential term alternative library by using mathematical methods such as priori knowledge, basis function representation and the like;

The technical solution of the present invention is further described below with reference to the accompanying drawings.

As shown in fig. 3, the partial differential equation data processing method provided by the present invention includes the following steps:

the method comprises the following steps: and (6) sampling data. Firstly, performing spatial sampling on input time sequence data, wherein random sampling is used for spatial sampling, and partial points in an input space are selected as observation objects; and then time sampling is carried out, uniform sampling is used for time sampling, new sample values at intervals are obtained on the basis of observation points selected by space sampling, an attention mechanism is introduced during time sampling, and the weight of the data of the second half part is increased by controlling the time sampling rate so as to increase the precision and the initial value robustness of the model.

The attention mechanism affects the time sampling of the input as follows: the total time sampling rate is 25% of all time sequences, all input time sequences are equally divided into five parts, the time sampling rate of the first part is 15%, the time sampling rate of the second part is 20%, and the like, and the attention mechanism correction value of each section of sampling rate is 5%.

The network parameters at the time of data sampling are set as follows: boundary width: 5; spatial sampling rate: 2 percent; total time sampling rate: 25 percent; number of time-sampled equal segments: 5; attention mechanism correction: the sampling rate of each segment is increased by 5%. Segment time sampling rate: 15%, 20%, 25%, 30%, 35%.

Step two: constructing a differential item alternative library: equations containing the partial derivative (or partial differential) of the unknown function are called partial differential equations, the general form of which can be expressed as:

u_t(t，x，y)＝F(x，y，u，u_x，u_y，u_xx，u_xy，u_yy，...)，(x，y)∈R²，t∈[0，T].；

where t represents time, x and y represent space, partial differential equation u_tIs the differential term x, y, u_x，u_y，u_xx，u_xy，u_yy,.. The partial differential equation is expressed in the present invention as follows:

ω_t＝Θ(ω，u，v)ξ；

wherein Θ represents a matrix composed of various differential terms ω, u and v, and ξ represents coefficients of the differential terms, i.e. a candidate library of the differential terms is constructed, and then the coefficient ξ corresponding to each differential term is learned to obtain a final partial differential equation. Constructing a candidate library based on a basis function representation method, and constructing x, y, u_x，u_y，u_xx，u_xy，u_yyAnd waiting for basic differential terms, and adding part of complex differential terms closely related to a specific system into an alternative library according to prior knowledge when the specific system is targeted. The alternative library is not too small, otherwise the accuracy of the final learned partial differential equation is influenced; the alternative library should not be too large, which would affect the efficiency of operation.

The alternative bank compression is followed. After the original differential item candidate base is obtained, the candidate base matrix is compressed according to the data sampling result of the previous step to reduce the calculation amount:

wherein

And (4) representing matrix compression, and taking the compressed new matrix as a final differential item candidate library.

When a subsequently obtained partial differential equation needs to be solved iteratively, if a solved numerical method is consistent with a construction method, the solving precision is highest, and the Chebyshev polynomial interpolation method is adopted to construct the differential item candidate library in the specific embodiment.

Step three: learning of differential term coefficients: and constructing a deep network to obtain a corresponding coefficient of each differential term in the alternative library. The deep network is stacked by a plurality of single-layer regression networks to improve the precision of the network training result layer by layer, wherein the single regression network is composed of: obtaining the coefficient xi of the differential term by a regression method for the compressed differential term candidate library, wherein the loss function to be minimized is as follows:

wherein

Indicating the estimated value, the coefficient of the differential term obtained from the ith layer is xi_iAnd finally, taking the coefficient finally output by the single-layer regression network as the coefficient xi of the partial differential equation.

Step four: regularization: when constructing the differential term alternative library, an over-complete library is generally constructed in order to be able to represent complex partial differential equations, and therefore interference terms which are not included in the real partial differential equations are included therein. In order to eliminate the interferences and enable the learned partial differential equation to be closer to a real equation, the invention introduces a regularization term and modifies a loss function into:

where λ represents the regularization coefficient. Thus, the partial differential equation subjected to sparse regression omits the interference terms with small coefficients, so that the final result is closer to the real partial differential equation.

During regularization, L1 regularization is selected for sparse regression, and considering that regularization coefficients are not suitable to be set to be too large initially during deep network training, otherwise, terms which may cause some real partial differential equations to exist are ignored in the first layers of the network due to the small coefficients, and therefore the terms are omitted in the first layers of the networkThe regularization coefficient lambda of the network is from 10 as the number of network layers increases^-6To 10^-5Gradually changing. The finally obtained partial differential equation term and the coefficient corresponding to each differential term form a partial differential equation finally learned by the model.

The technical effects of the present invention will be described in detail with reference to simulations.

1. The simulation experiment is completed by Python language on a PC with a CPU being an Intel (R) core (TM) i7-9700, a CPU3.60GHz, a RAM 16.00GB and a ubuntu 16.04 operating system.

2. The experimental data of the invention is the data generated by various classical partial differential equations and the real noise-containing data generated when the partial differential equations are used for image processing. The classical partial differential equations comprise Burgers equations, KdV equations, NLS equations and Navier Stokes equations; the image processing by the partial differential equation includes image denoising data by the thermal diffusion equation and level set image segmentation data by the GAC model. The results of the experiment are shown in tables 1 and 2.

average running time of the algorithm: 5.018 s.

According to the results, the depth differential equation network can learn partial differential equations with abundant types of differential terms, can obtain higher precision and higher efficiency under the condition of smaller required data volume, and still has good learning effect when processing real data containing noise interference generated when image processing is carried out through the partial differential equations. The hardware portion may be implemented using dedicated logic; the software portions may be stored in a memory and executed by a suitable instruction execution system, such as a microprocessor or specially designed hardware. Those skilled in the art will appreciate that the apparatus and methods described above may be implemented using computer executable instructions and/or embodied in processor control code, such code being provided on a carrier medium such as a disk, CD-or DVD-ROM, programmable memory such as read only memory (firmware), or a data carrier such as an optical or electronic signal carrier, for example. The apparatus and its modules of the present invention may be implemented by hardware circuits such as very large scale integrated circuits or gate arrays, semiconductors such as logic chips, transistors, or programmable hardware devices such as field programmable gate arrays, programmable logic devices, etc., or by software executed by various types of processors, or by a combination of hardware circuits and software, e.g., firmware.

The above description is only for the purpose of illustrating the present invention and the appended claims are not to be construed as limiting the scope of the invention, which is intended to cover all modifications, equivalents and improvements that are within the spirit and scope of the invention as defined by the appended claims.

Claims

1. A partial differential equation data processing method, characterized by comprising:

sampling the time-space data by using an attention mechanism;

2. The partial differential equation data processing method according to claim 1, wherein the data sampling of the partial differential equation data processing method specifically includes: firstly, performing spatial sampling on input time sequence data, wherein random sampling is used for spatial sampling, and partial points in an input space are selected as observation objects; and then time sampling is carried out, uniform sampling is used for time sampling, new sample values at intervals are obtained on the basis of observation points selected by space sampling, an attention mechanism is introduced during time sampling, and the weight of the data of the second half part is increased by controlling the time sampling rate.

3. The partial differential equation data processing method according to claim 2, wherein the network parameter settings at the time of data sampling of the partial differential equation data processing method are as follows: boundary width: 5; spatial sampling rate: 2 percent; total time sampling rate: 25 percent; number of time-sampled equal segments: 5; attention mechanism correction: the sampling rate of each section is increased by 5%; segment time sampling rate: 15%, 20%, 25%, 30%, 35%.

4. The partial differential equation data processing method according to claim 1, wherein the partial differential equation data processing method constructs a candidate library of differential terms: equations containing partial derivatives or partial differentials of unknown functions are called partial differential equations, the general form of which is expressed as:

where t represents time, x and y represent space, partial differential equation u_tIs the differential term x, y, u_x，u_y，u_xx，u_xy，u_yy,..; the partial differential equation is expressed in the form:

ω_t＝Θ(ω，u，v)ξ；

wherein theta represents a matrix composed of various differential terms omega, u and v, and xi represents coefficients of the differential terms, namely, a candidate library of the differential terms is constructed, then the coefficient xi corresponding to each differential term is learned to obtain a final partial differential equation, and the construction of the candidate library is based on a basis function representation method to construct x, y, u and u_x，u_y，u_xx，u_xy，u_yyThe basic differential terms are equal, and when a specific system is aimed at, part of complex differential terms closely related to the system can be added into an alternative library according to prior knowledge; the alternative library is not too small, otherwise the accuracy of the final learned partial differential equation is influenced; the alternative library is not too large, otherwise, the operation efficiency is influenced;

and (3) compressing the alternative library, and compressing the alternative library matrix according to the data sampling result of the previous step after obtaining the original differential term alternative library so as to reduce the calculated amount:

C′ω_t＝C′Θ(ω，u，v)ξ；

wherein C' represents matrix compression, and the new compressed matrix is used as a final differential item alternative library;

5. The partial differential equation data processing method according to claim 1, characterized in that a differential term coefficient of the partial differential equation data processing method learns: constructing a depth network to obtain a corresponding coefficient of each differential term in the candidate library, wherein the depth network is formed by stacking a plurality of single-layer regression networks to improve the precision of the network training result layer by layer, and the single regression network comprises the following components: obtaining the coefficient xi of the differential term by a regression method for the compressed differential term candidate library, wherein the loss function to be minimized is as follows:

wherein

6. The partial differential equation data processing method according to claim 1, wherein the regularization of the partial differential equation data processing method comprises: the loss function is modified to:

7. A computer device, characterized in that the computer device comprises a memory and a processor, the memory storing a computer program which, when executed by the processor, causes the processor to carry out the steps of:

sampling the time-space data by using an attention mechanism;

8. A computer-readable storage medium storing a computer program which, when executed by a processor, causes the processor to perform the steps of:

sampling the time-space data by using an attention mechanism;

9. A partial differential equation data processing system for implementing the partial differential equation data processing method according to any one of claims 1 to 6, the partial differential equation data processing system comprising:

10. A data learning terminal, characterized in that the data learning terminal is used for realizing the partial differential equation data processing method of any one of claims 1 to 6.