CN113779643A

CN113779643A - Signature handwriting recognition system and method based on pre-training technology and storage medium

Info

Publication number: CN113779643A
Application number: CN202111122613.9A
Authority: CN
Inventors: 覃勋辉; 祁松茂; 曾川
Original assignee: Chongqing Aos Online Information Technology Co ltd
Current assignee: Chongqing Sign Digital Technology Co ltd
Priority date: 2021-09-24
Filing date: 2021-09-24
Publication date: 2021-12-10
Anticipated expiration: 2041-09-24
Also published as: CN113779643B

Abstract

An electronic signature recognition system and method based on a pre-training technology relate to an electronic signature handwriting recognition technology, and the system and method are used for collecting electronic signature stroke related characteristic data, and acquiring signature sample data and signature to-be-detected data; the method comprises the steps of resampling, normalizing and discretizing signature sample data and data to be signed to obtain data with fixed length, respectively inputting the signature sample data and the data to be signed with the fixed length into a sample encoder, determining common characteristics and difference characteristics of the signature sample data and the data to be signed by the encoder through a cosine similarity loss function, and determining that the signature sample data and the data to be signed come from the same signer or different signers. The invention is widely used in places such as e-commerce, e-government affairs and the like needing signature identification.

Description

Signature handwriting recognition system and method based on pre-training technology and storage medium

Technical Field

The invention relates to the technical field of computer information processing, in particular to a handwritten electronic signature comparison method based on a neural network pre-training technology.

Background

The traditional document and protocol signing depends on signature, the signature is often also a judicial identification mode, and the electronic handwritten signature is brought into a remote evidence obtaining system in the court trial process, so the accuracy authentication of a signature comparison algorithm brings convenience to the judicial justice, and the appearance of a new signature comparison authentication technology is more and more urgent. In the handwritten electronic signature comparison neighborhood, the same person can write different signature tracks called as intra-individual difference, and a forger with worry can write extremely similar tracks called as inter-individual identity, so that difficulty is caused in image signature comparison.

Chinese patent application publication No. CN112560636A, "a handwritten signature comparison method and system based on image recognition" proposes signature comparison by a method of comparing with an image in a database, which is poor in generalization capability. The proposed method uses too single characteristics to fully utilize the characteristics acquired by the acquisition plate, such as pressure, rotation angle, etc. The Chinese patent application with the publication number of CN109409254A discloses an electronic contract handwritten signature identification method based on a twin neural network, and discloses an electronic contract handwritten signature identification method based on a twin neural network, wherein a plurality of times of user handwritten signature background reservation is carried out in a real-name authentication link, and a handwritten signature is stored as an image file; selecting two of the handwritten signature image files and one of the handwritten signature image files which is not the user, inputting the two handwritten signature image files and the one of the handwritten signature image files into a twin neural network model for vectorization calculation to obtain a loss function, and obtaining a training set cost function through the loss function; training other handwritten signature image files of the user through an optimization algorithm, and outputting an identification model; when signing online, the handwritten signature on the touch screen input device is instantly stored as an image, the image is input into the identification model, the identification result is output, and the storage and identification work of the handwritten signature is completed. However, the method adopts the image file to form the training set, does not use a large amount of characteristic information of the handwriting, and has low recognition accuracy.

CN 113158887 a is an electronic signature authentication method and device for improving the identification accuracy of electronic signatures. Collecting signature information including a timestamp, coordinate information and pressure information of each pixel point of an electronic signature track; restoring the signature track on the signature track layer according to the coordinate information of the signature track; acquiring a pressure value from the pressure information, and setting the pressure value on a pixel point corresponding to the coordinate information of the signature track on the signature pressure layer; combining the signature track layer and the signature pressure layer to generate a feature map; inputting the characteristic graph into a trained convolutional neural network, randomly selecting the characteristic graphs corresponding to two signature information from a signature information data set to form a data pair, marking the data pair, distinguishing whether the two signature information in the data pair are both correct signatures through marking, building a convolutional neural network model, comparing the characteristic graph with the signature template graph of the signature through the convolutional neural network, and outputting an authentication result. Although the patent application fully utilizes the collected characteristics of the electronic version, supervised learning consumes a large amount of manpower and resources, and the industrial cost is greatly increased.

Most electronic signature comparison technologies are handwriting around electronic signatures, an image processing mode is adopted, but the same signer can sign different signatures at different time, a counterfeiter with worry can also forge highly similar signatures, along with the development of the technology, the electronic signature board can capture more features, and a large amount of information is omitted through an authentication mode of image signature comparison. In the conventional method, such as deephsv, idn and the like, image data is used as input data, handwriting comparison is used as a two-classification processing problem based on images, and a large number of positive and negative sample learning models are needed. However, it is very difficult to collect high quality negative sample data, resulting in a low accuracy of signature identification.

In addition, the conventional handwritten electronic signature comparison method classifies input sample images and statement images, and the recognition method is not suitable for large-scale handwriting retrieval tasks and other applications due to reasons such as calculated amount, so that the conventional handwriting comparison technology based on images mainly aims at modeling track features and does not fully utilize collected rich multi-dimensional writing feature information. The neural network model with high complexity has stronger identification capability, but the neural network model with high complexity easily causes the problem of overfitting on small data, and the lack of negative samples causes the signature identification accuracy to be difficult to improve.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides an electronic signature comparison method based on a pre-training technology, which can solve the problem of overfitting of a small amount of sample data on a large model, and uses a method for enhancing noise data for the purpose of data sufficiency to solve the problem of lack of negative samples.

The invention provides an electronic signature handwriting recognition system based on a pre-training technology, which comprises a data acquisition module, a data preprocessing module, a signature handwriting pre-training module and a signature handwriting characteristic recognition module, wherein the data acquisition module acquires electronic signature stroke related characteristic data to acquire signature sample data; the data preprocessing module carries out resampling, normalization and discretization on the collected positive sample data to generate negative sample data N for electronic signature handwriting pre-training, a neural network is trained in an autoregressive pre-training mode of the positive sample data P based on a mask, and a neural network model after the pre-training is finished serves as an encoder E; the signature handwriting pre-training module trains an electronic signature handwriting pre-training model by combining a classification loss function with positive sample data and a noise regression loss function with negative sample data respectively to obtain a sample encoder 1 and a sample encoder 2 which have the same structure; and updating the weight parameters of the two encoders until a convergence weight is obtained, and constructing a twin network model of the sample encoder which comprises two same structures and shares the weight as a signature handwriting characteristic identification module.

Further, from the sample encoder 1 output vector y and the sample encoder 2 output vector y ', using the cosine distance Loss _ match ═ (cos (y, y'))²And constructing a classification loss function: loss _ positive ═(1-cos(y,y'))²And noise regression function: loss _ noise ═ 1-cos (y', y) - μ)². Updating the weight parameters of the two weight-shared encoders further comprises calling a formula using a classification loss function

Calculating the partial derivative of the current weight coefficient W of the encoder 1, and calling a formula W through a preset learning rate lr_new1＝w-lr×W₁' update encoder 1 weight parameters until convergence. Calling a formula using a noise regression function

Calculating the partial derivative of the current weight coefficient W of the encoder 2, and calling the formula W according to the preset learning rate lr_new2＝w-lr×W₂' update encoder 2 weight parameters until convergence; and obtaining the weight parameters of the two sample encoders, and obtaining the sharing weight of the sharing weight sample encoder in the signature handwriting characteristic identification module by a gradient average or addition synchronization method for the convergence weight parameters of the two encoders.

The training of the neural network in the autoregressive pre-training mode further comprises: and (3) masking partial data sequence by using random mask to the original characteristic sequence of the acquired signature handwriting, inputting the partial data sequence into a neural network model for pre-training, outputting the pre-training characteristic sequence of the signature handwriting by the neural network model, enabling the model to output the pre-training characteristic sequence to approach the original characteristic sequence, and completing the pre-training.

The invention also provides an electronic signature handwriting recognition method based on the pre-training technology, which comprises the steps that a data acquisition module acquires the relevant characteristic data of the electronic signature handwriting, and signature sample data and signature to-be-detected data are acquired; the data preprocessing module is used for resampling, normalizing and discretizing the signature sample data and the signature to-be-detected data to obtain signature sample data and signature to-be-detected data with fixed lengths; respectively training an electronic signature handwriting pre-training model by using a classification loss function in combination with positive sample data and using a noise regression function in combination with negative sample data, and constructing a signature identification module comprising a sample encoder 1 and a sample encoder 2 which have the same structure and share a weight; the method comprises the steps that signature sample data and to-be-detected signature data with fixed lengths are input into a signature identification module, a sample encoder 1 and a sample encoder 2 respectively determine the common characteristic and the difference characteristic of the signature sample data and the to-be-detected signature data by utilizing a classification loss function and a noise regression function, and the signature sample data and the to-be-detected signature data are determined to be from the same signer or different signers.

Further, from the sample encoder 1 output vector y and the sample encoder 2 output vector y ', using the cosine distance Loss _ match ═ (cos (y, y'))²Determine Loss function Loss _ positive (1-cos (y, y'))²And acquiring the common characteristic of the signature sample characteristic and the signature to-be-detected characteristic.

Further, based on the output vector y of the sample encoder 1 and the output vector y 'of the sample encoder 2, using the formula Loss _ noise ═ 1-cos (y', y) -C)²And determining a noise regression function, and acquiring the difference characteristic between the signature sample characteristic and the signature to-be-detected characteristic.

And obtaining a weight parameter by performing bias derivation on the weight parameters W of the sample encoder 1 and the sample encoder 2 through a classification loss function and a noise regression function, and obtaining a shared weight for the obtained weight parameters of the two sample encoders through a gradient average or addition synchronization method.

Reducing the loss values of the two loss functions by adopting a gradient descent method, obtaining a parameter W 'by performing partial derivation on a weight parameter W of an encoder of the signature identification module, and obtaining a parameter W' according to a formula W_newThe weight parameters of the encoder are updated W-lrW'.

The invention also claims a computer-readable storage medium on which a computer program is stored which can be loaded and executed by a processor to perform the above-described method.

The single handwriting vector is obtained through the representation learning, and the vector is easy to use and more convenient in the electronic signature comparison stage. Abundant multi-dimensional writing characteristic information collected is fully utilized, a small amount of positive and negative sample data can be utilized, the problem that small data easily causes overfitting is solved, and signature identification accuracy is improved.

Drawings

FIG. 1 is a schematic diagram of an electronic signature comparison system;

FIG. 2 is a schematic diagram of a pre-trained electronic signature recognition model;

FIG. 3 is a graph of a handwriting pre-training model trained using a classification loss function in conjunction with positive sample data;

FIG. 4 trains a handwriting pre-training model using a noise regression loss function in conjunction with negative sample data.

Detailed Description

Fig. 1 is a schematic diagram of a signature recognition system according to the present invention, which includes a data acquisition module, a data preprocessing module, a data pre-training module, and a signature feature recognition module. The data acquisition module acquires the original handwriting sequence characteristic data of the electronic signature as positive sample data, wherein the sequence characteristic data comprise handwriting characteristic information such as an abscissa X, an ordinate Y, pressure P, signature process duration T and the like of the original handwriting of the signature; the preprocessing module resamples the signature original handwriting characteristic data, the data length is fixed to the same length, normalization and discretization are carried out, and a characteristic data sequence with a fixed length is obtained; the data pre-training module comprises a model pre-training mode in an autoregressive mode and a model pre-training mode in a noise regression learning mode based on a large amount of sample data, and then the low-sample transfer learning is achieved; and the signature characteristic identification module compares and identifies the original handwriting characteristics of the random signature with the original handwriting characteristics of the original signature, and determines the real identity of the signer according to whether the two original handwriting characteristics are consistent.

Fig. 2 is a schematic diagram of a pre-trained electronic signature recognition model.

Firstly, a signature handwriting pre-training module trains an electronic signature handwriting pre-training model by respectively using a classification loss function and a noise regression loss function in combination with positive sample data to obtain a sample encoder 1 and a sample encoder 2 with the same structure, and a twin network model of the sample encoder which comprises the same structure and shares weight is constructed as a signature handwriting feature recognition module. The two encoders can be regarded as a copy of the same model due to the weight sharing. Gradient finding due to two encoder inputs in backward propagationDifferent gradients are obtained, and the gradients of the two encoders are considered together when updating the network, and the common gradient is obtained by averaging or adding, and then the model is updated. Receiving data of a data acquisition module and generated noise enhancement data, converting the corresponding data into vectors, sending the respective converted vectors into an encoder 1 and an encoder 2, outputting twin network learning handwriting characterization formed by the encoder 1 and the encoder 2 through an output layer, and bringing the output of the output layer into a noise regression function Loss _ noise ═ 1-cos (y', y) -mu)²In the medium-pre-training signature feature recognition model, differences of data vectors are output through pre-training encoders 1 and 2, but commonalities of signatures of the same signer cannot be learned, so that a classification Loss function Loss _ positive (1-cos (y, y')) with zero noise is adopted²And pre-training a signature feature recognition model, wherein the encoder 1 and the encoder 2 output loss values between data feature vectors, and the loss values represent the difference between signature sample data and signature to-be-detected data. The noise regression function and the classification loss function are reduced by a gradient descent method so that the loss values of the two are close to 0. The specific gradient descent method is that the classification loss function and the noise regression function can be used for solving the partial derivative W 'of the current weight parameter of the twin network, and the weight parameter of the encoder is updated according to the formula Wnew-W-lrW' by setting a certain learning rate lr until the weight parameter is converged.

The data acquisition module acquires various handwritten signature characteristic information and data, and can adopt the following data acquisition modes: c + + sign board drive; JS H5 mobile terminal, Android APP mobile terminal. The data acquisition module can acquire sequence data of a plurality of characteristics such as a signature abscissa X, a signature ordinate Y, a signature pressure P, a signature time T, an angular speed and the like.

In the data preprocessing stage, the raw sequence data is subjected to normalization (such as batch normalization and layer normalization), resampling, discretization, data enhancement, data generation and other operations to obtain preprocessed data. The length of the generated sequence data is different due to different writing time of writers, the input length adopts a fixed value, original data is resampled, original signature data is fixed to a fixed length, when the data length of the sampling signature data sequence is smaller than the input length, upsampling is carried out in a zero padding mode, and when the data length is larger than the input length, the data length of the sampling data sequence is downsampled to the fixed length of the input length. Obtaining positive sample data with fixed length, adding Gaussian random noise to the obtained fixed length data sequence as negative sample data

And training a neural network by adopting an autoregressive pretraining mode based on a mask, wherein a trained neural network model is used as a coding module, and the coding module comprises two signature recognition encoders with the same structure. The neural network model can be based on cnn, lstm, transformer models or their superposition. And (3) masking partial data sequence by using random mask to the original characteristic sequence of the acquired signature handwriting, inputting the partial data sequence into a neural network model for pre-training, and outputting the pre-training characteristic sequence of the signature handwriting by the neural network model so as to finish pre-training when the pre-training characteristic sequence output by the model approaches to the original characteristic sequence.

The training of the neural network in the autoregressive pretraining manner specifically includes using the autoregressive manner to complete pretraining, so that the target of the model output data sequence is an input data sequence, and the data processed by the preprocessing module exists in a sequence form, for example, X ═ X (X ═ X)₁、x₂...x_n) For the abscissa of the written trace, Y ═ Y₁,y₂...y_n) As the ordinate of the signature script, P ═ P₁,p₂,..p_n) To sign the pressure of the handwriting, T ═ T₁,t₂,...t_n) The writing time is. Masking partial data by random masking, masking partial sequences in the data sequence characteristics after masking processing, wherein the corresponding characteristic data are as follows: x '((X1, X2.. mask _ token.. xn)), Y' ((Y1, y2... mask _ token.. yn), P '((P1, p2... mask _ token.. pn), T' ((T1, T2.. mask _ token.. tn)). Partial features in the sequence are covered by a mask _ token, and various feature data sequences with partial feature values covered by masks are input into a pre-training model. That is, the output sequence is a signature sequence X ═ X (X1, X2,. X.. xn), and a signature sequence y (y ═ X.. xn)₁,y₂...y...y_n) Characteristic sequence (p)₁,p₂.._p...p_n)，(t₁,t₂,...t_n) And finishing pre-training.

The pre-trained model is used as an encoder of an electronic signature handwriting pre-training model to construct a signature feature recognition model, and the signature feature recognition model comprises two pre-training encoders (an encoder 1 and an encoder 2) with the same structure. And updating the weight parameters of the pre-training signature recognition encoder by adopting a Loss function Loss _ noise of noise regression and a classification Loss function Loss _ positive in a gradient descending manner, and taking the convergence weight obtained after updating as the shared weight of the encoder 1 and the encoder 2.

And adding Gaussian random noise C (C is a noise vector with one dimension the same as the length of the sample sequence) into a data sequence B obtained after the original sample is preprocessed to obtain a comparison data sequence B '═ B + C, and inputting B and B' into two encoders through an input layer 1 and an input layer 2 respectively.

The signature characteristic identification module is established based on the encoder, the signature characteristic identification module comprises an input layer, an encoder 1, an encoder 2 and an output layer, the input layer 1 and the input layer 2 respectively acquire signature handwriting sample data B acquired by the data acquisition module, and comparison data B 'of random noise is added to the sample data, the data sequence is converted into vectors and is sent to the encoder 1 and the encoder 2, the data are encoded through the encoder 1 and the encoder 2, the signature sample data B enters the encoder 1 through the input layer 1, the sample data B is encoded to output a characteristic vector y, the random noise is added to the signature sample data to generate comparison signature data B', the comparison signature data B 'enters the encoder 2 through the input layer 2, and the corresponding characteristic vector y' is encoded and output. Pre-training an encoder through a noise enhancement learning and noise regression learning mode to obtain difference characteristic values and loss characteristic values of the signature sample data and the signature comparison data, and determining the difference and the commonality of the signature sample data and the signature comparison data according to the difference characteristic values and the loss characteristic values, so as to distinguish whether the signature comparison handwriting is the handwriting of a signer.

FIG. 3 illustrates training a handwriting pre-training model using a classification loss function in conjunction with positive sample data.

And the signature handwriting pre-training module trains the signature recognition encoder by combining the classification loss function with the positive sample data to obtain a sample encoder 1.

Fig. 4 shows that the noise regression function is used to train the signature recognition encoder with the negative sample data to obtain the sample encoder 2.

An encoder of a signature handwriting recognition module is trained by adopting positive samples, an input layer 1 of the signature handwriting recognition module receives input of a positive sample set B ═ B1, B2.. Bn, n is the number of the positive sample sets, an input layer 2 receives input of a negative sample set B '═ B' 1, B '2., B'm }, m is the number of the negative sample sets, namely, B and B 'are from two signatures of different signers or different signatures of the same signer, the input layer converts input feature sequences into feature vectors V1 and V2, then vectors converted by the input layer 1 and the input layer 2 are sent into the encoder 1 and the encoder 2, the vectors V1 and V2 are output after encoding processing, output layer output vectors y and y', y are output of the encoder 1, y 'is output of the encoder 2, and the output layer obtains output vectors y and y'. And respectively solving partial derivatives of the updated weight parameters of the encoder 1 and the encoder 2 by adopting a noise regression function and a loss function, reducing the difference characteristic value and the loss characteristic value by using a gradient descent method until a convergence condition is reached, updating the weight parameters of two twin networks sharing the weight in the signature characteristic identification module, and obtaining the weight parameters of the two encoders in the signature characteristic identification module.

Establishing different training tasks according to different values of the noise C, and when C is 0, according to a formula: loss _ positive (1-cos (y, y'))²Calculating a classification loss function, and determining common characteristics of signature comparison data and signature sample data; when C is not zero, the formula is called: loss _ noise ═ 1-cos (y', y) - μ)²A noise regression function is calculated and the difference characteristic of the signature comparison data and the signature sample data is determined, where μ represents a value of the noise C. Respectively updating parameters of the encoder 1 and the encoder 2 by using a classification loss function and a noise regression function, and calling formulas by using the classification loss function

The current weight coefficient w of the encoder 1 is subjected to partial derivation, and a formula is called by using a noise regression function

Obtaining the partial derivatives of the current weight coefficient W of the encoder 2 to obtain the weight parameters of the two sample encoders, and calling the formula W according to the preset learning rate lr_newW-lrW' until convergence. (value W which can be derived from the partial derivation as described above₁' and W₂' separately substituting the formula to obtain updated weight parameters. Such as calling formula W_new1＝w-lr×W₁' update encoder 1 weight parameters, call formula W_new2＝w-lr×W₂Update encoder 2 weight parameters) update the weight parameters until a convergence weight is obtained. The condition for the loss to reach the predetermined value may be preset as a convergence condition, depending on the specific task. And obtaining shared weight by the two obtained convergence weight parameters through a gradient average or addition synchronization method, wherein the shared weight is used as the weight parameter of the encoders of the two twin network structures in the signature feature recognition model.

Calling formulas using classification loss functions

Calculating a partial derivative of the current weight coefficient W of the encoder 2, and obtaining convergence through a preset learning rate lr; and obtaining the weight parameters of the two sample encoders, and obtaining the sharing weight of the sharing weight sample encoder in the signature handwriting characteristic identification module by a gradient average or addition synchronization method for the convergence weight parameters of the two encoders.

For the encoder receiving the input of the negative sample set, adopting the training and updating of the encoder weight parameters based on the noise regression Loss function to obtain the difference characteristic values of the signature sample characteristic and the signature comparison characteristic, and obtaining the difference characteristic value according to the noise regression function Loss _ noise ═ 1-cos(y',y)-μ)²And determining the difference characteristic Loss _ noise to obtain the difference between the signature sample characteristic and the signature comparison characteristic. Calling a formula using a noise regression function

Calculating the biased reciprocal of the current weight coefficient W of the encoder, and calling the formula W through the preset learning rate lr_newThe weight parameter is updated to obtain the convergence weight W-lrW'.

Although the difference of data can be learned through noise regression learning, the commonalities of signatures of the same signer cannot be learned, the noise C is set to be zero, and a classification Loss function Loss _ positive ═ 1-cos (y, y')/is adopted for an encoder receiving a positive sample set)²And updating the encoder, obtaining Loss characteristics Loss _ positive of the sample characteristics and the comparison characteristics, and determining the common characteristics of the signature sample data and the signature detection data. Calling formulas using classification loss functions

Calculating the inverse bias of the parameter W, and calling the formula W through a preset learning rate lr_newThe weight parameter is updated to obtain the convergence weight W-lrW'.

Updating the weight parameters of the signature comparison model according to a preset learning rate to obtain a convergence weight, reducing the loss value by adopting a shared gradient (average or addition) reduction method on the weight coefficients obtained by the noise regression loss function and the classification loss function to enable the loss values of the two to be close to 0, updating the weight parameters of the encoders 1 and 2, and obtaining the loss value when the convergence weight is the final difference characteristic value and the final loss characteristic value. Thereby constructing a twin network model containing two same structures and sharing a weight sample encoder as a signature feature identification module.

The method comprises the steps of collecting electronic signature stroke related characteristic data, and acquiring signature sample data and signature to-be-detected data; the method comprises the steps of resampling, normalizing and discretizing signature sample data and data to be detected of a signature to obtain data with fixed length, inputting a signature data sequence to be detected and a sample signature data sequence with fixed length into a sample encoder 1 and a sample encoder 2 from an input layer 1 and an input layer 2 respectively, determining common characteristics and difference characteristics of the signature sample data and the data to be detected of the signature by the two sample encoders respectively through a cosine distance loss function and a noise regression function, and determining whether the signature sample data and the data to be detected of the signature come from signature data of the same signer or signature data of different signers. The invention is widely used in places such as e-commerce, e-government affairs and the like needing signature identification.

The invention also claims a computer-readable storage medium having stored thereon a computer program which can be loaded and executed by a processor to perform the method as described herein above.

Claims

1. An electronic signature handwriting recognition system based on a pre-training technology is characterized by comprising a data acquisition module, a data preprocessing module, a signature handwriting pre-training module and a signature handwriting characteristic recognition module, wherein the data acquisition module acquires electronic signature stroke related characteristic data to acquire signature positive sample data; the data preprocessing module carries out resampling, normalization and discretization on the collected positive sample data to generate negative sample data for electronic signature handwriting pre-training, a neural network is trained in an autoregressive pre-training mode of the positive sample data based on a mask, and a neural network model after the pre-training is completed serves as an encoder; the signature handwriting pre-training module trains an electronic signature handwriting pre-training model by combining a classification loss function with positive sample data and a noise regression function with negative sample data respectively to obtain a sample encoder 1 and a sample encoder 2 which have the same structure; and updating the weight parameters of the two sample encoders until a convergence weight is obtained, and constructing a twin network model of the two sample encoders with the same structure and sharing the weight as a signature handwriting feature recognition module.

2. The system of claim 1, wherein the cosine distance Loss _ compare (cos (y, y')) is used based on the sample encoder 1 output vector y and the sample encoder 2 output vector y²And constructing a classification loss function: loss _ positive (1-cos (y, y'))²And noise regression function:

Loss_noise＝(1-cos(y',y)-μ)²。

3. the system of claim 1, wherein the formula is invoked using a classification loss function

Calculating the partial derivative of the current weight coefficient W of the sample encoder 1, and calling the formula W through the preset learning rate lr_new1＝w-lr×W₁' update encoder 1 weight parameters until convergence.

4. The system of claim 1, wherein the formula is invoked using a noise regression function

Calculating the partial derivative of the current weight coefficient W of the sample encoder 2, and calling the formula W through a preset learning rate lr_new2＝w-lr×W₂' update encoder 2 weight parameters until convergence.

5. The system according to claim 3 or 4, wherein the shared weight of the shared weight sample encoder in the signature handwriting characteristic identification module is obtained by gradient average or addition synchronization method for the convergence weight parameters of the two sample encoders.

6. The system of one of claims 1 or 5, the training of the neural network in an autoregressive pre-training mode further comprising: and (3) masking partial data sequence by using random mask to the original characteristic sequence of the acquired signature handwriting, inputting the partial data sequence into a neural network model for pre-training, outputting the pre-training characteristic sequence of the signature handwriting by the neural network model, enabling the pre-training characteristic sequence output by the model to approach the original characteristic sequence, and completing the pre-training.

7. A method for recognizing electronic signature handwriting based on a pre-training technology is characterized in that a data acquisition module acquires relevant characteristic data of the electronic signature handwriting, and acquires signature sample data and signature to-be-detected data; the data preprocessing module is used for resampling, normalizing and discretizing the signature sample data and the signature to-be-detected data to obtain signature sample data and signature to-be-detected data with fixed lengths; respectively training an electronic signature handwriting pre-training model by using a classification loss function in combination with positive sample data and using a noise regression function in combination with negative sample data, and constructing a signature identification module comprising a sample encoder 1 and a sample encoder 2 which have the same structure and share a weight; the method comprises the steps that signature sample data and to-be-detected signature data with fixed lengths are input into a signature identification module, a sample encoder 1 and a sample encoder 2 respectively determine the common characteristic and the difference characteristic of the signature sample data and the to-be-detected signature data by utilizing a classification loss function and a noise regression function, and the signature sample data and the to-be-detected signature data are determined to be from the same signer or different signers.

8. The method of claim 7, wherein, based on the sample encoder 1 output vector y and the sample encoder 2 output vector y', based on the classification loss function: loss _ positive (1-cos (y, y'))²Acquiring common characteristics Loss _ positive of the signature sample characteristics and the signature to-be-detected characteristics, and according to a noise regression function: loss _ noise ═ 1-cos (y', y) - μ)²And acquiring the difference characteristic Loss _ noise of the signature sample characteristic and the signature to-be-detected characteristic.

9. The method of claim 7, wherein the formula is invoked using a classification loss function

Calculating the partial derivative of the current weight coefficient W of the encoder 1, and calling a formula W through a preset learning rate lr_new1＝w-lr×W₁' update encoder 1 weight parameters until convergence.

10. The method of claim 7, wherein the formula is invoked using a noise regression function

Calculating the partial derivative of the current weight coefficient W of the encoder 2, and calling the formula W according to the preset learning rate lr_new2＝w-lr×W₂' update encoder 2 weight parameters until convergence.

11. The method as claimed in claim 9 or 10, wherein the shared weight of the shared weight sample encoder in the signature handwriting feature recognition module is obtained by gradient average or addition synchronization method for the convergence weight parameters of the two encoders.

12. A computer-readable storage medium, having stored thereon a computer program which can be loaded and executed by a processor to perform the method of any one of claims 7 to 11.