CN109063431A

CN109063431A - Weight the method for identifying ID of keystroke characteristic curve diversity factor

Info

Publication number: CN109063431A
Application number: CN201810644782.0A
Authority: CN
Inventors: 王林; 贺冰清
Original assignee: Xian University of Technology
Current assignee: Xian University of Technology
Priority date: 2018-06-21
Filing date: 2018-06-21
Publication date: 2018-12-21
Anticipated expiration: 2038-06-21
Also published as: CN109063431B

Abstract

The invention discloses the method for identifying ID of weighting keystroke characteristic curve diversity factor, specific step is as follows, first keystroke interval time data set and half temporal characteristics data set are extracted, then the mean value and standard deviation of calculating keystroke interval time data set and half time data, the up/down boundary of keystroke interval time indicatrix and the up/down boundary of half temporal characteristics curve and keystroke interval time weighted feature curve diversity factor and half temporal characteristics curve diversity factor, finally identify user identity using weighted curve diversity factor and indicatrix diversity factor.The present invention is using the method for identifying ID for weighting keystroke characteristic curve diversity factor, compared with traditional keystroke identifying algorithm of keystroke duration and keystroke time interval is used only, user identity authentication recognizer performance based on indicatrix diversity factor is more preferable, false rejection rate, false acceptance rate and equal error rate are reduced, the accuracy rate of identification is improved.

Description

Weight the method for identifying ID of keystroke characteristic curve diversity factor

Technical field

The invention belongs to biological authentication method technical fields, are related to a kind of use using weighting keystroke characteristic curve diversity factor Family personal identification method.

Background technique

In recent years, we are using a large amount of online web application, these programs include social media platform (such as Facebook, Twitter, Weibo), cloud storage service (such as Drobox, Google Drive) and some online games. However these web application bring network crimes are unconsciously being spread to all over the world unexpectedly.Serious network Crime refers to that some offenders using the account of internet intrusion victim, steal quick including password and financial assets Feel information, in order to solve theft problem, we are entered in sequence of threads or equipment by a kind of additional biological identification mechanism To improve the safety of user account.In current various computer security measures, one is use the tradition based on password Identity validation technology, but password is easy leakage；Another kind is to replace simple challenge using some physical tokens (smart card etc.), But this method requires system to be equipped with corresponding hardware device, this meeting so that increased costs and there is also physical token loss, The problems such as stealing, replicating.Due to the biological characteristic of people have it is not reproducible, the characteristics such as be difficult to change so that living things feature recognition Technology becomes research hotspot.Common biometrics identification technology has: fingerprint identification technology, face recognition technology, iris recognition Technology etc..But above-mentioned technology requires to be equipped with the higher hardware device of cost, keeps its application inconvenient and is difficult to popularize.

Keystroke dynamic identity authentication is a kind of to carry out identity based on keystroke characteristic (such as: keystroke time delay, keystroke strength etc.) The biometrics of identification, this method acquire keystroke data, the keystroke behavior to user by the keyboard input of monitoring user Feature carries out classification model construction, thus carries out the differentiation of user identity.Keystroke dynamic identity authentication not only solves tradition and is based on The safety issue of password authentication, while being compared with other biological identification technology, the hardware for not needing additional expensive is set, standby Have many advantages, such as that at low cost, flexibility is high.

Summary of the invention

The object of the present invention is to provide a kind of method for identifying ID using weighting keystroke characteristic curve diversity factor, solutions Identity knowledge is carried out only with the size of each keystroke characteristic included in keystroke characteristic vector in existing authentication method of having determined Not, using the change rate between two adjacent characteristic values, so as to cause the not high problem of accuracy.

The technical scheme adopted by the invention is that the method for identifying ID of weighting keystroke characteristic curve diversity factor, tool Body follows the steps below to implement:

Step 1, acquisition data, establish half temporal characteristics data set and keystroke interval time data set；

Step 2, the mean value for calculating separately keystroke interval time data set and standard deviation are equal with half temporal characteristics data set Value and standard deviation；

Step 3 calculates keystroke interval time indicatrix according to the mean value and standard deviation of keystroke interval time data set Up/down boundary calculates the up/down boundary of half temporal characteristics curve according to the mean value of half temporal characteristics data set and standard deviation；

Step 4, the up/down feature modeling keystroke interval time weighted feature curve according to keystroke interval time indicatrix Diversity factor, according to the half temporal characteristics curve diversity factor of up/down feature modeling of half temporal characteristics curve；

Step 5 identifies user identity using weighted curve diversity factor and indicatrix diversity factor.

The features of the present invention also characterized in that

Step 1 specific implementation step is as follows:

1.1, k representative specific double bond character strings, group are filtered out from the original keystroke information of free text At specific character sequence set SK；

1.2, the frequency of use λ of each double bond is calculated_j, j=1,2 ..., k construct the keystroke interval time data set of user S_ppWith half temporal characteristics data set S_st, S_ppAnd S_stBe expressed as follows:

S_st={ V_i ^st=[WPM_i,P_{i,N_UD},P_i,error,P_i,CapsLock,P_i,Shift] | i=1,2 ..., n } (2)

Wherein: k is the specific double bond character string number screened, V in formula_i ^pp∈R^kFor i-th of keystroke interval time Vector sample,For the keystroke interval time of the specific double bond character string of the last one in i-th of sample,For i-th of sample In j-th of specific double bond character string keystroke interval time (j=1 ..., k), m be collected keystroke interval time vector Number of samples；V_i ^st∈R⁵For i-th of half temporal characteristics vector samples, WPM_i、P_{i,N_UD}、P_i,error、P_i,CapsLockAnd P_i,ShiftPoint It Wei not the average keystroke speed of i-th of sample, the frequency of occurrences of negative interval time RP, error rate for input, cap lock key use Frequency and shift key frequency of use, P_{N_UD}、P_error、P_ShiftAnd P_CapsLockVariation range be [0,1], average keystroke speed The variation range of WPM be [0 ,+∞), under normal circumstances, the magnitude of WPM is 10², exist with the magnitude of other half temporal characteristics aobvious Difference is write, n is collected half temporal characteristics vector number of samples；

1.3, double of temporal characteristics data set S_stIn average keystroke speed WPM normalization formula is normalized Are as follows:

In formula: max { WPM_i| i=1 ..., n } it is that maximum in sample is averaged keystroke speed, it is denoted as WPM_max, by normalizing After change processing, by half temporal characteristics data set S_stIt is abbreviated as

S_st={ V_i ^st=[v_i,1,v_i,2,v_i,3,v_i,4,v_i,5] | i=1,2 ... n } (4)

In formula:v_i,2=P_{i,N_UD}, v_i,3=P_i,error, v_i,4=P_i,CapsLock, v_i,5=P_i,Shift。

The mean value and standard of the mean value and standard deviation of keystroke interval time data set and half temporal characteristics data set in step 2 The calculation method of difference are as follows:

If data set S_ppThe mean value of middle all elements isData set S_stThe mean value of middle all elements ForThen

If data set S_ppThe standard deviation of middle all elements isData set S_stIncluded in element Standard deviation beThen

The meter on the up/down boundary of keystroke interval time indicatrix and the up/down boundary of half temporal characteristics curve in step 3 Calculation method are as follows:

If data set S_ppIncluded in the coboundary vector sum lower boundary vector of element be respectivelyData set S_stIncluded in element coboundary vector sum lower boundary vector RespectivelyThe then coboundary of keystroke interval time indicatrixBelow BoundaryCalculating such as following formula (9), the coboundary v of half temporal characteristics curve_u,l, lower boundary v_d,lCalculating such as following formula (10):

In formula:WithFor adjustable threshold value.

The calculation method of keystroke interval time weighted feature curve diversity factor and half temporal characteristics curve diversity factor in step 4 Are as follows:

If any keystroke interval time vector sampleThen the sample is in data set S_ppIn plus Weigh indicatrix diversity factorCalculation formula are as follows:

In formula:

Wherein: λ_jFor the frequency of use of each specific double bond character string, j=1,2 ..., k；

If any half temporal characteristics vector sampleIn data set S_stMiddle indicatrix diversity factorFor

In formula:

Keystroke interval time data set S is calculated according to the frequency of use of double bond each in set SK and formula (11)_ppIn it is each The weighted feature curve diversity factor of element, and constitute keystroke interval time indicatrix diversity factor set Q_pp；It is calculated by formula (12) Half temporal characteristics data set S_stIn each element indicatrix diversity factor, and constitute half temporal characteristics curve diversity factor set Q_st, the above-mentioned definition respectively gathered is

In formula:Indicate data set S_ppMiddle element V_i ^pp∈R^kWeighted feature curve diversity factor,It indicates Data set S_stMiddle element V_i ^st∈R⁵Indicatrix diversity factor.

Knowledge method for distinguishing is carried out to user identity using weighted curve diversity factor and indicatrix diversity factor in step 5 are as follows:

Test sample is determined according to following inequality

In formula:WithFor adjustable threshold；

If inequality (15) and formula (16) are set up simultaneously, assert that this test sample belongs to the user；Otherwise, assert this survey Sample is originally not belonging to the user.

Threshold value in step 4WithValue range be 0~3.

Threshold value in step 5WithValue range be not less than 0.

The invention has the advantages that using the method for identifying ID of weighting keystroke characteristic curve diversity factor, and only It is compared using the keystroke duration with traditional keystroke identifying algorithm of keystroke time interval, the user based on indicatrix diversity factor Authentication recognizer performance is more preferable, reduces false rejection rate (FRR), false acceptance rate (FAR) and equal error rate (ERR), the accuracy rate of identification is improved.

Detailed description of the invention

Fig. 1 is keystroke duration of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor Indicatrix；

Fig. 2 is that the present invention is bent using half temporal characteristics of the method for identifying ID of weighting keystroke characteristic curve diversity factor Line；

Fig. 3 is data set S of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor_ppKeystroke The upper and lower boundary curve figure of feature；

Fig. 4 is data set S of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor_stKeystroke The upper and lower boundary curve figure of feature；

Fig. 5 is that the present invention is special using the free text keystroke of method for identifying ID of weighting keystroke characteristic curve diversity factor Identifying algorithm performance indicator ERR is levied with the change curve of TP；

Fig. 6 is keystroke data Ji Qu of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor Domain divides schematic diagram；

Fig. 7 is internal specimen signal of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor Figure；

Fig. 8 is external samples signal of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor Figure；

Fig. 9 is weighted curve feature of the present invention using the method for identifying ID of weighting keystroke characteristic curve diversity factor Curve diversity factor schematic diagram.

Specific embodiment

The following describes the present invention in detail with reference to the accompanying drawings and specific embodiments.

The present invention weights the method for identifying ID of keystroke characteristic curve diversity factor, is specifically implemented according to the following steps:

Step 1, acquisition data, establish half temporal characteristics data set and keystroke interval time data set, specific implementation step It is as follows:

S_st={ V_i ^st=[v_i,1,v_i,2,v_i,3,v_i,4,v_i,5] | i=1,2 ... n } (4)

Step 2, the mean value for calculating separately keystroke interval time data set and standard deviation are equal with half temporal characteristics data set Value and standard deviation, circular are as follows:

By data set S_ppIn either element V_i ^ppRepresented by curve, abscissa j, ordinate isWherein j=1, L, k；Similarly, by data set S_stIn either element V_i ^prRepresented by curve, abscissa l, ordinate is followed successively by v_i,l, wherein l= 1, L, 5, for ease of description, by data set S_ppIn either element V_i ^ppCurve be known as keystroke interval time indicatrix, will count According to collection S_stIn either element V_i ^prCurve be referred to as half temporal characteristics curve, keystroke characteristic curve can also be referred to as.

If data set S_ppThe standard deviation of middle all elements isData set S_stIncluded in element Standard deviation isThen

Step 3 calculates keystroke interval time indicatrix according to the mean value and standard deviation of keystroke interval time data set Up/down boundary calculates the up/down boundary of half temporal characteristics curve, tool according to the mean value of half temporal characteristics data set and standard deviation Body calculation method are as follows:

In formula:WithFor adjustable threshold value,WithValue range be 0~3；

WithValue range be according to central-limit theorem (i.e. assuming acquisition keystroke temporal characteristics amount obey just State distribution) and determine,WithValue is bigger, and the range on upper and lower boundary is bigger, and sample is in the increase of the probability in boundary, from And reduce FRR value, the increase of FAR value；WithIt is worth smaller, the range on upper and lower boundary is smaller, and sample is in general in boundary Rate reduces, so that FRR value increases, FAR value reduces.It choosesWithValue should make as far as possible EER value reach minimum value,WithValue range is 0~3, can generally choose 2.

Step 4, the up/down feature modeling keystroke interval time weighted feature curve according to keystroke interval time indicatrix Diversity factor, according to the half temporal characteristics curve diversity factor of up/down feature modeling of half temporal characteristics curve, circular are as follows:

Data set S_ppAnd S_stUp/down boundary curve entire two-dimensional surface is divided into interior zone and perimeter, such as Shown in Fig. 6.For any keystroke interval time vector sampleIfIt is all satisfiedThen the sample is completely in S_ppInterior zone, be called data set S_ppInternal specimen, as shown in Figure 7； Otherwise, then it is called data set S_ppExternal samples, as shown in Figure 8.It can similarly obtain, for any half temporal characteristics vector sampleIfThere is v_d,l≤v_s,l≤v_u,l, then it is called data set S_stInternal specimen；Otherwise, Then it is called data set S_stExternal samples.

According to above-mentioned definition it is found that if when a sample is the external samples of some data set, in corresponding outside area In domain, the indicatrix of this sample inherently constitutes several enclosed areas with the coboundary of its data set or lower boundary curve Domain, as shown in the shadow region in Fig. 8.The gross area of all closed areas is bigger, indicates that the difference of sample and this data set is got over Greatly, it is bigger to be not belonging to a possibility that this data set for sample.The feature in conjunction with possessed by free text keystroke characteristic information, this chapter pairs Fixed text keystroke characteristic curve diversity factor is suitably modified, and the concept of weighting keystroke characteristic curve diversity factor is extracted, It is associated with it with the generation of the frequency of use of specific double bond character string.

In the research of fixed text keystroke characteristic, the physical meaning of the keystroke characteristic curve diversity factor of any sample is the sample Whole envelopes that this keystroke characteristic curve and the coboundary of corresponding data collection or lower boundary curve are constituted in its perimeter The sum of closed region area, the area of each closed area depend primarily on each element in feature vector and exceed up or down boundary D in distance, such as Fig. 9₂、d₄And d₇.In view of the use frequency of the specific double bond character string filtered out from free text Rate can have differences, by element each in feature vector beyond up or down boundary distance multiplied by corresponding weight coefficient, such as λ in Fig. 8₂d₂、λ₄d₄And λ₇d₇, so that the waviness tolerance range of specific double bond keystroke interval time is inversely proportional with its frequency of use Relationship.The weight coefficient that each element is multiplied in feature vector to its corresponding to the frequency of use of double bond character string it is directly proportional Relationship can directly be chosen use frequency as weight coefficient under normal circumstances.

Compared with the keystroke characteristic curve diversity factor in fixed text, according to the obtained weighting keystroke of above-mentioned design method Indicatrix diversity factor is unique in that when any two element exceeds the absolute of up or down frontier distance in feature vector When being worth equal, the variable quantity of indicatrix diversity factor caused by the big element of weight coefficient is greater than the small element institute of weight coefficient The variable quantity of caused indicatrix diversity factor.In view of frequency of use high double bond keystroke interval time is lower than frequency of use Double bond stability is good, fluctuating range is small, it should differentiation processing is carried out to it, so that when the high double bond keystroke interval of frequency of use Between waviness tolerance range be less than the low double bond of frequency of use.Therefore, it uses and adds in free text keystroke characteristic verification process It is more suitable to weigh keystroke characteristic curve diversity factor.

In formula:

In formula:Indicate data set S_ppMiddle elementWeighted feature curve diversity factor,Table Show data set S_stMiddle element V_i ^st∈R⁵Indicatrix diversity factor.

Step 5 identifies user identity using weighted curve diversity factor and indicatrix diversity factor method particularly includes:

Assuming that sampleAnd V_stIt is set S respectively_ppAnd S_stInternal specimen, then defining its indicatrix diversity factor is Zero；Otherwise, the indicatrix diversity factor of sample is equal to the indicatrix of the sample and the up/down boundary characteristic song of corresponding data collection The sum of whole closed area areas that line is constituted in its perimeter.

Then test sample is determined according to following inequality

In formula:WithFor adjustable threshold,WithValue range be not less than 0；

Embodiment 1

Introduce the example of a specific user identity identification.

Step 1: acquisition data establish half temporal characteristics data set and keystroke interval time data set

Experimental data acquisition mainly carries out above and below the PC machine of installation Windows system, and conventional mechanical keyboard is selected to make Equipment is acquired for keystroke information, in addition, having write a user keystroke information acquisition program based on VC++6.0 exploitation environment, is led to Crossing the program can freely tap user the keystroke information storage of keyboard into specified file.Start it in data collection task Before, the keystroke information capture program write is installed in the computer used by a user for participating in experiment first.In data During acquisition, it is desirable that user just runs keystroke information capture program after opening computer every time, and program display interface is such as Shown in Fig. 9.After user click [beginning] button, program just starts to acquire the free keystroke letter of user in a manner of running background Breath, and be stored in key_record.txt file.In data acquisition, keystroke information capture program will not be bothered User's normal use computer.Before user shuts down computer every time, [end] button is clicked to exit keystroke information acquisition Program.

After completing the raw data acquisition work of whole participants, pair that each participant gets used to therefrom is extracted Key characters sequence and access times (frequency), statistical result is shown in Table 1.

Table 1

Listed in table 1 each participant during the experiment in frequency of use come first 15 (by frequency of use by height to Low sequence) double bond character string and access times.By analysis it is found that double bond character string " in ", " an ", " ng ", " zh ", " wo ", " en ", " sh ", " ji " are that all participants are jointly owned and frequency of use is higher, can also be reflected above-mentioned double Key characters sequence has certain generality.Therefore, it is specific that above-mentioned 8 double bond character strings composition is chosen in the experiment of this chapter Character string set SK, i.e. SK={ in, an, ng, zh, wo, en, sh, ji }.

After the selected specific character sequence set SK, made according to each double bond in acquired original data set of computations SK With frequency, it is denoted as λ_j, indicate set SK in j-th of double bond frequency of use, j=1,2, L, 8.Each participant is in Freely input During, each a period of time will collect a double bond keystroke interval time vector sample and half temporal characteristics vector sample This, in conjunction with the data in table 1, each participant at least has 200 double bond keystroke interval time vector samples and 200 half Temporal characteristics vector sample.

The mean value and mark of step 2, the mean value for calculating separately keystroke interval time data set and standard deviation and half time data It is quasi- poor

It is substantially similar with fixed text for the experimental program of free text keystroke characteristic authentication, only make in experiment Keystroke characteristic information and identifying algorithm different from.

Successively concentrating from each participant's keystroke data takes preceding 20%, 40%, 60% and 80% sample as sample Originally the keystroke characteristic model of the participant was established.The sample of above-mentioned participant to be indicated with variable TP convenient for analysis of experimental results This quantity accounts for the percentage of total number of samples amount.

Then, it concentrates after taking 80%, 60%, 40% and 20% sample to be used as from each participant's keystroke data respectively to survey Sample sheet calculates the false rejection rate FRR of the participant.

Next, using whole samples of other 9 participants as test sample, which is attacked It hits, calculates the false acceptance rate FAR of the participant.

The above process can recycle down, until the FRR and FAR of 10 users are all calculated.Finally, taking all participations Performance indicator of the average value of person FRR and FAR as identity authentication algorithm.

Step 4, user identity identification

Experiment has obtained user's sample size to account for the percentage TP of total number of samples amount being respectively 20%, 40%, 60% and In the case of 80%, false rejection rate (FRR), false acceptance rate (FAR) and the equal error rate (EER) of various algorithms, experiment knot Fruit is shown in Table 2.Through the experimental result in table 2 it is found that in the case where TP value is different, based on weighting keystroke characteristic curve difference The equal error rate (EER) of the identifying algorithm of degree is respectively 20.11%, 16.28%, 13.48% and 10.32%, significant excellent In other 2 kinds of alignment algorithms, accuracy height is authenticated, it is more preferable to the certification effect of characteristics of user keystroke.This is primarily due to Man Ha Keystroke interval time and half temporal characteristics are used only as keystroke characteristic progress user's body in distance algorithm and relative distance algorithm Part certification, and the identifying algorithm based on weighting keystroke characteristic curve diversity factor that this chapter is proposed is calculating weighting keystroke characteristic song Traditional keystroke interval time is not only contained during line diversity factor, also introduces the change rate and double bond of interval time The information such as the frequency of use of character string.Therefore the mentioned algorithm of this chapter can more accurately describe the keystroke characteristic of user, into And the accuracy rate of authentication can be improved.

The performance indicator of free text keystroke characteristic identifying algorithm is as shown in table 2, free text keystroke characteristic identifying algorithm Performance indicator ERR is as shown in Figure 5 with the change curve of TP.

The performance indicator of the free text keystroke characteristic identifying algorithm of table 2

It can be seen that the method for identifying ID using weighting keystroke characteristic curve diversity factor from above-mentioned experimental result, Compared with traditional keystroke identifying algorithm of keystroke duration and keystroke time interval is used only, based on indicatrix diversity factor User identity authentication recognizer performance is more preferable, reduces false rejection rate (FRR), false acceptance rate (FAR) and equal error Rate (ERR), improves the accuracy rate of identification.

Claims

1. weighting the method for identifying ID of keystroke characteristic curve diversity factor, which is characterized in that specifically real according to the following steps It applies:

The mean value of step 2, the mean value for calculating separately keystroke interval time data set and standard deviation and half temporal characteristics data set and Standard deviation；

Step 3, the up/down that keystroke interval time indicatrix is calculated according to the mean value and standard deviation of keystroke interval time data set Boundary calculates the up/down boundary of half temporal characteristics curve according to the mean value of half temporal characteristics data set and standard deviation；

Step 4, the up/down feature modeling keystroke interval time weighted feature curve difference according to keystroke interval time indicatrix Degree, according to the half temporal characteristics curve diversity factor of up/down feature modeling of half temporal characteristics curve；

Step 5 identifies user identity using weighted feature curve diversity factor and indicatrix diversity factor.

2. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that Step 1 specific implementation step is as follows:

1.1, k representative specific double bond character strings are filtered out from the original keystroke information of free text, composition is special Determine character string set SK；

1.2, the frequency of use λ of each double bond is calculated_j, j=1,2 ..., k construct the keystroke interval time data set S of user_ppWith Half temporal characteristics data set S_st, S_ppAnd S_stBe expressed as follows:

Wherein: k is the specific double bond character string number screened, V in formula_i ^pp∈R^kFor i-th of keystroke interval time vector Sample,For the keystroke interval time of the specific double bond character string of the last one in i-th of sample,It is in i-th of sample The keystroke interval time (j=1 ..., k) of j specific double bond character strings, m are collected keystroke interval time vector sample Number；V_i ^st∈R⁵For i-th of half temporal characteristics vector samples, WPM_i、P_{i,N_UD}、P_i,error、P_i,CapsLockAnd P_i,ShiftRespectively The average keystroke speed of i-th of sample, the frequency of occurrences of negative interval time RP, error rate for input, cap lock key frequency of use With shift key frequency of use, P_{N_UD}、P_error、P_ShiftAnd P_CapsLockVariation range be [0,1], average keystroke speed WPM's Variation range be [0 ,+∞), under normal circumstances, the magnitude of WPM is 10², there are significance differences with the magnitude of other half temporal characteristics Different, n is collected half temporal characteristics vector number of samples；

In formula: max { WPM_i| i=1 ..., n } it is that maximum in sample is averaged keystroke speed, it is denoted as WPM_max, at normalization After reason, by half temporal characteristics data set S_stIt is abbreviated as

S_st={ V_i ^st=[v_i,1,v_i,2,v_i,3,v_i,4,v_i,5] | i=1,2 ... n } (4)

3. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that The mean value and standard deviation of keystroke interval time data set and the mean value of half temporal characteristics data set and standard deviation in the step 2 Calculation method are as follows:

If data set S_ppThe mean value of middle all elements isData set S_stThe mean value of middle all elements isThen

If data set S_ppThe standard deviation of middle all elements isData set S_stIncluded in element standard Difference isThen

4. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that The calculating side on the up/down boundary of keystroke interval time indicatrix and the up/down boundary of half temporal characteristics curve in the step 3 Method are as follows:

In formula:WithFor adjustable threshold value.

5. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that The calculation method of keystroke interval time weighted feature curve diversity factor and half temporal characteristics curve diversity factor in the step 4 are as follows:

If any keystroke interval time vector sampleThen the sample is in data set S_ppIn weighting it is special Levy curve diversity factorCalculation formula are as follows:

In formula:

If any half temporal characteristics vector sample V_s ^st=[v_s,1,v_s,2,…,v_s,5] in data set S_stMiddle indicatrix diversity factorFor

In formula:

Keystroke interval time data set S is calculated according to the frequency of use of double bond each in set SK and formula (11)_ppIn each element Weighted feature curve diversity factor, and constitute keystroke interval time indicatrix diversity factor set Q_pp；When calculating half by formula (12) Between characteristic data set S_stIn each element indicatrix diversity factor, and constitute half temporal characteristics curve diversity factor set Q_st, on The definition for stating each set is

In formula:Indicate data set S_ppMiddle element V_i ^pp∈R^kWeighted feature curve diversity factor,Indicate data Collect S_stMiddle element V_i ^st∈R⁵Indicatrix diversity factor.

6. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that Knowledge method for distinguishing is carried out to user identity using weighted feature curve diversity factor and indicatrix diversity factor in the step 5 are as follows:

Test sample is determined according to following inequality

In formula:WithFor adjustable threshold；

If inequality (15) and formula (16) are set up simultaneously, assert that this test sample belongs to the user；Otherwise, assert this test specimens Originally it is not belonging to the user.

7. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 1, which is characterized in that Threshold value in the step 4WithValue range be 0~3.

8. the method for identifying ID of weighting keystroke characteristic curve diversity factor according to claim 5, which is characterized in that Threshold value in the step 5WithValue range be not less than 0.