CN110445753A - The partition method and device of terminal device abnormal access - Google Patents

The partition method and device of terminal device abnormal access Download PDF

Info

Publication number
CN110445753A
CN110445753A CN201910580052.3A CN201910580052A CN110445753A CN 110445753 A CN110445753 A CN 110445753A CN 201910580052 A CN201910580052 A CN 201910580052A CN 110445753 A CN110445753 A CN 110445753A
Authority
CN
China
Prior art keywords
terminal device
euclidean distance
nonlinear combination
combination feature
feature set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910580052.3A
Other languages
Chinese (zh)
Inventor
黎立桂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201910580052.3A priority Critical patent/CN110445753A/en
Priority to PCT/CN2019/103663 priority patent/WO2020258509A1/en
Publication of CN110445753A publication Critical patent/CN110445753A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/02Network architectures or network communication protocols for network security for separating internal from external traffic, e.g. firewalls
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/14Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic
    • H04L63/1408Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic by monitoring network traffic
    • H04L63/1425Traffic logging, e.g. anomaly detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer And Data Communications (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The present invention is safety detection technology field, the present invention provides the partition method and device of a kind of terminal device abnormal access, the method includes the first device parameters of the terminal device according to history to obtain multiple first nonlinear combination feature sets, and divides the first nonlinear combination feature set into normal condition cluster and abnormality cluster;The second device parameter of the terminal device current accessed is obtained by the shell script on terminal device, generates the second nonlinear combination feature set;Calculate separately the first Euclidean distance of the mass center of the second nonlinear combination feature set and the normal condition cluster, and the second Euclidean distance between the mass center of abnormality cluster;When first Euclidean distance is greater than second Euclidean distance, determine that the terminal device carries out abnormal access, and isolation processing is carried out to the abnormal access of corresponding terminal device.This method is conducive to improve the isolating power to the access of terminal device present terminal unit exception.

Description

The partition method and device of terminal device abnormal access
Technical field
The present invention relates to network detection technique fields, specifically, the present invention relates to a kind of terminal device abnormal access Partition method and device.
Background technique
In the access of terminal device network, the access for providing the available terminal device transmission of server of network connection is asked Seek the data for being clicking and dragging on track of terminal device.Currently, judging whether the terminal device is normal terminal device, usually Normal end can be belonged to by the terminal device of the Data Detection for being clicking and dragging on track of the terminal device initiation access request End equipment or web crawlers.But since the detection method is not easy to distinguish the terminal device and web crawlers normally accessed, Cause the error rate of detection higher, is easy the terminal device that will normally access detection and is determined as abnormal access, influence normally to use Family access causes to be not easy to carry out isolation processing to web crawlers.
Summary of the invention
To overcome the above technical problem, when especially passing through terminal device logs network in the prior art, according to user's It is easy the problem of real user is determined as abnormal user using Trace Data, spy proposes following technical scheme:
In a first aspect, the present invention provides a kind of partition method of terminal device abnormal access, comprising the following steps:
Multiple first nonlinear combination feature sets are obtained according to the first device parameter of the terminal device of history, and will be described First nonlinear combination feature set divides normal condition cluster and abnormality cluster into;Wherein, the first nonlinear combination feature set For the nonlinear characteristic information for the terminal device that history obtains, this feature information includes the attribute data and access number of terminal device According to;
It obtains the second device parameter of the terminal device current accessed by the shell script on terminal device, generates the Two nonlinear combination feature sets;Wherein, the second nonlinear combination feature set is the non-linear of the terminal device currently obtained Characteristic information, this feature information include the attribute data and access data of terminal device;
The first Euclidean distance of the mass center of the second nonlinear combination feature set and the normal condition cluster is calculated separately, And the second Euclidean distance between the mass center of abnormality cluster;
When first Euclidean distance is greater than second Euclidean distance, determine that the terminal device carries out abnormal access, And isolation processing is carried out to the abnormal access of corresponding terminal device.
The second nonlinear combination feature set and the normal shape are calculated separately described in one of the embodiments, Before first Euclidean distance of the mass center of state cluster, and the step of the second Euclidean distance between the mass center of abnormality cluster, Further include:
Initial mass center is preset to the normal condition cluster and abnormality cluster of the first nonlinear combination feature set respectively.
It is described in one of the embodiments, to calculate separately the second nonlinear combination feature set and the normal condition First Euclidean distance of the mass center of cluster, and the step of the second Euclidean distance between the mass center of abnormality cluster includes:
Calculate separately the second nonlinear combination feature set and normal condition accessed each time in the set time period First Euclidean distance of the mass center of cluster, and the second Euclidean distance between the mass center of abnormality cluster;
Same terminal device is obtained in the set time period to the first Euclidean distance described in the preceding setting number of sequence Set and second Euclidean distance set, and respectively obtain first Euclidean distance set and it is described second it is European away from From set mode value;
And respectively using the mode value of the set of first Euclidean distance and the set of second Euclidean distance as institute State same terminal device first Euclidean distance and second Euclidean distance in the set time period.
The shell script by terminal device obtains the terminal device and currently visits in one of the embodiments, The second device parameter asked, generate the second nonlinear combination feature set the step of include:
The second device parameter of current accessed is obtained by the shell script on terminal device, generates the second nonlinear combination Feature set, and from the second nonlinear combination feature set obtain terminal device user agent;
By parsing to user agent, the model of the terminal device is obtained.
The type of the terminal device is obtained by parsing to user agent described in one of the embodiments, Number the step of after, further includes:
According to the model of the terminal device, each spy of the second nonlinear combination feature set of the terminal device is obtained Levy the normal range (NR) of numerical value;
The character numerical value of the second nonlinear combination feature set is compared with the normal range (NR);
The nonlinear combination feature set of the character numerical value counterpart terminal equipment other than the normal range (NR) is obtained as second Nonlinear combination feature set.
It is non-to obtain multiple first for the first device parameter of the terminal device according to history in one of the embodiments, The step of linear combination feature set includes:
The first device parameter for obtaining the terminal device of history acquisition, extracts the characteristic information of first device parameter, Multiple first nonlinear combination feature sets are generated according to the characteristic information.
It is non-to obtain multiple first for the first device parameter of the terminal device according to history in one of the embodiments, After the step of linear combination feature set, further includes:
Network access idle section is obtained according to the amount of access of the terminal device of history;
Obtain the idle access frequency of the same terminal device in the same network access idle section;
The idle access frequency is compared with preset value;
If the idle access frequency is greater than preset value, the second nonlinear combination feature set of counterpart terminal equipment is obtained.
Second aspect, the present invention also provides a kind of isolating devices of terminal device abnormal access comprising:
Module is obtained, the first device parameter for the terminal device according to history obtains multiple first nonlinear combinations spies Collection, and divide the first nonlinear combination feature set into normal condition cluster and abnormality cluster;
Generation module is set for obtaining the second of the terminal device current accessed by the shell script on terminal device Standby parameter, generates the second nonlinear combination feature set;
Computing module, for calculating separately the second nonlinear combination feature set and the mass center of the normal condition cluster First Euclidean distance, and the second Euclidean distance between the mass center of abnormality cluster;
Isolation module determines the terminal device for being greater than second Euclidean distance when first Euclidean distance Abnormal access is carried out, and isolation processing is carried out to the abnormal access of corresponding terminal device.
The third aspect, the present invention also provides a kind of servers comprising:
One or more processors;
Memory;
One or more computer programs, wherein one or more of computer programs are stored in the memory And be configured as being executed by one or more of processors, one or more of computer programs are configured to carry out first The partition method of terminal device abnormal access described in aspect embodiment.
Fourth aspect, the present invention also provides a kind of computer readable storage medium, on the computer readable storage medium It is stored with computer program, realizes that terminal device described in first aspect embodiment is different when which is executed by processor The partition method that frequentation is asked.
The partition method and device of a kind of terminal device abnormal access provided by the present invention respectively acquire the history Terminal device network access and the access of current network generate multiple first nonlinear combination feature sets and corresponding respectively Second nonlinear combination feature set, and by the data point of the multiple first nonlinear combination feature set be divided into normal condition cluster and Abnormality cluster calculates separately to obtain the first Europe of the mass center of the second nonlinear combination feature set and the normal condition cluster Formula distance, and the second Euclidean distance between the mass center of abnormality cluster;According to first Euclidean distance and described the The comparison result of two Euclidean distances determines whether the network access is abnormal access, and carries out isolation processing to abnormal access.
Technical solution provided by the present invention feature generated when initiating network access request by the terminal device Information is converted into corresponding data point, and the spatial relation mutual according to data point, and obtains determining result with this. In this way, can be by characteristic information that terminal device terminal network accesses to pass through the comparison knot of the distance parameter between related Fruit intuitively reflects whether the terminal device network access is abnormal access, and carries out isolation processing to the abnormal access.It should Method is easier to distinguish the terminal device and web crawlers that normally access, reduces the general of web crawlers access invasion website Rate is effectively isolated web crawlers.
The additional aspect of the isolation present invention and advantage of terminal device abnormal access will be set forth in part in the description, These will become apparent from the description below, or practice through the invention is recognized.
Detailed description of the invention
Above-mentioned and/or additional aspect and advantage of the invention will become from the following description of the accompanying drawings of embodiments Obviously and it is readily appreciated that, in which:
Fig. 1 is the applied environment figure of the isolation scheme of the embodiment execution terminal device abnormal access in the present invention;
Fig. 2 is the flow chart of the partition method of the terminal device abnormal access of one embodiment in the present invention;
Fig. 3 is the flow chart of the partition method of the terminal device abnormal access of another embodiment in the present invention
Fig. 4 is the schematic diagram of the isolating device of the terminal device abnormal access of one embodiment in the present invention;
Fig. 5 is the structural schematic diagram of the server of one embodiment in the present invention.
Specific embodiment
The embodiment of the present invention is described below in detail, examples of the embodiments are shown in the accompanying drawings, wherein from beginning to end Same or similar label indicates same or similar element or element with the same or similar functions.Below with reference to attached The embodiment of figure description is exemplary, and for explaining only the invention, and is not construed as limiting the claims.
Those skilled in the art of the present technique are appreciated that unless expressly stated, singular " one " used herein, " one It is a ", " described " and "the" may also comprise plural form.It is to be further understood that being arranged used in specification of the invention Diction " comprising " refer to that there are the feature, integer, step, operation, element and/or component, but it is not excluded that in the presence of or addition Other one or more features, integer, step, operation, element, component and/or their group.It should be understood that when we claim member Part is " connected " or when " coupled " to another element, it can be directly connected or coupled to other elements, or there may also be Intermediary element.In addition, " connection " used herein or " coupling " may include being wirelessly connected or wirelessly coupling.It is used herein to arrange Diction "and/or" includes one or more associated wholes for listing item or any cell and all combinations.
Those skilled in the art of the present technique are appreciated that unless otherwise defined, all terms used herein (including technology art Language and scientific term), there is meaning identical with the general understanding of those of ordinary skill in fields of the present invention.Should also Understand, those terms such as defined in the general dictionary, it should be understood that have in the context of the prior art The consistent meaning of meaning, and unless idealization or meaning too formal otherwise will not be used by specific definitions as here To explain.
Those skilled in the art of the present technique are appreciated that " terminal " used herein above, " terminal device " both include wireless communication The equipment of number receiver, only has the equipment of the wireless signal receiver of non-emissive ability, and including receiving and emitting hardware Equipment, have on both-way communication chain road, can execute both-way communication reception and emit hardware equipment.This equipment It may include: honeycomb or other communication apparatus, shown with single line display or multi-line display or without multi-line The honeycomb of device or other communication apparatus;PCS (Personal Communications Service, person communication system), can With combine voice, data processing, fax and/or data communication capabilities;PDA (Personal Digital Assistant, it is personal Digital assistants), it may include radio frequency receiver, pager, the Internet/intranet access, web browser, notepad, day It goes through and/or GPS (Global Positioning System, global positioning system) receiver;Conventional laptop and/or palm Type computer or other equipment, have and/or the conventional laptop including radio frequency receiver and/or palmtop computer or its His equipment." terminal " used herein above, " terminal device " can be it is portable, can transport, be mounted on the vehicles (aviation, Sea-freight and/or land) in, or be suitable for and/or be configured in local runtime, and/or with distribution form, operate in the earth And/or any other position operation in space." terminal " used herein above, " terminal device " can also be communicating terminal, on Network termination, music/video playback terminal, such as can be PDA, MID (Mobile Internet Device, mobile Internet Equipment) and/or mobile phone with music/video playing function, it is also possible to the equipment such as smart television, set-top box.
Those skilled in the art of the present technique are appreciated that remote network devices used herein above comprising but be not limited to count The cloud that calculation machine, network host, single network server, multiple network server collection or multiple servers are constituted.Here, Yun Youji It is constituted in a large number of computers or network servers of cloud computing (Cloud Computing), wherein cloud computing is distributed computing One kind, a super virtual computer consisting of a loosely coupled set of computers.In the embodiment of the present invention, distal end It can be realized and be communicated by any communication modes between the network equipment, terminal device and WNS server, including but not limited to, is based on The mobile communication of 3GPP, LTE, WIMAX, based on TCP/IP, the computer network communication of udp protocol and based on bluetooth, infrared The low coverage wireless transmission method of transmission standard.
Refering to what is shown in Fig. 1, Fig. 1 is the applied environment figure of the embodiment of the present invention;In the embodiment, the technology of the present invention side Case can be based on realizing on server, and as shown in figure 1, terminal device 110 and 120 can access server by internet 130, the network request that terminal device 110 and/or 120 is issued to server 130, server 130 is counted according to network request According to interaction.When carrying out data interaction, server 130 obtains terminal according to the solicited message of terminal device 110 and/or 120 and sets Standby 110 and/or 120 access data and attribute data, and the terminal device is carried out abnormality detection according to the data.
It is easy for real user to be determined as abnormal user to solve the problems, such as to detect abnormal data at present, the present invention provides A kind of partition method of terminal device abnormal access.It can refer to Fig. 2, Fig. 2 is the terminal device abnormal access of one embodiment Partition method flow chart, method includes the following steps:
S210, multiple first nonlinear combination feature sets are obtained according to the first device parameter of the terminal device of history, and Divide the first nonlinear combination feature set into normal condition cluster and abnormality cluster.
When server and terminal device carry out data interaction, according to the network request that terminal device issues, obtaining should The relevant parameter of terminal device.In this step, server obtains from the network request that the terminal device of history is issued One device parameter, server parse first device parameter, and non-linear according to the result of parsing acquisition multiple first Assemblage characteristic collection.
The first nonlinear combination feature set is that one access note of terminal device of data interaction was carried out with server The feature set generated is recorded, which is the nonlinear characteristic information for the terminal device that history obtains, should Characteristic information includes the attribute data and access data of terminal device.Such as attribute data may include the model of terminal device, end The screen resolution x*y of the end equipment or available screen resolution X*Y of browser, access data may include terminal device to service Device issues the frequency etc. of request.
The corresponding characteristic information of the first nonlinear combination feature set, in the present embodiment, this feature information is specially Corresponding characteristic value.Corresponding coordinate is set, and marks the access of the terminal device of history each time on coordinate and records generation Feature set or a n dimension strong point.The feature set to be formed is recorded about different access, and corresponding normal shape is formed on coordinate State cluster and abnormality cluster.The considerations of being absolutely greater than abnormal conditions according to normal condition, big cluster are normal condition clusters, and tuftlet is different Normal manifold of states.
Further, it in order to eliminate the dimension relation between variable, so that data be made to be comparable, is marked to characteristic value Before, the characteristic information value in each feature set is standardized.For example, in the obtained feature set for accessing record each time In may include the variable of hundred-mark system and the variable of 5 score values, can be in same mark only by all data normalizations It is compared in standard.
S220, the second device parameter that the terminal device current accessed is obtained by the shell script on terminal device, Generate the second nonlinear combination feature set.
In order to can the network access of terminal device described in real-time detection whether handle abnormality, it is right according to detection needs The state that the terminal device currently accesses each time is detected.In this step, the server passes through network connection, to The terminal device provides shell script, to obtain the second device parameter that the terminal device currently accesses each time.It is described Second device parameter is identical as the property of first device parameter.The second nonlinear combination feature set currently obtains The nonlinear characteristic information of terminal device, this feature information include the attribute data and access data of terminal device.
Server is parsed according to second device parameter, is extracted and is obtained the feature letter of second device parameter Breath is generated according to the characteristic information about the second nonlinear combination when the preceding terminal device for issuing network request to server Feature set.
Characteristic information included by the second nonlinear combination feature set at least with the first nonlinear combination feature The characteristic information of collection kind is corresponding, compares so as to subsequent.
S230, the first European of the second nonlinear combination feature set and the mass center of the normal condition cluster is calculated separately Distance, and the second Euclidean distance between the mass center of abnormality cluster.
In this step, it by being compared with the normal condition of the network of the terminal device of history access, obtains current Whether the network access for accessing the terminal device of server is abnormality.
On the coordinate of characteristic information for marking the first nonlinear combination feature set, what mark was generated from step S220 And the characteristic information of the second nonlinear combination feature set is obtained by standardization, obtain corresponding data point.
The coordinate is being utilized, is calculating separately data point that the second nonlinear combination feature set obtains to described and institute The first Euclidean distance of the mass center of normal condition cluster is stated, and the second Euclidean distance between the mass center of abnormality cluster, obtained To the second nonlinear combination feature set actual range with normal condition cluster and with abnormality cluster respectively.
S240, when first Euclidean distance is greater than second Euclidean distance, it is abnormal to determine that the terminal device carries out Access, and isolation processing is carried out to the abnormal access of corresponding terminal device.The first Euclidean distance obtained according to step S230 With the second Euclidean distance, and it is compared.If comparison result is that first Euclidean distance is greater than second Euclidean distance, It indicates that present terminal equipment initiates network access request closer to abnormality to server, at this moment, determines the current end It is abnormal access state that end equipment, which initiates network access request to server,.
The network request currently initiated for terminal device is judged as abnormal access request, and server is directly to the network Request carries out isolation processing, that is, refuses to respond abnormal access request, and terminal device described in new demand of laying equal stress on accesses verifying; If the network request that terminal device is currently initiated is judged as normal access request, directly in response to request.
In the present invention, the first nonlinear combination feature set and the second nonlinear combination feature set be individually by Made of multiple characteristic value collections.Multiple characteristic values that access is formed by nonlinear combination feature set each time form corresponding Dimension vector.Therefore, in this application, the first nonlinear characteristic collection caused by each access and described first non-linear Feature set is measured with its dimension vector.And for the normal condition cluster and abnormality of the first nonlinear characteristic collection Cluster is to be formed by manifold of states by the dimension vector of corresponding multiple first nonlinear characteristic collection.Therefore, of the invention The calculating of Euclidean distance in technical solution is the meter of the space length between the dimension vector to corresponding nonlinear characteristic collection It calculates.
A kind of partition method of terminal device abnormal access provided by the invention initiates network by the terminal device of history The data point of first nonlinear combination feature set of access divides normal condition cluster and abnormality cluster into, and by present terminal equipment To server initiate obtained the second nonlinear combination feature set of network access request respectively with normal condition cluster and abnormality Cluster carries out Euclidean distance calculating, and according to the comparison between distance, obtaining the network access request that present terminal equipment is initiated is No is abnormality.It is special that data produced by the present invention accesses the terminal device network form the first, second nonlinear combination Collection, and the data point that will be obtained from the first, second nonlinear combination feature set is marked on coordinate.Technical solution of the present invention It directly compares, is avoided in the prior art after data caused by the network access request of terminal device initiation are handled Only in the generated usage record such as authentication procedures of user's using terminal equipment the click time and dragging track Data are easy the problem of real user is determined as abnormal user according to caused as abnormality detection, more precisely react The state for the network access request that present terminal equipment is initiated to server, and by more simply and intuitively in a manner of data comparison Obtain abnormality detection as a result, be conducive to carry out isolation processing to the corresponding abnormal access of abnormality detection.It is described in above scheme On the basis of, before step S230, that is, calculating separately the second nonlinear combination feature set and the normal condition cluster Before the second Euclidean distance between first Euclidean distance of mass center and the mass center of abnormality cluster, further includes:
Initial mass center is preset to the normal condition cluster and abnormality cluster of the first nonlinear combination feature set respectively.
In this step, especially for the calculating for never carrying out Euclidean distance with the second nonlinear combination feature set In the case where, the position of corresponding mass center is sought according to the dimension vector of normal condition cluster and abnormality cluster respectively, is corresponded to Initial mass center.The mass center is the mass centre of corresponding states cluster, can be according to the collected state of the dimension vector of manifold of states Variation and change.
When calculating the European of the second nonlinear combination feature set and the normal condition cluster and abnormality cluster for the first time Apart from when, only need to calculate the second nonlinear combination feature set at a distance from the initial mass center.
If need to continue calculated for subsequent terminal device issues the second nonlinear combination feature set caused by network request It, only need to be according to the second nonlinear combination feature set newly generated before at a distance from normal condition cluster and abnormality cluster The value of the position of data point can directly acquire the value of the position of new mass center with the value of the position of the initial mass center, avoid every After the network request of the secondary terminal device for having increased history newly, need again according to the data of all normal condition clusters and abnormality cluster Point calculates mass center.In this way, being conducive to control the operand of subsequent abnormality detection, to keep the isolation of terminal device abnormal access Efficiency.
According to the use habit of the logging in network of general normal users, for a terminal device in a short period of time to Server issues network request under normal circumstances can be more than one, therefore, can in a short period of time may with regard to same terminal device More than one network request can be issued to server.
In this regard, corresponding step S230 can include:
S231, the second nonlinear combination feature set for accessing each time in the set time period and normal is calculated separately First Euclidean distance of the mass center of manifold of states, and the second Euclidean distance between the mass center of abnormality cluster;
S232, same terminal device is obtained in the set time period described in the preceding setting number of sequence first it is European away from From set and second Euclidean distance set, and respectively obtain first Euclidean distance set and second Europe The mode value of the set of formula distance;
S233 is simultaneously made respectively with the mode value of the set of first Euclidean distance and the set of second Euclidean distance For the same terminal device first Euclidean distance and second Euclidean distance in the set time period.
Specifically, same terminal device is every issues primary network request to server, just will form a pair corresponding first Euclidean distance and the second Euclidean distance.
In the set time period, the second nonlinear combination spy when same terminal device issues network request each time is obtained Collection, and corresponding dimension vector is obtained according to multiple characteristic values in the second nonlinear combination feature set.In the present embodiment In, according to the dimension vector of the second nonlinear combination feature set respectively with the mass center of the normal condition cluster and the abnormality The mass center of cluster is corresponding to seek first Euclidean distance and the second Euclidean distance of same terminal device network request each time.In In set period of time, obtained multiple first Euclidean distances of network request and more are repeatedly issued according to same terminal device respectively In a second Euclidean distance the acquiring size of distance value sort it is preceding setting number first Euclidean distance set and The set of second Euclidean distance, point as described in the number that sets as 100, and respectively with regard to this 100 described first it is European away from From respective mode value is obtained in set composed by composed set and second Euclidean distance, i.e., in this setting number It is middle the largest number of distance values occur.And with the set of obtained first Euclidean distance and second Euclidean distance Gather corresponding mode value as corresponding same terminal device first Euclidean distance in the set time period and described the Two Euclidean distances.In this way, same terminal device generated pending data in a short period of time can be simplified, exclude as far as possible because of idol Abnormal data caused by right factor influences the result of related Euclidean distance value.
For step S220, comprising:
A1, the second device parameter that current accessed is obtained by the shell script on terminal device, it is non-linear to generate second Assemblage characteristic collection, and from the second nonlinear combination feature set obtain terminal device user agent;
A2, by being parsed to user agent, obtain the model of the terminal device.
In step A1-A2, server obtains the second current device parameter by the detection script of terminal device, obtains The second nonlinear combination feature set current about the terminal device for issuing network request, and from the second nonlinear combination feature Concentrate the user agent for obtaining the terminal device.By parsing to the user agent, the corresponding terminal device is obtained Model.
On this basis, by that can also include: after step S220
A3, according to the model of the terminal device, obtain each of the second nonlinear combination feature set of the terminal device The normal range (NR) of a character numerical value;
A4, the character numerical value of the second nonlinear combination feature set is compared with the normal range (NR);
A5, the character numerical value obtained other than the normal range (NR) correspond to nonlinear combination feature set as second non-linear group Close feature set.
According to the model of the obtained terminal device of above-mentioned steps A2, to the second nonlinear combination feature of the terminal device Each character numerical value of collection limits normal range value respectively.
The each character numerical value that will acquire the second nonlinear combination feature set of the terminal device, and it is obtained above right The normal range value of model is answered to compare, according to the prediction of result of comparison, accordingly whether terminal device is in abnormality, In this, as the foundation that web crawlers is isolated.
If a certain characteristic value of corresponding terminal device is not in the normal range value of its model, the terminal device It is likely to be at abnormality, it is likely that be utilized and carry out crawler deployment access network.
Using the nonlinear combination feature set of the terminal device for being likely to be at abnormality as described second non-linear group Close feature set, screened in advance with will pass through the data treated and compared, reduce the later period progress Euclidean distance seek and data The data processing works such as comparison.
For will acquire each character numerical value of the second nonlinear combination feature set of the terminal device, obtained with above-mentioned Correspondence model the process that compares of normal range value, the pixel ratio that may include terminal device, the ginseng about resolution ratio The comprehensive condition of number either verification time and verifying number.
Specifically, following example can be referred to:
(1) the whether consistent assemblage characteristic of the system platform of pixel ratio and terminal device:
Windows system computerized real pixel is than generally 1 or so, if terminal device acquired in server Windows system computerized pixel ratio is more than or equal to 2, then it is possible that being that official's simulator of computer simulation mobile phone is attacked Identifying code, it is more likely that be abnormal terminal device;
The real pixel of iPhone is than being generally 2-3 or so, if the pixel ratio of iPhone acquired in server It is 1, then it is likely used only to being the attack identifying code such as crawler or automation equipment or official's simulator, then corresponding terminal device It is likely to abnormal terminal device;
The pixel ratio of apple mac computer is generally 1-2 or so, if the pixel ratio of mac computer acquired in server is 3, then corresponding terminal device is likely to abnormal terminal device.
(2) nonlinear combination feature and the whether consistent assemblage characteristic of system platform based on resolution ratio are generated:
Browser can use screen resolution x, the difference of y product xy and terminal device screen resolution x1, y1 product x1y1 X1y1-xy and the whether consistent assemblage characteristic of system platform:
For example the xy-x1y1 of the computer end chrome browser obtained under normal circumstances by front end is not 0, that is, just Computer end chrome browser passes through terminal device acquired in server in normal situation available point of screen resolution and browser Resolution has certain difference, and passes through the screen resolution and browser of terminal device acquired in server when computer simulation mobile phone Available resolution is identical or widely different (for example x1y1-xy is greater than 150000), then corresponding terminal device is likely to It is abnormal terminal device.
(3) resolution ratio X*Y value normal range (NR) and the whether consistent assemblage characteristic of system platform;
When especially resolution ratio is too low, corresponding low side devices are disposed for crawler).
It, can for step S210 specifically:
The first device parameter for obtaining the terminal device of history acquisition, extracts the characteristic information of first device parameter, Multiple first nonlinear combination feature sets are generated according to the characteristic information.
For the first device parameter about terminal device of history acquisition, server is mentioned for first device parameter Relevant characteristic information, such as pixel ratio, resolution ratio or verification time and frequency characteristic information are taken, and is directed to each characteristic information Respective first nonlinear combination feature set is generated, and the initiation network request of comprehensive same terminal device each time is generated All first nonlinear combination feature sets can form the data point of a multi-C vector, and can be in the enterprising rower of corresponding coordinate Note, to carry out subsequent data analysis and statistics.
After step S210, can also include:
B1, the idle section that network access is obtained according to the amount of access of the terminal device of history;
B2, the idle access frequency for obtaining the same terminal device in the same network access idle section;
B3, the idle access frequency and preset value are compared;
If B4, the idle access frequency are greater than preset value, the second nonlinear combination feature of counterpart terminal equipment is obtained Collection.
According to above-mentioned steps B1-B4, network access idle section is obtained according to the access number of the terminal device of history.And root The idle access frequency of same terminal device is obtained according to network access idle section.The display of same terminal device will be accessed Frequency is compared with preset value.In the present embodiment, the preset value can be by obtaining the terminal device of history described The amount of access of network access idle section counts to obtain.
If the access frequency of the terminal device is greater than preset value, which is set as monitored object, and Obtain its second nonlinear combination feature set.Other characteristic informations are extracted by the second nonlinear combination feature set of acquisition, into The further monitoring of row, comparative analysis improve the capture effect of abnormal conditions so that the detection of network access exception has more specific aim Rate promotes the effect that the abnormal access caused by web crawlers is isolated.
Based on inventive concept identical with the partition method of above-mentioned terminal device abnormal access, the embodiment of the present invention is also provided A kind of isolating device of terminal device abnormal access, as shown in Figure 4, comprising:
Module 410 is obtained, the first device parameter for the terminal device according to history obtains multiple first non-linear groups Feature set is closed, and divides the first nonlinear combination feature set into normal condition cluster and abnormality cluster;Wherein, described first Nonlinear combination feature set is the nonlinear characteristic information for the terminal device that history obtains, and this feature information includes terminal device Attribute data and access data;
Generation module 420, for obtaining the of the terminal device current accessed by the shell script on terminal device Two device parameters generate the second nonlinear combination feature set;Wherein, the second nonlinear combination feature set currently obtains The nonlinear characteristic information of terminal device, this feature information include the attribute data and access data of terminal device;
Computing module 430, for calculating separately the matter of the second nonlinear combination feature set Yu the normal condition cluster First Euclidean distance of the heart, and the second Euclidean distance between the mass center of abnormality cluster;
Isolation module 440 determines that the terminal is set for being greater than second Euclidean distance when first Euclidean distance It is standby to carry out abnormal access, and isolation processing is carried out to the abnormal access of corresponding terminal device.
Referring to FIG. 5, Fig. 5 is the schematic diagram of internal structure of server in one embodiment.As shown in figure 4, the server Including processor 510, storage medium 520, memory 530 and the network interface 540 connected by system bus.Wherein, the clothes The storage medium 520 of business device is stored with operating system, database and computer-readable instruction, and control letter can be stored in database Sequence is ceased, when which is executed by processor 510, processor 510 may make to realize that a kind of terminal device is abnormal The partition method of access, processor 510 are able to achieve the isolating device of one of embodiment illustrated in fig. 4 terminal device abnormal access In acquisition module 410, the function of generation module 420, computing module 430 and isolation model 440.The processor of the server 510, for providing calculating and control ability, support the operation of entire server.It can be stored in the memory 530 of the server Computer-readable instruction may make processor 510 to execute a kind of terminal when the computer-readable instruction is executed by processor 510 The partition method of unit exception access.The network interface 540 of the server is used for and terminal connection communication.Those skilled in the art It is appreciated that structure shown in Fig. 5, only the block diagram of part-structure relevant to application scheme, is not constituted to this The restriction for the server that application scheme is applied thereon, specific server may include more more or fewer than as shown in the figure Component perhaps combines certain components or with different component layouts.
In one embodiment, the invention also provides a kind of storage medium for being stored with computer-readable instruction, the meters When calculation machine readable instruction is executed by one or more processors, so that one or more processors execute following steps: according to going through First device parameter of the terminal device of history obtains multiple first nonlinear combination feature sets, and by first nonlinear combination Feature set divides normal condition cluster and abnormality cluster into;Wherein, the first nonlinear combination feature set is the end that history obtains The nonlinear characteristic information of end equipment, this feature information include the attribute data and access data of terminal device;It is set by terminal Standby upper shell script obtains the second device parameter of the terminal device current accessed, generates the second nonlinear combination feature Collection;Wherein, the second nonlinear combination feature set is the nonlinear characteristic information of the terminal device currently obtained, this feature letter Breath includes the attribute data and access data of terminal device;Calculate separately the second nonlinear combination feature set and described normal First Euclidean distance of the mass center of manifold of states, and the second Euclidean distance between the mass center of abnormality cluster;When described One Euclidean distance is greater than second Euclidean distance, determines that the terminal device carries out abnormal access, and set to corresponding terminal Standby abnormal access carries out isolation processing.
Based on the above embodiments it is found that the maximum beneficial effect of the present invention is:
The partition method and device of a kind of terminal device abnormal access provided by the present invention respectively acquire the history Terminal device network access and the access of current network generate multiple first nonlinear combination feature sets and corresponding respectively Second nonlinear combination feature set, and by the data point of the multiple first nonlinear combination feature set be divided into normal condition cluster and Abnormality cluster calculates separately to obtain the first Europe of the mass center of the second nonlinear combination feature set and the normal condition cluster Formula distance, and the second Euclidean distance between the mass center of abnormality cluster;According to first Euclidean distance and described the The comparison result of two Euclidean distances determines whether network access is abnormal access, and to abnormal access it is corresponding request into Row isolation processing.
Technical solution provided by the present invention feature generated when initiating network access request by the terminal device Information is converted into corresponding data point, and the spatial relation mutual according to data point, and obtains determining result with this. In this way, can be by characteristic information that terminal device terminal network accesses to pass through the comparison knot of the distance parameter between related Fruit intuitively reflects whether the terminal device network access is abnormal access, makes it easier to judge that the terminal device is It is no that invasion website is accessed by web crawlers, improve the isolation effect to the abnormal access of terminal device.
To sum up, the present invention passes through the partition method and device of terminal device abnormal access, by visiting terminal device network It asks that characteristic information data generated carries out visual distance versus, and obtains determining whether the judgement result of abnormal access Technical solution, solve in the prior art user when by terminal device logs network use Trace Data it is easy will be true User is determined as the problem of abnormal user, improves the detectability to terminal device abnormal access.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, which can be stored in a computer-readable storage and be situated between In matter, the program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, storage medium above-mentioned can be Storage mediums or the random access memories such as magnetic disk, CD, read-only memory (Read-Only Memory, ROM) (Random Access Memory, RAM) etc..
Each technical characteristic of embodiment described above can be combined arbitrarily, for simplicity of description, not to above-mentioned reality It applies all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, all should be considered as described in this specification.
The embodiments described above only express several embodiments of the present invention, and the description thereof is more specific and detailed, but simultaneously Limitations on the scope of the patent of the present invention therefore cannot be interpreted as.It should be pointed out that for those of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to guarantor of the invention Protect range.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims (10)

1. a kind of partition method of terminal device abnormal access, which comprises the following steps:
Multiple first nonlinear combination feature sets are obtained according to the first device parameter of the terminal device of history, and by described first Nonlinear combination feature set divides normal condition cluster and abnormality cluster into;Wherein, the first nonlinear combination feature set is to go through The nonlinear characteristic information for the terminal device that history obtains, this feature information include the attribute data and access data of terminal device;
The second device parameter of the terminal device current accessed is obtained by the shell script on terminal device, and it is non-to generate second Linear combination feature set;Wherein, the second nonlinear combination feature set is the nonlinear characteristic of the terminal device currently obtained Information, this feature information include the attribute data and access data of terminal device;
The first Euclidean distance of the mass center of the second nonlinear combination feature set and the normal condition cluster is calculated separately, and The second Euclidean distance between the mass center of abnormality cluster;
When first Euclidean distance is greater than second Euclidean distance, determine that the terminal device carries out abnormal access, and right The abnormal access of corresponding terminal device carries out isolation processing.
2. the method according to claim 1, wherein
The mass center for calculating separately the second nonlinear combination feature set and the normal condition cluster first it is European away from From, and before the step of the second Euclidean distance between the mass center of abnormality cluster, further includes:
Initial mass center is preset to the normal condition cluster and abnormality cluster of the first nonlinear combination feature set respectively.
3. method according to claim 1 or 2, which is characterized in that
First Euclidean distance of the mass center for calculating separately the second nonlinear combination feature set and the normal condition cluster, And the step of the second Euclidean distance between the mass center of abnormality cluster, includes:
Calculate separately the second nonlinear combination feature set and normal condition cluster accessed each time in the set time period First Euclidean distance of mass center, and the second Euclidean distance between the mass center of abnormality cluster;
Same terminal device is obtained in the set time period to the set for the first Euclidean distance described in preceding setting number that sorts With the set of second Euclidean distance, and respectively obtain first Euclidean distance set and second Euclidean distance The mode value of set;
And respectively using the mode value of the set of first Euclidean distance and the set of second Euclidean distance as described same One terminal device first Euclidean distance and second Euclidean distance in the set time period.
4. the method according to claim 1, wherein
The shell script by terminal device obtains the second device parameter of the terminal device current accessed, generates the The step of two nonlinear combination feature sets includes:
The second device parameter of current accessed is obtained by the shell script on terminal device, generates the second nonlinear combination feature Collect, and obtains the user agent of terminal device from the second nonlinear combination feature set;
By parsing to user agent, the model of the terminal device is obtained.
5. according to the method described in claim 4, it is characterized in that,
It is described by being parsed to user agent, obtaining the model of the terminal device the step of after, further includes:
According to the model of the terminal device, each characteristic of the second nonlinear combination feature set of the terminal device is obtained The normal range (NR) of value;
The character numerical value of the second nonlinear combination feature set is compared with the normal range (NR);
The nonlinear combination feature set for obtaining the character numerical value counterpart terminal equipment other than the normal range (NR) is non-thread as second Property assemblage characteristic collection.
6. the method according to claim 1, wherein
The step of the first device parameter acquisition multiple first nonlinear combination feature sets of the terminal device according to history, wraps It includes:
The first device parameter for obtaining the terminal device of history acquisition, extracts the characteristic information of first device parameter, according to The characteristic information generates multiple first nonlinear combination feature sets.
7. the method according to claim 1, wherein
First device parameter of the terminal device according to history obtain the step of multiple first nonlinear combination feature sets it Afterwards, further includes:
Network access idle section is obtained according to the amount of access of the terminal device of history;
Obtain the idle access frequency of the same terminal device in the same network access idle section;
The idle access frequency is compared with preset value;
If the idle access frequency is greater than preset value, the second nonlinear combination feature set of counterpart terminal equipment is obtained.
8. a kind of isolating device of terminal device abnormal access characterized by comprising
Module is obtained, the first device parameter for the terminal device according to history obtains multiple first nonlinear combination features Collection, and divide the first nonlinear combination feature set into normal condition cluster and abnormality cluster;
Generation module, the second equipment for obtaining the terminal device current accessed by the shell script on terminal device are joined Number generates the second nonlinear combination feature set;
Computing module, first of the mass center for calculating separately the second nonlinear combination feature set and the normal condition cluster Euclidean distance, and the second Euclidean distance between the mass center of abnormality cluster;
Isolation module determines that the terminal device carries out for being greater than second Euclidean distance when first Euclidean distance Abnormal access, and isolation processing is carried out to the abnormal access of corresponding terminal device.
9. a kind of server characterized by comprising
One or more processors;
Memory;
One or more computer programs, wherein one or more of computer programs are stored in the memory and quilt It is configured to be executed by one or more of processors, one or more of computer programs are configured to carry out according to right It is required that the partition method of 1 to 7 described in any item terminal device abnormal access.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium Program realizes claim 1 to 7 described in any item terminal device abnormal access when the computer program is executed by processor Partition method.
CN201910580052.3A 2019-06-28 2019-06-28 The partition method and device of terminal device abnormal access Pending CN110445753A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201910580052.3A CN110445753A (en) 2019-06-28 2019-06-28 The partition method and device of terminal device abnormal access
PCT/CN2019/103663 WO2020258509A1 (en) 2019-06-28 2019-08-30 Method and device for isolating abnormal access of terminal device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910580052.3A CN110445753A (en) 2019-06-28 2019-06-28 The partition method and device of terminal device abnormal access

Publications (1)

Publication Number Publication Date
CN110445753A true CN110445753A (en) 2019-11-12

Family

ID=68428743

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910580052.3A Pending CN110445753A (en) 2019-06-28 2019-06-28 The partition method and device of terminal device abnormal access

Country Status (2)

Country Link
CN (1) CN110445753A (en)
WO (1) WO2020258509A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111600880A (en) * 2020-05-14 2020-08-28 深信服科技股份有限公司 Method, system, storage medium and terminal for detecting abnormal access behavior
CN116150542A (en) * 2023-04-21 2023-05-23 河北网新数字技术股份有限公司 Dynamic page generation method and device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107465648A (en) * 2016-06-06 2017-12-12 腾讯科技(深圳)有限公司 The recognition methods of warping apparatus and device
CN109446768A (en) * 2018-10-09 2019-03-08 北京北信源软件股份有限公司 Application access abnormal behavior detection method and system
CN109714311A (en) * 2018-11-15 2019-05-03 北京天地和兴科技有限公司 A method of the unusual checking based on clustering algorithm
CN109800130A (en) * 2019-01-31 2019-05-24 郑州云海信息技术有限公司 A kind of apparatus monitoring method, device, equipment and medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101219538B1 (en) * 2009-07-29 2013-01-08 한국전자통신연구원 Apparatus for detecting network attack based on visual data analysis and its method thereof
CN107819631B (en) * 2017-11-23 2021-03-02 东软集团股份有限公司 Equipment anomaly detection method, device and equipment
CN109391620B (en) * 2018-10-22 2021-06-25 武汉极意网络科技有限公司 Method, system, server and storage medium for establishing abnormal behavior judgment model
CN109886290B (en) * 2019-01-08 2024-05-28 平安科技(深圳)有限公司 User request detection method and device, computer equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107465648A (en) * 2016-06-06 2017-12-12 腾讯科技(深圳)有限公司 The recognition methods of warping apparatus and device
CN109446768A (en) * 2018-10-09 2019-03-08 北京北信源软件股份有限公司 Application access abnormal behavior detection method and system
CN109714311A (en) * 2018-11-15 2019-05-03 北京天地和兴科技有限公司 A method of the unusual checking based on clustering algorithm
CN109800130A (en) * 2019-01-31 2019-05-24 郑州云海信息技术有限公司 A kind of apparatus monitoring method, device, equipment and medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111600880A (en) * 2020-05-14 2020-08-28 深信服科技股份有限公司 Method, system, storage medium and terminal for detecting abnormal access behavior
CN116150542A (en) * 2023-04-21 2023-05-23 河北网新数字技术股份有限公司 Dynamic page generation method and device and storage medium

Also Published As

Publication number Publication date
WO2020258509A1 (en) 2020-12-30

Similar Documents

Publication Publication Date Title
US12074888B2 (en) Network security monitoring method, network security monitoring device, and system
CN111092852A (en) Network security monitoring method, device, equipment and storage medium based on big data
CN113159615B (en) Intelligent information security risk measuring system and method for industrial control system
CN103905440B (en) Network security situation awareness analysis method based on log and SNMP information fusion
CN107861713A (en) Data calling method, device and computer-readable recording medium
CN102929613B (en) The tuning apparatus and method of operating system
CN109886290A (en) Detection method, device, computer equipment and the storage medium of user's request
Lv et al. A deep convolution generative adversarial networks based fuzzing framework for industry control protocols
CN107370806A (en) HTTP conditional codes monitoring method, device, storage medium and electronic equipment
CN107958456A (en) Dispensing detection method, device and electronic equipment
CN110392046A (en) The method for detecting abnormality and device of network access
CN110535850B (en) Processing method and device for account login, storage medium and electronic device
CN109117352B (en) Server performance prediction method and device
CN107203470B (en) Page adjustment method and device
CN110445753A (en) The partition method and device of terminal device abnormal access
CN112948224A (en) Data processing method, device, terminal and storage medium
CN105637488A (en) Tracing source code for end user monitoring
CN108809926A (en) Inbreak detection rule optimization method, device, electronic equipment and storage medium
CN106646110A (en) Low-voltage distribution network fault positioning system based on GIS and Petri technologies
EP4102772A1 (en) Method and apparatus of processing security information, device and storage medium
CN113849363A (en) Service monitoring method and related device
CN106027284A (en) Network fault diagnosis method and device
CN109688099A (en) Server end hits library recognition methods, device, equipment and readable storage medium storing program for executing
CN103024767A (en) Mobile communication service end-to-end performance evaluation method and system
CN110417744A (en) The safe determination method and device of network access

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20191112