CN103198091B - The processing method of a kind of online data based on user behavior request and equipment - Google Patents

The processing method of a kind of online data based on user behavior request and equipment Download PDF

Info

Publication number
CN103198091B
CN103198091B CN201210516508.8A CN201210516508A CN103198091B CN 103198091 B CN103198091 B CN 103198091B CN 201210516508 A CN201210516508 A CN 201210516508A CN 103198091 B CN103198091 B CN 103198091B
Authority
CN
China
Prior art keywords
online data
url
client
mark
request
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210516508.8A
Other languages
Chinese (zh)
Other versions
CN103198091A (en
Inventor
罗晓华
邵峰
梁文锋
邱晟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Netease Hangzhou Network Co Ltd
Original Assignee
Netease Hangzhou Network Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netease Hangzhou Network Co Ltd filed Critical Netease Hangzhou Network Co Ltd
Priority to CN201210516508.8A priority Critical patent/CN103198091B/en
Publication of CN103198091A publication Critical patent/CN103198091A/en
Application granted granted Critical
Publication of CN103198091B publication Critical patent/CN103198091B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

Embodiments of the present invention provide the processing method of a kind of online data based on user behavior request.The method includes: obtain the URL information browsing webpage of each client in real time;Judge whether URL information and URL rule meet matching condition, if it is, the mark of client and the process action of online data in URL rule are added to online data process request list;Judge that the mark identifying whether to process any client in request list with online data of active client is mated, if it is, trigger the execution of the process action of the online data of the mark correspondence of active client.Embodiments of the present invention, by carrying out online data process based on user behavior, can be saved system resource, thus be obviously improved the server systematic function when responding online data process request.Additionally, embodiments of the present invention provide the processing equipment of a kind of online data based on user behavior request, such as, server.

Description

The processing method of a kind of online data based on user behavior request and equipment
Technical field
Embodiments of the present invention relate to internet arena, more specifically, embodiments of the present invention relate to a kind of based on The processing method of the online data request of user behavior and equipment.
Background technology
This part is it is intended that the embodiments of the present invention stated in claims provide background or context.Herein Description can include the concept that can probe into, but is not necessarily the concept having contemplated that before or having probed into.Therefore, unless This points out, is not otherwise existing skill for the description and claims of this application in the content described in this part Art, and not because being included in this part just recognize it is prior art.
More and more flourishing present in the Internet, increasing user carries out online data process, example based on the Internet As, send Email, use instant communication software to carry out online real-time communication, etc., it is also possible to carry out based on server The real-time verification of line data and process.At present, user is implementing based between internet use server and other users During data interaction, server is typically necessary that the request of data to initiator user is the most reasonable or legal to be verified, when The online data real-time, interactive between initiator user and recipient user is reprocessed the when of being verified.
In prior art, server generally uses, for the authentication of initiator user, the technical side that feature based is analyzed Case, just shields this user when the identity finding user is illegal.In this mode of prior art, typically use reality Time the data mining technology such as cluster, classification, feature identification, to the IP address of initiator user's request, user name, phone number, COOKIE (refer to website in order to distinguish user identity, carry out session (session) follow the tracks of and be stored on user local terminal logical Often through the data of encryption) etc. information check, once find that the request of certain online data is mated with preset security feature, Then judge that the identity of this initiator user is illegal, the request of this online data need to be carried out particularization process and (such as extend and be somebody's turn to do The response time of line request of data or directly this online data of refusal response request).
Summary of the invention
But, the present inventor finds in research process, and in the prior art, carrying out feature analysis needs to use data to dig Pick technology, such as, cluster and classify.But, because cluster and classification etc. are all the calculations of relative consumption CPU, disk and memory headroom Method, therefore, the system resource that server can be caused to consume when responding online data and processing and ask is relatively big, also have impact on server Systematic function;Further, server receive magnanimity online data request need to timely respond to when, due to server The problem that can there is stable system performance, so online data cannot be met process the requirement of real-time of request;Enter one again Step ground, also can there is certain erroneous judgement and phenomenon of failing to judge in data mining technology itself, therefore will ensure that online data please Seek the correctness of process.
To this end, be highly desirable to processing method and the equipment (example of online data based on the user behavior request of a kind of improvement Such as, server), with solve in prior art server respond the system resource consumed when online data processes request relatively big this One technical problem, further, also meets online data and processes the requirement of real-time of request, and ensure at online data request The correctness of reason.
In the present context, embodiments of the present invention expectation provides the request of a kind of online data based on user behavior Processing method and equipment.
In the first aspect of embodiment of the present invention, it is provided that the place of a kind of online data based on user behavior request Reason method, such as, may include that under line handling process in handling process and line, and wherein, under described line, handling process includes: in real time Obtain URL (URL) information browsing webpage of each client;Judge described URL information and the URL preset Whether rule meets the matching condition preset, and described URL rule includes: URL mark, the process action of online data and both Corresponding relation, if it is, the mark of the client of coupling is added with the process action of online data in the URL rule mated The online data adding to preset processes in request list;On described line, handling process includes: in response to active user by current Client submit to current online data process request, it is judged that described active client identify whether with described online data at The mark coupling of any client in reason request list, if it is, the mark that triggers described active client corresponding The execution of the process action of line data.
In an embodiment of the invention, wherein, described default matching condition include following any one or appoint Meaning combination: prefix matching condition, etc. value predicate and matching regular expressions condition.
In another embodiment of the present invention, wherein, described default URL rule is saved in default URL rule list In, whether the most described URL rule judging described URL information and preset meets the matching condition preset, such as, may include that and obtain Take the URL mark in the URL rule that described default URL rule list preserves;Judge that described URL information with described URL mark is No meet matching condition.
In the further embodiment of the present invention, such as, can also include: in described default URL rule list URL rule is updated, and described renewal includes: increases, revise and/or deletes.
In yet further embodiment of the invention, wherein, the identifying whether and institute of the described active client of described judgement State online data and process the mark coupling of any client in request list, including: obtain described online data and process request The mark of all clients in list is as logo collection to be matched;Processing from the online data of described active client please Seek the mark of the described active client of middle extraction;Judge identifying whether of described active client and treating in described logo collection Arbitrary mark coupling of coupling.
In yet further embodiment of the invention, wherein, described client the IP address being designated client and/or COOKIE。
In the second aspect of embodiment of the present invention, it is provided that the place of a kind of online data based on user behavior request Reason equipment, such as, may include that and include: first device and the second device, wherein: described first device includes: acquisition module, joins Put the uniform resource position mark URL information browsing webpage for each client of acquisition in real time;First judge module, configuration is used Whether meeting the matching condition preset in the URL rule judging described URL information and preset, described URL rule includes: URL marks Knowledge, the process action of online data and both corresponding relations;Add module, be configured at described first judge module In the case of result is for being, the mark of the client of coupling is added with the process action of online data in the URL rule mated Process in request list to the online data preset;Described second device includes: the second judge module, is configured in response to working as The current online data that front user is submitted to by active client processes request, it is judged that described active client identify whether with Described online data processes the mark coupling of any client in request list;Trigger module, is configured to described second In the case of the result of judge module is for being, trigger the process action of the online data of the mark correspondence of described active client Perform.
In another embodiment of the present invention, wherein, described default URL rule is saved in default URL rule list In, the most described first judge module, including: first obtains submodule, is configured to obtain described default URL rule list and preserves URL rule in URL mark;First judges submodule, is configured to judge that described URL information identifies whether with described URL Meet matching condition.
In another embodiment of the present invention, such as, can also include: more new module, be configured to described default URL rule in URL rule list is updated, and described renewal includes: increases, revise and/or deletes.
In another embodiment of the present invention, wherein, described second judge module, including: second obtains submodule, joins Put the mark for obtaining all clients in described online data process request list as logo collection to be matched;Carry Take submodule, be configured to process the mark extracting described active client request from the online data of described active client Know;Second judges submodule, is configured to judge identifying whether of described active client and treating in described logo collection The arbitrary mark coupling joined.
In embodiments of the present invention, the mutual independent flow process of handling process two under handling process and line is used on line Realize the process of the online data request that user is submitted to, wherein, by obtaining each client in real time in handling process on line The uniform resource position mark URL information browsing webpage of end, can be in the URL information of client and the URL rule preset In the case of URL mark meets the matching condition preset, by mark and the place of online data in the URL rule mated of client Reason action is added to the online data process request list preset.And if user have submitted online number by client on line According to processing request, then from online data process request list, directly just can match the online data that on this line, user is corresponding Process action, and perform this process action to respond the online data process request of user on this line.
First, embodiments of the present invention descend online in handling process by analyze user behavior be i.e. that user browses webpage URL information, it is provided that online data processes request list to realize asking the online data submitted user in real time The process asked, compared with prior art, analyzes the system resource data mining technology to be far smaller than that user behavior is to be consumed The system resource that (such as cluster and classify) is consumed, therefore, embodiments of the present invention more can save system resource, also can carry Rise the server systematic function when responding online data process request.Secondly, the online data in embodiments of the present invention Processing request list to realize based on user behavior, therefore its correctness carries out feature than in prior art by data mining technology The mode analyzed is more guaranteed;Again, because having saved online data to process request list in handling process under line, so When on line handling process needing the online data responding user to process request, it becomes possible to the most directly match this The process action of online data also directly performs, thus the online data that improve server response user processes the effect of request Rate, also meets online data and processes the requirement of real-time of request.
Accompanying drawing explanation
By reading detailed description below, above-mentioned and other mesh of exemplary embodiment of the invention with reference to accompanying drawing , feature and advantage will become prone to understand.In the accompanying drawings, if showing the present invention's by way of example, and not by way of limitation Dry embodiment, wherein:
Fig. 1 schematically shows the block diagram of the exemplary computer system 100 being adapted for carrying out embodiment of the present invention;
Fig. 2 schematically shows the block schematic illustration of an exemplary application scene of embodiments of the present invention;
Fig. 3 schematically shows method flow diagram according to an embodiment of the present invention;
Fig. 4 schematically shows the flow chart of step 302 in an embodiment of the present invention;
Fig. 5 schematically shows the flow chart of step 304 in an embodiment of the present invention;
Fig. 6 schematically shows the structural framing figure of the equipment (such as, server) of an embodiment of the present invention;
Fig. 7 schematically shows the structural framing figure of the first judge module 612 in the equipment of an embodiment of the present invention;
Fig. 8 schematically shows the structural framing figure of the second judge module 621 in the equipment of an embodiment of the present invention.
In the accompanying drawings, identical or corresponding label represents identical or corresponding part.
Detailed description of the invention
Principle and the spirit of the present invention are described below with reference to some illustrative embodiments.Should be appreciated that and provide this A little embodiments are only used to make those skilled in the art better understood when and then realize the present invention, and not with any Mode limits the scope of the present invention.On the contrary, it is provided that these embodiments are to make the disclosure more thorough and complete, and energy Enough the scope of the present disclosure is intactly conveyed to those skilled in the art.
Fig. 1 shows the block diagram of the exemplary computer system 100 being adapted for carrying out embodiment of the present invention.As it is shown in figure 1, meter Calculation system 100 may include that CPU (CPU) 101, random access memory (RAM) 102, read only memory (ROM) 103, system bus 104, hard disk controller 105, KBC 106, serial interface controller 107, parallel interface controller 108, display controller 109, hard disk 110, keyboard 111, serial peripheral equipment 112, concurrent peripheral equipment 113 and display 114. In these equipment, couple with system bus 104 has CPU 101, RAM 102, ROM 103, hard disk controller 105, keyboard control Device 106 processed, serialization controller 107, parallel controller 108 and display controller 109.Hard disk 110 and hard disk controller 105 coupling Closing, keyboard 111 couples with KBC 106, and serial peripheral equipment 112 couples with serial interface controller 107, concurrent peripheral Equipment 113 couples with parallel interface controller 108, and display 114 couples with display controller 109.Should be appreciated that Fig. 1 Described structured flowchart is only used to purpose rather than the limitation of the scope of the invention of example.In some cases, permissible Some equipment is increased or decreased as the case may be.
Art technology skilled artisan knows that, embodiments of the present invention can be implemented as a kind of system, method or calculating Machine program product.Therefore, the disclosure can be to be implemented as following form, it may be assumed that hardware, completely software (include solid completely Part, resident software, microcode etc.), or the form that hardware and software combines, it is referred to generally herein as " circuit ", " module " or " is System ".Additionally, in certain embodiments, the present invention is also implemented as the calculating in one or more computer-readable mediums The form of machine program product, comprises computer-readable program code in this computer-readable medium.
The combination in any of one or more computer-readable medium can be used.Computer-readable medium can be to calculate Machine readable signal medium or computer-readable recording medium.Computer-readable recording medium such as may be, but not limited to, The system of electricity, magnetic, optical, electromagnetic, infrared ray or quasiconductor, device or device, or above combination.Computer-readable The more specifically example (non-exhaustive examples) of storage medium such as may include that the electrical connection, just with one or more wire Take formula computer disk, hard disk, random access memory (RAM), read only memory (ROM), erasable type read-only storage able to programme Device (EPROM or flash memory), optical fiber, portable compact disc read only memory (CD-ROM), light storage device, magnetic memory device, Or the combination of above-mentioned any appropriate.In this document, computer-readable recording medium can be any to comprise or store journey The tangible medium of sequence, this program can be commanded execution system, device or device and use or in connection.
The data signal that computer-readable signal media can include in a base band or propagate as a carrier wave part, Wherein carry computer-readable program code.The data signal of this propagation can take various forms, including but do not limit In electromagnetic signal, optical signal or the combination of above-mentioned any appropriate.Computer-readable signal media can also is that computer can Reading any computer-readable medium beyond storage medium, this computer-readable medium can send, propagates or transmit and be used for By instruction execution system, device or device use or program in connection.
The program code comprised on computer-readable medium can include but not limited to nothing with any suitable medium transmission Line, electric wire, optical cable, RF etc., or the combination of above-mentioned any appropriate.
The computer for performing present invention operation can be write with one or more programming languages or a combination thereof Program code, described programming language includes object oriented program language-such as Java, Smalltalk, C++, also Including conventional process type programming language-such as " C " language or similar programming language.Program code can be complete Ground performs on the user computer, performs the most on the user computer, performs as an independent software kit, partly exists Subscriber computer upper part performs on the remote computer or performs on remote computer or server completely.Relating to In the situation of remote computer, remote computer (can include LAN (LAN) or wide area network by the network of any kind (WAN)) it is connected to subscriber computer, or, it may be connected to outer computer (such as utilizes ISP to lead to Cross Internet connection).
The flow chart of method and the block diagram of equipment (or system) below with reference to embodiment of the present invention describe the present invention Embodiment.Should be appreciated that the group of each square frame in flow chart and/or each square frame of block diagram and flow chart and/or block diagram Conjunction can be realized by computer program instructions.These computer program instructions can be supplied to general purpose computer, dedicated computing Machine or the processor of other programmable data processing means, thus produce a kind of machine, these computer program instructions pass through Computer or other programmable data processing means perform, and create regulation in the square frame in flowchart and/or block diagram The device of function/operation.
These computer program instructions can also be stored in and can make computer or other programmable data processing means In the computer-readable medium worked in a specific way, so, the instruction being stored in computer-readable medium just produces one The product of the command device of the function/operation of regulation in the individual square frame included in flowchart and/or block diagram.
Computer program instructions can also be loaded into computer, other programmable data processing means or miscellaneous equipment On so that on computer, other programmable data processing means or miscellaneous equipment, perform sequence of operations step, in terms of producing The process that calculation machine realizes, so that the instruction performed on computer or other programmable device can provide flowchart And/or the process of the function/operation of regulation in the square frame in block diagram.
According to the embodiment of the present invention, it is proposed that a kind of online data based on user behavior request processing method and Equipment.
In this article, it is to be understood that any number of elements in accompanying drawing is used to example and unrestricted and any Name is only used for distinguishing, and does not have any limitation.
Principle and spirit below with reference to some representative embodiments of the present invention, in detail the explaination present invention.
Summary of the invention
The inventors discovered that, because carrying out feature analysis and needing to use data mining technology in prior art, can relatively For consuming CPU, disk and memory headroom, the most therefore cause the system money that server consumes when responding online data and processing and ask Source is relatively big, also have impact on the systematic function of server so that server process online data is discontented with full when of processing request The requirement of time property, also cannot ensure that online data processes the correctness of request.If able to avoid using the technology of feature analysis, with Time again online data is processed request browse the information of webpage with user behavior such as user and combine, it is right so can to save The consumption of system resource, improves server process online data and processes the real-time of request, simultaneously also can be because of online data at The result of reason request is associated with user behavior, so the mode comparing feature analysis also can be the most correct.
After the ultimate principle describing the present invention, introduce the various non-limiting embodiment party of the present invention in detail below Formula.
Application scenarios overview
It it is the block schematic illustration of an exemplary application scene of embodiments of the present invention with reference first to Fig. 2, Fig. 2.Its In, user is interacted with server 202 by client 201.It will be understood by those skilled in the art that the framework shown in Fig. 2 Schematic diagram is only the example that embodiments of the present invention can be achieved wherein.The applicable model of embodiment of the present invention Enclose and do not limited by any aspect of this framework.
It should be noted that client 201 herein can be existing, research and develop or in the future research and development, can By appoint mutual with server 202 of any type of wired or wireless connection (such as, Wi-Fi, LAN, WAN, the Internet etc.) What client, includes but not limited to: existing, research and develop or research and development in the future, desk computer, laptop computer, Mobile terminal (including smart mobile phone, non intelligent mobile phone, various panel computer) etc..
It is also to be noted that server 202 herein be only existing, research and develop or in the future research and development, can Provide a user with the example that online data processes the equipment of service.Embodiments of the present invention are not the most by any limit System.
Server 202 can be divided into the first device as behavior analysis system and as on-line data handling system Two devices, behavior analysis system can obtain the uniform resource position mark URL information browsing webpage of each client in real time, and When URL information meets default matching condition with the URL rule preset, by mark and the URL mated of the client of coupling In rule, the process action of online data processes system for online data in adding the online data process request list extremely preset System is inquired about.And at the current online data submitted to by active client in response to active user of on-line data handling system Reason request, can identifying whether and the mark of any client in online data process request list at active client Timing, triggers the execution of the process action of the online data of the mark correspondence of described active client.
Illustrative methods
Below in conjunction with the application scenarios of Fig. 2, be described with reference to Figure 3 according to exemplary embodiment of the invention based on user The processing method of the online data request of behavior.It should be noted that above-mentioned application scenarios is for only for ease of and understand the present invention Spirit and principle and illustrate, embodiments of the present invention are the most unrestricted.On the contrary, embodiments of the present invention Can apply to any scene being suitable for.
With reference to shown in Fig. 3, the processing method one for online data based on user behavior disclosed by the invention request is implemented The flow chart of mode, wherein, step 301~step 303 are handling process under line, are the non real-time nature run by background program Under handling process, i.e. line, the result (online data process request list) of handling process feeds back in non real-time and processes on line Flow process;Step 304~step 305 are handling process on line, are in the handling process of the real-time run by foreground program, i.e. line Handling process processes request upon receipt of online data and i.e. starts to perform step 304~step 305.Handling process and line under line Upper handling process can be separate, and under line, handling process obtains the URL information browsing webpage of each client in real time, Can be performed by the user behavior analysis system as first device, on line, handling process then triggers online data in client Perform step 304 and step 305 again when of processing request, Request System can be processed by the online data as the second device Realize.Present embodiment the most such as may include that
Step 301: obtain the uniform resource position mark URL information browsing webpage of each client in real time.
In the present embodiment, behavior analysis system can obtain each in real time by real time parsing User action log The URL information of client, wherein User action log can obtain in real time from aol server, is i.e. continual continuing from line Upper server gets the URL information browsing webpage of each client.The most only it is extracted a kind of user behavior Information, the i.e. URL information of each Client browse webpage, be i.e. the URL address that webpage that user browses is corresponding.
In actual application scenarios, because user may browse multiple webpage by client, then can be according to visitor The URL information of identical IP address and/or COOKIE is saved together forming one by IP address and/or the COOKIE of family end The user of client redirects URL chain.Visible, need to safeguard two dimensions in the present embodiment mode redirects URL chain: identical The URL chain of COOKIE and/or the URL chain of identical IP.
Step 302: judge described URL information with whether the URL rule preset meets the matching condition preset, if it is, Then enter step 303.
In the present embodiment, a judgement time, such as, half an hour can be set, come within this half an hour The all URL information got by step 301 perform step 302.Wherein, the URL rule preset can be by art technology Personnel independently arrange or dynamically update, and are used for finding and limit specific page access.The content of this URL rule is the most permissible Including: URL mark, the process action of online data and both corresponding relations.This URL rule may be employed to judge currently Client sends the rule that online request of data is the most legal.The URL rule preset can have a plurality of, can be saved as one Individual redirect rule list.Wherein, described default matching condition includes following any one or combination in any: prefix matching condition, Deng value predicate and matching regular expressions condition.
The flow process of default URL rule is described in detail below by the example of default URL rule.In present embodiment In, the form of URL rule can example as follows:
Addrule<[E/P/R] url 1 [, url 2 ..., urln]><action>
Wherein, " addrule " represents a URL rule, and [E/P/R] represents URL information and n url's in URL rule Matching way, wherein " E " represents equivalence coupling, and " P " represents prefix matching, and " R " represents canonical coupling;" url1 [, url2 ..., Urln] " represent the url address in the URL rule preset, " urln " represents the n-th URL address;" action " represents URL information With the online data process action that needs perform when mating of the URL address in default URL rule.
It is found that URL rule is mainly made up of two parts from the definition of above-mentioned form, Part I is coupling bar Part, be described as " [E/P/R] url1 [, url2 ..., urln] ", and Part II is online data process action, is described as “action”.Wherein, matching condition is a character string sequence, and each character string in this character string sequence is a coupling The content of condition.Character string sequence is divided into two parts, and Part I is string prefix, is used for representing match-type, prefix Value collection be combined into " [E] ", " [P] ", and/or, " [R] " };Part II is string postfix, for representing in matching condition URL address, multiple matching conditions of the character string in character string sequence are separated with comma.
It is described in detail with the example of two concrete URL rules below.For URL rule example one, it is assumed that actual In have a URL rule as follows:
Addrule [P]/business, [E]/user/subscribe/business.do " stop "
Wherein, " business " is prefix matching, as long as being i.e. that the URL information got in step 301 comprises " business ", regardless of the URL address information after " business ", is i.e. the URL address started with "/business ", shape Such as " business XXX ", just explanation meets the matching condition preset.And "/user/subscribe/business " expression etc. Value coupling, is i.e. that the URL information got in step 301 must be equal to "/user/subscribe/business.do ", Illustrate to meet the matching condition preset.Meanwhile, this URL rule also illustrates, meets online data process action during matching condition For " stop ", the implication being i.e. to skip.
For URL rule example two, it is assumed that reality there is a URL rule as follows:
Addrule [E]/_ vti_bin/owssvr.dll, [R]/getcoupon.do?Id=[0-9] { 1,5} " delay 7”
This example illustrates, for certain user, its URL information browsing webpage is equal to "/_ vti_bin/ Owssvr.dll ", or it browses the URL information "/getcoupon.do of webpage?Id " it is the numeral of 1 to 5, then it represents that its The URL information browsed meets matching condition with this URL rule preset.Meanwhile, this URL rule also illustrates, meets matching condition Time online data process action be " delay 7 ", expression processes this online data and processes the response time delay 7 seconds of request.
Wherein, in this step preset URL rule can be pre-stored in the URL rule list preset, then described in sentence Whether disconnected described URL information and the URL rule preset meet the process that realizes of the matching condition preset, with reference to Fig. 4, the most permissible Including:
Step 401: obtain the URL mark in the URL rule that described default URL rule list preserves.
From default URL rule list, first get the URL mark in URL, be i.e. aforesaid " business ", or “/_vti_bin/owssvr.dll”。
Step 402: judge that described URL information and described URL identify whether to meet matching condition.
Judge that the user got in real time browses the URL information of webpage and identifies whether satisfied coupling with the URL in URL rule The matching condition that type is limited.
In the present embodiment, if URL rule is all saved in default URL rule list, then can also include:
Step 403: being updated the URL rule in described default URL rule list, described renewal includes: increases, repair Change and/or delete.
URL rule in default URL rule list can be updated, such as toward a URL newly-increased in URL rule list A URL rule in rule, or amendment URL rule list, or delete a URL rule in URL rule list.By this The mode in step being updated the URL rule in URL rule list, so that URL rule is suitable for different application The demand of scene.
It is understood that in the diagram, although step 403 performs after step 402, but those skilled in the art It is appreciated that and in fact between step 403 and step 402, there is no fixing sequencing relation.
Be then returned to Fig. 3, the judged result of step 302 for being when, enter step 303: will coupling client Mark add with the process action of online data in URL rule mate to the most default online data process request list.
Behavior analysis system is according to the URL information got in step 301 and default URL rule, it is judged that in step 301 Whether the URL information browsed meets the matching condition preset with the URL rule preset, and once meets, then according to this user's The process action of the online data in that URL rule of COOKIE and/or IP and coupling constructs an online data and processes Request list item, and add in online data process request list.
Such as, for the URL rule example one in step 302, it is assumed that certain user A, IP are " 112.111.256.222 ", COOKIE is " abc123 ", and the URL information browsing webpage got in real time has 3, webpage One is :/bussiness/buy.do, and webpage two is :/help.do, and webpage three is :/user/subscribe/ business.do.Wherein webpage one hits [P]/business (meeting the matching condition of prefix matching), and webpage three hits/ User/subscribe/business.do (meets the matching condition of equivalence coupling), and therefore two online datas process of structure please Seek list items: " IP 112.111.256.222 stop " and " COOKIE abc123 stop ", and by the two online data Reason request list item adds to the online data process request list preset.
For the URL rule example two in step 302, it is assumed that certain user B, IP are " 165.124.128.111 ", COOKIE is " 1c2d3e ", and he has browsed two pages, the page one :/_ vti_bin/owssvr.dll, and the page two is :/ getcoupon.do?Id=12345.So URL address of the two page also meets matching condition with this URL rule, structure Two online datas process request list items: " IP 165.124.128.111 delay 7 ", and " COOKIE 1c2d3e Dalay 7 ", and the two online data is processed in the online data process request list that the interpolation of request list item is extremely preset.
Further, behavior analysis system, can by a URL rule correspondence one in order realizing this step when Rule state chain, wherein each matching condition in URL rule can a unique state in corresponding states chain, and be expert at For in analysis system, the state in ordering rule state chain can globally unique be distributed, and whole behavior analysis system retains One globally unique original state.So, for the example one of URL rule, behavior analysis system is that user A can distribute one The individual information table (UserTableA) belonging to this user:
UserID={IP:112.111.256.222, COOKIE:abc123}
UrlLink={}
StateLink={s0}
Due to initial period, user A did not access any page, and URL chain (UrlLink) therein is empty, state chain (StateLink) it is initialized as s0, i.e. state 0.Then, user's A accession page one "/bussiness/buy.do ", due to this The page one meets matching condition with the example one of URL rule, therefore according to state s0 in current page and state-chain-table, and hit State s1 in the example one of URL rule, therefore adds to state s1 in the StateLink of user, and by the page one URL information is added to UrlLink, and user message table is updated to:
UserID={IP:112.111.256.222, COOKIE:abc123}
UrlLink={/bussiness/buy.do}
StateLink={s0, s1}
Then, user to access pages two "/help.do ", according to standing state s0, s1 and the URL information of this page two, with State in any URL rule is not mated, and therefore the state chain in user message table is constant, only need to revise therein UrlLink, amended user message table is updated to:
UserID={IP:112.111.256.222, COOKIE:abc123}
UrlLink={/bussiness/buy.do ,/help.do}
StateLink={s0, s1}
Then, user to access pages three "/user/subscribe/business.do ", according to state s0, s1 and the page The URL information of three, state s2 in hit URL rule example one, therefore state s2 is added in StateLink to replace State s1, and the page three is added to UrlLink, now user message table is updated to:
Due to the end state that s2 is URL rule example one, illustrate the behavior of this user be i.e. its URL information browsed Through meeting, with URL rule example one, the matching condition preset, system according to the online data process action of URL rule example one with And the client identification of this user, build online data process request list item: " IP:112.111.256.222 stop " and " COOKIE:abc123 stop ", and add in online data processing request list.
In the present embodiment, under above-mentioned line, handling process can browse the URL letter of webpage on backstage with user in real Breath, under firing line, handling process processes request list to obtain online data, it is also possible to arrange a feedback time, such as 2 Hour, this feedback time arrives when, online data is processed request list and feeds back to handling process on line, in order to line Upper handling process can get the online data generated by handling process under line and process request when processing online data request List.And certain user submits to online data to process request by client when, start to perform to locate on the line of step 304 Reason flow process, specifically may include that
Step 304: the current online data submitted to by active client in response to active user processes request, it is judged that institute The mark identifying whether to process any client in request list with described online data stating active client is mated, if It is then to enter step 305.
After behavior analysis system obtains online data process request list, can be according to the feedback time pre-set Online data is processed request list and feeds back to on-line data handling system, and for on-line data handling system from the point of view of, behavior The online data process request list analyzing system feedback actually represents: be analyzed obtaining for user's historical behavior History online data process the result of request.On-line data handling system processes online data every time and processes request Before, first resolve the client identification of user corresponding to this request, obtain IP and COOKIE information therein, then according to IP and COOKIE judges whether that the mark processing any client in request list with online data is mated.
Wherein, the mark of described client can be IP address and/or the COOKIE of client.
With reference to shown in Fig. 5, for judging identifying whether and described online data process request list of described active client In any client mark coupling flow chart, specifically may include that
Step 501: obtain identifying as to be matched of all clients in described online data process request list Logo collection.
When implementing, on-line data handling system first processes from online data and obtains all of preservation request list Client mark as logo collection to be matched, this logo collection to be matched be i.e. each client IP and/or COOKIE information.
Step 502: process the mark extracting described active client request from the online data of described active client.
Process the mark extracting described active client request again from the online data of active client, be i.e. current visitor IP and/or the COOKIE information of family end.
Step 503: judge identifying whether and the arbitrary mark to be matched in described logo collection of described active client Know coupling.
Judge IP and/or the COOKIE information of active client whether with the IP in logo collection to be matched and/or COOKIE information is identical, if identical, then it is assumed that coupling, if it is not the same, then not think and mate.
Then, Fig. 3, wherein, step 305 are returned: trigger the place of the online data of the mark correspondence of described active client The execution of reason action.
In the present embodiment, and if step 304 judging the mark obtaining active client and described online data The mark processing any client in request list is mated, because online data process request list preservation is client Identify the process action with online data and both corresponding relations, then now can directly perform the mark with active client The action that the online data of sensible correspondence processes.Such as, if the IP of client is " 112.111.256.222 ", then mate " IP:112.111.256.222 stop " item in line data processing request list, now performs " stop " action, it is simply that stop This online data of subsequent treatment processes request.If " COOKIE " of client is 1c2d3e, then coupling online data processes and asks Seeking " the COOKIE 1c2d3e delay 7 " item in list, now online data processes Request System execution action " delay 7 ", represent and first pause 7 seconds when client returns request result.
Visible, in the present embodiment, embodiments of the present invention are descended in handling process online by analyzing user behavior It is i.e. user's URL information of browsing webpage, it is provided that an online data processes request list to realize in real time user The process of the online data request submitted to, compared with prior art, the system resource analyzing user behavior to be consumed will be far away The system resource consumed less than data mining technology (such as cluster and classify), therefore, embodiments of the present invention more can save Save system resource, also can promote the server systematic function when responding online data process request.Secondly, the enforcement of the present invention Online data in mode processes request list and realizes based on user behavior, and therefore its correctness passes through data than in prior art The mode that digging technology carries out feature analysis is more guaranteed;Again, under line in handling process because saved online data Process request list.During so on line handling process needing the online data responding user to process request, it becomes possible to more Quickly directly match the process action of this online data and directly perform, thus improve the online of server response user The efficiency of data processing request, also meets online data and processes the requirement of real-time of request.
Meanwhile, the processing method that the online data using embodiments of the present invention to provide is asked, because user is initiating During the request that online data processes, server can more be rapidly performed by response, also improves the online data request of user Process experience.
It is understood that embodiments of the present invention can apply to detect in the scene that network steals brush automatically, because Ecommerce fast development in prior art, net purchase becomes a kind of new propensity to consume.Some online speculators see it In commercial value, use some network technology means, a large amount of registrations, panic buying special price and cheaper commodity, very disruptive net purchase Order." network cattle " known widely such as recently, they obtain for 24 hours earlier incessantly in online search than ordinary consumer Bargain goods information, place an order panic buying in very first time high-volume, change hands, by the player whose turn comes next, sale of raising the price the most again.This situation exists The net purchase industry of low cost or cost free, as purchased by group or reward voucher industry is the most prominent.This behavior of online speculator, be i.e. A kind of network steals brush behavior.
Net purchase ISP have employed a large amount of technological means, detects network and steals brush behavior.It is most that network steals brush behavior Number, for automatically to steal brush, is a kind of by certain technological means, runs batch robber's brush that machine program is carried out.Embodiment party of the present invention The online data processing method that formula provides i.e. can be applicable to detect network and automatically steals brush, by pre-setting some page jump streams By the URL address in this page jump stream rule, rule (URL rule), can judge which user has hit these URL Address, will hit IP and/or cookie, Yi Jixu of the suspicious user client of URL address in page jump stream rule Process action correspondence preservation to be performed (can be referred to as user to line data processing request list under this application scenarios and steal brush name Single), the user in this user steals brush list is i.e. suspicious user, and once detection finds that this user triggers online data and processes Ask such as online transaction to be asked, i.e. perform the corresponding action such as " stop " or " delay ", prevent this user the most online Proceeding of transaction.Using embodiments of the present invention to carry out network and steal the detection of brush, can more save server is System resource.
Example devices
After the method describing exemplary embodiment of the invention, it follows that be that the present invention implements with reference to Fig. 6, Fig. 6 The structural representation of one embodiment of equipment disclosed in mode (such as, server), setting of exemplary embodiment of the invention Standby, the most such as may include that first device 61 and the second device 62.
Described first device 61 specifically may include that acquisition module 611, is configured to obtain the clear of each client in real time Look at the uniform resource position mark URL information of webpage;First judge module 612, is configured to judge described URL information and presets Whether URL rule meets the matching condition preset, and described URL rule includes: URL mark, the process action of online data and Both corresponding relations;Add module 613, be configured to the result at described first judge module for being in the case of, general The mark of the client joined adds, with the process action of online data in the URL rule mated, the online data process extremely preset please Ask in list.
Wherein, described default URL rule is saved in default URL rule list, then as it is shown in fig. 7, described first sentences Disconnected module 612 such as may include that
First obtains submodule 701, is configured to obtain the URL in the URL rule that described default URL rule list preserves Mark;
First judges submodule 702, is configured to judge that described URL information identifies whether with described URL to meet and mates bar Part.
Described first device 61 can also include:
More new module 703, is configured to be updated the URL rule in described default URL rule list, described renewal Including: increase, revise and/or delete.
As shown in Figure 6, described second device 62 specifically may include that the second judge module 621, is configured in response to working as The current online data that front user is submitted to by active client processes request, it is judged that described active client identify whether with Described online data processes the mark coupling of any client in request list;Trigger module 622, is configured to described In the case of the result of two judge modules is for being, trigger the process action of the online data of the mark correspondence of described active client Execution.
Wherein, as shown in Figure 8, described second judge module 621, specifically may include that
Second obtains submodule 801, is configured to obtain described online data and processes all clients in request list Mark as logo collection to be matched;
Extract submodule 802, be configured to process from the online data of described active client request is extracted described working as The mark of front client;
Second judges submodule 803, is configured to judge identifying whether and described logo collection of described active client In to be matched arbitrary mark coupling.
According to some embodiment of the present invention, described default matching condition includes following any one or any group Close: prefix matching condition, etc. value predicate and matching regular expressions condition.
According to some embodiment of the present invention, the IP address being designated client of described client and/or COOKIE.
Visible, that embodiment of the present invention provides equipment (such as, server), embodiments of the present invention are lower online to be processed Flow process is i.e. the URL information that user browses webpage by analyzing user behavior, it is provided that online data processes request List, to realize the process asked the online data submitted to user in real time, compared with prior art, analyzes user behavior institute The system resource that system resource to be consumed data mining technology to be far smaller than (such as cluster and classify) is consumed, therefore, Embodiments of the present invention more can save system resource, also can promote the server system when responding online data process request Performance.Secondly, the online data in embodiments of the present invention processes request list and realizes based on user behavior, and therefore it is correct Property than in prior art by the way of data mining technology carries out feature analysis more guaranteed;Again, under line in handling process Because having saved online data to process request list.So handling process needs to respond the online data of user on line When processing request, it becomes possible to more quickly directly match the process action of this online data and directly perform, thus improving The online data of server response user processes the efficiency of request, also meets online data and processes the real-time of request and want Ask.
Although it should be noted that, in above-detailed, be referred to equipment (such as, server) if equipment for drying or son dress Put, but this division is the most enforceable.It practice, according to the embodiment of the present invention, above-described two or The feature of more devices and function can embody in one apparatus.Otherwise, the feature of an above-described device and merit Can be able to embody with Further Division for by multiple devices.
Although additionally, describe the operation of the inventive method in the accompanying drawings with particular order, but, this do not require that or Hint must perform these operations according to this particular order, or having to carry out the most shown operation could realize desired Result.On the contrary, the step described in flow chart can change execution sequence.Additionally or alternatively, it is convenient to omit some step, Multiple steps are merged into a step perform, and/or a step is decomposed into the execution of multiple step.
It should be noted that in this article, the relational terms of such as first and second or the like is used merely to a reality Body or operation separate with another entity or operating space, and deposit between not necessarily requiring or imply these entities or operating Relation or order in any this reality.The verb mentioned in application documents " includes ", " comprising " and paradigmatic Use and be not excluded for except those elements described in application documents or the element in addition to step or the existence of step.Hat before element Word "a" or "an" is not excluded for the existence of multiple this element.
Although describing spirit and principles of the present invention by reference to some detailed description of the invention, it should be appreciated that, this Invention is not limited to disclosed detailed description of the invention, and the division to each side does not means that the feature in these aspects can not yet Combination to be benefited, this division merely to statement convenience.It is contemplated that contain claims spirit and Various amendments included by the range of and equivalent arrangements.Scope of the following claims meets broadest explanation, thus comprises All such amendments and equivalent structure and function.

Claims (10)

1. a processing method for online data based on user behavior request, including processing stream on handling process under line and line Journey, wherein,
Under described line, handling process includes:
Obtain the uniform resource position mark URL information browsing webpage of each client in real time;
Whether the URL rule judging described URL information and preset meets the matching condition preset, and described URL rule includes: URL Mark, the process action of online data and both corresponding relations, if it is, by the mark of the client of coupling with mate URL rule in the process action of online data add to the online data preset and process in request list;
On described line, handling process includes:
The current online data submitted to by active client in response to active user processes request, it is judged that described active client The mark identifying whether to process any client in request list with described online data mate, if it is, triggering institute State the execution of the process action of the online data of the mark correspondence of active client;Wherein, described online data processes request row Table saves the mark of client and the process action of online data and both corresponding relations.
Method the most according to claim 1, wherein, described default matching condition includes following any one or any group Close: prefix matching condition, etc. value predicate and matching regular expressions condition.
Method the most according to claim 1, wherein, described default URL rule is saved in default URL rule list, Whether the most described URL rule judging described URL information and preset meets the matching condition preset, including:
Obtain the URL mark in the URL rule that described default URL rule list preserves;
Judge that described URL information and described URL identify whether to meet matching condition.
Method the most according to claim 3, also includes:
Being updated the URL rule in described default URL rule list, described renewal includes: increases, revise and/or deletes.
Method the most according to claim 1, wherein, the described active client of described judgement identify whether with described online The mark coupling of any client in data processing request list, including:
Obtain identifying as logo collection to be matched of all clients in described online data process request list;
The mark extracting described active client request is processed from the online data of described active client;
Judge described active client identify whether mate with the to be matched arbitrary mark in described logo collection.
6. according to the method described in any one of Claims 1 to 5, wherein, the IP address being designated client of described client And/or COOKIE.
7. a processing equipment for online data based on user behavior request, including: first device and the second device, wherein:
Described first device includes: acquisition module, is configured to obtain in real time the unified resource browsing webpage of each client Finger URL URL information;First judge module, is configured to judge whether described URL information meets with the URL rule preset and presets Matching condition, described URL rule includes: URL mark, the process action of online data and both corresponding relations;Add Module, be configured to the result at described first judge module for being in the case of, by the mark of the client of coupling with mate URL rule in the process action of online data add to the online data preset and process in request list;
Described second device includes: the second judge module, is configured to be submitted to by active client in response to active user Current online data processes request, it is judged that identify whether and the described online data of described active client process in request list Any client mark coupling;Trigger module, be configured to the result at described second judge module for being in the case of, Trigger the execution of the process action of the online data of the mark correspondence of described active client;Wherein, described online data processes Request list saves the mark of client and the process action of online data and both corresponding relations.
Equipment the most according to claim 7, wherein, described default URL rule is saved in default URL rule list, The most described first judge module, including:
First obtains submodule, is configured to the URL mark obtaining in the URL rule that described default URL rule list preserves;
First judges submodule, is configured to judge that described URL information and described URL identify whether to meet matching condition.
Equipment the most according to claim 8, described first device also includes:
More new module, is configured to be updated the URL rule in described default URL rule list, and described renewal includes: increase Add, revise and/or delete.
Equipment the most according to claim 7, wherein, described second judge module, including:
Second obtains submodule, is configured to obtain described online data and processes the mark work of all clients in request list For logo collection to be matched;
Extract submodule, be configured to process request from the online data of described active client extract described active client Mark;
Second judges submodule, is configured to judge identifying whether of described active client and treating in described logo collection The arbitrary mark coupling joined.
CN201210516508.8A 2012-12-04 2012-12-04 The processing method of a kind of online data based on user behavior request and equipment Active CN103198091B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210516508.8A CN103198091B (en) 2012-12-04 2012-12-04 The processing method of a kind of online data based on user behavior request and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210516508.8A CN103198091B (en) 2012-12-04 2012-12-04 The processing method of a kind of online data based on user behavior request and equipment

Publications (2)

Publication Number Publication Date
CN103198091A CN103198091A (en) 2013-07-10
CN103198091B true CN103198091B (en) 2016-12-21

Family

ID=48720650

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210516508.8A Active CN103198091B (en) 2012-12-04 2012-12-04 The processing method of a kind of online data based on user behavior request and equipment

Country Status (1)

Country Link
CN (1) CN103198091B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462213A (en) * 2014-12-05 2015-03-25 成都逸动无限网络科技有限公司 User behavior analysis method and system based on big data
CN104680387A (en) * 2015-02-27 2015-06-03 百度在线网络技术(北京)有限公司 Information display method and device
CN106548067B (en) * 2015-09-21 2020-05-22 百度在线网络技术(北京)有限公司 Method and apparatus for forwarding access requests
CN105704120B (en) * 2016-01-05 2019-03-19 中云网安科技(北京)有限公司 A method of the secure access network based on self study form
CN110674174B (en) * 2019-09-24 2020-09-01 北京九章云极科技有限公司 Data real-time processing method and data real-time processing system
CN112822302B (en) * 2019-11-18 2023-03-24 百度在线网络技术(北京)有限公司 Data normalization method and device, electronic equipment and storage medium
CN111415263A (en) * 2020-04-07 2020-07-14 中国建设银行股份有限公司 Data matching method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101557427A (en) * 2009-05-11 2009-10-14 阿里巴巴集团控股有限公司 Method for providing diffluent information and realizing the diffluence of clients, system and server thereof
CN102624703A (en) * 2011-12-31 2012-08-01 成都市华为赛门铁克科技有限公司 Method and device for filtering uniform resource locators (URLs)

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101557427A (en) * 2009-05-11 2009-10-14 阿里巴巴集团控股有限公司 Method for providing diffluent information and realizing the diffluence of clients, system and server thereof
CN102624703A (en) * 2011-12-31 2012-08-01 成都市华为赛门铁克科技有限公司 Method and device for filtering uniform resource locators (URLs)

Also Published As

Publication number Publication date
CN103198091A (en) 2013-07-10

Similar Documents

Publication Publication Date Title
CN103198091B (en) The processing method of a kind of online data based on user behavior request and equipment
CN107590174B (en) Page access method and device
US10902077B2 (en) Search result aggregation method and apparatus based on artificial intelligence and search engine
CN104639420B (en) The information processing method and system of instant messaging
CN108780448A (en) Web page editing in domain
KR102472572B1 (en) Method for profiling user&#39;s intention and apparatus therefor
WO2013107376A1 (en) Information processing method and system for network trading platform
CN111178950A (en) User portrait construction method and device and computing equipment
KR20200142489A (en) Automatic advertisement execution device, method for automatically generating campaign information for an advertisement medium to execute an advertisement and computer program for executing the method
CN105279224A (en) Information push method and device
US11436297B2 (en) Landing page generation
CN106251168A (en) Information-pushing method and system
CN107679217A (en) Association method for extracting content and device based on data mining
US20140173031A1 (en) Information providing apparatus, information providing method, and network system
CN105574089A (en) Mapping knowledge domain generation method and device, and object comparison method and device
CN104598815A (en) Identification method and device of malicious advertisement program and client side
CN104142990A (en) Search method and device
CN110827112A (en) Deep learning commodity recommendation method and device, computer equipment and storage medium
Guarino et al. A machine learning-based approach to identify unlawful practices in online terms of service: analysis, implementation and evaluation
CN113094492A (en) Comment information display method, comment information processing system, comment information processing device, comment information equipment and storage medium
CN105354344A (en) SEO (search engine optimization) system and method
US11256703B1 (en) Systems and methods for determining long term relevance with query chains
CN111222918B (en) Keyword mining method and device, electronic equipment and storage medium
CN104050174B (en) A kind of personal page generation method and device
CN112860986A (en) System and method for generating individual content to users of a service

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant