CN107895011A - Processing method, system, storage medium and the electronic equipment of session information - Google Patents

Processing method, system, storage medium and the electronic equipment of session information Download PDF

Info

Publication number
CN107895011A
CN107895011A CN201711112871.2A CN201711112871A CN107895011A CN 107895011 A CN107895011 A CN 107895011A CN 201711112871 A CN201711112871 A CN 201711112871A CN 107895011 A CN107895011 A CN 107895011A
Authority
CN
China
Prior art keywords
session
session information
historical
information
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711112871.2A
Other languages
Chinese (zh)
Other versions
CN107895011B (en
Inventor
周宜兵
郑佰云
邢钦华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ctrip Travel Network Technology Shanghai Co Ltd
Original Assignee
Ctrip Travel Network Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ctrip Travel Network Technology Shanghai Co Ltd filed Critical Ctrip Travel Network Technology Shanghai Co Ltd
Priority to CN201711112871.2A priority Critical patent/CN107895011B/en
Publication of CN107895011A publication Critical patent/CN107895011A/en
Application granted granted Critical
Publication of CN107895011B publication Critical patent/CN107895011B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data

Abstract

The invention provides the processing method of session information, system, storage medium and electronic equipment, wherein method includes:Session information caused by receiving website user's session, non-registered identity, enrollment status mark and remaining signature field of the session information are obtained respectively;Judge successively the session information non-registered identity, enrollment status mark and remaining signature field whether respectively the non-registered identity with the historical session in database, enrollment status mark and remaining signature field match;And judge the session information with the time field difference of historical session matched whether within a session cycle, if the session information to be then incorporated into matched historical session, if otherwise creating an independent sessions for the session information.The session information for embodying user behavior data can be collected in same user conversation by various dimensions session information identifying processing, data basis is provided for the analysis of air control big data by the present invention from mass data exactly.

Description

Processing method, system, storage medium and the electronic equipment of session information
Technical field
The present invention relates to Internet technical field, more particularly to a kind of processing method of session information, system, storage medium And electronic equipment.
Background technology
User conversation (session) is record user using some behavioral datas caused by website product or service Set, including the behavioral data such as the page browsing of user, click, transaction.
At present, have it is more and more for user behavior carry out big data analysis method and systems, such as commercial product recommending system, Ad system etc..For helping company to improve Consumer's Experience, strengthen product function, improve profitability.But for user behavior The identification of data, integrate and handled more by foundation of this dimension of ID.That is, it is necessary to be registered user and login The behavioral data of user correctly could be collected and identified, and striding equipment (can not be used with the period at pc ends and mobile terminal Product or service) user behavior handled.
, can be from user conversation in air control field, whether analyzed by big data analysis engine for example has transaction to take advantage of Swindleness, in violation of rules and regulations, the data such as customer transaction environment make reference to regulation engine/anti-fake system for transaction.So as to be provided for operation system More, more complete, reliable user's various dimensions information helps operation system, and such as anti-fake system improves the degree of accuracy and degree of intelligence.
But because the processing dimension currently for user session information is single, cause the analyze data that ultimately forms not complete enough Face, accuracy deficiency, it is impossible to provide data basis well for the analysis of air control big data.
It should be noted that information is only used for strengthening the reason to the background of the disclosure disclosed in above-mentioned background section Solution, therefore can include not forming the information to prior art known to persons of ordinary skill in the art.
The content of the invention
For in the prior art the defects of, the problem to be solved in the present invention is, how various dimensions identifying processing user's meeting Information is talked about, collects the behavioral data of same user in same session from mass data.
According to an aspect of the present invention, there is provided a kind of processing method of session information, methods described include:Step S101, session information caused by website user's session is received, the session information at least carries the non-registered identity mark of the user Know;Step S102, identify the session information non-registered identity whether the non-note with the historical session in database Volume identity matching, if then performing step S106, if otherwise performing step S103;Step S103, the session letter is judged Whether breath carries the enrollment status mark of the user, if then performing step S104, if otherwise performing step S105;Step S104, the enrollment status of the identification session information identify whether and the enrollment status of the historical session in database mark Match somebody with somebody, if then performing step S106, if otherwise performing step S105;Step S105, the client class of the session information is obtained Type, each signature word of each signature field and historical session in database of the session information is obtained according to the client type The similarity of section, step is performed if in the presence of historical session of the similarity of the signature field with the session information higher than threshold value S106, if the execution step S108 in the absence of if;Step S106, the session information and the historical session that is matched are judged Whether time field difference is within a session cycle, if then performing step S107, if otherwise performing step S108;Step S107, the session information is incorporated into the historical session matched;Step S108, one is created for the session information Independent sessions.
Preferably, the step S105 includes:Step S1051, the client type of the session information, the visitor are obtained Family end type includes page end and mobile terminal;Step S1052, each label of the session information are obtained according to the client type The weighted value of file-name field;Step S1053, the historical session in ergodic data storehouse, the client type of each historical session is obtained, with And weighted value of each signature field of each historical session under its corresponding client type;Step S1054, the session is judged Whether the weighted value sum of the signature field to match of information and historical session is more than threshold value, if then performing step S106, if otherwise performing step S108.
Preferably, the signature field includes device identification number, handset identity number.
Preferably, the step S101 includes:Step S1011, multigroup meeting caused by multiple user conversations in website is received Information is talked about, the non-registered identity of the user is at least carried per group session information;Step S1012, according to per group session information Non-registered identity, each group session information is distributed to different worker threads, it is real-time parallel by different worker threads Step S103 to step S108.
Preferably,, will using hash algorithm modulo operation for each group session information distribution numbering in the step S1011 The session information of difference numbering is distributed to corresponding worker thread.
Preferably, the historical session includes the historical session in active state in local cache database, and The historical session in expired state in remote synchronization database.
Preferably, in the step S106, the session information and the time word of the historical session matched are judged Whether segment difference value is in 30 minutes, if then judging that the session information is located at a session with the historical session matched In the cycle, step S107 is performed, if otherwise performing step S108.
According to another aspect of the present invention, there is provided a kind of processing system of session information, the system include:Session obtains Modulus block, for receiving session information caused by website user's session, the session information at least carries the non-registered of the user Identity;Non-registered identity identification module, for identify the session information non-registered identity whether with number Matched according to the non-registered identity of the historical session in storehouse, if then triggering session merging module, if otherwise triggering registration Identity identification module;Enrollment status identifies identification module, for identifying that the enrollment status of the session information identifies whether Matched with the enrollment status mark of the historical session in database, if then triggering session merging module, if otherwise triggering label File-name field identification module;Signature field identification module, for obtaining the client type of the session information, according to the client Type is held to obtain the similarity of each signature field and each signature field of historical session in database of the session information, screening Go out to be higher than with the similarity of the signature field of the session information historical session of threshold value, and triggering session merging module;Session Merging module, for judging the session information with the time field difference of the historical session matched whether in a meeting Talk about in the cycle, if the session information to be then incorporated into the historical session matched, if being otherwise the session information Create an independent sessions.
Preferably, the processing system of above-mentioned session information also includes:Module is locally stored, shape is enlivened for caching to be in The historical session of state;Remote synchronization module, for caching the historical session in expired state;It is described that module and institute is locally stored State remote synchronization module and identify identification module and the label with the non-registered identity identification module, the enrollment status File-name field identification module communicates to connect.
Preferably, the processing system of above-mentioned session information also includes:Expired processing module, for periodically from the local Expired historical session is extracted in memory module, is sent to the work queue of the remote synchronization module.
According to another aspect of the present invention, there is provided a kind of computer-readable recording medium, be stored thereon with computer journey Sequence, the program realizes the processing method of above-mentioned session information when being executed by processor the step of.
According to another aspect of the present invention, there is provided a kind of electronic equipment, including:Processor;And memory, for depositing Store up the executable instruction of the processor;Wherein, the processor is configured to perform via the executable instruction is performed The step of processing method for the session information stated.
In view of this, the beneficial effect of the present invention compared with prior art is:
1) present invention, can be exactly from mass data, according to pass by the user session information identifying processing of various dimensions Key field collects same user behavior data in same user conversation;
2) billions of mass data daily can be handled in real time, can Real Time Observation to whole user action trail, and Low is required to machine performance;
3) data basis is provided for the analysis of air control big data, and partial data is provided and used to anti-fake system.
It should be appreciated that the general description and following detailed description of the above are only exemplary and explanatory, not The disclosure can be limited.
Brief description of the drawings
Accompanying drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the application Example, and be used to together with specification to explain the principle of the application.It should be evident that drawings in the following description are only the disclosure Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis These accompanying drawings obtain other accompanying drawings.
Fig. 1 shows a kind of schematic flow sheet of the processing method of session information in exemplary embodiment of the present;
Fig. 2 shows to identify the signature field of session information and the signature field of historical session in exemplary embodiment of the present Similarity step schematic diagram;
Fig. 3 shows a kind of comprising modules schematic diagram of the processing system of session information in exemplary embodiment of the present;
Fig. 4 shows a kind of schematic diagram of computer-readable recording medium in exemplary embodiment of the present;
Fig. 5 shows the schematic diagram of a kind of electronic equipment in exemplary embodiment of the present.
Embodiment
Example embodiment is described more fully with referring now to accompanying drawing.However, example embodiment can be with a variety of shapes Formula is implemented, and is not understood as limited to example set forth herein;On the contrary, these embodiments are provided so that the present invention will more Fully and completely, and by the design of example embodiment comprehensively it is communicated to those skilled in the art.Described feature, knot Structure or characteristic can be incorporated in one or more embodiments in any suitable manner.
In addition, accompanying drawing is only the schematic illustrations of the present invention, it is not necessarily drawn to scale.Identical accompanying drawing mark in figure Note represents same or similar part, thus will omit repetition thereof.Some block diagrams shown in accompanying drawing are work( Can entity, not necessarily must be corresponding with physically or logically independent entity.These work(can be realized using software form Energy entity, or these functional entitys are realized in one or more hardware modules or integrated circuit, or at heterogeneous networks and/or place These functional entitys are realized in reason device device and/or microcontroller device.
Fig. 1 shows a kind of schematic flow sheet of the processing method of session information in embodiment.Shown in reference picture 1, this implementation The processing method of session information mainly comprises the following steps in example:
Step S101, session information caused by website user's session is received, the session information at least carries the non-of the user Enrollment status identifies.Wherein, in the statistics of website, user conversation (session) is to use some specific IP address (any time in past 30 minutes is often referred to) recently and accessed the performance of the user of this website, be embodied as user Behavior, as browse, search for, place an order etc. caused by data set.Non-registered identity refers to visitor ID, and each user enters It can be then that the user distributes a visitor ID, the identity for the unique mark user to enter website.
Further, step S101 specifically may include:Step S1011, receive more caused by multiple user conversations in website Group session information, the non-registered identity of the user is at least carried per group session information;Step S1012, according to per group session The non-registered identity of information, each group session information is distributed to different worker threads, it is parallel by different worker threads Real time steps S103 to step S108.Wherein, in step S1011, numbered for the distribution of each group session information, using hash algorithm Modulo operation distributes the session information of different numberings to corresponding worker thread.
Specifically, when distributing session information, it can check whether a certain worker thread treats and meeting to be allocated first Remaining session information of the visitor ID identicals of words information, if then distributing the session information with the visitor ID to processed The worker thread of remaining session information of the visitor ID, to accelerate identifying processing progress.If otherwise divided using hash algorithm Match somebody with somebody.Such as a total of 20 groups of session informations to be allocated, numbering is 1~20 respectively;A total of 8 groups of worker thread, is numbered respectively For first group~the 8th group.Modulo operation is then made using the numbering and 8 of each group session information, obtained value is group session letter Worker thread corresponding to breath.Afterwards, session information of each worker thread in its pending queue, using following step Processing is identified respectively.
Step S102, identify whether the non-registered identity of the session information is non-with the historical session in database Enrollment status mark matching, if then performing step S106, if otherwise performing step S103.Wherein, historical session includes local The historical session in active state in cache database, and the history in expired state in remote synchronization database Session.In general, when user does not operate in set period, such as without operation in 30 minutes, then it is assumed that the user's is upper One section of conversation end, is changed into expired session.
Step S103, judge whether the session information carries the enrollment status mark of the user, if then performing step S104, if otherwise performing step S105.Wherein, enrollment status mark refers to the registered user ID of ID, i.e. website.Any use Visitor ID is distributed in family (including nonregistered user and registered user) into website, and when website, registered members log in the account of oneself Afterwards, then its identity is identified using ID.
Step S104, identify that the enrollment status of the session information identifies whether the registration with the historical session in database Identity matches, if then performing step S106, if otherwise performing step S105.As identifying non-registered identity, Historical session herein also includes the historical session in active state in local cache database, and remote synchronization data The historical session in expired state in storehouse.
Step S105, the client type of the session information is obtained, each of the session information is obtained according to client type The similarity of signature field and each signature field of historical session in database, if in the presence of the signature field with the session information Similarity then performs step S106 higher than the historical session of threshold value, if the execution step S108 in the absence of if.Wherein, signature field is Identify some feature fields of the session information, including device identification number, handset identity number (IMEI), log in ground etc..Certainly, Above-mentioned visitor ID and ID are also the feature field of one section of session information, but because above-mentioned steps are from visitor ID and user ID dimensions are identified, therefore its complementary energy that signature field herein is referred mainly in addition to visitor ID and ID identifies session letter The feature field of breath.
Specifically, shown in reference picture 2, step S105 is specifically included:Step S1051, the client of the session information is obtained Type, described client type include page end (PC ends, H5 etc.) and mobile terminal (cell phone application etc.).Step S1052, root The weighted value of each signature field of the session information is obtained according to client type.Under different client types, word of respectively signing The weighted value of section is different, for example, the weighted value of device identification number is 3, is 5 with logging under PC ends;APP ends handset identity number Weighted value be 5, be 3 with logging in.Step S1053, the historical session in ergodic data storehouse, the client of each historical session is obtained Type, and weighted value of each signature field under its corresponding client type of each historical session.Step S1054, judge Whether the weighted value sum of the signature field to match of the session information and historical session is more than threshold value, if then performing Step S106, if otherwise performing step S108.Herein, threshold value can be set according to actual conditions, and the present invention is not limited this System.When the session information and a certain historical session have the signature field to match, and the weight of its signature field to match It is worth the threshold value that sum is more than setting, then shows that the session information and the similarity of the signature field of the historical session are higher, It can determine that the session information and the historical session for same user.
Step S106, judge the session information with the time field difference of historical session matched whether in a session In cycle, if then performing step S107, if otherwise performing step S108;Step S107, the session information is incorporated into institute The historical session matched somebody with somebody;Step S108, an independent sessions are created for the session information.When step S102, step S104 and step Non-registered identity/enrollment status mark/signature field that S105 determines the session information is non-with a certain historical session Enrollment status mark/enrollment status mark/signature field matches, then shows the session information with the historical session from same The operation behavior of user.On this basis, judge whether are the session information and the time field difference of historical session that is matched Within a session cycle, for example whether in 30 minutes, if the session information then is incorporated into the historical session, also will The session information collects with the historical session that it is matched, and can constantly follow the trail of and merge the action trail of the user.If not Then show that the session information comes from a new user (or new visitor), or the session information is not present in active state not Expired historical session, therefore an independent sessions are created for the session information, for storing the user subsequently in the behaviour of website Make a series of action trails left by behavior.
Processing is identified to user session information (session) in the present embodiment by various dimensions, can be exactly from sea Measure in data, collect same user behavior data in same user conversation according to critical field.Data shows, adopts With the method for the present embodiment, mass datas billions of daily, the behavior rail of energy Real Time Observation to whole user can be handled in real time Mark, and low is required to machine performance.Data basis is provided for the analysis of air control big data simultaneously, and partial data is provided and taken advantage of to counter Swindleness system uses.
Fig. 3 shows a kind of comprising modules schematic diagram of the processing system of session information in embodiment.Shown in reference picture 3, this The processing system of the session information of embodiment includes:
Acquisition conversation module 301, for receiving session information caused by website user's session, session information at least carries this The non-registered identity of user.Specifically, all session informations in website in a period of time are for example stored in server (kafka) in, the data that acquisition conversation module 301 can choose specified type from server (such as are appointed as type of transaction Data, it is appointed as browsing the data of type, the data for being appointed as air ticket type etc.), data are subjected to unified conversion, that is, converted For the reference format of processing to be identified, the data after conversion are then supplied to session processing modules 302.
Session processing modules 302 are mainly used in being distributed data, identify and collect and expired processing.Session Processing module 302 includes multiple worker threads 303, and (because domain limitation only illustrates one in figure, this is not construed as to this The limitation of invention).After session processing modules 302 receive session information, led to according to the visitor ID or ID of session information Cross hash algorithm and choose suitable worker thread, place data into the queue of worker thread.
In worker thread 303, non-registered identity identification module 3031, enrollment status mark identification are specifically included Module 3032 and signature field identification module 3033.Non-registered identity identification module 3031 is visitor's ID identification modules, is used In identification session information non-registered identity (visitor ID) whether the non-registered identity with the historical session in database Mark matching.Enrollment status mark identification module 3032 is ID identification module, for identifying the enrollment status of session information Whether mark (ID) matches with the enrollment status mark of the historical session in database.Signature field identification module 3033 For obtaining the client type of session information, in each signature field and database that session information is obtained according to client type The similarity of each signature field of historical session.
Wherein, above-mentioned historical session includes the historical session in active state being stored in local cache 31, Including the historical session in expired state being stored in remote data base 32.Module 304 is locally stored to be used for program Caused session is cached during reason, the communication link established between session processing modules 302 and local cache 31 Connect, there is provided session query interface.That is, signature word of the worker thread 303 in the session information received Section (including visitor ID, ID and other signature fields) goes inquiry in local cache 31 to have altogether by local cache module 304 With the historical session of the field of signature, the historical session that can be matched therefrom is selected according to session recognition methods, it will words information It is appended in the historical session picked out, and the interface that session information is provided by local cache module 304 stores or renewal Into local cache 31;A new independent sessions are created if without qualified historical session.Further, encountering Session information, can be sent to session collection modules 306 and go to send by the session information (such as placing an order, payment etc.) of particular event; And session can be split in proportion if the volume (object EMS memory occupation) of session information is excessive in session information is handled.
In addition, the also timing of worker thread 303 is extracted expired session and is removed from it from local cache 31, will It is sent in the queue of session remote synchronization threads.
Remote synchronization module 305 is neutralized to distal end for local expired session to be synchronized into remote data base 32 Session merges processing.Specifically, synchronizing thread passed through first according to the signature field of session information received it is long-range Synchronization module 305 goes remote data base 32 to inquire about, to Query Result and the session information received according to session identification sides Method, judge, either with or without same historical session, have, merge, and final amalgamation result is stored or be merged into long-range number According to storehouse 32.
Session collection modules 306 are used for timing and expired session are collected and received from remote data base 32 Session processing modules 302 push the session to come, are then pushed to kafka servers.Specifically, session is collected The timing of module 306 obtains the session that worker thread 303 is sent from queue, is sent by kafka producer's client Give kafka servers;And timing pulls expired session from remote data base 32, is then sent to kafka servers, and Session states are set to unavailable, renewal distal end session state.
Further, worker thread 303 also includes session identification module 3034, when filtering out and session information matches After historical session, session identification module 3034 judge session information and the time field difference of historical session that is matched whether In one session cycle, if session information to be then incorporated into matched historical session, if being otherwise session information newly-built one Individual independent sessions, then it is stored in corresponding cache module.
In an exemplary embodiment of the present invention, a kind of computer-readable recording medium is additionally provided, is stored thereon with meter Calculation machine program, the place of session information described in any one above-mentioned embodiment can be realized when the program is by such as computing device The step of reason method.In some possible embodiments, various aspects of the invention are also implemented as a kind of program product Form, it includes program code, and when described program product is run on the terminal device, described program code is used to making described Terminal device perform the processing method description of the above-mentioned session information of this specification according to the various illustrative embodiments of the present invention The step of.
With reference to shown in figure 4, the program product for being used to realize the above method according to the embodiment of the present invention is described 400, it can use portable compact disc read only memory (CD-ROM) and including program code, and can in terminal device, Such as run on PC.However, the program product not limited to this of the present invention, in this document, readable storage medium storing program for executing can be with Be it is any include or the tangible medium of storage program, the program can be commanded execution system, device either device use or It is in connection.
Described program product 400 can use any combination of one or more computer-readable recording mediums.Computer-readable recording medium can be can Read signal medium or readable storage medium storing program for executing.Readable storage medium storing program for executing for example can be but be not limited to electricity, magnetic, optical, electromagnetic, infrared The system of line or semiconductor, device or device, or any combination above.The more specifically example of readable storage medium storing program for executing is (non- Exhaustive list) include:Electrical connection, portable disc, hard disk, random access memory (RAM) with one or more wires, Read-only storage (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, the read-only storage of portable compact disc Device (CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.
The computer-readable recording medium can include believing in a base band or as the data that a carrier wave part is propagated Number, wherein carrying readable program code.The data-signal of this propagation can take various forms, including but not limited to electromagnetism Signal, optical signal or above-mentioned any appropriate combination.Readable storage medium storing program for executing can also be any beyond readable storage medium storing program for executing Computer-readable recording medium, the computer-readable recording medium can send, propagate either transmit for being used by instruction execution system, device or device or Person's program in connection.The program code included on readable storage medium storing program for executing can be transmitted with any appropriate medium, bag Include but be not limited to wireless, wired, optical cable, RF etc., or above-mentioned any appropriate combination.
Can being combined to write the program operated for performing the present invention with one or more programming languages Code, described program design language include object oriented program language-Java, C++ etc., include routine Procedural programming language-such as " C " language or similar programming language.Program code can be fully in user Perform on computing device, partly perform on a user device, the software kit independent as one performs, is partly calculated in user Its upper side point is performed or performed completely in remote computing device or server on a remote computing.It is remote being related to In the situation of journey computing device, remote computing device can pass through the network of any kind, including LAN (LAN) or wide area network (WAN) user calculating equipment, is connected to, or, it may be connected to external computing device (such as utilize ISP To pass through Internet connection).
In an exemplary embodiment of the present invention, a kind of electronic equipment is also provided, the electronic equipment can include processor, And the memory of the executable instruction for storing the processor.Wherein, the processor is configured to via described in execution The step of executable instruction is to perform the processing method of session information described in any one above-mentioned embodiment.
Person of ordinary skill in the field it is understood that various aspects of the invention can be implemented as system, method or Program product.Therefore, various aspects of the invention can be implemented as following form, i.e.,:It is complete hardware embodiment, complete The embodiment combined in terms of full Software Implementation (including firmware, microcode etc.), or hardware and software, can unite here Referred to as " circuit ", " module " or " system ".
The electronic equipment 500 according to the embodiment of the invention is described referring to Fig. 5.The electronics that Fig. 5 is shown Equipment 500 is only an example, should not bring any restrictions to the function and use range of the embodiment of the present invention.
As shown in figure 5, electronic equipment 500 is showed in the form of universal computing device.The component of electronic equipment 500 can wrap Include but be not limited to:At least one processing unit 510, at least one memory cell 520, (including the storage of connection different system component Unit 520 and processing unit 510) bus 530, display unit 540 etc..
Wherein, the memory cell is had program stored therein code, and described program code can be held by the processing unit 510 OK so that the processing unit 510 perform described in the processing method part of the above-mentioned session information of this specification according to this hair The step of bright various illustrative embodiments.For example, the step of processing unit 510 can perform as shown in fig. 1.
The memory cell 520 can include the computer-readable recording medium of volatile memory cell form, such as random access memory Unit (RAM) 5201 and/or cache memory unit 5202, it can further include read-only memory unit (ROM) 5203.
The memory cell 520 can also include program/practical work with one group of (at least one) program module 5205 Tool 5204, such program module 5205 includes but is not limited to:Operating system, one or more application program, other programs Module and routine data, the realization of network environment may be included in each or certain combination in these examples.
Bus 530 can be to represent the one or more in a few class bus structures, including memory cell bus or storage Cell controller, peripheral bus, graphics acceleration port, processing unit use any bus structures in a variety of bus structures Local bus.
Electronic equipment 500 can also be with one or more external equipments 600 (such as keyboard, sensing equipment, bluetooth equipment Deng) communication, the equipment communication interacted with the electronic equipment 500 can be also enabled a user to one or more, and/or with causing Any equipment that the electronic equipment 500 can be communicated with one or more of the other computing device (such as router, modulation /demodulation Device etc.) communication.This communication can be carried out by input/output (I/O) interface 550.Also, electronic equipment 500 can be with By network adapter 560 and one or more network (such as LAN (LAN), wide area network (WAN) and/or public network, Such as internet) communication.Network adapter 560 can be communicated by bus 530 with other modules of electronic equipment 500.Should Understand, although not shown in the drawings, can combine electronic equipment 500 uses other hardware and/or software module, including it is but unlimited In:Microcode, device driver, redundant processing unit, external disk drive array, RAID system, tape drive and number According to backup storage system etc..
Through the above description of the embodiments, those skilled in the art is it can be readily appreciated that example described herein is implemented Mode can be realized by software, can also be realized by way of software combines necessary hardware.Therefore, according to the present invention The technical scheme of embodiment can be embodied in the form of software product, the software product can be stored in one it is non-volatile Property storage medium (can be CD-ROM, USB flash disk, mobile hard disk etc.) in or network on, including some instructions are to cause a calculating Equipment (can be personal computer, server or network equipment etc.) performs the above-mentioned session according to embodiment of the present invention The processing method of information.
In summary, this present invention by from non-registered identity, enrollment status identify, client-based signature word Processing is identified to user session information (session) various dimensions such as section, can be exactly from mass data, according to key Field collects same user behavior data in same user conversation.Data shows, using this method of the invention, Billions of mass data daily can be handled in real time, can Real Time Observation to whole user action trail, and to machine performance It is it is required that low.Data basis is provided for the analysis of air control big data simultaneously, and partial data is provided and used to anti-fake system.
Those skilled in the art will readily occur to the present invention its after considering specification and putting into practice invention disclosed herein Its embodiment.The application be intended to the present invention any modification, purposes or adaptations, these modifications, purposes or Person's adaptations follow the general principle of the present invention and including undocumented common knowledges in the art of the invention Or conventional techniques.Description and embodiments are considered only as exemplary, and true scope and spirit of the invention are by appended Claim is pointed out.

Claims (12)

1. a kind of processing method of session information, it is characterised in that methods described includes:
Step S101, session information caused by website user's session is received, the session information at least carries the non-note of the user Volume identity;
Step S102, identify the session information non-registered identity whether the non-note with the historical session in database Volume identity matching, if then performing step S106, if otherwise performing step S103;
Step S103, judge whether the session information carries the enrollment status mark of the user, if then performing step S104, If otherwise perform step S105;
Step S104, identify that the enrollment status of the session information identifies whether the registration body with the historical session in database Part mark matching, if then performing step S106, if otherwise performing step S105;
Step S105, the client type of the session information is obtained, the session information is obtained according to the client type Each signature field and database in historical session each signature field similarity, if in the presence of the signature with the session information The similarity of field then performs step S106 higher than the historical session of threshold value, if the execution step S108 in the absence of if;
Step S106, judge the session information with the time field difference of the historical session matched whether in a meeting Talk about in the cycle, if then performing step S107, if otherwise performing step S108;
Step S107, the session information is incorporated into the historical session matched;
Step S108, an independent sessions are created for the session information.
2. the processing method of session information as claimed in claim 1, it is characterised in that the step S105 includes:
Step S1051, the client type of the session information is obtained, the client type includes page end and mobile terminal;
Step S1052, the weighted value of each signature field of the session information is obtained according to the client type;
Step S1053, the historical session in ergodic data storehouse, the client type of each historical session, and each history meeting are obtained Weighted value of each signature field of words under its corresponding client type;
Step S1054, judging the weighted value sum of the signature field to match of the session information and historical session is It is no to be more than threshold value, if then performing step S106, if otherwise performing step S108.
3. the processing method of session information as claimed in claim 2, it is characterised in that the signature field identifies including equipment Number, handset identity number.
4. the processing method of session information as claimed in claim 1, it is characterised in that the step S101 includes:
Step S1011, multigroup session information caused by multiple user conversations in website is received, this is at least carried per group session information The non-registered identity of user;
Step S1012, according to the non-registered identity per group session information, each group session information is distributed to different work Thread, by different worker thread parallel real time steps S103 to step S108.
5. the processing method of session information as claimed in claim 4, it is characterised in that be each group meeting in the step S1011 Information distribution numbering is talked about, is distributed the session information of different numberings to corresponding worker thread using hash algorithm modulo operation.
6. the processing method of session information as claimed in claim 1, it is characterised in that the historical session includes local cache The historical session in active state in database, and the history meeting in expired state in remote synchronization database Words.
7. the processing method of session information as claimed in claim 1, it is characterised in that in the step S106, described in judgement Session information with the time field difference of the historical session matched whether in 30 minutes, if then judging the session Information is located at a session cycle with the historical session matched, performs step S107, if otherwise performing step S108.
8. a kind of processing system of session information, it is characterised in that the system includes:
Acquisition conversation module, for receiving session information caused by website user's session, the session information at least carries the use The non-registered identity at family;
Non-registered identity identification module, for identify the session information non-registered identity whether with database A historical session non-registered identity matching, if then triggering session merging module, if otherwise trigger enrollment status mark Know identification module;
Enrollment status identifies identification module, and the enrollment status for identifying the session information identifies whether and one in database The enrollment status mark matching of historical session, if then triggering session merging module, if otherwise triggering signature field identification module;
Signature field identification module, for obtaining the client type of the session information, obtained according to the client type The similarity of each signature field of the session information and each signature field of historical session in database, is filtered out and the meeting The similarity for talking about the signature field of information is higher than the historical session of threshold value, and triggering session merging module;
Merged session module, for judging whether are the session information and the time field difference of the historical session that is matched Within a session cycle, if the session information to be then incorporated into the historical session matched, if being otherwise described Session information creates an independent sessions.
9. the processing system of session information as claimed in claim 8, it is characterised in that also include:
Module is locally stored, for caching the historical session in active state;
Remote synchronization module, for caching the historical session in expired state;
It is described be locally stored module and the remote synchronization module with the non-registered identity identification module, the registration Identity identification module and signature field identification module communication connection.
10. processing system as claimed in claim 9, it is characterised in that also include:
Expired processing module, for periodically extracting expired historical session in module from described be locally stored, send to described remote The work queue of journey synchronization module.
11. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor The step of processing method of the session information described in any one of claim 1~7 is realized during execution.
12. a kind of electronic equipment, it is characterised in that including:
Processor;And
Memory, for storing the executable instruction of the processor;
Wherein, the processor is configured to come described in perform claim 1~7 any one of requirement via the execution executable instruction The step of processing method of session information.
CN201711112871.2A 2017-11-03 2017-11-03 Session information processing method, system, storage medium and electronic equipment Active CN107895011B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711112871.2A CN107895011B (en) 2017-11-03 2017-11-03 Session information processing method, system, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711112871.2A CN107895011B (en) 2017-11-03 2017-11-03 Session information processing method, system, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN107895011A true CN107895011A (en) 2018-04-10
CN107895011B CN107895011B (en) 2020-05-26

Family

ID=61805203

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711112871.2A Active CN107895011B (en) 2017-11-03 2017-11-03 Session information processing method, system, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN107895011B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108446183A (en) * 2018-04-13 2018-08-24 广东亿迅科技有限公司 Processing method and processing device based on message distribution
CN108549691A (en) * 2018-04-13 2018-09-18 郑州云海信息技术有限公司 A kind of tracking of database session and analysis method and its device
CN109003605A (en) * 2018-07-02 2018-12-14 北京百度网讯科技有限公司 Intelligent sound interaction processing method, device, equipment and storage medium
CN109118779A (en) * 2018-10-12 2019-01-01 东软集团股份有限公司 Break in traffic rules and regulations information identifying method, equipment and readable storage medium storing program for executing
CN109257448A (en) * 2018-11-21 2019-01-22 网易(杭州)网络有限公司 A kind of synchronous method and device of session information, electronic equipment, storage medium
CN110008081A (en) * 2019-02-21 2019-07-12 阿里巴巴集团控股有限公司 A kind of interaction data processing method and device
CN110502549A (en) * 2019-07-08 2019-11-26 招联消费金融有限公司 User data processing method, device, computer equipment and storage medium
CN111459950A (en) * 2019-01-18 2020-07-28 北京字节跳动网络技术有限公司 Data updating method and device
CN116597855A (en) * 2023-07-18 2023-08-15 深圳市则成电子股份有限公司 Adaptive noise reduction method and device and computer equipment
CN116881429A (en) * 2023-09-07 2023-10-13 四川蜀天信息技术有限公司 Multi-tenant-based dialogue model interaction method, device and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102387135A (en) * 2011-09-29 2012-03-21 北京邮电大学 User identity filtering method and firewall
US20140143230A1 (en) * 2012-11-16 2014-05-22 International Business Machines Corporation Contextual search history in collaborative archives
CN106973062A (en) * 2017-04-27 2017-07-21 努比亚技术有限公司 A kind of conversation managing method and server

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102387135A (en) * 2011-09-29 2012-03-21 北京邮电大学 User identity filtering method and firewall
US20140143230A1 (en) * 2012-11-16 2014-05-22 International Business Machines Corporation Contextual search history in collaborative archives
CN106973062A (en) * 2017-04-27 2017-07-21 努比亚技术有限公司 A kind of conversation managing method and server

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
羊淑英等: "统一会话管理平台的研究", 《西昌学院学报(自然科学版)》 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108446183A (en) * 2018-04-13 2018-08-24 广东亿迅科技有限公司 Processing method and processing device based on message distribution
CN108549691A (en) * 2018-04-13 2018-09-18 郑州云海信息技术有限公司 A kind of tracking of database session and analysis method and its device
CN108549691B (en) * 2018-04-13 2021-09-17 郑州云海信息技术有限公司 Database session tracking and analyzing method and device
CN109003605A (en) * 2018-07-02 2018-12-14 北京百度网讯科技有限公司 Intelligent sound interaction processing method, device, equipment and storage medium
CN109003605B (en) * 2018-07-02 2020-04-21 北京百度网讯科技有限公司 Intelligent voice interaction processing method, device, equipment and storage medium
CN109118779A (en) * 2018-10-12 2019-01-01 东软集团股份有限公司 Break in traffic rules and regulations information identifying method, equipment and readable storage medium storing program for executing
CN109257448A (en) * 2018-11-21 2019-01-22 网易(杭州)网络有限公司 A kind of synchronous method and device of session information, electronic equipment, storage medium
CN109257448B (en) * 2018-11-21 2021-07-09 网易(杭州)网络有限公司 Session information synchronization method and device, electronic equipment and storage medium
CN111459950A (en) * 2019-01-18 2020-07-28 北京字节跳动网络技术有限公司 Data updating method and device
CN110008081A (en) * 2019-02-21 2019-07-12 阿里巴巴集团控股有限公司 A kind of interaction data processing method and device
CN110008081B (en) * 2019-02-21 2023-02-24 创新先进技术有限公司 Interactive data processing method and device
CN110502549A (en) * 2019-07-08 2019-11-26 招联消费金融有限公司 User data processing method, device, computer equipment and storage medium
CN116597855A (en) * 2023-07-18 2023-08-15 深圳市则成电子股份有限公司 Adaptive noise reduction method and device and computer equipment
CN116597855B (en) * 2023-07-18 2023-09-29 深圳市则成电子股份有限公司 Adaptive noise reduction method and device and computer equipment
CN116881429A (en) * 2023-09-07 2023-10-13 四川蜀天信息技术有限公司 Multi-tenant-based dialogue model interaction method, device and storage medium
CN116881429B (en) * 2023-09-07 2023-12-01 四川蜀天信息技术有限公司 Multi-tenant-based dialogue model interaction method, device and storage medium

Also Published As

Publication number Publication date
CN107895011B (en) 2020-05-26

Similar Documents

Publication Publication Date Title
CN107895011A (en) Processing method, system, storage medium and the electronic equipment of session information
CN104615852B (en) The method for order and the raising source service efficiency of registering for guarantee online booking
Shi et al. Predicting US primary elections with Twitter
CN110462604A (en) The data processing system and method for association internet device are used based on equipment
CN108062629A (en) Processing method, terminal device and the medium of transaction event
CN110149806A (en) The digital assistants of stack data structures are handled
CN110781308B (en) Anti-fraud system for constructing knowledge graph based on big data
CN106888194A (en) Intelligent grid IT assets security monitoring systems based on distributed scheduling
CN113469663A (en) Intelligent service information analysis method and system combined with artificial intelligence
CN105389341A (en) Text clustering and analysis method for repeating caller work orders of customer service calls
CN111953757A (en) Information processing method based on cloud computing and intelligent device interaction and cloud server
CN109345417A (en) The on-line examination method and terminal device of the business personnel of identity-based certification
CN110598070A (en) Application type identification method and device, server and storage medium
CN113347170A (en) Intelligent analysis platform design method based on big data framework
CN110909195A (en) Picture labeling method and device based on block chain, storage medium and server
CN205845090U (en) Electricity market main body credit evaluation system
CN108897800A (en) A kind of method, apparatus and system of managing log information
CN107944293B (en) Fictitious assets guard method, system, equipment and storage medium
CN110415041A (en) Recommended method, recommendation apparatus, equipment and storage medium
CN110443265A (en) A kind of behavioral value method and apparatus based on corporations
CN107077279A (en) A kind of method and device of pressure detecting
CN110533094A (en) A kind of evaluation method and system for driver
CN1889130A (en) Mine safety fingerprint management control method and system
CN113593692A (en) Data processing method based on big data intelligent medical treatment and cloud computing server
CN107562768A (en) A kind of data handling procedure dynamic back jump tracking method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant