CN103763138B - Method for improving precision of identification of dynamic users and static users of current network - Google Patents

Method for improving precision of identification of dynamic users and static users of current network Download PDF

Info

Publication number
CN103763138B
CN103763138B CN201410013424.1A CN201410013424A CN103763138B CN 103763138 B CN103763138 B CN 103763138B CN 201410013424 A CN201410013424 A CN 201410013424A CN 103763138 B CN103763138 B CN 103763138B
Authority
CN
China
Prior art keywords
user
subscriber
message
static
dynamic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410013424.1A
Other languages
Chinese (zh)
Other versions
CN103763138A (en
Inventor
崔晶晶
林佳婕
窦乐健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaoxiang Innovation Artificial Intelligence Technology Co ltd
Original Assignee
BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd filed Critical BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd
Priority to CN201410013424.1A priority Critical patent/CN103763138B/en
Publication of CN103763138A publication Critical patent/CN103763138A/en
Application granted granted Critical
Publication of CN103763138B publication Critical patent/CN103763138B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention relates to the technical field of computer networks and discloses a method for improving the precision of identification of dynamic users and static users of a current network. The precision of identification of the dynamic users and static users of the current network is improved in a mode of asynchronously getting access to a database by analyzing original data of a radius signaling message and an http access message and combining a NoSq1 database, a local cache, a distributed cache synchronization technology and a user cache aging mechanism. The defects of classified identification of the users under big data conditions are compensated for, the dynamic users and the static users can be effectively distinguished, and the accuracy rate of identification of the users is improved.

Description

Improve existing network dynamic subscriber and the method for static subscriber's accuracy of identification
Technical field
The present invention relates to technical field of the computer network, particularly a kind of raising existing network dynamic subscriber and static state The method of user's accuracy of identification.
Background technology
First customer analysis is an important ingredient in analysis of network, must be before analyzing user Being capable of identify that each user, which is distinguished is dynamic subscriber, and which is static subscriber.Have following at present Several conventional users know method for distinguishing:
1, user based on IP or IP+Agent identifies
IP address is the information being easiest to obtain, typically all can comprise in WEB daily record IP address and Agent information, but due to feelings such as pseudo-IP, agency, dynamic IP, LAN-sharing same public network IP outlets The existence of condition, the accuracy of this mode is low.
2, user based on ID identifies
User based on ID identifies it is the most accurate because generally user's different sharing he ID.But it is used to ID to identify that user requires that must provide for user registers login service, And in clickstream data, record ID.
3, user based on cookie identifies
Cookie field is the field being commonly used to identify user identity in network, uses and analyzes cookie Method to obtain the relevant information of user be more accurately, as CN201310134318.4 " a kind of knows The method and device of other user " in a kind of described method identifying user, obtain by the way of statistics To the corresponding long-term cookie field being used for unique mark user identity of each website, then basis What user accessed message redirects relation, is all associated by these cookie, generates user cookie-value Relation table and the ID of correspondence thereof;Gather the access message that user sends;According to user cookie-value Access message is marked and carries out user's identification by the ID of relation table and correspondence thereof, thus substitutes and make User's identification is carried out, it is possible to be effectively improved accuracy and identification that user identifies by information such as ADSL, IP Efficiency.But ID only has corresponding website just to know as user privacy information, and it is cookie's Arranging and deposit position, the approach of the acquisition of cookie is more complicated.Analyze cookie's it addition, use Method obtains the relevant information of user, and user property is more single.
Summary of the invention
(1) present invention solves the technical problem that:
For the defect of prior art, the present invention proposes a kind of raising existing network dynamic subscriber and static subscriber The method of accuracy of identification, compensate for the Classification and Identification of user under big data, effectively difference dynamic subscriber and Static subscriber, improves the accuracy rate that user identifies.
(2) technical scheme
For achieving the above object, the present invention proposes a kind of new solution, by analyzing signaling message With the initial data of access message, synchronize skill in conjunction with NoSql data base, local cache, distributed caching Art, user cache aging mechanism, asynchronous access database mode improves existing network dynamic subscriber and static subscriber Accuracy of identification.
On the one hand, the present invention provides a kind of side improving existing network dynamic subscriber and static subscriber's accuracy of identification Method, it is characterised in that described method includes step:
(1) reach the standard grade message identification dynamic subscriber and the static subscriber of radius message are used;
(2) http message identification dynamic subscriber and static subscriber are used.
(3) step (1) and (2) is repeated until being accurately identified dynamic subscriber and static subscriber.
The method of claim 1, it is characterised in that: two are marked with for user: user Type, User Status, user type is divided into dynamic subscriber and static subscriber;User Status is validated user And inactive users, it is used for assisting in identifying dynamic subscriber and static subscriber.
Preferably, step (1) uses reach the standard grade message identification dynamic subscriber and the static state of radius message User specifically comprises the following steps that
S1: capture message of reaching the standard grade, inquires about local cache;
S2: if not having UID to record in local cache, go to step S7;If there is UID in local cache Record, then judge that the UID in UID and the radius message in caching is the most equal;If equal, turn Step S3, if unequal, goes to step S5;
S3: judge that user is the most effective;
S4: if validated user, the most first allow user offline, be then stored in local cache, go to step S7; If inactive users, then it is directly stored in local cache, goes to step S7;
S5: judge that user is the most effective;
S6: if validated user, then this user of labelling is dynamic subscriber;If inactive users, then it is stored in Local cache, goes to step S7;
S7: mark the user as dynamic subscriber, and synchronize other caching and update NoSql data base.
Preferably, step (2) uses http message identification dynamic subscriber and static subscriber's concrete steps As follows:
S1: capture http message, inquires about local cache;
S2: if not having the record of correspondence in local cache, then go to step S7;If local cache is deposited At corresponding record, go to step S3;
S3: judge that user is the most effective;
S4: if validated user, then go to step S5;If inactive users, then go to step S7;
S5: judge whether user is static subscriber;
S6: if not static subscriber, then this user of labelling is dynamic subscriber;If static subscriber, then sentence This user disconnected, whether beyond ageing time, if without departing from ageing time, is defined as static subscriber, if super Go out ageing time, then need to delete local record, synchronize other caching simultaneously, preserve data base;
S7: inquire about whether this user is this provider customer, if provider customer is then with asynchronous side Formula inquiry NoSql data base;Then illustrate that if not provider customer this flow, should for the flow that passes by Message directly filters.
Preferably, cache as distributed caching, the synchronization side synchronizing employing distributed caching between caching Formula.
Preferably, step (1) can also use the renewal message of radiu message to identify dynamic subscriber And static subscriber, use and update message identification dynamic subscriber and the step of static subscriber and message one of reaching the standard grade Cause.
(3) technique effect
The present invention proposes local cache and the mode of NoSql database combination, uses local cache distribution Formula synchronization mechanism, the mode of asynchronous access data base, it is possible to utilize the user data that operator records, have Effect makes up prior art defect of the Classification and Identification of user under processing big data, more accurately identifies User is dynamic subscriber or static subscriber.
Accompanying drawing explanation
Fig. 1 is the method flow signal improving existing network dynamic subscriber and static subscriber's accuracy of identification in the present invention Figure;
Fig. 2 is reach the standard grade message identification dynamic subscriber and the static subscriber using radius message in the present invention Method flow schematic diagram;
Fig. 3 is the method flow signal using http message identification dynamic subscriber and static subscriber in the present invention Figure.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out Clear, complete description, it is clear that described embodiment is a part of embodiment of the present invention, and not It it is whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not making Other embodiments obtained on the premise of creative work, broadly fall into the scope of protection of the invention.
The method of the present invention is by the radius message in the original flow of analysis existing network and http message The mode combined is to distinguish dynamic subscriber and static subscriber.Use NoSql data base, utilize this locality Caching technology, asynchronous access pattern and distributed caching synchronization mechanism, in raising existing network, static subscriber is with dynamic The precision that state user identifies.Data Source is operator's central machine room original image flow.
The user every time dialled up on the telephone by username and password in network is dynamic subscriber;On user only has Net behavior, the user not dialled up on the telephone, then it is assumed that be static subscriber.
The present invention sets two are marked with for user: user type, User Status.User type It is divided into dynamic subscriber and static subscriber;User Status is validated user and inactive users, is used for assisting in identifying Dynamic subscriber and static subscriber.
Use NoSql database stores user data, local cache preserves UID and the use of user The state at family, preserves the UID of user, user the most effectively (according to whether roll off the production line in distributed caching Judge), the attribute (dynamic subscriber or static subscriber) of user
Fig. 1 is the method flow signal improving existing network dynamic subscriber and static subscriber's accuracy of identification in the present invention Figure, method includes step:
(1) reach the standard grade message identification dynamic subscriber and the static subscriber of radius message are used;
(2) http message identification dynamic subscriber and static subscriber are used.
(3) step (1) and (2) is repeated until being accurately identified dynamic subscriber and static subscriber.
Fig. 2 is reach the standard grade message identification dynamic subscriber and the static subscriber using radius message in the present invention Method flow schematic diagram.
Radius message, is to be authenticated for dial user and charging.Radius message has three types Message: message of reaching the standard grade, update message and the message that rolls off the production line.By resolving the user's account that can obtain in message Family UID and IP address, wherein account name needs to use md5 encryption.
Specifically comprising the following steps that of method
S1: capture message of reaching the standard grade, inquires about local cache;
S2: if not having UID to record in local cache, go to step S7;If there is UID in local cache Record, then judge that the UID in UID and the radius message in caching is the most equal;If equal, turn Step S3, if unequal, goes to step S5;
S3: by the tag field in local cache table, it is judged that user is the most effective;
S4: if validated user, the message dropping that rolls off the production line of user before explanation, need first to allow user offline, Update downtime, be then stored in local cache, go to step S7;If inactive users, then it is directly stored in Local cache, goes to step S7;
S5: by the tag field in local cache table, it is judged that user is the most effective;
S6: if validated user, then this user of labelling is dynamic subscriber;If inactive users, then it is stored in Local cache, goes to step S7;
S7: mark the user as dynamic subscriber, and synchronize other caching and update NoSql data base.
The renewal message in radius message can also be used to identify dynamic subscriber and static subscriber, use Update message identification dynamic subscriber consistent with message of reaching the standard grade with the method for static subscriber.
Fig. 3 is the method flow signal using http message identification dynamic subscriber and static subscriber in the present invention Figure, specifically comprises the following steps that
S1: capture http message, inquires about local cache;
S2: whether have the corresponding record of this message IP in inquiry local cache table, if it is not, turn Step S7;If local cache exists the record of correspondence, go to step S3;
S3: by the tag field in local cache table, it is judged that user is the most effective;
S4: if validated user, then go to step S5;If inactive users, then go to step S7;
S5: by the user property field in local cache table, it is judged that whether user is static subscriber;
S6: if not static subscriber, then this user of labelling is dynamic subscriber;If static subscriber, then sentence This user disconnected, whether beyond ageing time, if without departing from ageing time, is defined as static subscriber, if super Go out ageing time, then need to delete local record, synchronize other caching simultaneously, preserve data base;
S7: inquire about whether this user is this provider customer, if provider customer is then with asynchronous side Formula inquiry NoSql data base, is saved in local cache by the result found;If not provider customer Then illustrate that this message, for the flow that passes by, is directly filtered by this flow;
Use this set technology can identify the dynamic subscriber in existing network and static subscriber, have loss at message In the case of, owing to not catching message, may result in user's misjudgment, but the present invention is by big The data of amount, can be by correct for user property amendment.In the judgement of whole flow process, if user property is Dynamic subscriber, then the attribute of this user will not change again, if the attribute of user is static subscriber, then may be used Can be by message below, it is judged that be dynamic subscriber, then need to be revised as dynamic subscriber.Along with data The persistency of amount, can identify dynamic subscriber and the ratio of static subscriber of local operator the most accurately And details.
Below as a example by the identification process of a user, detailed description identification process:
When a user A is by dialling up on the telephone, radius can be captured and reach the standard grade message, parse and reach the standard grade User account UID and IP address, inquire about local cache, if local cache does not exist UID note Record, then it is assumed that user A is dynamic subscriber.
If local cache exists UID, then whether judge the UID in caching and the UID in message For same user, if unequal, then judge whether user is validated user, if validated user is then Allow this user offline, being stored in the caching identifying as dynamic subscriber of guaranteeing the minimum.If inactive users, the most directly It is designated dynamic subscriber.
If the UID in local cache and the UID in message is same user, it may be judged whether be to have Effectiveness family, if validated user, then mark user is dynamic subscriber, if inactive users, then needs It is stored in local cache, and is designated dynamic subscriber.
What after user A dialing, online produced is common http message, parses the IP in message, passes through IP inquires about local cache, inquires about local cache, if record can be found from local cache, then needs to judge Use whether family A is validated user instead, if this user A is validated user, then need to judge whether to be quiet State user, if not static subscriber, illustrates that this is dynamic subscriber;If static subscriber, then need Judge that user A, whether beyond ageing time, if without departing from ageing time, is defined as static subscriber, if Beyond ageing time, then need to delete local record, synchronize other caching simultaneously, preserve data base.
By the long-time analysis of message flow, finally confirm that this user is dynamic subscriber or static use Family.
The present invention proposes by the initial data of radius signaling message and http access message is carried out phase In conjunction with the pattern of analysis, compared to prior art utilizes IP, IP+Agent, ID, cookie Mode can improve user identify accuracy rate.
The present invention proposes local cache and the mode of NoSql database combination, uses local cache distribution Formula synchronization mechanism, the mode of asynchronous access data base, it is possible to utilize the user data that operator records, have Effect makes up prior art defect of the Classification and Identification of user under processing big data, more accurately identifies User is dynamic subscriber or static subscriber.
Although the present invention discloses as above in the way of most preferred embodiment, but not limit the present invention with it, Those skilled in the art without departing from the spirit and scope of the present invention, can do arbitrary change and variation. Protection scope of the present invention is only limited by the claims that follow in the range of standard.

Claims (3)

1. the method improving existing network dynamic subscriber and static subscriber's accuracy of identification, it is characterised in that Described method includes step:
(1) reach the standard grade message identification dynamic subscriber and the static subscriber of radius message are used;
(2) http message identification dynamic subscriber and static subscriber are used;
(3) step (1) and (2) is repeated until being accurately identified dynamic subscriber and static subscriber;
Being marked with two for user: user type, User Status, user type is divided into dynamic subscriber And static subscriber;User Status is validated user and inactive users, is used for assisting in identifying dynamic subscriber and quiet State user;
Wherein, step (1) specifically comprises the following steps that
S1: capture message of reaching the standard grade, inquires about local cache;
S2: if not having UID to record in local cache, go to step S7;If there is UID in local cache Record, then judge that the UID in UID and the radius message in caching is the most equal;If equal, turn Step S3, if unequal, goes to step S5;
S3: judge that user is the most effective;
S4: if validated user, the most first allow user offline, be then stored in local cache, go to step S7; If inactive users, then it is directly stored in local cache, goes to step S7;
S5: judge that user is the most effective;
S6: if validated user, then this user of labelling is dynamic subscriber;If inactive users, then it is stored in Local cache, goes to step S7;
S7: mark the user as dynamic subscriber, and synchronize other caching and update NoSql data base;
Step (2) specifically comprises the following steps that
S1: capture http message, inquires about local cache;
S2: if not having the record of correspondence in local cache, then go to step S7;If in local cache There is corresponding record, go to step S3;
S3: judge that user is the most effective;
S4: if validated user, then go to step S5;If inactive users, then go to step S7;
S5: judge whether user is static subscriber;
S6: if not static subscriber, then this user of labelling is dynamic subscriber;If static subscriber, then sentence This user disconnected, whether beyond ageing time, if without departing from ageing time, is defined as static subscriber, if super Go out ageing time, then need to delete local record, synchronize other caching simultaneously, preserve data base;
S7: inquire about whether this user is this provider customer, if provider customer is then with asynchronous side Formula inquiry NoSql data base;Then illustrate that if not provider customer this flow, should for the flow that passes by Message directly filters.
2. the method for claim 1, it is characterised in that: cache as distributed caching, caching Between synchronize use distributed caching the method for synchronization.
3. the method for claim 1, it is characterised in that: step can also use in (1) The renewal message of radius message identifies dynamic subscriber and static subscriber, uses renewal message identification dynamic User is consistent with message of reaching the standard grade with the step of static subscriber.
CN201410013424.1A 2014-01-13 2014-01-13 Method for improving precision of identification of dynamic users and static users of current network Expired - Fee Related CN103763138B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410013424.1A CN103763138B (en) 2014-01-13 2014-01-13 Method for improving precision of identification of dynamic users and static users of current network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410013424.1A CN103763138B (en) 2014-01-13 2014-01-13 Method for improving precision of identification of dynamic users and static users of current network

Publications (2)

Publication Number Publication Date
CN103763138A CN103763138A (en) 2014-04-30
CN103763138B true CN103763138B (en) 2017-01-11

Family

ID=50530301

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410013424.1A Expired - Fee Related CN103763138B (en) 2014-01-13 2014-01-13 Method for improving precision of identification of dynamic users and static users of current network

Country Status (1)

Country Link
CN (1) CN103763138B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106059637A (en) * 2016-05-31 2016-10-26 华中科技大学 Degree of freedom optimization method for multi-user MIMO (Multi-input Multi-output) broadcast channel under mixed CSI (Channel State Information)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004111809A1 (en) * 2003-06-18 2004-12-23 Telefonaktiebolaget Lm Ericsson (Publ) An arrangement and a method relating to ip network access
CN101379757A (en) * 2006-02-07 2009-03-04 思科技术公司 Methods and systems for providing telephony services and enforcing policies in a communication network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004111809A1 (en) * 2003-06-18 2004-12-23 Telefonaktiebolaget Lm Ericsson (Publ) An arrangement and a method relating to ip network access
CN101379757A (en) * 2006-02-07 2009-03-04 思科技术公司 Methods and systems for providing telephony services and enforcing policies in a communication network

Also Published As

Publication number Publication date
CN103763138A (en) 2014-04-30

Similar Documents

Publication Publication Date Title
US11233866B2 (en) Methods and apparatus to monitor online activity
Yen et al. Host Fingerprinting and Tracking on the Web: Privacy and Security Implications.
Liu et al. Cloudy with a chance of breach: Forecasting cyber security incidents
US9154516B1 (en) Detecting risky network communications based on evaluation using normal and abnormal behavior profiles
CN105930363B (en) HTML5 webpage-based user behavior analysis method and device
CN107169094B (en) Information aggregation method and device
CN108259630B (en) Detection method, platform and system for unregistered website
CN104823470A (en) System and method for correlating network information with subscriber information in mobile network environment
CN103780613B (en) By the method and system that fixed network associates with mobile network user
CN104168316B (en) A kind of Webpage access control method, gateway
CN105808702B (en) The management system of content of pages method for refreshing, system and content of pages synchronous refresh
CN106998262A (en) A kind of System and method for for recognizing Internet user
CN105959290A (en) Detection method and device of attack message
CN110493085A (en) Statistical method, system, electronic equipment and the medium of IPv6 active users
Wang et al. A smart automated signature extraction scheme for mobile phone number in human-centered smart home systems
CN103763138B (en) Method for improving precision of identification of dynamic users and static users of current network
CN104539655A (en) Method and system for synchronizing and displaying address book in batches
CN104202418A (en) Method and system for recommending commercial content distribution network for content provider
CN107241347B (en) Advertisement traffic quality analysis method and device
CN110727885A (en) Internet global uniform identifier generation system and generation method thereof
CN109474698A (en) Access request labeling method, device, equipment and storage medium
CN106067879A (en) The detection method of information and device
CN109614417A (en) Display methods, device and the terminal of report form index based on data flow
Ben Houidi et al. Gold mining in a river of internet content traffic
US20160344684A1 (en) System and method for maintaining coherence of assocation across a network address change or reassignment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Cui Jingjing

Inventor after: Lin Jiajie

Inventor after: Dou Lejian

Inventor before: Cui Jingjing

Inventor before: Lin Jiajie

Inventor before: Liu Lina

Inventor before: Dou Lejian

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: CUI JINGJING LIN JIAJIE LIU LINA DOU LEJIAN TO: CUI JINGJING LIN JIAJIE DOU LEJIAN

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220425

Address after: 100000 room 116, building 3, Shuangqiao (Shuangqiao dairy factory), Chaoyang District, Beijing

Patentee after: Beijing Xiaoxiang innovation Artificial Intelligence Technology Co.,Ltd.

Address before: 901, floor 9, building 5, courtyard 1, Shangdi East Road, Haidian District, Beijing 100028

Patentee before: BEIJING GEO POLYMERIZATION TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170111