CN106302849A - A kind of method carrying out moving solid fusion by carrier data - Google Patents

A kind of method carrying out moving solid fusion by carrier data Download PDF

Info

Publication number
CN106302849A
CN106302849A CN201610630393.3A CN201610630393A CN106302849A CN 106302849 A CN106302849 A CN 106302849A CN 201610630393 A CN201610630393 A CN 201610630393A CN 106302849 A CN106302849 A CN 106302849A
Authority
CN
China
Prior art keywords
network
account
user
carrier data
fusion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610630393.3A
Other languages
Chinese (zh)
Inventor
马占国
裴亚明
孙永军
丁琦
王猛
吴双双
崔晶晶
林佳婕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd
Original Assignee
BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd filed Critical BEIJING GEO POLYMERIZATION TECHNOLOGY Co Ltd
Priority to CN201610630393.3A priority Critical patent/CN106302849A/en
Publication of CN106302849A publication Critical patent/CN106302849A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping

Abstract

A kind of method carrying out moving solid fusion by carrier data is disclosed, it can identify which ID belongs to same user more accurately, portray the attribute of this user comprehensively, conveniently user is analysed in depth, significant to various precision marketings and finance reference.It includes step: (1) extracts each account information of user by carrier data;(2) account ID of fixing network is mapped, identify the ID belonging to same user;(3) account ID of mobile network is mapped, identify the ID belonging to same user;(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.Additionally provide a kind of system carrying out moving solid fusion by carrier data.

Description

A kind of method carrying out moving solid fusion by carrier data
Technical field
The invention belongs to the technical field that big data process, carry out moving solid melting by carrier data more particularly to one The method closed.
Background technology
Owing to the data message of each enterprises and institutions is asymmetric, and there is competitive relation in big data company substantially, then adds On relate to data safety and privacy concern, so each big Internet firm is all without completely by open for data, only can be with The affiliate that limited degree of belief is higher carries out data files.
Ali system and the electricity business such as Jingdone district have the consumption data of user, Baidu to have search data, Tengxun and the Sina of user micro- The rich social data having user, but carry out data files between these companies may be the least.
Prior art is all to be analyzed single attribute data, is difficult to portray the attribute of a people comprehensively.Right The when that people carrying out attribute description, consumption data, search data, social data etc. are again very important, it is therefore necessary to right The Taobao ID of user, Jingdone district ID, Baidu ID, No. QQ, Sina microblogging ID etc. carries out detecting, identify and mapping, in order to user is entered Row is analysed in depth, and this is all very important to various precision marketings and finance reference.And due to data deficiency, there is presently no See the document above-mentioned ID being detected and mapping.
Summary of the invention
The technology of the present invention solves problem: overcome the deficiencies in the prior art, it is provided that one is carried out by carrier data Moving the solid method merged, it can identify which ID belongs to same user more accurately, portray the attribute of this user comprehensively, side Just user is analysed in depth, significant to various precision marketings and finance reference.
The technical solution of the present invention is: this method being carried out by carrier data moving solid fusion, it include with Lower step:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
Owing to the present invention uses carrier data to be the solid fusion of shifting, and each ID of map user, therefore, it is possible to more accurate Ground identifies which ID belongs to same user, portrays the attribute of this user comprehensively, conveniently analyses in depth user, to various Precision marketing and finance reference are significant.
Additionally provide a kind of system carrying out moving solid fusion by carrier data, comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same The ID of user;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same use The ID at family;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network The fusion of network.
Accompanying drawing explanation
Fig. 1 is the flow chart carrying out moving the method for solid fusion by carrier data according to the present invention.
Fig. 2 is the integrated stand composition according to the present invention.
Fig. 3 is the schematic diagram of the rating matrix according to the present invention.
Fig. 4 is the schematic diagram of the ID that the rating matrix of Fig. 3 is corresponding.
Detailed description of the invention
When accessing a webpage, most of webpages all can nested advertisement, such as Jingdone district, Taobao, Tencent QQ, Baidu etc. all There is the biggest advertisement scheduling platform, and the probability of nested different gray advertisement is the highest in a webpage, each Advertisement is all under the territory controlled oneself, then just can form the feelings of the advertisement having multiple not same area under a tree i.e. father's page Condition.
Can be by converging in very short time window under same fixed network account, REFER is the ID in same webpage or territory, recognizes It is probably an ID of uniform machinery for it, when two ID are under different webpages, occurs, then it is assumed that these ID are the most simultaneously The ID of same people, occurrence probability is the highest simultaneously, and probability is the biggest.
At mobile phone terminal, if user uses an APP, when carrying out data transmission with server, it may appear that capital East account, No. QQ, Taobao's account, IMEI number, cell-phone number and IDFA etc., when accessing different APP, it may appear that different above-mentioned ID accounts Number.The mapping mapping of each ID can be carried out by IMEI number or cell-phone number.
As it is shown in figure 1, this method being carried out by carrier data moving solid fusion, it comprises the following steps:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
Admittedly merge owing to the present invention uses carrier data to do shifting, and mate each ID of user, therefore, it is possible to more accurate Ground identifies which ID belongs to same user, portrays the attribute of this user comprehensively, conveniently analyses in depth user, to various Precision marketing and finance reference are significant.
Fig. 2 is the integrated stand composition according to the present invention.IDmapingRawDatagenner: from each operator collect former Beginning data, for the data source of IDmaping.The original log that HttpLogData: every day generates. MobileIDmapingCleaner: the mobile daily record to input is carried out, 1 couple 1 relation data of output ID Yu ID, for rear Issue provides help according to process.PCAPPIDMerger: to primary ID MAPPING process, and export APPID in PC MAPING result, and it is stored in appointment position.MobileIDmapingMergeTool: the data moved in net are carried out The Mobile data (data surfed the Net by wifi) of fixed network end are carried out mapping by mapping.PcIDmapingTool: will In PC, mobiledata and PcappData carries out mapping, the final mapping result of input Pc.
It addition, account ID to fixing network carries out mapping and includes in described step (2): carry out the ID of pc end mapping and The ID of mobile device maps.
It addition, the ID of PC end is mapped, use collaborative filtering, determine user's according to the webpage behavior of user Similarity.Under normal circumstances, same user network page line is to be close, is only concerned webpage and the website of a few class content.Or The time difference sent to same DSP platform is the least.By the access time to url or useragent including different ID As the scoring to user.Rating matrix is as shown in Figure 3.ID corresponding to rating matrix is as shown in Figure 4.
It addition, described step (2) include following step by step:
(2.1) extract the two or more ID comprised in a message out, be dispersed as 1 relation pair to 1;
(2.2) the International Mobile Equipment Identity code IMEI filtering interference (plants because the IMEI of a lot of mountain vallage mobile phones is batch Enter, can cause the IMEI of a lot of mobile phone identical);
(2.3) by UNICOM's subgraph algorithm, all ID being associated are together in series, obtain result set.
It addition, in described step (2.2), select IMSI to be filtered by HIVE method.
It addition, in described step (3), prove that UID is polymerized the account of all extractions by user identity, its account collection is one The account information of individual.
It addition, described step (3) include following step by step:
(3.1) by mapping stipulations MAPREDUCE method, former data are dispersed as 1 relation pair to 1;
(3.2) it is polymerized all accounts by HIVE method with UID dimension, obtains result.
It addition, in described step (4), by each account of user of each account ID of user of mobile network's end Yu fixing network-side ID carries out cross validation, finds identical No. ID, then carries out global mapping, is fixed the fusion of network and mobile network.
Additionally provide a kind of system carrying out moving solid fusion by carrier data, comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same The ID of user;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same use The ID at family;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network The fusion of network.
The above, be only presently preferred embodiments of the present invention, and the present invention not makees any pro forma restriction, every depends on Any simple modification, equivalent variations and the modification made above example according to the technical spirit of the present invention, the most still belongs to the present invention The protection domain of technical scheme.

Claims (9)

1. the method carrying out moving solid fusion by carrier data, it is characterised in that: it comprises the following steps:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
The method carrying out moving solid fusion by carrier data the most according to claim 1, it is characterised in that: described step (2) in, account ID to fixing network carries out mapping and includes: map the ID of pc end and the ID of mobile device maps.
The method carrying out moving solid fusion by carrier data the most according to claim 2, it is characterised in that: to PC end ID maps, and uses collaborative filtering, determines the similarity of user according to the webpage behavior of user.
The method carrying out moving solid fusion by carrier data the most according to claim 3, it is characterised in that: described step (2) include following step by step:
(2.1) extract the two or more ID comprised in a message out, be dispersed as 1 relation pair to 1;
(2.2) the International Mobile Equipment Identity code IMEI of interference is filtered;
(2.3) by UNICOM's subgraph algorithm, all ID being associated are together in series, obtain result set.
The method carrying out moving solid fusion by carrier data the most according to claim 4, it is characterised in that: described step (2.2), in, IMSI to be filtered is selected by HIVE method.
The method carrying out moving solid fusion by carrier data the most according to claim 5, it is characterised in that: described step (3) in, proving that UID is polymerized the account of all extractions by user identity, its account collection is the account information of a people.
The method carrying out moving solid fusion by carrier data the most according to claim 6, it is characterised in that: described step (3) include following step by step:
(3.1) by mapping stipulations MAPREDUCE method, former data are dispersed as 1 relation pair to 1;
(3.2) it is polymerized all accounts by HIVE method with UID dimension, obtains result.
The method carrying out moving solid fusion by carrier data the most according to claim 7, it is characterised in that: described step (4), in, each account ID of user of each account ID of user of mobile network's end Yu fixing network-side is carried out cross validation, finds phase Same No. ID, then carries out global mapping, is fixed the fusion of network and mobile network.
9. one kind carries out moving the system of solid fusion by carrier data, it is characterised in that: comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same user ID;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same user's ID;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network Merge.
CN201610630393.3A 2016-08-04 2016-08-04 A kind of method carrying out moving solid fusion by carrier data Pending CN106302849A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610630393.3A CN106302849A (en) 2016-08-04 2016-08-04 A kind of method carrying out moving solid fusion by carrier data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610630393.3A CN106302849A (en) 2016-08-04 2016-08-04 A kind of method carrying out moving solid fusion by carrier data

Publications (1)

Publication Number Publication Date
CN106302849A true CN106302849A (en) 2017-01-04

Family

ID=57664782

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610630393.3A Pending CN106302849A (en) 2016-08-04 2016-08-04 A kind of method carrying out moving solid fusion by carrier data

Country Status (1)

Country Link
CN (1) CN106302849A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107515915A (en) * 2017-08-18 2017-12-26 晶赞广告(上海)有限公司 User based on user behavior data identifies correlating method
CN109285036A (en) * 2018-09-21 2019-01-29 中国联合网络通信集团有限公司 Internet of things service marketing method, device and storage medium
CN110992096A (en) * 2019-12-03 2020-04-10 秒针信息技术有限公司 Prediction model training method and device and media identification prediction method and device
CN111414406A (en) * 2019-01-04 2020-07-14 上海宏路数据技术股份有限公司 Method and system for identifying same user in different channel transactions
CN111476596A (en) * 2020-03-19 2020-07-31 深圳市酷开网络科技有限公司 Family population data processing method, system and storage medium based on homologous equipment
CN112217675A (en) * 2020-10-12 2021-01-12 北京电信规划设计院有限公司 Combined analysis method for big data of fixed and mobile communication network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678652A (en) * 2013-12-23 2014-03-26 山东大学 Information individualized recommendation method based on Web log data
CN103780613A (en) * 2014-01-21 2014-05-07 北京集奥聚合科技有限公司 Method and system for linking fixed network and mobile network
CN105227352A (en) * 2015-09-02 2016-01-06 新浪网技术(中国)有限公司 A kind of update method of user ID collection and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678652A (en) * 2013-12-23 2014-03-26 山东大学 Information individualized recommendation method based on Web log data
CN103780613A (en) * 2014-01-21 2014-05-07 北京集奥聚合科技有限公司 Method and system for linking fixed network and mobile network
CN105227352A (en) * 2015-09-02 2016-01-06 新浪网技术(中国)有限公司 A kind of update method of user ID collection and device

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107515915A (en) * 2017-08-18 2017-12-26 晶赞广告(上海)有限公司 User based on user behavior data identifies correlating method
CN107515915B (en) * 2017-08-18 2020-02-18 晶赞广告(上海)有限公司 User identification association method based on user behavior data
CN109285036A (en) * 2018-09-21 2019-01-29 中国联合网络通信集团有限公司 Internet of things service marketing method, device and storage medium
CN109285036B (en) * 2018-09-21 2021-05-18 中国联合网络通信集团有限公司 Internet of things service processing method and device and storage medium
CN111414406A (en) * 2019-01-04 2020-07-14 上海宏路数据技术股份有限公司 Method and system for identifying same user in different channel transactions
CN111414406B (en) * 2019-01-04 2021-06-04 上海嗨普智能信息科技股份有限公司 Method and system for identifying same user in different channel transactions
CN110992096A (en) * 2019-12-03 2020-04-10 秒针信息技术有限公司 Prediction model training method and device and media identification prediction method and device
CN110992096B (en) * 2019-12-03 2023-08-29 秒针信息技术有限公司 Prediction model training method and device and media identification prediction method and device
CN111476596A (en) * 2020-03-19 2020-07-31 深圳市酷开网络科技有限公司 Family population data processing method, system and storage medium based on homologous equipment
CN111476596B (en) * 2020-03-19 2023-05-02 深圳市酷开网络科技股份有限公司 Household population data processing method, system and storage medium based on homologous equipment
CN112217675A (en) * 2020-10-12 2021-01-12 北京电信规划设计院有限公司 Combined analysis method for big data of fixed and mobile communication network
CN112217675B (en) * 2020-10-12 2023-03-24 北京电信规划设计院有限公司 Combined analysis method for big data of fixed and mobile communication network

Similar Documents

Publication Publication Date Title
CN106302849A (en) A kind of method carrying out moving solid fusion by carrier data
EP3089055B1 (en) Method and device for displaying information flows in social network, and server
Wang et al. How do developers react to restful api evolution?
Bradbury Data mining with LinkedIn
CN103546446B (en) Phishing website detection method, device and terminal
US20160063095A1 (en) Unstructured data guided query modification
CN104504081A (en) Intelligent analysis system for all-media detection and monitoring big data behaviors
CN104394118A (en) User identity identification method and system
CN102662965A (en) Method and system of automatically discovering hot news theme on the internet
US10002187B2 (en) Method and system for performing topic creation for social data
CN107515915A (en) User based on user behavior data identifies correlating method
CN103559619A (en) Response method and system for garment size information
CN107766470B (en) Intelligent statistical method, intelligent statistical display method and device for data sharing
CN104537341A (en) Human face picture information obtaining method and device
CN104035972A (en) Knowledge recommending method and system based on micro blogs
CN102664926A (en) Method and system for user information sharing
CN103390000A (en) Web searching method and web searching system
CN109684402A (en) One kind being based on big data platform metadata genetic connection implementation method
CN108536700A (en) A kind of method that nothing buries a collector journal
JP2019212345A (en) Internet content providing server and computer-readable recording medium including implemented method therefor
CN106126654B (en) A kind of inter-network station user-association method based on user name similarity
CN103544150A (en) Method and system for providing recommendation information for mobile terminal browser
CN103365961A (en) Accurate search-oriented website structurization labeling method and system
US9544384B2 (en) Method and system for pushing associated users in social networking service network
CN105204806A (en) Individual display method and device for mobile terminal webpage

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170104