CN106302849A - A kind of method carrying out moving solid fusion by carrier data - Google Patents
A kind of method carrying out moving solid fusion by carrier data Download PDFInfo
- Publication number
- CN106302849A CN106302849A CN201610630393.3A CN201610630393A CN106302849A CN 106302849 A CN106302849 A CN 106302849A CN 201610630393 A CN201610630393 A CN 201610630393A CN 106302849 A CN106302849 A CN 106302849A
- Authority
- CN
- China
- Prior art keywords
- network
- account
- user
- carrier data
- fusion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L61/00—Network arrangements, protocols or services for addressing or naming
- H04L61/45—Network directories; Name-to-address mapping
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Between Computers (AREA)
Abstract
A kind of method carrying out moving solid fusion by carrier data is disclosed, it can identify which ID belongs to same user more accurately, portray the attribute of this user comprehensively, conveniently user is analysed in depth, significant to various precision marketings and finance reference.It includes step: (1) extracts each account information of user by carrier data;(2) account ID of fixing network is mapped, identify the ID belonging to same user;(3) account ID of mobile network is mapped, identify the ID belonging to same user;(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.Additionally provide a kind of system carrying out moving solid fusion by carrier data.
Description
Technical field
The invention belongs to the technical field that big data process, carry out moving solid melting by carrier data more particularly to one
The method closed.
Background technology
Owing to the data message of each enterprises and institutions is asymmetric, and there is competitive relation in big data company substantially, then adds
On relate to data safety and privacy concern, so each big Internet firm is all without completely by open for data, only can be with
The affiliate that limited degree of belief is higher carries out data files.
Ali system and the electricity business such as Jingdone district have the consumption data of user, Baidu to have search data, Tengxun and the Sina of user micro-
The rich social data having user, but carry out data files between these companies may be the least.
Prior art is all to be analyzed single attribute data, is difficult to portray the attribute of a people comprehensively.Right
The when that people carrying out attribute description, consumption data, search data, social data etc. are again very important, it is therefore necessary to right
The Taobao ID of user, Jingdone district ID, Baidu ID, No. QQ, Sina microblogging ID etc. carries out detecting, identify and mapping, in order to user is entered
Row is analysed in depth, and this is all very important to various precision marketings and finance reference.And due to data deficiency, there is presently no
See the document above-mentioned ID being detected and mapping.
Summary of the invention
The technology of the present invention solves problem: overcome the deficiencies in the prior art, it is provided that one is carried out by carrier data
Moving the solid method merged, it can identify which ID belongs to same user more accurately, portray the attribute of this user comprehensively, side
Just user is analysed in depth, significant to various precision marketings and finance reference.
The technical solution of the present invention is: this method being carried out by carrier data moving solid fusion, it include with
Lower step:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
Owing to the present invention uses carrier data to be the solid fusion of shifting, and each ID of map user, therefore, it is possible to more accurate
Ground identifies which ID belongs to same user, portrays the attribute of this user comprehensively, conveniently analyses in depth user, to various
Precision marketing and finance reference are significant.
Additionally provide a kind of system carrying out moving solid fusion by carrier data, comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same
The ID of user;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same use
The ID at family;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network
The fusion of network.
Accompanying drawing explanation
Fig. 1 is the flow chart carrying out moving the method for solid fusion by carrier data according to the present invention.
Fig. 2 is the integrated stand composition according to the present invention.
Fig. 3 is the schematic diagram of the rating matrix according to the present invention.
Fig. 4 is the schematic diagram of the ID that the rating matrix of Fig. 3 is corresponding.
Detailed description of the invention
When accessing a webpage, most of webpages all can nested advertisement, such as Jingdone district, Taobao, Tencent QQ, Baidu etc. all
There is the biggest advertisement scheduling platform, and the probability of nested different gray advertisement is the highest in a webpage, each
Advertisement is all under the territory controlled oneself, then just can form the feelings of the advertisement having multiple not same area under a tree i.e. father's page
Condition.
Can be by converging in very short time window under same fixed network account, REFER is the ID in same webpage or territory, recognizes
It is probably an ID of uniform machinery for it, when two ID are under different webpages, occurs, then it is assumed that these ID are the most simultaneously
The ID of same people, occurrence probability is the highest simultaneously, and probability is the biggest.
At mobile phone terminal, if user uses an APP, when carrying out data transmission with server, it may appear that capital
East account, No. QQ, Taobao's account, IMEI number, cell-phone number and IDFA etc., when accessing different APP, it may appear that different above-mentioned ID accounts
Number.The mapping mapping of each ID can be carried out by IMEI number or cell-phone number.
As it is shown in figure 1, this method being carried out by carrier data moving solid fusion, it comprises the following steps:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
Admittedly merge owing to the present invention uses carrier data to do shifting, and mate each ID of user, therefore, it is possible to more accurate
Ground identifies which ID belongs to same user, portrays the attribute of this user comprehensively, conveniently analyses in depth user, to various
Precision marketing and finance reference are significant.
Fig. 2 is the integrated stand composition according to the present invention.IDmapingRawDatagenner: from each operator collect former
Beginning data, for the data source of IDmaping.The original log that HttpLogData: every day generates.
MobileIDmapingCleaner: the mobile daily record to input is carried out, 1 couple 1 relation data of output ID Yu ID, for rear
Issue provides help according to process.PCAPPIDMerger: to primary ID MAPPING process, and export APPID in PC
MAPING result, and it is stored in appointment position.MobileIDmapingMergeTool: the data moved in net are carried out
The Mobile data (data surfed the Net by wifi) of fixed network end are carried out mapping by mapping.PcIDmapingTool: will
In PC, mobiledata and PcappData carries out mapping, the final mapping result of input Pc.
It addition, account ID to fixing network carries out mapping and includes in described step (2): carry out the ID of pc end mapping and
The ID of mobile device maps.
It addition, the ID of PC end is mapped, use collaborative filtering, determine user's according to the webpage behavior of user
Similarity.Under normal circumstances, same user network page line is to be close, is only concerned webpage and the website of a few class content.Or
The time difference sent to same DSP platform is the least.By the access time to url or useragent including different ID
As the scoring to user.Rating matrix is as shown in Figure 3.ID corresponding to rating matrix is as shown in Figure 4.
It addition, described step (2) include following step by step:
(2.1) extract the two or more ID comprised in a message out, be dispersed as 1 relation pair to 1;
(2.2) the International Mobile Equipment Identity code IMEI filtering interference (plants because the IMEI of a lot of mountain vallage mobile phones is batch
Enter, can cause the IMEI of a lot of mobile phone identical);
(2.3) by UNICOM's subgraph algorithm, all ID being associated are together in series, obtain result set.
It addition, in described step (2.2), select IMSI to be filtered by HIVE method.
It addition, in described step (3), prove that UID is polymerized the account of all extractions by user identity, its account collection is one
The account information of individual.
It addition, described step (3) include following step by step:
(3.1) by mapping stipulations MAPREDUCE method, former data are dispersed as 1 relation pair to 1;
(3.2) it is polymerized all accounts by HIVE method with UID dimension, obtains result.
It addition, in described step (4), by each account of user of each account ID of user of mobile network's end Yu fixing network-side
ID carries out cross validation, finds identical No. ID, then carries out global mapping, is fixed the fusion of network and mobile network.
Additionally provide a kind of system carrying out moving solid fusion by carrier data, comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same
The ID of user;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same use
The ID at family;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network
The fusion of network.
The above, be only presently preferred embodiments of the present invention, and the present invention not makees any pro forma restriction, every depends on
Any simple modification, equivalent variations and the modification made above example according to the technical spirit of the present invention, the most still belongs to the present invention
The protection domain of technical scheme.
Claims (9)
1. the method carrying out moving solid fusion by carrier data, it is characterised in that: it comprises the following steps:
(1) each account information of user is extracted by carrier data;
(2) account ID of fixing network is mapped, identify the ID belonging to same user;
(3) account ID of mobile network is mapped, identify the ID belonging to same user;
(4) ID of fixing network and mobile network is mapped, be fixed the fusion of network and mobile network.
The method carrying out moving solid fusion by carrier data the most according to claim 1, it is characterised in that: described step
(2) in, account ID to fixing network carries out mapping and includes: map the ID of pc end and the ID of mobile device maps.
The method carrying out moving solid fusion by carrier data the most according to claim 2, it is characterised in that: to PC end
ID maps, and uses collaborative filtering, determines the similarity of user according to the webpage behavior of user.
The method carrying out moving solid fusion by carrier data the most according to claim 3, it is characterised in that: described step
(2) include following step by step:
(2.1) extract the two or more ID comprised in a message out, be dispersed as 1 relation pair to 1;
(2.2) the International Mobile Equipment Identity code IMEI of interference is filtered;
(2.3) by UNICOM's subgraph algorithm, all ID being associated are together in series, obtain result set.
The method carrying out moving solid fusion by carrier data the most according to claim 4, it is characterised in that: described step
(2.2), in, IMSI to be filtered is selected by HIVE method.
The method carrying out moving solid fusion by carrier data the most according to claim 5, it is characterised in that: described step
(3) in, proving that UID is polymerized the account of all extractions by user identity, its account collection is the account information of a people.
The method carrying out moving solid fusion by carrier data the most according to claim 6, it is characterised in that: described step
(3) include following step by step:
(3.1) by mapping stipulations MAPREDUCE method, former data are dispersed as 1 relation pair to 1;
(3.2) it is polymerized all accounts by HIVE method with UID dimension, obtains result.
The method carrying out moving solid fusion by carrier data the most according to claim 7, it is characterised in that: described step
(4), in, each account ID of user of each account ID of user of mobile network's end Yu fixing network-side is carried out cross validation, finds phase
Same No. ID, then carries out global mapping, is fixed the fusion of network and mobile network.
9. one kind carries out moving the system of solid fusion by carrier data, it is characterised in that: comprising:
Data extractor, its configuration extracts each account information of user by carrier data;
Fixing network account adapter, account ID of fixing network is mapped by its configuration, identifies and belongs to same user
ID;
Mobile network's account adapter, the ID in mobile network is mapped by its configuration, identifies and belongs to same user's
ID;
Fusion device, the ID of fixing network and mobile network is mapped by its configuration, is fixed network and mobile network
Merge.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610630393.3A CN106302849A (en) | 2016-08-04 | 2016-08-04 | A kind of method carrying out moving solid fusion by carrier data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610630393.3A CN106302849A (en) | 2016-08-04 | 2016-08-04 | A kind of method carrying out moving solid fusion by carrier data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106302849A true CN106302849A (en) | 2017-01-04 |
Family
ID=57664782
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610630393.3A Pending CN106302849A (en) | 2016-08-04 | 2016-08-04 | A kind of method carrying out moving solid fusion by carrier data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106302849A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107515915A (en) * | 2017-08-18 | 2017-12-26 | 晶赞广告(上海)有限公司 | User based on user behavior data identifies correlating method |
CN109285036A (en) * | 2018-09-21 | 2019-01-29 | 中国联合网络通信集团有限公司 | Internet of things service marketing method, device and storage medium |
CN110992096A (en) * | 2019-12-03 | 2020-04-10 | 秒针信息技术有限公司 | Prediction model training method and device and media identification prediction method and device |
CN111414406A (en) * | 2019-01-04 | 2020-07-14 | 上海宏路数据技术股份有限公司 | Method and system for identifying same user in different channel transactions |
CN111476596A (en) * | 2020-03-19 | 2020-07-31 | 深圳市酷开网络科技有限公司 | Family population data processing method, system and storage medium based on homologous equipment |
CN112217675A (en) * | 2020-10-12 | 2021-01-12 | 北京电信规划设计院有限公司 | Combined analysis method for big data of fixed and mobile communication network |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103678652A (en) * | 2013-12-23 | 2014-03-26 | 山东大学 | Information individualized recommendation method based on Web log data |
CN103780613A (en) * | 2014-01-21 | 2014-05-07 | 北京集奥聚合科技有限公司 | Method and system for linking fixed network and mobile network |
CN105227352A (en) * | 2015-09-02 | 2016-01-06 | 新浪网技术(中国)有限公司 | A kind of update method of user ID collection and device |
-
2016
- 2016-08-04 CN CN201610630393.3A patent/CN106302849A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103678652A (en) * | 2013-12-23 | 2014-03-26 | 山东大学 | Information individualized recommendation method based on Web log data |
CN103780613A (en) * | 2014-01-21 | 2014-05-07 | 北京集奥聚合科技有限公司 | Method and system for linking fixed network and mobile network |
CN105227352A (en) * | 2015-09-02 | 2016-01-06 | 新浪网技术(中国)有限公司 | A kind of update method of user ID collection and device |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107515915A (en) * | 2017-08-18 | 2017-12-26 | 晶赞广告(上海)有限公司 | User based on user behavior data identifies correlating method |
CN107515915B (en) * | 2017-08-18 | 2020-02-18 | 晶赞广告(上海)有限公司 | User identification association method based on user behavior data |
CN109285036A (en) * | 2018-09-21 | 2019-01-29 | 中国联合网络通信集团有限公司 | Internet of things service marketing method, device and storage medium |
CN109285036B (en) * | 2018-09-21 | 2021-05-18 | 中国联合网络通信集团有限公司 | Internet of things service processing method and device and storage medium |
CN111414406A (en) * | 2019-01-04 | 2020-07-14 | 上海宏路数据技术股份有限公司 | Method and system for identifying same user in different channel transactions |
CN111414406B (en) * | 2019-01-04 | 2021-06-04 | 上海嗨普智能信息科技股份有限公司 | Method and system for identifying same user in different channel transactions |
CN110992096A (en) * | 2019-12-03 | 2020-04-10 | 秒针信息技术有限公司 | Prediction model training method and device and media identification prediction method and device |
CN110992096B (en) * | 2019-12-03 | 2023-08-29 | 秒针信息技术有限公司 | Prediction model training method and device and media identification prediction method and device |
CN111476596A (en) * | 2020-03-19 | 2020-07-31 | 深圳市酷开网络科技有限公司 | Family population data processing method, system and storage medium based on homologous equipment |
CN111476596B (en) * | 2020-03-19 | 2023-05-02 | 深圳市酷开网络科技股份有限公司 | Household population data processing method, system and storage medium based on homologous equipment |
CN112217675A (en) * | 2020-10-12 | 2021-01-12 | 北京电信规划设计院有限公司 | Combined analysis method for big data of fixed and mobile communication network |
CN112217675B (en) * | 2020-10-12 | 2023-03-24 | 北京电信规划设计院有限公司 | Combined analysis method for big data of fixed and mobile communication network |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106302849A (en) | A kind of method carrying out moving solid fusion by carrier data | |
EP3089055B1 (en) | Method and device for displaying information flows in social network, and server | |
Wang et al. | How do developers react to restful api evolution? | |
US10146878B2 (en) | Method and system for creating filters for social data topic creation | |
US20160063095A1 (en) | Unstructured data guided query modification | |
CN103546446B (en) | Phishing website detection method, device and terminal | |
CN104462547B (en) | A kind of method and system of configurable collecting webpage data | |
CN104504081A (en) | Intelligent analysis system for all-media detection and monitoring big data behaviors | |
CN104394118A (en) | User identity identification method and system | |
CN103324666A (en) | Topic tracing method and device based on micro-blog data | |
US20160092960A1 (en) | Product recommendations over multiple stores | |
CN103559619A (en) | Response method and system for garment size information | |
CN107766470B (en) | Intelligent statistical method, intelligent statistical display method and device for data sharing | |
CN104035972A (en) | Knowledge recommending method and system based on micro blogs | |
CA2977847A1 (en) | Automated extraction tools and their use in social content tagging systems | |
CN102664926A (en) | Method and system for user information sharing | |
CN104750760A (en) | Application software recommending method and device | |
CN103390000A (en) | Web searching method and web searching system | |
KR20150018880A (en) | Information aggregation, classification and display method and system | |
CN106126654B (en) | A kind of inter-network station user-association method based on user name similarity | |
CN104978406A (en) | User behavior analysis method of Internet platform | |
CN103365961A (en) | Accurate search-oriented website structurization labeling method and system | |
US9544384B2 (en) | Method and system for pushing associated users in social networking service network | |
CN105204806A (en) | Individual display method and device for mobile terminal webpage | |
CN105049325A (en) | Target user displaying method and system based on time matching degree |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170104 |