CN108011761A - The method of collection and analysis visitor's data based on big data - Google Patents
The method of collection and analysis visitor's data based on big data Download PDFInfo
- Publication number
- CN108011761A CN108011761A CN201711274116.4A CN201711274116A CN108011761A CN 108011761 A CN108011761 A CN 108011761A CN 201711274116 A CN201711274116 A CN 201711274116A CN 108011761 A CN108011761 A CN 108011761A
- Authority
- CN
- China
- Prior art keywords
- data
- smart machine
- collection
- facility information
- visitor
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/147—Network analysis or design for predicting network behaviour
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/06—Generation of reports
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/02—Network architectures or network communication protocols for network security for separating internal from external traffic, e.g. firewalls
- H04L63/0227—Filtering policies
- H04L63/0236—Filtering by address, protocol, port number or service, e.g. IP-address or URL
Abstract
A kind of method of collection and analysis visitor's data based on big data, including:Step 1, the facility information of smart machine is gathered;Step 2, the facility information of the smart machine of collection is filtered;Step 3, the facility information of the smart machine after filtering is integrated.The method of collection and analysis visitor data of the present invention based on big data utilizes intelligence wifi routers, full-automatic continual gathered data, it is not necessary to spend too many manpower and time, can data acquisition, coverage of the present invention is wider, gather generation data more comprehensively, pass through advanced algorithm, data are carried out with filtering cleaning, classification is integrated, statistical analysis, generate real-time guest system, customer portrait system, customer action system, powerful data analysis situation is provided for businessman.
Description
Technical field
The present invention relates to physical field, more particularly to the information processing technology, particularly a kind of collection based on big data and
The method for analyzing visitor's data.
Background technology
At this stage, the level of consumption of people is increasingly lifted, and consumption of going out, the frequency of amusement is also higher and higher, many large-scale
Place, such as shopping center, market, designer-label store, sales office, tourist attractions etc. are all passenger flow accumulation regions.With the increase of flow of the people,
The competition of businessman is also aggravating rapidly, and the cost that businessman obtains effective client is also increasing, for some Code in Hazardous Special Locations, such as brand
Shop, sales office etc., businessman particularly pay attention to visiting client.The data message of visiting subscriber how is obtained, and then is based on big data
Analysis, makes correct marketing activity or very important decision, is businessman's urgent problem to be solved.
Collecting and analyzing for present visitor's data is still done by traditional artificial, that is, arrange work personnel, record of filling in a form,
Do tabular analysis etc..Such data acquisition not only inefficiency, covering surface is not complete, and data dimension is not complete, post analysis cost
Also it is higher.Businessman can only utilize low volume data, do some simple graphic analyses, can not more depth mining data behind valency
Value.
The content of the invention
For above-mentioned technological deficiency, it is an object of the invention to provide a kind of solution above-mentioned technical problem based on big data
Collection and analyze visitor's data method.
In order to solve the above technical problems, the method for collection and analysis visitor's data provided by the invention based on big data,
Including:Step 1, the facility information of smart machine is gathered;Step 2, the facility information of the smart machine of collection is filtered;Step
Rapid 3, the facility information of the smart machine after filtering is integrated.
Step 1 includes:Step 1.1, the wireless signal of terminal scanning smart machine and with smart machine establish connect;Step
1.2, gather the facility information of smart machine;Step 1.3, the equipment of the facility information of terminal, the smart machine collected is believed
Breath and environmental information are integrated into data packet and send to data server and store.
In step 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into
Data packet is sent to data server and is stored with the form of csv files.
The facility information of terminal is the mac identification codes of terminal;The facility information of smart machine identifies for the mac of smart machine
Code;Environmental information includes at least current time, and smart machine enters case field time and signalling channel.
Step 2 includes:Step 2.1, the data packet of storage is handled;Step 2.2, the data packet after processing is carried out
Filtering.
Step 2.1, processing is carried out to split to the data packet of storage using 3 seconds as interval to the data packet of storage,
Form the mac identification code data of smart machine.
Step 2.2 includes:Step 2.2.1, filters the mac identification code data of the smart machine of passerby;Step 2.1.2, mistake
Filter the mac identification code data of the smart machine of staff.
In step 2.2.1, if time departure-entry time≤480 second of the mac identification code data of smart machine, sentence
The holder of the fixed smart machine is passerby, and the mac identification code data of the smart machine are removed.
Step 3 includes:Step 3.1, the real-time traffic of smart machine holder is integrated;Step 3.2, statistics smart machine is held
The information for the person of having.
In step 3.2, the information of statistics smart machine holder includes at least:Native place, residence, place of working, gender account for
Than, marriage accounting, mobile phone model accounting.
The method of collection and analysis visitor data of the present invention based on big data utilizes intelligence wifi routers, automatically not
The gathered data of interruption, it is not necessary to spend too many manpower and time, you can data acquisition, coverage of the present invention is wider,
Gather generation data more comprehensively, by advanced algorithm, data are carried out with filtering cleaning, classification is integrated, statistical analysis, and generation is real
When guest system, customer portrait system, customer action system, powerful data analysis situation is provided for businessman.
Brief description of the drawings
Fig. 1 is the method flow diagram of collection and analysis visitor data of the present invention based on big data.
Embodiment
It is further detailed to the method work of collection of the present invention based on big data and analysis visitor's data below in conjunction with the accompanying drawings
Explanation.
As shown in Figure 1, the method for collection and analysis visitor data of the present invention based on big data includes:
1)Part of data acquisition
Data acquisition first by hardware device be intelligent wifi routers, replica router is than normal domestic use router feature
It is powerful, there are wifi probe functions.After i.e. user opens wireless network, the wireless signal in this region of wifi probes meeting active scan,
Gather the mac codes of user equipment.
Specific acquisition step:
1st, intelligent wifi routers are powered first, and Logistics networks are unobstructed.
2nd, the wif probes that router carries can scan the wireless signal in certain area coverage, i.e. mobile phone wireless is sent
Handshake request, the success once intelligence wifi routers and user equipment are shaken hands, will analyze handshake data, collecting device
The related datas such as mac codes.
3rd, intelligence wifi routers can scan the mobile phone that all wifi in this region are opened, and collection of shaking hands every three minutes
Data(It is scanned before being likely to be, it may be possible to new), the mac data messages that then will be collected into, with csv trays
Formula passes back to data server.The data of passback include:Unique mac identification codes of intelligent wifi routers, subscriber connecting equipment
Unique mac identification codes, current time, signalling channel etc..
4th, data server receives the data file of intelligent wifi passbacks, splits csv files using program, and every number
According to deposit database.
2)Data filtering
Intelligent wifi routers will return the data of collection every three minutes, if user rests on place always,
Can then there is the information back of the user always to server, until user leaves.That is cleaned again using being first put in storage here does
Method.Retain the data logging of most original, the later stage filters data.
Specific filtration step and scheme:
1st, the data of passback will be gathered, one group was separated with 3 seconds, filters out the user mac of repetition, and retain user mac's
Minimum acquisition time, and maximum acquisition time,
2nd, complete user's mac data are drawn, including:Time on the same day, user equipment mac, user enter case field time, use
Case field time is left at family, the user data that will be gathered daily, in units of day, is stored in new tables of data.
3rd, passerby's data are filtered, because intelligence wifi router scans scopes are larger, about 50 meters~100 meters of radius, if
Place position passenger flow is more, can scan the crowd that some are not target customers, such as passerby.A socket gauge is had at this time
Whether the user mac information for then going to judge to collect is passerby, if it is, being filtered.
Passerby's filtering rule:Recorded if there is a mac, time departure-entry time≤480 second, i.e. user stop
Duration little Yu 8 minutes, then be judged as passerby.
4th, filtering black list user data, trade company staff, cleaning worker, these users of neighbouring resident personnel are because meeting
Frequently appear near businessman, information is easily collected by intelligent wifi routers, so being defined as blacklist to these personnel
User, is filtered.
The filtering rule of black list user:Data in 15 days are recorded, meet that following any one is considered as " black list user "
When stay time is small more than 3 within single day.
When any two days stay times are small more than 2.5 in 15 days.
When any three days stay times are small more than 2 in 15 days.
When any five days stay times are small more than 1 in 15 days.
Passerby's rule and blacklist rule can be according to the self-defined settings of Data acquisition and issuance scene.
3)Data Integration
Data Integration is divided into the real-time passenger flow of visitor and visitor's behavior portrait
1st, the real-time passenger flow of visitor is integrated, and on the basis of user's initial data according to collection, by certain algorithm, calculates the same day
Total number of persons is gathered, same day user be averaged stay time, daily integral point passenger flow situation, accumulative collection number, number of repeatedly visiting, often
Day old and new customers's number.
2nd, visitor's behavior is drawn a portrait, because the mac addresses of user equipment are unique, in theory, a mac data,
It correspond to a user.
Using all user mac as a colony, request data supplier is removed, so as to return to the portrait mark of this colony
Label.Including native place provinces and cities top500, residential quarter top100, work mansion top100, Condom top500 under line, is seen under line
Room top500(Top100 and top500 herein refers to latitude and longitude coordinates, can be gone to calculate with this latitude and longitude coordinates accurately small
Area and mansion name etc.)Gender accounting, married unmarried accounting, mobile phone model accounting, category accounting of doing shopping, mobile phone application
The portrait related datas such as apptop10, shopping app application top10, hobby top10.
As desired, city latitude and longitude coordinates storehouse is carried out, such as real estate developer, it is present city can be made
Selling the latitude and longitude coordinates basic database of building.
The group behavior track returned using data supplier, the i.e. longitude and latitude of the top500 that group of subscribers was gone are sat
Mark goes the base coordinate storehouse matching done with step 2, it can be deduced that the new flat that these group of subscribers were gone, can be ground
Produce developer and decision support is provided, can pinpoint dispensing advertisement for Condom under the line of residential quarter etc., do marketing activity.
The preferred embodiment to the invention is illustrated above, but the present invention is not limited to embodiment,
Those skilled in the art can also be made on the premise of without prejudice to the invention spirit a variety of equivalent deformations or
Replace, these equivalent deformations or replacement are all contained in scope of the present application.
Claims (10)
- A kind of 1. method of collection and analysis visitor's data based on big data, it is characterised in that include the following steps:Step 1, the facility information of smart machine is gathered;Step 2, the facility information of the smart machine of collection is filtered;Step 3, the equipment of the smart machine after filtering is believed Breath is integrated.
- 2. the method for collection and analysis visitor's data according to claim 1 based on big data, it is characterised in that step 1 includes:Step 1.1, the wireless signal of terminal scanning smart machine and with smart machine establish connect;Step 1.2, the facility information of smart machine is gathered;Step 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into data packet Send to data server and store.
- 3. the method for collection and analysis visitor's data according to claim 2 based on big data, it is characterised in that step In 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into data packet with csv The form of file sends to data server and stores.
- 4. collection based on big data according to Claims 2 or 3 and the method for analyzing visitor's data, it is characterised in that The facility information of terminal is the mac identification codes of terminal;The facility information of smart machine is the mac identification codes of smart machine;Environment Information includes at least current time, and smart machine enters case field time and signalling channel.
- 5. the method for collection and analysis visitor's data according to claim 4 based on big data, it is characterised in that step 2 include:Step 2.1, the data packet of storage is handled;Step 2.2, the data packet after processing is filtered.
- 6. the method for collection and analysis visitor's data according to claim 5 based on big data, it is characterised in that step 2.1, processing is carried out to split to the data packet of storage using 3 seconds as interval to the data packet of storage, forms smart machine Mac identification code data.
- 7. the method for collection and analysis visitor's data according to claim 6 based on big data, it is characterised in that step 2.2 including:Step 2.2.1, filters the mac identification code data of the smart machine of passerby;Step 2.1.2, the mac identification code data of the smart machine of filtration personnel.
- 8. the method for collection and analysis visitor's data according to claim 7 based on big data, it is characterised in that step 2.2.1 in, if time departure-entry time≤480 second of the mac identification code data of smart machine, judge the smart machine Holder be passerby, the mac identification code data of the smart machine are removed.
- 9. the method for collection and analysis visitor's data according to claim 7 based on big data, it is characterised in that step 3 include:Step 3.1, the real-time traffic of smart machine holder is integrated;Step 3.2, the information of smart machine holder is counted.
- 10. the method for collection and analysis visitor's data according to claim 9 based on big data, it is characterised in that step In rapid 3.2, the information of statistics smart machine holder includes at least:Native place, residence, place of working, gender accounting, marriage account for Than, mobile phone model accounting.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711274116.4A CN108011761A (en) | 2017-12-06 | 2017-12-06 | The method of collection and analysis visitor's data based on big data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711274116.4A CN108011761A (en) | 2017-12-06 | 2017-12-06 | The method of collection and analysis visitor's data based on big data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108011761A true CN108011761A (en) | 2018-05-08 |
Family
ID=62056854
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711274116.4A Withdrawn CN108011761A (en) | 2017-12-06 | 2017-12-06 | The method of collection and analysis visitor's data based on big data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108011761A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110188069A (en) * | 2019-05-21 | 2019-08-30 | 广东和新科技有限公司 | A kind of csv file storage method, device and computer equipment |
CN110223110A (en) * | 2019-05-27 | 2019-09-10 | 浙江华坤道威数据科技有限公司 | A kind of DSP advertisement analysis system based on big data |
CN111126736A (en) * | 2018-11-01 | 2020-05-08 | 百度在线网络技术(北京)有限公司 | Enterprise passenger flow determining method and device, server and storage medium |
CN111163490A (en) * | 2019-12-13 | 2020-05-15 | 南京华苏软件有限公司 | Method for analyzing household residents based on mobile phone mac |
CN112188478A (en) * | 2020-09-29 | 2021-01-05 | 浙江新再灵科技股份有限公司 | Resident population data acquisition method based on big data analysis |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103458456A (en) * | 2013-08-27 | 2013-12-18 | 中国科学院信息工程研究所 | Method and device for user behavior detection based on mobile terminal Wi-Fi data |
CN104778642A (en) * | 2015-03-20 | 2015-07-15 | 北京邮电大学 | Data processing method, server and monitoring system for campus users based on WiFi |
CN106230653A (en) * | 2016-07-21 | 2016-12-14 | 深圳奇迹智慧网络有限公司 | A kind of data integration method with MAC code as major key |
CN106651437A (en) * | 2016-11-15 | 2017-05-10 | 武汉璞华大数据技术有限公司 | Method for marketing promotion based on big data |
CN106792456A (en) * | 2016-12-21 | 2017-05-31 | 浙江省公众信息产业有限公司 | Data analysis system and method |
CN106982411A (en) * | 2017-03-20 | 2017-07-25 | 华南理工大学 | A kind of real-time passenger flow statistical method based on WIFI probe datas |
-
2017
- 2017-12-06 CN CN201711274116.4A patent/CN108011761A/en not_active Withdrawn
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103458456A (en) * | 2013-08-27 | 2013-12-18 | 中国科学院信息工程研究所 | Method and device for user behavior detection based on mobile terminal Wi-Fi data |
CN104778642A (en) * | 2015-03-20 | 2015-07-15 | 北京邮电大学 | Data processing method, server and monitoring system for campus users based on WiFi |
CN106230653A (en) * | 2016-07-21 | 2016-12-14 | 深圳奇迹智慧网络有限公司 | A kind of data integration method with MAC code as major key |
CN106651437A (en) * | 2016-11-15 | 2017-05-10 | 武汉璞华大数据技术有限公司 | Method for marketing promotion based on big data |
CN106792456A (en) * | 2016-12-21 | 2017-05-31 | 浙江省公众信息产业有限公司 | Data analysis system and method |
CN106982411A (en) * | 2017-03-20 | 2017-07-25 | 华南理工大学 | A kind of real-time passenger flow statistical method based on WIFI probe datas |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111126736A (en) * | 2018-11-01 | 2020-05-08 | 百度在线网络技术(北京)有限公司 | Enterprise passenger flow determining method and device, server and storage medium |
CN111126736B (en) * | 2018-11-01 | 2023-09-01 | 百度在线网络技术(北京)有限公司 | Enterprise passenger flow volume determining method and device, server and storage medium |
CN110188069A (en) * | 2019-05-21 | 2019-08-30 | 广东和新科技有限公司 | A kind of csv file storage method, device and computer equipment |
CN110188069B (en) * | 2019-05-21 | 2021-06-29 | 广东和新科技有限公司 | CSV file storage method and device and computer equipment |
CN110223110A (en) * | 2019-05-27 | 2019-09-10 | 浙江华坤道威数据科技有限公司 | A kind of DSP advertisement analysis system based on big data |
CN111163490A (en) * | 2019-12-13 | 2020-05-15 | 南京华苏软件有限公司 | Method for analyzing household residents based on mobile phone mac |
CN112188478A (en) * | 2020-09-29 | 2021-01-05 | 浙江新再灵科技股份有限公司 | Resident population data acquisition method based on big data analysis |
CN112188478B (en) * | 2020-09-29 | 2023-04-07 | 浙江新再灵科技股份有限公司 | Resident population data acquisition method based on big data analysis |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108011761A (en) | The method of collection and analysis visitor's data based on big data | |
CN105354196B (en) | Information-pushing method and information push-delivery apparatus | |
CN104050589B (en) | The method and system that a kind of social public platform ad click conversion ratio is assessed | |
CN105657666B (en) | A kind of commercial employment crowd residence recognition methods based on mobile phone location data | |
JP4364936B1 (en) | Liquid population estimation system, liquid population estimation method, and liquid population estimation program | |
CN111222744A (en) | Method for determining built environment and rail passenger flow distribution relation based on signaling data | |
CN105208528B (en) | A kind of system and method for identifying with administrative staff | |
CN102849382A (en) | Digitalized intelligent processing method for classifying community garbage | |
TR201909349T4 (en) | Empirical expert identification and question guidance system and method. | |
CN105451234B (en) | A kind of suspicious number analysis method and device based on Signalling exchange data | |
CN108632746A (en) | A kind of method of determining region flow of the people | |
CN106658564B (en) | The recognition methods of domestic consumer a kind of and device | |
CN106998262A (en) | A kind of System and method for for recognizing Internet user | |
CN107169805A (en) | A kind of advertisement placement method, apparatus and system | |
CN110290467B (en) | Method and device for acquiring stop point, business area service range and influence factor | |
CN109151780A (en) | A kind of collecting method based on WIFI probe, apparatus and system | |
CN107637098B (en) | Method and system for estimating the number of persons in one or more persons population a posteriori | |
EP3241367B1 (en) | Method and system for real-time computing of origin-destination matrices relating to attendees at a public happening through analysis of mobile communication network data | |
CN103810623A (en) | Real-time automatic marketing method and system | |
CN108696420A (en) | Information push method in communication network and device | |
CN103701693A (en) | Message handling method and system in communication process | |
CN114741612B (en) | Consumption habit classification method, system and storage medium based on big data | |
CN101557582B (en) | Method and device for mobile communication user information statistics | |
CN105376709A (en) | Regional population statistical method and system based on mobile communication technology | |
CN102075850B (en) | Method and device for determining occupational information of mobile subscriber |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20180508 |
|
WW01 | Invention patent application withdrawn after publication |