CN108011761A - The method of collection and analysis visitor's data based on big data - Google Patents

The method of collection and analysis visitor's data based on big data Download PDF

Info

Publication number
CN108011761A
CN108011761A CN201711274116.4A CN201711274116A CN108011761A CN 108011761 A CN108011761 A CN 108011761A CN 201711274116 A CN201711274116 A CN 201711274116A CN 108011761 A CN108011761 A CN 108011761A
Authority
CN
China
Prior art keywords
data
smart machine
collection
facility information
visitor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201711274116.4A
Other languages
Chinese (zh)
Inventor
肖梦清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
E-House (china) Enterprise Group Ltd By Share Ltd
Original Assignee
E-House (china) Enterprise Group Ltd By Share Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by E-House (china) Enterprise Group Ltd By Share Ltd filed Critical E-House (china) Enterprise Group Ltd By Share Ltd
Priority to CN201711274116.4A priority Critical patent/CN108011761A/en
Publication of CN108011761A publication Critical patent/CN108011761A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/147Network analysis or design for predicting network behaviour
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/02Network architectures or network communication protocols for network security for separating internal from external traffic, e.g. firewalls
    • H04L63/0227Filtering policies
    • H04L63/0236Filtering by address, protocol, port number or service, e.g. IP-address or URL

Abstract

A kind of method of collection and analysis visitor's data based on big data, including:Step 1, the facility information of smart machine is gathered;Step 2, the facility information of the smart machine of collection is filtered;Step 3, the facility information of the smart machine after filtering is integrated.The method of collection and analysis visitor data of the present invention based on big data utilizes intelligence wifi routers, full-automatic continual gathered data, it is not necessary to spend too many manpower and time, can data acquisition, coverage of the present invention is wider, gather generation data more comprehensively, pass through advanced algorithm, data are carried out with filtering cleaning, classification is integrated, statistical analysis, generate real-time guest system, customer portrait system, customer action system, powerful data analysis situation is provided for businessman.

Description

The method of collection and analysis visitor's data based on big data
Technical field
The present invention relates to physical field, more particularly to the information processing technology, particularly a kind of collection based on big data and The method for analyzing visitor's data.
Background technology
At this stage, the level of consumption of people is increasingly lifted, and consumption of going out, the frequency of amusement is also higher and higher, many large-scale Place, such as shopping center, market, designer-label store, sales office, tourist attractions etc. are all passenger flow accumulation regions.With the increase of flow of the people, The competition of businessman is also aggravating rapidly, and the cost that businessman obtains effective client is also increasing, for some Code in Hazardous Special Locations, such as brand Shop, sales office etc., businessman particularly pay attention to visiting client.The data message of visiting subscriber how is obtained, and then is based on big data Analysis, makes correct marketing activity or very important decision, is businessman's urgent problem to be solved.
Collecting and analyzing for present visitor's data is still done by traditional artificial, that is, arrange work personnel, record of filling in a form, Do tabular analysis etc..Such data acquisition not only inefficiency, covering surface is not complete, and data dimension is not complete, post analysis cost Also it is higher.Businessman can only utilize low volume data, do some simple graphic analyses, can not more depth mining data behind valency Value.
The content of the invention
For above-mentioned technological deficiency, it is an object of the invention to provide a kind of solution above-mentioned technical problem based on big data Collection and analyze visitor's data method.
In order to solve the above technical problems, the method for collection and analysis visitor's data provided by the invention based on big data, Including:Step 1, the facility information of smart machine is gathered;Step 2, the facility information of the smart machine of collection is filtered;Step Rapid 3, the facility information of the smart machine after filtering is integrated.
Step 1 includes:Step 1.1, the wireless signal of terminal scanning smart machine and with smart machine establish connect;Step 1.2, gather the facility information of smart machine;Step 1.3, the equipment of the facility information of terminal, the smart machine collected is believed Breath and environmental information are integrated into data packet and send to data server and store.
In step 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into Data packet is sent to data server and is stored with the form of csv files.
The facility information of terminal is the mac identification codes of terminal;The facility information of smart machine identifies for the mac of smart machine Code;Environmental information includes at least current time, and smart machine enters case field time and signalling channel.
Step 2 includes:Step 2.1, the data packet of storage is handled;Step 2.2, the data packet after processing is carried out Filtering.
Step 2.1, processing is carried out to split to the data packet of storage using 3 seconds as interval to the data packet of storage, Form the mac identification code data of smart machine.
Step 2.2 includes:Step 2.2.1, filters the mac identification code data of the smart machine of passerby;Step 2.1.2, mistake Filter the mac identification code data of the smart machine of staff.
In step 2.2.1, if time departure-entry time≤480 second of the mac identification code data of smart machine, sentence The holder of the fixed smart machine is passerby, and the mac identification code data of the smart machine are removed.
Step 3 includes:Step 3.1, the real-time traffic of smart machine holder is integrated;Step 3.2, statistics smart machine is held The information for the person of having.
In step 3.2, the information of statistics smart machine holder includes at least:Native place, residence, place of working, gender account for Than, marriage accounting, mobile phone model accounting.
The method of collection and analysis visitor data of the present invention based on big data utilizes intelligence wifi routers, automatically not The gathered data of interruption, it is not necessary to spend too many manpower and time, you can data acquisition, coverage of the present invention is wider, Gather generation data more comprehensively, by advanced algorithm, data are carried out with filtering cleaning, classification is integrated, statistical analysis, and generation is real When guest system, customer portrait system, customer action system, powerful data analysis situation is provided for businessman.
Brief description of the drawings
Fig. 1 is the method flow diagram of collection and analysis visitor data of the present invention based on big data.
Embodiment
It is further detailed to the method work of collection of the present invention based on big data and analysis visitor's data below in conjunction with the accompanying drawings Explanation.
As shown in Figure 1, the method for collection and analysis visitor data of the present invention based on big data includes:
1)Part of data acquisition
Data acquisition first by hardware device be intelligent wifi routers, replica router is than normal domestic use router feature It is powerful, there are wifi probe functions.After i.e. user opens wireless network, the wireless signal in this region of wifi probes meeting active scan, Gather the mac codes of user equipment.
Specific acquisition step:
1st, intelligent wifi routers are powered first, and Logistics networks are unobstructed.
2nd, the wif probes that router carries can scan the wireless signal in certain area coverage, i.e. mobile phone wireless is sent Handshake request, the success once intelligence wifi routers and user equipment are shaken hands, will analyze handshake data, collecting device The related datas such as mac codes.
3rd, intelligence wifi routers can scan the mobile phone that all wifi in this region are opened, and collection of shaking hands every three minutes Data(It is scanned before being likely to be, it may be possible to new), the mac data messages that then will be collected into, with csv trays Formula passes back to data server.The data of passback include:Unique mac identification codes of intelligent wifi routers, subscriber connecting equipment Unique mac identification codes, current time, signalling channel etc..
4th, data server receives the data file of intelligent wifi passbacks, splits csv files using program, and every number According to deposit database.
2)Data filtering
Intelligent wifi routers will return the data of collection every three minutes, if user rests on place always, Can then there is the information back of the user always to server, until user leaves.That is cleaned again using being first put in storage here does Method.Retain the data logging of most original, the later stage filters data.
Specific filtration step and scheme:
1st, the data of passback will be gathered, one group was separated with 3 seconds, filters out the user mac of repetition, and retain user mac's Minimum acquisition time, and maximum acquisition time,
2nd, complete user's mac data are drawn, including:Time on the same day, user equipment mac, user enter case field time, use Case field time is left at family, the user data that will be gathered daily, in units of day, is stored in new tables of data.
3rd, passerby's data are filtered, because intelligence wifi router scans scopes are larger, about 50 meters~100 meters of radius, if Place position passenger flow is more, can scan the crowd that some are not target customers, such as passerby.A socket gauge is had at this time Whether the user mac information for then going to judge to collect is passerby, if it is, being filtered.
Passerby's filtering rule:Recorded if there is a mac, time departure-entry time≤480 second, i.e. user stop Duration little Yu 8 minutes, then be judged as passerby.
4th, filtering black list user data, trade company staff, cleaning worker, these users of neighbouring resident personnel are because meeting Frequently appear near businessman, information is easily collected by intelligent wifi routers, so being defined as blacklist to these personnel User, is filtered.
The filtering rule of black list user:Data in 15 days are recorded, meet that following any one is considered as " black list user "
When stay time is small more than 3 within single day.
When any two days stay times are small more than 2.5 in 15 days.
When any three days stay times are small more than 2 in 15 days.
When any five days stay times are small more than 1 in 15 days.
Passerby's rule and blacklist rule can be according to the self-defined settings of Data acquisition and issuance scene.
3)Data Integration
Data Integration is divided into the real-time passenger flow of visitor and visitor's behavior portrait
1st, the real-time passenger flow of visitor is integrated, and on the basis of user's initial data according to collection, by certain algorithm, calculates the same day Total number of persons is gathered, same day user be averaged stay time, daily integral point passenger flow situation, accumulative collection number, number of repeatedly visiting, often Day old and new customers's number.
2nd, visitor's behavior is drawn a portrait, because the mac addresses of user equipment are unique, in theory, a mac data, It correspond to a user.
Using all user mac as a colony, request data supplier is removed, so as to return to the portrait mark of this colony Label.Including native place provinces and cities top500, residential quarter top100, work mansion top100, Condom top500 under line, is seen under line Room top500(Top100 and top500 herein refers to latitude and longitude coordinates, can be gone to calculate with this latitude and longitude coordinates accurately small Area and mansion name etc.)Gender accounting, married unmarried accounting, mobile phone model accounting, category accounting of doing shopping, mobile phone application The portrait related datas such as apptop10, shopping app application top10, hobby top10.
As desired, city latitude and longitude coordinates storehouse is carried out, such as real estate developer, it is present city can be made Selling the latitude and longitude coordinates basic database of building.
The group behavior track returned using data supplier, the i.e. longitude and latitude of the top500 that group of subscribers was gone are sat Mark goes the base coordinate storehouse matching done with step 2, it can be deduced that the new flat that these group of subscribers were gone, can be ground Produce developer and decision support is provided, can pinpoint dispensing advertisement for Condom under the line of residential quarter etc., do marketing activity.
The preferred embodiment to the invention is illustrated above, but the present invention is not limited to embodiment, Those skilled in the art can also be made on the premise of without prejudice to the invention spirit a variety of equivalent deformations or Replace, these equivalent deformations or replacement are all contained in scope of the present application.

Claims (10)

  1. A kind of 1. method of collection and analysis visitor's data based on big data, it is characterised in that include the following steps:
    Step 1, the facility information of smart machine is gathered;
    Step 2, the facility information of the smart machine of collection is filtered;Step 3, the equipment of the smart machine after filtering is believed Breath is integrated.
  2. 2. the method for collection and analysis visitor's data according to claim 1 based on big data, it is characterised in that step 1 includes:
    Step 1.1, the wireless signal of terminal scanning smart machine and with smart machine establish connect;
    Step 1.2, the facility information of smart machine is gathered;
    Step 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into data packet Send to data server and store.
  3. 3. the method for collection and analysis visitor's data according to claim 2 based on big data, it is characterised in that step In 1.3, the facility information of terminal, the facility information of the smart machine collected and environmental information are integrated into data packet with csv The form of file sends to data server and stores.
  4. 4. collection based on big data according to Claims 2 or 3 and the method for analyzing visitor's data, it is characterised in that The facility information of terminal is the mac identification codes of terminal;The facility information of smart machine is the mac identification codes of smart machine;Environment Information includes at least current time, and smart machine enters case field time and signalling channel.
  5. 5. the method for collection and analysis visitor's data according to claim 4 based on big data, it is characterised in that step 2 include:
    Step 2.1, the data packet of storage is handled;
    Step 2.2, the data packet after processing is filtered.
  6. 6. the method for collection and analysis visitor's data according to claim 5 based on big data, it is characterised in that step 2.1, processing is carried out to split to the data packet of storage using 3 seconds as interval to the data packet of storage, forms smart machine Mac identification code data.
  7. 7. the method for collection and analysis visitor's data according to claim 6 based on big data, it is characterised in that step 2.2 including:
    Step 2.2.1, filters the mac identification code data of the smart machine of passerby;
    Step 2.1.2, the mac identification code data of the smart machine of filtration personnel.
  8. 8. the method for collection and analysis visitor's data according to claim 7 based on big data, it is characterised in that step 2.2.1 in, if time departure-entry time≤480 second of the mac identification code data of smart machine, judge the smart machine Holder be passerby, the mac identification code data of the smart machine are removed.
  9. 9. the method for collection and analysis visitor's data according to claim 7 based on big data, it is characterised in that step 3 include:
    Step 3.1, the real-time traffic of smart machine holder is integrated;
    Step 3.2, the information of smart machine holder is counted.
  10. 10. the method for collection and analysis visitor's data according to claim 9 based on big data, it is characterised in that step In rapid 3.2, the information of statistics smart machine holder includes at least:Native place, residence, place of working, gender accounting, marriage account for Than, mobile phone model accounting.
CN201711274116.4A 2017-12-06 2017-12-06 The method of collection and analysis visitor's data based on big data Withdrawn CN108011761A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711274116.4A CN108011761A (en) 2017-12-06 2017-12-06 The method of collection and analysis visitor's data based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711274116.4A CN108011761A (en) 2017-12-06 2017-12-06 The method of collection and analysis visitor's data based on big data

Publications (1)

Publication Number Publication Date
CN108011761A true CN108011761A (en) 2018-05-08

Family

ID=62056854

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711274116.4A Withdrawn CN108011761A (en) 2017-12-06 2017-12-06 The method of collection and analysis visitor's data based on big data

Country Status (1)

Country Link
CN (1) CN108011761A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110188069A (en) * 2019-05-21 2019-08-30 广东和新科技有限公司 A kind of csv file storage method, device and computer equipment
CN110223110A (en) * 2019-05-27 2019-09-10 浙江华坤道威数据科技有限公司 A kind of DSP advertisement analysis system based on big data
CN111126736A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Enterprise passenger flow determining method and device, server and storage medium
CN111163490A (en) * 2019-12-13 2020-05-15 南京华苏软件有限公司 Method for analyzing household residents based on mobile phone mac
CN112188478A (en) * 2020-09-29 2021-01-05 浙江新再灵科技股份有限公司 Resident population data acquisition method based on big data analysis

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458456A (en) * 2013-08-27 2013-12-18 中国科学院信息工程研究所 Method and device for user behavior detection based on mobile terminal Wi-Fi data
CN104778642A (en) * 2015-03-20 2015-07-15 北京邮电大学 Data processing method, server and monitoring system for campus users based on WiFi
CN106230653A (en) * 2016-07-21 2016-12-14 深圳奇迹智慧网络有限公司 A kind of data integration method with MAC code as major key
CN106651437A (en) * 2016-11-15 2017-05-10 武汉璞华大数据技术有限公司 Method for marketing promotion based on big data
CN106792456A (en) * 2016-12-21 2017-05-31 浙江省公众信息产业有限公司 Data analysis system and method
CN106982411A (en) * 2017-03-20 2017-07-25 华南理工大学 A kind of real-time passenger flow statistical method based on WIFI probe datas

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458456A (en) * 2013-08-27 2013-12-18 中国科学院信息工程研究所 Method and device for user behavior detection based on mobile terminal Wi-Fi data
CN104778642A (en) * 2015-03-20 2015-07-15 北京邮电大学 Data processing method, server and monitoring system for campus users based on WiFi
CN106230653A (en) * 2016-07-21 2016-12-14 深圳奇迹智慧网络有限公司 A kind of data integration method with MAC code as major key
CN106651437A (en) * 2016-11-15 2017-05-10 武汉璞华大数据技术有限公司 Method for marketing promotion based on big data
CN106792456A (en) * 2016-12-21 2017-05-31 浙江省公众信息产业有限公司 Data analysis system and method
CN106982411A (en) * 2017-03-20 2017-07-25 华南理工大学 A kind of real-time passenger flow statistical method based on WIFI probe datas

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111126736A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Enterprise passenger flow determining method and device, server and storage medium
CN111126736B (en) * 2018-11-01 2023-09-01 百度在线网络技术(北京)有限公司 Enterprise passenger flow volume determining method and device, server and storage medium
CN110188069A (en) * 2019-05-21 2019-08-30 广东和新科技有限公司 A kind of csv file storage method, device and computer equipment
CN110188069B (en) * 2019-05-21 2021-06-29 广东和新科技有限公司 CSV file storage method and device and computer equipment
CN110223110A (en) * 2019-05-27 2019-09-10 浙江华坤道威数据科技有限公司 A kind of DSP advertisement analysis system based on big data
CN111163490A (en) * 2019-12-13 2020-05-15 南京华苏软件有限公司 Method for analyzing household residents based on mobile phone mac
CN112188478A (en) * 2020-09-29 2021-01-05 浙江新再灵科技股份有限公司 Resident population data acquisition method based on big data analysis
CN112188478B (en) * 2020-09-29 2023-04-07 浙江新再灵科技股份有限公司 Resident population data acquisition method based on big data analysis

Similar Documents

Publication Publication Date Title
CN108011761A (en) The method of collection and analysis visitor's data based on big data
CN105354196B (en) Information-pushing method and information push-delivery apparatus
CN104050589B (en) The method and system that a kind of social public platform ad click conversion ratio is assessed
CN105657666B (en) A kind of commercial employment crowd residence recognition methods based on mobile phone location data
JP4364936B1 (en) Liquid population estimation system, liquid population estimation method, and liquid population estimation program
CN111222744A (en) Method for determining built environment and rail passenger flow distribution relation based on signaling data
CN105208528B (en) A kind of system and method for identifying with administrative staff
CN102849382A (en) Digitalized intelligent processing method for classifying community garbage
TR201909349T4 (en) Empirical expert identification and question guidance system and method.
CN105451234B (en) A kind of suspicious number analysis method and device based on Signalling exchange data
CN108632746A (en) A kind of method of determining region flow of the people
CN106658564B (en) The recognition methods of domestic consumer a kind of and device
CN106998262A (en) A kind of System and method for for recognizing Internet user
CN107169805A (en) A kind of advertisement placement method, apparatus and system
CN110290467B (en) Method and device for acquiring stop point, business area service range and influence factor
CN109151780A (en) A kind of collecting method based on WIFI probe, apparatus and system
CN107637098B (en) Method and system for estimating the number of persons in one or more persons population a posteriori
EP3241367B1 (en) Method and system for real-time computing of origin-destination matrices relating to attendees at a public happening through analysis of mobile communication network data
CN103810623A (en) Real-time automatic marketing method and system
CN108696420A (en) Information push method in communication network and device
CN103701693A (en) Message handling method and system in communication process
CN114741612B (en) Consumption habit classification method, system and storage medium based on big data
CN101557582B (en) Method and device for mobile communication user information statistics
CN105376709A (en) Regional population statistical method and system based on mobile communication technology
CN102075850B (en) Method and device for determining occupational information of mobile subscriber

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20180508

WW01 Invention patent application withdrawn after publication