CN106649636A - Personnel mobility analysis method and device based on mobile terminal - Google Patents

Personnel mobility analysis method and device based on mobile terminal Download PDF

Info

Publication number
CN106649636A
CN106649636A CN201611122229.8A CN201611122229A CN106649636A CN 106649636 A CN106649636 A CN 106649636A CN 201611122229 A CN201611122229 A CN 201611122229A CN 106649636 A CN106649636 A CN 106649636A
Authority
CN
China
Prior art keywords
information
data
pretreatment
mark
mobile terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611122229.8A
Other languages
Chinese (zh)
Inventor
张祖兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Ruian Technology Co Ltd
Original Assignee
Beijing Ruian Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Ruian Technology Co Ltd filed Critical Beijing Ruian Technology Co Ltd
Priority to CN201611122229.8A priority Critical patent/CN106649636A/en
Publication of CN106649636A publication Critical patent/CN106649636A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/221Column-oriented storage; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Fuzzy Systems (AREA)
  • Quality & Reliability (AREA)
  • Remote Sensing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the invention discloses a personnel mobility analysis method and device based on a mobile terminal. The method comprises the steps that network data in the mobile terminal is collected in real time according to a preset collection rule, and the collected data is preprocessed to obtain preprocessed information, wherein the preprocessed information at least comprises position information, user identifiers and collection time; the preprocessed information is marked according to a preset marking rule, and after grouping is conducted, the preprocessed information with the best representativeness in each group is determined as key information; the key information is displayed in a geographic information system according to preset forms, wherein the preset forms comprise graphs and/or characters. According to the method and device, the personal mobility state can be updated automatically, and the personnel mobility analysis efficiency is improved.

Description

A kind of mobility of people analysis method and device based on mobile terminal
Technical field
The present embodiments relate to data analysis technique, more particularly to a kind of mobility of people analysis based on mobile terminal Method and device.
Background technology
Flow of personnel and statistics have important function to regional population's analysis.
Current mobility of people analysis also rests on the artificial statistics stage, i.e., it is main or by artificial gathered data and point Analysis, so not only consumes more time when early stage gathered data, whole process analysis efficiency is substantially reduced.
In view of this, it is special to propose the present invention.
The content of the invention
The embodiment of the present invention provides a kind of mobility of people analysis method and device based on mobile terminal, to realize improving The purpose of mobility of people analysis efficiency.
In a first aspect, a kind of mobility of people analysis method based on mobile terminal is embodiments provided, including:
According to default collection rule the network data in mobile terminal is acquired in real time, pre- place is carried out to gathered data Reason obtains pretreatment information, and the pretreatment information at least includes positional information, ID and acquisition time;
Carry out mark to the pretreatment information according to default mark rule, it is grouped after will most represent in each packet The pretreatment information of property is defined as key message;
The key message is included in GIS-Geographic Information System according to presets, the presets include chart And/or word.
Second aspect, the embodiment of the present invention additionally provides a kind of mobility of people analytical equipment based on mobile terminal, bag Include:
Collection pretreatment module, for being adopted to the network data in mobile terminal in real time according to default collection rule Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and the pretreatment information at least includes positional information, ID And acquisition time;
Mark grouping module, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after will Most representational pretreatment information is defined as key message in each packet;
Display module, it is described default for the key message to be included in GIS-Geographic Information System according to presets Form includes chart and/or word.
The embodiment of the present invention is right by being acquired to the network data in mobile terminal in real time according to default collection rule Gathered data carries out pretreatment and obtains pretreatment information, and mark is carried out to pretreatment information according to default mark rule, and Jing divides Most representational pretreatment information in each packet is defined as into key message after group, finally by key message according to default shape Formula is displayed in GIS-Geographic Information System, wherein, when corresponding positional information, ID and collection are contained in key message Between, that is, the real time position of each user is represented, key message is converted into corresponding chart and/or written form and is illustrated in In GIS-Geographic Information System, you can intuitively show the real time position of a large number of users in specific region, thus it is capable of achieving to automatically update Flow of personnel state, improves the efficiency of mobility of people analysis.
Description of the drawings
Fig. 1 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention one is provided Figure;
Fig. 2 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention two is provided Figure;
Fig. 3 is that a kind of structure of mobility of people analytical equipment based on mobile terminal that the embodiment of the present invention three is provided is shown It is intended to.
Specific embodiment
With reference to the accompanying drawings and examples the present invention is described in further detail.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention one is provided Figure, the present embodiment is applicable to carry out the situation of mobility of people analysis automatically, and the method can be by the people based on mobile terminal Performing, the device can be realized member's liquidity analysis device by way of hardware and/or software.With reference to Fig. 1, the present embodiment is carried For specifically included based on the mobility of people analysis method of mobile terminal:
The default collection rule of S101, basis is acquired in real time to the network data in mobile terminal, and gathered data is entered Row pretreatment obtains pretreatment information.
Wherein, the pretreatment information at least includes positional information, ID and acquisition time.Positional information can be The gps coordinate of user, ID can be cell-phone number.
Wherein it is possible to be acquired to the network data in mobile terminal in real time using data acquisition equipment, data acquisition Equipment can be the application software, or web crawlers in mobile terminal with certain collection authority, as long as user uses When mobile terminal will produce volume of data, data acquisition equipment just can constantly gather what is wanted according to default collection rule Data.Exemplary, the identity information of user, such as identification card number, age, sex can be collected when user's registration website Deng, the address information of user can be collected when user does shopping, the logical of user can be collected when user checks address list Information in news, the positional information of user can be collected when user is using map.The data for collecting can generate tab Text is pre-processed again.Wherein, the function of tab (being also tab stop) is being hung down in the case where form is not used Nogata is to pressing column alignment text.
Wherein, default collection rule could be for gathering the regular expression of particular data, for example, can be that collection is used Regular expression, regular expression of collection user mobile phone number of family position etc..Due to the number collected according to regular expression According to still there is some to be invalid data, therefore the data also needed to gathering carry out the conversion of rubbish filtering, data cleansing and form Valid data are extracted Deng operation.
S102, carry out mark to the pretreatment information according to default mark rule, it is grouped after by each packet most Representative pretreatment information is defined as key message.
Wherein, pretreatment information is carried out mark can preferably flag data, be easy to follow-up data processing procedure more preferable Corresponding data is searched on ground, preferably can carry out mark to pretreatment information according to acquisition time, not the data addition after mark The mark period.
Wherein, although data have eliminated most invalid data after pretreatment, but because collection is real Shi Jinhang's, the valid data amount for staying is still very big, therefore data can be marked (during as marked according to ID and mark Section) data are grouped, and the key message in the packet is filtered out according to certain screening rule, key message can be this Most representational pretreatment information in packet.
S103, the key message is included in GIS-Geographic Information System according to presets.
Wherein, the presets include chart and/or word.
Wherein, GIS-Geographic Information System (Geographic Information System, GIS) is that one kind is specific very Important space information system.It is in the case where computer hardware and software system is supported, to earth top layer all or in part (including big Gas-bearing formation) the relevant geographic distribution data in space is acquired, stores, managing, computing, the technology system that analyzes, be shown and described System.
The technical scheme of the present embodiment, by being carried out to the network data in mobile terminal in real time according to default collection rule Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and according to default mark rule pretreatment information is carried out to beat Mark, it is grouped after most representational pretreatment information in each packet is defined as into key message, finally key message is pressed Be displayed in GIS-Geographic Information System according to presets, wherein, corresponding positional information, ID are contained in key message with And acquisition time, that is, the real time position of each user is represented, key message is converted into corresponding chart and/or written form And be illustrated in GIS-Geographic Information System, you can the real time position of a large number of users in specific region is intuitively shown, is thus capable of achieving Flow of personnel state is automatically updated, the efficiency of mobility of people analysis is improved.
On the basis of above-mentioned technical proposal, the pretreatment information can also preferably include subscriber identity information, age At least one in information and locality information;The ID is user mobile phone number.
Wherein, subscriber identity information can be the numbers such as identification card number, home address, name, the pet name, mailbox, social account According to locality information can be districts and cities' code locality.
Embodiment two
Fig. 2 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention two is provided Figure, the present embodiment, preferably to operating S101, S102 and S103 further to optimize, is joined on the basis of above-described embodiment one Fig. 2 is examined, concrete grammar is as follows:
The default collection rule of S201, basis is acquired in real time to the network data in mobile terminal.
S202, gathered data is carried out data correlation, cleaning duplicate data, and be converted into preset format pretreatment letter Breath.
Wherein it is possible to the data for collecting are associated by pretreatment cluster, clean duplicate data go forward side by side row format turn Change.
Specifically, association is referred to and together forms the data correlation of containing identical content two or more than two newly Data, exemplary, containing cell-phone number 1, identification card number and name, data 2 are containing cell-phone number 1, address information and year for data 1 Age, two datas all contain cell-phone number 1, then can be associated together two datas, form data 3, and the data 3 contain cell-phone number 1st, identification card number, name, address information and age.
Specifically, it is mutually to compare the data with the data of former storage to clean duplicate data, washes duplicate data.
Specifically, form conversion is to carry out data to unify conversion, and the Supplementing Data information to gathering, as collected Positional information is a latitude and longitude coordinates, then also add to the latitude and longitude coordinates place place name in data, after then processing Data be converted into the pretreatment information of preset format.Wherein, preset format can be protobuf forms, and protobuf is Protocol buffers, are a kind of forms of data exchange of google, it independently of language, independently of platform.google There is provided the realization of multilingual:Java, c#, c++, go and python, each realization all contains the compiling of corresponding language Device and library file, because it is a kind of binary form, than carrying out the fast many of data exchange using xml, can use it The data exchange under data communication or isomerous environment between Distributed Application, it is all very excellent as a kind of efficiency and compatibility Elegant binary data transmission form, can be used for the numerous areas such as network transmission, configuration file, data storage.
S203, by pretreatment information Jing distributed information system caching after land in distributed file system preserve.
Wherein, distributed message system can be kafka, and Kafka is a high-performance, distributed message system, extensively For scenes such as log collection, stream data process, online and offline message distributions.Traditional ActiveMQ is compared, Kafka is simultaneously Row ability and handling capacity are higher.Due to the data volume difference that data acquisition equipment is gathered in different periods, therefore exist such Situation:The pretreatment information that a certain period obtains is considerably less, and the pretreatment information that a certain period obtains rapidly explodes again, if do not had There is kafka to cache, be then easy to the situation that system crash occurs, therefore kafka primarily serves the effect of equally loaded.
Wherein, distributed file system can be HDFS (Hadoop Distributed File System), and HDFS is The distributed storage file system of Hadoop systems, be especially suitable for storage process super large file, super large file typically refer to hundred MB, The file of hundreds of TB sizes is set, and at present in actual applications, HDFS can be used for the data of storage management PB level.HDFS Design set up on the basis of more response " write-once, repeatedly read-write " task, it means that a data set is once Generated by data source, will be replicated and be distributed in different memory nodes, be then responding to various data analysis tasks Request.
Further, because HDFS there are certain requirements to data memory format, typically by data rowization, and with parquet Form is landed in HDFS.Wherein, parquet is the column storage format towards analytic type business, can skip and not meet bar The data of part, only read the data for needing, and reduce I/O data amount, and compressed encoding is reducing disk storage space.Due to same row Data type be the same, more efficient compressed encoding (such as Run Length Encoding and Delta can also be used Encoding) further save storage empty, only read the row for needing, supporting vector computing can obtain more preferable scan performance, Simultaneously can be with follow-up spark sql (being a spark component for processing structure data) seamless combination.
Further, can be every subsequently to carry out according to date field sectional lists to landing to the pretreatment information in HDFS The data preanalysis of day/when is called, while can also be set up according to critical fielies such as acquisition time and ID index, in case subsequently Real-time query data are prepared.
S204, according to acquisition time to pre-process information every data carry out mark.
Specifically, for the pretreatment information being stored in HDFS, can be somebody's turn to do according to the parsing of date catalogue using spark sql The Parquet files landed under date catalogue, and the data after parsing are converted into RDD (Resilient Distributed Datasets, elasticity distribution formula data set).Wherein, RDD is an abstract concept of distributed memory, and RDD provides a kind of high The limited shared drive model of degree, i.e. RDD is the set of read-only record partitioning, can only be by performing what is determined in other RDD Conversion operation and create, but these limit cause realize that fault-tolerant expense is very low.For developer, RDD can be regarded as One object of Spark, itself is run in internal memory, and computing is read and write in internal memory and reaches as high as the 100 of disk read-write computing Times.Therefore, Parquet file translations can be improved into processing speed of the spark systems to data for RDD.
Further, the RDD in internal memory is traveled through, and to RDD marks.It is preferred that mark rule is to be carried out beating according to acquisition time Mark.Exemplary, it is assumed that the pretreatment information in HDFS is that then mark rule can be 24 little with daily date field sectional lists When be the cycle, according to acquisition time, morning zero point is designated as 1, every 10 minutes mark values+1, for marking per bar record daily Which period belonged to, and the when segment mark after mark is added in the data, stored in the RDD and original HDFS in internal memory Data all accordingly add when segment mark.After 24 hours, then the data by second day acquisition time for morning zero point are designated as 1, every 10 minutes mark values+1, by that analogy.
S205, according to after mark when segment mark and ID pretreatment information is grouped, and by after packet Per data with the storage of key-Value forms.
Wherein, the data in key set include ID and when segment mark, the data in the Value set Including ID, when segment mark, positional information, subscriber identity information, age information and locality at least in information Kind.
Specifically, it is possible to use in RDD groupby operations according to ID (such as cell-phone number) and when segment mark carry out point Group, i.e., data of the identical cell-phone number in a certain period are one group, and Key-Value formatted datas, wherein user are become after packet Cell-phone number is designated, positional information is longitude and latitude, then Key-Value is specially ((cell-phone number, when segment mark), (cell-phone number, when Segment mark, Jing dimensions, districts and cities' code, subscriber identity information, acquisition time)), grouped data is convenient for data statistics, merges And than peering, RDD continues to be placed in internal memory after packet.
S206, all Value set is ranked up according to acquisition time, and by Preset Time in each packet Value set is defined as key message.
Specifically, the Value set in packet RDD can be ranked up according to acquisition time, so each packet RDD In data be according to the ordered data of Time alignment.Further, the Value of ad-hoc location in each packet can be gathered, That is the Value set of preset time period is defined as key message.Exemplary, the information for storing in a certain packet is mobile phone Number 1 12:00 to 12:10 points of all data, per data according to acquisition time order ordered arrangement, it is considered that per group of number Most middle data accuracy highest according in, it is most representative, then it is 12 by data most middle in this group of data:05 point Value set is defined as key message.
Further, if follow-up data processes operation and needs to show corresponding flow of personnel feelings according to home zone of mobile phone number Condition, then also need to be associated operation with home zone of mobile phone number.Specifically, by the RDD in internal memory with home zone of mobile phone number storehouse Join operations, the mobile phone ownership place that mark is recorded per bar are carried out, and are converted into new RDD to be saved in internal memory and original HDFS, newly RDD data forms can for (cell-phone number, when segment mark, Jing dimensions, districts and cities' code, subscriber identity information, acquisition time, mobile phone Ownership place), can so generate the daily high frequency of mass data (10 minutes) track record.
The corresponding key message of the default screening conditions screening specific user colony of S207, basis.
Wherein, the default screening conditions include specifying age bracket, specify districts and cities, designated mobile phone ownership place, specify and adopt Any one in collection ground information or its combination.
Specifically, it is possible to use spark sql filter out specific user colony every N minutes according to default screening conditions Key message.Exemplary, obtain key message of the specified all users of districts and cities every N minutes using spark sql;Use Spark sql are obtained and are specified districts and cities and specify all users of roaming place every the key message of N minutes;Using spark sql Obtain specified districts and cities and the age is presetting key message of all users in age bracket every N minutes.
S208, the positional information in the key message after screening is included in GIS-Geographic Information System according to presets.
Specifically, the longitude and latitude in the key message after screening is presented in the form of chart and/or word in GIS and is tied Really.
Embodiment three
Fig. 3 is that a kind of structure of mobility of people analytical equipment based on mobile terminal that the embodiment of the present invention three is provided is shown It is intended to, the present embodiment is applicable to carry out the situation of mobility of people analysis automatically, the device can pass through hardware and/or software Mode is realized.With reference to Fig. 3, what the present embodiment was provided is specifically included based on the mobility of people analytical equipment of mobile terminal:
Collection pretreatment module 310, for being carried out to the network data in mobile terminal in real time according to default collection rule Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and the pretreatment information at least includes positional information, Yong Hubiao Know and acquisition time;
Mark grouping module 320, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after Most representational pretreatment information in each packet is defined as into key message;
Display module 330, it is described pre- for the key message to be included in GIS-Geographic Information System according to presets If form includes chart and/or word.
In the present embodiment, the pretreatment information can also include subscriber identity information, age information and locality information In at least one;
The ID is user mobile phone number.
It is described pretreatment is carried out to gathered data to obtain pretreatment information including in the present embodiment:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
In the present embodiment, the mark grouping module can include:
Mark unit, for carrying out mark to the every data for pre-processing information according to acquisition time;
Grouped element, for according to after mark when segment mark and ID pretreatment information is grouped, and will Every data after packet with the storage of key-Value forms, the data in key set include ID and when segment mark Note, the data in Value set include ID, when segment mark, positional information, subscriber identity information, age information And the locality at least one in information;
Sequencing unit, for being ranked up to all Value set according to acquisition time, and during by presetting in each packet Between Value set be defined as key message.
In the present embodiment, the display module can include:
Screening unit, it is described default for according to the corresponding key message of default screening conditions screening specific user colony Screening conditions include specifying age bracket, specify districts and cities, designated mobile phone ownership place, specify locality in information any one or Person it combines;
Display unit, for the positional information in the key message after screening to be included in geography information according to presets In system.
The mobility of people analytical equipment based on mobile terminal that the present embodiment is provided, is carried with any embodiment of the present invention For same inventive concept is belonged to based on the mobility of people analysis method of mobile terminal, can perform any embodiment institute of the present invention The mobility of people analysis method based on mobile terminal for providing, possesses the corresponding functional module of execution method and beneficial effect.
Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also More other Equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims (10)

1. a kind of mobility of people analysis method based on mobile terminal, it is characterised in that include:
According to default collection rule the network data in mobile terminal is acquired in real time, gathered data is carried out to pre-process To pretreatment information, the pretreatment information is at least including positional information, ID and acquisition time;
Carry out mark to the pretreatment information according to default mark rule, it is grouped after will be most representational in each packet Pretreatment information is defined as key message;
The key message is included in GIS-Geographic Information System according to presets, the presets include chart and/or Word.
2. the mobility of people analysis method based on mobile terminal according to claim 1, it is characterised in that the pre- place Reason information also includes subscriber identity information, age information and the locality at least one in information;
The ID is user mobile phone number.
3. the mobility of people analysis method based on mobile terminal according to claim 1 and 2, it is characterised in that described Pretreatment carried out to gathered data obtain pretreatment information including:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
4. the mobility of people analysis method based on mobile terminal according to claim 2, it is characterised in that it is described according to Default mark rule carries out mark to the pretreatment information, it is grouped after by most representational pretreatment letter in each packet Breath is defined as key message to be included:
Mark is carried out to the every data for pre-processing information according to acquisition time;
According to after mark when segment mark and ID pretreatment information is grouped, and by packet after every data with Key-Value forms are stored, the data in key set include ID and when segment mark, in the Value set Data include ID, when segment mark, positional information, subscriber identity information, age information and locality in information extremely Few one kind;
All Value set is ranked up according to acquisition time, and the Value set of Preset Time in each packet is determined For key message.
5. the mobility of people analysis method based on mobile terminal according to claim 2, it is characterised in that described by institute Stating key message and being displayed in GIS-Geographic Information System according to presets includes:
The corresponding key message of specific user colony is screened according to default screening conditions, the default screening conditions include specifying year Any one in age section, specified districts and cities, designated mobile phone ownership place, specified locality information or its combination;
Positional information in key message after screening is included in GIS-Geographic Information System according to presets.
6. a kind of mobility of people analytical equipment based on mobile terminal, it is characterised in that include:
Collection pretreatment module is right for being acquired to the network data in mobile terminal in real time according to default collection rule Gathered data carries out pretreatment and obtains pretreatment information, and the pretreatment information at least includes positional information, ID and adopts The collection time;
Mark grouping module, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after will be each Most representational pretreatment information is defined as key message in packet;
Display module, for the key message to be included in GIS-Geographic Information System according to presets, the presets Including chart and/or word.
7. the mobility of people analytical equipment based on mobile terminal according to claim 6, it is characterised in that the pre- place Reason information also includes subscriber identity information, age information and the locality at least one in information;
The ID is user mobile phone number.
8. the mobility of people analytical equipment based on mobile terminal according to claim 6 or 7, it is characterised in that described Pretreatment carried out to gathered data obtain pretreatment information including:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
9. the mobility of people analytical equipment based on mobile terminal according to claim 7, it is characterised in that the mark Grouping module includes:
Mark unit, for carrying out mark to the every data for pre-processing information according to acquisition time;
Grouped element, for according to after mark when segment mark and ID pretreatment information is grouped, and will packet Every data afterwards with the storage of key-Value forms, the data in key set include ID and when segment mark, institute State the data in Value set include ID, when segment mark, positional information, subscriber identity information, age information and adopt At least one in collection ground information;
Sequencing unit, for being ranked up to all Value set according to acquisition time, and by Preset Time in each packet Value set is defined as key message.
10. the mobility of people analytical equipment based on mobile terminal according to claim 7, it is characterised in that described aobvious Show that module includes:
Screening unit, for according to the corresponding key message of default screening conditions screening specific user colony, the default screening Condition includes specifying age bracket, specifies districts and cities, designated mobile phone ownership place, specifies any one or its locality in information Combination;
Display unit, for the positional information in the key message after screening to be included in GIS-Geographic Information System according to presets In.
CN201611122229.8A 2016-12-08 2016-12-08 Personnel mobility analysis method and device based on mobile terminal Pending CN106649636A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611122229.8A CN106649636A (en) 2016-12-08 2016-12-08 Personnel mobility analysis method and device based on mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611122229.8A CN106649636A (en) 2016-12-08 2016-12-08 Personnel mobility analysis method and device based on mobile terminal

Publications (1)

Publication Number Publication Date
CN106649636A true CN106649636A (en) 2017-05-10

Family

ID=58818719

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611122229.8A Pending CN106649636A (en) 2016-12-08 2016-12-08 Personnel mobility analysis method and device based on mobile terminal

Country Status (1)

Country Link
CN (1) CN106649636A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111026918A (en) * 2019-11-12 2020-04-17 上海麦克风文化传媒有限公司 Temperature data large-scale analysis framework
CN111641917A (en) * 2020-05-14 2020-09-08 成都众树信息科技有限公司 System and method for counting passenger flow of shopping mall store
CN112347093A (en) * 2020-11-05 2021-02-09 哈尔滨航天恒星数据系统科技有限公司 Method for facilitating cleaning, integrating and storing of mass multi-source heterogeneous data
CN113094292A (en) * 2020-01-09 2021-07-09 上海宝存信息科技有限公司 Data storage device and non-volatile memory control method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103853763A (en) * 2012-12-03 2014-06-11 腾讯科技(深圳)有限公司 Information acquiring method and device
CN105022781A (en) * 2015-05-31 2015-11-04 临沂大学 Online public opinion geographical location real time monitoring control system
CN106096631A (en) * 2016-06-02 2016-11-09 上海世脉信息科技有限公司 A kind of recurrent population's Classification and Identification based on the big data of mobile phone analyze method
CN106156461A (en) * 2015-03-27 2016-11-23 北京市天元网络技术股份有限公司 A kind of analysis method and system carrying out movement of population based on mobile communications network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103853763A (en) * 2012-12-03 2014-06-11 腾讯科技(深圳)有限公司 Information acquiring method and device
CN106156461A (en) * 2015-03-27 2016-11-23 北京市天元网络技术股份有限公司 A kind of analysis method and system carrying out movement of population based on mobile communications network
CN105022781A (en) * 2015-05-31 2015-11-04 临沂大学 Online public opinion geographical location real time monitoring control system
CN106096631A (en) * 2016-06-02 2016-11-09 上海世脉信息科技有限公司 A kind of recurrent population's Classification and Identification based on the big data of mobile phone analyze method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵时亮 等: "基于移动通信的人口流动信息大数据分析方法与应用", 《人口与社会》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111026918A (en) * 2019-11-12 2020-04-17 上海麦克风文化传媒有限公司 Temperature data large-scale analysis framework
CN113094292A (en) * 2020-01-09 2021-07-09 上海宝存信息科技有限公司 Data storage device and non-volatile memory control method
CN113094292B (en) * 2020-01-09 2022-12-02 上海宝存信息科技有限公司 Data storage device and non-volatile memory control method
CN111641917A (en) * 2020-05-14 2020-09-08 成都众树信息科技有限公司 System and method for counting passenger flow of shopping mall store
CN112347093A (en) * 2020-11-05 2021-02-09 哈尔滨航天恒星数据系统科技有限公司 Method for facilitating cleaning, integrating and storing of mass multi-source heterogeneous data

Similar Documents

Publication Publication Date Title
US11816126B2 (en) Large scale unstructured database systems
CN110704411B (en) Knowledge graph building method and device suitable for art field and electronic equipment
CN106934014B (en) Hadoop-based network data mining and analyzing platform and method thereof
Bellini et al. Smart city architecture for data ingestion and analytics: Processes and solutions
CN107451225B (en) Scalable analytics platform for semi-structured data
CN106202207A (en) A kind of index based on HBase ORM and searching system
CN106030573A (en) Implementation of semi-structured data as a first-class database element
US20100318492A1 (en) Data analysis system and method
CN106649636A (en) Personnel mobility analysis method and device based on mobile terminal
CN104239377A (en) Platform-crossing data retrieval method and device
JP5075653B2 (en) Database management method, database management apparatus, database management program, and database system
Alarabi et al. TAREEG: A MapReduce-based system for extracting spatial data from OpenStreetMap
CN115238015A (en) Space-time big data platform based on micro-service
CN105550375A (en) Heterogeneous data integrating method and system
CN105574051A (en) Method for updating user satisfaction rule and processing system
CN110570928A (en) HBase and ozone based medical image file access method
Valkanas et al. Location extraction from social networks with commodity software and online data
CN105518644A (en) Method for processing and displaying real-time social data on map
CN104298669A (en) Person geographic information mining model based on social network
CN106462591B (en) Partition filtering using intelligent indexing in memory
Jhummarwala et al. Parallel and distributed GIS for processing geo-data: an overview
Zhou et al. An efficient access model of Massive spatiotemporal Vehicle trajectory data in Smart City
Xia et al. DAPR-tree: a distributed spatial data indexing scheme with data access patterns to support Digital Earth initiatives
Rehman et al. Building socially-enabled event-enriched maps
US8527478B1 (en) Handling bulk and incremental updates while maintaining consistency

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170510

RJ01 Rejection of invention patent application after publication