CN106649636A - Personnel mobility analysis method and device based on mobile terminal - Google Patents
Personnel mobility analysis method and device based on mobile terminal Download PDFInfo
- Publication number
- CN106649636A CN106649636A CN201611122229.8A CN201611122229A CN106649636A CN 106649636 A CN106649636 A CN 106649636A CN 201611122229 A CN201611122229 A CN 201611122229A CN 106649636 A CN106649636 A CN 106649636A
- Authority
- CN
- China
- Prior art keywords
- information
- data
- pretreatment
- mark
- mobile terminal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/221—Column-oriented storage; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2474—Sequence data queries, e.g. querying versioned data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2477—Temporal data queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/258—Data format conversion from or to a database
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Probability & Statistics with Applications (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Fuzzy Systems (AREA)
- Quality & Reliability (AREA)
- Remote Sensing (AREA)
- Telephonic Communication Services (AREA)
Abstract
The embodiment of the invention discloses a personnel mobility analysis method and device based on a mobile terminal. The method comprises the steps that network data in the mobile terminal is collected in real time according to a preset collection rule, and the collected data is preprocessed to obtain preprocessed information, wherein the preprocessed information at least comprises position information, user identifiers and collection time; the preprocessed information is marked according to a preset marking rule, and after grouping is conducted, the preprocessed information with the best representativeness in each group is determined as key information; the key information is displayed in a geographic information system according to preset forms, wherein the preset forms comprise graphs and/or characters. According to the method and device, the personal mobility state can be updated automatically, and the personnel mobility analysis efficiency is improved.
Description
Technical field
The present embodiments relate to data analysis technique, more particularly to a kind of mobility of people analysis based on mobile terminal
Method and device.
Background technology
Flow of personnel and statistics have important function to regional population's analysis.
Current mobility of people analysis also rests on the artificial statistics stage, i.e., it is main or by artificial gathered data and point
Analysis, so not only consumes more time when early stage gathered data, whole process analysis efficiency is substantially reduced.
In view of this, it is special to propose the present invention.
The content of the invention
The embodiment of the present invention provides a kind of mobility of people analysis method and device based on mobile terminal, to realize improving
The purpose of mobility of people analysis efficiency.
In a first aspect, a kind of mobility of people analysis method based on mobile terminal is embodiments provided, including:
According to default collection rule the network data in mobile terminal is acquired in real time, pre- place is carried out to gathered data
Reason obtains pretreatment information, and the pretreatment information at least includes positional information, ID and acquisition time;
Carry out mark to the pretreatment information according to default mark rule, it is grouped after will most represent in each packet
The pretreatment information of property is defined as key message;
The key message is included in GIS-Geographic Information System according to presets, the presets include chart
And/or word.
Second aspect, the embodiment of the present invention additionally provides a kind of mobility of people analytical equipment based on mobile terminal, bag
Include:
Collection pretreatment module, for being adopted to the network data in mobile terminal in real time according to default collection rule
Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and the pretreatment information at least includes positional information, ID
And acquisition time;
Mark grouping module, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after will
Most representational pretreatment information is defined as key message in each packet;
Display module, it is described default for the key message to be included in GIS-Geographic Information System according to presets
Form includes chart and/or word.
The embodiment of the present invention is right by being acquired to the network data in mobile terminal in real time according to default collection rule
Gathered data carries out pretreatment and obtains pretreatment information, and mark is carried out to pretreatment information according to default mark rule, and Jing divides
Most representational pretreatment information in each packet is defined as into key message after group, finally by key message according to default shape
Formula is displayed in GIS-Geographic Information System, wherein, when corresponding positional information, ID and collection are contained in key message
Between, that is, the real time position of each user is represented, key message is converted into corresponding chart and/or written form and is illustrated in
In GIS-Geographic Information System, you can intuitively show the real time position of a large number of users in specific region, thus it is capable of achieving to automatically update
Flow of personnel state, improves the efficiency of mobility of people analysis.
Description of the drawings
Fig. 1 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention one is provided
Figure;
Fig. 2 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention two is provided
Figure;
Fig. 3 is that a kind of structure of mobility of people analytical equipment based on mobile terminal that the embodiment of the present invention three is provided is shown
It is intended to.
Specific embodiment
With reference to the accompanying drawings and examples the present invention is described in further detail.It is understood that this place is retouched
The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just
Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention one is provided
Figure, the present embodiment is applicable to carry out the situation of mobility of people analysis automatically, and the method can be by the people based on mobile terminal
Performing, the device can be realized member's liquidity analysis device by way of hardware and/or software.With reference to Fig. 1, the present embodiment is carried
For specifically included based on the mobility of people analysis method of mobile terminal:
The default collection rule of S101, basis is acquired in real time to the network data in mobile terminal, and gathered data is entered
Row pretreatment obtains pretreatment information.
Wherein, the pretreatment information at least includes positional information, ID and acquisition time.Positional information can be
The gps coordinate of user, ID can be cell-phone number.
Wherein it is possible to be acquired to the network data in mobile terminal in real time using data acquisition equipment, data acquisition
Equipment can be the application software, or web crawlers in mobile terminal with certain collection authority, as long as user uses
When mobile terminal will produce volume of data, data acquisition equipment just can constantly gather what is wanted according to default collection rule
Data.Exemplary, the identity information of user, such as identification card number, age, sex can be collected when user's registration website
Deng, the address information of user can be collected when user does shopping, the logical of user can be collected when user checks address list
Information in news, the positional information of user can be collected when user is using map.The data for collecting can generate tab
Text is pre-processed again.Wherein, the function of tab (being also tab stop) is being hung down in the case where form is not used
Nogata is to pressing column alignment text.
Wherein, default collection rule could be for gathering the regular expression of particular data, for example, can be that collection is used
Regular expression, regular expression of collection user mobile phone number of family position etc..Due to the number collected according to regular expression
According to still there is some to be invalid data, therefore the data also needed to gathering carry out the conversion of rubbish filtering, data cleansing and form
Valid data are extracted Deng operation.
S102, carry out mark to the pretreatment information according to default mark rule, it is grouped after by each packet most
Representative pretreatment information is defined as key message.
Wherein, pretreatment information is carried out mark can preferably flag data, be easy to follow-up data processing procedure more preferable
Corresponding data is searched on ground, preferably can carry out mark to pretreatment information according to acquisition time, not the data addition after mark
The mark period.
Wherein, although data have eliminated most invalid data after pretreatment, but because collection is real
Shi Jinhang's, the valid data amount for staying is still very big, therefore data can be marked (during as marked according to ID and mark
Section) data are grouped, and the key message in the packet is filtered out according to certain screening rule, key message can be this
Most representational pretreatment information in packet.
S103, the key message is included in GIS-Geographic Information System according to presets.
Wherein, the presets include chart and/or word.
Wherein, GIS-Geographic Information System (Geographic Information System, GIS) is that one kind is specific very
Important space information system.It is in the case where computer hardware and software system is supported, to earth top layer all or in part (including big
Gas-bearing formation) the relevant geographic distribution data in space is acquired, stores, managing, computing, the technology system that analyzes, be shown and described
System.
The technical scheme of the present embodiment, by being carried out to the network data in mobile terminal in real time according to default collection rule
Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and according to default mark rule pretreatment information is carried out to beat
Mark, it is grouped after most representational pretreatment information in each packet is defined as into key message, finally key message is pressed
Be displayed in GIS-Geographic Information System according to presets, wherein, corresponding positional information, ID are contained in key message with
And acquisition time, that is, the real time position of each user is represented, key message is converted into corresponding chart and/or written form
And be illustrated in GIS-Geographic Information System, you can the real time position of a large number of users in specific region is intuitively shown, is thus capable of achieving
Flow of personnel state is automatically updated, the efficiency of mobility of people analysis is improved.
On the basis of above-mentioned technical proposal, the pretreatment information can also preferably include subscriber identity information, age
At least one in information and locality information;The ID is user mobile phone number.
Wherein, subscriber identity information can be the numbers such as identification card number, home address, name, the pet name, mailbox, social account
According to locality information can be districts and cities' code locality.
Embodiment two
Fig. 2 is a kind of flow process of mobility of people analysis method based on mobile terminal that the embodiment of the present invention two is provided
Figure, the present embodiment, preferably to operating S101, S102 and S103 further to optimize, is joined on the basis of above-described embodiment one
Fig. 2 is examined, concrete grammar is as follows:
The default collection rule of S201, basis is acquired in real time to the network data in mobile terminal.
S202, gathered data is carried out data correlation, cleaning duplicate data, and be converted into preset format pretreatment letter
Breath.
Wherein it is possible to the data for collecting are associated by pretreatment cluster, clean duplicate data go forward side by side row format turn
Change.
Specifically, association is referred to and together forms the data correlation of containing identical content two or more than two newly
Data, exemplary, containing cell-phone number 1, identification card number and name, data 2 are containing cell-phone number 1, address information and year for data 1
Age, two datas all contain cell-phone number 1, then can be associated together two datas, form data 3, and the data 3 contain cell-phone number
1st, identification card number, name, address information and age.
Specifically, it is mutually to compare the data with the data of former storage to clean duplicate data, washes duplicate data.
Specifically, form conversion is to carry out data to unify conversion, and the Supplementing Data information to gathering, as collected
Positional information is a latitude and longitude coordinates, then also add to the latitude and longitude coordinates place place name in data, after then processing
Data be converted into the pretreatment information of preset format.Wherein, preset format can be protobuf forms, and protobuf is
Protocol buffers, are a kind of forms of data exchange of google, it independently of language, independently of platform.google
There is provided the realization of multilingual:Java, c#, c++, go and python, each realization all contains the compiling of corresponding language
Device and library file, because it is a kind of binary form, than carrying out the fast many of data exchange using xml, can use it
The data exchange under data communication or isomerous environment between Distributed Application, it is all very excellent as a kind of efficiency and compatibility
Elegant binary data transmission form, can be used for the numerous areas such as network transmission, configuration file, data storage.
S203, by pretreatment information Jing distributed information system caching after land in distributed file system preserve.
Wherein, distributed message system can be kafka, and Kafka is a high-performance, distributed message system, extensively
For scenes such as log collection, stream data process, online and offline message distributions.Traditional ActiveMQ is compared, Kafka is simultaneously
Row ability and handling capacity are higher.Due to the data volume difference that data acquisition equipment is gathered in different periods, therefore exist such
Situation:The pretreatment information that a certain period obtains is considerably less, and the pretreatment information that a certain period obtains rapidly explodes again, if do not had
There is kafka to cache, be then easy to the situation that system crash occurs, therefore kafka primarily serves the effect of equally loaded.
Wherein, distributed file system can be HDFS (Hadoop Distributed File System), and HDFS is
The distributed storage file system of Hadoop systems, be especially suitable for storage process super large file, super large file typically refer to hundred MB,
The file of hundreds of TB sizes is set, and at present in actual applications, HDFS can be used for the data of storage management PB level.HDFS
Design set up on the basis of more response " write-once, repeatedly read-write " task, it means that a data set is once
Generated by data source, will be replicated and be distributed in different memory nodes, be then responding to various data analysis tasks
Request.
Further, because HDFS there are certain requirements to data memory format, typically by data rowization, and with parquet
Form is landed in HDFS.Wherein, parquet is the column storage format towards analytic type business, can skip and not meet bar
The data of part, only read the data for needing, and reduce I/O data amount, and compressed encoding is reducing disk storage space.Due to same row
Data type be the same, more efficient compressed encoding (such as Run Length Encoding and Delta can also be used
Encoding) further save storage empty, only read the row for needing, supporting vector computing can obtain more preferable scan performance,
Simultaneously can be with follow-up spark sql (being a spark component for processing structure data) seamless combination.
Further, can be every subsequently to carry out according to date field sectional lists to landing to the pretreatment information in HDFS
The data preanalysis of day/when is called, while can also be set up according to critical fielies such as acquisition time and ID index, in case subsequently
Real-time query data are prepared.
S204, according to acquisition time to pre-process information every data carry out mark.
Specifically, for the pretreatment information being stored in HDFS, can be somebody's turn to do according to the parsing of date catalogue using spark sql
The Parquet files landed under date catalogue, and the data after parsing are converted into RDD (Resilient Distributed
Datasets, elasticity distribution formula data set).Wherein, RDD is an abstract concept of distributed memory, and RDD provides a kind of high
The limited shared drive model of degree, i.e. RDD is the set of read-only record partitioning, can only be by performing what is determined in other RDD
Conversion operation and create, but these limit cause realize that fault-tolerant expense is very low.For developer, RDD can be regarded as
One object of Spark, itself is run in internal memory, and computing is read and write in internal memory and reaches as high as the 100 of disk read-write computing
Times.Therefore, Parquet file translations can be improved into processing speed of the spark systems to data for RDD.
Further, the RDD in internal memory is traveled through, and to RDD marks.It is preferred that mark rule is to be carried out beating according to acquisition time
Mark.Exemplary, it is assumed that the pretreatment information in HDFS is that then mark rule can be 24 little with daily date field sectional lists
When be the cycle, according to acquisition time, morning zero point is designated as 1, every 10 minutes mark values+1, for marking per bar record daily
Which period belonged to, and the when segment mark after mark is added in the data, stored in the RDD and original HDFS in internal memory
Data all accordingly add when segment mark.After 24 hours, then the data by second day acquisition time for morning zero point are designated as
1, every 10 minutes mark values+1, by that analogy.
S205, according to after mark when segment mark and ID pretreatment information is grouped, and by after packet
Per data with the storage of key-Value forms.
Wherein, the data in key set include ID and when segment mark, the data in the Value set
Including ID, when segment mark, positional information, subscriber identity information, age information and locality at least in information
Kind.
Specifically, it is possible to use in RDD groupby operations according to ID (such as cell-phone number) and when segment mark carry out point
Group, i.e., data of the identical cell-phone number in a certain period are one group, and Key-Value formatted datas, wherein user are become after packet
Cell-phone number is designated, positional information is longitude and latitude, then Key-Value is specially ((cell-phone number, when segment mark), (cell-phone number, when
Segment mark, Jing dimensions, districts and cities' code, subscriber identity information, acquisition time)), grouped data is convenient for data statistics, merges
And than peering, RDD continues to be placed in internal memory after packet.
S206, all Value set is ranked up according to acquisition time, and by Preset Time in each packet
Value set is defined as key message.
Specifically, the Value set in packet RDD can be ranked up according to acquisition time, so each packet RDD
In data be according to the ordered data of Time alignment.Further, the Value of ad-hoc location in each packet can be gathered,
That is the Value set of preset time period is defined as key message.Exemplary, the information for storing in a certain packet is mobile phone
Number 1 12:00 to 12:10 points of all data, per data according to acquisition time order ordered arrangement, it is considered that per group of number
Most middle data accuracy highest according in, it is most representative, then it is 12 by data most middle in this group of data:05 point
Value set is defined as key message.
Further, if follow-up data processes operation and needs to show corresponding flow of personnel feelings according to home zone of mobile phone number
Condition, then also need to be associated operation with home zone of mobile phone number.Specifically, by the RDD in internal memory with home zone of mobile phone number storehouse
Join operations, the mobile phone ownership place that mark is recorded per bar are carried out, and are converted into new RDD to be saved in internal memory and original HDFS, newly
RDD data forms can for (cell-phone number, when segment mark, Jing dimensions, districts and cities' code, subscriber identity information, acquisition time, mobile phone
Ownership place), can so generate the daily high frequency of mass data (10 minutes) track record.
The corresponding key message of the default screening conditions screening specific user colony of S207, basis.
Wherein, the default screening conditions include specifying age bracket, specify districts and cities, designated mobile phone ownership place, specify and adopt
Any one in collection ground information or its combination.
Specifically, it is possible to use spark sql filter out specific user colony every N minutes according to default screening conditions
Key message.Exemplary, obtain key message of the specified all users of districts and cities every N minutes using spark sql;Use
Spark sql are obtained and are specified districts and cities and specify all users of roaming place every the key message of N minutes;Using spark sql
Obtain specified districts and cities and the age is presetting key message of all users in age bracket every N minutes.
S208, the positional information in the key message after screening is included in GIS-Geographic Information System according to presets.
Specifically, the longitude and latitude in the key message after screening is presented in the form of chart and/or word in GIS and is tied
Really.
Embodiment three
Fig. 3 is that a kind of structure of mobility of people analytical equipment based on mobile terminal that the embodiment of the present invention three is provided is shown
It is intended to, the present embodiment is applicable to carry out the situation of mobility of people analysis automatically, the device can pass through hardware and/or software
Mode is realized.With reference to Fig. 3, what the present embodiment was provided is specifically included based on the mobility of people analytical equipment of mobile terminal:
Collection pretreatment module 310, for being carried out to the network data in mobile terminal in real time according to default collection rule
Collection, pretreatment is carried out to gathered data and obtains pretreatment information, and the pretreatment information at least includes positional information, Yong Hubiao
Know and acquisition time;
Mark grouping module 320, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after
Most representational pretreatment information in each packet is defined as into key message;
Display module 330, it is described pre- for the key message to be included in GIS-Geographic Information System according to presets
If form includes chart and/or word.
In the present embodiment, the pretreatment information can also include subscriber identity information, age information and locality information
In at least one;
The ID is user mobile phone number.
It is described pretreatment is carried out to gathered data to obtain pretreatment information including in the present embodiment:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
In the present embodiment, the mark grouping module can include:
Mark unit, for carrying out mark to the every data for pre-processing information according to acquisition time;
Grouped element, for according to after mark when segment mark and ID pretreatment information is grouped, and will
Every data after packet with the storage of key-Value forms, the data in key set include ID and when segment mark
Note, the data in Value set include ID, when segment mark, positional information, subscriber identity information, age information
And the locality at least one in information;
Sequencing unit, for being ranked up to all Value set according to acquisition time, and during by presetting in each packet
Between Value set be defined as key message.
In the present embodiment, the display module can include:
Screening unit, it is described default for according to the corresponding key message of default screening conditions screening specific user colony
Screening conditions include specifying age bracket, specify districts and cities, designated mobile phone ownership place, specify locality in information any one or
Person it combines;
Display unit, for the positional information in the key message after screening to be included in geography information according to presets
In system.
The mobility of people analytical equipment based on mobile terminal that the present embodiment is provided, is carried with any embodiment of the present invention
For same inventive concept is belonged to based on the mobility of people analysis method of mobile terminal, can perform any embodiment institute of the present invention
The mobility of people analysis method based on mobile terminal for providing, possesses the corresponding functional module of execution method and beneficial effect.
Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that
The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes,
Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example
It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also
More other Equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.
Claims (10)
1. a kind of mobility of people analysis method based on mobile terminal, it is characterised in that include:
According to default collection rule the network data in mobile terminal is acquired in real time, gathered data is carried out to pre-process
To pretreatment information, the pretreatment information is at least including positional information, ID and acquisition time;
Carry out mark to the pretreatment information according to default mark rule, it is grouped after will be most representational in each packet
Pretreatment information is defined as key message;
The key message is included in GIS-Geographic Information System according to presets, the presets include chart and/or
Word.
2. the mobility of people analysis method based on mobile terminal according to claim 1, it is characterised in that the pre- place
Reason information also includes subscriber identity information, age information and the locality at least one in information;
The ID is user mobile phone number.
3. the mobility of people analysis method based on mobile terminal according to claim 1 and 2, it is characterised in that described
Pretreatment carried out to gathered data obtain pretreatment information including:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
4. the mobility of people analysis method based on mobile terminal according to claim 2, it is characterised in that it is described according to
Default mark rule carries out mark to the pretreatment information, it is grouped after by most representational pretreatment letter in each packet
Breath is defined as key message to be included:
Mark is carried out to the every data for pre-processing information according to acquisition time;
According to after mark when segment mark and ID pretreatment information is grouped, and by packet after every data with
Key-Value forms are stored, the data in key set include ID and when segment mark, in the Value set
Data include ID, when segment mark, positional information, subscriber identity information, age information and locality in information extremely
Few one kind;
All Value set is ranked up according to acquisition time, and the Value set of Preset Time in each packet is determined
For key message.
5. the mobility of people analysis method based on mobile terminal according to claim 2, it is characterised in that described by institute
Stating key message and being displayed in GIS-Geographic Information System according to presets includes:
The corresponding key message of specific user colony is screened according to default screening conditions, the default screening conditions include specifying year
Any one in age section, specified districts and cities, designated mobile phone ownership place, specified locality information or its combination;
Positional information in key message after screening is included in GIS-Geographic Information System according to presets.
6. a kind of mobility of people analytical equipment based on mobile terminal, it is characterised in that include:
Collection pretreatment module is right for being acquired to the network data in mobile terminal in real time according to default collection rule
Gathered data carries out pretreatment and obtains pretreatment information, and the pretreatment information at least includes positional information, ID and adopts
The collection time;
Mark grouping module, for carrying out mark to the pretreatment information according to default mark rule, it is grouped after will be each
Most representational pretreatment information is defined as key message in packet;
Display module, for the key message to be included in GIS-Geographic Information System according to presets, the presets
Including chart and/or word.
7. the mobility of people analytical equipment based on mobile terminal according to claim 6, it is characterised in that the pre- place
Reason information also includes subscriber identity information, age information and the locality at least one in information;
The ID is user mobile phone number.
8. the mobility of people analytical equipment based on mobile terminal according to claim 6 or 7, it is characterised in that described
Pretreatment carried out to gathered data obtain pretreatment information including:
Data correlation, cleaning duplicate data are carried out to gathered data, and is converted into the pretreatment information of preset format;
Pretreatment information is landed in distributed file system Jing after distributed information system caching and is preserved.
9. the mobility of people analytical equipment based on mobile terminal according to claim 7, it is characterised in that the mark
Grouping module includes:
Mark unit, for carrying out mark to the every data for pre-processing information according to acquisition time;
Grouped element, for according to after mark when segment mark and ID pretreatment information is grouped, and will packet
Every data afterwards with the storage of key-Value forms, the data in key set include ID and when segment mark, institute
State the data in Value set include ID, when segment mark, positional information, subscriber identity information, age information and adopt
At least one in collection ground information;
Sequencing unit, for being ranked up to all Value set according to acquisition time, and by Preset Time in each packet
Value set is defined as key message.
10. the mobility of people analytical equipment based on mobile terminal according to claim 7, it is characterised in that described aobvious
Show that module includes:
Screening unit, for according to the corresponding key message of default screening conditions screening specific user colony, the default screening
Condition includes specifying age bracket, specifies districts and cities, designated mobile phone ownership place, specifies any one or its locality in information
Combination;
Display unit, for the positional information in the key message after screening to be included in GIS-Geographic Information System according to presets
In.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611122229.8A CN106649636A (en) | 2016-12-08 | 2016-12-08 | Personnel mobility analysis method and device based on mobile terminal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611122229.8A CN106649636A (en) | 2016-12-08 | 2016-12-08 | Personnel mobility analysis method and device based on mobile terminal |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106649636A true CN106649636A (en) | 2017-05-10 |
Family
ID=58818719
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611122229.8A Pending CN106649636A (en) | 2016-12-08 | 2016-12-08 | Personnel mobility analysis method and device based on mobile terminal |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106649636A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111026918A (en) * | 2019-11-12 | 2020-04-17 | 上海麦克风文化传媒有限公司 | Temperature data large-scale analysis framework |
CN111641917A (en) * | 2020-05-14 | 2020-09-08 | 成都众树信息科技有限公司 | System and method for counting passenger flow of shopping mall store |
CN112347093A (en) * | 2020-11-05 | 2021-02-09 | 哈尔滨航天恒星数据系统科技有限公司 | Method for facilitating cleaning, integrating and storing of mass multi-source heterogeneous data |
CN113094292A (en) * | 2020-01-09 | 2021-07-09 | 上海宝存信息科技有限公司 | Data storage device and non-volatile memory control method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103853763A (en) * | 2012-12-03 | 2014-06-11 | 腾讯科技(深圳)有限公司 | Information acquiring method and device |
CN105022781A (en) * | 2015-05-31 | 2015-11-04 | 临沂大学 | Online public opinion geographical location real time monitoring control system |
CN106096631A (en) * | 2016-06-02 | 2016-11-09 | 上海世脉信息科技有限公司 | A kind of recurrent population's Classification and Identification based on the big data of mobile phone analyze method |
CN106156461A (en) * | 2015-03-27 | 2016-11-23 | 北京市天元网络技术股份有限公司 | A kind of analysis method and system carrying out movement of population based on mobile communications network |
-
2016
- 2016-12-08 CN CN201611122229.8A patent/CN106649636A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103853763A (en) * | 2012-12-03 | 2014-06-11 | 腾讯科技(深圳)有限公司 | Information acquiring method and device |
CN106156461A (en) * | 2015-03-27 | 2016-11-23 | 北京市天元网络技术股份有限公司 | A kind of analysis method and system carrying out movement of population based on mobile communications network |
CN105022781A (en) * | 2015-05-31 | 2015-11-04 | 临沂大学 | Online public opinion geographical location real time monitoring control system |
CN106096631A (en) * | 2016-06-02 | 2016-11-09 | 上海世脉信息科技有限公司 | A kind of recurrent population's Classification and Identification based on the big data of mobile phone analyze method |
Non-Patent Citations (1)
Title |
---|
赵时亮 等: "基于移动通信的人口流动信息大数据分析方法与应用", 《人口与社会》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111026918A (en) * | 2019-11-12 | 2020-04-17 | 上海麦克风文化传媒有限公司 | Temperature data large-scale analysis framework |
CN113094292A (en) * | 2020-01-09 | 2021-07-09 | 上海宝存信息科技有限公司 | Data storage device and non-volatile memory control method |
CN113094292B (en) * | 2020-01-09 | 2022-12-02 | 上海宝存信息科技有限公司 | Data storage device and non-volatile memory control method |
CN111641917A (en) * | 2020-05-14 | 2020-09-08 | 成都众树信息科技有限公司 | System and method for counting passenger flow of shopping mall store |
CN112347093A (en) * | 2020-11-05 | 2021-02-09 | 哈尔滨航天恒星数据系统科技有限公司 | Method for facilitating cleaning, integrating and storing of mass multi-source heterogeneous data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11816126B2 (en) | Large scale unstructured database systems | |
CN110704411B (en) | Knowledge graph building method and device suitable for art field and electronic equipment | |
CN106934014B (en) | Hadoop-based network data mining and analyzing platform and method thereof | |
Bellini et al. | Smart city architecture for data ingestion and analytics: Processes and solutions | |
CN107451225B (en) | Scalable analytics platform for semi-structured data | |
CN106202207A (en) | A kind of index based on HBase ORM and searching system | |
CN106030573A (en) | Implementation of semi-structured data as a first-class database element | |
US20100318492A1 (en) | Data analysis system and method | |
CN106649636A (en) | Personnel mobility analysis method and device based on mobile terminal | |
CN104239377A (en) | Platform-crossing data retrieval method and device | |
JP5075653B2 (en) | Database management method, database management apparatus, database management program, and database system | |
Alarabi et al. | TAREEG: A MapReduce-based system for extracting spatial data from OpenStreetMap | |
CN115238015A (en) | Space-time big data platform based on micro-service | |
CN105550375A (en) | Heterogeneous data integrating method and system | |
CN105574051A (en) | Method for updating user satisfaction rule and processing system | |
CN110570928A (en) | HBase and ozone based medical image file access method | |
Valkanas et al. | Location extraction from social networks with commodity software and online data | |
CN105518644A (en) | Method for processing and displaying real-time social data on map | |
CN104298669A (en) | Person geographic information mining model based on social network | |
CN106462591B (en) | Partition filtering using intelligent indexing in memory | |
Jhummarwala et al. | Parallel and distributed GIS for processing geo-data: an overview | |
Zhou et al. | An efficient access model of Massive spatiotemporal Vehicle trajectory data in Smart City | |
Xia et al. | DAPR-tree: a distributed spatial data indexing scheme with data access patterns to support Digital Earth initiatives | |
Rehman et al. | Building socially-enabled event-enriched maps | |
US8527478B1 (en) | Handling bulk and incremental updates while maintaining consistency |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |
|
RJ01 | Rejection of invention patent application after publication |