CN106991525A - The air quality and resident trip visual analysis method and system driven based on big data - Google Patents
The air quality and resident trip visual analysis method and system driven based on big data Download PDFInfo
- Publication number
- CN106991525A CN106991525A CN201710173669.4A CN201710173669A CN106991525A CN 106991525 A CN106991525 A CN 106991525A CN 201710173669 A CN201710173669 A CN 201710173669A CN 106991525 A CN106991525 A CN 106991525A
- Authority
- CN
- China
- Prior art keywords
- poi
- liveness
- data
- cum rights
- air quality
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 22
- 230000000007 visual effect Effects 0.000 title claims abstract description 19
- 238000009825 accumulation Methods 0.000 claims description 25
- 238000010586 diagram Methods 0.000 claims description 13
- 230000003442 weekly effect Effects 0.000 claims description 12
- 238000013507 mapping Methods 0.000 claims description 9
- 238000000034 method Methods 0.000 claims description 8
- 230000001186 cumulative effect Effects 0.000 claims description 7
- 230000002146 bilateral effect Effects 0.000 claims description 6
- 239000003086 colorant Substances 0.000 claims description 6
- 239000011159 matrix material Substances 0.000 claims description 6
- 238000005201 scrubbing Methods 0.000 claims description 6
- 230000004456 color vision Effects 0.000 claims description 3
- 238000003064 k means clustering Methods 0.000 claims description 3
- 239000004744 fabric Substances 0.000 claims 1
- 239000011800 void material Substances 0.000 claims 1
- 238000012800 visualization Methods 0.000 abstract description 4
- 230000003993 interaction Effects 0.000 abstract description 3
- 238000012544 monitoring process Methods 0.000 description 3
- 238000007405 data analysis Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- CBENFWSGALASAD-UHFFFAOYSA-N Ozone Chemical compound [O-][O+]=O CBENFWSGALASAD-UHFFFAOYSA-N 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013079 data visualisation Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 150000004767 nitrides Chemical class 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000013618 particulate matter Substances 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Engineering & Computer Science (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- Economics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Educational Administration (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Development Economics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Instructional Devices (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses the air quality driven based on big data and resident trip visual analysis method and system, including following steps:(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction;(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Deviation ratio reflects the situation of change of POI cum rights liveness;(3) same type POI is clustered;(4) air quality and the visual analysis of resident trip.The present invention is with low cost, safeguards simple, deployment is rapid, and visualization interface interaction has diversity, and each user can be with multi-granularity analysis air quality and resident trip situation.
Description
Technical field
The present invention relates to the air quality driven based on big data and resident trip visual analysis method and system.
Background technology
Along with the development of China's process of industrialization, with sulfide (SOx), nitride (NOx), ozone (O3), carbide
(COx), the pollution problem that the industrial excreta based on particulate matter (particle diameter is less than or equal to 10 μm and 2.5 μm) is caused to air quality
Increasingly serious, go off daily and life to people cause extreme influence, show according to investigations, when air quality is poor, people
It is more willing to stay indoors to reduce nonessential travel behaviour.
With the development of science and technology, data are largely gathered and stored, data volume is in explosive growth, how from these data
In excavate valuable information as urgent problem.When in face of huge and complicated data, traditional data
Excavate and data analysing method seems unable to do what one wishes in heuristic data.In order to obtain the value contained in data, various data
Analysis is used and given birth to method for digging.
Therefore we need a kind of effective method to solve these problems.In the last few years, as visually to interact
Analysis ratiocination science based on interface, visual analysis is data mining, data analysis provides a kind of brand-new means, it with
The features such as interactivity, visuality, is welcomed warmly by numerous researchers, turns into study hotspot gradually.
Therefore, for air quality and resident trip visual research for probing between air quality and resident trip
Relation it is significant, it not only can for explore resident travel behaviour important references are provided, can also cause traffic,
Attention of the relevant departments such as medical treatment to air quality.Therefore probe into air quality and no matter resonable the visual research of resident trip is
Still all there is very important researching value in actual applications by upper.
The content of the invention
The problem of present invention is analyzed for air quality and resident trip, designs a kind of air matter driven based on big data
Amount and resident trip visual analysis method and system, preferably help the departments such as traffic, medical treatment to air quality and resident trip
Analyzed, and a set of visual analysis system is provided and help customer analysis Characteristics of Air Quality, Resident Trip Characteristics, displaying is empty
Makings amount bar chart, temperature box traction substation, POI cum rights liveness stack figure and flow graph, POI cum rights liveness deviation ratio calendar thermal maps
With multidimensional block diagram, urban air-quality and resident trip are explored.The purpose of the present invention is by the following technical programs
Come what is realized:A kind of air quality and resident trip visual analysis method driven based on big data, this method includes following step
Suddenly:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to sky
Gas qualitative data, temperature data, POI data and taxi taking difficulty data carry out data scrubbing and sequence, wherein data scrubbing master
If lookup and rejecting to data exception in various data sources and missing values, then according to timestamp by all data according to when
Between sort, this is conducive to follow-up time series data to visualize.The taxi taking difficulty data include taxi taking difficulty distributed point
Geographical coordinate and weights.The POI data includes the geographical coordinate and POI types of POI distributed points.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Partially
Shifting rate reflects the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is
It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met
Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI
Cum rights liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour
POI cum rights liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal in T range
All POI distributed points, are designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre,
And set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, the clustering algorithm based on k-means is to POI
Distributed point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic
Mapping scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, and rectangle is from left to right
Order represents the priority on daily date, and the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index
AQI is determined.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right,
Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data
A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of the accumulation graph and flow graph scope that refers to fix time is sat per hour
Mark, with per week for basic scale.Ordinate is that POI cum rights enlivens angle value.Represented in accumulation graph with the area-graph of different colours
Different types of POI, accumulation graph is arranged along reference axis one side, and time range one or more POI cum rights liveness is specified in displaying
Situation of change.Flow graph is arranged along coordinate bilateral, and the change feelings of time range one or more POI cum rights liveness are specified in displaying
Condition.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is opening up in terms of scatter diagram higher-dimension
Exhibition, for showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner
Come, and the size of numerical value is represented with shade, show identical POI in different air qualities and temperature feelings by calendar thermal map
The situation of change of POI cum rights liveness deviation ratio under condition.GeoMap be used for show same type POI cluster liveness weights and
Geographical distribution situation.
A kind of air quality driven based on big data and resident trip visual analysis system, the system are included with the following group
Part:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, rectangle from left to right suitable
Sequence represents the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping side according to air quality index AQI
Case, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right,
Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data
A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time,
With per week for basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph difference is represented with the area-graph of different colours
The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying
Change situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying,
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is
It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met
Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI
Cum rights liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension,
For showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner,
And the size of numerical value is represented with shade, show identical POI in different air qualities and temperature conditions by calendar thermal map
The situation of change of lower POI cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and ground of same type POI clusters
Manage distribution situation.
The calculating of liveness weights of same type POI clusters is specially:Calculate around each taxi taking difficulty distributed point
Euclidean distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI of middle same type points
Layout, calculate the position of cluster centre, and set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, base
POI distributed points are clustered in k-means clustering algorithm, will calculate new cluster centre latitude and longitude coordinates as
The latitude and longitude coordinates of POI centers.
The beneficial effects of the invention are as follows:Of the invention different from traditional air quality visualization, the present invention, which is proposed, to be directed to
The data visualization of air quality and resident trip, user can explore air quality to global mode again from the overall situation to local
To the liveness situation of change of city different zones, change to the trip purpose of analysis air quality influence resident.Pass through interaction
Means, reduce analysis librarian use system cost, reach good bandwagon effect, system can be from air quality, temperature
Degree, POI cum rights liveness and deviation ratio four levelses illustrate air quality and a variety of rules of resident trip.
Brief description of the drawings
Fig. 1 bar shapeds-box traction substation analytic unit;
Fig. 2 flow graphs-accumulation graph analytic unit;
Fig. 3 scatterplot matrix-GeoMap- calendar thermal map analytic units;
Fig. 4 systems front and back end dependence graph.
Embodiment
It is described in detail with reference to embodiment and accompanying drawing.
The data basis of institute's foundation of the present invention has:Air quality data is each region and environment above protection administrator portion
Door or its environmental monitoring station authorized issue data, including daily paper and Times.The time cycle of Times data is 1 hour, each
The integral point moment issues the real-time report of each monitoring station, and the index reported in real time includes SO2、NO2、O3、CO、PM2.5、PM10Concentration, day
Count off evidence is one day SO2、NO2、O3、CO、PM2.5、PM1024 hour concentration average values;Atmospheric environment data be each region and more than
Meteorology protection administrative responsibile institution or its weather monitoring station authorized issue, including daily paper and Times.Week time of Times data
Phase is 1 hour, and each integral point moment issues the real-time report of each detection website, and the index reported in real time includes air pressure, temperature, wet
The data such as degree, precipitation and wind direction.Daily paper data are one day air pressure, temperature, humidity, precipitation and the hour data of wind direction 24
Average;The taxi taking difficulty data that resident trip data provides for drop drop firmament big data platform, wherein data cycle
For 1 hour, each integral point provided the taxi taking difficulty of different location.Each integral point data include:Longitude, dimension, difficulty or ease of calling a taxi
Degree;POI distributed datas are POI detailed data, include POI addresses, POI titles, POI longitudes, POI latitudes and POI types.
A kind of air quality and resident trip visual analysis method driven based on big data that the present invention is provided, including with
Under several steps:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to sky
Gas qualitative data, temperature data, POI data and taxi taking difficulty data carry out data scrubbing and sequence, wherein data scrubbing master
If lookup and rejecting to data exception in various data sources and missing values, then according to timestamp by all data according to when
Between sort, this is conducive to follow-up time series data to visualize.The taxi taking difficulty data include taxi taking difficulty distributed point
Geographical coordinate and weights.The POI data includes the geographical coordinate and POI types of POI distributed points.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Partially
Shifting rate reflects the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is
It is no to can use 0.5km less than the threshold value T, T that pre-set, the weights of taxi taking difficulty distributed point are set to this if condition is met
The weights of POI liveness.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI
Cum rights liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour
POI cum rights liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal in T range
All POI distributed points, are designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre,
And set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, the clustering algorithm based on k-means is to POI
Distributed point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic
Mapping scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, and rectangle is from left to right
Order represents the priority on daily date, and the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index
AQI is determined.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right,
Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data
A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data, as shown in Figure 1.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of the accumulation graph and flow graph scope that refers to fix time is sat per hour
Mark, with per week for basic scale.Ordinate is that POI cum rights enlivens angle value.Represented in accumulation graph with the area-graph of different colours
Different types of POI, accumulation graph is arranged along reference axis one side, and time range one or more POI cum rights liveness is specified in displaying
Situation of change.Flow graph is arranged along coordinate bilateral, and the change feelings of time range one or more POI cum rights liveness are specified in displaying
Condition, as shown in Figure 2.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is opening up in terms of scatter diagram higher-dimension
Exhibition, for showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner
Come, and the size of numerical value is represented with shade, show identical POI in different air qualities and temperature feelings by calendar thermal map
The situation of change of POI cum rights liveness deviation ratio under condition.GeoMap be used for show same type POI cluster liveness weights and
Geographical distribution situation, as shown in Figure 3.
A kind of air quality driven based on big data and resident trip visual analysis system, the system are included with the following group
Part:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, rectangle from left to right suitable
Sequence represents the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping side according to air quality index AQI
Case, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right,
Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data
A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data, as shown in Figure 1.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time,
With per week for basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph difference is represented with the area-graph of different colours
The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying
Change situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying,
As shown in Figure 2.The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is
It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met
Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI
Cum rights liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension,
For showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner,
And the size of numerical value is represented with shade, show identical POI in different air qualities and temperature conditions by calendar thermal map
The situation of change of lower POI cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and ground of same type POI clusters
Distribution situation is managed, as shown in Figure 3.
The calculating of liveness weights of same type POI clusters is specially:Calculate around each taxi taking difficulty distributed point
Euclidean distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI of middle same type points
Layout, calculate the position of cluster centre, and set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, base
POI distributed points are clustered in k-means clustering algorithm, will calculate new cluster centre latitude and longitude coordinates as
The latitude and longitude coordinates of POI centers.
In the preprocessing process of the inventive method, POI cum rights liveness calculates main by each difficulty or ease of calling a taxi of statistics
Spend the cumulative of a surrounding different type POI number and the metering of POI cum rights liveness is obtained with this;POI cum rights liveness
Drift condition of the real-time POI liveness of deviation ratio principal statistical with respect to history POI cum rights liveness averages.By draw column-
Box traction substation, accumulation-flow graph, scatterplot matrix-GeoMap- calendar thermal maps, user is by the interaction between a variety of visualization views, no
Only important references can be provided to explore the travel behaviour of resident, the relevant departments such as traffic, medical treatment can also be caused to air matter
The attention of amount, constructive opinion is provided for relevant departments.
Described above is the case study on implementation that the present invention is provided, and illustrates effective visualization component of a variety of aspects,
Obviously the present invention is not only limited to above-mentioned case study on implementation, without departing from essence spirit of the present invention and without departing from substantive content of the present invention
A variety of deformations can be done on the premise of involved scope to it to be carried out.
Claims (2)
1. a kind of air quality and resident trip visual analysis method driven based on big data, it is characterised in that this method bag
Include following steps:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to air matter
Measure data, temperature data, POI data and taxi taking difficulty data and carry out data scrubbing and sequence, wherein data scrubbing is mainly
Lookup and rejecting to data exception in various data sources and missing values, then arrange all data according to the time according to timestamp
Sequence.The taxi taking difficulty data include the geographical coordinate and weights of taxi taking difficulty distributed point.The POI data includes POI
The geographical coordinate and POI types of distributed point.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Deviation ratio
Reflect the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges whether Euclidean distance is small
In the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to the power of this POI liveness if condition is met
Value.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI cum rights
Liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour POI cum rights
Liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal to own in T range
POI distributed points, be designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre, and set
The weights for putting taxi taking difficulty distributed point are the weights of cluster centre.Wherein, the clustering algorithm based on k-means is distributed to POI
Point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic mapping
Scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, the order of rectangle from left to right
The priority on daily date is represented, the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index AQI
It is determined that.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, under
Dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data four
3/1 mono- to four quantiles represent the median of data according to scope, small rectangular center horizontal line position, as shown in Figure 1.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, with
Per week is basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph inhomogeneity is represented with the area-graph of different colours
The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying
Situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying, such as
Shown in Fig. 2.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, is used
To show air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, and
The size of numerical value is represented with shade, shows identical POI under different air qualities and temperature conditions by calendar thermal map
The situation of change of POI cum rights liveness deviation ratios.GeoMap is used for showing the liveness weights and geography of same type POI clusters
Distribution situation, as shown in Figure 3.
2. a kind of air quality driven based on big data and resident trip visual analysis system, it is characterised in that the system bag
Include following component:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, the sequence list of rectangle from left to right
Show the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping scheme, i.e., according to air quality index AQI
Dynamically adjusted according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation, lower void from left to right
Line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents four points of data
One of to 3/4ths quantiles according to scope, small rectangular center horizontal line position represents the median of data.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, with every
Week is basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph different type is represented with the area-graph of different colours
POI, accumulation graph arranged along reference axis one side, and the change feelings of time range one or more POI cum rights liveness are specified in displaying
Condition.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying.POI bands
Power liveness calculating be specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges whether Euclidean distance is small
In the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to the power of this POI liveness if condition is met
Value.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI cum rights
Liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, is used for
Show air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, is used in combination
Shade represents the size of numerical value, passes through calendar thermal map and shows identical POI POI under different air qualities and temperature conditions
The situation of change of cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and geography point of same type POI clusters
Cloth situation.
The calculating of liveness weights of same type POI clusters is specially:Calculate Euclidean around each taxi taking difficulty distributed point
Distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI distributed points of middle same type,
The position of cluster centre is calculated, and sets the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, based on k-
Means clustering algorithm is clustered to POI distributed points, will calculate new cluster centre latitude and longitude coordinates as in POI
The latitude and longitude coordinates of heart position.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710173669.4A CN106991525B (en) | 2017-03-22 | 2017-03-22 | Air quality and resident trip visual analysis method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710173669.4A CN106991525B (en) | 2017-03-22 | 2017-03-22 | Air quality and resident trip visual analysis method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106991525A true CN106991525A (en) | 2017-07-28 |
CN106991525B CN106991525B (en) | 2021-06-18 |
Family
ID=59411741
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710173669.4A Active CN106991525B (en) | 2017-03-22 | 2017-03-22 | Air quality and resident trip visual analysis method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106991525B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110286663A (en) * | 2019-06-28 | 2019-09-27 | 云南中烟工业有限责任公司 | Method for improving based on zonal cigarette physical index standardized production |
CN111380583A (en) * | 2018-12-28 | 2020-07-07 | 株式会社基恩士 | Gas flowmeter |
CN112699284A (en) * | 2021-01-11 | 2021-04-23 | 四川大学 | Bus stop optimization visualization method based on multi-source data |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2561458A1 (en) * | 2005-09-29 | 2007-03-29 | Ronald Kurnik | Ct determination by cluster analysis with variable cluster endpoint |
US7826965B2 (en) * | 2005-06-16 | 2010-11-02 | Yahoo! Inc. | Systems and methods for determining a relevance rank for a point of interest |
US20140163873A1 (en) * | 2011-02-02 | 2014-06-12 | Mapquest, Inc. | Systems and methods for generating electronic map displays with points-of-interest information |
CN105493109A (en) * | 2013-06-05 | 2016-04-13 | 微软技术许可有限责任公司 | Air quality inference using multiple data sources |
CN105679009A (en) * | 2016-02-03 | 2016-06-15 | 西安交通大学 | Taxi-taking/order-receiving POI recommendation system and method based on taxi GPS data mining |
CN105825672A (en) * | 2016-04-11 | 2016-08-03 | 中山大学 | City guidance area extraction method based on floating car data |
-
2017
- 2017-03-22 CN CN201710173669.4A patent/CN106991525B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7826965B2 (en) * | 2005-06-16 | 2010-11-02 | Yahoo! Inc. | Systems and methods for determining a relevance rank for a point of interest |
CA2561458A1 (en) * | 2005-09-29 | 2007-03-29 | Ronald Kurnik | Ct determination by cluster analysis with variable cluster endpoint |
US20140163873A1 (en) * | 2011-02-02 | 2014-06-12 | Mapquest, Inc. | Systems and methods for generating electronic map displays with points-of-interest information |
CN105493109A (en) * | 2013-06-05 | 2016-04-13 | 微软技术许可有限责任公司 | Air quality inference using multiple data sources |
CN105679009A (en) * | 2016-02-03 | 2016-06-15 | 西安交通大学 | Taxi-taking/order-receiving POI recommendation system and method based on taxi GPS data mining |
CN105825672A (en) * | 2016-04-11 | 2016-08-03 | 中山大学 | City guidance area extraction method based on floating car data |
Non-Patent Citations (6)
Title |
---|
HSUN-PING HSIEH ET AL: "Inferring Air Quality for Station Location Recommendation Based on Urban Big Data", 《KDD "15: PROCEEDINGS OF THE 21TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING》 * |
NICK QI ZHU: "《Data Visualization with D3.js Cookbook》", 31 October 2013 * |
ZEQIAN SHEN ET AL: "Visual Analysis of Large Heterogeneous Social Networks by Semantic and Structural Abstraction", 《IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS》 * |
任磊等: "大数据可视分析综述", 《软件学报》 * |
张明月: "基于出租车轨迹的载客点与热点区域推荐", 《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》 * |
王爽等: "基于城市网络空间的 POI 分布密度分析及可视化", 《城市勘测》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111380583A (en) * | 2018-12-28 | 2020-07-07 | 株式会社基恩士 | Gas flowmeter |
CN110286663A (en) * | 2019-06-28 | 2019-09-27 | 云南中烟工业有限责任公司 | Method for improving based on zonal cigarette physical index standardized production |
CN110286663B (en) * | 2019-06-28 | 2021-05-25 | 云南中烟工业有限责任公司 | Regional cigarette physical index standardized production improving method |
CN112699284A (en) * | 2021-01-11 | 2021-04-23 | 四川大学 | Bus stop optimization visualization method based on multi-source data |
Also Published As
Publication number | Publication date |
---|---|
CN106991525B (en) | 2021-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Leconte et al. | Using Local Climate Zone scheme for UHI assessment: Evaluation of the method using mobile measurements | |
Zhou et al. | Impacts of weather on public transport ridership: Results from mining data from different sources | |
Yeprintsev et al. | Technologies for creating geographic information resources for monitoring the socio-ecological conditions of cities | |
Kuik et al. | Air quality modelling in the Berlin–Brandenburg region using WRF-Chem v3. 7.1: sensitivity to resolution of model grid and input data | |
Sayegh et al. | Understanding how roadside concentrations of NOx are influenced by the background levels, traffic density, and meteorological conditions using Boosted Regression Trees | |
Ronda et al. | Urban finescale forecasting reveals weather conditions with unprecedented detail | |
Sailor et al. | Development of a national anthropogenic heating database with an extrapolation for international cities | |
Liu et al. | Urban land uses and traffic ‘source-sink areas’: Evidence from GPS-enabled taxi data in Shanghai | |
Zhao et al. | High-resolution daily AOD estimated to full coverage using the random forest model approach in the Beijing-Tianjin-Hebei region | |
Arnds et al. | Spatio-temporal variance and meteorological drivers of the urban heat island in a European city | |
Wu et al. | Analysis of Spatio-temporal patterns and related factors of thermal comfort in subtropical coastal cities based on local climate zones | |
Leconte et al. | Analysis of nocturnal air temperature in districts using mobile measurements and a cooling indicator | |
Ming et al. | Spatial pattern of anthropogenic heat flux in monocentric and polycentric cities: The case of Chengdu and Chongqing | |
Cao et al. | Using a distributed air sensor network to investigate the spatiotemporal patterns of PM2. 5 concentrations | |
CN108877227A (en) | A kind of global dynamic trip requirements estimation method based on multi-source traffic data | |
Zhang et al. | Exploring the effects of 3D urban form on urban air quality: Evidence from fifteen megacities in China | |
Timsic et al. | Spatial variability in stable isotope values of surface waters of Eastern Canada and New England | |
CN106991525A (en) | The air quality and resident trip visual analysis method and system driven based on big data | |
CN110176141A (en) | A kind of traffic zone division method and system based on POI and traffic characteristic | |
Han et al. | Heterogeneity of influential factors across the entire air quality spectrum in Chinese cities: A spatial quantile regression analysis | |
Deng et al. | Unraveling the association between the built environment and air pollution from a geospatial perspective | |
Li et al. | Evaluation and analysis of the impact of coastal urban impervious surfaces on ecological environments | |
Balen et al. | Air quality monitoring device for vehicular ad hoc networks: enviodev | |
Wang et al. | Impacts of spatiotemporally uneven urbanization on sea breeze fronts in a mega-river delta | |
Tole | Changes in the built vs. non-built environment in a rapidly urbanizing region: a case study of the Greater Toronto Area |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |