CN106991525A - The air quality and resident trip visual analysis method and system driven based on big data - Google Patents

The air quality and resident trip visual analysis method and system driven based on big data Download PDF

Info

Publication number
CN106991525A
CN106991525A CN201710173669.4A CN201710173669A CN106991525A CN 106991525 A CN106991525 A CN 106991525A CN 201710173669 A CN201710173669 A CN 201710173669A CN 106991525 A CN106991525 A CN 106991525A
Authority
CN
China
Prior art keywords
poi
liveness
data
cum rights
air quality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710173669.4A
Other languages
Chinese (zh)
Other versions
CN106991525B (en
Inventor
谢波
姜波
潘伟丰
王家乐
殷骏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Gongshang University
Original Assignee
Zhejiang Gongshang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Gongshang University filed Critical Zhejiang Gongshang University
Priority to CN201710173669.4A priority Critical patent/CN106991525B/en
Publication of CN106991525A publication Critical patent/CN106991525A/en
Application granted granted Critical
Publication of CN106991525B publication Critical patent/CN106991525B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Development Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Game Theory and Decision Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Instructional Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses the air quality driven based on big data and resident trip visual analysis method and system, including following steps:(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction;(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Deviation ratio reflects the situation of change of POI cum rights liveness;(3) same type POI is clustered;(4) air quality and the visual analysis of resident trip.The present invention is with low cost, safeguards simple, deployment is rapid, and visualization interface interaction has diversity, and each user can be with multi-granularity analysis air quality and resident trip situation.

Description

The air quality and resident trip visual analysis method and system driven based on big data
Technical field
The present invention relates to the air quality driven based on big data and resident trip visual analysis method and system.
Background technology
Along with the development of China's process of industrialization, with sulfide (SOx), nitride (NOx), ozone (O3), carbide (COx), the pollution problem that the industrial excreta based on particulate matter (particle diameter is less than or equal to 10 μm and 2.5 μm) is caused to air quality Increasingly serious, go off daily and life to people cause extreme influence, show according to investigations, when air quality is poor, people It is more willing to stay indoors to reduce nonessential travel behaviour.
With the development of science and technology, data are largely gathered and stored, data volume is in explosive growth, how from these data In excavate valuable information as urgent problem.When in face of huge and complicated data, traditional data Excavate and data analysing method seems unable to do what one wishes in heuristic data.In order to obtain the value contained in data, various data Analysis is used and given birth to method for digging.
Therefore we need a kind of effective method to solve these problems.In the last few years, as visually to interact Analysis ratiocination science based on interface, visual analysis is data mining, data analysis provides a kind of brand-new means, it with The features such as interactivity, visuality, is welcomed warmly by numerous researchers, turns into study hotspot gradually.
Therefore, for air quality and resident trip visual research for probing between air quality and resident trip Relation it is significant, it not only can for explore resident travel behaviour important references are provided, can also cause traffic, Attention of the relevant departments such as medical treatment to air quality.Therefore probe into air quality and no matter resonable the visual research of resident trip is Still all there is very important researching value in actual applications by upper.
The content of the invention
The problem of present invention is analyzed for air quality and resident trip, designs a kind of air matter driven based on big data Amount and resident trip visual analysis method and system, preferably help the departments such as traffic, medical treatment to air quality and resident trip Analyzed, and a set of visual analysis system is provided and help customer analysis Characteristics of Air Quality, Resident Trip Characteristics, displaying is empty Makings amount bar chart, temperature box traction substation, POI cum rights liveness stack figure and flow graph, POI cum rights liveness deviation ratio calendar thermal maps With multidimensional block diagram, urban air-quality and resident trip are explored.The purpose of the present invention is by the following technical programs Come what is realized:A kind of air quality and resident trip visual analysis method driven based on big data, this method includes following step Suddenly:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to sky Gas qualitative data, temperature data, POI data and taxi taking difficulty data carry out data scrubbing and sequence, wherein data scrubbing master If lookup and rejecting to data exception in various data sources and missing values, then according to timestamp by all data according to when Between sort, this is conducive to follow-up time series data to visualize.The taxi taking difficulty data include taxi taking difficulty distributed point Geographical coordinate and weights.The POI data includes the geographical coordinate and POI types of POI distributed points.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Partially Shifting rate reflects the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI Cum rights liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour POI cum rights liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal in T range All POI distributed points, are designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre, And set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, the clustering algorithm based on k-means is to POI Distributed point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic Mapping scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, and rectangle is from left to right Order represents the priority on daily date, and the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index AQI is determined.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of the accumulation graph and flow graph scope that refers to fix time is sat per hour Mark, with per week for basic scale.Ordinate is that POI cum rights enlivens angle value.Represented in accumulation graph with the area-graph of different colours Different types of POI, accumulation graph is arranged along reference axis one side, and time range one or more POI cum rights liveness is specified in displaying Situation of change.Flow graph is arranged along coordinate bilateral, and the change feelings of time range one or more POI cum rights liveness are specified in displaying Condition.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is opening up in terms of scatter diagram higher-dimension Exhibition, for showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner Come, and the size of numerical value is represented with shade, show identical POI in different air qualities and temperature feelings by calendar thermal map The situation of change of POI cum rights liveness deviation ratio under condition.GeoMap be used for show same type POI cluster liveness weights and Geographical distribution situation.
A kind of air quality driven based on big data and resident trip visual analysis system, the system are included with the following group Part:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, rectangle from left to right suitable Sequence represents the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping side according to air quality index AQI Case, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, With per week for basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph difference is represented with the area-graph of different colours The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying Change situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying, The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI Cum rights liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, For showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, And the size of numerical value is represented with shade, show identical POI in different air qualities and temperature conditions by calendar thermal map The situation of change of lower POI cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and ground of same type POI clusters Manage distribution situation.
The calculating of liveness weights of same type POI clusters is specially:Calculate around each taxi taking difficulty distributed point Euclidean distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI of middle same type points Layout, calculate the position of cluster centre, and set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, base POI distributed points are clustered in k-means clustering algorithm, will calculate new cluster centre latitude and longitude coordinates as The latitude and longitude coordinates of POI centers.
The beneficial effects of the invention are as follows:Of the invention different from traditional air quality visualization, the present invention, which is proposed, to be directed to The data visualization of air quality and resident trip, user can explore air quality to global mode again from the overall situation to local To the liveness situation of change of city different zones, change to the trip purpose of analysis air quality influence resident.Pass through interaction Means, reduce analysis librarian use system cost, reach good bandwagon effect, system can be from air quality, temperature Degree, POI cum rights liveness and deviation ratio four levelses illustrate air quality and a variety of rules of resident trip.
Brief description of the drawings
Fig. 1 bar shapeds-box traction substation analytic unit;
Fig. 2 flow graphs-accumulation graph analytic unit;
Fig. 3 scatterplot matrix-GeoMap- calendar thermal map analytic units;
Fig. 4 systems front and back end dependence graph.
Embodiment
It is described in detail with reference to embodiment and accompanying drawing.
The data basis of institute's foundation of the present invention has:Air quality data is each region and environment above protection administrator portion Door or its environmental monitoring station authorized issue data, including daily paper and Times.The time cycle of Times data is 1 hour, each The integral point moment issues the real-time report of each monitoring station, and the index reported in real time includes SO2、NO2、O3、CO、PM2.5、PM10Concentration, day Count off evidence is one day SO2、NO2、O3、CO、PM2.5、PM1024 hour concentration average values;Atmospheric environment data be each region and more than Meteorology protection administrative responsibile institution or its weather monitoring station authorized issue, including daily paper and Times.Week time of Times data Phase is 1 hour, and each integral point moment issues the real-time report of each detection website, and the index reported in real time includes air pressure, temperature, wet The data such as degree, precipitation and wind direction.Daily paper data are one day air pressure, temperature, humidity, precipitation and the hour data of wind direction 24 Average;The taxi taking difficulty data that resident trip data provides for drop drop firmament big data platform, wherein data cycle For 1 hour, each integral point provided the taxi taking difficulty of different location.Each integral point data include:Longitude, dimension, difficulty or ease of calling a taxi Degree;POI distributed datas are POI detailed data, include POI addresses, POI titles, POI longitudes, POI latitudes and POI types.
A kind of air quality and resident trip visual analysis method driven based on big data that the present invention is provided, including with Under several steps:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to sky Gas qualitative data, temperature data, POI data and taxi taking difficulty data carry out data scrubbing and sequence, wherein data scrubbing master If lookup and rejecting to data exception in various data sources and missing values, then according to timestamp by all data according to when Between sort, this is conducive to follow-up time series data to visualize.The taxi taking difficulty data include taxi taking difficulty distributed point Geographical coordinate and weights.The POI data includes the geographical coordinate and POI types of POI distributed points.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Partially Shifting rate reflects the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is It is no to can use 0.5km less than the threshold value T, T that pre-set, the weights of taxi taking difficulty distributed point are set to this if condition is met The weights of POI liveness.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI Cum rights liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour POI cum rights liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal in T range All POI distributed points, are designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre, And set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, the clustering algorithm based on k-means is to POI Distributed point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic Mapping scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, and rectangle is from left to right Order represents the priority on daily date, and the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index AQI is determined.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data, as shown in Figure 1.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of the accumulation graph and flow graph scope that refers to fix time is sat per hour Mark, with per week for basic scale.Ordinate is that POI cum rights enlivens angle value.Represented in accumulation graph with the area-graph of different colours Different types of POI, accumulation graph is arranged along reference axis one side, and time range one or more POI cum rights liveness is specified in displaying Situation of change.Flow graph is arranged along coordinate bilateral, and the change feelings of time range one or more POI cum rights liveness are specified in displaying Condition, as shown in Figure 2.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is opening up in terms of scatter diagram higher-dimension Exhibition, for showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner Come, and the size of numerical value is represented with shade, show identical POI in different air qualities and temperature feelings by calendar thermal map The situation of change of POI cum rights liveness deviation ratio under condition.GeoMap be used for show same type POI cluster liveness weights and Geographical distribution situation, as shown in Figure 3.
A kind of air quality driven based on big data and resident trip visual analysis system, the system are included with the following group Part:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, rectangle from left to right suitable Sequence represents the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping side according to air quality index AQI Case, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, Lower dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data A quarter is to 3/4ths quantiles according to scope, and small rectangular center horizontal line position represents the median of data, as shown in Figure 1.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, With per week for basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph difference is represented with the area-graph of different colours The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying Change situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying, As shown in Figure 2.The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges that Euclidean distance is It is no less than the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to this POI liveness if condition is met Weights.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI Cum rights liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, For showing air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, And the size of numerical value is represented with shade, show identical POI in different air qualities and temperature conditions by calendar thermal map The situation of change of lower POI cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and ground of same type POI clusters Distribution situation is managed, as shown in Figure 3.
The calculating of liveness weights of same type POI clusters is specially:Calculate around each taxi taking difficulty distributed point Euclidean distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI of middle same type points Layout, calculate the position of cluster centre, and set the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, base POI distributed points are clustered in k-means clustering algorithm, will calculate new cluster centre latitude and longitude coordinates as The latitude and longitude coordinates of POI centers.
In the preprocessing process of the inventive method, POI cum rights liveness calculates main by each difficulty or ease of calling a taxi of statistics Spend the cumulative of a surrounding different type POI number and the metering of POI cum rights liveness is obtained with this;POI cum rights liveness Drift condition of the real-time POI liveness of deviation ratio principal statistical with respect to history POI cum rights liveness averages.By draw column- Box traction substation, accumulation-flow graph, scatterplot matrix-GeoMap- calendar thermal maps, user is by the interaction between a variety of visualization views, no Only important references can be provided to explore the travel behaviour of resident, the relevant departments such as traffic, medical treatment can also be caused to air matter The attention of amount, constructive opinion is provided for relevant departments.
Described above is the case study on implementation that the present invention is provided, and illustrates effective visualization component of a variety of aspects, Obviously the present invention is not only limited to above-mentioned case study on implementation, without departing from essence spirit of the present invention and without departing from substantive content of the present invention A variety of deformations can be done on the premise of involved scope to it to be carried out.

Claims (2)

1. a kind of air quality and resident trip visual analysis method driven based on big data, it is characterised in that this method bag Include following steps:
(1) original air qualitative data, temperature data, POI data and taxi taking difficulty data reconstruction:First respectively to air matter Measure data, temperature data, POI data and taxi taking difficulty data and carry out data scrubbing and sequence, wherein data scrubbing is mainly Lookup and rejecting to data exception in various data sources and missing values, then arrange all data according to the time according to timestamp Sequence.The taxi taking difficulty data include the geographical coordinate and weights of taxi taking difficulty distributed point.The POI data includes POI The geographical coordinate and POI types of distributed point.
(2) POI cum rights liveness and deviation ratio are calculated:The size of flow of the people around POI cum rights liveness reflection POI;Deviation ratio Reflect the situation of change of POI cum rights liveness.
The calculating of POI cum rights liveness is specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges whether Euclidean distance is small In the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to the power of this POI liveness if condition is met Value.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI cum rights Liveness.
The calculating of POI cum rights liveness deviation ratios is specially:
Offsett=(POIWeightt-Averweek,hour)/(POIWeightt)-1
Wherein, Averweek,hourFor the cum rights of POI per hour liveness average per week, POIWeighttFor current hour POI cum rights Liveness, OffsettFor deviation ratio.
3) same type POI is clustered:Euclidean distance around each taxi taking difficulty distributed point is calculated to be less than or equal to own in T range POI distributed points, be designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, calculate the position of cluster centre, and set The weights for putting taxi taking difficulty distributed point are the weights of cluster centre.Wherein, the clustering algorithm based on k-means is distributed to POI Point is clustered, and will calculate latitude and longitude coordinates of the new cluster centre latitude and longitude coordinates as POI centers.
4) air quality and the visual analysis of resident trip, be specially:
(4.1) colour vision is encoded:When mapping color, due to air quality index AQI difference, using dynamic mapping Scheme, i.e., dynamically adjust according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
(4.2) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, the order of rectangle from left to right The priority on daily date is represented, the fill color of rectangle is determined according to the scheme of step 4.1, highly according to air quality index AQI It is determined that.Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation from left to right, under Dotted line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents data four 3/1 mono- to four quantiles represent the median of data according to scope, small rectangular center horizontal line position, as shown in Figure 1.
(4.3) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, with Per week is basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph inhomogeneity is represented with the area-graph of different colours The POI of type, accumulation graph is arranged along reference axis one side, and the change of time range one or more POI cum rights liveness is specified in displaying Situation.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying, such as Shown in Fig. 2.
(4.4) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, is used To show air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, and The size of numerical value is represented with shade, shows identical POI under different air qualities and temperature conditions by calendar thermal map The situation of change of POI cum rights liveness deviation ratios.GeoMap is used for showing the liveness weights and geography of same type POI clusters Distribution situation, as shown in Figure 3.
2. a kind of air quality driven based on big data and resident trip visual analysis system, it is characterised in that the system bag Include following component:
(1) bar shaped-box traction substation analytic unit:Daily air quality index shows with rectangle, the sequence list of rectangle from left to right Show the priority on daily date;The height of rectangle determines that fill color uses dynamic mapping scheme, i.e., according to air quality index AQI Dynamically adjusted according to air quality index value:
Wherein ColorrectFor the fill color of rectangle.
Temperature when box traction substation represents per weekly, box traction substation represents weekly dotted line on the priority on date, box traction substation, lower void from left to right Line represents a quarter data area and lower a quarter data area respectively, and the small rectangle in box traction substation center represents four points of data One of to 3/4ths quantiles according to scope, small rectangular center horizontal line position represents the median of data.
(2) flow graph-accumulation graph analytic unit:The abscissa of accumulation graph and flow graph refers to scope coordinate per hour of fixing time, with every Week is basic scale.Ordinate is that POI cum rights enlivens angle value.In accumulation graph different type is represented with the area-graph of different colours POI, accumulation graph arranged along reference axis one side, and the change feelings of time range one or more POI cum rights liveness are specified in displaying Condition.Flow graph is arranged along coordinate bilateral, and the situation of change of time range one or more POI cum rights liveness is specified in displaying.POI bands Power liveness calculating be specially:
(2.1) Euclidean distance between taxi taking difficulty distributed point and each POI distributed points is calculated, judges whether Euclidean distance is small In the threshold value T pre-set, the weights of taxi taking difficulty distributed point are set to the power of this POI liveness if condition is met Value.
(2.2) the cumulative of all kinds POI liveness is counted according to POI types difference respectively and is used as this type POI cum rights Liveness.
(3) scatterplot matrix-GeoMap- calendars thermal map analytic unit:Scatterplot matrix diagram is the expansion in terms of scatter diagram higher-dimension, is used for Show air quality, temperature and POI cum rights liveness.Calendar thermal map shows multidimensional data in a two-dimensional manner, is used in combination Shade represents the size of numerical value, passes through calendar thermal map and shows identical POI POI under different air qualities and temperature conditions The situation of change of cum rights liveness deviation ratio.GeoMap is used for showing the liveness weights and geography point of same type POI clusters Cloth situation.
The calculating of liveness weights of same type POI clusters is specially:Calculate Euclidean around each taxi taking difficulty distributed point Distance is less than or equal to POI distributed points all in T range, is designated as POIdidi.Count POIdidiThe POI distributed points of middle same type, The position of cluster centre is calculated, and sets the weights of taxi taking difficulty distributed point to be the weights of cluster centre.Wherein, based on k- Means clustering algorithm is clustered to POI distributed points, will calculate new cluster centre latitude and longitude coordinates as in POI The latitude and longitude coordinates of heart position.
CN201710173669.4A 2017-03-22 2017-03-22 Air quality and resident trip visual analysis method and system Active CN106991525B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710173669.4A CN106991525B (en) 2017-03-22 2017-03-22 Air quality and resident trip visual analysis method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710173669.4A CN106991525B (en) 2017-03-22 2017-03-22 Air quality and resident trip visual analysis method and system

Publications (2)

Publication Number Publication Date
CN106991525A true CN106991525A (en) 2017-07-28
CN106991525B CN106991525B (en) 2021-06-18

Family

ID=59411741

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710173669.4A Active CN106991525B (en) 2017-03-22 2017-03-22 Air quality and resident trip visual analysis method and system

Country Status (1)

Country Link
CN (1) CN106991525B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110286663A (en) * 2019-06-28 2019-09-27 云南中烟工业有限责任公司 Method for improving based on zonal cigarette physical index standardized production
CN111380583A (en) * 2018-12-28 2020-07-07 株式会社基恩士 Gas flowmeter
CN112699284A (en) * 2021-01-11 2021-04-23 四川大学 Bus stop optimization visualization method based on multi-source data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2561458A1 (en) * 2005-09-29 2007-03-29 Ronald Kurnik Ct determination by cluster analysis with variable cluster endpoint
US7826965B2 (en) * 2005-06-16 2010-11-02 Yahoo! Inc. Systems and methods for determining a relevance rank for a point of interest
US20140163873A1 (en) * 2011-02-02 2014-06-12 Mapquest, Inc. Systems and methods for generating electronic map displays with points-of-interest information
CN105493109A (en) * 2013-06-05 2016-04-13 微软技术许可有限责任公司 Air quality inference using multiple data sources
CN105679009A (en) * 2016-02-03 2016-06-15 西安交通大学 Taxi-taking/order-receiving POI recommendation system and method based on taxi GPS data mining
CN105825672A (en) * 2016-04-11 2016-08-03 中山大学 City guidance area extraction method based on floating car data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7826965B2 (en) * 2005-06-16 2010-11-02 Yahoo! Inc. Systems and methods for determining a relevance rank for a point of interest
CA2561458A1 (en) * 2005-09-29 2007-03-29 Ronald Kurnik Ct determination by cluster analysis with variable cluster endpoint
US20140163873A1 (en) * 2011-02-02 2014-06-12 Mapquest, Inc. Systems and methods for generating electronic map displays with points-of-interest information
CN105493109A (en) * 2013-06-05 2016-04-13 微软技术许可有限责任公司 Air quality inference using multiple data sources
CN105679009A (en) * 2016-02-03 2016-06-15 西安交通大学 Taxi-taking/order-receiving POI recommendation system and method based on taxi GPS data mining
CN105825672A (en) * 2016-04-11 2016-08-03 中山大学 City guidance area extraction method based on floating car data

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
HSUN-PING HSIEH ET AL: "Inferring Air Quality for Station Location Recommendation Based on Urban Big Data", 《KDD "15: PROCEEDINGS OF THE 21TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING》 *
NICK QI ZHU: "《Data Visualization with D3.js Cookbook》", 31 October 2013 *
ZEQIAN SHEN ET AL: "Visual Analysis of Large Heterogeneous Social Networks by Semantic and Structural Abstraction", 《IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS》 *
任磊等: "大数据可视分析综述", 《软件学报》 *
张明月: "基于出租车轨迹的载客点与热点区域推荐", 《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》 *
王爽等: "基于城市网络空间的 POI 分布密度分析及可视化", 《城市勘测》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111380583A (en) * 2018-12-28 2020-07-07 株式会社基恩士 Gas flowmeter
CN110286663A (en) * 2019-06-28 2019-09-27 云南中烟工业有限责任公司 Method for improving based on zonal cigarette physical index standardized production
CN110286663B (en) * 2019-06-28 2021-05-25 云南中烟工业有限责任公司 Regional cigarette physical index standardized production improving method
CN112699284A (en) * 2021-01-11 2021-04-23 四川大学 Bus stop optimization visualization method based on multi-source data

Also Published As

Publication number Publication date
CN106991525B (en) 2021-06-18

Similar Documents

Publication Publication Date Title
Leconte et al. Using Local Climate Zone scheme for UHI assessment: Evaluation of the method using mobile measurements
Zhou et al. Impacts of weather on public transport ridership: Results from mining data from different sources
Yeprintsev et al. Technologies for creating geographic information resources for monitoring the socio-ecological conditions of cities
Kuik et al. Air quality modelling in the Berlin–Brandenburg region using WRF-Chem v3. 7.1: sensitivity to resolution of model grid and input data
Sayegh et al. Understanding how roadside concentrations of NOx are influenced by the background levels, traffic density, and meteorological conditions using Boosted Regression Trees
Ronda et al. Urban finescale forecasting reveals weather conditions with unprecedented detail
Sailor et al. Development of a national anthropogenic heating database with an extrapolation for international cities
Liu et al. Urban land uses and traffic ‘source-sink areas’: Evidence from GPS-enabled taxi data in Shanghai
Zhao et al. High-resolution daily AOD estimated to full coverage using the random forest model approach in the Beijing-Tianjin-Hebei region
Arnds et al. Spatio-temporal variance and meteorological drivers of the urban heat island in a European city
Wu et al. Analysis of Spatio-temporal patterns and related factors of thermal comfort in subtropical coastal cities based on local climate zones
Leconte et al. Analysis of nocturnal air temperature in districts using mobile measurements and a cooling indicator
Ming et al. Spatial pattern of anthropogenic heat flux in monocentric and polycentric cities: The case of Chengdu and Chongqing
Cao et al. Using a distributed air sensor network to investigate the spatiotemporal patterns of PM2. 5 concentrations
CN108877227A (en) A kind of global dynamic trip requirements estimation method based on multi-source traffic data
Zhang et al. Exploring the effects of 3D urban form on urban air quality: Evidence from fifteen megacities in China
Timsic et al. Spatial variability in stable isotope values of surface waters of Eastern Canada and New England
CN106991525A (en) The air quality and resident trip visual analysis method and system driven based on big data
CN110176141A (en) A kind of traffic zone division method and system based on POI and traffic characteristic
Han et al. Heterogeneity of influential factors across the entire air quality spectrum in Chinese cities: A spatial quantile regression analysis
Deng et al. Unraveling the association between the built environment and air pollution from a geospatial perspective
Li et al. Evaluation and analysis of the impact of coastal urban impervious surfaces on ecological environments
Balen et al. Air quality monitoring device for vehicular ad hoc networks: enviodev
Wang et al. Impacts of spatiotemporally uneven urbanization on sea breeze fronts in a mega-river delta
Tole Changes in the built vs. non-built environment in a rapidly urbanizing region: a case study of the Greater Toronto Area

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant