CN113487465B - City overlapping structure characteristic detection method and system based on label propagation algorithm - Google Patents
City overlapping structure characteristic detection method and system based on label propagation algorithm Download PDFInfo
- Publication number
- CN113487465B CN113487465B CN202110691291.3A CN202110691291A CN113487465B CN 113487465 B CN113487465 B CN 113487465B CN 202110691291 A CN202110691291 A CN 202110691291A CN 113487465 B CN113487465 B CN 113487465B
- Authority
- CN
- China
- Prior art keywords
- city
- layer
- node
- urban
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims description 8
- 230000003993 interaction Effects 0.000 claims abstract description 48
- 238000000034 method Methods 0.000 claims abstract description 37
- 238000005259 measurement Methods 0.000 claims abstract description 29
- 238000012549 training Methods 0.000 claims abstract description 15
- 230000006870 function Effects 0.000 claims description 16
- 230000002452 interceptive effect Effects 0.000 claims description 16
- 238000011160 research Methods 0.000 claims description 14
- 238000000638 solvent extraction Methods 0.000 claims description 10
- 238000007781 pre-processing Methods 0.000 claims description 6
- 238000004458 analytical method Methods 0.000 claims description 4
- 239000011159 matrix material Substances 0.000 claims description 4
- 238000012805 post-processing Methods 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 229910052739 hydrogen Inorganic materials 0.000 claims 1
- 239000001257 hydrogen Substances 0.000 claims 1
- 125000004435 hydrogen atom Chemical class [H]* 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 6
- 230000008901 benefit Effects 0.000 abstract description 4
- 230000008569 process Effects 0.000 description 8
- 241000894007 species Species 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 230000006399 behavior Effects 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000003696 structure analysis method Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Economics (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention relates to the field of urban planning, and provides a method and a system for detecting urban overlapping structure features based on a label propagation algorithm, wherein the method comprises the following steps: obtaining city grids and processed trajectory data; carrying out weighted matching on the urban grids and the processed trajectory data to obtain a four-layer directed weighting network; inputting a graph division model into a four-layer directed weighting network for unsupervised training to obtain a four-layer urban community structure, and extracting an overlapping structure in the four-layer urban community structure; and constructing a measurement index through the point-of-interest data, and identifying the overlapped structure through the measurement index to obtain the land use characteristics and the space interaction mode of the overlapped structure. The method introduces the method for dividing the images in the network science into city planning, has better benefit, and can divide the city structure in batch and automatically; and the spatial interaction information of the city hidden in the urban resident activity can be fully excavated, and the land utilization characteristics and the spatial interaction relationship of the city are excavated.
Description
Technical Field
The invention relates to the field of urban planning, in particular to a method and a system for detecting urban overlapping structure characteristics based on a label propagation algorithm.
Background
In the past thirty years, sufficient labor, good infrastructure and cheap land brought by the urbanization process lay a foundation for rapid development of economy. But can not avoid a plurality of problems in the urbanization process of China. The urban problem is particularly serious for some provincial cities or metropolis. The urban diseases are mainly manifested by traffic jam, housing shortage, water supply shortage, energy shortage, environmental deterioration and the like, which cause burden to cities and even restrict the development of the cities. The development structure of the city is closely related to the life and economy of urban residents, the urban spatial structure is determined by combining human activities through scientific means, an operable, scientific and reasonable spatial structure analysis method is provided, and the method becomes an important direction of digital urban research.
The urban structure is gradually complicated and diversified, obvious hierarchy and overlapping are achieved, the urban areas with different hierarchies have obvious hierarchical overlapping relations, the hierarchical overlapping relations are discussed from human activities, the regional change and the spatial distribution of the urban spatial structure can be gradually mastered from local parts to the whole, the interaction between the urban overlapping structure and other plots is far larger than the interaction between the urban overlapping structure and other plots, the urban overlapping structure can be understood as a junction area of urban spatial interaction, and the interaction can be calculated through human behaviors. Some experts have made relevant researches on city structure division methods, which can be divided into statistical survey-based methods and model-based methods, but currently, researches on city structure hierarchy and city overlap are still few. The method based on statistical survey combines survey statistics and expert evaluation modes for carrying out the demarcation, namely, in the process of demarcation of the urban structure, on the basis of on-site survey statistical results, a plurality of experts which have certain cognition on the city and have higher representativeness and authority are selected for carrying out the evaluation. This method is generally associated with great subjectivity and high time, labor and capital costs. The method based on the model is used for defining urban areas through scientific data analysis and big data mining methods under the support of public-source geographic big data, and provides an operable, scientific and reasonable space optimization model. The multi-source geographic data has the advantages of large data volume, strong current, rich sources, low cost and the like. Based on the characteristic of collecting the multi-source geographic data from bottom to top, researchers can easily obtain the spatial-temporal information which is wide in city range, abundant in mass and based on individuals, so that fine geographic analysis and modeling are achieved, and better service is provided for researching city structures.
The above is only for the purpose of assisting understanding of the technical aspects of the present invention, and does not represent an admission that the above is prior art.
Disclosure of Invention
The invention mainly aims to solve the technical problems of high subjectivity, high time, labor and capital cost in the prior art, and can detect the level overlapping structure in a city and identify the characteristics of the level overlapping structure.
In order to achieve the above object, the present invention provides a method for detecting urban overlapping structure features based on a label propagation algorithm, comprising the steps of:
s1: obtaining city map data, taxi track data and interest point data in a research area, and carrying out city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
s2: carrying out weighted matching on the urban grids and the processed trajectory data to obtain a four-layer directed weighting network;
s3: carrying out unsupervised training on the four-layer directed weighting network input graph division model, obtaining four-layer urban community structures after the training is finished, and extracting an overlapping structure in the four-layer urban community structures;
s4: and constructing a measurement index through the point of interest data, and identifying the overlapped structure through the measurement index to obtain the land use characteristics and the space interaction mode of the overlapped structure.
Preferably, step S1 is specifically:
s11: processing the urban map data through GIS software, performing spatial fishing net analysis on urban areas of the urban map data, and dividing the urban areas into urban grids; the city grid comprises a plurality of 500mx500m city grid cells;
s12: removing the point data and invalid point data which are not in the urban area in the taxi track data to obtain removed track data;
s13: and extracting the data of the point of getting on or off the vehicle in each piece of the eliminated track data, wherein the processed track data is a set of the data of the point of getting on or off the vehicle.
Preferably, step S2 is specifically:
s21: matching each getting-on and getting-off point data in the processed track data with each city grid unit, simulating each city grid unit into a graph node, and simulating the number of interaction times among the city grid units into the weight of an edge;
s22: the directed weighting network is composed of a plurality of city grid units and interactive relations among the city grid units, and the interactive relations are related to the interactive times among the city grid units;
s23: and path layering is carried out on the processed track data, a first layer is used when the track path is less than 3km, a second layer is used when the track path is less than 5km, a third layer is used when the track path is less than 9km, all the track paths are fourth layers, corresponding directed weighting networks are respectively constructed on the first layer, the second layer, the third layer and the fourth layer, and the four-layer directed weighting networks are obtained.
Preferably, step S3 is specifically:
s31: initializing the memory of each node in the graph partitioning model by using the id of the corresponding node, and obtaining a corresponding unique label by each node;
s32: selecting a certain node as a listener node;
s33: all adjacent nodes of the listener node send own unique labels to the listener node, and the listener node selects the most popular label from all the received labels;
s34: repeating the steps S32-S33 for n times, traversing all the nodes, and obtaining the most popular labels of all the nodes;
s35: post-processing all the labels of the nodes to obtain the four-layer urban community structure, and evaluating the division result of the four-layer urban community structure through an overlapping modularity function, wherein the overlapping modularity function specifically comprises the following steps:
wherein m is the sum of the weights of the edges in the network, A is a weighted adjacency matrix of the network, and if an edge exists between a node v and a node w, A is vw The weight of vw edge is, otherwise, 0 is adopted; k is a radical of v ,k w Respectively, the out-degree weight of the node v and the in-degree weight of the node w, O v ,O w The number of communities to which the node v and the node w belong respectively;
s36: and extracting an overlapping structure in the four-layer city community structure.
Preferably, in step S4, the measurement index includes: measuring indexes of richness, Simpson indexes and entropies;
the land use condition and the function type can be identified through the richness; the land mixing condition can be identified through the Simpson index and the entropy measurement index;
the formula of the richness is specifically as follows:
F i,l denotes the enrichment index, n, of POIs of class I in the ith plot l,i Representing the number of class I land use types in the ith plot, n i Is the number of all POIs in the ith parcel. N is a radical of l N is the total number of POIs of class i, and N is the total number of POIs in the whole research area;
the Simpson index and the entropy measurement index are expressed by a Hill index, and the formula is as follows:
in the formula, D represents the value of the Hill index, p u Representing the proportion of u type POI; when q is 1, it represents entropy, and higher values indicate that the distribution of POI species is more disordered, and lower values indicate that the distribution of POI species is more ordered; q is 2, the inverse of the happson index, which measures the probability that two POIs randomly selected from an urban area belong to the same category; thus, it takes into account both the abundance of POIs and the relative abundance of different types of POIs, with lower values indicating a higher degree of mixed use of land and higher values indicating a lower degree of mixed use of land.
Preferably, in step S4;
the land use characteristics are land use types and land mixing degrees, the functional areas are structures which show certain functional attributes in urban areas, such as residential areas and scenic areas, and the functional structures and the function mixing degrees of the overlapped structures can be measured through the richness; the POI mixing degree of the overlapped structure can be calculated through the Simpson index and the entropy measurement index, and the POI mixing degree can reflect the vitality of the land;
the space interaction mode represents the interaction condition of the overlapping area and the adjacent community, the interaction strength is expressed through the interaction times, an interaction network is built, the travel mode of people is analyzed through the trajectory flow from one functional area to another functional area, the travel mode has a fixed time law such as the interaction mode from a living area to a working area in the morning rush hour and the interaction mode from the living area to a leisure area in the holiday, the interaction hotspot area of the overlapping area is analyzed through the interaction strength, and the functional structure of the interaction area is further identified through the richness.
A city overlapping structure feature detection system based on a label propagation algorithm comprises the following modules:
the data acquisition module is used for acquiring city map data, taxi track data and interest point data in a research area, and performing city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
the four-layer directional weighting network acquisition module is used for carrying out weighted matching on the urban grid and the processed track data to obtain a four-layer directional weighting network;
the overlapping structure extraction module is used for carrying out unsupervised training on the four-layer directed weighting network input graph division model, obtaining four-layer urban community structures after the training is finished, and extracting an overlapping structure in the four-layer urban community structures;
and the identification module is used for constructing a measurement index through the interest point data, identifying the overlapped structure through the measurement index and obtaining the land use characteristics and the space interaction mode of the overlapped structure.
The invention has the following beneficial effects:
1. the invention adopts an analogy reasoning method, introduces a method for dividing the network science into city planning, has better benefit, and can divide the city structure in batch and automatically;
2. the invention can fully mine the spatial interaction information of the city hidden in the urban resident activity, pay attention to the overlapping structure existing in the network, and mine the land utilization characteristics and the spatial interaction relationship thereof.
Drawings
FIG. 1 is a flow chart of a method according to an embodiment of the present invention;
FIG. 2 is a system block diagram of an embodiment of the present invention;
the implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Referring to fig. 1, the invention provides a method for detecting characteristics of an urban overlapping structure based on a label propagation algorithm, which is an extension method based on a model method, and applies the idea of network science to urban division on the basis of the previous space division model research, and detects the overlapping structure existing in the city; the method is based on the idea of label propagation in graph division, adopts an analogy reasoning method, aims at the hierarchy and the overlapping property of cities in the urbanization process, discusses the hierarchy overlapping property from human activities, and is beneficial to gradually grasp the regional change and the spatial distribution of the city space structure from local to whole; the method effectively introduces the concept of overlapping nodes in the complex network into city division, combines taxi track data, carries out community detection on long and short distance city community structures, fully excavates spatial interaction information of cities hidden in city resident activities, pays attention to overlapping structures in the city community structures, and excavates land utilization types and spatial interaction relations of the overlapping structures;
the method specifically comprises the following steps:
s1: obtaining city map data, taxi track data and interest point data in a research area, and carrying out city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
s2: carrying out weighted matching on the urban grids and the processed trajectory data to obtain a four-layer directed weighting network;
s3: carrying out unsupervised training on the four-layer directed weighting network input graph division model, obtaining four-layer urban community structures after the training is finished, and extracting an overlapping structure in the four-layer urban community structures;
s4: and constructing a measurement index according to the interest point data, and identifying the overlapping structure according to the measurement index to obtain the land use characteristics and the space interaction mode of the overlapping structure.
In this embodiment, step S1 specifically includes:
s11: processing the urban map data through GIS software, performing spatial fishing net analysis on urban areas of the urban map data, and dividing the urban areas into urban grids; the city grid comprises a plurality of 500mx500m city grid cells; in this embodiment, 4853 city grid cells are obtained altogether;
s12: removing the point data and invalid point data which are not in the urban area in the taxi track data to obtain removed track data;
s13: extracting the boarding and alighting point data in each eliminated track data, wherein the processed track data is a set of the boarding and alighting point data; in this embodiment, 793253 pieces of boarding and alighting point data are obtained in total.
In this embodiment, step S2 specifically includes:
s21: matching each getting-on and getting-off point data in the processed track data with each city grid unit, simulating each city grid unit into a graph node, and simulating the number of interaction times among the city grid units into the weight of an edge;
s22: the directed weighting network is composed of a plurality of city grid units and interactive relations among the city grid units, and the interactive relations are related to the interactive times among the city grid units; each city grid unit interacts with a plurality of other city grid units;
s23: path layering is carried out on the processed track data, a first layer is used when the track path is less than 3km, a second layer is used when the track path is less than 5km, a third layer is used when the track path is less than 9km, all the track paths are fourth layers, corresponding directed weighting networks are respectively constructed on the first layer, the second layer, the third layer and the fourth layer, and the four layers of directed weighting networks are obtained;
in the specific implementation, the route layering threshold is determined according to the track proportion of each layer of route, and the tracks less than 3km, less than 5km and less than 9km respectively account for 29.3301%, 51.1679% and 76.0485% of the total track; meanwhile, the network structure of the four layers of directional weighting networks can be changed according to requirements.
In this embodiment, step S3 specifically includes:
s31: initializing the memory of each node in the graph partitioning model by using the id of the corresponding node, and obtaining a corresponding unique label by each node;
s32: selecting a certain node as a listener node;
s33: all adjacent nodes of the listener node send own unique labels to the listener node, and the listener node selects the most popular label from all the received labels;
s34: repeating the steps S32-S33 for n times, traversing all the nodes and obtaining the most popular labels of all the nodes;
in the specific implementation, when a four-layer directed weighting network input graph partitioning model is subjected to unsupervised training, an SLPA model parameter needs to be set, in the graph partitioning model, the iteration number of the model is set to be 100, the number of communities of minimum partitioning is set to be 3, in order to keep the result of each partitioning consistent, the random seed is set to be 5140727168289296997, meanwhile, through comparison detection, a random seed value is used, the modularity fluctuation result of the community partitioning result is less than 0.1, and the normal modularity value is 0.3-0.7. In addition, r parameter for controlling the output of the overlapped community is determined by comparison, wherein r belongs to {0.01,0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45 and 0.5}, through comparison experiments, when r is determined to be 0.1, the modularity curve inflection point is determined, finally, r is selected to be 0.1 according to training parameters, and parameter selection can be changed according to requirements;
s35: post-processing all the labels of the nodes to obtain the four-layer urban community structure, and evaluating the division result of the four-layer urban community structure through an overlapping modularity function, wherein the overlapping modularity function specifically comprises the following steps:
wherein m is the weight sum of edges in the network, A is the weighted adjacent matrix of the network, and if an edge exists between the node v and the node w, A is vw The weight of vw edge is, otherwise, 0 is adopted; k is a radical of v ,k w Respectively, the out-degree weight of the node v and the in-degree weight of the node w, O v ,O w The number of communities to which the node v and the node w belong respectively;
in the specific implementation, the overlapping modularity function results of the four-layer urban community structure fluctuate up and down at 0.65, 0.6, 0.55 and 0.3 respectively, and considering the reasons of overlapping nodes and path lengths, it can be explained that the more the overlapping nodes are identified, the more chaotic the area is, the area possibly belongs to a plurality of areas, the result of community division is not good, the longer the path is, the larger the scale of community division is, and the interaction of short paths can influence the division result; for the overlapped nodes, the first community is the community with the maximum label probability, and so on;
s36: and extracting an overlapping structure in the four-layer city community structure.
In this embodiment, in step S4, the point-of-interest data is obtained through the gold API, and the data is reclassified; adopting a crawler method to acquire POI data of each category, wherein 622206 data comprise attributes such as POI name, longitude and latitude, large category, medium category, small category, address and the like; meanwhile, referring to urban land use categories, reclassifying POIs into 16 categories;
the measurement indexes include: richness, simpson index and entropy measure index;
the land use condition and the function type can be identified through the richness; the land mixing condition can be identified through the Simpson index and the entropy measurement index;
the formula of the richness is specifically as follows:
F i,l represents the enrichment index, n, of POIs of class I in the ith plot l,i Representing the number of class I land use types in the ith plot, n i Is the number of all POIs in the ith parcel. N is a radical of l N is the total number of POI class i, and N is the total number of POIs in the whole research area;
the simpson index and the entropy measurement index are expressed by a hill index, and the formula specifically comprises:
in the formula, D represents the value of the Hill index, p u Representing the proportion of u type POI; when q is 1, it represents entropy, and higher values indicate that the distribution of POI species is more disordered, and lower values indicate that the distribution of POI species is more ordered; q is 2, which is the inverse of the fortunate pson index, which measures the probability that two POIs randomly selected from a metropolitan area belong to the same category; therefore, the abundance of the POI and the relative abundance of different types of POI are considered, and the lower the value is, the higher the mixed utilization degree of the land is, and the higher the value is, the lower the mixed utilization degree of the land is.
In this embodiment, in step S4;
the land use characteristics are land use types and land mixing degrees, the functional areas are structures which show certain functional attributes in urban areas, such as residential areas and scenic areas, and the functional structures and the function mixing degrees of the overlapped structures can be measured through the richness; the POI mixing degree of the overlapped structure can be calculated through the Simpson index and the entropy measurement index, and the POI mixing degree can reflect the vitality of the land;
the space interaction mode represents the interaction condition of the overlapping area and the adjacent community, the interaction strength is expressed through the number of times of interaction, an interaction network is constructed, the travel mode of people is analyzed through the trajectory flow from one functional area to another functional area, the travel mode has the interaction mode with a fixed time rule such as the early peak, the residential area, the working area and the holiday, the residential area and the leisure area, the interaction hotspot area of the overlapping area is analyzed through the interaction strength, and the functional structure of the interaction area is further identified through the richness.
Referring to fig. 2, the present invention provides a system for detecting characteristics of an urban overlapping structure based on a label propagation algorithm, including the following modules:
the data acquisition module 10 is configured to acquire city map data, taxi track data, and interest point data in a research area, and perform city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
a four-layer directional weighting network obtaining module 20, configured to perform weighted matching on the city grid and the processed trajectory data to obtain a four-layer directional weighting network;
the overlap structure extraction module 30 is configured to perform unsupervised training on the four-layer directed weighting network input graph partitioning model, obtain a four-layer urban community structure after the training is completed, and extract an overlap structure in the four-layer urban community structure;
and the identification module 40 is configured to construct a measurement index according to the point-of-interest data, identify the overlapping structure according to the measurement index, and obtain land use characteristics and a space interaction mode of the overlapping structure.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or system. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or system that comprises the element.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The use of the words first, second, third and the like do not denote any order, but rather the words first, second and the like may be interpreted as indicating any order.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, and all equivalent structures or equivalent processes performed by the present invention or directly or indirectly applied to other related technical fields are also included in the scope of the present invention.
Claims (4)
1. A city overlapping structure feature detection method based on a label propagation algorithm is characterized by comprising the following steps:
s1: obtaining city map data, taxi track data and interest point data in a research area, and carrying out city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
s2: carrying out weighted matching on the urban grids and the processed trajectory data to obtain a four-layer directed weighting network;
step S2 specifically includes:
s21: matching each boarding and alighting point data in the processed track data with each city grid unit, simulating each city grid unit into a graph node, and simulating the number of interactions among the city grid units into the weight of an edge;
s22: the directed weighting network is composed of a plurality of city grid units and interactive relations among the city grid units, and the interactive relations are related to the interactive times among the city grid units;
s23: path layering is carried out on the processed track data, a first layer is used when the track path is less than 3km, a second layer is used when the track path is less than 5km, a third layer is used when the track path is less than 9km, all track paths are fourth layers, corresponding directed weighting networks are respectively constructed on the first layer, the second layer, the third layer and the fourth layer, and the four layers of directed weighting networks are obtained;
s3: performing unsupervised training on the four-layer directed weighting network input graph division model, obtaining four-layer urban community structures after the training is finished, and extracting an overlapping structure in the four-layer urban community structures;
step S3 specifically includes:
s31: initializing the memory of each node in the graph partitioning model by using the id of the corresponding node, and obtaining a corresponding unique label by each node;
s32: selecting a certain node as a listener node;
s33: all adjacent nodes of the listener node send own unique labels to the listener node, and the listener node selects the most popular label from all the received labels;
s34: repeating the steps S32-S33 for n times, traversing all the nodes and obtaining the most popular labels of all the nodes;
s35: post-processing all the labels of the nodes to obtain the four-layer urban community structure, and evaluating the division result of the four-layer urban community structure through an overlapping modularity function, wherein the overlapping modularity function specifically comprises the following steps:
wherein m is the sum of the weights of the edges in the network, A is a weighted adjacency matrix of the network, and if an edge exists between a node v and a node w, A is vw The weight of vw edge is, otherwise, 0 is adopted; k is a radical of v ,k w Respectively, the out-degree weight of the node v and the in-degree weight of the node w, O v ,O w Respectively to which node v and node w belongThe number of communities of;
s36: extracting an overlapping structure in the four-layer city community structure;
s4: constructing a measurement index through the point of interest data, and identifying the overlapped structure through the measurement index to obtain the land use characteristics and the space interaction mode of the overlapped structure;
the land use characteristics are land use types and land mixing degrees, and the space interaction mode represents the interaction condition of the overlapping structure and the adjacent community;
in step S4, the measurement index includes: richness, simpson index and entropy measure index;
the land use condition and the function type can be identified through the richness; the land mixing condition can be identified through the Simpson index and the entropy measurement index;
the formula of the richness is specifically as follows:
F i,l denotes the enrichment index, n, of POIs of class I in the ith plot l,i Representing the number of class I land use types in the ith plot, n i Is the number of all POIs in the ith parcel; n is a radical of l N is the total number of POIs of class i, and N is the total number of POIs in the whole research area;
the simpson index and the entropy measurement index are expressed by a hill index, and the formula specifically comprises:
in the formula, D represents the value of the Hill index, p u Representing the proportion of u-type POI; when q is 1, the representative entropy is shown, the higher the value is, the more disordered the POI type distribution is, and the lower the value is, the more orderly the POI type distribution is; q is the inverse of the happson index when q is 2, measured from oneProbability that two randomly selected POIs in a city area belong to the same category; therefore, the abundance of the POI and the relative abundance of different types of POI are considered, and the lower the value is, the higher the mixed utilization degree of the land is, and the higher the value is, the lower the mixed utilization degree of the land is.
2. The method for detecting urban overlapping structure features based on the label propagation algorithm according to claim 1, wherein the step S1 specifically comprises:
s11: processing the urban map data through GIS software, performing spatial fishing net analysis on urban areas of the urban map data, and dividing the urban areas into urban grids; the city grid comprises a plurality of 500mx500m city grid cells;
s12: removing the point data and invalid point data which are not in the urban area in the taxi track data to obtain removed track data;
s13: and extracting the boarding and alighting point data in each eliminated track data, wherein the processed track data is a set of the boarding and alighting point data.
3. The city overlap structure feature detection method based on label propagation algorithm according to claim 1, characterized in that in step S4;
the functional area is a structure showing certain functional attributes in urban areas, and comprises the following steps: the functional structure and the function mixing degree of the overlapping structure can be measured through the richness; the POI mixing degree of the overlapped structure can be calculated through the Simpson index and the entropy measurement index, and the POI mixing degree can reflect the vitality of the land;
expressing the strength of interaction through the number of times of interaction, constructing an interaction network, and analyzing the travel mode of people through the trajectory flow from one functional area to another functional area, wherein the travel mode comprises the following steps: and the interactive mode with a fixed time law is that the early peak is from the residential area to the working area, and the holiday is from the residential area to the leisure area, the interactive hotspot area of the overlapped structure is analyzed through the interactive intensity, and the functional structure of the interactive area is further identified by using the richness.
4. A city overlapping structure feature detection system based on a label propagation algorithm is characterized by comprising the following modules:
the data acquisition module is used for acquiring city map data, taxi track data and interest point data in a research area, and performing city unit division on the city map data to obtain city grids; preprocessing the taxi track data to obtain processed track data;
the four-layer directional weighting network acquisition module is used for carrying out weighted matching on the urban grid and the processed track data to obtain a four-layer directional weighting network;
the steps for obtaining the four layers of directional weighting networks are as follows:
s21: matching each boarding and alighting point data in the processed track data with each city grid unit, simulating each city grid unit into a graph node, and simulating the number of interactions among the city grid units into the weight of edges;
s22: the directed weighting network is composed of a plurality of city grid units and interactive relations among the city grid units, and the interactive relations are related to the interactive times among the city grid units;
s23: path layering is carried out on the processed track data, a first layer is used when the track path is less than 3km, a second layer is used when the track path is less than 5km, a third layer is used when the track path is less than 9km, all the track paths are fourth layers, corresponding directed weighting networks are respectively constructed on the first layer, the second layer, the third layer and the fourth layer, and the four layers of directed weighting networks are obtained;
the overlapping structure extraction module is used for carrying out unsupervised training on the four-layer directed weighting network input graph division model, obtaining four-layer urban community structures after the training is finished, and extracting an overlapping structure in the four-layer urban community structures;
the step of extracting the overlapping structure in the four-layer city community structure comprises the following steps:
s31: initializing the memory of each node in the graph partitioning model by using the id of the corresponding node, and obtaining a corresponding unique label by each node;
s32: selecting a certain node as a listener node;
s33: all adjacent nodes of the listener node send own unique labels to the listener node, and the listener node selects the most popular label from all the received labels;
s34: repeating the steps S32-S33 for n times, traversing all the nodes, and obtaining the most popular labels of all the nodes;
s35: post-processing all the labels of the nodes to obtain the four-layer urban community structure, and evaluating the division result of the four-layer urban community structure through an overlapping modularity function, wherein the overlapping modularity function specifically comprises the following steps:
wherein m is the weight sum of edges in the network, A is the weighted adjacent matrix of the network, and if an edge exists between the node v and the node w, A is vw The weight of vw edge is, otherwise, 0 is adopted; k is a radical of v ,k w Respectively, the out-degree weight of the node v and the in-degree weight of the node w, O v ,O w The number of communities to which the node v and the node w belong respectively;
s36: extracting an overlapping structure in the four-layer city community structure;
the identification module is used for constructing a measurement index through the interest point data, identifying the overlapped structure through the measurement index and obtaining the land use characteristics and the space interaction mode of the overlapped structure;
the land use characteristics are land use types and land mixing degrees, and the space interaction mode represents the interaction condition of the overlapping structure and the adjacent community;
the measurement index includes: measuring indexes of richness, Simpson indexes and entropies;
the land use condition and the function type can be identified through the richness; the land mixing condition can be identified through the Simpson index and the entropy measurement index;
the formula of the richness is specifically as follows:
F i,l denotes the enrichment index, n, of POIs of class I in the ith plot l,i Representing the number of type I land use types in the ith plot, n i Is the number of all POIs in the ith parcel; n is a radical of hydrogen l N is the total number of POIs of class i, and N is the total number of POIs in the whole research area;
the Simpson index and the entropy measurement index are expressed by a Hill index, and the formula is as follows:
in the formula, D represents the value of the Hill index, p u Representing the proportion of u-type POI; when q is 1, it represents entropy, and higher values indicate that the distribution of POI species is more disordered, and lower values indicate that the distribution of POI species is more ordered; q is 2, which is the inverse of the fortunate pson index, which measures the probability that two POIs randomly selected from a metropolitan area belong to the same category; therefore, the abundance of the POI and the relative abundance of different types of POI are considered, and the lower the value is, the higher the mixed utilization degree of the land is, and the higher the value is, the lower the mixed utilization degree of the land is.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110691291.3A CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110691291.3A CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113487465A CN113487465A (en) | 2021-10-08 |
CN113487465B true CN113487465B (en) | 2022-09-30 |
Family
ID=77935765
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110691291.3A Active CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113487465B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118627750A (en) * | 2024-07-04 | 2024-09-10 | 苏州市中遥数字科技有限公司 | Multi-dimensional image processing system based on high-resolution remote sensing data |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9342854B2 (en) * | 2013-05-08 | 2016-05-17 | Yahoo! Inc. | Identifying communities within a social network based on information propagation data |
CN103729475B (en) * | 2014-01-24 | 2016-10-26 | 福州大学 | Multi-tag in a kind of social networks propagates overlapping community discovery method |
CN107818534B (en) * | 2017-10-31 | 2022-04-01 | 武汉大学 | Human activity network region division method with space constraint |
CN107784598A (en) * | 2017-11-21 | 2018-03-09 | 山西大学 | A kind of network community discovery method |
CN108446862A (en) * | 2018-03-29 | 2018-08-24 | 山东科技大学 | The three-stage policy algorithm of overlapping community detection in a kind of community network |
CN109493119B (en) * | 2018-10-19 | 2020-06-23 | 南京图申图信息科技有限公司 | POI data-based urban business center identification method and system |
CN109614458B (en) * | 2018-12-20 | 2021-07-16 | 中国人民解放军战略支援部队信息工程大学 | Urban community structure mining method and device based on navigation data |
CN110111575B (en) * | 2019-05-16 | 2020-10-27 | 北京航空航天大学 | Urban traffic flow network analysis method based on complex network theory |
CN111698743B (en) * | 2020-06-09 | 2022-09-13 | 嘉兴学院 | Complex network community identification method fusing node analysis and edge analysis |
-
2021
- 2021-06-22 CN CN202110691291.3A patent/CN113487465B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN113487465A (en) | 2021-10-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Blanchard | Mathematical analysis of urban spatial networks | |
Clarke et al. | A self-modifying cellular automaton model of historical urbanization in the San Francisco Bay area | |
CN110458048A (en) | Take population distribution Spatio-temporal Evolution and the cognition of town pattern feature into account | |
CN108629978A (en) | A kind of traffic trajectory predictions method based on higher-dimension road network and Recognition with Recurrent Neural Network | |
Huang et al. | Construction of complex network of green infrastructure in smart city under spatial differentiation of landscape | |
CN109146204A (en) | A kind of wind power plant booster stations automatic addressing method of comprehensiveestimation | |
CN108446470A (en) | Medical facilities analysis method of reachability based on track of vehicle data and population distribution | |
CN108427965A (en) | A kind of hot spot region method for digging based on road network cluster | |
CN108133302A (en) | A kind of public bicycles potential demand Forecasting Methodology based on big data | |
CN110346517A (en) | A kind of smart city industrial air pollution visualization method for early warning and its system | |
CN110836675B (en) | Decision tree-based automatic driving search decision method | |
Ahmed | Modelling spatio-temporal urban land cover growth dynamics using remote sensing and GIS techniques: A case study of Khulna City | |
CN110837973B (en) | Human trip selection information mining method based on traffic trip data | |
CN113806419B (en) | Urban area function recognition model and recognition method based on space-time big data | |
CN112419711B (en) | Closed parking lot parking demand prediction method based on improved GMDH algorithm | |
CN113642757A (en) | Internet of things charging pile construction planning method and system based on artificial intelligence | |
CN113487465B (en) | City overlapping structure characteristic detection method and system based on label propagation algorithm | |
CN104361255A (en) | Simulation method for urban expansion through modified cellular automaton | |
Kang et al. | Potential of urban land use by autonomous vehicles: Analyzing land use potential in Seoul capital area of Korea | |
CN104318319A (en) | Multi-agent urban expansion simulation method based on land acquisition process competition result | |
CN113988659A (en) | Three-dimensional compact digital city design method and device and computer equipment | |
CN116415756B (en) | Urban virtual scene experience management system based on VR technology | |
CN116680586A (en) | Urban resident activity space access mode analysis method based on access probability | |
CN104463442A (en) | Detection method of town and country construction clustering | |
CN115392569A (en) | Electric vehicle charging station site selection and volume fixing method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |