CN113487465B - City overlapping structure characteristic detection method and system based on label propagation algorithm - Google Patents
City overlapping structure characteristic detection method and system based on label propagation algorithm Download PDFInfo
- Publication number
- CN113487465B CN113487465B CN202110691291.3A CN202110691291A CN113487465B CN 113487465 B CN113487465 B CN 113487465B CN 202110691291 A CN202110691291 A CN 202110691291A CN 113487465 B CN113487465 B CN 113487465B
- Authority
- CN
- China
- Prior art keywords
- layer
- city
- node
- urban
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Economics (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
技术领域technical field
本发明涉及城市规划领域,尤其涉及一种基于标签传播算法的城市重叠结构特征检测方法及系统。The invention relates to the field of urban planning, in particular to a method and system for detecting overlapping structural features of cities based on a label propagation algorithm.
背景技术Background technique
在过去的三十多年间,城镇化进程带来的充足的劳动力、良好的基础设施和低廉的土地,为经济快速发展奠定了基础。但不可回避的是,我国城镇化进程中出现了诸多问题。特别对于一些省会城市或者大都市来说,城市问题尤为严重。“城市病”主要表现为交通拥堵、住房紧张、供水不足、能源紧缺、环境恶化等,给城市造成了负担,甚至制约了城市的发展。城市的发展结构与城市居民生活和经济息息相关,通过科学手段结合人类活动对城市空间结构进行确定,提供可操作、科学合理的空间结构分析方法,成为数字城市研究的重要方向。In the past three decades, the abundant labor force, good infrastructure and cheap land brought by the urbanization process have laid the foundation for rapid economic development. But it is unavoidable that many problems have arisen in the process of urbanization in our country. Especially for some provincial capital cities or metropolises, the urban problem is particularly serious. "Urban disease" is mainly manifested in traffic congestion, housing shortage, insufficient water supply, energy shortage, environmental deterioration, etc., which have caused a burden to the city and even restricted the development of the city. The development structure of a city is closely related to the life and economy of urban residents. The determination of the urban spatial structure through scientific means combined with human activities, and the provision of operational, scientific and reasonable spatial structure analysis methods, has become an important direction of digital city research.
城市结构日趋复杂多样,具有明显的层次性和重叠性,不同层次的城市区域具有较为明显的层级重叠关系,从人类活动探讨这种层次重叠性,有助于从局部到整体逐步把握城市空间结构的区域变化和空间分布,城市的重叠结构与其他地块的交互远远大于自身与自身发生的交互,可以理解为城市空间交互的枢纽区域,而这种交互可以通过人的行为来计算。在此前已有一些专家针对城市结构划分方法做了相关研究,这些方法可以划分为基于统计调查的方法和基于模型的方法,但目前针对城市结构层次性和城市重叠性的研究还较少。其中,基于统计调查的方法结合调查统计和专家评判的方式进行划定,即在城市结构的划定过程中,基于实地调查统计结果,选择数名对城市有一定认识,具有较高代表性和权威性的专家进行评判。该方法通常具有较大的主观性,时间、人力和资金成本高的问题。基于模型的方法在众源地理大数据的支持下,通过科学的数据分析和大数据挖掘方法对城市区域进行划定,提供可操作、科学合理的空间优化模型。众源地理数据具有数据量大,现势性强,来源丰富,成本低等优势。基于众源地理数据自下而上的采集特点,研究人员可以轻松获取城市范围的、海量丰富的、基于个人的时空信息,从而实现精细的地理分析与建模,为研究城市结构提供更好的服务。The urban structure is becoming more and more complex and diverse, with obvious layers and overlaps. Urban areas at different levels have a relatively obvious layer-overlapping relationship. Exploring this layer-overlapping relationship from human activities is helpful to gradually grasp the urban spatial structure from the local to the whole. The interaction between the overlapping structure of the city and other plots is far greater than the interaction between itself and itself. It can be understood as the hub area of urban space interaction, and this interaction can be calculated by human behavior. Some experts have done related researches on urban structure division methods before, these methods can be divided into statistical survey-based methods and model-based methods, but there are few studies on urban structure hierarchy and urban overlap. Among them, the method based on statistical survey is combined with survey statistics and expert judgment. That is, in the process of urban structure delineation, based on the statistical results of field surveys, select a number of people who have a certain understanding of the city and are highly representative and Judging by authoritative experts. This method usually has a lot of subjectivity and high cost of time, manpower and capital. The model-based method delimits urban areas through scientific data analysis and big data mining with the support of crowdsourced geographic big data, and provides an operational, scientific and reasonable spatial optimization model. Crowdsource geographic data has the advantages of large data volume, strong current situation, abundant sources, and low cost. Based on the bottom-up collection characteristics of crowdsourced geographic data, researchers can easily obtain city-wide, massive and rich, personal-based spatiotemporal information, so as to achieve precise geographic analysis and modeling, and provide better insights for studying urban structure. Serve.
上述内容仅用于辅助理解本发明的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist the understanding of the technical solutions of the present invention, and does not mean that the above content is the prior art.
发明内容SUMMARY OF THE INVENTION
本发明的主要目的在于,解决现有技术中存在的具有较大的主观性,时间、人力和资金成本高的技术问题,且可以检测城市中的层次重叠结构并识别其特征。The main purpose of the present invention is to solve the technical problems of high subjectivity and high cost of time, manpower and capital in the prior art, and to detect and identify the features of overlapping hierarchical structures in cities.
为实现上述目的,本发明提供一种基于标签传播算法的城市重叠结构特征检测方法,包括步骤:In order to achieve the above purpose, the present invention provides a method for detecting overlapping structural features of cities based on a label propagation algorithm, comprising the steps of:
S1:获取研究区域内的城市地图数据、出租车轨迹数据和兴趣点数据,对所述城市地图数据进行城市单元划分,获得城市网格;对所述出租车轨迹数据进行预处理,获得处理后的轨迹数据;S1: Acquire city map data, taxi trajectory data, and point-of-interest data in the study area, divide the city map data into city units, and obtain an urban grid; preprocess the taxi trajectory data, and obtain the processed trajectory data;
S2:将所述城市网格和所述处理后的轨迹数据进行加权匹配,获得四层有向加权网络;S2: Perform weighted matching on the urban grid and the processed trajectory data to obtain a four-layer directed weighted network;
S3:将所述四层有向加权网络输入图划分模型进行无监督训练,训练完成后获得四层城市社区结构,提取所述四层城市社区结构中的重叠结构;S3: Perform unsupervised training on the input graph partition model of the four-layer directed weighted network, obtain a four-layer urban community structure after the training is completed, and extract the overlapping structure in the four-layer urban community structure;
S4:通过所述兴趣点数据构建测量指标,通过所述测量指标对所述重叠结构进行识别,获得所述重叠结构的土地使用特点和空间交互模式。S4: Constructing a measurement index based on the point of interest data, identifying the overlapping structure through the measurement index, and obtaining land use characteristics and a spatial interaction pattern of the overlapping structure.
优选地,步骤S1具体为:Preferably, step S1 is specifically:
S11:通过GIS软件处理所述城市地图数据,对所述城市地图数据的城市区域进行空间渔网分析,将所述城市区域划分为所述城市网格;所述城市网格包括多个500mx500m的城市网格单元;S11: Process the city map data through GIS software, perform spatial fishnet analysis on the city area of the city map data, and divide the city area into the city grid; the city grid includes a plurality of 500mx500m cities grid cell;
S12:剔除所述出租车轨迹数据中不在所述城市区域的点数据和无效的点数据,获得剔除后的轨迹数据;S12: Eliminate the point data and invalid point data that are not in the urban area in the taxi trajectory data, and obtain the eliminated trajectory data;
S13:提取每条所述剔除后的轨迹数据中的上下车点数据,所述处理后的轨迹数据为所述上下车点数据的集合。S13: Extract the pick-up and drop-off point data in each piece of the excluded trajectory data, and the processed trajectory data is a set of the pick-up point data.
优选地,步骤S2具体为:Preferably, step S2 is specifically:
S21:将所述处理后的轨迹数据中各所述上下车点数据与各所述城市网格单元进行匹配,将各所述城市网格单元类比为图节点,将各城市网格单元间的交互次数类比为边的权重;S21: Match each of the pick-up and drop-off point data in the processed trajectory data with each of the urban grid units, compare each of the urban grid units as a graph node, and compare the difference between the urban grid units The number of interactions is analogous to the weight of the edge;
S22:有向加权网络由多个所述城市网格单元和各所述城市网格单元间的交互关系组成,所述交互关系与所述城市网格单元间的交互次数有关;S22: The directed weighted network is composed of a plurality of the urban grid units and the interaction relationship between each of the urban grid units, and the interaction relationship is related to the number of interactions between the urban grid units;
S23:将所述处理后的轨迹数据进行路程分层,轨迹路程小于3km的为第一层,轨迹路程小于5km的为第二层,轨迹路程小于9km的第三层,全部轨迹路程为第四层,对所述第一层、所述第二层、所述第三层和所述第四层分别构建对应的有向加权网络,获得所述四层有向加权网络。S23: Perform route layering on the processed trajectory data, the first layer with the trajectory distance less than 3km, the second layer with the trajectory distance less than 5km, the third layer with the trajectory distance less than 9km, and the fourth layer with all the trajectory distances layer, respectively constructing corresponding directed weighted networks for the first layer, the second layer, the third layer and the fourth layer to obtain the four-layer directed weighted network.
优选地,步骤S3具体为:Preferably, step S3 is specifically:
S31:将所述图划分模型中各节点的内存均用对应节点的id初始化,各节点获得对应的唯一标签;S31: Initialize the memory of each node in the graph partitioning model with the id of the corresponding node, and each node obtains a corresponding unique label;
S32:选择某一节点作为监听器节点;S32: select a node as the listener node;
S33:所述监听器节点的所有相邻节点均向所述监听器节点发送自己的唯一标签,所述监听器节点在收到的所有标签中选择最流行标签;S33: All adjacent nodes of the listener node send their own unique tags to the listener node, and the listener node selects the most popular tag among all the tags received;
S34:重复步骤S32-S33共n次,遍历所有节点,获得所有节点的最流行标签;S34: Repeat steps S32-S33 for a total of n times, traverse all nodes, and obtain the most popular labels of all nodes;
S35:对各所述节点的所有标签进行后处理,获得所述四层城市社区结构,通过重叠模块度函数能够评价所述四层城市社区结构的划分结果,所述重叠模块度函数具体为:S35: Perform post-processing on all the labels of each of the nodes to obtain the four-layer urban community structure. The division result of the four-layer urban community structure can be evaluated through an overlapping modularity function. The overlapping modularity function is specifically:
其中,m为网络中边的权重和,A为网络的带权邻接矩阵,若节点v到节点w之间存在一条边,则Avw为vw边的权重,反之为0;kv,kw分别为节点v的出度权重和和节点w的入度权重和,Ov,Ow分别为节点v和节点w所属的社区数;Among them, m is the weight sum of the edges in the network, A is the weighted adjacency matrix of the network, if there is an edge between node v and node w, then A vw is the weight of the vw edge, otherwise it is 0; k v , k w are the out-degree weight sum of node v and the in-degree weight sum of node w, respectively, O v , O w are the number of communities to which node v and node w belong;
S36:提取所述四层城市社区结构中的重叠结构。S36: Extract the overlapping structure in the four-layer urban community structure.
优选地,步骤S4中,所述测量指标包括:丰富度、辛普森指数和熵测量指标;Preferably, in step S4, the measurement indicators include: richness, Simpson index and entropy measurement indicators;
通过所述丰富度能够识别土地使用情况和功能类型;通过所述辛普森指数和所述熵测量指标可以识别土地混合状况;Land use and functional type can be identified by the abundance; land mixing can be identified by the Simpson index and the entropy measure;
所述丰富度的公式具体为:The formula for the richness is specifically:
Fi,l表示第i个地块中的第l类POI的富集指数,nl,i表示第i个地块中的第l类土地利用类型的数量,ni是第i个地块中所有POI的数量。Nl为第l类POI的总数,N是整个研究区域内POI的总数;F i,l represents the enrichment index of the l-th type of POI in the ith plot, n l,i represents the number of the l-th type of land use in the ith plot, n i is the i-th plot The number of all POIs in . N l is the total number of POIs of type l, and N is the total number of POIs in the entire study area;
所述辛普森指数和所述熵测量指标通过希尔指数来表示,公式具体为:The Simpson index and the entropy measurement index are represented by the Hill index, and the formula is specifically:
公式中D代表希尔指数的值,pu代表第u类POI所占比例;当q=1时,它是代表熵,值越高说明POI种类分布越无序,越低代表POI种类分布越有序;q=2时是幸普森指数的逆值,辛普森指数衡量的是从一个城市区域中随机选择的两个POI属于同一类别的概率;因此,它既考虑了POI的丰富度,又考虑了不同类型POI的相对丰度,值越低说明土地的混合利用程度越高,值越高说明土地的混合利用程度越低。In the formula, D represents the value of the Hill index, and p u represents the proportion of the u-th type of POI; when q=1, it represents the entropy. Ordered; q=2 is the inverse of the Simpson index, which measures the probability that two POIs randomly selected from an urban area belong to the same class; thus, it takes into account both the abundance of POIs and the Considering the relative abundance of different types of POIs, the lower the value, the higher the degree of mixed use of land; the higher the value, the lower the degree of mixed use of land.
优选地,步骤S4中;Preferably, in step S4;
所述土地使用特点为土地使用的类型及土地混合程度,功能区为城市区域中表现出一定功能属性的结构,如居住区、风景区,通过所述丰富度可以度量重叠结构的功能结构及功能混合程度;通过所述辛普森指数和熵测量指标能够计算重叠结构的POI混合程度,所述POI混合程度可以反映地块活力;The characteristics of land use are the type of land use and the degree of land mixing, and functional areas are structures that exhibit certain functional attributes in urban areas, such as residential areas and scenic areas. The richness can be used to measure the functional structure and function of overlapping structures. Mixing degree; the POI mixing degree of overlapping structures can be calculated by the Simpson index and entropy measurement index, and the POI mixing degree can reflect the vitality of the plot;
所述空间交互模式代表所述重叠区域与邻近社区的交互情况,通过交互的次数来表现交互的强弱,构建交互网络,通过一个功能区到另一个功能区的轨迹流来分析人们的出行模式,所述出行模式如早高峰从居住区至工作区,节假日从居住区至休闲区此类具有固定时间规律的交互模式,通过交互强度分析所述重叠区域交互热点区域并进一步使用所述丰富度识别交互区域的功能结构。The spatial interaction pattern represents the interaction between the overlapping area and neighboring communities. The strength of interaction is represented by the number of interactions, an interaction network is constructed, and people’s travel patterns are analyzed through the trajectory flow from one functional area to another. , the travel patterns such as morning rush hour from residential area to work area, holidays from residential area to leisure area and other interactive modes with fixed time rules, analyze the overlapping area interaction hotspot area by interaction strength and further use the richness Identify the functional structure of the interaction area.
一种基于标签传播算法的城市重叠结构特征检测系统,包括以下模块:An urban overlapping structure feature detection system based on label propagation algorithm, including the following modules:
数据获取模块,用于获取研究区域内的城市地图数据、出租车轨迹数据和兴趣点数据,对所述城市地图数据进行城市单元划分,获得城市网格;对所述出租车轨迹数据进行预处理,获得处理后的轨迹数据;The data acquisition module is used to acquire city map data, taxi trajectory data and point-of-interest data in the research area, divide the city map data into city units, and obtain city grids; preprocess the taxi trajectory data , obtain the processed trajectory data;
四层有向加权网络获取模块,用于将所述城市网格和所述处理后的轨迹数据进行加权匹配,获得四层有向加权网络;A four-layer directed weighted network acquisition module is used to perform weighted matching between the urban grid and the processed trajectory data to obtain a four-layer directed weighted network;
重叠结构提取模块,用于将所述四层有向加权网络输入图划分模型进行无监督训练,训练完成后获得四层城市社区结构,提取所述四层城市社区结构中的重叠结构;The overlapping structure extraction module is used to perform unsupervised training on the input graph partition model of the four-layer directed weighted network, obtain a four-layer urban community structure after the training is completed, and extract the overlapping structure in the four-layer urban community structure;
识别模块,用于通过所述兴趣点数据构建测量指标,通过所述测量指标对所述重叠结构进行识别,获得所述重叠结构的土地使用特点和空间交互模式。The identification module is configured to construct a measurement index based on the interest point data, identify the overlapping structure through the measurement index, and obtain the land use characteristics and spatial interaction pattern of the overlapping structure.
本发明具有以下有益效果:The present invention has the following beneficial effects:
1、本发明采用类比推理的方法,将网络科学中图划分的方法引入城市规划,具有较好的效益,同时能够批量化、自动化的进行城市结构划分;1. The present invention adopts the method of analogical reasoning, and introduces the method of graph division in network science into urban planning, which has good benefits, and can divide the urban structure in batches and automatically;
2、本发明能充分挖掘隐藏在城市居民活动中城市的空间交互信息,同时关注网络中存在的重叠结构,挖掘其土地利用特征和其空间交互关系。2. The present invention can fully excavate the spatial interaction information of the city hidden in the activities of urban residents, at the same time pay attention to the overlapping structure existing in the network, and excavate its land use characteristics and its spatial interaction relationship.
附图说明Description of drawings
图1为本发明实施例方法流程图;1 is a flowchart of a method according to an embodiment of the present invention;
图2为本发明实施例系统结构图;FIG. 2 is a system structure diagram of an embodiment of the present invention;
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the present invention will be further described with reference to the accompanying drawings in conjunction with the embodiments.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
参照图1,本发明提供一种基于标签传播算法的城市重叠结构特征检测方法,是基于模型方法的一种扩展方法,在以往空间划分模型研究的基础上,将网络科学的思想应用到城市划分当中,并检测城市中存在的重叠结构;该方法基于图划分中的标签传播的思想,采用类比推理的方法,针对城市化过程中城市存在的层次性和重叠性,从人类活动探讨这种层次重叠性,有助于从局部到整体逐步把握城市空间结构的区域变化和空间分布;本发明有效的将复杂网络中重叠节点的概念引入到城市划分当中,并结合了出租车轨迹数据,对长短途的城市社区结构进行社区检测,充分挖掘隐藏在城市居民活动中城市的空间交互信息,同时关注城市社区结构中存在的重叠结构,挖掘其土地利用类型和其空间交互关系;Referring to FIG. 1, the present invention provides a method for detecting urban overlapping structure features based on a label propagation algorithm, which is an extension method based on a model method. On the basis of previous research on spatial division models, the idea of network science is applied to urban division. Among them, and detect the overlapping structure existing in the city; this method is based on the idea of label propagation in graph division, adopts the method of analogical reasoning, aiming at the layering and overlapping of cities in the process of urbanization, and explores this layer from human activities. The overlap helps to gradually grasp the regional changes and spatial distribution of the urban spatial structure from the local to the whole; the present invention effectively introduces the concept of overlapping nodes in the complex network into the urban division, and combines the taxi trajectory data, which provides a better solution for long-term urbanization. The short-distance urban community structure is tested for the community, and the spatial interaction information of the city hidden in the activities of urban residents is fully excavated. At the same time, it pays attention to the overlapping structure existing in the urban community structure, and excavates its land use type and its spatial interaction relationship;
具体包括以下步骤:Specifically include the following steps:
S1:获取研究区域内的城市地图数据、出租车轨迹数据和兴趣点数据,对所述城市地图数据进行城市单元划分,获得城市网格;对所述出租车轨迹数据进行预处理,获得处理后的轨迹数据;S1: Acquire city map data, taxi trajectory data, and point-of-interest data in the study area, divide the city map data into city units, and obtain an urban grid; preprocess the taxi trajectory data, and obtain the processed trajectory data;
S2:将所述城市网格和所述处理后的轨迹数据进行加权匹配,获得四层有向加权网络;S2: Perform weighted matching on the urban grid and the processed trajectory data to obtain a four-layer directed weighted network;
S3:将所述四层有向加权网络输入图划分模型进行无监督训练,训练完成后获得四层城市社区结构,提取所述四层城市社区结构中的重叠结构;S3: Perform unsupervised training on the input graph partition model of the four-layer directed weighted network, obtain a four-layer urban community structure after the training is completed, and extract the overlapping structure in the four-layer urban community structure;
S4:通过所述兴趣点数据构建测量指标,通过所述测量指标对所述重叠结构进行识别,获得所述重叠结构的土地使用特点和空间交互模式。S4: Constructing a measurement index based on the point of interest data, identifying the overlapping structure through the measurement index, and obtaining land use characteristics and a spatial interaction pattern of the overlapping structure.
本实施例中,步骤S1具体为:In this embodiment, step S1 is specifically:
S11:通过GIS软件处理所述城市地图数据,对所述城市地图数据的城市区域进行空间渔网分析,将所述城市区域划分为所述城市网格;所述城市网格包括多个500mx500m的城市网格单元;本实施例中,共获取4853个城市网格单元;S11: Process the city map data through GIS software, perform spatial fishnet analysis on the city area of the city map data, and divide the city area into the city grid; the city grid includes a plurality of 500mx500m cities grid unit; in this embodiment, a total of 4853 urban grid units are obtained;
S12:剔除所述出租车轨迹数据中不在所述城市区域的点数据和无效的点数据,获得剔除后的轨迹数据;S12: Eliminate the point data and invalid point data that are not in the urban area in the taxi trajectory data, and obtain the eliminated trajectory data;
S13:提取每条所述剔除后的轨迹数据中的上下车点数据,所述处理后的轨迹数据为所述上下车点数据的集合;本实施例中,共获取793253条上下车点数据。S13 : Extract the pick-up and drop-off point data in each piece of the excluded trajectory data, and the processed trajectory data is the set of the pick-up point data; in this embodiment, a total of 793253 pieces of pick-up and drop-off point data are acquired.
本实施例中,步骤S2具体为:In this embodiment, step S2 is specifically:
S21:将所述处理后的轨迹数据中各所述上下车点数据与各所述城市网格单元进行匹配,将各所述城市网格单元类比为图节点,将各城市网格单元间的交互次数类比为边的权重;S21: Match each of the pick-up and drop-off point data in the processed trajectory data with each of the urban grid units, compare each of the urban grid units as a graph node, and compare the difference between the urban grid units The number of interactions is analogous to the weight of the edge;
S22:有向加权网络由多个所述城市网格单元和各所述城市网格单元间的交互关系组成,所述交互关系与所述城市网格单元间的交互次数有关;其中,每一个城市网格单元均会与若干个其他城市网格单元交互;S22: The directed weighted network is composed of a plurality of the urban grid units and the interaction relationship between the urban grid units, and the interaction relationship is related to the number of interactions between the urban grid units; wherein, each Urban grid cells interact with several other urban grid cells;
S23:将所述处理后的轨迹数据进行路程分层,轨迹路程小于3km的为第一层,轨迹路程小于5km的为第二层,轨迹路程小于9km的第三层,全部轨迹路程为第四层,对所述第一层、所述第二层、所述第三层和所述第四层分别构建对应的有向加权网络,获得所述四层有向加权网络;S23: Perform route layering on the processed trajectory data, the first layer with the trajectory distance less than 3km, the second layer with the trajectory distance less than 5km, the third layer with the trajectory distance less than 9km, and the fourth layer with all the trajectory distances layer, respectively constructing corresponding directed weighted networks for the first layer, the second layer, the third layer and the fourth layer to obtain the four-layer directed weighted network;
具体实现中,路程分层的阈值根据每层路程的轨迹占比来确定,小于3km,小于5km和小于9km的轨迹分别占总轨迹的29.3301%,51.1679%和76.0485%;同时也可根据需求改变四层有向加权网络的网络结构。In the specific implementation, the threshold of the route layering is determined according to the proportion of trajectories of each layer. The trajectories less than 3km, less than 5km and less than 9km account for 29.3301%, 51.1679% and 76.0485% of the total trajectories respectively; they can also be changed according to demand. The network structure of the four-layer directed weighted network.
本实施例中,步骤S3具体为:In this embodiment, step S3 is specifically:
S31:将所述图划分模型中各节点的内存均用对应节点的id初始化,各节点获得对应的唯一标签;S31: Initialize the memory of each node in the graph partitioning model with the id of the corresponding node, and each node obtains a corresponding unique label;
S32:选择某一节点作为监听器节点;S32: select a node as the listener node;
S33:所述监听器节点的所有相邻节点均向所述监听器节点发送自己的唯一标签,所述监听器节点在收到的所有标签中选择最流行标签;S33: All adjacent nodes of the listener node send their own unique tags to the listener node, and the listener node selects the most popular tag among all the tags received;
S34:重复步骤S32-S33共n次,遍历所有节点,获得所有节点的最流行标签;S34: Repeat steps S32-S33 for a total of n times, traverse all nodes, and obtain the most popular labels of all nodes;
具体实现中,在将四层有向加权网络输入图划分模型进行无监督训练时,需设置SLPA模型参数,在图划分模型中,模型的迭代次数设置为100,最小划分的社区数设置为3,为保持每次划分的结果一致,随机种子seed设置为5140727168289296997,同时经过对比检测,使用随机的seed值,其社区划分结果的模块度波动结果小于0.1,正常的模块度值为0.3~0.7。此外,还对比确定了控制重叠社区输出的r参数,r∈{0.01,0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45,0.5},通过对比实验,确定了r=0.1时为模块度曲线拐点,最后训练参数选择r=0.1,可根据需求改变参数选择;In the specific implementation, when the four-layer directed weighted network input graph partition model is used for unsupervised training, the SLPA model parameters need to be set. In the graph partition model, the number of iterations of the model is set to 100, and the minimum number of divided communities is set to 3 , in order to keep the results of each division consistent, the random seed seed is set to 5140727168289296997. At the same time, after comparative testing and random seed value, the modularity fluctuation result of the community division result is less than 0.1, and the normal modularity value is 0.3 to 0.7. In addition, the r parameter that controls the output of overlapping communities is also determined by comparison, r∈{0.01,0.05,0.1,0.15,0.2,0.25,0.3,0.35,0.4,0.45,0.5}, through the comparison experiment, it is determined that when r=0.1 It is the inflection point of the modularity curve, and the final training parameter selects r=0.1, and the parameter selection can be changed according to the needs;
S35:对各所述节点的所有标签进行后处理,获得所述四层城市社区结构,通过重叠模块度函数能够评价所述四层城市社区结构的划分结果,所述重叠模块度函数具体为:S35: Perform post-processing on all the labels of each of the nodes to obtain the four-layer urban community structure. The division result of the four-layer urban community structure can be evaluated through an overlapping modularity function. The overlapping modularity function is specifically:
其中,m为网络中边的权重和,A为网络的带权邻接矩阵,若节点v到节点w之间存在一条边,则Avw为vw边的权重,反之为0;kv,kw分别为节点v的出度权重和和节点w的入度权重和,Ov,Ow分别为节点v和节点w所属的社区数;Among them, m is the weight sum of the edges in the network, A is the weighted adjacency matrix of the network, if there is an edge between node v and node w, then A vw is the weight of the vw edge, otherwise it is 0; k v , k w are the out-degree weight sum of node v and the in-degree weight sum of node w, respectively, O v , O w are the number of communities to which node v and node w belong;
具体实现中,四层城市社区结构的重叠模块度函数结果分别在0.65、0.6、0.55、0.3上下波动,考虑到重叠节点和路程长度的原因,可解释为重叠节点识别的越多,则该地区越混乱,可能属于多个地区,社区划分的结果不佳,而路程越长,社区划分的尺度也就越大,短路程的交互会影响划分结果;对于重叠节点,第一社区为标签概率最大的社区,以此类推;In the specific implementation, the results of the overlapping modularity function of the four-story urban community structure fluctuate around 0.65, 0.6, 0.55, and 0.3, respectively. Considering the reasons for overlapping nodes and the length of the journey, it can be explained that the more overlapping nodes are identified, the better the area. The more chaotic, it may belong to multiple regions, the result of community division is not good, and the longer the distance, the larger the scale of community division, and the interaction of short distance will affect the division result; for overlapping nodes, the first community has the highest probability of labeling community, and so on;
S36:提取所述四层城市社区结构中的重叠结构。S36: Extract the overlapping structure in the four-layer urban community structure.
本实施例中,步骤S4中,通过高德API获取兴趣点数据,并对数据进行重分类;采用爬虫的方法对每个类别的POI数据进行获取,共622206条数据,有POI名称,经纬度,大类别,中类别,小类别,地址等属性;同时参考城市土地使用类别,对POI重分类共16类;In this embodiment, in step S4, the POI data is obtained through the AutoNavi API, and the data is reclassified; the POI data of each category is obtained by using the crawler method, a total of 622206 pieces of data, including POI name, latitude and longitude, Large category, medium category, small category, address and other attributes; also refer to the urban land use category, reclassify POI into 16 categories;
所述测量指标包括:丰富度、辛普森指数和熵测量指标;The measurement indicators include: richness, Simpson index and entropy measurement indicators;
通过所述丰富度能够识别土地使用情况和功能类型;通过所述辛普森指数和所述熵测量指标可以识别土地混合状况;Land use and functional type can be identified by the abundance; land mixing can be identified by the Simpson index and the entropy measure;
所述丰富度的公式具体为:The formula for the richness is specifically:
Fi,l表示第i个地块中的第l类POI的富集指数,nl,i表示第i个地块中的第l类土地利用类型的数量,ni是第i个地块中所有POI的数量。Nl为第l类POI的总数,N是整个研究区域内POI的总数;F i,l represents the enrichment index of the l-th type of POI in the ith plot, n l,i represents the number of the l-th type of land use in the ith plot, n i is the i-th plot The number of all POIs in . N l is the total number of POIs of type l, and N is the total number of POIs in the entire study area;
所述辛普森指数和所述熵测量指标通过希尔指数来表示,公式具体为:The Simpson index and the entropy measurement index are represented by the Hill index, and the formula is specifically:
公式中D代表希尔指数的值,pu代表第u类POI所占比例;当q=1时,它是代表熵,值越高说明POI种类分布越无序,越低代表POI种类分布越有序;q=2时是幸普森指数的逆值,辛普森指数衡量的是从一个城市区域中随机选择的两个POI属于同一类别的概率;因此,它既考虑了POI的丰富度,又考虑了不同类型POI的相对丰度,值越低说明土地的混合利用程度越高,值越高说明土地的混合利用程度越低。In the formula, D represents the value of the Hill index, and p u represents the proportion of the u-th type of POI; when q=1, it represents the entropy. Ordered; q=2 is the inverse of the Simpson index, which measures the probability that two POIs randomly selected from an urban area belong to the same class; thus, it takes into account both the abundance of POIs and the Considering the relative abundance of different types of POIs, the lower the value, the higher the degree of mixed use of land; the higher the value, the lower the degree of mixed use of land.
本实施例中,步骤S4中;In this embodiment, in step S4;
所述土地使用特点为土地使用的类型及土地混合程度,功能区为城市区域中表现出一定功能属性的结构,如居住区、风景区,通过所述丰富度可以度量重叠结构的功能结构及功能混合程度;通过所述辛普森指数和熵测量指标能够计算重叠结构的POI混合程度,所述POI混合程度可以反映地块活力;The characteristics of land use are the type of land use and the degree of land mixing, and functional areas are structures that exhibit certain functional attributes in urban areas, such as residential areas and scenic areas. The richness can be used to measure the functional structure and function of overlapping structures. Mixing degree; the POI mixing degree of overlapping structures can be calculated by the Simpson index and entropy measurement index, and the POI mixing degree can reflect the vitality of the plot;
所述空间交互模式代表所述重叠区域与邻近社区的交互情况,通过交互的次数来表现交互的强弱,构建交互网络,通过一个功能区到另一个功能区的轨迹流来分析人们的出行模式,所述出行模式如早高峰从居住区至工作区,节假日从居住区至休闲区此类具有固定时间规律的交互模式,通过交互强度分析所述重叠区域交互热点区域并进一步使用所述丰富度识别交互区域的功能结构。The spatial interaction pattern represents the interaction between the overlapping area and neighboring communities. The strength of interaction is represented by the number of interactions, an interaction network is constructed, and people’s travel patterns are analyzed through the trajectory flow from one functional area to another. , the travel patterns such as morning rush hour from residential area to work area, holidays from residential area to leisure area and other interactive modes with fixed time rules, analyze the overlapping area interaction hotspot area by interaction strength and further use the richness Identify the functional structure of the interaction area.
参考图2,本发明提供一种基于标签传播算法的城市重叠结构特征检测系统,包括以下模块:Referring to Fig. 2, the present invention provides an urban overlapping structure feature detection system based on a label propagation algorithm, including the following modules:
数据获取模块10,用于获取研究区域内的城市地图数据、出租车轨迹数据和兴趣点数据,对所述城市地图数据进行城市单元划分,获得城市网格;对所述出租车轨迹数据进行预处理,获得处理后的轨迹数据;The
四层有向加权网络获取模块20,用于将所述城市网格和所述处理后的轨迹数据进行加权匹配,获得四层有向加权网络;A four-layer directed weighted
重叠结构提取模块30,用于将所述四层有向加权网络输入图划分模型进行无监督训练,训练完成后获得四层城市社区结构,提取所述四层城市社区结构中的重叠结构;The overlapping
识别模块40,用于通过所述兴趣点数据构建测量指标,通过所述测量指标对所述重叠结构进行识别,获得所述重叠结构的土地使用特点和空间交互模式。The
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or system comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or system. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in the process, method, article or system that includes the element.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。词语第一、第二、以及第三等的使用不表示任何顺序,可将这些词语解释为标识。The above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages or disadvantages of the embodiments. In a unit claim enumerating several means, several of these means may be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. do not denote any order, and these words may be construed as identifications.
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only preferred embodiments of the present invention, and are not intended to limit the scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present invention, or directly or indirectly applied in other related technical fields , are similarly included in the scope of patent protection of the present invention.
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110691291.3A CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110691291.3A CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113487465A CN113487465A (en) | 2021-10-08 |
CN113487465B true CN113487465B (en) | 2022-09-30 |
Family
ID=77935765
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110691291.3A Expired - Fee Related CN113487465B (en) | 2021-06-22 | 2021-06-22 | City overlapping structure characteristic detection method and system based on label propagation algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113487465B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118627750B (en) * | 2024-07-04 | 2025-02-14 | 苏州市中遥数字科技有限公司 | A multi-dimensional image processing system based on high-resolution remote sensing data |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9342854B2 (en) * | 2013-05-08 | 2016-05-17 | Yahoo! Inc. | Identifying communities within a social network based on information propagation data |
CN103729475B (en) * | 2014-01-24 | 2016-10-26 | 福州大学 | Multi-tag in a kind of social networks propagates overlapping community discovery method |
CN107818534B (en) * | 2017-10-31 | 2022-04-01 | 武汉大学 | Human activity network region division method with space constraint |
CN107784598A (en) * | 2017-11-21 | 2018-03-09 | 山西大学 | A kind of network community discovery method |
CN108446862A (en) * | 2018-03-29 | 2018-08-24 | 山东科技大学 | The three-stage policy algorithm of overlapping community detection in a kind of community network |
CN109493119B (en) * | 2018-10-19 | 2020-06-23 | 南京图申图信息科技有限公司 | POI data-based urban business center identification method and system |
CN109614458B (en) * | 2018-12-20 | 2021-07-16 | 中国人民解放军战略支援部队信息工程大学 | Method and Device for Mining Urban Community Structure Based on Navigation Data |
CN110111575B (en) * | 2019-05-16 | 2020-10-27 | 北京航空航天大学 | An Analysis Method of Urban Traffic Flow Network Based on Complex Network Theory |
CN111698743B (en) * | 2020-06-09 | 2022-09-13 | 嘉兴学院 | A complex network community identification method integrating node analysis and edge analysis |
-
2021
- 2021-06-22 CN CN202110691291.3A patent/CN113487465B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN113487465A (en) | 2021-10-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103533501B (en) | A kind of geography fence generation method | |
CN110533038A (en) | A method of urban vitality area and inner city Boundary Recognition based on information data | |
CN109325085A (en) | A method of urban land function identification and change detection | |
CN113393149B (en) | Method and system for optimizing urban residential site, computer equipment and storage medium | |
CN108133302A (en) | A kind of public bicycles potential demand Forecasting Methodology based on big data | |
CN113806419B (en) | Urban area function recognition model and recognition method based on space-time big data | |
Ahmed | Modelling spatio-temporal urban land cover growth dynamics using remote sensing and GIS techniques: A case study of Khulna City | |
CN109272170A (en) | A kind of traffic zone dividing system based on Louvain algorithm | |
CN110176141A (en) | A kind of traffic zone division method and system based on POI and traffic characteristic | |
CN110909434A (en) | Urban public transport trunk line network design method under low-carbon guidance | |
CN107330734A (en) | Business address system of selection based on Co location patterns and body | |
CN115965171A (en) | Micro-park site selection method based on ant colony optimization algorithm | |
CN111274700B (en) | Habitat network construction method based on vector data | |
CN112419711A (en) | Prediction method of parking demand in closed parking lot based on improved GMDH algorithm | |
CN113191553A (en) | Population space distribution estimation method and system based on building scale | |
CN112954623B (en) | Resident occupancy rate estimation method based on mobile phone signaling big data | |
Natera Orozco et al. | Quantifying life quality as walkability on urban networks: The case of Budapest | |
CN106651017A (en) | Method for site selection of land consolidation project based on ant colony optimization algorithm | |
Wang et al. | Exploring regional sustainable commuting patterns based on dockless bike-sharing data and POI data | |
CN113487465B (en) | City overlapping structure characteristic detection method and system based on label propagation algorithm | |
CN113158084B (en) | Method, device, computer equipment and storage medium for processing movement track data | |
CN114139827A (en) | Intelligent sensing and optimizing method for urban functional area function performance | |
Ostroukh et al. | Research on Using Deep Learning for Transport Demand Prediction | |
CN113034005A (en) | Method and system for analyzing space difference influence factors of traditional village population hollowing degree | |
CN117541928B (en) | A method and system for estimating urban building material stock based on convolutional neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20220930 |