WO2020172954A1 - Living circle identification method based on positioning data - Google Patents

Living circle identification method based on positioning data Download PDF

Info

Publication number
WO2020172954A1
WO2020172954A1 PCT/CN2019/081820 CN2019081820W WO2020172954A1 WO 2020172954 A1 WO2020172954 A1 WO 2020172954A1 CN 2019081820 W CN2019081820 W CN 2019081820W WO 2020172954 A1 WO2020172954 A1 WO 2020172954A1
Authority
WO
WIPO (PCT)
Prior art keywords
point
behavior
points
living
circle
Prior art date
Application number
PCT/CN2019/081820
Other languages
French (fr)
Chinese (zh)
Inventor
杨俊宴
金探花
史北祥
王桥
魏晋
史宜
Original Assignee
东南大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 东南大学 filed Critical 东南大学
Publication of WO2020172954A1 publication Critical patent/WO2020172954A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/021Services related to particular areas, e.g. point of interest [POI] services, venue services or geofences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/029Location-based management or tracking services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the invention belongs to the technical field of urban planning, and particularly relates to a method for identifying a living circle based on positioning data.
  • the living circle determines the urban facility space allocation and the shaping of the quality of life.
  • the existing methods of defining the scope of the living circle at home and abroad mainly define the living circle in terms of time distance, spatial distance or function.
  • the analysis method of defining the living circle by time distance mainly delimits the boundary of the urban living circle by calculating "15-minute walking distance", "one-hour commuting circle", etc.; defined by spatial distance
  • the living circle of Shanghai mainly uses the "800m walking distance” to determine the reachable space boundary to form the range of the living circle; while the living circle defined by function determines the boundary of the living circle by calculating the distance of major facilities.
  • the above three types of life circle delineation methods all have certain limitations, and more people use subjective feelings and experience to determine the life circle. First of all, in the purely dividing method by traffic time, the continuous movement behavior does not conform to the characteristics of the actual crowd activities.
  • This method lacks consideration of the real behavior of people in life; through the pure spatial distance dividing method, by judging the short-term walking The farthest distance ignores the behavioral characteristics of specific individuals and blurs the differences between crowd activities.
  • the analysis of the level of the living circle only at the level of the city range does not incorporate the actual use of the population.
  • the current life circle identification methods are more subjectively defined methods or small-scale real representation methods, lacking the true restoration of the urban-scale population's living area, and failing to integrate with the current multi-source big data application.
  • the present invention proposes a living circle identification method based on positioning data, aiming at the problem of simply making a circle with the place of residence or work in the existing living circle, ignoring the real situation of the crowd, and identifying the crowd with positioning data
  • the behavior of people is integrated with the behavior points within the daily walking range to get the real living and working life circle of the crowd.
  • the technical solution adopted by the present invention is: a method for identifying a living circle based on positioning data.
  • the method includes the following steps:
  • step 2 Identify the behavior points of the effective positioning data file obtained in step 1, and divide the different behavior characteristics of the behavior points according to the duration of the behavior points in a specific time period: including residential behavior point M, work behavior point N, and life Behavior point set P.
  • step (1) includes:
  • the data collection time is every 5 minutes.
  • the data content includes the user number, the user's location update date and time, and the user's real-time latitude and longitude coordinates.
  • Positioning data can be obtained through mobile phone operators, or through GPS data collection.
  • the anonymous location data number is the user identification number after desensitization, and does not involve the user's private information.
  • the data collection time interval can be set freely in actual work.
  • the standard format of positioning data is as follows:
  • step (2) includes:
  • (2.2) points to sort all tracks in time series trajectory point O n determines which path point O n + 1 from the track point O n is less than S m, such as less than S m, is as O n a Adjacent points; continue the above operations, continue to judge, to the track point O n+m , if the distance from the track point O n is greater than or equal to the threshold S meters, the search stops; calculate the track point O n+m and the track point O n time intervals, such as greater than or equal to the duration T 1 minute, the recording track of all points to O n O n + m-1; such duration is less than T 1 minute, to O n O n + m-1 is not satisfied Request, the result is invalid; go to the next track point, that is, start at track point O n+1 , repeat the above steps;
  • step 2.1.2 Cluster all trajectory points O n to On +m-1 obtained in step 2.1.2 to generate behavior points, where the latitude and longitude of the behavior point is the mean value of all trajectory points O n to O n+m-1 , The time t n when the track point O n is located is the start time of the action point, and the time t n+m-1 when the track point O n+m-1 is located is the end time of the action point;
  • step (2.5) Determine in turn whether the interval between the start time and end time of the action points obtained in step (2.4) has an intersection with the interval 23:00-05:00, record the above-mentioned intersection action points to form a set A1, and calculate the A1
  • the behavior point with the maximum time difference between the end time and the start time of the behavior point is regarded as the residence behavior point M;
  • step (2.6) Determine in turn whether the start time and end time of the behavior points obtained in step (2.4) have an intersection with the 08:00-17:00 interval, record the above-mentioned intersection behavior points to form a set A2, and calculate the behavior points in A2
  • the action point with the maximum time difference between the end time and the start time of is regarded as the work action point N;
  • step (3) includes:
  • step (2) Load the behavior point file obtained in step (2) into ArcGIS software in layers to form three layers: residential behavior point M, work behavior point N, and life behavior point set P.
  • step (4) includes:
  • the living area of each object in the residential area can be combined to obtain the living area of all objects in the residential area; the working and living area of each object in the office area can be combined to obtain a specific The working life circle area of all objects in the office area.
  • the present invention uses positioning data to identify effective behaviors of the crowd through algorithms and rules, and obtains the actual residential behavior points, work behavior points and life behavior points of the crowd. Taking the walking distance in a specific area within a specific time as the radius of the life circle, extract the life behavior points within the radius of the life circle from the residence behavior point and the work behavior point respectively, as the residence life behavior point and work life behavior point.
  • the minimum boundary geometric area for generating living behavior points and living life behavior points is the living life circle
  • the minimum boundary geometric area for generating working behavior points and work life behavior points is the working life circle.
  • the present invention makes full use of positioning data to accurately locate the crowd
  • the real living or working life circle is more scientific and objective than the circle in the traditional sense, providing a scientific basis for the analysis and construction of the living or working life circle planning.
  • FIG. 1 Schematic diagram of behavior point recognition method
  • Fig. 3 is a schematic diagram of the living space of the present invention.
  • Fig. 4 is a schematic diagram of the working and living circle space of the present invention.
  • Figure 5 The application of the present invention in the living circle of a user in the Zhima Village residential area in Xinjiekou area, Nanjing;
  • Fig. 6 The application of the present invention in the residential living circle of Zhima Village, Xinjiekou, Nanjing.
  • EXCEL software to preprocess the anonymous mobile phone location data CSV file, where the anonymous mobile phone location data CSV file is obtained from the mobile phone information push service provider, including the user number, the date, time, and latitude and longitude coordinate information of the user's location update; and , Or use GPS positioning equipment to collect positioning data files;
  • the specific steps for obtaining the mobile phone location data of anonymous users through the mobile phone information push service provider are as follows: Obtain the mobile phone location data of all users in Nanjing on November 11, 2017 through the mobile phone information push service provider.
  • the data collection time is every 5 minutes.
  • the collection time interval can be preset according to actual needs.
  • the data content includes user ID, user location update date and time, user real-time latitude and longitude coordinates, a total of 2.321 billion pieces of collected data.
  • the specific data format requirements are shown in Table 1, including desensitized data.
  • the user's mobile phone location data number is the desensitized mobile phone identification number, which does not involve the user's private information.
  • the distance threshold S is 50 meters
  • the shortest time interval T 1 threshold is 15 minutes.
  • the 50 meters and 15 minutes in this example are just examples, and different values can be set according to actual needs;
  • (2.2) points to sort all tracks in time series trajectory point O n, determines which path point O n + 1 from the track point O n is less than 50 m, such as less than 50 meters, then as O n a Adjacent points; continue the above operations, continue to judge, to the track point O n+m , if the distance from the track point O n is greater than or equal to the threshold 50 meters, the search stops; calculate the track point O n+m and the track point O n time intervals, as a duration greater than or equal to 15 minutes, all the recording track point to O n O n + m-1; such a duration less than 15 minutes, to O n O n + m-1 does not meet the requirements, The result is invalid; go to the next trajectory point, starting from the trajectory point O n+1 , repeat the above steps until a point that meets the conditions is found, go to step (2.3).
  • (2.3) Cluster all track points O n to O n+m-1 obtained in step 2.2 to generate behavior points.
  • the latitude and longitude of the behavior point is the mean value of the latitude and longitude of all the trajectory points O n to On +m-1
  • the time t n when the trajectory point O n is located is the start time of the behavior point
  • the trajectory point O n+m-1 The time t n+m-1 when it is located is the end time of the action point;
  • step (2.5) Determine in turn whether the interval between the start time and the end time of the action point obtained in step (2.4) has an intersection with the interval 23:00-05:00, record all the above-mentioned intersection action points to form a set A1, and calculate A1
  • the behavior point with the maximum time difference between the end time and the start time of the behavior point is taken as the residence behavior point M;
  • step (2.6) Determine in turn whether the interval between the start time and end time of the action point obtained in step (2.4) has an intersection with the interval 08:00-17:00, record all the above-mentioned intersection action points to form a set A2, and calculate A2
  • the behavior point with the maximum time difference between the end time and the start time of the behavior point is regarded as the work behavior point N;
  • the measured D value is 1.5 km, and the life behavior point set Q within 1.5 km from the residential behavior point is extracted.
  • the target layer is the life behavior point set P layer of user 00069f1927ee5cef9919335d75170d3
  • the source layer is circle M
  • the spatial selection method of the layer and target layer features is "completely located within the range of the source layer features”.
  • the exported layer is an SHP file of the "living life behavior point set Q”.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Remote Sensing (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Navigation (AREA)
  • Position Fixing By Use Of Radio Waves (AREA)

Abstract

Disclosed in the present invention is a living circle identification method based on positioning data: first pre-processing a positioning data file; then performing behaviour point identification on the positioning data and, on the basis of the continuous duration of the behaviour points in a specific time period, dividing into residential behaviour point, work behaviour point, and living behaviour point sets; selecting a specific city or area, and measuring or assigning an experience value to acquire a living circle radius D value; extracting living behaviour points that are respectively less than the D value from residential behaviour points M and work behaviour points N to form a residential living behaviour point set Q and a work living behaviour point set S; respectively extracting points M and point set Q and points N and point set S to form a minimum boundary geometric area and forming a residential living circle area and a work living circle area. The present invention makes full use of positioning data to accurately demarcate the real residential or work living circles of a population, being more scientific and objective than the traditional method of demarcating a circle with the place of residence as the centre point, and providing a scientific basis for residential or work living circle planning analysis and construction.

Description

一种基于定位数据的生活圈识别方法A life circle identification method based on positioning data 技术领域Technical field
本发明属于城市规划技术领域,尤其涉及一种基于定位数据的生活圈识别方法。The invention belongs to the technical field of urban planning, and particularly relates to a method for identifying a living circle based on positioning data.
背景技术Background technique
随着城市更新热潮的持续高涨,人们对集约的空间布局与精致的生活品质的需求不断提升。而生活圈作为城市人居环境的基本单元,决定着城市的设施空间配置及生活品质的塑造。国内外现有的生活圈范围界定方法主要以时间距离、空间距离或者是功能来定义生活圈等。With the continuous upsurge of urban renewal, people's demand for intensive spatial layout and exquisite quality of life continues to increase. As the basic unit of the urban human settlement environment, the living circle determines the urban facility space allocation and the shaping of the quality of life. The existing methods of defining the scope of the living circle at home and abroad mainly define the living circle in terms of time distance, spatial distance or function.
此类生活圈范围界定的方法中,以时间距离定义生活圈的分析方法主要通过计算“15分钟步行距离”、“一小时通勤圈”等方式来划定城市生活圈的边界;以空间距离定义的生活圈主要通过“800m的步行距离”来确定可达空间边界,形成生活圈的范围;而以功能来定义的生活圈则通过计算重大设施的距离来确定生活圈边界。以上三类生活圈的划定方法皆存在一定的局限性,更多的以人为主观感受与经验判断来确定生活圈。首先,单纯的通过交通时间划分方法中,持续的移动行为并不符合实际人群活动特征,此方法缺乏对生活中人的真实行为活动进行考虑;通过纯空间距离划分方法,通过判断人群短期步行的最远距离,忽视了具体个体的行为特征,模糊了人群活动之间的差异性。通过重大设施等级的划分方法,仅从城市范围层面,对生活圈的等级分析,并未结合人群的真实使用情况。Among the methods of defining the scope of this kind of living circle, the analysis method of defining the living circle by time distance mainly delimits the boundary of the urban living circle by calculating "15-minute walking distance", "one-hour commuting circle", etc.; defined by spatial distance The living circle of Shanghai mainly uses the "800m walking distance" to determine the reachable space boundary to form the range of the living circle; while the living circle defined by function determines the boundary of the living circle by calculating the distance of major facilities. The above three types of life circle delineation methods all have certain limitations, and more people use subjective feelings and experience to determine the life circle. First of all, in the purely dividing method by traffic time, the continuous movement behavior does not conform to the characteristics of the actual crowd activities. This method lacks consideration of the real behavior of people in life; through the pure spatial distance dividing method, by judging the short-term walking The farthest distance ignores the behavioral characteristics of specific individuals and blurs the differences between crowd activities. Through the classification method of major facilities, the analysis of the level of the living circle only at the level of the city range does not incorporate the actual use of the population.
此外,也存在一些专家学者通过GPS定位数据,对人群活动范围进行真实重现,但分析的方法与技术较为局限,通过一个小范围,少量人,短时间的测度方式来划定某一小范围地段的生活圈边界,难以对城市整体的人群生活圈进行范围的界定。因此,目前生活圈识别的方式更多的是人为主观的界定方式或者小范围的真实表征方法,缺乏城市尺度的人群生活范围的真实还原,并且未能与当下多源大数据的应用相结合。In addition, there are some experts and scholars who use GPS positioning data to truly reproduce the range of crowd activities, but the analysis methods and techniques are more limited. A small area, a small number of people, and a short-term measurement method are used to delimit a small area. The boundary of the living circle of the lot makes it difficult to define the scope of the overall urban living circle. Therefore, the current life circle identification methods are more subjectively defined methods or small-scale real representation methods, lacking the true restoration of the urban-scale population's living area, and failing to integrate with the current multi-source big data application.
发明内容Summary of the invention
发明目的:针对以上问题,本发明提出一种基于定位数据的生活圈识别方法,针对现有生活圈以居住地或工作地简单做圆,忽视人群的真实情况的问题,以定位数据,识别人群的行为,以日常步行范围内的行为点集成面,得出人群真实的居住生活圈及工作生活圈。Purpose of the invention: In view of the above problems, the present invention proposes a living circle identification method based on positioning data, aiming at the problem of simply making a circle with the place of residence or work in the existing living circle, ignoring the real situation of the crowd, and identifying the crowd with positioning data The behavior of people is integrated with the behavior points within the daily walking range to get the real living and working life circle of the crowd.
技术方案:为实现本发明的目的,本发明所采用的技术方案是:一种基于定位数据的生活圈识别方法,该方法包括以下步骤:Technical solution: In order to achieve the purpose of the present invention, the technical solution adopted by the present invention is: a method for identifying a living circle based on positioning data. The method includes the following steps:
(1)获取用户的定位数据文件,运用EXCEL软件对定位数据文件进行预处理。(1) Obtain the user's positioning data file, and use EXCEL software to preprocess the positioning data file.
(2)对步骤1得到的有效定位数据文件进行行为点识别,并根据行为点在特定时间段的持续时长,划分出行为点的不同行为性质:包括居住行为点M、工作行为点N和生活行为点集P。(2) Identify the behavior points of the effective positioning data file obtained in step 1, and divide the different behavior characteristics of the behavior points according to the duration of the behavior points in a specific time period: including residential behavior point M, work behavior point N, and life Behavior point set P.
(3)选取特定城市或地区,实测或赋经验值获取特定时长人群的步行距离,得出特定时长生活圈半径D值。提取距居住行为点M距离小于D值的生活行为点,形成居住生活行为点集Q;和/或,提取距工作行为点N距离小于D值的生活行为点,形成工作行为点集S。(3) Select a specific city or region, measure or assign an empirical value to obtain the walking distance of the crowd of a specific time period, and obtain the value of the radius D of the life circle for a specific time period. Extract the life behavior points whose distance from the residence behavior point M is less than the value of D to form the residence life behavior point set Q; and/or extract the life behavior points whose distance from the work behavior point N is less than the value D to form the work behavior point set S.
(4)提取点M和点集Q形成最小边界几何面域,获取用户的居住生活圈面域Y,和/或提取点N和点集S形成最小边界几何面域,获取用户的工作生活圈面域L。(4) Extract point M and point set Q to form the minimum boundary geometric area, obtain the user’s living area area Y, and/or extract points N and point set S to form the minimum boundary geometric area, obtain the user’s work and life area Area L.
进一步的,所述步骤(1)包括:Further, the step (1) includes:
(1.1)获得脱敏定位数据,数据采集时间为5分钟一次,数据内容包括用户编号,用户位置更新日期与时间,用户实时经纬度坐标。定位数据可以通过手机运营商获得,亦可通过GPS数据采集获得。其中匿名定位数据编号为脱敏后的用户识别号,不涉及到用户的隐私信息。实际工作中数据采集时间间隔可以自由设定。定位数据的标准格式如下:(1.1) Obtain desensitization positioning data. The data collection time is every 5 minutes. The data content includes the user number, the user's location update date and time, and the user's real-time latitude and longitude coordinates. Positioning data can be obtained through mobile phone operators, or through GPS data collection. The anonymous location data number is the user identification number after desensitization, and does not involve the user's private information. The data collection time interval can be set freely in actual work. The standard format of positioning data is as follows:
表1 定位数据格式示例Table 1 Example of positioning data format
Figure PCTCN2019081820-appb-000001
Figure PCTCN2019081820-appb-000001
(1.2)将脱敏的定位数据储存于配置不低于Intel Xeon Processor E5-2620V4处理器,512G SSD,128G DDR4内存的工作站上,并在此工作站上进行具体操作。(1.2) Store the desensitized positioning data on a workstation with no lower configuration than Intel Xeon Processor E5-2620V4 processor, 512G SSD, 128G DDR4 memory, and perform specific operations on this workstation.
(1.3)在Excel程序中,使用‘查找和选择’命令,勾选‘空值’,获取所有空值单元格,右击“删除”命令,选择删除“整行”。使用“数据”命令栏下的“删除重复项”,勾选‘用户ID’列,删除所有用户重复数据。使用“排序和筛选”命令栏下的 “筛选”命令,勾选‘数字筛选’,选择并删除日期或者经纬度不在正常值范围内的数据。(1.3) In the Excel program, use the ‘find and select’ command, check the ‘empty value’ to get all the blank value cells, right click on the “delete” command and select to delete the “whole row”. Use the "Delete Duplicate Items" under the "Data" command column, check the "User ID" column to delete all user duplicate data. Use the "Filter" command under the "Sort and Filter" command bar, check "Digital Filter", select and delete data whose date or latitude and longitude are not within the normal range.
进一步的,所述步骤(2)包括:Further, the step (2) includes:
(2.1)定义距离阈值和最短时间间隔阈值,距离阈值为S米,最短时间间隔阈值为T 1分钟; (2.1) Define the distance threshold and the shortest time interval threshold, the distance threshold is S meters, and the shortest time interval threshold is T 1 minute;
(2.2)对所有轨迹点按照时间序列排序,对轨迹点O n,判断其下一个轨迹点O n+1与轨迹点O n的距离是否小于S米,如小于S米,则作为O n的相邻点;持续以上操作,继续判断,至轨迹点O n+m,若其与轨迹点O n的距离大于或等于阈值S米,即搜索停止;计算轨迹点O n+m和轨迹点O n的时间间隔,如持续时间大于或等于T 1分钟,则记录所有轨迹点O n到O n+m-1;如持续时间小于T 1分钟,则O n到O n+m-1不满足要求,结果无效;转至下一个轨迹点,即轨迹点O n+1开始,重复以上步骤; (2.2) points to sort all tracks in time series trajectory point O n, determines which path point O n + 1 from the track point O n is less than S m, such as less than S m, is as O n a Adjacent points; continue the above operations, continue to judge, to the track point O n+m , if the distance from the track point O n is greater than or equal to the threshold S meters, the search stops; calculate the track point O n+m and the track point O n time intervals, such as greater than or equal to the duration T 1 minute, the recording track of all points to O n O n + m-1; such duration is less than T 1 minute, to O n O n + m-1 is not satisfied Request, the result is invalid; go to the next track point, that is, start at track point O n+1 , repeat the above steps;
(2.3)将步骤2.1.2得到的所有轨迹点O n到O n+m-1聚类,生成行为点,其中行为点的经纬度为所有轨迹点O n到O n+m-1经纬度的均值,轨迹点O n被定位的时间t n为行为点的开始时间,轨迹点O n+m-1被定位时的时间t n+m-1作为行为点的结束时间; (2.3) Cluster all trajectory points O n to On +m-1 obtained in step 2.1.2 to generate behavior points, where the latitude and longitude of the behavior point is the mean value of all trajectory points O n to O n+m-1 , The time t n when the track point O n is located is the start time of the action point, and the time t n+m-1 when the track point O n+m-1 is located is the end time of the action point;
(2.4)从轨迹点O n+m开始,重复步骤2.2与步骤2.3,遍历所有轨迹点,识别出所有的行为点; (2.4) Starting from the track point On +m , repeat steps 2.2 and 2.3 to traverse all track points and identify all behavior points;
(2.5)依次判断步骤(2.4)得到的行为点的开始时间和结束时间的区间是否与23:00-05:00区间有交集,记录上述有交集的行为点形成集合A1,并计算A1中的行为点的结束时间与开始时间的时间差最大值的行为点,作为居住行为点M;(2.5) Determine in turn whether the interval between the start time and end time of the action points obtained in step (2.4) has an intersection with the interval 23:00-05:00, record the above-mentioned intersection action points to form a set A1, and calculate the A1 The behavior point with the maximum time difference between the end time and the start time of the behavior point is regarded as the residence behavior point M;
(2.6)依次判断步骤(2.4)得到的行为点的开始时间和结束时间是否与08:00-17:00区间有交集,记录上述有交集的行为点形成集合A2,并计算A2中的行为点的结束时间与开始时间的时间差最大值的行为点,作为工作行为点N;(2.6) Determine in turn whether the start time and end time of the behavior points obtained in step (2.4) have an intersection with the 08:00-17:00 interval, record the above-mentioned intersection behavior points to form a set A2, and calculate the behavior points in A2 The action point with the maximum time difference between the end time and the start time of is regarded as the work action point N;
(2.7)所有行为点文件中,剔除居住行为点M以及工作行为点N,得到生活行为点集P={p1,p2,p3,……,pn}。(2.7) In all the behavior point files, remove the residence behavior point M and the work behavior point N, and get the life behavior point set P={p1, p2, p3,..., pn}.
进一步的,所述步骤(3)包括:Further, the step (3) includes:
(3.1)实测确定生活圈半径大小:选取拟划定生活圈的城市/地区,选择独立出行的人员,其中0-20岁市民、21-40岁市民、41-60岁市民、大于60岁市民,各个年龄段男女各L人,总实验人数为8*L人,在道路上连续行走T 2分钟,以佳明GPSMAP 62scGPS记录仪器记录他们的所有步行距离,取所有距离的平均值作为生活圈半径D;或者通过 赋经验值,单位分钟步行距离为d,D=d*T 2,其中,时长T 2基于城市和地区的差异,通常取值为15、30或60。 (3.1) Determine the radius of the living circle through actual measurement: select the city/region to be delimited for the living circle, and choose people who travel independently, including citizens aged 0-20, citizens 21-40, citizens 41-60, and citizens older than 60 , There are L men and women in all age groups, the total number of people in the experiment is 8*L, and they walked on the road for T 2 minutes continuously, and recorded all their walking distances with Jiaming GPSMAP 62scGPS recording instrument, and took the average of all distances as the radius of the life circle D; or by assigning an empirical value, the walking distance per minute is d, D=d*T 2 , where the duration T 2 is based on the difference between cities and regions, usually taking the value 15, 30 or 60.
(3.2)将步骤(2)得到的行为点文件,分图层加载到ArcGIS软件中,形成居住行为点M、工作行为点N、到生活行为点集P三个图层。(3.2) Load the behavior point file obtained in step (2) into ArcGIS software in layers to form three layers: residential behavior point M, work behavior point N, and life behavior point set P.
(3.3)在ArcGIS软件中,分别以点M和点N为圆心,画半径为D的圆形,为圆M图层和圆N图层。(3.3) In the ArcGIS software, draw a circle with a radius of D with point M and point N as the center of the circle, which is a circle M layer and a circle N layer.
(3.4)使用ArcGIS软件中“选择”下的“按照位置选择文件”命令,勾选“从以下图层中选择要素”,目标图层为生活行为点集P图层,源图层分别为圆M图层和源N图层,目标图层要素的空间选择方法为“完全位于源图层要素范围内”,选择后,导出图层为“居住生活行为点集Q”和“工作生活行为点集S”的SHP文件。(3.4) Use the "Select file according to location" command under "Select" in ArcGIS software, check "Select features from the following layers", the target layer is the life behavior point set P layer, and the source layers are circles. M layer and source N layer, the spatial selection method of the target layer feature is "completely within the range of the source layer feature", after selection, the exported layer is "living life behavior point set Q" and "working life behavior point Set S" SHP file.
进一步的,所述步骤(4)包括:Further, the step (4) includes:
(4.1)在ArcGIS软件中,使用“合并”命令,将点M和点集Q合并,得到点集X;(4.1) In ArcGIS software, use the "merge" command to merge point M and point set Q to get point set X;
(4.2)在ArcGIS软件中,使用“最小边界几何”命令,在输入要素中,输入点集X,在地理选项中,选择“凸包(CONVEX HULL)”,输出面域Y,所形成的面域范围,即为目标对象的居住生活圈范围。如图3所示,为本发明的空间示意图,其中面域MP 1P 2即为居住生活圈范围。 (4.2) In ArcGIS software, use the "minimum boundary geometry" command, in the input features, input the point set X, in the geographic options, select "CONVEX HULL", output area Y, the formed surface The domain scope is the living circle scope of the target object. As shown in Fig. 3, it is a schematic diagram of the space of the present invention, in which the area MP 1 P 2 is the range of the living circle.
(4.3)在ArcGIS软件中,使用“合并”命令,将点N和点集S合并,得到点集K;(4.3) In ArcGIS software, use the "merge" command to merge point N and point set S to get point set K;
(4.4)在ArcGIS软件中,使用“最小边界几何”命令,在输入要素中,输入点集K,在地理选项中,选择“凸包(CONVEX HULL)”,输出面域L,所形成的面域范围,即为目标对象的工作生活圈范围。如图4所示,为本发明的空间示意图,其中面域NP 3P nP 4即为工作生活圈范围。 (4.4) In ArcGIS software, use the "minimum boundary geometry" command, in the input elements, input point set K, in the geographic options, select "CONVEX HULL", output area L, the formed area The domain scope is the working life circle scope of the target object. As shown in Fig. 4, it is a schematic diagram of the space of the present invention, where the area NP 3 P n P 4 is the range of the working and living circle.
进一步的,还可以将居住区每个对象的居住生活圈面域合并,即得到该居住区所有对象的居住生活圈面域;将办公区每个对象的工作生活圈面域合并,即得到特定办公区所有对象的工作生活圈面域。Furthermore, the living area of each object in the residential area can be combined to obtain the living area of all objects in the residential area; the working and living area of each object in the office area can be combined to obtain a specific The working life circle area of all objects in the office area.
有益效果:与现有技术相比,本发明的技术方案具有以下有益技术效果:Beneficial effects: Compared with the prior art, the technical solution of the present invention has the following beneficial technical effects:
本发明利用定位数据,通过算法和规则识别人群有效的行为,得出人群真实的居住行为点、工作行为点和生活行为点。以特定地区特定时间内的步行距离,作为生活圈半径,提取出分别距离居住行为点和工作行为点生活圈半径范围内的生活行为点,为居住生活行为点和工作生活行为点。生成居住行为点和居住生活行为点的最小边界几何面域为居住生活圈,生成工作行为点和工作生活行为点的最小边界几何面域为工作生活圈, 本发明充分利用定位数据能够精准定位人群真实的居住或工作生活圈,比传统意义上的圆形更为科学和客观,为居住或工作生活圈规划的分析和建设提供科学依据。The present invention uses positioning data to identify effective behaviors of the crowd through algorithms and rules, and obtains the actual residential behavior points, work behavior points and life behavior points of the crowd. Taking the walking distance in a specific area within a specific time as the radius of the life circle, extract the life behavior points within the radius of the life circle from the residence behavior point and the work behavior point respectively, as the residence life behavior point and work life behavior point. The minimum boundary geometric area for generating living behavior points and living life behavior points is the living life circle, and the minimum boundary geometric area for generating working behavior points and work life behavior points is the working life circle. The present invention makes full use of positioning data to accurately locate the crowd The real living or working life circle is more scientific and objective than the circle in the traditional sense, providing a scientific basis for the analysis and construction of the living or working life circle planning.
附图说明Description of the drawings
图1本发明的流程图;Figure 1 Flow chart of the present invention;
图2行为点识别方法示意图;Figure 2 Schematic diagram of behavior point recognition method;
图3本发明的居住生活圈空间示意图;Fig. 3 is a schematic diagram of the living space of the present invention;
图4本发明的工作生活圈空间示意图;Fig. 4 is a schematic diagram of the working and living circle space of the present invention;
图5本发明在南京市新街口片区止马村居住小区某用户居住生活圈的应用;Figure 5 The application of the present invention in the living circle of a user in the Zhima Village residential area in Xinjiekou area, Nanjing;
图6本发明在南京市新街口止马村居住小区居住生活圈的应用。Fig. 6 The application of the present invention in the residential living circle of Zhima Village, Xinjiekou, Nanjing.
具体实施方式detailed description
下面结合附图和实施例对本发明的技术方案作进一步的说明。The technical scheme of the present invention will be further described below in conjunction with the drawings and embodiments.
以下将基于定位数据识别南京新街口止马村居住小区的15分钟居住生活圈及附图来详细地说明本发明的技术方案,如:The technical scheme of the present invention will be described in detail below based on the positioning data to identify the 15-minute residential life circle of the Zhima Village residential area in Xinjiekou, Nanjing and the accompanying drawings, such as:
(1)运用EXCEL软件对匿名手机定位数据CSV文件进行预处理,其中匿名手机定位数据CSV文件从手机信息推送服务商处获取,包括用户编号,用户位置更新的日期,时间和经纬度坐标信息;并且,或通过GPS定位仪器搜集获取定位数据文件;(1) Use EXCEL software to preprocess the anonymous mobile phone location data CSV file, where the anonymous mobile phone location data CSV file is obtained from the mobile phone information push service provider, including the user number, the date, time, and latitude and longitude coordinate information of the user's location update; and , Or use GPS positioning equipment to collect positioning data files;
(1.1)通过手机信息推送服务商获得匿名用户的手机定位数据的具体步骤如下:通过手机信息推送服务商获得南京所有用户2017年11月11号的手机定位数据,数据采集时间为5分钟一次,采集时间间隔可以根据实际需要预先设置,数据内容包括用户编号,用户位置更新日期与时间,用户实时经纬度坐标,共采集数据23.21亿条,具体数据格式要求如表1所示,包含脱敏后的用户编号,用户性别、年龄、位置更新日期、时间、经度、维度信息。其中用户手机定位数据编号为脱敏后的手机识别号,不涉及到用户的隐私信息。将匿名手机定位数据储存于配置不低于Intel Xeon Processor E5-2620V4处理器,512G SSD,128G DDR4内存的工作站上,并在此工作站上进行具体操作。(1.1) The specific steps for obtaining the mobile phone location data of anonymous users through the mobile phone information push service provider are as follows: Obtain the mobile phone location data of all users in Nanjing on November 11, 2017 through the mobile phone information push service provider. The data collection time is every 5 minutes. The collection time interval can be preset according to actual needs. The data content includes user ID, user location update date and time, user real-time latitude and longitude coordinates, a total of 2.321 billion pieces of collected data. The specific data format requirements are shown in Table 1, including desensitized data. User ID, user gender, age, location update date, time, longitude, latitude information. The user's mobile phone location data number is the desensitized mobile phone identification number, which does not involve the user's private information. Store anonymous mobile phone location data on a workstation with no lower configuration than Intel Xeon Processor E5-2620V4 processor, 512G SSD, 128G DDR4 memory, and perform specific operations on this workstation.
表2 数据格式示例Table 2 Data format example
Figure PCTCN2019081820-appb-000002
Figure PCTCN2019081820-appb-000002
(1.2)运用EXCEL软件对步骤1得到的南京所有手机用户定位数据CSV文件进行预处理,删除缺失数据,重复数据和异常数据,得到所有用户的有效定位数据CSV文件。(1.2) Use EXCEL software to preprocess the CSV file of all mobile phone user positioning data in Nanjing obtained in step 1, delete missing data, duplicate data and abnormal data, and obtain effective positioning data CSV files for all users.
例如,在Excel程序中,使用‘查找和选择’命令,勾选‘空值’,获取所有空值单元格,右击“删除”命令,选择删除“整行”。使用“数据”命令栏下的“删除重复项”,勾选‘用户ID’列,删除所有用户重复数据。使用“排序和筛选”命令栏下的“筛选”命令,勾选‘数字筛选’,选择并删除日期或者经纬度不在正常值范围内的数据。For example, in the Excel program, use the ‘find and select’ command, check the ‘empty value’ to get all the blank value cells, right click on the “delete” command, and select to delete the “whole row”. Use the "Delete Duplicate Items" under the "Data" command column, check the "User ID" column to delete all user duplicate data. Use the "Filter" command under the "Sort and Filter" command bar, check "Digital Filter", select and delete data whose date or latitude and longitude are not within the normal range.
(2)对步骤2得到的南京所有手机用户有效定位数据CSV文件进行行为点识别;根据行为点所在的时间段,划分出南京所有手机用户行为点的不同行为性质:包括居住行为点、工作行为点和生活行为点。(2) Perform behavior point recognition on the CSV file of effective positioning data for all mobile phone users in Nanjing obtained in step 2. According to the time period when the behavior point is located, divide the different behavior characteristics of all mobile phone users in Nanjing: including residential behavior points and work behaviors Points and life behavior points.
(2.1)定义距离阈值和最短时间间隔阈值,距离阈值S为50米,最短时间间隔T 1阈值为15分钟,该实例中的50米和15分钟只是示例,可以根据实际需要设置不同的值; (2.1) Define the distance threshold and the shortest time interval threshold. The distance threshold S is 50 meters, and the shortest time interval T 1 threshold is 15 minutes. The 50 meters and 15 minutes in this example are just examples, and different values can be set according to actual needs;
(2.2)对所有轨迹点按照时间序列排序,对轨迹点O n,判断其下一个轨迹点O n+1与轨迹点O n的距离是否小于50米,如小于50米,则作为O n的相邻点;持续以上操作,继续判断,至轨迹点O n+m,若其与轨迹点O n的距离大于或等于阈值50米,即搜索停止;计算轨迹点O n+m和轨迹点O n的时间间隔,如持续时间大于或等于15分钟,则记录所有轨迹点O n到O n+m-1;如持续时间小于15分钟,则O n到O n+m-1不满足要求,结果无效;转至下一个轨迹点,即轨迹点O n+1开始,重复以上步骤,直到找到符合条件的点,转到步骤(2.3)。 (2.2) points to sort all tracks in time series trajectory point O n, determines which path point O n + 1 from the track point O n is less than 50 m, such as less than 50 meters, then as O n a Adjacent points; continue the above operations, continue to judge, to the track point O n+m , if the distance from the track point O n is greater than or equal to the threshold 50 meters, the search stops; calculate the track point O n+m and the track point O n time intervals, as a duration greater than or equal to 15 minutes, all the recording track point to O n O n + m-1; such a duration less than 15 minutes, to O n O n + m-1 does not meet the requirements, The result is invalid; go to the next trajectory point, starting from the trajectory point O n+1 , repeat the above steps until a point that meets the conditions is found, go to step (2.3).
(2.3)将步骤2.2得到的所有轨迹点O n到O n+m-1聚类,生成行为点。其中,行为点 的经纬度为所有轨迹点O n到O n+m-1所有点经纬度的均值,轨迹点O n被定位的时间t n为行为点的开始时间,轨迹点O n+m-1被定位时的时间t n+m-1作为行为点的结束时间; (2.3) Cluster all track points O n to O n+m-1 obtained in step 2.2 to generate behavior points. Among them, the latitude and longitude of the behavior point is the mean value of the latitude and longitude of all the trajectory points O n to On +m-1 , the time t n when the trajectory point O n is located is the start time of the behavior point, and the trajectory point O n+m-1 The time t n+m-1 when it is located is the end time of the action point;
(2.4)从轨迹点O n+m开始,重复步骤2.2与步骤2.3,遍历所有轨迹点,识别出所有的行为点; (2.4) Starting from the track point On +m , repeat steps 2.2 and 2.3 to traverse all track points and identify all behavior points;
(2.5)依次判断步骤(2.4)得到的行为点的开始时间和结束时间的区间是否与23:00-05:00区间有交集,记录上述所有有交集的行为点形成集合A1,并计算A1中的行为点的结束时间与开始时间的时间差最大值的行为点,作为居住行为点M;(2.5) Determine in turn whether the interval between the start time and the end time of the action point obtained in step (2.4) has an intersection with the interval 23:00-05:00, record all the above-mentioned intersection action points to form a set A1, and calculate A1 The behavior point with the maximum time difference between the end time and the start time of the behavior point is taken as the residence behavior point M;
(2.6)依次判断步骤(2.4)得到的行为点的开始时间和结束时间的区间是否与08:00-17:00区间有交集,记录上述所有有交集的行为点形成集合A2,并计算A2中的行为点的结束时间与开始时间的时间差最大值的行为点,作为工作行为点N;(2.6) Determine in turn whether the interval between the start time and end time of the action point obtained in step (2.4) has an intersection with the interval 08:00-17:00, record all the above-mentioned intersection action points to form a set A2, and calculate A2 The behavior point with the maximum time difference between the end time and the start time of the behavior point is regarded as the work behavior point N;
(2.7)所有行为点文件中,剔除居住行为点M以及工作行为点N,得到生活行为点集P文件={p1,p2,p3,……,pn}。(2.7) In all the behavior point files, remove the residence behavior point M and the work behavior point N, and get the life behavior point set P file={p1, p2, p3,..., pn}.
(3)实测获取15分钟步行距离D值,该实施例实测D值为1.5km,提取距离居住行为点1.5km范围内的生活行为点集Q。(3) Obtain the 15-minute walking distance D value through actual measurement. In this embodiment, the measured D value is 1.5 km, and the life behavior point set Q within 1.5 km from the residential behavior point is extracted.
(3.1)选取南京新街口片区和人群特征,实测获取15分钟步行距离为1.5km的具体步骤如下:实测确定生活圈半径大小:选取特定的南京新街口片区,选择独立出行的人员,其中0-20岁市民、21-40岁市民、41-60岁市民、大于60岁市民,各个年龄段男女各10人,总试验者各80人,在新街口城市道路上连续行走15分钟,然后使用佳明GPSMAP 62sc记录仪器记录起始点与终点的距离,取所有距离的平均值作为生活圈半径D,本实施例距离值1.5km。(3.1) Select Nanjing Xinjiekou area and the characteristics of the crowd, and the specific steps to obtain a 15-minute walking distance of 1.5km are as follows: Actual measurement determines the radius of the living circle: select a specific Nanjing Xinjiekou area and choose independent travellers, among which Citizens aged 0-20, citizens 21-40, citizens 41-60, citizens older than 60, 10 men and women of all ages, and 80 test subjects each, walked for 15 minutes on Xinjiekou city roads. Then use Garmin GPSMAP 62sc recording instrument to record the distance between the start point and the end point, and take the average of all distances as the radius D of the living circle. The distance value in this embodiment is 1.5km.
(3.2)以南京止马村居住小区为例,以居住点位于该居住小区的用户00069f1927ee5cef9919335d75170d3为例,提取其一天的行为活动轨迹点数据。(3.2) Taking the residential area of Zhima Village in Nanjing as an example, take the user 00069f1927ee5cef9919335d75170d3 whose residence is located in the residential area as an example to extract the data of the trajectory points of their behavior in one day.
(3.3)将步骤3.2得到的用户00069f1927ee5cef9919335d75170d3行为点CSV文件,分图层加载到ArcGIS软件中,形成用户00069f1927ee5cef9919335d75170d3居住行为点M、工作行为点N、到生活行为点集P三个图层。(3.3) Load the user 00069f1927ee5cef9919335d75170d3 behavior point CSV file obtained in step 3.2 into ArcGIS software in layers to form three layers of user 00069f1927ee5cef9919335d75170d3 residence behavior point M, work behavior point N, and life behavior point set P.
(3.4)在ArcGIS软件中,以用户00069f1927ee5cef9919335d75170d3的居住点M为圆心,画半径距离为1.5km的圆形,为圆M图层。(3.4) In the ArcGIS software, take the residence M of the user 00069f1927ee5cef9919335d75170d3 as the center of the circle, and draw a circle with a radius of 1.5km, which is the circle M layer.
(3.5)提取位于圆M内的生活行为点集P得到居住生活行为点集Q,方法如下:(3.5) Extract the life behavior point set P located in the circle M to obtain the residential life behavior point set Q, the method is as follows:
使用ArcGIS软件中“选择”下的“按照位置选择文件”命令,勾选“从以下图层中选择要素”,目标图层为用户00069f1927ee5cef9919335d75170d3的生活行为点集P 图层,源图层为圆M图层,目标图层要素的空间选择方法为“完全位于源图层要素范围内”,选择后,导出图层为“居住生活行为点集Q”的SHP文件。Use the "Select file by location" command under "Select" in ArcGIS software, check "Select features from the following layers", the target layer is the life behavior point set P layer of user 00069f1927ee5cef9919335d75170d3, the source layer is circle M The spatial selection method of the layer and target layer features is "completely located within the range of the source layer features". After selection, the exported layer is an SHP file of the "living life behavior point set Q".
(4)提取点M和点集Q形成最小边界几何面域,获取形成手机用户的居住生活圈面域。(4) Extract the point M and the point set Q to form the minimum boundary geometric area, and obtain the area that forms the living circle of the mobile phone user.
(4.1)在ArcGIS软件中,使用“合并”命令,将点M和点集Q合并,得到点集X;(4.1) In ArcGIS software, use the "merge" command to merge point M and point set Q to get point set X;
(4.2)在ArcGIS软件中,使用“最小边界几何”命令,在输入要素中,输入点集X,在地理选项中,选择“凸包(CONVEX HULL)”,输出面域Y,所形成的面域范围,即为手机用户00069f1927ee5cef9919335d75170d3的居住生活圈面域。如图5所示,为手机用户00069f1927ee5cef9919335d75170d3的实际生活圈,具体范围为面域P 1P 2P 3P 4P 5P 6P 7(4.2) In ArcGIS software, use the "minimum boundary geometry" command, in the input features, input the point set X, in the geographic options, select "CONVEX HULL", output area Y, the formed surface The domain range is the living area of the mobile phone user 00069f1927ee5cef9919335d75170d3. As shown in Figure 5, it is the actual living circle of the mobile phone user 00069f1927ee5cef9919335d75170d3, and the specific range is the area P 1 P 2 P 3 P 4 P 5 P 6 P 7 .
(4.3)提取居住点为止马村居住小区的所有用户的行为活动点数据,重复上述步骤3.2至步骤4.2,获取该居住小区所有用户居住生活圈几何面域集合{Y 1,Y 2,……,Y n},对所有面域取并集,输出面域Y zmc,即为该居住小区居住生活圈面域。如图6所示,为止马村居住小区的居住生活圈范围,即面域: (4.3) Extract the behavioral activity point data of all users in the Ma Village residential area up to the residential area, repeat the above steps 3.2 to 4.2 to obtain the geometric area set {Y 1 ,Y 2 ,…… , Y n }, take the union of all areas, and output area Y zmc , which is the area of the living circle of the residential area. As shown in Figure 6, the area of the residential life circle of Zhima Village Residential Area:
P M21-H 1-P M32-P M34-P M12-H 2-P M37-H 3-P M22-P M23-P M15-P M36-H 4-P M16-P M17-P M26-P M27,其中P M21是居住行为点为M2的手机用户的生活行为点,依此类推;H为该住区用户居住生活圈面域面域交汇点。 P M2 1-H 1 -P M3 2-P M3 4-P M1 2-H 2 -P M3 7-H 3 -P M2 2-P M2 3-P M1 5-P M3 6-H 4 -P M1 6-P M1 7-P M2 6-P M2 7, where P M2 1 is the life behavior point of the mobile phone user whose residential behavior point is M2, and so on; H is the intersection of the residential area of the user in the residential area point.

Claims (9)

  1. 一种基于定位数据的生活圈识别方法,其特征在于,该方法包括以下步骤:A method for identifying a living circle based on positioning data, characterized in that the method includes the following steps:
    (1)获取用户的定位数据文件,并对定位数据文件进行预处理;(1) Obtain the user's location data file and preprocess the location data file;
    (2)对步骤(1)得到的有效定位数据进行行为点识别,并将行为点划分为居住行为点M、工作行为点N和生活行为点集P;(2) Perform behavior point recognition on the effective positioning data obtained in step (1), and divide the behavior points into residential behavior point M, work behavior point N, and life behavior point set P;
    (3)设置生活圈半径D值,提取距居住行为点M距离小于D值的生活行为点,形成居住生活行为点集Q;和/或,提取距工作行为点N距离小于D值的生活行为点,形成工作生活行为点集S;(3) Set the radius of the life circle D value, extract the life behavior points whose distance from the residence behavior point M is less than the value of D, and form the residence life behavior point set Q; and/or, extract the life behaviors whose distance from the work behavior point N is less than the value D Point, forming a point set S of work and life behavior;
    (4)提取点M和点集Q形成最小边界几何面域,获取用户的居住生活圈面域Y,和/或提取点N和点集S形成最小边界几何面域,获取用户的工作生活圈面域L。(4) Extract point M and point set Q to form the minimum boundary geometric area, obtain the user’s living area area Y, and/or extract points N and point set S to form the minimum boundary geometric area, obtain the user’s work and life area Area L.
  2. 根据权利要求1或2所述的一种基于定位数据的生活圈识别方法,其特征在于,所述定位数据文件包括用户编号,用户位置更新的日期,时间和经纬度坐标信息。A method for identifying a living circle based on positioning data according to claim 1 or 2, wherein the positioning data file includes a user number, date, time, and latitude and longitude coordinate information of the user's location update.
  3. 根据权利要求1或2所述的一种基于定位数据的生活圈识别方法,其特征在于,步骤(1)获取用户的定位数据文件方法如下:定位数据文件可从手机信息推送服务商处获取脱敏的定位数据,或通过GPS定位仪器搜集获取GPS定位数据。The method for identifying a living circle based on positioning data according to claim 1 or 2, characterized in that the method of obtaining the user's positioning data file in step (1) is as follows: the positioning data file can be obtained from a mobile phone information push service provider. Sensitive positioning data, or GPS positioning data collected by GPS positioning equipment.
  4. 根据权利要求1或2所述的一种基于定位数据的生活圈识别方法,其特征在于,所述步骤(1)对定位数据文件进行预处理的方法如下:在EXCEL软件中,使用查找和选择功能内的定位选项,选择并删除缺失用户编号和经纬度信息的数据;使用删除重复项命令,选择并删除重复用户编号的数据;使用数字筛选命令,选择并删除日期或者经纬度不在预设正常值范围内的异常数据。The method for identifying a living circle based on positioning data according to claim 1 or 2, characterized in that, the method for preprocessing the positioning data file in the step (1) is as follows: In EXCEL software, use search and select The positioning option in the function, select and delete the missing user number and latitude and longitude information; use the delete duplicate command to select and delete the data of the repeated user number; use the digital filter command to select and delete the date or the longitude and latitude are not in the preset normal value range Abnormal data within.
  5. 根据权利要求1或2所述的一种基于定位数据的生活圈识别方法,其特征在于,对步骤(2)得到的定位数据文件进行行为点识别,并将行为点划分为居住行为点M、工作行为点N和生活行为点集P的方法如下:The method for identifying a living circle based on positioning data according to claim 1 or 2, characterized in that the positioning data file obtained in step (2) is identified by behavior points, and the behavior points are divided into residential behavior points M, The methods of work behavior point N and life behavior point set P are as follows:
    (2.1)定义距离阈值和最短时间间隔阈值,设距离阈值为S米,最短时间间隔阈值为T 1分钟; (2.1) Define the distance threshold and the shortest time interval threshold, set the distance threshold to S meters, and the shortest time interval threshold to T 1 minute;
    (2.2)对定位数据中所有轨迹点按照时间序列排序,对轨迹点O n,判断其下一个轨迹点O n+1与轨迹点O n的距离是否小于S米,如小于S米,则作为O n的相邻点;持续以上操作,继续判断,至轨迹点O n+m,若其与轨迹点O n的距离大于或等于阈值S米,即搜索停止;计算轨迹点O n+m和轨迹点O n的时间间隔,如持续时间大于或等于预设时间T 1,则记录所有轨迹点O n到O n+m-1;如持续时间小于T 1,则O n到O n+m-1不满足要求,结果无效;转至下一个轨迹点,即轨迹点O n+1开始,重复以上步骤,直到找到符合条件的点,转到 步骤(2.3); (2.2) Sort all the trajectory points in the positioning data according to the time sequence. For the trajectory point O n , determine whether the distance between the next trajectory point O n+1 and the trajectory point O n is less than S meters. If it is less than S meters, it is taken as The adjacent point of O n ; continue the above operation, continue to judge, to the trajectory point O n+m , if the distance from the trajectory point O n is greater than or equal to the threshold S meters, the search stops; calculate the trajectory point O n+m and O n locus of points of time intervals, as a duration equal to or greater than the predetermined time T 1, the recording track of all the points O n O n + m-1; such a duration less than T 1, to the O n O n + m -1 does not meet the requirements, the result is invalid; go to the next trajectory point, starting from the trajectory point O n+1 , repeat the above steps until a point that meets the conditions is found, go to step (2.3);
    (2.3)将步骤(2.2)得到的所有轨迹点O n到O n+m-1聚类,生成行为点,其中行为点的经纬度为所有轨迹点O n到O n+m-1经纬度的均值,轨迹点O n被定位的时间t n为行为点的开始时间,轨迹点O n+m-1被定位时的时间t n+m-1作为行为点的结束时间; (2.3) Cluster all trajectory points O n to O n+m-1 obtained in step (2.2) to generate behavior points, where the latitude and longitude of the behavior point is the mean value of all trajectory points O n to O n+m-1 , The time t n when the track point O n is located is the start time of the action point, and the time t n+m-1 when the track point O n+m-1 is located is the end time of the action point;
    (2.4)从轨迹点O n+m开始,重复步骤2.2与步骤2.3,遍历所有轨迹点,识别出所有的行为点; (2.4) Starting from the track point On +m , repeat steps 2.2 and 2.3 to traverse all track points and identify all behavior points;
    (2.5)依次判断步骤(2.4)得到的行为点的开始时间和结束时间的区间是否与23:00-05:00区间有交集,记录上述所有有交集的行为点形成集合A1,并选择A1中的行为点的结束时间与开始时间的时间差最大值的行为点,作为居住行为点M;(2.5) Determine in turn whether the interval between the start time and end time of the action point obtained in step (2.4) has an intersection with the interval 23:00-05:00, record all the above-mentioned intersection action points to form a set A1, and select A1 The behavior point with the maximum time difference between the end time and the start time of the behavior point is taken as the residence behavior point M;
    (2.6)依次判断步骤(2.4)得到的行为点的开始时间和结束时间是否与08:00-17:00区间有交集,记录上述所有有交集的行为点形成集合A2,并选择A2中的行为点的结束时间与开始时间的时间差最大值的行为点,作为工作行为点N;(2.6) Determine in turn whether the start time and end time of the action points obtained in step (2.4) have an intersection with the 08:00-17:00 interval, record all the above-mentioned intersection action points to form a set A2, and select the behavior in A2 The action point with the maximum time difference between the end time and the start time of the point is regarded as the work action point N;
    (2.7)所有行为点文件中,剔除居住行为点M以及工作行为点N,得到生活行为点集P={p1,p2,p3,……,pn}。(2.7) In all the behavior point files, remove the residence behavior point M and the work behavior point N, and get the life behavior point set P={p1, p2, p3,..., pn}.
  6. 根据权利要求5所述的一种基于定位数据的生活圈识别方法,其特征在于,步骤(3)中,提取距居住行为点M和工作行为点N距离小于D值的生活行为点,并形成居住生活行为点集Q和工作生活行为点集S的具体方法如下:The method for identifying a living circle based on positioning data according to claim 5, characterized in that, in step (3), the living behavior point whose distance from the living behavior point M and the working behavior point N is less than the value of D is extracted, and forming The specific methods of living life behavior point set Q and work life behavior point set S are as follows:
    (3.1)将步骤(2.5)-(2.7)得到的行为点文件,分图层加载到ArcGIS软件中,形成居住行为点M、工作行为点N、生活行为点集P三个图层;(3.1) Load the behavior point files obtained in steps (2.5)-(2.7) into ArcGIS software in layers to form three layers of residential behavior point M, work behavior point N, and life behavior point set P;
    (3.2)在ArcGIS软件中,以点M和点N为圆心,画半径为D的圆形,为圆M图层和圆N图层;(3.2) In ArcGIS software, draw a circle with a radius of D with point M and point N as the center of the circle, which is a circle M layer and a circle N layer;
    (3.3)在ArcGIS软件中,提取落在圆M图层范围内的生活行为点集P中的点,导出图层为居住生活行为点集Q的SHP文件;(3.3) In ArcGIS software, extract the points in the life behavior point set P that fall within the circle M layer, and export the layer as the SHP file of the life behavior point set Q;
    (3.4)在ArcGIS软件中,提取落在圆N图层范围内的生活行为点集P中的点,导出图层为工作生活行为点集S的SHP文件。(3.4) In the ArcGIS software, extract the points in the life behavior point set P falling within the circle N layer, and export the layer as an SHP file of the work life behavior point set S.
  7. 根据权利要求6述的一种基于定位数据的生活圈识别方法,其特征在于,步骤(4)中,提取点M和点集Q形成最小边界几何面域,获取用户的居住生活圈面域Y的具体步骤如下:A method for identifying a living circle based on positioning data according to claim 6, characterized in that, in step (4), the point M and the point set Q are extracted to form a minimum boundary geometric area, and the user's living circle area Y is obtained The specific steps are as follows:
    (4.1)在ArcGIS软件中,将点M和点集Q合并,得到点集X;(4.1) In ArcGIS software, merge point M and point set Q to get point set X;
    (4.2)在ArcGIS软件中,使用“最小边界几何”命令,在输入要素中,输入点集 X,在地理选项中,选择“凸包(CONVEX HULL)”,输出面域Y,所形成的面域范围即为目标对象的居住生活圈范围。(4.2) In ArcGIS software, use the "minimum boundary geometry" command, in the input elements, input the point set X, in the geographic options, select "CONVEX HULL", output area Y, the formed area The domain scope is the living circle scope of the target object.
  8. 根据权利要求6述的一种基于定位数据的生活圈识别方法,其特征在于,步骤(4)中,提取点N和点集S形成最小边界几何面域,获取用户的工作生活圈面域L的具体步骤如下:A method for identifying a living circle based on positioning data according to claim 6, characterized in that, in step (4), the point N and the point set S are extracted to form a minimum boundary geometric area, and the user's work and life circle area L is obtained The specific steps are as follows:
    (4.3)在ArcGIS软件中,将点N和点集S合并,得到点集K;(4.3) In ArcGIS software, merge point N and point set S to get point set K;
    (4.4)在ArcGIS软件中,使用“最小边界几何”命令,在输入要素中,输入点集K,在地理选项中,选择“凸包(CONVEX HULL)”,输出面域L,所形成的面域范围即为目标对象的工作生活圈范围。(4.4) In the ArcGIS software, use the "minimum boundary geometry" command, in the input element, input the point set K, in the geographic options, select "CONVEX HULL", output the area L, the formed area The domain scope is the working life circle scope of the target object.
  9. 根据权利要求6或7所述的一种基于定位数据的生活圈识别方法,其特征在于,将居住区内每个对象的居住生活圈面域合并,即得到居住区所有对象的居住生活圈面域;将办公区每个对象的工作生活圈面域合并,即得到办公区所有对象的工作生活圈面域。The method for identifying the living circle based on positioning data according to claim 6 or 7, characterized in that the living circle area of each object in the residential area is merged to obtain the living circle area of all objects in the residential area Domain: Combine the work and life circle areas of each object in the office area to obtain the work and life circle areas of all objects in the office area.
PCT/CN2019/081820 2019-02-28 2019-04-08 Living circle identification method based on positioning data WO2020172954A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910149875.0 2019-02-28
CN201910149875.0A CN110032609B (en) 2019-02-28 2019-02-28 Living circle identification method based on positioning data

Publications (1)

Publication Number Publication Date
WO2020172954A1 true WO2020172954A1 (en) 2020-09-03

Family

ID=67235723

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/081820 WO2020172954A1 (en) 2019-02-28 2019-04-08 Living circle identification method based on positioning data

Country Status (2)

Country Link
CN (1) CN110032609B (en)
WO (1) WO2020172954A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113298301A (en) * 2021-05-17 2021-08-24 东南大学 Urban service facility evaluation and optimization method based on crowd digital portrait
CN115687549A (en) * 2022-09-28 2023-02-03 广州市城市规划设计有限公司 Quick and efficient life circle defining method, device, equipment and medium
CN117076744A (en) * 2023-10-13 2023-11-17 广州市城市规划勘测设计研究院 Barrier-free facility construction site selection method and device

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110716925B (en) * 2019-09-05 2023-08-04 中电科大数据研究院有限公司 Cross-border behavior recognition method based on track analysis
CN110972073B (en) * 2019-11-14 2021-03-02 常州市规划设计院 Living circle defining method based on mobile phone big data and walking accessibility
CN111966769B (en) * 2020-07-14 2024-01-02 北京城市象限科技有限公司 Method, device, equipment and medium for recommending information based on life circle
CN112087716B (en) * 2020-08-17 2022-06-07 同济大学 Community life circle boundary measuring method, system, computer equipment and storage medium
CN113127594B (en) * 2021-06-17 2021-09-03 脉策(上海)智能科技有限公司 Method, computing device and storage medium for determining grouping data of geographic area
CN113421176B (en) * 2021-07-16 2022-11-01 昆明学院 Intelligent screening method for abnormal data in student score scores
CN113642928B (en) * 2021-08-31 2022-04-01 广州市城市规划勘测设计研究院 Planning simulation measurement and evaluation method applied to city updating community life circle

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102053976A (en) * 2009-10-30 2011-05-11 财团法人工业技术研究院 Mobile device user matching system and method
CN102595323A (en) * 2012-03-20 2012-07-18 北京交通发展研究中心 Method for obtaining resident travel characteristic parameter based on mobile phone positioning data
CN105608153A (en) * 2015-12-18 2016-05-25 晶赞广告(上海)有限公司 Universal POI information association method
CN106384120A (en) * 2016-08-29 2017-02-08 深圳先进技术研究院 Mobile phone positioning data based resident activity pattern mining method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8195393B2 (en) * 2009-06-30 2012-06-05 Apple Inc. Analyzing and consolidating track file data
CN105142106B (en) * 2015-07-29 2019-03-26 西南交通大学 The identification of traveler duty residence and Trip chain depicting method based on mobile phone signaling data
CN106570184B (en) * 2016-11-11 2020-08-14 同济大学 Method for extracting recreation-living contact data set from mobile phone signaling data
CN106600960B (en) * 2016-12-22 2020-01-24 西南交通大学 Travel origin-destination point identification method based on space-time clustering analysis algorithm
CN108810818B (en) * 2018-05-24 2020-10-23 深圳市数字城市工程研究中心 Method and device for dividing residential life circle

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102053976A (en) * 2009-10-30 2011-05-11 财团法人工业技术研究院 Mobile device user matching system and method
CN102595323A (en) * 2012-03-20 2012-07-18 北京交通发展研究中心 Method for obtaining resident travel characteristic parameter based on mobile phone positioning data
CN105608153A (en) * 2015-12-18 2016-05-25 晶赞广告(上海)有限公司 Universal POI information association method
CN106384120A (en) * 2016-08-29 2017-02-08 深圳先进技术研究院 Mobile phone positioning data based resident activity pattern mining method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SUN, DAOSHENG; CHAI, YANWEI; ZHANG, YAN: "The Definition and Measurement of Community Life Circle:A Case Study of Qinghe Area in Beijing", URBAN DEVELOPMENT STUDIES, vol. 23, no. 9, 26 September 2016 (2016-09-26), pages 1 - 9, XP009522847, ISSN: 1006-3862 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113298301A (en) * 2021-05-17 2021-08-24 东南大学 Urban service facility evaluation and optimization method based on crowd digital portrait
CN113298301B (en) * 2021-05-17 2024-04-30 东南大学 Urban service facility evaluation and optimization method based on crowd digital portraits
CN115687549A (en) * 2022-09-28 2023-02-03 广州市城市规划设计有限公司 Quick and efficient life circle defining method, device, equipment and medium
CN115687549B (en) * 2022-09-28 2024-04-02 广州市城市规划设计有限公司 Rapid and efficient living circle demarcation method, device, equipment and medium
CN117076744A (en) * 2023-10-13 2023-11-17 广州市城市规划勘测设计研究院 Barrier-free facility construction site selection method and device
CN117076744B (en) * 2023-10-13 2024-02-27 广州市城市规划勘测设计研究院 Barrier-free facility construction site selection method and device

Also Published As

Publication number Publication date
CN110032609B (en) 2023-09-12
CN110032609A (en) 2019-07-19

Similar Documents

Publication Publication Date Title
WO2020172954A1 (en) Living circle identification method based on positioning data
CN107134142B (en) A kind of urban road method for predicting based on multisource data fusion
Badland et al. Understanding the Relationship between Activity and Neighbourhoods (URBAN) Study: research design and methodology
CN108446470B (en) Medical facility accessibility analysis method based on vehicle trajectory data and population distribution
Kanuganti et al. Quantifying accessibility to health care using Two-step Floating Catchment Area Method (2SFCA): A case study in Rajasthan
CN106651027B (en) Internet regular bus route optimization method based on social network
Pflügler et al. Predicting the availability of parking spaces with publicly available data
Gao et al. Identification of urban regions’ functions in Chengdu, China, based on vehicle trajectory data
Yang et al. A data-driven travel mode share estimation framework based on mobile device location data
CN110413855B (en) Region entrance and exit dynamic extraction method based on taxi boarding point
Ji et al. Non-linear effects of street patterns and land use on the bike-share usage
Li et al. Spatial accessibility to shopping malls in Nanjing, China: Comparative analysis with multiple transportation modes
Wang et al. Taxi passenger hot spot mining based on a refined K-means++ algorithm
Sun et al. A data-driven method for measuring accessibility to healthcare using the spatial interpolation model
Pierdicca et al. Identifying the use of a park based on clusters of visitors' movements from mobile phone data
Lu et al. Does density promote non-motorized travel? The effects of individual space–time activity patterns
Yang et al. Unraveling nonlinear and interaction effects of multilevel built environment features on outdoor jogging with explainable machine learning
Tunçer et al. Informed Design Platform
CN117076744B (en) Barrier-free facility construction site selection method and device
Zong et al. Feedback-based coarse time-granularity POI recommendation and itinerary planning
Seo et al. Strategy for introducing autonomous driving mobility service in transportation vulnerable areas: A case study of gyeonggi province in Republic of Korea
CN113343781B (en) City functional area identification method using remote sensing data and taxi track data
Yuting et al. A geospatial analysis of tweets during post-circuit breaker in singapore
Li et al. Analysis and research on the use situation of public bicycles based on spark machine learning
Kong The Spatial Equity of Dockless Micromobility Sharing Systems in Calgary, Canada

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19916737

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19916737

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19916737

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 18.03.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19916737

Country of ref document: EP

Kind code of ref document: A1