WO2023201938A1 - Missing trajectory filling method and system - Google Patents

Missing trajectory filling method and system Download PDF

Info

Publication number
WO2023201938A1
WO2023201938A1 PCT/CN2022/112691 CN2022112691W WO2023201938A1 WO 2023201938 A1 WO2023201938 A1 WO 2023201938A1 CN 2022112691 W CN2022112691 W CN 2022112691W WO 2023201938 A1 WO2023201938 A1 WO 2023201938A1
Authority
WO
WIPO (PCT)
Prior art keywords
trajectory
sub
distance
trajectories
speed
Prior art date
Application number
PCT/CN2022/112691
Other languages
French (fr)
Chinese (zh)
Inventor
亓晋
陈湘婷
谭雨恬
陈欣冉
孙雁飞
许斌
陆音
Original Assignee
南京邮电大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京邮电大学 filed Critical 南京邮电大学
Publication of WO2023201938A1 publication Critical patent/WO2023201938A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification

Definitions

  • the invention belongs to the technical field of trajectory data processing, and in particular relates to a method and system for filling missing trajectories.
  • trajectory data of a large number of moving objects is showing a rapid growth trend.
  • the study of trajectory data can obtain unknown knowledge about the movement of objects. It is called future research hotspot and application growth point.
  • vehicle trajectory data can be used to improve transportation networks, such as using trajectory data to generate road information.
  • Vehicle trajectory data can also be used for resource allocation, such as predicting vehicle demand in a certain area, so that platforms like Didi can schedule idle vehicles in advance.
  • Vehicle trajectory data can also be used for traffic analysis, such as using trajectory data to discover congested road sections or detect damaged road sections.
  • the GPS (Global Positioning System) receiving module is a common way to obtain trajectory data.
  • any vehicle trajectory can be obtained.
  • the vehicle trajectory data obtained by the base station and upper-layer data management center may be incomplete, and these incomplete trajectory data are processed directly. It will inevitably affect the accuracy of results and the reliability of decision-making. Therefore, post-stage data processing and knowledge discovery are crucial to recover missing trajectory data.
  • the present invention provides a missing trajectory filling method and system, which can complete the missing trajectory data.
  • the first aspect provides a missing trajectory filling method, including:
  • Clustering is used to fill in the missing trajectories based on the comprehensive distance.
  • dividing the original trajectory into several sub-trajectories includes:
  • x ij , y ij , v ij and t ij are respectively the abscissa and ordinate coordinates of the j-th recording point P ij of the i-th sub-trajectory. , speed and recording time;
  • calculating the time distance, space distance and speed distance between each sub-trajectory includes:
  • t i represents the intermediate time of the i-th sub-trajectory st i
  • t j represents the intermediate time of the j-th sub-trajectory st j ;
  • the calculation of the weights of time distance, space distance and speed distance includes:
  • the time distance, spatial distance and velocity distance between each sub-trajectory and other sub-trajectories are used as samples to perform comprehensive weighting calculations, including the following steps:
  • r ik is the correlation coefficient between the i-th data object and the k-th data object
  • P k is the proportion of the indicator value of the k-th data object
  • further obtaining the comprehensive distance between each sub-trajectory includes:
  • dist (st i ,st j ) ⁇ dist t (st i ,st j )+ ⁇ dist s (st i ,st j )+ ⁇ dist v (st i ,st j ) (10)
  • ⁇ , ⁇ , and ⁇ are the weight values of time distance, space distance, and speed distance respectively.
  • ⁇ k and ⁇ correspond to each other.
  • ⁇ k and ⁇ corresponds to ⁇ k and ⁇ when the k-th data object is a spatial distance.
  • using clustering according to the comprehensive distance to fill the missing trajectories includes:
  • c is the number of cluster centers
  • n is the number of sub-trajectories
  • u ij is the membership degree of the j-th sub-trajectory to the i-th sub-trajectory
  • st i represents the i-th sub-trajectory
  • d ij is the distance between the i-th sub-trajectory and the j-th sub-trajectory;
  • the second aspect provides a missing trajectory filling system, including:
  • Data acquisition module used to obtain original trajectories
  • the trajectory division module is used to divide the original trajectory into several sub-trajectories
  • the trajectory filling module is used to fill in missing trajectories using clustering based on comprehensive distance.
  • the present invention considers that in the concept of space and time, two trajectories passing through the same position at different speeds at the same time should be different, so the Euclidean distance cannot be simply used to measure similarity.
  • New The inter-trajectory distance measurement method provides a processing process for the three dimensions of trajectory data: time, space and speed, which improves the accuracy of the final result. It introduces weight parameters to coordinate the influence of each component on the clustering result, making it more accurate.
  • the present invention comprehensively considers the correlation, difference, discreteness and other properties of data, and creates a method of automatically adjusting weight parameters, which solves the problem of low efficiency of manual adjustment of weight parameters, and at the same time makes the calculation of weight parameters more efficient. Scientifically valid.
  • This invention takes into account the uncertainty that a sub-trajectory may belong to multiple trajectory clusters, and uses a fuzzy logic-based clustering method to perform flexible fuzzy division of trajectory data. This method can more realistically and effectively measure the trajectory data according to different application scenarios. The degree of similarity between different sub-trajectories, thereby improving the accuracy of trajectory data filling.
  • Figure 1 is a flow chart of the present invention
  • Figure 2 is a flow chart of the original trajectory in the present invention
  • Figure 3 is a schematic diagram of sub-trajectory segments after the original trajectory is divided according to the present invention.
  • Figure 4 is a schematic diagram of the time distance of neutron trajectories in the case of time separation according to the present invention.
  • Figure 5 is a schematic diagram of the time distance of neutron trajectories in the case of time intersection according to the present invention.
  • Figure 6a is a schematic diagram of the spatial distance from neutron trajectory st i to st j in the present invention
  • Figure 6b is a schematic diagram of the spatial distance from neutron trajectory st j to st i in the present invention
  • Figure 7 is a schematic diagram of the weights of time, space and speed distance in the process of calculating the comprehensive distance of the sub-trajectory in the present invention.
  • the present invention discloses a missing trajectory filling method, which includes the following steps:
  • Step 1 Obtain the original trajectory
  • Step 2 Divide sub-trajectories
  • Finding the sub-trajectory is mainly to find the starting point and the ending point of the sub-trajectory. These two points are both characteristic points. Once these two are determined, the sub-trajectory is determined. You only need to add the starting point and the ending point and the distance between them. The complete sub-trajectory can be obtained by connecting the common recording points in series.
  • Step 3 Calculate the time distance, space distance and speed distance between sub-trajectories
  • the calculation process of the three distances mainly includes the following steps:
  • t i represents the intermediate time of the i-th sub-trajectory st i
  • t j represents the intermediate time of the j-th sub-trajectory st j ;
  • sub-trajectory st i ⁇ P i0 ,P i1 ,...,P iq ⁇
  • sub-trajectory st j ⁇ P j0 ,P j1 ,...,P jw ⁇
  • each element in st i and st j is the recording point of the sub-track.
  • the first element and the last element are also the feature points selected in step 2.
  • the feature points are first the recording points, and q is the recording point of st i .
  • the number of , w is the number of recording points in st j
  • the minimum distance from any recording point in sub-track st i to sub-track st j is taken as the distance from the recording point to st j .
  • the minimum distance from any recording point in sub-track st j to sub-track st i is taken as the distance from the recording point to st i .
  • the average speed of all recording points in the sub-trajectory is taken as the speed of the sub-trajectory segment, and the absolute value of the speed difference between the sub-trajectory segments to be compared is taken as the speed distance between the sub-trajectories, as shown in Equation (3),
  • Step 4 Find the weights of various distances
  • N i ⁇ dist t (st i ,st 0 ),dist s (st i ,st 0 ),dist v (st i ,st 0 ) ⁇ ,...,
  • the improved CRITIC algorithm and entropy weight method are used to perform comprehensive weighting calculations on the position, time, and velocity components of each sub-trajectory data sample. Assume that there are u data objects x 1 , x 2 ,...x i ,...x u in the sample space. According to formulas (4) and (5), the k-th data object in the entire sample space containing u data objects is obtained. Difference coefficient Sk
  • r ik is the correlation coefficient between the i-th data object and the k-th data object
  • P k is the proportion of the indicator value of the k-th data object
  • Step 5 Find the comprehensive distance
  • the traditional similarity measurement method between trajectories does not take into account the impact of multiple attribute dimensions of trajectory data on trajectory data quantification at the same time.
  • ⁇ , ⁇ , and ⁇ are the weight values of time distance, space distance, and speed distance respectively.
  • ⁇ k and ⁇ correspond to each other.
  • ⁇ k and ⁇ corresponds to ⁇ k and ⁇ when the k-th data object is a spatial distance.
  • Step 6 Fill in missing tracks
  • the membership matrix represents the degree to which each sub-trajectory belongs to each cluster.
  • c is the number of cluster centers
  • n is the number of sub-trajectories
  • u ij is the membership degree of the j-th sub-trajectory to the i-th sub-trajectory
  • st i represents the i-th sub-trajectory
  • d ij is the distance between the i-th sub-trajectory and the j-th sub-trajectory;
  • the present invention also provides a missing trajectory filling system, including:
  • Data acquisition module used to obtain original trajectories
  • the trajectory division module is used to divide the original trajectory into several sub-trajectories
  • the trajectory filling module is used to fill in missing trajectories using clustering based on comprehensive distance.
  • embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment that combines software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
  • computer-usable storage media including, but not limited to, disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions may also be stored in a computer-readable memory that causes a computer or other programmable data processing apparatus to operate in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction means, the instructions
  • the device implements the functions specified in a process or processes of the flowchart and/or a block or blocks of the block diagram.
  • These computer program instructions may also be loaded onto a computer or other programmable data processing device, causing a series of operating steps to be performed on the computer or other programmable device to produce computer-implemented processing, thereby executing on the computer or other programmable device.
  • Instructions provide steps for implementing the functions specified in a process or processes of a flowchart diagram and/or a block or blocks of a block diagram.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Remote Sensing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Radar Systems Or Details Thereof (AREA)

Abstract

A missing trajectory filling method, comprising: acquiring an original trajectory; dividing the original trajectory into a plurality of sub-trajectories; calculating a time distance, a space distance and a velocity distance between the sub-trajectories; calculating weights of the time distance, the space distance and the velocity distance; acquiring a comprehensive distance between the sub-trajectories according to the weights of the time distance, the space distance and the velocity distance; and according to the comprehensive distance, filling a missing trajectory by means of clustering. By using a clustering method based on a fuzzy logic, flexible fuzzy division is performed on trajectory data, such that the degree of similarity between different sub-trajectories can be measured more truly and effectively according to different application scenarios, thereby improving the accuracy of trajectory data filling.

Description

缺失轨迹填补方法及系统Missing trajectory filling method and system 技术领域Technical field
本发明属于轨迹数据处理技术领域,尤其涉及一种缺失轨迹填补方法及系统。The invention belongs to the technical field of trajectory data processing, and in particular relates to a method and system for filling missing trajectories.
背景技术Background technique
随着社会的发展与科学技术的进步,以及卫星网络、无线网络、和定位设备的发展,大量移动物体的轨迹数据呈急速增长的趋势,对轨迹数据的研究,获取物体有关运动的未知知识,称为未来研究热点和应用增长点。常见的轨迹分为四种,分别是人类轨迹、车辆轨迹。动物轨迹和自然现象轨迹。例如,车辆轨迹数据可以用来改善交通运输网络,例如利用轨迹数据来生成道路信息。车辆轨迹数据也可以用来资源分配,例如预测某个区域的车辆需求,这样类似滴滴这类平台可以提前调度空闲车辆。车辆轨迹数据还可以用来交通分析,例如用轨迹数据来发掘拥堵路段或者检测道路损毁路段等。With the development of society and the advancement of science and technology, as well as the development of satellite networks, wireless networks, and positioning equipment, the trajectory data of a large number of moving objects is showing a rapid growth trend. The study of trajectory data can obtain unknown knowledge about the movement of objects. It is called future research hotspot and application growth point. There are four common trajectories, namely human trajectories and vehicle trajectories. Animal tracks and natural phenomenon tracks. For example, vehicle trajectory data can be used to improve transportation networks, such as using trajectory data to generate road information. Vehicle trajectory data can also be used for resource allocation, such as predicting vehicle demand in a certain area, so that platforms like Didi can schedule idle vehicles in advance. Vehicle trajectory data can also be used for traffic analysis, such as using trajectory data to discover congested road sections or detect damaged road sections.
GPS(Global Positioning System,全球定位系统)接收模块是获取轨迹数据是常用方式,智能车路协同系统中,理论上,可以获得任意车辆行驶轨迹。而实际中,由于车载设备和路侧单元的故障、无线传输的丢失等诸多原因,基站和上层数据管理中心获得的车辆轨迹数据有可能是不完整的,直接对这些不完整的轨迹数据进行处理势必会影响结果的准确性和决策的可靠性。因此,恢复缺失的轨迹数据后期的数据处理和知识发现至关重要。The GPS (Global Positioning System) receiving module is a common way to obtain trajectory data. In the intelligent vehicle-road collaboration system, in theory, any vehicle trajectory can be obtained. In practice, due to many reasons such as failures of on-board equipment and roadside units, loss of wireless transmission, etc., the vehicle trajectory data obtained by the base station and upper-layer data management center may be incomplete, and these incomplete trajectory data are processed directly. It will inevitably affect the accuracy of results and the reliability of decision-making. Therefore, post-stage data processing and knowledge discovery are crucial to recover missing trajectory data.
发明内容Contents of the invention
为了解决现有技术存在的问题,本发明提供一种缺失轨迹填补方法及系统,能够对缺失的轨迹数据进行补全。In order to solve the problems existing in the existing technology, the present invention provides a missing trajectory filling method and system, which can complete the missing trajectory data.
本发明所要解决的技术问题是通过以下技术方案实现的:The technical problems to be solved by the present invention are achieved through the following technical solutions:
第一方面,提供了一种缺失轨迹填补方法,包括:The first aspect provides a missing trajectory filling method, including:
获取原始轨迹;Get the original trajectory;
将原始轨迹划分为若干子轨迹;Divide the original trajectory into several sub-trajectories;
计算各子轨迹之间的时间距离、空间距离以及速度距离;Calculate the time distance, space distance and speed distance between each sub-trajectory;
计算时间距离、空间距离以及速度距离所占权重;Calculate the weight of time distance, space distance and speed distance;
根据时间距离、空间距离以及速度距离所占权重获取各子轨迹之间的综合距离;Obtain the comprehensive distance between each sub-trajectory based on the weight of time distance, space distance and speed distance;
根据综合距离采用聚类得填补缺失的轨迹。Clustering is used to fill in the missing trajectories based on the comprehensive distance.
结合第一方面,进一步的,所述将原始轨迹划分为若干子轨迹包括:Combined with the first aspect, further, dividing the original trajectory into several sub-trajectories includes:
2-1)逐个扫描原始轨迹中的每个记录点,生成原始轨迹集T={P 0,...P i,...,P n},其中, P i=(x i,y i,v i,t i),x i,y i、v i、t i分别为轨迹集中第i个记录点P i的横坐标、纵坐标,速度,以及记录时间,设子轨迹的起始点P start=P 0,当前记录点P now=P 12-1) Scan each recording point in the original trajectory one by one to generate the original trajectory set T={P 0 ,...P i ,...,P n }, where P i =(xi , y i ,v i ,t i ), x i , y i , vi , and t i are respectively the abscissa, ordinate, speed, and recording time of the i-th recording point Pi in the trajectory set. Let the starting point P of the sub-trajectory be start =P 0 , current recording point P now =P 1 ;
2-2)设子轨迹集为STC,STC={st 0,...st i,...,st n},st i={P i0,...,P ij,...,P in},P ij={x ij,y ij,v ij,t ij},i=0...n,并将STC、st i以及P ij中元素全部初始化为0,其中st i为第i条子轨迹,P i0、P in分别为子轨迹st i的起点和终点,x ij、y ij、v ij以及t ij分别为第i条子轨迹第j个记录点P ij的的横坐标、纵坐标、速度以及记录时间; 2-2) Let the sub-trajectory set be STC, STC={st 0 ,...st i ,...,st n }, st i ={P i0 ,...,P ij ,...,P in }, P ij = {x ij ,y ij ,v ij ,t ij }, i=0...n, and initialize all elements in STC, st i and P ij to 0, where st i is the i-th For the sub-trajectory, P i0 and P in are respectively the starting point and the end point of the sub-trajectory st i . x ij , y ij , v ij and t ij are respectively the abscissa and ordinate coordinates of the j-th recording point P ij of the i-th sub-trajectory. , speed and recording time;
1-3)将当前记录点的方向变化量定义为Δd、子轨迹的方向累计变化量定义为Δd +,将它们初始值均设置为0,d now、d start分别为当前记录点的方向和子轨迹起始点的方向; 1-3) Define the direction change amount of the current recording point as Δd, and the cumulative direction change amount of the sub-track as Δd + . Set their initial values to 0, d now and d start are the direction and sub-direction of the current recording point respectively. The direction of the trajectory starting point;
2-4)将当前记录点的平均速率变化量定义为
Figure PCTCN2022112691-appb-000001
初始值设置为0;
2-4) Define the average rate change of the current recording point as
Figure PCTCN2022112691-appb-000001
The initial value is set to 0;
2-5)若P start≠P n,计算方向变化量Δd的绝对值,|Δd|=|d now-d start|,否则结束特征点判定; 2-5) If P start ≠P n , calculate the absolute value of the direction change Δd, |Δd|=|d now -d start |, otherwise end the feature point determination;
2-6)计算当前记录点的方向累计变化量Δd +的绝对值,|Δd +|=|Δd+Δd +(old)|,Δd +(old)为上一个记录点的方向累计变化量; 2-6) Calculate the absolute value of the cumulative change in direction Δd + of the current recording point, |Δd + | = |Δd + Δd + (old) |, Δd + (old) is the cumulative change in direction of the previous recording point;
2-7)计算平均速率变化量
Figure PCTCN2022112691-appb-000002
2-7) Calculate the average rate change
Figure PCTCN2022112691-appb-000002
2-8)若得到的当前记录点的方向变化量的绝对值|Δd|或者方向累计变化量绝对值|Δd +|大于所设定方向门限值d,则将当前记录点认定为候选的特征点,令P end=P now,并将该子轨迹st i={P start,...P end}添加到子轨迹集STC中,将Δd、Δd +
Figure PCTCN2022112691-appb-000003
重新赋值为0,再令P start=P end,跳转至步骤2-5)继续下一个点的判定,P end为当前子轨迹的结尾点;
2-8) If the obtained absolute value |Δd| of the direction change of the current recording point or the absolute value of the cumulative direction change |Δd + | is greater than the set direction threshold d, the current recording point will be identified as a candidate Feature points, let P end =P now , and add the sub-trajectory st i ={P start ,...P end } to the sub-trajectory set STC, add Δd, Δd + ,
Figure PCTCN2022112691-appb-000003
Reassign the value to 0, then set P start = P end , jump to step 2-5) to continue the determination of the next point, P end is the end point of the current sub-trajectory;
若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d,则比较计算出的平均速率变化量
Figure PCTCN2022112691-appb-000004
与速度门限值v的大小,若
Figure PCTCN2022112691-appb-000005
超出速度门限值v,则该记录点被认为是一个候选的特征点,令P end=P now,将子轨迹st i={P start,...,P end}添加到子轨迹集STC中,跳转至步骤2-5)继续下一个记录点的判定;
If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold value d, then compare the calculated average velocity change
Figure PCTCN2022112691-appb-000004
and the speed threshold value v, if
Figure PCTCN2022112691-appb-000005
exceeds the speed threshold v, then the record point is considered a candidate feature point, let P end =P now , and add the sub-trajectory st i ={P start ,...,P end } to the sub-trajectory set STC , jump to step 2-5) to continue judging the next recording point;
若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d,且平均速率变化量
Figure PCTCN2022112691-appb-000006
小于速度门限值v,则令P now=P now+1,跳转至步骤2-5)继续 进行下一个记录点的判断。
If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold value d, and the average velocity change
Figure PCTCN2022112691-appb-000006
If it is less than the speed threshold value v, let P now = P now + 1 and jump to step 2-5) to continue judging the next recording point.
结合第一方面,进一步的,所述计算各子轨迹之间的时间距离、空间距离以及速度距离包括:Combined with the first aspect, further, calculating the time distance, space distance and speed distance between each sub-trajectory includes:
根据式(1)求取两个子轨迹之间的时间距离Calculate the time distance between two sub-trajectories according to equation (1)
dist t(st i,st j)=|t i-t j|  (1); dist t (st i ,st j )=|t i -t j | (1);
其中,t i表示第i条子轨迹st i的中间时间,t j表示第第j条子轨迹st j的中间时间; Among them, t i represents the intermediate time of the i-th sub-trajectory st i , and t j represents the intermediate time of the j-th sub-trajectory st j ;
根据式(2)求取两个子轨迹之间的空间距离Calculate the spatial distance between two sub-trajectories according to equation (2)
Figure PCTCN2022112691-appb-000007
Figure PCTCN2022112691-appb-000007
其中,
Figure PCTCN2022112691-appb-000008
为子轨迹st i到子轨迹st j的空间距离,
Figure PCTCN2022112691-appb-000009
为子轨迹st j到子轨迹st i的空间距离;
in,
Figure PCTCN2022112691-appb-000008
is the spatial distance from sub-trajectory st i to sub-trajectory st j ,
Figure PCTCN2022112691-appb-000009
is the spatial distance from sub-trajectory st j to sub-trajectory st i ;
根据式(3)求取两个子轨迹之间的速度距离Calculate the speed distance between the two sub-trajectories according to equation (3)
Figure PCTCN2022112691-appb-000010
Figure PCTCN2022112691-appb-000010
其中,
Figure PCTCN2022112691-appb-000011
为子轨迹st i的平均速度,
Figure PCTCN2022112691-appb-000012
为子轨迹st j的平均速度。
in,
Figure PCTCN2022112691-appb-000011
is the average speed of sub-trajectory st i ,
Figure PCTCN2022112691-appb-000012
is the average speed of sub-trajectory st j .
结合第一方面,进一步的,所述计算时间距离、空间距离以及速度距离所占权重包括:Combined with the first aspect, further, the calculation of the weights of time distance, space distance and speed distance includes:
将每条子轨迹与其他子轨迹的时间距离、空间距以及速度距离作为样本进行综合赋权计算,包括如下步骤:The time distance, spatial distance and velocity distance between each sub-trajectory and other sub-trajectories are used as samples to perform comprehensive weighting calculations, including the following steps:
假设样本空间中有u个数据对象x 1,x 2,…x i,…x u,根据下式得到第k个数据对象在含u个数据对象的整个样本空间中的差异系数S kAssume that there are u data objects x 1 , x 2 ,...x i ,...x u in the sample space. According to the following formula, the difference coefficient S k of the k-th data object in the entire sample space containing u data objects is obtained;
Figure PCTCN2022112691-appb-000013
Figure PCTCN2022112691-appb-000013
Figure PCTCN2022112691-appb-000014
Figure PCTCN2022112691-appb-000014
其中,
Figure PCTCN2022112691-appb-000015
为第k个数据对象在整个样本空间的平均值;
in,
Figure PCTCN2022112691-appb-000015
is the average value of the kth data object in the entire sample space;
根据式(6)得到第k个数据对象在含u个数据对象的整个样本空间中的不相关系数R k According to equation (6), the uncorrelated coefficient R k of the k-th data object in the entire sample space containing u data objects is obtained
Figure PCTCN2022112691-appb-000016
Figure PCTCN2022112691-appb-000016
其中,r ik为第i个数据对象与第k个数据对象的相关系数; Among them, r ik is the correlation coefficient between the i-th data object and the k-th data object;
根据式(7)、(8)得到第k个数据对象在含u个数据对象的整个样本空间中的信息熵E k According to equations (7) and (8), the information entropy E k of the k-th data object in the entire sample space containing u data objects is obtained
Figure PCTCN2022112691-appb-000017
Figure PCTCN2022112691-appb-000017
Figure PCTCN2022112691-appb-000018
Figure PCTCN2022112691-appb-000018
其中,P k为第k个数据对象的指标值的比重; Among them, P k is the proportion of the indicator value of the k-th data object;
根据式(9)得到第k个数据对象在含u个数据对象的整个样本空间中的权重ω k According to equation (9), the weight ω k of the k-th data object in the entire sample space containing u data objects is obtained
Figure PCTCN2022112691-appb-000019
Figure PCTCN2022112691-appb-000019
结合第一方面,进一步的,获取各子轨迹之间的综合距离包括:Combined with the first aspect, further obtaining the comprehensive distance between each sub-trajectory includes:
通过式(10)得到各子轨迹之间的综合距离The comprehensive distance between each sub-trajectory is obtained through Equation (10)
dist(st i,st j)=α×dist t(st i,st j)+β×dist s(st i,st j)+γ×dist v(st i,st j)  (10) dist (st i ,st j )=α×dist t (st i ,st j )+β×dist s (st i ,st j )+γ×dist v (st i ,st j ) (10)
其中,α、β、γ分别为时间距离、空间距离以及速度距离的权重值,当第k个数据对象为时间距离时ω k和α对应,当第k个数据对象为空间距离时ω k和β对应,当第k个数据对象为空间距离时ω k和γ对应。 Among them, α, β, and γ are the weight values of time distance, space distance, and speed distance respectively. When the k-th data object is time distance, ω k and α correspond to each other. When the k-th data object is space distance, ω k and β corresponds to ω k and γ when the k-th data object is a spatial distance.
结合第一方面,进一步的,所述根据综合距离采用聚类得填补缺失的轨迹包括:Combined with the first aspect, further, using clustering according to the comprehensive distance to fill the missing trajectories includes:
5-1)初始化隶属度矩阵,用取值范围在[0,1]之内的随机数初始化隶属度矩阵U,并且使该矩阵中所有元素的总和等于1,隶属度矩阵表示每条子轨迹隶属于每个簇的程度;5-1) Initialize the membership matrix, initialize the membership matrix U with a random number within the value range [0,1], and make the sum of all elements in the matrix equal to 1. The membership matrix represents the membership of each sub-trajectory to the extent of each cluster;
5-2)为每个簇选定核心轨迹,即根据式(11)计算模糊C均值聚类的聚类中心c i,i=1,…,c; 5-2) Select the core trajectory for each cluster, that is, calculate the cluster center c i of fuzzy C-means clustering according to Equation (11), i=1,...,c;
Figure PCTCN2022112691-appb-000020
Figure PCTCN2022112691-appb-000020
其中,c为聚类中心的数量,n为子轨迹的数量,u ij为第j条子轨迹隶属于第i条子轨迹的隶属度,st i表示第i条子轨迹; Among them, c is the number of cluster centers, n is the number of sub-trajectories, u ij is the membership degree of the j-th sub-trajectory to the i-th sub-trajectory, st i represents the i-th sub-trajectory;
5-3)更新隶属度矩阵,即根据式(12)重新计算第j条子轨迹隶属于第i条子轨迹的隶属度;5-3) Update the membership matrix, that is, recalculate the membership degree of the j-th sub-trajectory to the i-th sub-trajectory according to equation (12);
Figure PCTCN2022112691-appb-000021
Figure PCTCN2022112691-appb-000021
其中,d ij是第i条子轨迹和第j条子轨迹之间的距离; Among them, d ij is the distance between the i-th sub-trajectory and the j-th sub-trajectory;
5-4)计算代价函数的值,即根据式(13)计算代价函数的值J;5-4) Calculate the value of the cost function, that is, calculate the value J of the cost function according to equation (13);
Figure PCTCN2022112691-appb-000022
Figure PCTCN2022112691-appb-000022
若J小于代价阀值或者相对于上次计算的代价函数J的变化量小于变化量阀值,则结束计算,否则返回步骤5-2);If J is less than the cost threshold or the change relative to the last calculated cost function J is less than the change threshold, end the calculation, otherwise return to step 5-2);
5-5)轨迹填补,即用隶属矩阵作为权重乘以每个簇的聚类中心,得到最终填充的轨迹。5-5) Trajectory filling, that is, using the membership matrix as a weight to multiply the cluster center of each cluster to obtain the final filled trajectory.
第二方面,提供了一种缺失轨迹填补系统,包括:The second aspect provides a missing trajectory filling system, including:
数据获取模块,用于获取原始轨迹;Data acquisition module, used to obtain original trajectories;
轨迹划分模块,用于将原始轨迹划分为若干子轨迹;The trajectory division module is used to divide the original trajectory into several sub-trajectories;
综合距离计算模块,用于计算各子轨迹之间的时间距离、空间距离以及速度距离;Comprehensive distance calculation module, used to calculate the time distance, space distance and speed distance between each sub-trajectory;
计算时间距离、空间距离以及速度距离所占权重;Calculate the weight of time distance, space distance and speed distance;
根据时间距离、空间距离以及速度距离所占权重获取各子轨迹之间的综合距离;Obtain the comprehensive distance between each sub-trajectory based on the weight of time distance, space distance and speed distance;
轨迹填补模块,用于根据综合距离采用聚类得填补缺失的轨迹。The trajectory filling module is used to fill in missing trajectories using clustering based on comprehensive distance.
本发明有益效果:1.本发明考虑在时空观念中,以不同速度在同一时间通过同一位置的两条轨迹应是不同的,所以不能简单的采用欧几里德距离来进行相似度度量,新的轨迹间距离度量方法提供了对轨迹数据时间、空间和速度三个维度的处理过程,提高了最终结果的精准性,引入权重参数来协调各分量对聚类结果的影响程度,更加精准。Beneficial effects of the present invention: 1. The present invention considers that in the concept of space and time, two trajectories passing through the same position at different speeds at the same time should be different, so the Euclidean distance cannot be simply used to measure similarity. New The inter-trajectory distance measurement method provides a processing process for the three dimensions of trajectory data: time, space and speed, which improves the accuracy of the final result. It introduces weight parameters to coordinate the influence of each component on the clustering result, making it more accurate.
2.本发明综合考虑数据的相关性、差异性和离散性等性质,创造了一种自动调控的权重参数的方法,解决了手工调控权重参数效率低的问题,同时使得权重参数的计算更为科学有效。2. The present invention comprehensively considers the correlation, difference, discreteness and other properties of data, and creates a method of automatically adjusting weight parameters, which solves the problem of low efficiency of manual adjustment of weight parameters, and at the same time makes the calculation of weight parameters more efficient. Scientifically valid.
3.本发明考虑到一个子轨迹可能属于多个轨迹簇的不确定性,基于模糊逻辑的聚类方法,对轨迹数据进行柔性的模糊划分,该方法能够根据不同的应用场景更加真实有效地度量不同子轨迹之间的相似程度,从而提高轨迹数据填充的准确性。3. This invention takes into account the uncertainty that a sub-trajectory may belong to multiple trajectory clusters, and uses a fuzzy logic-based clustering method to perform flexible fuzzy division of trajectory data. This method can more realistically and effectively measure the trajectory data according to different application scenarios. The degree of similarity between different sub-trajectories, thereby improving the accuracy of trajectory data filling.
附图说明Description of the drawings
图1为本发明的流程图;Figure 1 is a flow chart of the present invention;
图2为本发明中原始轨迹的流程图;Figure 2 is a flow chart of the original trajectory in the present invention;
图3为本发明原始轨迹划分后的子轨迹段示意图;Figure 3 is a schematic diagram of sub-trajectory segments after the original trajectory is divided according to the present invention;
图4为本发明中子轨迹在时间分离情况下的时间距离示意图;Figure 4 is a schematic diagram of the time distance of neutron trajectories in the case of time separation according to the present invention;
图5为本发明中子轨迹在时间相交情况下的时间距离示意图;Figure 5 is a schematic diagram of the time distance of neutron trajectories in the case of time intersection according to the present invention;
图6a为本发明中子轨迹st i到st j的空间距离示意图; Figure 6a is a schematic diagram of the spatial distance from neutron trajectory st i to st j in the present invention;
图6b为本发明中子轨迹st j到st i的空间距离示意图; Figure 6b is a schematic diagram of the spatial distance from neutron trajectory st j to st i in the present invention;
图7为本发明中计算子轨迹综合距离过程中时间、空间以及速度距离所占权重的示意图。Figure 7 is a schematic diagram of the weights of time, space and speed distance in the process of calculating the comprehensive distance of the sub-trajectory in the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合附图对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Obviously, the described embodiments are part of the embodiments of the present invention, not all of them. Embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of the present invention.
为了更好地理解本发明,下面对本发明技术方案中的相关技术进行说明。In order to better understand the present invention, the relevant technologies in the technical solution of the present invention will be described below.
实施例1Example 1
如图1-7所示,本发明公开了一种缺失轨迹填补方法,包括以下步骤:As shown in Figures 1-7, the present invention discloses a missing trajectory filling method, which includes the following steps:
步骤一、获取原始轨迹 Step 1. Obtain the original trajectory
获取包含所有记录点的原始汽车运动的原始轨迹。Get the original trajectory of the original car motion containing all recorded points.
步骤二、划分子轨迹Step 2: Divide sub-trajectories
2-1)首先,逐个扫描原始轨迹中的每个记录点,得到原始轨迹集T={P 0,...P i,...,P n},其中,P i=(x i,y i,v i,t i),x i,y i、v i、t i分别为轨迹集中第i个记录点P i的横坐标、纵坐标,速度,以及记录时间,设子轨迹的起始点P start=P 0,当前记录点P now=P 12-1) First, scan each recording point in the original trajectory one by one to obtain the original trajectory set T={P 0 ,...P i ,...,P n }, where P i =(xi , y i , vi , t i ), x i , y i , vi , and t i are respectively the abscissa, ordinate, speed, and recording time of the i-th recording point P i in the trajectory set. Let the starting point of the sub-trajectory be Starting point P start =P 0 , current recording point P now =P 1 ;
2-2)设子轨迹集为STC,STC={st 0,...st i,...,st n},st i={P i0,...,P ij,...,P in},P ij={x ij,y ij,v ij,t ij},i=0...n,并将STC、st i以及P ij中元素全部初始化为0,其中st i为第i条子轨迹,P i0、P in分别为子轨迹st i的起点和终点,x ij、y ij、v ij以及t ij分别表示第i条子轨迹第j个记录点P ij的的横坐标、纵坐标、速度以及记录时间; 2-2) Let the sub-trajectory set be STC, STC={st 0 ,...st i ,...,st n }, st i ={P i0 ,...,P ij ,...,P in }, P ij = {x ij ,y ij ,v ij ,t ij }, i=0...n, and initialize all elements in STC, st i and P ij to 0, where st i is the i-th For the sub-trajectory, P i0 and P in are respectively the starting point and the end point of the sub-trajectory st i . , speed and recording time;
2-3)将当前记录点的方向变化量定义为Δd、子轨迹的方向累计变化量定义为Δd +,将它们初始值均设置为0,d now、d start分别表示当前记录点的方向和子轨迹起始点的方向; 2-3) Define the direction change amount of the current recording point as Δd, and the cumulative direction change amount of the sub-track as Δd + . Set their initial values to 0. d now and d start respectively represent the direction and sub-track of the current recording point. The direction of the trajectory starting point;
2-4)将当前记录点的平均速率变化量定义为
Figure PCTCN2022112691-appb-000023
初始值设置为0;
2-4) Define the average rate change of the current recording point as
Figure PCTCN2022112691-appb-000023
The initial value is set to 0;
2-5)若P start≠P n,计算方向变化量Δd的绝对值,|Δd|=|d now-d start|,否则结束特征点判定; 2-5) If P start ≠P n , calculate the absolute value of the direction change Δd, |Δd|=|d now -d start |, otherwise end the feature point determination;
2-6)计算当前记录点的方向累计变化量Δd +的绝对值,|Δd +|=|Δd+Δd +(old)|,Δd +(old)为 上一个记录点的方向累计变化量; 2-6) Calculate the absolute value of the cumulative change in direction Δd + of the current recording point, |Δd + | = |Δd + Δd + (old) |, Δd + (old) is the cumulative change in direction of the previous recording point;
2-7)计算平均速率变化量
Figure PCTCN2022112691-appb-000024
2-7) Calculate the average rate change
Figure PCTCN2022112691-appb-000024
2-8)若得到的当前记录点的方向变化量的绝对值|Δd|或者方向累计变化量绝对值|Δd +|大于所设定方向门限值d(该门限值是通过经验值确定),则将当前记录点认定为候选的特征点,令P end=P now,并将该子轨迹st i={P start,...P end}添加到子轨迹集STC中,将Δd、Δd +
Figure PCTCN2022112691-appb-000025
重新赋值为0,再令P start=P end,跳转至步骤2-5)继续下一个点的判定,P end为当前子轨迹的结尾点;
2-8) If the obtained absolute value |Δd| of the direction change of the current recording point or the absolute value of the cumulative direction change |Δd + | is greater than the set direction threshold d (the threshold is determined through empirical values ), the current recording point is identified as a candidate feature point, let P end =P now , and the sub-trajectory st i ={P start ,...P end } is added to the sub-trajectory set STC, and Δd, Δd + ,
Figure PCTCN2022112691-appb-000025
Reassign the value to 0, then set P start = P end , jump to step 2-5) to continue the determination of the next point, P end is the end point of the current sub-trajectory;
若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d,则比较计算出的平均速率变化量
Figure PCTCN2022112691-appb-000026
与速度门限值v的大小,若
Figure PCTCN2022112691-appb-000027
超出速度门限值v,则该记录点被认为是一个候选的特征点,令P end=P now,将子轨迹st i={P start,...,P end}添加到子轨迹集STC中,跳转至步骤2-5)继续下一个记录点的判定;
If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold value d, then compare the calculated average velocity change
Figure PCTCN2022112691-appb-000026
and the speed threshold value v, if
Figure PCTCN2022112691-appb-000027
exceeds the speed threshold v, then the record point is considered a candidate feature point, let P end =P now , and add the sub-trajectory st i ={P start ,...,P end } to the sub-trajectory set STC , jump to step 2-5) to continue judging the next recording point;
若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d(根据经验值设定),且平均速率变化量
Figure PCTCN2022112691-appb-000028
小于速度门限值v,则令P now=P now+1,然后跳转至步骤2-5)继续进行下一个记录点的判断。
If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold d (set based on empirical values), and the average velocity change
Figure PCTCN2022112691-appb-000028
is less than the speed threshold value v, then let P now =P now+1 , and then jump to step 2-5) to continue judging the next recording point.
求取子轨迹主要就是求取子轨迹的起始点和结尾点,这两个点,都是特征点,一旦这两个确定子轨迹就确定了,只需要将起始点和结尾点以及他们之间的普通的记录点串联起来即可以得到完整的子轨迹。Finding the sub-trajectory is mainly to find the starting point and the ending point of the sub-trajectory. These two points are both characteristic points. Once these two are determined, the sub-trajectory is determined. You only need to add the starting point and the ending point and the distance between them. The complete sub-trajectory can be obtained by connecting the common recording points in series.
步骤三、计算子轨迹之间的时间距离、空间距离以及速度距离Step 3: Calculate the time distance, space distance and speed distance between sub-trajectories
在得到子轨迹之后我们需要分别计算子轨迹之间的时间、空间和速度距离,用于之后步骤综合考虑轨迹数据中所有相关的属性维度。After obtaining the sub-trajectories, we need to calculate the time, space and speed distances between the sub-trajectories respectively, which will be used in subsequent steps to comprehensively consider all relevant attribute dimensions in the trajectory data.
三个距离的计算过程主要包括如下步骤:The calculation process of the three distances mainly includes the following steps:
首先,我们来求取时间距离,取两个子轨迹st i、st j,两个子轨迹的开始时间分别为t si和t sj,两个子轨迹的结束时间分别为t ei和t ej,子轨迹中间时间点的差的绝对值取为子轨迹之间的时间距离。 First, let's find the time distance, take two sub-trajectories st i and st j , the start time of the two sub-trajectories are t si and t sj respectively, the end time of the two sub-trajectories are t ei and t ej respectively, the middle of the sub-trajectory The absolute value of the difference in time points is taken as the time distance between subtrajectories.
通过下式求取两个子轨迹之间的时间距离Find the time distance between two sub-trajectories by the following formula
dist t(st i,st j)=|t i-t j|  (1); dist t (st i ,st j )=|t i -t j | (1);
其中,t i表示第i条子轨迹st i的中间时间,t j表示第第j条子轨迹st j的中间时间; Among them, t i represents the intermediate time of the i-th sub-trajectory st i , and t j represents the intermediate time of the j-th sub-trajectory st j ;
接下来,我们求取空间距离,子轨迹st i={P i0,P i1,...,P iq},子轨迹st j={P j0,P j1,...,P jw} Next, we find the spatial distance, sub-trajectory st i ={P i0 ,P i1 ,...,P iq }, sub-trajectory st j ={P j0 ,P j1 ,...,P jw }
其中,st i,st j中的每一个元素都是该条子轨迹的记录点,同时首元素和末元素也是步骤二中选定的特征点,特征点首先是记录点,q为st i记录点点的个数,w为st j中记录点的个数,子轨迹st i中任意一个记录点到子轨迹st j的最小距离作为该记录点到st j的距离。子轨迹st j中任意一个记录点到子轨迹st i的最小距离作为该记录点到st i的距离。 Among them, each element in st i and st j is the recording point of the sub-track. At the same time, the first element and the last element are also the feature points selected in step 2. The feature points are first the recording points, and q is the recording point of st i . The number of , w is the number of recording points in st j , and the minimum distance from any recording point in sub-track st i to sub-track st j is taken as the distance from the recording point to st j . The minimum distance from any recording point in sub-track st j to sub-track st i is taken as the distance from the recording point to st i .
根据轨迹双向匹配计算子轨迹之间的空间距离,如式(2)所示Calculate the spatial distance between sub-trajectories based on two-way trajectory matching, as shown in Equation (2)
Figure PCTCN2022112691-appb-000029
Figure PCTCN2022112691-appb-000029
其中,
Figure PCTCN2022112691-appb-000030
为子轨迹st i到子轨迹st j的空间距离,
Figure PCTCN2022112691-appb-000031
为子轨迹st j到子轨迹st i的空间距离;
in,
Figure PCTCN2022112691-appb-000030
is the spatial distance from sub-trajectory st i to sub-trajectory st j ,
Figure PCTCN2022112691-appb-000031
is the spatial distance from sub-trajectory st j to sub-trajectory st i ;
最后求取速度距离Finally find the speed distance
将子轨迹中所有记录点的速度取平均值作为该子轨迹段的速度,将要相互比较的子轨迹段的速度差取绝对值作为子轨迹之间的速度距离,如式(3)所示,The average speed of all recording points in the sub-trajectory is taken as the speed of the sub-trajectory segment, and the absolute value of the speed difference between the sub-trajectory segments to be compared is taken as the speed distance between the sub-trajectories, as shown in Equation (3),
Figure PCTCN2022112691-appb-000032
Figure PCTCN2022112691-appb-000032
其中,
Figure PCTCN2022112691-appb-000033
为子轨迹st i的平均速度,
Figure PCTCN2022112691-appb-000034
为子轨迹st j的平均速度。
in,
Figure PCTCN2022112691-appb-000033
is the average speed of sub-trajectory st i ,
Figure PCTCN2022112691-appb-000034
is the average speed of sub-trajectory st j .
步骤四、求各种距离所占权重Step 4: Find the weights of various distances
将步骤三种算出来的三个距离作为参数生成新的集合N i作为样本输入 Use the three distances calculated in the three steps as parameters to generate a new set N i as sample input
N i={{dist t(st i,st 0),dist s(st i,st 0),dist v(st i,st 0)},..., N i ={{dist t (st i ,st 0 ),dist s (st i ,st 0 ),dist v (st i ,st 0 )},...,
{dist t(st i,st j),dist s(st i,st j),dist v(st i,st j)},...,{dist t(st i,st n),dist s(st i,st n),dist v(st i,st n)}} {dist t (st i ,st j ),dist s (st i ,st j ),dist v (st i ,st j )},...,{dist t (st i ,st n ),dist s ( st i ,st n ),dist v (st i ,st n )}}
利用改进的CRITIC算法和熵权法分别对每一个子轨迹数据样本的位置、时间、速度分量进行综合赋权计算。假设样本空间中有u个数据对象x 1,x 2,…x i,…x u,根据式(4)、(5)得到第k个数据对象在含u个数据对象的整个样本空间中的差异系数S k The improved CRITIC algorithm and entropy weight method are used to perform comprehensive weighting calculations on the position, time, and velocity components of each sub-trajectory data sample. Assume that there are u data objects x 1 , x 2 ,...x i ,...x u in the sample space. According to formulas (4) and (5), the k-th data object in the entire sample space containing u data objects is obtained. Difference coefficient Sk
Figure PCTCN2022112691-appb-000035
Figure PCTCN2022112691-appb-000035
Figure PCTCN2022112691-appb-000036
Figure PCTCN2022112691-appb-000036
其中,
Figure PCTCN2022112691-appb-000037
为第k个数据对象在整个样本空间的平均值;
in,
Figure PCTCN2022112691-appb-000037
is the average value of the kth data object in the entire sample space;
利用改进的CRITIC算法得到第k个数据对象在含u个数据对象的整个样本空间中的不相关系数R k,如式(6)所示 Use the improved CRITIC algorithm to obtain the uncorrelated coefficient R k of the k-th data object in the entire sample space containing u data objects, as shown in Equation (6)
Figure PCTCN2022112691-appb-000038
Figure PCTCN2022112691-appb-000038
其中,r ik为第i个数据对象与第k个数据对象的相关系数; Among them, r ik is the correlation coefficient between the i-th data object and the k-th data object;
利用熵权法根据式(7)、(8)得到第k个数据对象在含u个数据对象的整个样本空间中的信息熵E k Use the entropy weight method to obtain the information entropy E k of the k-th data object in the entire sample space containing u data objects according to equations (7) and (8).
Figure PCTCN2022112691-appb-000039
Figure PCTCN2022112691-appb-000039
Figure PCTCN2022112691-appb-000040
Figure PCTCN2022112691-appb-000040
其中,P k为第k个数据对象的指标值的比重; Among them, P k is the proportion of the indicator value of the k-th data object;
将CRITIC法和熵权法结合,根据式(9)得到第k个数据对象在含u个数据对象的整个样本空间中的权重ω k Combining the CRITIC method and the entropy weight method, the weight ω k of the k-th data object in the entire sample space containing u data objects is obtained according to Equation (9)
Figure PCTCN2022112691-appb-000041
Figure PCTCN2022112691-appb-000041
步骤五、求取综合距离Step 5: Find the comprehensive distance
传统的轨迹间相似度度量方法没有同时考虑到轨迹数据的多个属性维度对轨迹数据量化的影响,本发明综合考虑了轨迹数据中的时间距离、速率距离和空间距离求取综合距离,如式(10)所示,dist(st i,st j)=α×dist t(st i,st j)+β×dist s(st i,st j)+γ×dist v(st i,st j)  (10) The traditional similarity measurement method between trajectories does not take into account the impact of multiple attribute dimensions of trajectory data on trajectory data quantification at the same time. The present invention comprehensively considers the time distance, rate distance and spatial distance in trajectory data to calculate the comprehensive distance, as shown in Eq. As shown in (10), dist(st i ,st j )=α×dist t (st i ,st j )+β×dist s (st i ,st j )+γ×dist v (st i ,st j ) (10)
其中,α、β、γ分别为时间距离、空间距离以及速度距离的权重值,当第k个数据对象为时间距离时ω k和α对应,当第k个数据对象为空间距离时ω k和β对应,当第k个数据对象为空间距离时ω k和γ对应。 Among them, α, β, and γ are the weight values of time distance, space distance, and speed distance respectively. When the k-th data object is time distance, ω k and α correspond to each other. When the k-th data object is space distance, ω k and β corresponds to ω k and γ when the k-th data object is a spatial distance.
步骤六、填补缺失轨迹Step 6: Fill in missing tracks
6-1)初始化隶属度矩阵,隶属度矩阵表示每条子轨迹隶属于每个簇的程度,用取值范围在[0,1]之内的随机数初始化隶属度矩阵U,并且使该矩阵中所有元素的总和等于1,隶属度 矩阵表示每条子轨迹隶属于每个簇的程度;6-1) Initialize the membership matrix. The membership matrix represents the degree to which each sub-trajectory belongs to each cluster. Initialize the membership matrix U with a random number in the range [0,1], and make the matrix The sum of all elements is equal to 1, and the membership matrix represents the degree to which each sub-trajectory belongs to each cluster;
6-2)为每个簇选定核心轨迹,即根据式(11)计算模糊C均值聚类的聚类中心c i,i=1,…,c; 6-2) Select the core trajectory for each cluster, that is, calculate the cluster center c i of fuzzy C-means clustering according to Equation (11), i=1,...,c;
Figure PCTCN2022112691-appb-000042
Figure PCTCN2022112691-appb-000042
其中,c为聚类中心的数量,n为子轨迹的数量,u ij为第j条子轨迹隶属于第i条子轨迹的隶属度,st i表示第i条子轨迹; Among them, c is the number of cluster centers, n is the number of sub-trajectories, u ij is the membership degree of the j-th sub-trajectory to the i-th sub-trajectory, st i represents the i-th sub-trajectory;
6-3)更新隶属度矩阵,即根据式(12)重新计算第j条子轨迹隶属于第i条子轨迹的隶属度;6-3) Update the membership matrix, that is, recalculate the membership degree of the j-th sub-trajectory to the i-th sub-trajectory according to equation (12);
Figure PCTCN2022112691-appb-000043
Figure PCTCN2022112691-appb-000043
其中,d ij是第i条子轨迹和第j条子轨迹之间的距离; Among them, d ij is the distance between the i-th sub-trajectory and the j-th sub-trajectory;
6-4)计算代价函数的值,轨迹之间的距离决定了一条轨迹隶属于某一个簇的程度。根据式(13)计算代价函数的值J;6-4) Calculate the value of the cost function. The distance between trajectories determines the degree to which a trajectory belongs to a certain cluster. Calculate the value J of the cost function according to Equation (13);
Figure PCTCN2022112691-appb-000044
Figure PCTCN2022112691-appb-000044
若J小于某个确定的代价阀值((例如:e -6))或者相对于上次计算的代价函数J的变化量小于变化量阀值(例如:e -6),则结束算法,否则返回步骤6-2); If J is less than a certain cost threshold ((for example: e -6 )) or the change relative to the last calculated cost function J is less than the change threshold (for example: e -6 ), the algorithm ends, otherwise Return to step 6-2);
6-5)轨迹填补。聚类完成后即得到每个簇的隶属度矩阵,用该矩阵作为权重乘以每个簇的核心轨迹的参数,得到最终填充的轨迹数据。6-5) Track filling. After clustering is completed, the membership matrix of each cluster is obtained. This matrix is used as a weight to multiply the parameters of the core trajectory of each cluster to obtain the final filled trajectory data.
实施例2Example 2
本发明还提供了一种缺失轨迹填补系统,包括:The present invention also provides a missing trajectory filling system, including:
数据获取模块,用于获取原始轨迹;Data acquisition module, used to obtain original trajectories;
轨迹划分模块,用于将原始轨迹划分为若干子轨迹;The trajectory division module is used to divide the original trajectory into several sub-trajectories;
综合距离计算模块,用于计算各子轨迹之间的时间距离、空间距离以及速度距离;Comprehensive distance calculation module, used to calculate the time distance, space distance and speed distance between each sub-trajectory;
计算时间距离、空间距离以及速度距离所占权重;Calculate the weight of time distance, space distance and speed distance;
根据时间距离、空间距离以及速度距离所占权重获取各子轨迹之间的综合距离;Obtain the comprehensive distance between each sub-trajectory based on the weight of time distance, space distance and speed distance;
轨迹填补模块,用于根据综合距离采用聚类得填补缺失的轨迹。The trajectory filling module is used to fill in missing trajectories using clustering based on comprehensive distance.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的 形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment that combines software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each process and/or block in the flowchart illustrations and/or block diagrams, and combinations of processes and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing device to produce a machine, such that the instructions executed by the processor of the computer or other programmable data processing device produce a use A device for realizing the functions specified in one process or multiple processes of the flowchart and/or one block or multiple blocks of the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory that causes a computer or other programmable data processing apparatus to operate in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction means, the instructions The device implements the functions specified in a process or processes of the flowchart and/or a block or blocks of the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions may also be loaded onto a computer or other programmable data processing device, causing a series of operating steps to be performed on the computer or other programmable device to produce computer-implemented processing, thereby executing on the computer or other programmable device. Instructions provide steps for implementing the functions specified in a process or processes of a flowchart diagram and/or a block or blocks of a block diagram.

Claims (7)

  1. 一种缺失轨迹填补方法,其特征在于,包括:A missing trajectory filling method, which is characterized by including:
    获取原始轨迹;Get the original trajectory;
    将原始轨迹划分为若干子轨迹;Divide the original trajectory into several sub-trajectories;
    计算各子轨迹之间的时间距离、空间距离以及速度距离;Calculate the time distance, space distance and speed distance between each sub-trajectory;
    计算时间距离、空间距离以及速度距离所占权重;Calculate the weight of time distance, space distance and speed distance;
    根据时间距离、空间距离以及速度距离所占权重获取各子轨迹之间的综合距离;Obtain the comprehensive distance between each sub-trajectory based on the weight of time distance, space distance and speed distance;
    根据综合距离采用聚类得填补缺失的轨迹。Clustering is used to fill in the missing trajectories based on the comprehensive distance.
  2. 根据权利要求1所述的一种缺失轨迹填补方法,其特征在于,所述将原始轨迹划分为若干子轨迹包括:A missing trajectory filling method according to claim 1, characterized in that dividing the original trajectory into several sub-trajectories includes:
    2-1)逐个扫描原始轨迹中的每个记录点,生成原始轨迹集T={P 0,...P i,...,P n},其中,P i=(x i,y i,v i,t i),x i,y i、v i、t i分别为轨迹集中第i个记录点P i的横坐标、纵坐标,速度,以及记录时间,设子轨迹的起始点P start=P 0,当前记录点P now=P 12-1) Scan each recording point in the original trajectory one by one to generate the original trajectory set T={P 0 ,...P i ,...,P n }, where P i =(xi , y i ,v i ,t i ), x i , y i , vi , and t i are respectively the abscissa, ordinate, speed, and recording time of the i-th recording point Pi in the trajectory set. Let the starting point P of the sub-trajectory be start =P 0 , current recording point P now =P 1 ;
    2-2)设子轨迹集为STC,STC={st 0,...st i,...,st n},st i={P i0,...,P ij,...,P in},P ij={x ij,y ij,v ij,t ij},i=0...n,并将STC、st i以及P ij中元素全部初始化为0,其中st i为第i条子轨迹,P i0、P in分别为子轨迹st i的起点和终点,x ij、y ij、v ij以及t ij分别为第i条子轨迹第j个记录点P ij的的横坐标、纵坐标、速度以及记录时间; 2-2) Let the sub-trajectory set be STC, STC={st 0 ,...st i ,...,st n }, st i ={P i0 ,...,P ij ,...,P in }, P ij = {x ij ,y ij ,v ij ,t ij }, i=0...n, and initialize all elements in STC, st i and P ij to 0, where st i is the i-th For the sub-trajectory, P i0 and P in are respectively the starting point and the end point of the sub-trajectory st i . x ij , y ij , v ij and t ij are respectively the abscissa and ordinate coordinates of the j-th recording point P ij of the i-th sub-trajectory. , speed and recording time;
    2-3)将当前记录点的方向变化量定义为Δd、子轨迹的方向累计变化量定义为Δd +,将它们初始值均设置为0,d now、d start分别为当前记录点的方向和子轨迹起始点的方向; 2-3) Define the direction change amount of the current recording point as Δd, and the cumulative direction change amount of the sub-track as Δd + . Set their initial values to 0, d now and d start are the direction and sub-direction of the current recording point respectively. The direction of the trajectory starting point;
    2-4)将当前记录点的平均速率变化量定义为
    Figure PCTCN2022112691-appb-100001
    初始值设置为0;
    2-4) Define the average rate change of the current recording point as
    Figure PCTCN2022112691-appb-100001
    The initial value is set to 0;
    2-5)若P start≠P n,计算方向变化量Δd的绝对值,|Δd|=|d now-d start|,否则结束特征点判定; 2-5) If P start ≠P n , calculate the absolute value of the direction change Δd, |Δd|=|d now -d start |, otherwise end the feature point determination;
    2-6)计算当前记录点的方向累计变化量Δd +的绝对值,|Δd +|=|Δd+Δd +(old)|,Δd +(old)为上一个记录点的方向累计变化量; 2-6) Calculate the absolute value of the cumulative change in direction Δd + of the current recording point, |Δd + | = |Δd + Δd + (old) |, Δd + (old) is the cumulative change in direction of the previous recording point;
    2-7)计算平均速率变化量
    Figure PCTCN2022112691-appb-100002
    2-7) Calculate the average rate change
    Figure PCTCN2022112691-appb-100002
    2-8)若得到的当前记录点的方向变化量的绝对值|Δd|或者方向累计变化量绝对值|Δd +| 大于所设定方向门限值d,则将当前记录点认定为候选的特征点,令P end=P now,并将该子轨迹st i={P start,...P end}添加到子轨迹集STC中,将Δd、Δd +
    Figure PCTCN2022112691-appb-100003
    重新赋值为0,再令P start=P end,跳转至步骤2-5)继续下一个点的判定,P end为当前子轨迹的结尾点;
    2-8) If the obtained absolute value |Δd| of the direction change of the current recording point or the absolute value of the cumulative direction change |Δd + | is greater than the set direction threshold d, the current recording point will be identified as a candidate. Feature points, let P end =P now , and add the sub-trajectory st i ={P start ,...P end } to the sub-trajectory set STC, add Δd, Δd + ,
    Figure PCTCN2022112691-appb-100003
    Reassign the value to 0, then set P start = P end , jump to step 2-5) to continue the determination of the next point, P end is the end point of the current sub-trajectory;
    若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d,则比较计算出的平均速率变化量
    Figure PCTCN2022112691-appb-100004
    与速度门限值v的大小,若
    Figure PCTCN2022112691-appb-100005
    超出速度门限值v,则该记录点被认为是一个候选的特征点,令P end=P now,将子轨迹st i={P start,...,P end}添加到子轨迹集STC中,跳转至步骤2-5)继续下一个记录点的判定;
    If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold value d, then compare the calculated average velocity change
    Figure PCTCN2022112691-appb-100004
    and the speed threshold value v, if
    Figure PCTCN2022112691-appb-100005
    exceeds the speed threshold v, then the record point is considered a candidate feature point, let P end =P now , and add the sub-trajectory st i ={P start ,...,P end } to the sub-trajectory set STC , jump to step 2-5) to continue judging the next recording point;
    若得到的方向变化量的绝对值|Δd|和方向累计变化量绝对值|Δd +|小于等于所设定方向门限值d,且平均速率变化量
    Figure PCTCN2022112691-appb-100006
    小于速度门限值v,则令P now=P now+1,跳转至步骤2-5)继续进行下一个记录点的判断。
    If the obtained absolute value of direction change |Δd| and the absolute value of cumulative direction change |Δd + | are less than or equal to the set direction threshold value d, and the average velocity change
    Figure PCTCN2022112691-appb-100006
    If it is less than the speed threshold value v, let P now = P now + 1 and jump to step 2-5) to continue judging the next recording point.
  3. 根据权利要求1所述的一种缺失轨迹填补方法,其特征在于,所述计算各子轨迹之间的时间距离、空间距离以及速度距离包括:A missing trajectory filling method according to claim 1, characterized in that the calculation of the time distance, spatial distance and speed distance between each sub-trajectory includes:
    根据式(1)求取两个子轨迹之间的时间距离Calculate the time distance between two sub-trajectories according to equation (1)
    dist t(st i,st j)=|t i-t j|  (1); dist t (st i ,st j )=|t i -t j | (1);
    其中,t i表示第i条子轨迹st i的中间时间,t j表示第第j条子轨迹st j的中间时间; Among them, t i represents the intermediate time of the i-th sub-trajectory st i , and t j represents the intermediate time of the j-th sub-trajectory st j ;
    根据式(2)求取两个子轨迹之间的空间距离Calculate the spatial distance between two sub-trajectories according to equation (2)
    Figure PCTCN2022112691-appb-100007
    Figure PCTCN2022112691-appb-100007
    其中,
    Figure PCTCN2022112691-appb-100008
    为子轨迹st i到子轨迹st j的空间距离,
    Figure PCTCN2022112691-appb-100009
    为子轨迹st j到子轨迹st i的空间距离;
    in,
    Figure PCTCN2022112691-appb-100008
    is the spatial distance from sub-trajectory st i to sub-trajectory st j ,
    Figure PCTCN2022112691-appb-100009
    is the spatial distance from sub-trajectory st j to sub-trajectory st i ;
    根据式(3)求取两个子轨迹之间的速度距离Calculate the speed distance between the two sub-trajectories according to equation (3)
    Figure PCTCN2022112691-appb-100010
    Figure PCTCN2022112691-appb-100010
    其中,
    Figure PCTCN2022112691-appb-100011
    为子轨迹st i的平均速度,
    Figure PCTCN2022112691-appb-100012
    为子轨迹st j的平均速度。
    in,
    Figure PCTCN2022112691-appb-100011
    is the average speed of sub-trajectory st i ,
    Figure PCTCN2022112691-appb-100012
    is the average speed of sub-trajectory st j .
  4. 根据权利要求3所述的一种缺失轨迹填补方法,其特征在于,所述计算时间距离、空间距离以及速度距离所占权重包括:A missing trajectory filling method according to claim 3, characterized in that the calculation of the weights of time distance, space distance and speed distance includes:
    将每条子轨迹与其他子轨迹的时间距离、空间距以及速度距离作为样本进行综合赋权计算,包括如下步骤:The time distance, spatial distance and velocity distance between each sub-trajectory and other sub-trajectories are used as samples to perform comprehensive weighting calculations, including the following steps:
    假设样本空间中有u个数据对象x 1,x 2,…x i,…x u,根据下式得到第k个数据对象在含u个数据对象的整个样本空间中的差异系数S kAssume that there are u data objects x 1 , x 2 ,...x i ,...x u in the sample space. According to the following formula, the difference coefficient S k of the k-th data object in the entire sample space containing u data objects is obtained;
    Figure PCTCN2022112691-appb-100013
    Figure PCTCN2022112691-appb-100013
    Figure PCTCN2022112691-appb-100014
    Figure PCTCN2022112691-appb-100014
    其中,
    Figure PCTCN2022112691-appb-100015
    为第k个数据对象在整个样本空间的平均值;
    in,
    Figure PCTCN2022112691-appb-100015
    is the average value of the kth data object in the entire sample space;
    根据式(6)得到第k个数据对象在含u个数据对象的整个样本空间中的不相关系数R k According to equation (6), the uncorrelated coefficient R k of the k-th data object in the entire sample space containing u data objects is obtained
    Figure PCTCN2022112691-appb-100016
    Figure PCTCN2022112691-appb-100016
    其中,r ik为第i个数据对象与第k个数据对象的相关系数; Among them, r ik is the correlation coefficient between the i-th data object and the k-th data object;
    根据式(7)、(8)得到第k个数据对象在含u个数据对象的整个样本空间中的信息熵E k According to equations (7) and (8), the information entropy E k of the k-th data object in the entire sample space containing u data objects is obtained
    Figure PCTCN2022112691-appb-100017
    Figure PCTCN2022112691-appb-100017
    Figure PCTCN2022112691-appb-100018
    Figure PCTCN2022112691-appb-100018
    其中,P k为第k个数据对象的指标值的比重; Among them, P k is the proportion of the indicator value of the k-th data object;
    根据式(9)得到第k个数据对象在含u个数据对象的整个样本空间中的权重ω k According to equation (9), the weight ω k of the k-th data object in the entire sample space containing u data objects is obtained
    Figure PCTCN2022112691-appb-100019
    Figure PCTCN2022112691-appb-100019
  5. 根据权利要求4所述的一种缺失轨迹填补方法,其特征在于,获取各子轨迹之间的综合距离包括:A missing trajectory filling method according to claim 4, characterized in that obtaining the comprehensive distance between each sub-trajectory includes:
    通过式(10)得到各子轨迹之间的综合距离The comprehensive distance between each sub-trajectory is obtained through Equation (10)
    dist(st i,st j)=α×dist t(st i,st j)+β×dist s(st i,st j)+γ×dist v(st i,st j)  (10) dist (st i ,st j )=α×dist t (st i ,st j )+β×dist s (st i ,st j )+γ×dist v (st i ,st j ) (10)
    其中,α、β、γ分别为时间距离、空间距离以及速度距离的权重值,当第k个数据对象为时间距离时ω k和α对应,当第k个数据对象为空间距离时ω k和β对应,当第k个数据对 象为空间距离时ω k和γ对应。 Among them, α, β, and γ are the weight values of time distance, space distance, and speed distance respectively. When the k-th data object is time distance, ω k and α correspond to each other. When the k-th data object is space distance, ω k and β corresponds to ω k and γ when the k-th data object is a spatial distance.
  6. 根据权利要求1所述的一种缺失轨迹填补方法,其特征在于,所述根据综合距离采用聚类得填补缺失的轨迹包括:A missing trajectory filling method according to claim 1, characterized in that the filling of missing trajectories by clustering based on comprehensive distance includes:
    5-1)初始化隶属度矩阵,用取值范围在[0,1]之内的随机数初始化隶属度矩阵U,并且使该矩阵中所有元素的总和等于1,隶属度矩阵表示每条子轨迹隶属于每个簇的程度;5-1) Initialize the membership matrix, initialize the membership matrix U with a random number within the value range [0,1], and make the sum of all elements in the matrix equal to 1. The membership matrix represents the membership of each sub-trajectory to the extent of each cluster;
    5-2)为每个簇选定核心轨迹,即根据式(11)计算模糊C均值聚类的聚类中心c i,i=1,…,c; 5-2) Select the core trajectory for each cluster, that is, calculate the cluster center c i of fuzzy C-means clustering according to Equation (11), i=1,...,c;
    Figure PCTCN2022112691-appb-100020
    Figure PCTCN2022112691-appb-100020
    其中,c为聚类中心的数量,n为子轨迹的数量,u ij为第j条子轨迹隶属于第i条子轨迹的隶属度,st i表示第i条子轨迹; Among them, c is the number of cluster centers, n is the number of sub-trajectories, u ij is the membership degree of the j-th sub-trajectory to the i-th sub-trajectory, st i represents the i-th sub-trajectory;
    5-3)更新隶属度矩阵,即根据式(12)重新计算第j条子轨迹隶属于第i条子轨迹的隶属度;5-3) Update the membership matrix, that is, recalculate the membership degree of the j-th sub-trajectory to the i-th sub-trajectory according to equation (12);
    Figure PCTCN2022112691-appb-100021
    Figure PCTCN2022112691-appb-100021
    其中,d ij是第i条子轨迹和第j条子轨迹之间的距离; Among them, d ij is the distance between the i-th sub-trajectory and the j-th sub-trajectory;
    5-4)计算代价函数的值,即根据式(13)计算代价函数的值J;5-4) Calculate the value of the cost function, that is, calculate the value J of the cost function according to equation (13);
    Figure PCTCN2022112691-appb-100022
    Figure PCTCN2022112691-appb-100022
    若J小于代价阀值或者相对于上次计算的代价函数J的变化量小于变化量阀值,则结束计算,否则返回步骤5-2);If J is less than the cost threshold or the change relative to the last calculated cost function J is less than the change threshold, end the calculation, otherwise return to step 5-2);
    5-5)轨迹填补,即用隶属矩阵作为权重乘以每个簇的聚类中心,得到最终填充的轨迹。5-5) Trajectory filling, that is, using the membership matrix as a weight to multiply the cluster center of each cluster to obtain the final filled trajectory.
  7. 一种缺失轨迹填补系统,其特征在于,包括:A missing track filling system, which is characterized by including:
    数据获取模块,用于获取原始轨迹;Data acquisition module, used to obtain original trajectories;
    轨迹划分模块,用于将原始轨迹划分为若干子轨迹;The trajectory division module is used to divide the original trajectory into several sub-trajectories;
    综合距离计算模块,用于计算各子轨迹之间的时间距离、空间距离以及速度距离;Comprehensive distance calculation module, used to calculate the time distance, space distance and speed distance between each sub-trajectory;
    计算时间距离、空间距离以及速度距离所占权重;Calculate the weight of time distance, space distance and speed distance;
    根据时间距离、空间距离以及速度距离所占权重获取各子轨迹之间的综合距离;Obtain the comprehensive distance between each sub-trajectory based on the weight of time distance, space distance and speed distance;
    轨迹填补模块,用于根据综合距离采用聚类得填补缺失的轨迹。The trajectory filling module is used to fill in missing trajectories using clustering based on comprehensive distance.
PCT/CN2022/112691 2022-04-22 2022-08-16 Missing trajectory filling method and system WO2023201938A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210428947.7A CN114860700A (en) 2022-04-22 2022-04-22 Missing track filling method and system
CN202210428947.7 2022-04-22

Publications (1)

Publication Number Publication Date
WO2023201938A1 true WO2023201938A1 (en) 2023-10-26

Family

ID=82633030

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/112691 WO2023201938A1 (en) 2022-04-22 2022-08-16 Missing trajectory filling method and system

Country Status (2)

Country Link
CN (1) CN114860700A (en)
WO (1) WO2023201938A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117290364A (en) * 2023-11-24 2023-12-26 深圳市成为高科技有限公司 Intelligent market investigation data storage method

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114860700A (en) * 2022-04-22 2022-08-05 南京邮电大学 Missing track filling method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015110762A1 (en) * 2014-01-23 2015-07-30 Université Claude Bernard Lyon 1 Method for determining the trajectory of a point of a moving object
US20160377441A1 (en) * 2015-06-23 2016-12-29 International Business Machines Corporation Quantifying anomalous behavior by identifying anomalies at subtrajectories
CN106383868A (en) * 2016-09-05 2017-02-08 电子科技大学 Road network-based spatio-temporal trajectory clustering method
CN110991475A (en) * 2019-10-17 2020-04-10 中国科学院电子学研究所苏州研究院 Moving object track clustering method based on multi-dimensional distance measurement
CN111862659A (en) * 2020-06-30 2020-10-30 中冶智诚(武汉)工程技术有限公司 GPS track data matching and complementing method
CN112733890A (en) * 2020-12-28 2021-04-30 北京航空航天大学 Online vehicle track clustering method considering space-time characteristics
CN114860700A (en) * 2022-04-22 2022-08-05 南京邮电大学 Missing track filling method and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11994863B2 (en) * 2019-12-03 2024-05-28 International Business Machines Corporation Trajectory similarity search
CN113009532A (en) * 2021-04-01 2021-06-22 南京理工大学 Moving track data completion method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015110762A1 (en) * 2014-01-23 2015-07-30 Université Claude Bernard Lyon 1 Method for determining the trajectory of a point of a moving object
US20160377441A1 (en) * 2015-06-23 2016-12-29 International Business Machines Corporation Quantifying anomalous behavior by identifying anomalies at subtrajectories
CN106383868A (en) * 2016-09-05 2017-02-08 电子科技大学 Road network-based spatio-temporal trajectory clustering method
CN110991475A (en) * 2019-10-17 2020-04-10 中国科学院电子学研究所苏州研究院 Moving object track clustering method based on multi-dimensional distance measurement
CN111862659A (en) * 2020-06-30 2020-10-30 中冶智诚(武汉)工程技术有限公司 GPS track data matching and complementing method
CN112733890A (en) * 2020-12-28 2021-04-30 北京航空航天大学 Online vehicle track clustering method considering space-time characteristics
CN114860700A (en) * 2022-04-22 2022-08-05 南京邮电大学 Missing track filling method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117290364A (en) * 2023-11-24 2023-12-26 深圳市成为高科技有限公司 Intelligent market investigation data storage method
CN117290364B (en) * 2023-11-24 2024-01-30 深圳市成为高科技有限公司 Intelligent market investigation data storage method

Also Published As

Publication number Publication date
CN114860700A (en) 2022-08-05

Similar Documents

Publication Publication Date Title
WO2023201938A1 (en) Missing trajectory filling method and system
CN107103754B (en) Road traffic condition prediction method and system
CN110796168A (en) Improved YOLOv 3-based vehicle detection method
Li et al. A general framework for unmet demand prediction in on-demand transport services
CN114627390B (en) Improved active learning remote sensing sample marking method
WO2021073526A1 (en) Trajectory data-based signal control period division method
US20220357176A1 (en) Methods and data processing systems for predicting road attributes
CN114925238B (en) Federal learning-based video clip retrieval method and system
CN113240265B (en) Urban space division method based on multi-mode traffic data
CN116776208B (en) Training method of seismic wave classification model, seismic wave selecting method, equipment and medium
CN110717493A (en) License plate recognition method containing stacked characters based on deep learning
CN115861738A (en) Category semantic information guided remote sensing target detection active sampling method
CN110135451B (en) Flight path clustering method based on distance from point to line segment set
CN111144462A (en) Unknown individual identification method and device for radar signals
CN109543922B (en) Time-period borrowing and returning amount prediction method for single-vehicle station group shared by piles
Sun et al. Decoupled feature pyramid learning for multi-scale object detection in low-altitude remote sensing images
CN113724293A (en) Vision-based intelligent internet public transport scene target tracking method and system
CN111858817B (en) BilSTM-CRF path inference method for sparse track
CN111899283B (en) Video target tracking method
Zhang et al. Strans-gan: Spatially-transferable generative adversarial networks for urban traffic estimation
CN110503663B (en) Random multi-target automatic detection tracking method based on frame extraction detection
CN114996659B (en) Vehicle track prediction method and system with continuous learning capability
CN113465616B (en) Track abnormal point detection method and device, electronic equipment and storage medium
CN113780092B (en) Crowd counting method based on block weak labeling
CN113379794B (en) Single-target tracking system and method based on attention-key point prediction model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22938163

Country of ref document: EP

Kind code of ref document: A1