WO2023206904A1 - Pedestrian trajectory tracking method and system, and related apparatus - Google Patents

Pedestrian trajectory tracking method and system, and related apparatus Download PDF

Info

Publication number
WO2023206904A1
WO2023206904A1 PCT/CN2022/117148 CN2022117148W WO2023206904A1 WO 2023206904 A1 WO2023206904 A1 WO 2023206904A1 CN 2022117148 W CN2022117148 W CN 2022117148W WO 2023206904 A1 WO2023206904 A1 WO 2023206904A1
Authority
WO
WIPO (PCT)
Prior art keywords
pedestrian
frame
trajectory
target
candidate
Prior art date
Application number
PCT/CN2022/117148
Other languages
French (fr)
Chinese (zh)
Inventor
李晓川
李仁刚
赵雅倩
郭振华
范宝余
张润泽
王立
Original Assignee
苏州元脑智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏州元脑智能科技有限公司 filed Critical 苏州元脑智能科技有限公司
Publication of WO2023206904A1 publication Critical patent/WO2023206904A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30241Trajectory

Abstract

The present application relates to the field of image processing, and provides a pedestrian trajectory tracking method, comprising: obtaining image data; performing feature extraction on the image data, and constructing a candidate box relationship mask; extracting a historical frame feature set of a pedestrian trajectory library, and performing feature calculation with a candidate box in the candidate box relationship mask to obtain a human-box feature distance matrix and a box-human feature distance matrix; calculating a feature distance between the target pedestrian and the candidate box, and classifying, under the trajectory of the target pedestrian, a target candidate box satisfying that the feature distances between the target candidate box and the target pedestrian are minimum with respect to each other, until no detection box satisfying the condition exists in a current frame detection box; and updating the pedestrian track library, and outputting a pedestrian index set of the target pedestrian and a corresponding position trajectory. According to the present application, the problem of insufficient feature richness during pedestrian tracking can be effectively solved, and the pedestrian tracking detection precision is improved. The present application further provides a pedestrian trajectory tracking system, a computer readable storage medium and an electronic device, which achieve the beneficial effects above.

Description

一种行人轨迹跟踪方法、系统及相关装置A pedestrian trajectory tracking method, system and related devices
相关申请的交叉引用Cross-references to related applications
本申请要求于2022年04月30日提交中国专利局,申请号为202210469020.8,申请名称为“一种行人轨迹跟踪方法、系统及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application submitted to the China Patent Office on April 30, 2022, with the application number 202210469020.8, and the application title is "A pedestrian trajectory tracking method, system and related devices", the entire content of which is incorporated by reference in this application.
技术领域Technical field
本申请涉及图像处理领域,特别涉及一种行人轨迹跟踪方法、系统及相关装置。The present application relates to the field of image processing, and in particular to a pedestrian trajectory tracking method, system and related devices.
背景技术Background technique
一直以来,行人目标跟踪都是计算机视觉领域最重要的研究方向之一,由于较高的落地价值和实用性,行人目标跟踪受到各方面研究人员的重视。Pedestrian target tracking has always been one of the most important research directions in the field of computer vision. Due to its high implementation value and practicality, pedestrian target tracking has attracted the attention of researchers from all aspects.
多目标跟踪(MOT,Multiple Object Tracking)是目标跟踪领域一个较难的课题,现阶段,该领域从业研究者们通常会结合目标检测和度量学习来实现,通常,我们会采用目标检测算法进行行人定位,并将定位到的行人用度量学习提取特征,进而通过特征的匹配策略来实现同一行人轨迹的计算。但由于跟踪目标数量较多,现有策略会造成较多的漏帧(False Negative)现象及ID漂移(ID-Switch)现象。Multiple Object Tracking (MOT, Multiple Object Tracking) is a difficult topic in the field of target tracking. At this stage, researchers in this field usually combine target detection and metric learning to achieve it. Usually, we use target detection algorithms to detect pedestrians. Positioning, using metric learning to extract features of the located pedestrian, and then realizing the calculation of the trajectory of the same pedestrian through the feature matching strategy. However, due to the large number of tracking targets, the existing strategy will cause more frame missing (False Negative) and ID drift (ID-Switch) phenomena.
因此,发明人意识到,如何提高行人跟踪精度是本领域技术人员亟需解决的技术问题。Therefore, the inventor realized that how to improve pedestrian tracking accuracy is an urgent technical problem that needs to be solved by those skilled in the art.
发明内容Contents of the invention
根据本申请公开的各种实施例,提供一种行人轨迹跟踪方法、行人轨迹跟踪系统、计算机可读存储介质和电子设备。According to various embodiments disclosed in this application, a pedestrian trajectory tracking method, a pedestrian trajectory tracking system, a computer-readable storage medium, and an electronic device are provided.
一种行人轨迹跟踪方法,包括:获取图像数据;对图像数据进行特征提取,并根据提取的特征构建候选框关系掩膜;候选框关系掩膜中的数值表示当前帧的检测框与目标行人是否可形成合理的轨迹关系;提取行人轨迹库的历史帧特征集合,与候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;根据人框特 征距离矩阵和框人特征距离矩阵计算目标行人与候选框的特征距离,响应于存在目标候选框与目标行人的特征距离满足互为最小距离,将目标候选框归入目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;以及更新行人轨迹库,输出目标行人的行人索引集合和对应的位置轨迹。A pedestrian trajectory tracking method includes: acquiring image data; performing feature extraction on the image data, and constructing a candidate frame relationship mask based on the extracted features; the value in the candidate frame relationship mask indicates whether the detection frame of the current frame is related to the target pedestrian A reasonable trajectory relationship can be formed; the historical frame feature set of the pedestrian trajectory library is extracted, and the feature calculation is performed with the candidate frames in the candidate frame relationship mask to obtain the person-frame feature distance matrix and the frame-person feature distance matrix; according to the person-frame feature distance matrix The feature distance matrix of the sum frame person calculates the feature distance between the target pedestrian and the candidate frame. In response to the existence of the feature distance between the target candidate frame and the target pedestrian that satisfies the minimum distance between each other, the target candidate frame is classified into the trajectory of the target pedestrian until the current frame detection frame There is no detection frame that meets the conditions; and the pedestrian trajectory library is updated, and the pedestrian index set and the corresponding position trajectory of the target pedestrian are output.
一种行人轨迹跟踪系统,包括:图像获取模块,用于获取图像数据;特征提取模块,用于对图像数据进行空间特征提取和外观特征提取,并根据提取的特征构建候选框关系掩膜;特征计算模块,用于提取行人轨迹库的历史帧特征集合,与候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;检测模块,用于根据人框特征距离矩阵和框人特征距离矩阵计算目标行人与候选框的特征距离,响应于存在目标候选框与目标行人的特征距离满足互为最小距离,将目标候选框归入目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;以及轨迹更新模块,用于更新行人轨迹库,输出目标行人的行人索引集合和对应的位置轨迹。A pedestrian trajectory tracking system includes: an image acquisition module for acquiring image data; a feature extraction module for extracting spatial features and appearance features from the image data, and constructing a candidate frame relationship mask based on the extracted features; features The calculation module is used to extract the historical frame feature set of the pedestrian trajectory library, and performs feature calculation with the candidate frames in the candidate frame relationship mask to obtain the human frame feature distance matrix and the frame human feature distance matrix; the detection module is used to calculate the feature distance matrix based on the human frame The characteristic distance matrix and the framer characteristic distance matrix calculate the characteristic distance between the target pedestrian and the candidate frame. In response to the existence of the characteristic distance between the target candidate frame and the target pedestrian that satisfies the mutual minimum distance, the target candidate frame is classified into the trajectory of the target pedestrian until the current There is no detection frame that meets the conditions in the frame detection frame; and a trajectory update module is used to update the pedestrian trajectory library and output the pedestrian index set and the corresponding position trajectory of the target pedestrian.
一种非易失性计算机可读存储介质,其上存储有计算机可读指令,计算机可读指令被处理器执行时实现如上的方法的步骤。A non-volatile computer-readable storage medium has computer-readable instructions stored thereon. When the computer-readable instructions are executed by a processor, the steps of the above method are implemented.
一种电子设备,包括存储器,还包括一个或多个处理器,存储器中存有计算机可读指令,处理器调用存储器中的计算机可读指令时实现如上的方法的步骤。An electronic device includes a memory and one or more processors. Computer-readable instructions are stored in the memory. When the processor calls the computer-readable instructions in the memory, the steps of the above method are implemented.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。The details of one or more embodiments of the application are set forth in the accompanying drawings and the description below. Other features and advantages of the application will be apparent from the description, drawings, and claims.
附图说明Description of drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to explain the embodiments of the present application or the technical solutions in the prior art more clearly, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below. Obviously, the drawings in the following description are only This is an embodiment of the present application. For those of ordinary skill in the art, other drawings can be obtained based on the provided drawings without exerting creative efforts.
图1为根据一个或多个实施例中提供的一种行人轨迹跟踪方法的流程图;Figure 1 is a flow chart of a pedestrian trajectory tracking method provided in one or more embodiments;
图2为根据一个或多个实施例中提供的对图像数据进行特征提取,并进行特征构建候选框关系掩膜步骤的流程图;Figure 2 is a flowchart of the steps for extracting features from image data and constructing candidate frame relationship masks based on features provided in one or more embodiments;
图3为根据一个或多个实施例中候选框关系掩膜可视化示例图;Figure 3 is an example diagram of candidate box relationship mask visualization according to one or more embodiments;
图4为根据一个或多个实施例中行人轨迹跟踪系统结构示意图;Figure 4 is a schematic structural diagram of a pedestrian trajectory tracking system according to one or more embodiments;
图5为根据一个或多个实施例中电子设备结构框图。Figure 5 is a structural block diagram of an electronic device according to one or more embodiments.
具体实施方式Detailed ways
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments These are part of the embodiments of this application, but not all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of this application.
请参考图1,图1为根据一个或多个实施例中所提供的一种行人轨迹跟踪方法的流程图,该方法可以应用于电子设备。该方法包括:Please refer to FIG. 1 , which is a flow chart of a pedestrian trajectory tracking method provided in one or more embodiments. The method can be applied to electronic devices. The method includes:
S101:获取图像数据;S101: Obtain image data;
本步骤旨在获取图像数据,在此对于如何获取该图像数据不作限定,通常可以将路边摄像头采集的视频数据作为图像数据来源。响应于源数据为视频数据,可以对其进行图像帧处理,以得到本步骤所需要的图像数据。This step aims to obtain image data. There is no limitation on how to obtain the image data. Video data collected by roadside cameras can usually be used as the source of image data. In response to the fact that the source data is video data, image frame processing can be performed on it to obtain the image data required in this step.
S102:对图像数据进行特征提取,并根据提取的特征构建候选框关系掩膜;S102: Extract features from the image data and construct a candidate frame relationship mask based on the extracted features;
本步骤旨在对图像数据进行特征提取,从而构建候选框关系掩膜。候选框关系掩膜中的数值表示当前帧的检测框与目标行人是否可形成合理的轨迹关系。本步骤的执行对象为步骤S101中获取的图像数据,其可以逐帧按照本步骤进行处理。This step aims to extract features from the image data to construct a candidate box relationship mask. The value in the candidate frame relationship mask indicates whether the detection frame of the current frame and the target pedestrian can form a reasonable trajectory relationship. The execution object of this step is the image data obtained in step S101, which can be processed frame by frame according to this step.
在本申请的某些实施例中,如图2所示,该步骤的对图像数据进行特征提取,并根据提取的特征构建候选框关系掩膜可以包括如下步骤:In some embodiments of the present application, as shown in Figure 2, this step of extracting features from the image data and constructing a candidate frame relationship mask based on the extracted features may include the following steps:
S1021:利用第一网络模型对图像数据中的图像帧进行目标预测,得到第一检测结果;其中,第一检测结果包含各行人的坐标框位置信息和行人数量;S1021: Use the first network model to perform target prediction on the image frames in the image data to obtain the first detection result; wherein the first detection result includes the coordinate frame position information of each pedestrian and the number of pedestrians;
S1022:利用第二网络模型提取坐标框中的特征,得到特征集合;S1022: Use the second network model to extract features in the coordinate frame and obtain a feature set;
S1023:根据轨迹预测公式计算各行人在当前时刻的轨迹预测坐标;S1023: Calculate the trajectory prediction coordinates of each pedestrian at the current moment according to the trajectory prediction formula;
S1024:根据各行人的轨迹预测坐标和坐标框位置信息确定行人在当前时刻的空间可行范围;S1024: Determine the spatial feasible range of the pedestrian at the current moment based on the trajectory prediction coordinates and coordinate frame position information of each pedestrian;
S1025:根据空间可行范围生成各行人对应的候选框关系掩膜;候选框关系掩膜中的数值表示当前帧的检测框与行人是否可形成合理的轨迹关系。S1025: Generate a candidate frame relationship mask corresponding to each pedestrian based on the spatial feasible range; the value in the candidate frame relationship mask indicates whether the detection frame of the current frame and the pedestrian can form a reasonable trajectory relationship.
可以利用双阶段检测器或者单阶段检测器训练行人检测网络,并将训练数据集中行人框标签以及图片输入至行人检测网络,调整网络参数,得到第一网络模型。第一网络模型主要用于执行行人检测,可以将训练数据集中行人框标签及图片输入至行人检测网络,并利用双阶段检测器或者单阶段检测器对行人检测网络进行训练,得到第一网络模 型。The pedestrian detection network can be trained using a dual-stage detector or a single-stage detector, and the pedestrian frame labels and images in the training data set are input to the pedestrian detection network, and the network parameters are adjusted to obtain the first network model. The first network model is mainly used to perform pedestrian detection. The pedestrian frame labels and pictures in the training data set can be input to the pedestrian detection network, and the pedestrian detection network can be trained using a dual-stage detector or a single-stage detector to obtain the first network model. .
而对于第二网络模型,可以基于行人重识别模式(Person Re-Identification)进行模型训练,裁剪训练数据集中的行人框,得到第二网络模型。第二网络模型主要为度量学习网络训练模型。For the second network model, model training can be performed based on the person re-identification mode (Person Re-Identification), and the pedestrian frames in the training data set can be cropped to obtain the second network model. The second network model is mainly a metric learning network training model.
具体的,下文以相关公式对上述过程进行说明:Specifically, the above process is explained below with relevant formulas:
对第i帧图像进行目标检测,使用训练好的第一网络模型预测检测结果,获得检测结果D i={p 1,p 2,…,p m},p j表示每个行人的坐标框位置[x1,y1,x2,y2],及目标框左上角和右下角的坐标,m表示当前帧所检测出的行人数量,这里的检测结果对应上述S1021中的第一检测结果。 Perform target detection on the i-th frame image, use the trained first network model to predict the detection result, and obtain the detection result D i = {p 1 , p 2 ,..., p m }, p j represents the coordinate box position of each pedestrian [x1, y1, x2, y2], and the coordinates of the upper left corner and lower right corner of the target frame, m represents the number of pedestrians detected in the current frame, and the detection result here corresponds to the first detection result in S1021 above.
此后执行行人目标特征提取。使用训练好的第二网络模型对m个预测出的行人检测框提取特征。可获得F i={f 1,f 2,…,f m}。 Pedestrian target feature extraction is then performed. Use the trained second network model to extract features from the m predicted pedestrian detection frames. F i ={f 1 , f 2 ,..., f m } can be obtained.
再进行空间条件限制计算:对行人轨迹库T中r个行人进行轨迹预测,其中x,y分别表示目标行人在二维图像上的坐标,k表示目标行人在T中的索引,而lqx(*)表示用来拟合轨迹曲线的最小二乘法公式。根据轨迹预测公式计算得到每个行人在当前时刻的轨迹预测坐标(x t+1,k,y t+1,k),即每个行人在当前时刻的下一时刻的轨迹坐标,其中,y t+1,k表示第k个目标行人的纵向预测坐标值,x t+1,k表示第k个目标行人的横向预测坐标值。这里,轨迹预测公式可以采用最小二乘法公式,但也不限于这种方式。其中,最小二乘法公式如下: Then perform spatial condition restriction calculation: predict the trajectory of r pedestrians in the pedestrian trajectory library T, where x and y respectively represent the coordinates of the target pedestrian on the two-dimensional image, k represents the index of the target pedestrian in T, and lqx(* ) represents the least squares formula used to fit the trajectory curve. According to the trajectory prediction formula, the trajectory prediction coordinates (x t+1, k , y t+1, k ) of each pedestrian at the current moment are calculated, that is, the trajectory coordinates of each pedestrian at the next moment at the current moment, where, y t+1, k represents the longitudinal predicted coordinate value of the k-th target pedestrian, and x t+1, k represents the lateral predicted coordinate value of the k-th target pedestrian. Here, the trajectory prediction formula can use the least squares formula, but it is not limited to this method. Among them, the least squares formula is as follows:
x t+1,k=lqx({x t,k|t∈T,k∈[1,...,r]}) x t+1,k =lqx({x t,k |t∈T,k∈[1,...,r]})
y t+1,k=lqx({y t,k|t∈T,k∈[1,...,r]}) y t+1,k =lqx({y t,k |t∈T,k∈[1,...,r]})
式中,lqx(*)为用来拟合轨迹曲线的最小二乘法公式,(x t,k,y t,k)表示第k个目标行人在当前时刻的轨迹坐标,其中,x t,k表示第k个目标行人在当前时刻的横坐标值,y t,k表示第k个目标行人在当前时刻的纵坐标值。 In the formula, lqx(*) is the least squares formula used to fit the trajectory curve, (x t,k ,y t,k ) represents the trajectory coordinates of the k-th target pedestrian at the current moment, where, x t,k represents the abscissa value of the k-th target pedestrian at the current moment, and y t,k represents the ordinate value of the k-th target pedestrian at the current moment.
之后,对每个行人k,根据其轨迹预测坐标(x t+1,k,y t+1,k)与其最终时刻的目标框大小[w t,k,h t,k],计算其在当前时刻的空间可行范围 After that, for each pedestrian k, according to its trajectory prediction coordinates (x t+1, k , y t+1, k ) and its final moment target box size [w t, k , h t, k ], calculate its The feasible range of space at the current moment
Figure PCTCN2022117148-appb-000001
Figure PCTCN2022117148-appb-000001
S t,kS t, k =
{(x,y)|x∈[x t+1,kw×w t,k,x t+1,kw×w t,k], {(x, y)|x∈[x t+1,kw ×w t,k ,x t+1,kw ×w t,k ],
y∈[y t+1,kh×h t,k,y t+1,kh×h t,k]} y∈[y t+1, kh ×h t, k , y t+1, kh ×h t, k ]}
其中,λ是目标框的扩展系数,相应地,λ h表示目标框的纵向扩展系数,λ w表示目标框的横向扩展系数,thresh是一个阈值,是一个可以设置的参数,用来限制“可行范围”,某框(即其中一个框)s与目标框S t,k之间的重合程度要大于这个值。S为所有符合条件的s的集合,即“可行范围”。 Among them, λ is the expansion coefficient of the target frame. Correspondingly, λ h represents the vertical expansion coefficient of the target frame, λ w represents the horizontal expansion coefficient of the target frame, and thresh is a threshold, which is a parameter that can be set to limit the "feasible"Range", the degree of overlap between a certain frame (i.e. one of the frames) s and the target frame S t, k must be greater than this value. S is the set of all s that meet the conditions, that is, the "feasible range".
此后可生成候选框关系掩膜,对当前帧的检测结果结合及T中r个行人的空间条件限制,计算T中每个行人的候选框关系掩膜M。在此对于候选框关系掩膜的表示方式不作限定,一种可行的方式,该候选框关系掩膜为包含0和1的M*N矩阵,其中,M为行人检测框的数量,N为行人数量。Afterwards, the candidate frame relationship mask can be generated, and the detection results of the current frame are combined with the spatial condition constraints of r pedestrians in T to calculate the candidate frame relationship mask M for each pedestrian in T. There is no limitation on the representation method of the candidate frame relationship mask. A feasible method is that the candidate frame relationship mask is an M*N matrix containing 0 and 1, where M is the number of pedestrian detection frames and N is the pedestrian. quantity.
可以获得所有行人(假设N个人)在当前帧的可行范围(共N个S),此外还有当前帧检测到的所有行人检测框(假设M个b),通过比对每个b是否属于每个S,可以获得一个M×N的矩阵,矩阵中第i行第j列个元素表示第i个检测框有可能作为行人库里第j个行人候选框的情况。参见图3,图3为根据一个或多个实施例中候选框关系掩膜可视化示例图,图2中1表示真、0表示假,而1表示相关行人之间存在潜在的候选关系。当然也可以采用其他方式标识候选框关系掩膜,在此不一一举例限定,例如采用1表示真,-1表示假等等。We can obtain the feasible range of all pedestrians (assuming N people) in the current frame (a total of N S), in addition to all pedestrian detection frames detected in the current frame (assuming M b), by comparing whether each b belongs to each S, an M×N matrix can be obtained. The elements in the i-th row and j-th column of the matrix represent the situation where the i-th detection frame may be used as the j-th pedestrian candidate frame in the pedestrian library. Referring to Figure 3, Figure 3 is an example diagram of candidate box relationship mask visualization according to one or more embodiments. In Figure 2, 1 represents true, 0 represents false, and 1 represents the existence of potential candidate relationships between related pedestrians. Of course, other methods can also be used to identify the candidate frame relationship mask, which are not limited to examples here. For example, 1 represents true, -1 represents false, and so on.
S103:提取行人轨迹库的历史帧特征集合,与候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;S103: Extract the historical frame feature set from the pedestrian trajectory database, perform feature calculations with the candidate frames in the candidate frame relationship mask, and obtain the person frame feature distance matrix and the frame person feature distance matrix;
本步骤旨在特征提取和特征距离计算。具体的,可以将行人轨迹库中每个行人的历史帧特征集合提取出来,分别与其当前帧的候选框进行余弦距离运算得到人框特征距离矩阵Disttd,此外,当前帧候选框分别与其存在候选关系的行人进行特征距离运算,得到框人距离矩阵Distdt。Disttd表示库中t个行人与当前帧d个候选框之间的距离矩阵,反之Distdt表示候选框与行人的距离矩阵,在“匹配”的过程中,对于第i个行人和第j个框,当且仅当Disttd[i,j]==min(Disttd[i,*])且Distdt[j,i]==min(Distdt[j,*])时,将候选框j分配给行人i。函数min(array)表示数组array的最小值;matric[j,*]表示矩阵matric的第j行数组。This step is aimed at feature extraction and feature distance calculation. Specifically, the historical frame feature set of each pedestrian in the pedestrian trajectory library can be extracted, and the cosine distance operation is performed on the candidate frame of the current frame to obtain the human frame feature distance matrix Disttd. In addition, the candidate frame of the current frame has a candidate relationship with it. Perform feature distance calculation on the pedestrians to obtain the frame-person distance matrix Distdt. Disttd represents the distance matrix between t pedestrians in the library and d candidate boxes in the current frame. Conversely, Distdt represents the distance matrix between the candidate boxes and pedestrians. In the "matching" process, for the i-th pedestrian and j-th box, If and only if Disttd[i,j]==min(Disttd[i,*]) and Distdt[j,i]==min(Distdt[j,*]), candidate box j is assigned to pedestrian i. The function min(array) represents the minimum value of the array array; matric[j,*] represents the j-th row array of the matrix matric.
S104:响应于目标行人与目标候选框的特征距离满足互为最小距离,将目标候选框归入目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;S104: In response to the feature distance between the target pedestrian and the target candidate frame being the minimum distance from each other, classify the target candidate frame into the trajectory of the target pedestrian until there is no detection frame that meets the conditions in the current frame detection frame;
响应于行人k与当前帧候选框p的特征距离满足相互为最小距离的召回条件,将候选框p归入行人k的轨迹。重复此操作直至当前帧检测框Di中再无满足条件的目标,将这部分框设置为新出现的行人,存入行人轨迹库中。In response to the characteristic distance between pedestrian k and the candidate frame p of the current frame satisfying the recall condition of being the minimum distance to each other, the candidate frame p is classified into the trajectory of pedestrian k. Repeat this operation until there is no target that meets the conditions in the current frame detection frame Di. This part of the frame is set as a new pedestrian and stored in the pedestrian trajectory library.
S105:更新行人轨迹库,输出目标行人的行人索引集合和对应的位置轨迹。S105: Update the pedestrian trajectory database and output the pedestrian index set and the corresponding position trajectory of the target pedestrian.
将匹配后的行人框及其位置、特征存入相关行人中,更新其位置信息和特征队列,最后可以输出目标行人的行人索引集合和对应的位置轨迹。The matched pedestrian frame, its position and characteristics are stored in the relevant pedestrian, its position information and feature queue are updated, and finally the pedestrian index set and the corresponding position trajectory of the target pedestrian can be output.
在此对于何时构建行人轨迹库不作具体限定,只要求在应用过程中存在相应的数据库或者数据队列即可。也可以在执行本实施例前先构建行人轨迹库,行人轨迹库包含各行人的历史位置和各历史位置时的特征信息。There is no specific limit on when to build the pedestrian trajectory database. It only requires that a corresponding database or data queue exists during the application process. It is also possible to construct a pedestrian trajectory database before executing this embodiment. The pedestrian trajectory database includes the historical location of each pedestrian and the characteristic information of each historical location.
本申请实施例通过对图像数据进行特征提取,构建候选框关系掩膜,用于对行人和图像数据中识别到的检测框进行轨迹关系的合理性判断,进而通过比对行人轨迹库中的历史帧特征集合,进行特征距离的计算,以确定目标行人和目标候选框,从而确定图像数据中的行人轨迹,实现行人轨迹跟踪。本申请能够有效解决超大场景下空间距离加权导致的行人索引置换问题,以及行人跟踪过程中特征丰富性不足从而导致对度量学习模型依赖程度过高的问题,提高了行人跟踪检测精度。The embodiment of the present application constructs a candidate frame relationship mask by extracting features from the image data, which is used to judge the rationality of the trajectory relationship between pedestrians and detection frames identified in the image data, and then compares the history in the pedestrian trajectory database. The frame feature set is used to calculate the feature distance to determine the target pedestrian and the target candidate frame, thereby determining the pedestrian trajectory in the image data and realizing pedestrian trajectory tracking. This application can effectively solve the problem of pedestrian index replacement caused by spatial distance weighting in extremely large scenes, as well as the problem of insufficient feature richness in the pedestrian tracking process, which leads to excessive dependence on the metric learning model, and improves the accuracy of pedestrian tracking detection.
应该理解的是,虽然图1、2的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图1、2中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of Figures 1 and 2 are shown in sequence as indicated by the arrows, these steps are not necessarily executed in the order indicated by the arrows. Unless explicitly stated in this article, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least some of the steps in Figures 1 and 2 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. These sub-steps or stages The order of execution is not necessarily sequential, but may be performed in turn or alternately with other steps or sub-steps of other steps or at least part of the stages.
下面对本申请实施例提供的行人轨迹跟踪系统进行介绍,下文描述的行人轨迹跟踪系统与上文描述的行人轨迹跟踪方法可相互对应参照。The pedestrian trajectory tracking system provided by the embodiment of the present application is introduced below. The pedestrian trajectory tracking system described below and the pedestrian trajectory tracking method described above can be mutually referenced.
参见图4,图4为根据一个或多个实施例中的行人轨迹跟踪系统结构示意图,本申请还提供一种行人轨迹跟踪系统,包括图像获取模块、特征提取模块、特征计算模块、检测模块和轨迹更新模块,其中:Referring to Figure 4, Figure 4 is a schematic structural diagram of a pedestrian trajectory tracking system according to one or more embodiments. This application also provides a pedestrian trajectory tracking system, including an image acquisition module, a feature extraction module, a feature calculation module, a detection module and Track update module, including:
图像获取模块,用于获取图像数据;Image acquisition module, used to acquire image data;
特征提取模块,用于对图像数据进行空间特征提取和外观特征提取,并根据提取的特征构建候选框关系掩膜;The feature extraction module is used to extract spatial features and appearance features from image data, and construct a candidate frame relationship mask based on the extracted features;
特征计算模块,用于提取行人轨迹库的历史帧特征集合,与候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;The feature calculation module is used to extract the historical frame feature set of the pedestrian trajectory library, perform feature calculation with the candidate frames in the candidate frame relationship mask, and obtain the person frame feature distance matrix and the frame person feature distance matrix;
检测模块,用于根据人框特征距离矩阵和框人特征距离矩阵计算目标行人与候选框的特征距离,响应于存在目标候选框与目标行人的特征距离满足互为最小距离,将目标候选框归入目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;The detection module is used to calculate the characteristic distance between the target pedestrian and the candidate frame based on the human frame characteristic distance matrix and the frame human characteristic distance matrix. In response to the existence of the characteristic distance between the target candidate frame and the target pedestrian that satisfies the mutual minimum distance, the target candidate frame is classified into Enter the trajectory of the target pedestrian until there is no detection frame that meets the conditions in the current frame detection frame;
轨迹更新模,用于更新行人轨迹库,输出目标行人的行人索引集合和对应的位置轨迹。The trajectory update module is used to update the pedestrian trajectory library and output the pedestrian index set and the corresponding position trajectory of the target pedestrian.
在本申请的某些实施例中,人轨迹跟踪系统还可以包括轨迹库构建模块,该轨迹库构建模块用于构建行人轨迹库;行人轨迹库包含各行人的历史位置和行人在各历史位置时的特征信息。In some embodiments of the present application, the human trajectory tracking system may also include a trajectory library building module, which is used to construct a pedestrian trajectory library; the pedestrian trajectory library includes the historical location of each pedestrian and the time the pedestrian is at each historical location. characteristic information.
在本申请的某些实施例中,特征提取模块包括第一特征提取单元、第二特征提取单元、轨迹计算单元、空间预测单元以及候选框关系掩膜生成单元,其中:In some embodiments of the present application, the feature extraction module includes a first feature extraction unit, a second feature extraction unit, a trajectory calculation unit, a spatial prediction unit, and a candidate frame relationship mask generation unit, wherein:
第一特征提取单元,用于利用第一网络模型对图像数据中的图像帧进行目标预测,得到第一检测结果;其中,第一检测结果包含各行人的坐标框位置信息和行人数量;The first feature extraction unit is used to use the first network model to perform target prediction on the image frames in the image data to obtain the first detection result; wherein the first detection result includes the coordinate frame position information of each pedestrian and the number of pedestrians;
第二特征提取单元,用于利用第二网络模型提取坐标框中的特征,得到特征集合;The second feature extraction unit is used to extract features in the coordinate frame using the second network model to obtain a feature set;
轨迹计算单元,用于根据轨迹预测公式计算各行人在当前时刻的轨迹预测坐标;The trajectory calculation unit is used to calculate the trajectory prediction coordinates of each pedestrian at the current moment according to the trajectory prediction formula;
空间预测单元,用于根据各行人的轨迹预测坐标和坐标框位置信息确定行人在当前时刻的空间可行范围;The spatial prediction unit is used to determine the spatial feasible range of the pedestrian at the current moment based on the trajectory prediction coordinates and coordinate frame position information of each pedestrian;
候选框关系掩膜生成单元,用于根据空间可行范围生成各行人对应的候选框关系掩膜;候选框关系掩膜中的数值表示当前帧的检测框与行人是否可形成合理的轨迹关系。The candidate frame relationship mask generation unit is used to generate the candidate frame relationship mask corresponding to each pedestrian based on the spatial feasible range; the value in the candidate frame relationship mask indicates whether the detection frame of the current frame and the pedestrian can form a reasonable trajectory relationship.
在本申请的某些实施例中,行人轨迹跟踪系统还包括第一网络模型生成模块,该第一网络模型生成模块用于将训练数据集中行人框标签及图片输入至行人检测网络,并利用双阶段检测器或者单阶段检测器对行人检测网络进行训练,得到第一网络模型。In some embodiments of the present application, the pedestrian trajectory tracking system also includes a first network model generation module. The first network model generation module is used to input pedestrian frame labels and pictures in the training data set into the pedestrian detection network, and uses dual The stage detector or single stage detector trains the pedestrian detection network to obtain the first network model.
在本申请的某些实施例中,行人轨迹跟踪系统还包括第二网络模型生成模块,该第二网络模型生成模块,用于基于行人重识别模式进行模型训练,裁剪训练数据集中的行人框,得到第二网络模型。In some embodiments of the present application, the pedestrian trajectory tracking system also includes a second network model generation module, which is used to perform model training based on the pedestrian re-identification mode and crop the pedestrian frame in the training data set, Obtain the second network model.
关于行人轨迹跟踪系统的具体限定可以参见上文中对于行人轨迹跟踪方法的限定,在此不再赘述。上述行人轨迹跟踪系统中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For specific limitations on the pedestrian trajectory tracking system, please refer to the limitations on the pedestrian trajectory tracking method mentioned above, which will not be described again here. Each module in the above-mentioned pedestrian trajectory tracking system can be implemented in whole or in part through software, hardware, and combinations thereof. Each of the above modules may be embedded in or independent of the processor of the computer device in the form of hardware, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
本申请还提供了一种非易失性计算机可读存储介质,其上存有计算机可读指令,该计算机可读指令被执行时可以实现上述任一实施例所提供的步骤。该非易失性计算机可读存储介质可以包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。This application also provides a non-volatile computer-readable storage medium on which computer-readable instructions are stored. When executed, the computer-readable instructions can implement the steps provided in any of the above embodiments. The non-volatile computer-readable storage medium can include: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk, etc. A medium on which program code can be stored.
本申请还提供了一种电子设备,如图5所示,可以包括存储器,还包括一个或多个处理器,存储器中存有计算机可读指令,处理器调用存储器中的计算机可读指令时,可以实现上述任一实施例所提供的步骤。This application also provides an electronic device, as shown in Figure 5, which may include a memory and one or more processors. Computer-readable instructions are stored in the memory. When the processor calls the computer-readable instructions in the memory, The steps provided in any of the above embodiments can be implemented.
其中,处理器可以采用数字信号处理DSP(Digital Signal Processing)、现场可编程门阵列FPGA(Field-Programmable Gate Array)、可编程逻辑阵列PLA(Programmable Logic Array)中的至少一种硬件形式来实现。处理器也可以包括主处理器和协处理器,主处理器是用于对在唤醒状态下的数据进行处理的处理器,也称中央处理器CPU(Central Processing Unit);协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中,处理器可以集成有图像处理器GPU(Graphics Processing Unit),GPU用于负责显示屏所需要显示的内容的渲染和绘制。在本申请的某些实施例中,处理器还可以包括人工智能AI(Artificial Intelligence)处理器,该AI处理器用于处理有关机器学习的计算操作。Among them, the processor can be implemented in at least one hardware form among digital signal processing DSP (Digital Signal Processing), field programmable gate array FPGA (Field-Programmable Gate Array), and programmable logic array PLA (Programmable Logic Array). The processor can also include a main processor and a co-processor. The main processor is used to process data in the wake-up state, also called the central processing unit (CPU); the co-processor is used to process data in the wake-up state. A low-power processor that processes data in standby mode. In some embodiments, the processor may be integrated with a graphics processor GPU (Graphics Processing Unit), which is responsible for rendering and drawing content to be displayed on the display screen. In some embodiments of the present application, the processor may also include an artificial intelligence (AI) processor, which is used to process computing operations related to machine learning.
存储器可以包括一个或多个计算机可读存储介质,该计算机可读存储介质可以是非易失性的。存储器还可包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。本实施例中,存储器至少用于存储以下计算机可读指令,其中,该计算机可读指令被处理器加载并执行之后,能够实现前述任一实施例公开的行人轨迹跟踪方法中的相关步骤。另外,存储器所存储的资源还可以包括操作系统和数据等,存储方式可以是短暂存储或者永久存储。数据可以包括但不限于上述方法所涉及到的数据。Memory may include one or more computer-readable storage media, which may be non-volatile. Memory may also include high-speed random access memory, and non-volatile memory, such as one or more disk storage devices, flash memory storage devices. In this embodiment, the memory is at least used to store the following computer-readable instructions. After the computer-readable instructions are loaded and executed by the processor, the relevant steps in the pedestrian trajectory tracking method disclosed in any of the foregoing embodiments can be implemented. In addition, the resources stored in the memory may also include operating systems and data, and the storage method may be short-term storage or permanent storage. The data may include but is not limited to the data involved in the above methods.
在本申请的某些实施例中,当然电子设备还可以包括各种网络接口,电源、显示屏、电源、输入输出接口、传感器以及通信总线等组件中的部分组件或者全部组件。In some embodiments of the present application, of course the electronic device may also include various network interfaces, some or all of the components such as power supplies, display screens, power supplies, input and output interfaces, sensors, and communication buses.
说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例提供的系统而言,由于其与实施例提供的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。Each embodiment in the specification is described in a progressive manner. Each embodiment focuses on its differences from other embodiments. The same and similar parts between the various embodiments can be referred to each other. As for the system provided in the embodiment, since it corresponds to the method provided in the embodiment, the description is relatively simple. For relevant details, please refer to the description in the method section.
本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想。应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以对本申请进行若干改进和修饰,这些改进和修饰也落入本申请权利要求的保护范围内。This article uses specific examples to illustrate the principles and implementation methods of this application. The description of the above embodiments is only used to help understand the method and its core idea of this application. It should be noted that for those of ordinary skill in the art, several improvements and modifications can be made to the present application without departing from the principles of the present application, and these improvements and modifications also fall within the protection scope of the claims of the present application.
还需要说明的是,在本说明书中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作 之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括要素的过程、方法、物品或者设备中还存在另外的相同要素。It should also be noted that in this specification, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations There is no such actual relationship or sequence between operations. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment. Without further limitation, an element qualified by the statement "comprises a..." does not exclude the presence of additional identical elements in the process, method, article, or device that includes the element.

Claims (15)

  1. 一种行人轨迹跟踪方法,其特征在于,包括:A pedestrian trajectory tracking method, characterized by including:
    获取图像数据;Get image data;
    对所述图像数据进行特征提取,并根据提取的特征构建候选框关系掩膜;所述候选框关系掩膜中的数值表示当前帧的检测框与目标行人是否可形成合理的轨迹关系;Perform feature extraction on the image data, and construct a candidate frame relationship mask based on the extracted features; the value in the candidate frame relationship mask indicates whether the detection frame of the current frame and the target pedestrian can form a reasonable trajectory relationship;
    提取行人轨迹库的历史帧特征集合,与所述候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;及Extract the historical frame feature set from the pedestrian trajectory library, perform feature calculations with the candidate frames in the candidate frame relationship mask, and obtain the person frame feature distance matrix and the frame person feature distance matrix; and
    根据所述人框特征距离矩阵和所述框人特征距离矩阵计算所述目标行人与所述候选框的特征距离,响应于存在目标候选框与所述目标行人的特征距离满足互为最小距离,将所述目标候选框归入所述目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;Calculate the characteristic distance between the target pedestrian and the candidate frame according to the person frame characteristic distance matrix and the frame person characteristic distance matrix, in response to the existence of the characteristic distance between the target candidate frame and the target pedestrian that satisfies the mutual minimum distance, Classify the target candidate frame into the trajectory of the target pedestrian until there is no detection frame that meets the conditions in the current frame detection frame;
    更新所述行人轨迹库,输出所述目标行人的行人索引集合和对应的位置轨迹。The pedestrian trajectory database is updated, and the pedestrian index set and the corresponding position trajectory of the target pedestrian are output.
  2. 根据权利要求1所述的行人轨迹跟踪方法,其特征在于,还包括:The pedestrian trajectory tracking method according to claim 1, further comprising:
    构建所述行人轨迹库;所述行人轨迹库包含各行人的历史位置和所述行人在各所述历史位置时的特征信息。The pedestrian trajectory database is constructed; the pedestrian trajectory database includes the historical location of each pedestrian and the characteristic information of the pedestrian at each historical location.
  3. 根据权利要求1或2所述的行人轨迹跟踪方法,其特征在于,对所述图像数据进行特征提取,并根据提取的特征构建候选框关系掩膜包括:The pedestrian trajectory tracking method according to claim 1 or 2, characterized in that performing feature extraction on the image data and constructing a candidate frame relationship mask based on the extracted features includes:
    利用第一网络模型对所述图像数据中的图像帧进行目标预测,得到第一检测结果;其中,所述第一检测结果包含各行人的坐标框位置信息和行人数量;Using the first network model to perform target prediction on the image frames in the image data, the first detection result is obtained; wherein the first detection result includes the coordinate frame position information of each pedestrian and the number of pedestrians;
    利用第二网络模型提取所述坐标框中的特征,得到特征集合;Use the second network model to extract features in the coordinate frame to obtain a feature set;
    根据轨迹预测公式计算各所述行人在当前时刻的轨迹预测坐标;Calculate the trajectory prediction coordinates of each pedestrian at the current moment according to the trajectory prediction formula;
    根据各所述行人的所述轨迹预测坐标和所述坐标框位置信息确定所述行人在当前时刻的空间可行范围;及Determine the spatial feasible range of the pedestrian at the current moment based on the trajectory prediction coordinates of each pedestrian and the coordinate frame position information; and
    根据所述空间可行范围生成各所述行人对应的候选框关系掩膜。A candidate frame relationship mask corresponding to each pedestrian is generated according to the spatial feasible range.
  4. 根据权利要求3所述的行人轨迹跟踪方法,其特征在于,所述根据轨迹预测公式计算各所述行人在当前时刻的轨迹预测坐标,包括:The pedestrian trajectory tracking method according to claim 3, characterized in that calculating the trajectory prediction coordinates of each pedestrian at the current moment according to the trajectory prediction formula includes:
    根据最小二乘法公式计算各所述行人在当前时刻的轨迹预测坐标。The predicted trajectory coordinates of each pedestrian at the current moment are calculated according to the least squares formula.
  5. 根据权利要求3或4所述的行人轨迹跟踪方法,其特征在于,所述根据各所述行人的所述轨迹预测坐标和所述坐标框位置信息确定所述行人在当前时刻的空间可行范围,包括:The pedestrian trajectory tracking method according to claim 3 or 4, characterized in that the spatial feasible range of the pedestrian at the current moment is determined based on the trajectory prediction coordinates of each pedestrian and the coordinate frame position information, include:
    根据以下公式计算行人k在当前时刻的空间可行范围;Calculate the spatial feasible range of pedestrian k at the current moment according to the following formula;
    Figure PCTCN2022117148-appb-100001
    Figure PCTCN2022117148-appb-100001
    S t,k={(x,y)|x∈[x t+1,kw×w t,k,x t+1,kw×w t,k] S t,k ={(x,y)|x∈[x t+1,kw ×w t,k ,x t+1,kw ×w t,k ]
    y∈[y t+1,kh×h t,k,y t+1,kh×h t,k]} y∈[y t+1, kh ×h t, k , y t+1, kh ×h t, k ]}
    其中,thresh为设置的阈值参数,s表示其中一个框,S t,k表示目标框,(x,y)表示目标行人的坐标,(x t+1,k,y t+1,k)表示第k个目标行人的轨迹预测坐标,[w t+1,k,h t+1,k]表示目标框大小,λ h表示目标框的纵向扩展系数,λ w表示目标框的横向扩展系数,y t+1,k表示第k个目标行人的纵向预测坐标值,x t+1,k表示第k个目标行人的横向预测坐标值,S为符合条件的框的集合。 Among them, thresh is the set threshold parameter, s represents one of the boxes, S t, k represents the target frame, (x, y) represents the coordinates of the target pedestrian, (x t+1,k ,y t+1,k ) represents The trajectory prediction coordinates of the k-th target pedestrian, [w t+1,k ,h t+1,k ] represents the size of the target frame, λ h represents the longitudinal expansion coefficient of the target frame, λ w represents the horizontal expansion coefficient of the target frame, y t+1, k represents the longitudinal predicted coordinate value of the k-th target pedestrian, x t+1, k represents the lateral predicted coordinate value of the k-th target pedestrian, and S is a set of boxes that meet the conditions.
  6. 根据权利要求3至5任意一项所述的行人轨迹跟踪方法,其特征在于,利用第一网络模型对所述图像数据中的图像帧进行目标预测,得到第一检测结果之前,还包括:The pedestrian trajectory tracking method according to any one of claims 3 to 5, characterized in that, before using the first network model to perform target prediction on the image frames in the image data and obtaining the first detection result, it also includes:
    将训练数据集中行人框标签及图片输入至行人检测网络,并利用双阶段检测器或者单阶段检测器对所述行人检测网络进行训练,得到所述第一网络模型。Input the pedestrian frame labels and pictures in the training data set to the pedestrian detection network, and use a dual-stage detector or a single-stage detector to train the pedestrian detection network to obtain the first network model.
  7. 根据权利要求3至6任意一项所述的行人轨迹跟踪方法,其特征在于,所述利用第二网络模型提取所述坐标框中的特征,得到特征集合之前,还包括:The pedestrian trajectory tracking method according to any one of claims 3 to 6, characterized in that before using the second network model to extract features in the coordinate frame and obtaining the feature set, it also includes:
    基于行人重识别模式进行模型训练,裁剪训练数据集中的行人框,得到所述第二网络模型。Model training is performed based on the pedestrian re-identification mode, and pedestrian frames in the training data set are cropped to obtain the second network model.
  8. 根据权利要求1至7任一项所述的行人轨迹跟踪方法,其特征在于,所述候选框关系掩膜为包含0和1的M*N矩阵;其中,M为行人检测框的数量,N为行人数量。The pedestrian trajectory tracking method according to any one of claims 1 to 7, characterized in that the candidate frame relationship mask is an M*N matrix containing 0 and 1; where M is the number of pedestrian detection frames, N is the number of pedestrians.
  9. 根据权利要求1至8任一项所述的行人轨迹跟踪方法,其特征在于,所述提取行人轨迹库的历史帧特征集合,与所述候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵,包括:The pedestrian trajectory tracking method according to any one of claims 1 to 8, characterized in that the feature set of historical frames extracted from the pedestrian trajectory library is calculated with the candidate frames in the candidate frame relationship mask to obtain The person-frame feature distance matrix and the frame-person feature distance matrix include:
    提取行人轨迹库中各行人的历史帧特征集合,分别与其当前帧的候选框进行余弦距离运算,得到人框特征距离矩阵;以及Extract the historical frame feature set of each pedestrian in the pedestrian trajectory database, perform cosine distance operations on its candidate frames of the current frame, and obtain the human frame feature distance matrix; and
    将当前帧候选框分别与其存在候选关系的行人进行特征距离运算,得到框人距离矩阵。Perform feature distance operations on the candidate frames of the current frame and the pedestrians with candidate relationships to obtain the frame-person distance matrix.
  10. 根据权利要求1至9任一项所述的行人轨迹跟踪方法,其特征在于,所述响应于存在目标候选框与所述目标行人的特征距离满足互为最小距离,将所述目标候选框归入所述目标行人的轨迹,包括:The pedestrian trajectory tracking method according to any one of claims 1 to 9, characterized in that, in response to the existence of a target candidate frame and the characteristic distance of the target pedestrian satisfying each other's minimum distance, the target candidate frame is classified into Enter the trajectory of the target pedestrian, including:
    响应于Disttd[i,j]==min(Disttd[i,*])且Distdt[j,i]==min(Distdt[j,*]),将第j个候选 框归入第i个行人的轨迹;In response to Disttd[i,j]==min(Disttd[i,*]) and Distdt[j,i]==min(Distdt[j,*]), classify the j-th candidate box into the i-th pedestrian traces of;
    其中,Disttd[i,j]表示第i个行人与第j个候选框的人框特征距离矩阵,Distdt[j,i]表示第i个行人与第j个候选框的框人特征距离矩阵。Among them, Disttd[i, j] represents the person frame feature distance matrix between the i-th pedestrian and the j-th candidate frame, and Distdt[j, i] represents the frame-person feature distance matrix between the i-th pedestrian and the j-th candidate frame.
  11. 根据权利要求1至10任一项所述的行人轨迹跟踪方法,其特征在于,所述获取图像数据,包括:The pedestrian trajectory tracking method according to any one of claims 1 to 10, characterized in that the obtaining image data includes:
    获取视频数据,对所述视频数据进行图像帧处理,得到所述图像数据。Obtain video data, perform image frame processing on the video data, and obtain the image data.
  12. 一种行人轨迹跟踪系统,其特征在于,包括:A pedestrian trajectory tracking system is characterized by including:
    图像获取模块,用于获取图像数据;Image acquisition module, used to acquire image data;
    特征提取模块,用于对所述图像数据进行空间特征提取和外观特征提取,并根据提取的特征构建候选框关系掩膜;所述候选框关系掩膜中的数值表示当前帧的检测框与目标行人是否可形成合理的轨迹关系;A feature extraction module, used to extract spatial features and appearance features from the image data, and construct a candidate frame relationship mask based on the extracted features; the numerical value in the candidate frame relationship mask represents the detection frame and target of the current frame. Whether pedestrians can form a reasonable trajectory relationship;
    特征计算模块,用于提取行人轨迹库的历史帧特征集合,与所述候选框关系掩膜中的候选框进行特征计算,得到人框特征距离矩阵和框人特征距离矩阵;The feature calculation module is used to extract the historical frame feature set of the pedestrian trajectory library, perform feature calculation with the candidate frames in the candidate frame relationship mask, and obtain the person frame feature distance matrix and the frame person feature distance matrix;
    检测模块,用于根据所述人框特征距离矩阵和所述框人特征距离矩阵计算所述目标行人与所述候选框的特征距离,响应于存在目标候选框与所述目标行人的特征距离满足互为最小距离,将所述目标候选框归入所述目标行人的轨迹,直至当前帧检测框内无满足条件的检测框;以及A detection module configured to calculate the characteristic distance between the target pedestrian and the candidate frame according to the human frame characteristic distance matrix and the frame human characteristic distance matrix, in response to the existence of the characteristic distance between the target candidate frame and the target pedestrian satisfying being the minimum distance from each other, the target candidate frame is classified into the trajectory of the target pedestrian until there is no detection frame that meets the conditions in the current frame detection frame; and
    轨迹更新模块,用于更新所述行人轨迹库,输出所述目标行人的行人索引集合和对应的位置轨迹。A trajectory update module is used to update the pedestrian trajectory library and output the pedestrian index set and the corresponding position trajectory of the target pedestrian.
  13. 根据权利要求12所述的行人轨迹跟踪系统,其特征在于,还包括:The pedestrian trajectory tracking system according to claim 12, further comprising:
    轨迹库构建模块,用于构建行人轨迹库;所述行人轨迹库包含各行人的历史位置和所述行人在各所述历史位置时的特征信息。A trajectory library construction module is used to construct a pedestrian trajectory library; the pedestrian trajectory library includes the historical location of each pedestrian and the characteristic information of the pedestrian at each historical location.
  14. 一种非易失性计算机可读存储介质,其上存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现如权利要求1至11任一项所述的行人轨迹跟踪方法的步骤。A non-volatile computer-readable storage medium on which computer-readable instructions are stored, characterized in that when the computer-readable instructions are executed by a processor, the pedestrian movement as described in any one of claims 1 to 11 is realized. The steps of the trajectory tracking method.
  15. 一种电子设备,其特征在于,包括存储器,还包括一个或多个处理器,所述存储器中存有计算机可读指令,所述处理器调用所述存储器中的计算机可读指令时实现如权利要求1至11任一项所述的行人轨迹跟踪方法的步骤。An electronic device, characterized in that it includes a memory and one or more processors. Computer-readable instructions are stored in the memory. When the processor calls the computer-readable instructions in the memory, it implements the rights as claimed. The steps of the pedestrian trajectory tracking method according to any one of claims 1 to 11.
PCT/CN2022/117148 2022-04-30 2022-09-06 Pedestrian trajectory tracking method and system, and related apparatus WO2023206904A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210469020.8 2022-04-30
CN202210469020.8A CN114581491B (en) 2022-04-30 2022-04-30 Pedestrian trajectory tracking method, system and related device

Publications (1)

Publication Number Publication Date
WO2023206904A1 true WO2023206904A1 (en) 2023-11-02

Family

ID=81785306

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/117148 WO2023206904A1 (en) 2022-04-30 2022-09-06 Pedestrian trajectory tracking method and system, and related apparatus

Country Status (2)

Country Link
CN (1) CN114581491B (en)
WO (1) WO2023206904A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114581491B (en) * 2022-04-30 2022-07-22 苏州浪潮智能科技有限公司 Pedestrian trajectory tracking method, system and related device
CN115183763A (en) * 2022-09-13 2022-10-14 南京北新智能科技有限公司 Personnel map positioning method based on face recognition and grid method
CN115546192B (en) * 2022-11-03 2023-03-21 中国平安财产保险股份有限公司 Livestock quantity identification method, device, equipment and storage medium
CN115965657B (en) * 2023-02-28 2023-06-02 安徽蔚来智驾科技有限公司 Target tracking method, electronic device, storage medium and vehicle

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160132728A1 (en) * 2014-11-12 2016-05-12 Nec Laboratories America, Inc. Near Online Multi-Target Tracking with Aggregated Local Flow Descriptor (ALFD)
CN109087335A (en) * 2018-07-16 2018-12-25 腾讯科技(深圳)有限公司 A kind of face tracking method, device and storage medium
CN110517292A (en) * 2019-08-29 2019-11-29 京东方科技集团股份有限公司 Method for tracking target, device, system and computer readable storage medium
CN114581491A (en) * 2022-04-30 2022-06-03 苏州浪潮智能科技有限公司 Pedestrian trajectory tracking method, system and related device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10921130B1 (en) * 2019-09-18 2021-02-16 Here Global B.V. Method and apparatus for providing an indoor pedestrian origin-destination matrix and flow analytics
CN113761987A (en) * 2020-06-05 2021-12-07 苏宁云计算有限公司 Pedestrian re-identification method and device, computer equipment and storage medium
CN112734809A (en) * 2021-01-21 2021-04-30 高新兴科技集团股份有限公司 Online multi-pedestrian tracking method and device based on Deep-Sort tracking framework
CN114332168A (en) * 2022-03-14 2022-04-12 苏州浪潮智能科技有限公司 Pedestrian tracking method, pedestrian tracking system, electronic device and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160132728A1 (en) * 2014-11-12 2016-05-12 Nec Laboratories America, Inc. Near Online Multi-Target Tracking with Aggregated Local Flow Descriptor (ALFD)
CN109087335A (en) * 2018-07-16 2018-12-25 腾讯科技(深圳)有限公司 A kind of face tracking method, device and storage medium
CN110517292A (en) * 2019-08-29 2019-11-29 京东方科技集团股份有限公司 Method for tracking target, device, system and computer readable storage medium
CN114581491A (en) * 2022-04-30 2022-06-03 苏州浪潮智能科技有限公司 Pedestrian trajectory tracking method, system and related device

Also Published As

Publication number Publication date
CN114581491B (en) 2022-07-22
CN114581491A (en) 2022-06-03

Similar Documents

Publication Publication Date Title
WO2023206904A1 (en) Pedestrian trajectory tracking method and system, and related apparatus
Ochmann et al. Automatic reconstruction of fully volumetric 3D building models from oriented point clouds
CN109559320B (en) Method and system for realizing visual SLAM semantic mapping function based on hole convolution deep neural network
US20120263346A1 (en) Video-based detection of multiple object types under varying poses
CN111626128A (en) Improved YOLOv 3-based pedestrian detection method in orchard environment
AU2016201908A1 (en) Joint depth estimation and semantic labeling of a single image
CN110084299B (en) Target detection method and device based on multi-head fusion attention
WO2019109793A1 (en) Human head region recognition method, device and apparatus
CN112419202B (en) Automatic wild animal image recognition system based on big data and deep learning
Guo et al. Evaluation-oriented façade defects detection using rule-based deep learning method
CN115512251A (en) Unmanned aerial vehicle low-illumination target tracking method based on double-branch progressive feature enhancement
CN111091101A (en) High-precision pedestrian detection method, system and device based on one-step method
Xu et al. Task-aware meta-learning paradigm for universal structural damage segmentation using limited images
CN115719436A (en) Model training method, target detection method, device, equipment and storage medium
CN114511661A (en) Image rendering method and device, electronic equipment and storage medium
CN108961385A (en) A kind of SLAM patterning process and device
López-Rubio et al. Anomalous object detection by active search with PTZ cameras
CN111738164A (en) Pedestrian detection method based on deep learning
Rong et al. Big data intelligent tourism management platform design based on abnormal behavior identification
KR101826669B1 (en) System and method for video searching
CN116069801B (en) Traffic video structured data generation method, device and medium
WO2023178951A1 (en) Image analysis method and apparatus, model training method and apparatus, and device, medium and program
CN111915713A (en) Three-dimensional dynamic scene creating method, computer equipment and storage medium
Han et al. FloorUSG: Indoor floorplan reconstruction by unifying 2D semantics and 3D geometry
CN115359468A (en) Target website identification method, device, equipment and medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22939730

Country of ref document: EP

Kind code of ref document: A1