CN108764167B

CN108764167B - Space-time correlated target re-identification method and system

Info

Publication number: CN108764167B
Application number: CN201810543066.3A
Authority: CN
Inventors: 张重阳; 孔熙雨; 归琳
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2018-05-30
Filing date: 2018-05-30
Publication date: 2020-09-29
Anticipated expiration: 2038-05-30
Also published as: CN108764167A

Abstract

The invention relates to a space-time associated target re-identification method, which combines the pixel motion rate of a target in video data to estimate the probability distribution of the time length of the target crossing two adjacent cameras with a certain distance in each section of video data; based on the time length probability, the candidate targets appearing in the video can be screened and preprocessed, the candidate targets exceeding a reasonable crossing time interval are filtered, and the probability that similar targets are mistakenly matched as tracking targets is reduced. The invention also relates to a space-time associated target re-identification system. The matching result generated by the method is constrained by the space-time position and the target motion information, and compared with the original unconstrained matching structure which only depends on visual features, the accuracy of re-identification can be effectively improved.

Description

Space-time correlated target re-identification method and system

Technical Field

The invention relates to a target re-identification technology, in particular to a space-time associated target re-identification method and a corresponding target re-identification system.

Background

The object re-identification problem is a problem of judging whether a specific object exists in an image or a video sequence by utilizing a computer vision technology. Specifically, when a specific target is tracked by using a video, since a video source is from a fixed position, cross-video relay tracking is required when the target leaves a visual field, and at this time, a problem that the specific target is detected in other video sources belongs to a target re-identification problem.

The object re-identification problem uses the visual features of the object obtained from the image to perform feature matching and give possible candidate objects. Because the features between different targets have similarity, matching using the features may also result in a situation where the candidate target is not a true tracking target.

Through the search of the prior art, although the current target re-identification technology is widely applied to relay tracking, the target re-identification module therein mostly only utilizes the visual characteristic information of the target in the image. Such as patent No. CN201210201004.7, patent name: although the GPS and GIS information, namely the spatial information, is combined for screening, the utilization of the time and spatial information only stays at the stage of drawing a GIS map and a target motion track, the time information and the spatial information are not directly used for the technology of improving the accuracy of target re-identification, and a target re-identification module is still only carried out based on the visual characteristics and still can generate a large number of candidate targets with low possibility in an unreasonable time interval due to the similarity of the visual characteristics.

In addition, although there is a patent combining target time and position information, such as application (patent) No. CN201610221993.4, which is a target re-identification method based on space-time constraint, which gives a shortest motion time to each pair of adjacent cameras and gives a probability that a target appears at that time according to weber distribution and measured candidate target occurrence times, and gives a joint probability distribution in combination with visual matching features, this method does not consider the real-time motion state of the target, and the probability of the target appearing at a specific time depends on the measurement time completely, and there are two main problems: firstly, the time description according to the patent is the local time of each camera with its own clock, but is not described as uniform global positioning time service information, so that the time is not synchronized due to the clock synchronization of different cameras, and the whole prediction result is directly influenced. In addition, more importantly, the patent only considers the common problems of crossing time, space distance and the like, and does not consider the individual problems of target displacement direction, speed and the like: directly appointing the shortest motion time for the adjacent cameras, wherein the information amount represented by the value is not essentially different from the path distance of the adjacent cameras on the GIS information, and is only represented by space distance constraint; in practical situations, different targets have individual differences such as speed, the moving speed of some targets is high, the moving speed of some targets is low, and if the time estimation of the target appearing in the visual field of the camera is to be given more accurately, the possible arrival time of different targets must be calculated and predicted by combining the real-time moving information of the targets. For example, two objects with similar visual characteristics are in the field of view of the camera a, and are moved to the camera B, one is a tracking object, and the other is not, according to the algorithm of the patent, the obtained probability distribution of the entering time of the two objects is completely consistent weber distribution, but if the two objects move faster and slower, the time of the two objects reaching the camera B is greatly different, and the conclusion is completely different from that obtained by the method provided by the application (patent) No. CN201610221993.4, so that the candidate object screening is inaccurate.

In further retrieval, a target re-identification method which combines visual features and space-time constraints and motion information of target individuals and adopts global unified time service to perform time synchronization is not found at present.

Disclosure of Invention

Aiming at the current situation that the existing target re-identification method mainly utilizes target visual characteristics and the space-time correlation information is not sufficiently utilized, a space-time correlation target re-identification method is provided.

In order to achieve the purpose, the invention adopts the following technical scheme: the method combines the pixel motion rate of the target in the video data to estimate the probability distribution of the time length of the target crossing two adjacent cameras with a certain distance in each section of video data; based on the time length probability, the candidate targets appearing in the video can be screened and preprocessed, the candidate targets exceeding a reasonable crossing time interval are filtered, the probability that similar targets are mistakenly matched into tracking targets is reduced, the generated matching result is constrained by the space-time position and the target motion information, and compared with the original unconstrained matching structure which only depends on visual features, the accuracy of re-recognition can be effectively improved.

According to a first object of the present invention, there is provided a method for re-identifying spatio-temporal correlated targets, comprising:

for camera C_iRecording the initial time t of a selected object a to be checked_sAnd starting to track, and acquiring the pixel motion rate V of the image by using the tracking result_aAnd the movement direction information, and extracting the visual characteristics of the object a to be checked for re-identification;

obtaining null by using GIS informationOn the room and the camera C_iM camera sets which are adjacent and matched with the advancing direction of the object a to be checked, and M adjacent cameras in the set_jAnd a camera C is obtained by utilizing GIS or manual measurement_iTo camera M_jActual path length L of_i,j；

Object a to be checked secondary camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model is used

Prediction is made, using the predicted crossing time

Make a camera M_jIn the time interval

As a candidate for re-recognition, wherein

Statistical standard deviation of (1), i.e. assumption

Obeying normal distribution, and obtaining a standard deviation of the normal distribution by using training data;

to camera M_jExtracts its visual characteristics for re-recognition, and first appears in the camera M with the candidate object b_jGlobally uniform time service information acquired by time synchronization and used as the target in the camera M_jTime of occurrence t_e(ii) a Obtaining each candidate target b in the camera M by utilizing motion tracking calculation_jVelocity V of pixel motion_bPredicting the time of crossing by using a linear rate-time model

To camera M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Assuming obedience to a mean value of t_meanVariance is σ²Based on the normal distribution, the candidate object b is calculated at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_iProbability P of_timespace((t_e-t_s))～N(t_mean，σ²)；

Based on the visual characteristics of the object a to be checked and the candidate object b, calculating the recognition probability P of each candidate object b by using an object re-recognition method_vision(ii) a P of each candidate target b_visionAnd P_timespaceMultiplying, and taking the obtained product as the target re-identification probability, and sequencing according to the probability to obtain the final result of re-identification.

Preferably, the obtaining of the pixel motion rate Va and the motion direction information by using the tracking result means: the pixel motion rate is the motion speed taking image pixels as unit distance and image acquisition time intervals as time units, and does not relate to the actual target motion rate, so that calibration is not needed, and additional acquisition equipment is not needed; the moving direction is the moving direction of the target, which is the section where the target moving direction falls, that is, the section is taken as the target, and the image is divided into one direction section every N degrees on the plane space by combining the camera calibration information.

Preferably, the pixel motion velocity V is obtained by using the tracking result_aAnd motion direction information, wherein: the pixel motion rate is the motion speed taking image pixels as unit distance and image acquisition time intervals as time units, and does not relate to the actual target motion rate; the moving direction is the moving direction of the target, which is the moving direction of the target with the interval as the target, by combining the camera calibration information and dividing the image into one direction interval every N degrees on the plane space。

Preferably, the space and the camera C are obtained by utilizing GIS information_iThe M camera sets which are adjacent and matched with the advancing direction of the object a to be checked refer to: taking the moving direction interval of the object a to be checked as the center, adding two intervals adjacent in space as the search range of the adjacent camera, and taking the search range as the search range of the adjacent camera, wherein the search range is positioned in the direction range and is positioned at the camera C_iThe cameras which are adjacent in space form an adjacent camera set matched with the advancing direction of the object a to be checked.

Preferably, the object a to be checked is selected from a camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model is used

Predicted, means:

for a pixel velocity of V_aTarget of (2), crossing path L_i,jSatisfies the linear relationship:

in the formula: alpha and beta are model parameters, and the linear relation model is fitted by using training data acquired under a line to obtain each parameter of the model; the model parameters can be dynamically learned and updated according to the data acquired on line.

Preferably, when the image is collected, the global unified time service information obtained by reading the GPS or Beidou global time service module of the collection equipment or other global time service equipment and modules is used as the generation time of the equipment when the equipment collects the image of the frame, and the candidate target b is used as the generation time of the M_jThe generation time of the first appearance image is taken as the appearance time t_e。

Preferably, the time of crossing is predicted by using a linear rate-time model

The method comprises the following steps:

for a pixel velocity of V_bTarget of (2), crossing path L_i,jSatisfies the linear relationship:

in the formula: eta and theta are model parameters, and the linear relation model is fitted by using training data acquired offline to obtain each parameter of the model; the model parameters can be dynamically learned and updated according to the data acquired on line.

Preferably, the pair of cameras M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Assuming obedience to a mean value of t_meanVariance is σ²Normal distribution of (a) means:

quantizing the pixel motion rate of the candidate object b into M rate levels and a pixel rate V falling into one rate level_bUsing the average velocity V of the class interval_meanInstead of the original rate; for each given combination of conditions (V)_mean，L_i,j) Crossing time of candidate object b

Obeying a parameter of (t)_mean，σ²) Normal distribution of (a), a parameter (t) of the normal distribution_mean，σ²) Training and fitting are carried out on training data acquired offline; the model parameters can be dynamically learned and updated according to the data acquired online.

Preferably, said calculating candidate object b is given (V) based on the distribution_b,L_i,j) At time t under the condition_eAppear in camera M_iProbability of (2)P_timespace((t_e-t_s))～N(t_mean，σ²) The method comprises the following steps:

assuming that the object b is an object to be checked, the slave camera C_iTo camera M_jReal crossing time t_b＝t_e-t_sDue to the crossing time

Obeying normal distribution, calculating the crossing time C by using a normal distribution model and the real crossing time_iTo M_jProbability of (2)

And using the probability as a candidate target b at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_iProbability of (2)

According to a second object of the present invention, there is provided a spatiotemporal correlated target re-identification system, comprising:

a target detection and tracking module: for camera C_iRecording the initial time t of a selected object a to be checked_sAnd starting to track, and acquiring the pixel motion rate V of the image by using the tracking result_aAnd motion direction information;

visual feature extraction and re-identification module: extracting visual features of each object to be checked and the candidate object for re-identification based on the result of the object detection and tracking module; spatial and camera C obtained by utilizing GIS information_iM camera sets which are adjacent and matched with the advancing direction of the object a to be checked, and M adjacent cameras in the set_jAnd a camera C is obtained by utilizing GIS or manual measurement_iTo camera M_jActual path length L of_i,j(ii) a Object a to be checked secondary camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jUnder certain circumstances, linear velocity is used-time model

Predicting to obtain; using the predicted crossing time

Make a camera M_jIn the time interval

As a candidate for re-recognition, wherein

Statistical standard deviation of (1), i.e. assumption

the space-time association and target screening module: to camera M_jWith which it first appears at the camera M_jGlobally uniform time service information acquired by time synchronization and used as the target in the camera M_jTime of occurrence t_e(ii) a Obtaining each candidate target at the camera M by utilizing motion tracking calculation_jVelocity V of pixel motion_b(ii) a Similarly, a linear rate-time model is used to predict the time it spans

To camera M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Assuming obedience to a mean value of t_meanVariance is σ²Based on the normal distribution of (a), b is calculated at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_iProbability P of_timespace((t_e-t_s))～N(t_mean，σ²)；

The recognition probability calculation and reordering module: based on the visual information characteristics of the object a to be checked and the candidate object b, calculating the recognition probability P of each candidate object b by using an object re-recognition method_vision(ii) a P of each candidate target b_visionAnd P_timespaceMultiplying, and taking the obtained product as the target re-identification probability, and sequencing according to the probability to obtain the final result of re-identification.

The visual features of the object a to be checked and the candidate object b include, but are not limited to, traditional artificial design features such as color texture and the like, and depth features learned by a deep neural network model. The invention uses the video data combined with the actual distance change information to analyze, calculates the movement speed of the target in real time, predicts the time of the target, filters the candidate target beyond the reasonable interval range and improves the accuracy of the target re-identification problem.

Compared with the prior art, the embodiment of the invention has the following effects:

the method and the system provided by the invention are a method for improving the accuracy of the re-identification process on the problem of re-identification of the target which is generally carried out only by using image data or time and position data by combining target motion information.

According to the method and the system, the relevance relation of the cross-camera video data in time and space and the motion information of the target individual are utilized to correlate the re-recognition candidate target in time and space, and the non-correlated candidate target in time and space is filtered or reduced, so that a more accurate candidate target range is obtained, and the target re-recognition precision is effectively improved.

Drawings

FIG. 1 is a block diagram of an embodiment of spatial-temporal correlation object re-identification according to the present invention.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that variations and modifications can be made by persons skilled in the art without departing from the spirit of the invention. All falling within the scope of the present invention.

The method combines the pixel motion rate of the target in the video data to estimate the probability distribution of the time length of the target crossing two adjacent cameras with a certain distance in each section of video data; based on the time length probability, the candidate targets appearing in the video can be screened and preprocessed, the candidate targets exceeding a reasonable crossing time interval are filtered, and the probability that similar targets are mistakenly matched as tracking targets is reduced.

Specifically, in the embodiment of the method for re-identifying the space-time associated target of the present invention, the following steps may be referred to:

s1: for camera C_iRecording the initial time t of a selected object a to be checked_sAnd starting to track, and acquiring the pixel motion rate V of the image by using the tracking result_aAnd the movement direction information, and extracting the visual characteristics used for re-identification; the visual features include, but are not limited to, traditional artificial design features such as color textures and the like, and depth features learned by a deep neural network model.

S2: obtaining spatial sum C by using GIS information_iM camera sets which are adjacent and matched with the advancing direction of the object to be checked; for each adjacent camera M in the set_jUsing GIS or manual measurements to obtain C_i—〉M_jActual path length L of_i,j；

S3: object to be checked a from C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model may be utilized

And (6) obtaining a prediction. Using the predicted crossing time

Can be combined with M_jIn the time interval

The target of (2) is used as a candidate target for re-identification;

s4: to M_jOn the one hand, extracting visual features of the candidate target b for re-recognition, wherein the visual features include, but are not limited to, traditional artificial design features such as color textures and the like, and deep features learned by a deep neural network model; on the other hand, the first occurrence in M with candidate object b_jGlobally unified time service information acquired by time synchronization and taken as the target at M_jTime of occurrence t_e(ii) a Obtaining the position M of each candidate target by utilizing motion tracking calculation_jVelocity V of pixel motion_b(ii) a Similarly, a linear rate-time model is used to predict the time it spans

To M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

One can assume obey to a mean value of t_meanVariance is σ²Is normally distributed. Based on this distribution, b is calculated at a given (V)_b,L_i,j) At time t under the condition_ePresent at M_iProbability P of_timespace((t_e-t_s))～N(t_mean，σ²)；

S5: based on the visual characteristics of the object a to be checked and the candidate object b, calculating the recognition probability P of each candidate object b by using an object re-recognition method_vision(ii) a P of each candidate target b_visionAnd P_timespaceMultiplying, and taking the obtained product as the target re-identification probability, and sequencing according to the probability to obtain the final result of re-identification.

Of course, it will be understood by those skilled in the art that the execution sequence of the steps in the above embodiments may also be adjusted according to actual situations, and the steps are not required to be strictly performed.

In this embodiment, in S1: acquiring the pixel motion rate Va and motion direction information thereof by using the tracking result, which means that: the pixel motion rate is the motion speed taking image pixels as unit distance and image acquisition time intervals as time units, and does not relate to the actual target motion rate, so that calibration is not needed, and additional acquisition equipment is not needed; the moving direction is the moving direction of the target, which is the section where the target moving direction falls, that is, the section is taken as the target, and the image is divided into one direction section every N degrees on the plane space by combining the camera calibration information.

In this embodiment, in S2, obtaining M camera sets spatially adjacent to C and matched with the advancing direction of the target to be checked by using GIS information means: and taking the target motion direction interval as a center, adding two spatially adjacent intervals to serve as the search range of adjacent cameras, and forming an adjacent camera set matched with the advancing direction of the target to be checked, wherein the adjacent cameras are positioned in the direction range and are spatially adjacent to the C.

In this embodiment, in S3, the object a to be checked is selected from C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model may be utilized

The prediction is that: for a pixel velocity of V_aTarget of (2), crossing path L_i,jSatisfies the linear relationship:

alpha and β are model parameters, the linear relation model can be fitted by using the training data acquired under the line to obtain each parameter of the model, and the model parameters can be dynamically learned and updated according to the data acquired on the line.

In this embodiment, in S4, M is the pair_jWith which it first appears at M_jGlobal unified time service letter acquired by time synchronizationTo the target at M_jTime of occurrence t_eSpecifically, the method comprises the following steps: when the image is collected, the global unified time service information is obtained by reading a GPS or Beidou global time service module of the collection equipment or other global time service equipment and modules, is used as the generation time of the equipment when the image of the current frame is collected, and the target b is positioned at M_jThe generation time of the first appearance image is used as the appearance time t of b_e。

In this embodiment, in the step S4, the time spanned by the time is predicted by using a linear rate-time model

The method specifically comprises the following steps: for a pixel velocity of V_bTarget of (2), crossing path L_i,jSatisfies the linear relationship:

η and theta are model parameters, the linear relation model can be fitted by using the training data collected off-line to obtain each parameter of the model, and the model parameters can be dynamically learned and updated according to the data collected on-line.

In this embodiment, in S4, M is the pair_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

One can assume obey to a mean value of t_meanVariance is σ²The normal distribution of (a) specifically means: quantizing the pixel motion rate of the candidate target b into M rate levels; pixel rate V falling on one rate level_bUsing the average velocity V of the class interval_meanInstead of the original rate; for each given combination of conditions (V)_mean，L_i,j) Crossing time of object b

Obeying a parameter of (t)_mean，σ²) Is normally distributed, thisParameter of normal distribution (t)_mean，σ²) Training and fitting are carried out on training data acquired offline; the model parameters can be dynamically learned and updated according to the data acquired online.

In this embodiment, in S4, b is calculated based on the distribution at a given value (V)_b,L_i,j) At time t under the condition_ePresent at M_iProbability P of_timespace((t_e-t_s))～N(t_mean，σ²) The method comprises the following steps: assuming object b as the object to be looked up, it is driven from C_iTo M_jReal crossing time t_b＝t_e-t_s(ii) a Because of crossing time

Obeying normal distribution, calculating the crossing time C by using a normal distribution model and the real crossing time_i--〉M_jProbability of (2)

And using the probability as b at a given value (V)_b,L_i,j) At time t under the condition_ePresent at M_iProbability of (2)

On the basis of the method, the accuracy rate of target re-identification is improved by modeling by utilizing motion information. In another system embodiment, the target re-identification system mainly comprises a target detection and tracking module, a visual feature extraction and re-identification module, a spatio-temporal association and target screening module, and an identification probability calculation and reordering module, wherein:

visual feature extraction and re-identification module: object-based detectionExtracting visual features for re-identification of each object to be checked and the candidate object from the result of the tracking module, wherein the visual features comprise traditional artificial design features such as but not limited to color textures and the like and depth features learned by a deep neural network model; obtaining spatial and camera C by using GIS information_iM camera sets which are adjacent and matched with the advancing direction of the object a to be checked, and M adjacent cameras in the set_jAnd a camera C is obtained by utilizing GIS or manual measurement_iTo camera M_jActual path length L of_i,j(ii) a Object a to be checked secondary camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model is used

Predicting to obtain; using the predicted crossing time

Make a camera M_jIn the time interval

As a candidate for re-recognition, wherein

Statistical standard deviation of (1), i.e. assumption

the space-time association and target screening module: to camera M_jWith which it first appears at the camera M_jGlobally uniform time service information acquired by time synchronization and used as the target in the camera M_jTime of occurrence t_e(ii) a Benefit toObtaining each candidate target at the camera M by using motion tracking calculation_jVelocity V of pixel motion_b(ii) a Similarly, a linear rate-time model is used to predict the time it spans

To camera M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Referring to FIG. 1, in one embodiment:

the invention takes the cross-camera pedestrian re-identification in the video monitoring system as an embodiment and carries out application description. Cross-camera pedestrian re-identification in a video monitoring system refers to that when a specific pedestrian target a appearing in an initial camera C leaves C and enters the visual field of other cameras under a video monitoring network, based on the space-time correlation target re-identification method, the cross-camera space-time correlation is utilized to carry out candidate target constraint and is fused with the visual information of the target, and the visual characteristics are assisted in a joint probability mode to determine the probability that each candidate target and the target a to be detected are the same target.

Setting an initial camera C, and sending the collected video into a target for detectionThe tracking module is used for tracking the selected pedestrian target a as a target to be searched; furthermore, advanced tracking methods such as related filtering combined with depth features can be used, the change situation of the center of the tracking frame along with time is calculated by utilizing the tracking result, namely the target tracking frame on continuous time, so that the motion rate V of the target a pixel can be obtained_aAnd motion direction information. Similarly, the target camera B adjacent to the target camera C performs detection and tracking analysis by using the same target detection and tracking module, and obtains information such as a pixel motion rate of each detected candidate target.

The target camera C and the target camera B to be checked, the target detection frame output by the target detection and tracking modules of the target camera C and the target camera B, and the like are sent to the visual feature extraction and re-identification module, the visual features are extracted, and the re-identification probability P is carried out based on the features_visionAnd (4) calculating.

Target camera C and target camera B to be checked, and target a pixel motion rate V output by target detection and tracking modules of the two cameras_aAnd the motion direction information is transmitted to a space-time correlation and target screening module, candidate targets are screened based on the space-time correlation by combining information such as GIS (geographic information System), and the recognition probability P under the space-time constraint of each candidate target is calculated_timespace。

The two probabilities output by the visual feature extraction and re-identification module and the space-time association and target screening module are sent to the identification probability calculation and reordering module together, probability multiplication is carried out to obtain a new identification probability, and reordering is carried out based on the probability to obtain a final re-identification result.

The working process and the realized function of the system in the embodiment are as follows:

(1) and generating time information containing target motion information, position information and unified time service for each detection target and each candidate target, and providing accurate, specific and synchronous space-time information for video analysis.

(2) The target re-identification method is combined with visual features, space-time constraints and motion information of target individuals, and adopts global unified time service to perform time synchronization.

The specific implementation technology of each module in the system according to the above embodiment of the present invention may adopt the corresponding technology in the step corresponding to the target re-identification method, and is not described herein again.

In summary, the target re-identification method and the target re-identification system provided by the invention combine visual characteristics, space-time constraints and motion information of target individuals, adopt global unified time service to carry out time synchronization, and are fused to improve the target re-identification accuracy. The matching result generated by the method is constrained by the space-time position and the target motion information, and compared with the original unconstrained matching structure which only depends on visual features, the accuracy of re-identification can be effectively improved.

It should be noted that, the steps in the target re-identification method provided by the present invention may be implemented by using corresponding modules, devices, units, and the like in the target re-identification system, and those skilled in the art may refer to the technical solution of the system to implement the step flow of the method, that is, the embodiment in the system may be understood as a preferred example for implementing the method, and details are not described herein.

Those skilled in the art will appreciate that, in addition to implementing the system and its various devices provided by the present invention in purely computer readable program code means, the method steps can be fully programmed to implement the same functions by implementing the system and its various devices in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system and various devices thereof provided by the present invention can be regarded as a hardware component, and the devices included in the system and various devices thereof for realizing various functions can also be regarded as structures in the hardware component; means for performing the functions may also be regarded as structures within both software modules and hardware components for performing the methods.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes and modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention.

Claims

1. A space-time associated target re-identification method is characterized by comprising the following steps:

spatial and camera C obtained by utilizing GIS information_iM camera sets which are adjacent and matched with the advancing direction of the object a to be checked, and M adjacent cameras in the set_jAnd a camera C is obtained by utilizing GIS or manual measurement_iTo camera M_jActual path length L of_i,j；

Prediction is made, using the predicted crossing time

Make a camera M_jIn the time interval

As a candidate for re-recognition, wherein

Statistical standard deviation of (1), i.e. assumption

to camera M_jIs extracted for re-recognitionOther visual characteristics, first appearing in camera M with candidate object b_jGlobally uniform time service information acquired by time synchronization and used as the target in the camera M_jTime of occurrence t_e(ii) a Obtaining each candidate target b in the camera M by utilizing motion tracking calculation_jVelocity V of pixel motion_bPredicting the time of crossing by using a linear rate-time model

To camera M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Assuming obedience to a mean value of t_meanVariance is σ²Based on the normal distribution, the candidate object b is calculated at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_jProbability P of_timespace((t_e-t_s))～N(t_mean，σ²)；

2. The method as claimed in claim 1, wherein the pixel motion velocity V is obtained by using the tracking result_aAnd motion direction information, wherein: the pixel motion rate is the motion speed taking image pixels as unit distance and image acquisition time intervals as time units, and does not relate to the actual target motion rate; the direction of movement is then combinedThe camera marks information, and divides the image into a direction section every N degrees on a plane space, wherein the target motion direction falls into which section, namely the section is taken as the motion direction of the target.

3. The method for re-identifying spatio-temporal correlated targets as claimed in claim 1, wherein said obtaining of spatial and camera C by using GIS information_iThe M camera sets which are adjacent and matched with the advancing direction of the object a to be checked refer to: taking the moving direction interval of the object a to be checked as the center, adding two intervals adjacent in space as the search range of the adjacent camera, and taking the search range as the search range of the adjacent camera, wherein the search range is positioned in the direction range and is positioned at the camera C_iThe cameras which are adjacent in space form an adjacent camera set matched with the advancing direction of the object a to be checked.

4. The method for re-identifying spatio-temporal correlated targets as claimed in claim 1, characterized in that said target a to be checked is selected from a camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model is used

Predicted, means:

5. The spatio-temporal correlation target re-identification method according to claim 1, characterized in thatWhen the image is collected, the global unified time service information is obtained by reading a GPS or Beidou global time service module of the collection equipment or other global time service equipment and modules, is used as the generation time of the equipment when the current frame of image is collected, and the candidate target b is used as the generation time of the camera M_jThe generation time of the first appearance image is taken as the appearance time t_e。

6. The method as claimed in claim 1, wherein the time of crossing is predicted by linear rate-time model

The method comprises the following steps:

7. The spatiotemporal correlated target re-identification method according to claim 1, characterized in that the pair of cameras M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

quantizing the pixel motion rate of the candidate object b into S rate levels, and the pixel rate V falling into one rate level_bUsing the average velocity V of the class interval_meanTo replace the originalA rate; for each given combination of conditions (V)_mean，L_i,j) Crossing time of candidate object b

8. The method of claim 1, wherein the candidate object b is calculated based on the distribution_b,L_i,j) At time t under the condition_eAppear in camera M_jProbability P of_timespace((t_e-t_s))～N(t_mean，σ²) The method comprises the following steps:

And using the probability as a candidate target b at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_jProbability of (2)

9. A spatiotemporal correlated target re-identification system, comprising:

visual feature extraction and re-identification module: extracting visual features of each object to be checked and the candidate object for re-identification based on the result of the object detection and tracking module; spatial and camera C obtained by utilizing GIS information_iM camera sets which are adjacent and matched with the advancing direction of the object a to be checked, and M adjacent cameras in the set_jAnd a camera C is obtained by utilizing GIS or manual measurement_iTo camera M_jActual path length L of_i,j(ii) a Object a to be checked secondary camera C_iTo the adjacent camera M_jCrossing time t_i,jAt path length L_i,jIn certain cases, a linear rate-time model is used

Predicting to obtain; using the predicted crossing time

Make a camera M_jIn the time interval

As a candidate for re-recognition, wherein

Statistical standard deviation of (1), i.e. assumption

the space-time association and target screening module: to camera M_jEach candidate object b in (1), using its headSecond occurrence in the camera M_jGlobally uniform time service information acquired by time synchronization and used as the target in the camera M_jTime of occurrence t_e(ii) a Obtaining each candidate target at the camera M by utilizing motion tracking calculation_jVelocity V of pixel motion_b(ii) a Similarly, a linear rate-time model is used to predict the time it spans

To camera M_jEach pair of (V)_b,L_i,j) Crossing time of candidate object b

Assuming obedience to a mean value of t_meanVariance is σ²Based on the normal distribution of (a), b is calculated at a given value (V)_b,L_i,j) At time t under the condition_eAppear in camera M_jProbability P of_timespace((t_e-t_s))～N(t_mean，σ²)；