CN114882429A

CN114882429A - Queue counting method and system based on fusion of multiple information characteristics

Info

Publication number: CN114882429A
Application number: CN202210424749.3A
Authority: CN
Inventors: 陆涛; 曹颂; 钟星
Original assignee: Suzhou Super Planet Venture Capital Co ltd
Current assignee: Suzhou Super Planet Venture Capital Co ltd
Priority date: 2022-04-22
Filing date: 2022-04-22
Publication date: 2022-08-09

Abstract

The invention discloses a queue counting method and a queue counting system based on fusion of various information characteristics, wherein the method comprises the following steps: performing monocular visual target detection and tracking; the method comprises the steps of using the position information of each tracked human head in each frame image as an extraction target, and obtaining high-level features of the extraction target, wherein the high-level features comprise track features, position features, time features and speed features of the extraction target; performing a computation of an adjacency matrix E of the decimated objects, comprising: calculating a trajectory similarity matrix E _c Position similarity matrix E _l Time similarity matrix E _t And the velocity similarity matrix E _s (ii) a A set of sets of actual queues for the N frame images is computed. The invention integrally solves the technical requirements that the clustering and distinguishing of the same group of targets in the passenger flow and queuing queue are not performed in the prior art, and a more complex and more accurate computer vision algorithm and system are not used, so that the internet platform can be helped to improve the operation efficiency and dynamically know the customer information.

Description

Queue counting method and system based on fusion of multiple information characteristics

Technical Field

The invention relates to a queue counting method and a queue counting system based on fusion of various information characteristics, relates to flow statistics, and belongs to the technical field of artificial intelligence.

Background

With the rise and development of the service industry, various consuming activities are in the layer. In a plurality of gold time periods or peak time periods, a plurality of business places such as hospitals, supermarkets, banks, fast food restaurants and the like have the problems that the dynamic information of customers is difficult to accurately know, the queuing queue of windows is too long, the windows are not opened timely, the number of people in the queue actually cannot be estimated, and the like. The existing monitoring technology plays a great role in helping shops operate, for example, a camera is counted by a person, the counting precision of the camera is very high, but the installation requirement is very high, and the camera is usually only limited to a hall entrance; the AI intelligent camera for face recognition and analysis is also provided, which has a risk of invading the privacy of the client, and the AI intelligent camera can only analyze a single target and cannot distinguish and analyze a group of objects with the same target. If a user wants to check out in a checkout queue of a supermarket, the method can not meet the complex counting requirement because many people in the queue belong to group shopping, such as couples, parents, children, friends and the like.

Therefore, the computer vision algorithm and the system which are more complex and accurate can help the Internet platform to improve the operation efficiency and dynamically know the customer information, and play a great role in the development of the industry.

Patent 1 discloses a passenger flow statistics device, a passenger flow statistics method and a storage medium, wherein the patent publication number is CN109460811A, and specifically discloses that a distance measuring sensor is installed, an infrared sensor is used for counting the number of passengers entering and leaving, the passenger flow statistics device is limited by the sensor, needs to be installed in a specific area, and installation conditions are harsh, and the device is not as strong in universality as a common camera. Patent 2 discloses a passenger flow statistics device based on binocular vision, with publication number CN202058221U, which discloses carrying out passenger flow statistics through a binocular vision system and technology, and is mainly used for passenger flow monitoring of a public transport system; the precision of the binocular matching algorithm will restrict the precision of passenger flow statistics, and in addition, the efficiency of operation and use is difficult to guarantee. Patent 3 discloses a crowd tracking and pedestrian volume counting method and device, with publication number CN104751491A, which uses RGB-D camera to track pedestrians and count passenger flow; the RGB-D can provide distance information of the target, but the detection accuracy of the depth camera is greatly reduced for targets with a distance of more than 4 m. Patent 4 discloses a method for segmenting and tracking the conglutinated crowd based on superpixels and a graph model, wherein the publication number is CN103164858B, the human body form is segmented by using an image segmentation method and the prior knowledge of the human head form, and each region of the human body to be analyzed is converted into an undirected connected graph. However, the super-resolution method is not robust to the scene with changed illumination, and is easy to segment errors.

None of the prior art, including the above patents, clusters and differentiates the same group of objects in the passenger flow, queue.

Disclosure of Invention

The invention aims to overcome the technical defects in the prior art, solve the technical problems and provide a queue counting method and system based on fusion of various information characteristics. A group object refers to a group of people who belong together in a queue.

The invention specifically adopts the following technical scheme: a queue counting method based on fusion of multiple information characteristics comprises the following steps:

step SS 1: performing monocular visual target detection and tracking, comprising: acquiring continuous N frames of images, detecting the head and shoulder positions of each frame of image by using a convolutional neural network, tracking multiple targets by using a Deepsort tracking algorithm, acquiring the position information of each head in each frame of image as an extraction target, and generating a target sequence T;

step SS 2: performing behavior analysis on each sequence in the target sequence T to obtain high-level features of the extracted target, wherein the high-level features comprise track features, position features, time features and speed features of the extracted target;

step SS 3: performing a computation of an adjacency matrix E of the decimated objects, comprising: calculating a trajectory similarity matrix E _c Position similarity matrix E _l Time similarity matrix E _t And a velocity similarity matrix E _s And E is calculated in the following manner,

wherein E ═ { E ═ E _ij Is a normalized n × n symmetric matrix representing the degree of association between the various extracted targets, i.e. the probability of belonging to the same group of targets, e _ij The larger the probability that object i and object j belong to the same group of objects, n the number of detected objects, w _c Weights representing a trajectory similarity matrix; w is a _l Weights representing the location similarity matrix; w is a _t Weights representing a temporal similarity matrix; w is a _s Weights representing a velocity similarity matrix;

step SS 4: performing a set of sets of actual queues for calculating N frames of images, comprising: and calculating the Laplacian matrix L of the adjacent matrix E of the extracted target, obtaining the optimal clustering decomposition by using a graph cut algorithm, and finally outputting the actual queued group number set.

As a preferred embodiment, step SS1 specifically includes:

the detecting the human head and shoulder position in the picture by using the method based on the convolutional neural network specifically comprises the following steps: using a 19-layer convolutional neural network in each frame image for target detection, detecting the part from the shoulder to the top of the head of the human head, and outputting target frame information l (u) _p ,v _p ,w _p ,h _p ) Wherein (u) _p ,v _p ) Is the coordinate of the center point, w _p And h _p Respectively representing the width and height of the target frame;

the tracking the detected multiple targets by using the deep sort tracking algorithm specifically comprises the following steps: finding a matching frame of the next frame aiming at the target detected by each frame of image, if the matching frame cannot be found, taking the current frame as a starting point to be counted into the track, and finally outputting a target sequence detected in the N frames of images as follows:

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

where N represents the number of detected targets and N represents the number of frames.

As a preferred embodiment, step SS2 specifically includes: for the ith target T in the target sequence T detected in the N frames of images _i The following high-level features are calculated:

trajectory feature A for characterizing the motion of an object over time _c ，A _c ＝{p _i }，

Position feature B characterizing the position of the target _c ，B _c Representing the track characteristics A _c The first coordinate of (1);

temporal characteristics D _c ，D _c Is a number between 1 and N, indicating that the object appears in the picture starting at the frame number;

speed characteristic S _c According to the track characteristics A _c Calculating a number, see formula (2):

wherein, omega represents the Euclidean distance,

is the coordinates of the ith object in the object sequence T.

As a preferred embodiment, step SS3 specifically includes: adjacency matrix E is formed by trajectory similarity matrix E _c A position similarity matrix E _l Time similarity matrix E _t And velocity similarity matrix E _s The weighting results, wherein:

trajectory similarity matrix E _c ＝{1-a _ij }，a _ij The sum of the squared differences of the track characteristic of any target i and the track characteristic of a target j is normalized to 0 to 1, and the probability of belonging to the same group is higher when the track difference between the two targets is smaller;

position similarity matrix E _l ＝{1-l _ij }，l _ij The Euclidean distance is calculated by the position characteristic value of any target i and the position characteristic value of a target j, the normalization is from 0 to 1, and the closer the positions of the two targets are, the higher the probability of belonging to the same group is;

time similarity matrix E _t ＝{1-t _ij }，t _ij Is the time characteristic value of an arbitrary object i and the time characteristic value of an object j ₁ The closer the time of occurrence of the two targets is, the greater the probability of belonging to the same group is;

velocity similarity matrix E _s ＝{1-s _ij }，s _ij L being the speed of an arbitrary target i and the speed of a target j ₁ Distance, normalized to 0 to 1, two target speedsThe closer together, the greater the probability of belonging to the same group.

As a preferred embodiment, step SS4 specifically includes:

step SS 41: taking the adjacent matrix E as a relation matrix of the relation between the standard targets to obtain a degree information matrix D, which is shown in a formula (3); the degree information matrix D is combined with the adjacent matrix E to obtain a Laplacian matrix L, which is a symmetric matrix, according to a formula (4);

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

Step SS 42: carrying out SVD on the Laplacian matrix L to obtain a characteristic value lambda and a characteristic vector v; calculating the eigenvector z ═ D ^1/2 v, classifying the samples i corresponding to the components smaller than 0 in the converted eigenvector z into one class, recalculating the energy matrix E 'between the samples i' and the Laplacian matrix L 'and the eigenvector z' for the samples i 'corresponding to the components larger than 0, and classifying the corresponding samples with the components smaller than 0 in the eigenvector z' into a new class, so that the recursion is ended until the components larger than 0 do not exist in the eigenvector z 'or the dimension of the eigenvector z' is 1 according to the recursion mode, and forming a new target group C by the classified targets each time (C) ₁ ,c ₂ ,...,c _k ) Finally, the population number k of the whole queue is obtained.

The invention also provides a queue counting system based on the fusion of various information characteristics, which comprises the following components:

the target detection and tracking module specifically executes: performing monocular visual target detection and tracking, comprising: acquiring continuous N frames of images, detecting the positions of the heads and shoulders of the people in the images of each frame of image by using a convolutional neural network, tracking multiple targets by using a Deepsort tracking algorithm, acquiring position information of each head in each frame of image as an extraction target, and generating a target sequence T;

and the target feature extraction module specifically executes: performing behavior analysis on each sequence in the target sequence T to obtain high-level features of the extracted target, wherein the high-level features comprise track features, position features, time features and speed features of the extracted target;

the module for calculating the adjacency matrix specifically executes: performing a computation of an adjacency matrix E of the decimated objects, comprising: calculating a trajectory similarity matrix E _c Position similarity matrix E _l Time similarity matrix E _t And the velocity similarity matrix E _s And E is calculated in the following manner,

wherein E ═ E { E ═ E _ij Is a normalized n × n symmetric matrix representing the degree of association between the various extracted targets, i.e. the probability of belonging to the same group of targets, e _ij The larger the probability that object i and object j belong to the same group of objects, n the number of detected objects, w _c Weights representing a trajectory similarity matrix; w is a _l Weights representing the location similarity matrix; w is a _t Weights representing a temporal similarity matrix; w is a _s Weights representing a velocity similarity matrix;

and a module for calculating actual group number: performing a set of sets of actual queues for calculating N frames of images, comprising: and calculating the Laplacian matrix L of the adjacent matrix E of the extracted target, obtaining the optimal clustering decomposition by using a graph cut algorithm, and finally outputting the actual queued group number set.

As a preferred embodiment, the target detecting and tracking module specifically includes:

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

As a preferred embodiment, the target feature extraction module specifically includes: for the ith target T in the target sequence T detected in the N frames of images _i The following high-level features are calculated:

wherein, omega represents the Euclidean distance,

is the coordinates of the ith object in the object sequence T.

As a preferred embodiment, the calculate adjacency matrix moduleThe method specifically comprises the following steps: adjacency matrix E is formed by trajectory similarity matrix E _c A position similarity matrix E _l Time similarity matrix E _t And velocity similarity matrix E _s The weighting results, wherein:

trajectory similarity matrix E _c ＝{1-a _ij }，a _ij The sum of the squared differences of the track characteristic of any target i and the track characteristic of a target j is normalized to 0 to 1, and the probability of belonging to the same group is higher as the track difference between the two targets is smaller;

position similarity matrix E _l ＝{1-l _ij }，l _ij The Euclidean distance is calculated by the position characteristic value of any target i and the position characteristic value of a target j, the normalization is from 0 to 1, and the probability of belonging to the same group is higher based on the closer the two occurring positions;

time similarity matrix E _t ＝{1-t _ij }，t _ij L being the time characteristic value of an arbitrary object i and the time characteristic value of an object j ₁ Distance, based on the closer the time of occurrence of two targets, the greater the probability of belonging to the same group;

velocity similarity matrix E _s ＝{1-s _ij }，s _ij L being the speed of an arbitrary target i and the speed of a target j ₁ The distance, normalized to 0 to 1, is based on the closer the two target velocities are, the greater the probability of belonging to the same group.

As a preferred embodiment, the module for calculating the actual number of groups specifically includes: taking the adjacent matrix E as a relation matrix of the relation between the standard targets to obtain a degree information matrix D, which is shown in a formula (3); the degree information matrix D is combined with the adjacent matrix E to obtain a Laplacian matrix L, which is a symmetric matrix, according to a formula (4);

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

Carrying out SVD on the Laplacian matrix L to obtain a characteristic value lambda and a characteristic vector v; calculating a feature vector z ═D ^1/ ² v, classifying the samples i corresponding to the components smaller than 0 in the converted eigenvector z into one class, recalculating the energy matrix E 'between the samples i' and the Laplacian matrix L 'and the eigenvector z' for the samples i 'corresponding to the components larger than 0, and classifying the corresponding samples with the components smaller than 0 in the eigenvector z' into a new class, so that the recursion is ended until the components larger than 0 do not exist in the eigenvector z 'or the dimension of the eigenvector z' is 1 according to the recursion mode, and forming a new target group C by the classified targets each time (C) ₁ ,c ₂ ,...,c _k ) Finally, the population number k of the whole queue is obtained.

The invention achieves the following beneficial effects: aiming at solving the technical requirement of solving the problem that the prior art does not perform clustering and distinguishing on the same group of targets in passenger flow and queuing queues, the invention uses a more complex and more accurate computer vision algorithm and system to help an internet platform to improve the operation efficiency, dynamically know the customer information and execute the target detection and tracking of monocular vision, and comprises the following steps: acquiring continuous N frames of images, detecting the positions of the head and the shoulders of the people in the frame of each frame of image by using a convolutional neural network-based method, and tracking the detected multiple targets by using a Deepsort tracking algorithm to obtain a target sequence T; extracting high-level features of each target in the T, wherein the high-level features comprise track features, position features, time features and speed features of the extracted targets; performing a computation of an adjacency matrix E of the decimated objects, comprising: calculating a trajectory similarity matrix E _c Position similarity matrix E _l Time similarity matrix E _t And the velocity similarity matrix E _s Performing a set of sets of actual queues for calculating the N frames of images, comprising: calculating a Laplacian matrix L of the adjacent matrix E of the extracted target, obtaining optimal clustering decomposition by using a graph cut algorithm, finally outputting a group number set which is actually queued, carrying out passenger flow statistics and counting by using a computer vision technology, and having great gain for the video monitoring industry; secondly, the invention firstly proposes that the method of feature extraction, target transformation and graph theory is used for decomposing and clustering the targets in the picture, the actual queue group number is calculated, and the method plays an important role in a plurality of actual scenesThe method has the advantages of solving the technical blind spot that the clustering and distinguishing of the same group of targets in the passenger flow and queuing queue are not performed in the prior art on the whole.

Drawings

FIG. 1 is a flow chart of a queue counting method based on fusion of multiple information characteristics according to the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.

Example 1: as shown in fig. 1, the present invention provides a queue counting method based on multiple information feature fusion, which includes the following steps:

step SS 1: performing monocular visual target detection and tracking, comprising: designing a buffer queue T to store N frames of continuous images for subsequent detection, tracking, feature extraction and adjacency matrix calculation, detecting the head and shoulder positions of each frame of image by using a convolutional neural network-based method, tracking multiple targets by using a Deepsort tracking algorithm, and acquiring the position information of each head in each frame of image as a target sequence T detected in the N frames of image; the size of N is set according to different scenes, and is generally set between 150 and 250;

step SS 2: extracting high-level features of the target by utilizing the position information of each tracked human head in each frame image, wherein the high-level features comprise track features, position features, time features and speed features of the extracted target;

step SS 3: performing a computation of an adjacency matrix E of the decimated objects, comprising: calculating a trajectory similarity matrix E _c Position similarity matrix E _l Time similarity matrix E _t And the velocity similarity matrix E _s And E is calculated in the following manner,

wherein E ═ { E ═ E _ij Is oneA normalized n × n symmetric matrix characterizing the degree of association between the various extracted targets, i.e. the probability of belonging to the same group of targets, e _ij The larger the probability that object i and object j belong to the same group of objects, n the number of detected objects, w _c Weights representing a trajectory similarity matrix; w is a _l Weights representing the location similarity matrix; w is a _t Weights representing a temporal similarity matrix; w is a _s Weights representing a velocity similarity matrix;

As a preferred embodiment, step SS1 specifically includes:

the detecting the human head and shoulder position in the picture by using the method based on the convolutional neural network specifically comprises the following steps: in each frame of image, a 19-layer convolutional neural network is used for target detection, only the part from the shoulder to the top of the head of a person is detected, and target frame information l (u) is output _p ,v _p ,w _p ,h _p ) Wherein (u) _p ,v _p ) Is the coordinate of the center point, w _p And h _p Respectively representing the width and height of the target frame;

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

Any detected object must leave a track in each frame of image;

wherein, omega represents the Euclidean distance,

is the coordinates of the ith object in the object sequence T.

trajectory similarity matrix E _c ＝{1-a _ij }，a _ij The sum of the squared differences of the track characteristic of any target i and the track characteristic of a target j is normalized to be 0 to 1, because the smaller the track difference between the two targets is, the higher the probability of belonging to the same group is;

position similarity matrix E _l ＝{1-l _ij }，l _ij Is the Euclidean distance between the position characteristic value of any object i and the position characteristic value of object j, normalized to 0 to 1, because of the two occurring positionsThe closer the more closely, the greater the probability of belonging to the same group;

time similarity matrix E _t ＝{1-t _ij }，t _ij Is the time characteristic value of an arbitrary object i and the time characteristic value of an object j ₁ Distance, because the closer the time two objects appear, the greater the probability of belonging to the same group;

velocity similarity matrix E _s ＝{1-s _ij }，s _ij L being the speed of an arbitrary target i and the speed of a target j ₁ The distance, normalized to 0 to 1, is because the closer the two target velocities are, the greater the probability of belonging to the same group.

As a preferred embodiment, step SS4 specifically includes:

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

The invention also provides a queue counting system based on fusion of various information characteristics, which comprises the following steps:

the target detection and tracking module specifically executes: performing monocular visual target detection and tracking, comprising: acquiring N frames of images, detecting the positions of the heads and shoulders of the people in the images of each frame of image by using a convolutional neural network-based method, tracking the detected multiple targets by using a Deepsort tracking algorithm, and acquiring the position information of each head in each frame of image as a target sequence T detected in the N frames of images;

and the target feature extraction module specifically executes: the method comprises the steps of using the position information of each tracked human head in each frame image as an extraction target, and obtaining high-level features of the extraction target, wherein the high-level features comprise track features, position features, time features and speed features of the extraction target;

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

wherein, omega represents the Euclidean distance,

is the coordinates of the ith object in the object sequence T.

As a preferred embodiment, the module for calculating an adjacency matrix specifically includes: adjacency matrix E is formed by trajectory similarity matrix E _c A position similarity matrix E _l Time similarity matrix E _t And velocity similarity matrix E _s The weighting results, wherein:

velocity similarity matrix E _s ＝{1-s _ij }，s _ij L being the speed of an arbitrary target i and the speed of a target j ₁ The closer the two target velocities are, the greater the probability of belonging to the same group, normalized to 0 to 1.

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

Carrying out SVD on the Laplacian matrix L to obtain a characteristic value lambda and a characteristic vector v; calculating the eigenvector z ═ D ^1/ ² v, classifying the samples i corresponding to the components smaller than 0 in the converted eigenvector z into one class, recalculating the energy matrix E 'between the samples i' and the Laplacian matrix L 'and the eigenvector z' for the samples i 'corresponding to the components larger than 0, and classifying the corresponding samples with the components smaller than 0 in the eigenvector z' into a new class, so that the recursion is ended until the components larger than 0 do not exist in the eigenvector z 'or the dimension of the eigenvector z' is 1 according to the recursion mode, and forming a new target group C by the classified targets each time (C) ₁ ,c ₂ ,...,c _k ) Finally, the population number k of the whole queue is obtained.

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims

1. A queue counting method based on fusion of multiple information characteristics is characterized by comprising the following steps:

step SS 1: performing monocular visual target detection and tracking, comprising: acquiring continuous N frames of images, detecting the positions of the heads and shoulders of the people in the images of each frame of image by using a convolutional neural network, tracking multiple targets by using a Deepsort tracking algorithm, acquiring position information of each head in each frame of image as an extraction target, and generating a target sequence T;

2. The method according to claim 1, wherein step SS1 specifically comprises:

the detecting the human head and shoulder position in the picture by using the method based on the convolutional neural network specifically comprises the following steps: using a 19-layer convolutional neural network in each frame image for target detection, and detecting the position from the shoulder to the top of the headA section for outputting the target frame information l (u) _p ,v _p ,w _p ,h _p ) Wherein (u) _p ,v _p ) Is the coordinate of the center point, w _p And h _p Respectively representing the width and height of the target frame;

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

3. The method according to claim 1, wherein step SS2 specifically comprises: for the ith target T in the target sequence T detected in the N frames of images _i The following high-level features are calculated:

wherein, omega represents the Euclidean distance,

is the coordinates of the ith object in the object sequence T.

4. The method according to claim 1, wherein step SS3 specifically comprises: adjacency matrix E is formed by trajectory similarity matrix E _c A position similarity matrix E _l Time similarity matrix E _t And velocity similarity matrix E _s The weighting results, wherein:

5. The method according to claim 1, wherein step SS4 specifically comprises:

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

Step SS 42: performing SVD on the Laplacian matrix L to obtain a characteristic value lambda and a characteristic vector v; calculating eigenvector z ═ D ^1/2 v, classifying the samples i corresponding to the components smaller than 0 in the converted eigenvector z into one class, recalculating the energy matrix E 'between the samples i' and the Laplacian matrix L 'and the eigenvector z' for the samples i 'corresponding to the components larger than 0, and classifying the corresponding samples with the components smaller than 0 in the eigenvector z' into a new class, so that the recursion is ended until the components larger than 0 do not exist in the eigenvector z 'or the dimension of the eigenvector z' is 1 according to the recursion mode, and forming a new target group C by the classified targets each time (C) ₁ ,c ₂ ,...,c _k ) Finally, the population number k of the whole queue is obtained.

6. A queue counting system based on fusion of multiple information features, comprising:

7. The queue counting system based on fusion of multiple information features according to claim 6, wherein the target detection and tracking module specifically comprises:

T＝{t ₁ ,t ₂ ,t _i ,...,t _n }，

t _i ＝{l _s ,l _s+1 ,...,l _e },1≤s≤e≤N；

8. The queue counting system based on fusion of multiple information features according to claim 6, wherein the target feature extraction module specifically comprises: for the ith target T in the target sequence T detected in the N frames of images _i The following high-level features are calculated:

trajectory feature A for characterizing the motion of an object over time _c ，

wherein, omega represents the Euclidean distance,

is the order of eyesThe coordinates of the ith target in the target sequence T.

9. The queue counting system based on fusion of multiple information features according to claim 6, wherein the module for calculating the adjacency matrix specifically comprises: adjacency matrix E is formed by trajectory similarity matrix E _c A position similarity matrix E _l Time similarity matrix E _t And velocity similarity matrix E _s The weighting results, wherein:

10. The queue counting system based on fusion of multiple information features according to claim 6, wherein the module for calculating the actual group number specifically comprises: taking the adjacent matrix E as a relation matrix of the relation between the standard targets to obtain a degree information matrix D, which is shown in a formula (3); the degree information matrix D is combined with the adjacent matrix E to obtain a Laplacian matrix L, which is a symmetric matrix, according to a formula (4);

L＝D ^-1/2 (D-E)D ^-1/2 formula (4)

Carrying out SVD on the Laplacian matrix L to obtain a characteristic value lambda and a characteristic vector v; calculating eigenvector z ═ D ^1/2 v, classifying the samples i corresponding to the components smaller than 0 in the converted eigenvector z into one class, recalculating the energy matrix E 'between the samples i' and the Laplacian matrix L 'and the eigenvector z' for the samples i 'corresponding to the components larger than 0, and classifying the corresponding samples with the components smaller than 0 in the eigenvector z' into a new class, so that the recursion is ended until the components larger than 0 do not exist in the eigenvector z 'or the dimension of the eigenvector z' is 1 according to the recursion mode, and forming a new target group C by the classified targets each time (C) ₁ ,c ₂ ,...,c _k ) Finally, the population number k of the whole queue is obtained.