CN116403180A

CN116403180A - 4D millimeter wave radar target detection, tracking and speed measurement method based on deep learning

Info

Publication number: CN116403180A
Application number: CN202310647626.0A
Authority: CN
Inventors: 娄慧丽; 陆新飞; 薛旦; 史颂华
Original assignee: Shanghai Geometry Partner Intelligent Driving Co ltd
Current assignee: Shanghai Geometry Partner Intelligent Driving Co ltd
Priority date: 2023-06-02
Filing date: 2023-06-02
Publication date: 2023-07-07
Anticipated expiration: 2043-06-02
Also published as: CN116403180B

Abstract

The invention relates to a method for realizing target detection, tracking and speed measurement of a 4D millimeter wave radar based on a deep learning network model, wherein the method comprises the following steps: performing point cloud clipping on the currently acquired millimeter wave points Yun Lei of the front frame and the rear frame; extracting the characteristics of the selected input characteristics by using a characteristic pyramid structure; extracting target key points from the extracted features by using a detection head of the anchor-free frame CenterNet; performing cross attention fusion matching processing on the acquired target key points by using a Transformer attention mechanism; and acquiring classification, regression and tracking confidence and tracking position information of target detection through the multi-task detection head. The invention also relates to a corresponding device, a processor and a storage medium thereof. The method, the device, the processor and the storage medium for realizing the target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model have obvious technical advantages in use scene and complexity.

Description

4D millimeter wave radar target detection, tracking and speed measurement method based on deep learning

Technical Field

The invention relates to the technical field of automatic driving, in particular to the technical field of laser radar, and particularly relates to a method, a device, a processor and a computer readable storage medium for realizing target detection, tracking and speed measurement of a 4D millimeter wave radar based on a deep learning network model.

Background

The current mainstream autopilot solution takes vision and laser as main sensors, is influenced by the sensors themselves, and the feasibility of the technology faces great challenges in the presence of severe weather such as shielding and rain and fog. The traditional 3D millimeter radar only plays an auxiliary role as an additional attribute of multi-source sensor fusion on the solution of target detection and tracking due to sparsity of observation point clouds.

The deep learning-based automatic driving solution uses a data-driven manner to adapt to a large number of scenes, and has strong generalization capability. In the field of autopilot, deep learning target detection and tracking algorithms are mostly based on cameras and lidars. Three-dimensional single-target detection tracking based on lidar is a challenging problem in robotics and autopilot technology. At present, the existing laser radar detection tracking method often has the problem of sparse or partial shielding of long-distance objects, which makes the extracted characteristics of the model ambiguous. The blurred features can make the target object difficult to locate, ultimately resulting in poor tracking results.

The 4D millimeter wave imaging radar has the advantages that the angle resolution performance is greatly improved, the pitching angle measurement precision is improved, the positioning of the height with centimeter level is realized, the resolution is high, the information such as amplitude, phase, energy distribution, intensity and speed is abundant, and a new solution is provided for target detection and tracking. At present, most of the solutions based on the 4D millimeter wave radar perform target detection by clustering, perform multi-target tracking by filtering and matching, and require a large number of rules to be manually added to a complex scene. The current deep learning target detection scheme based on the 4D millimeter wave radar is similar to a laser radar, such as three-dimensional object detection using Lidar, point pillars and voxel-based target detection are used as basic models, power measurement of Doppler, range, azimuth and elevation dimensions is used as input, and provided three-dimensional spatial information features are used as input to realize accurate three-dimensional perception.

The improvement of 4D millimeter wave imaging radar on the point cloud resolution and accurate speed information provide new solutions for target detection and tracking, and the patent uses the deep learning data drive, uses the strong correlation of millimeter wave radar speed between front and back frames, uses the mode of deep learning single-stage model prediction to carry out target detection, speed measurement and multi-target tracking of 4D millimeter wave Lei Dadian cloud on front and back frame millimeter wave radar data.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a method, a device, a processor and a computer readable storage medium thereof for realizing target detection, tracking and speed measurement of a 4D millimeter wave radar based on a deep learning network model.

In order to achieve the above object, the method, the device, the processor and the computer readable storage medium thereof for realizing the target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model of the invention are as follows:

the method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model is mainly characterized by comprising the following steps of:

(1) Performing point cloud clipping on the currently acquired front and rear frames of millimeter wave points Yun Lei, and taking the point cloud clipping as an input characteristic of target detection, tracking and speed measurement;

(2) Extracting the characteristics of the selected input characteristics by using a characteristic pyramid structure;

(3) Based on the currently extracted features, extracting target key points from the extracted features by using a detection head of the anchor-free frame CenterNet, and obtaining actual position information of the target points;

(4) Cross attention fusion matching processing is carried out on the obtained target key points by using a transducer attention mechanism so as to obtain the correlation degree between frames before and after the target;

(5) The target detection classification, regression, tracking confidence and tracking position information are obtained through the multi-task detection head, so that the detection, tracking and speed measurement of the target point are realized.

Preferably, the step (1) specifically includes:

and selecting front and back millimeter wave Lei Dadian clouds with a time interval of 100ms, performing point cloud clipping, performing voxelization on the intercepted point clouds, converting the voxelized point clouds into voxel grids with a height of 1, selecting the center offset of the point clouds in each voxel grid, the density of the point clouds in the voxels, the radial speed of the point clouds and the point cloud compensation speed as input features, and projecting the input features into a 2D BEV grid for subsequent processing.

Preferably, the step (2) specifically includes:

the feature pyramid structure includes a downsampling layer and an upsampling layer, wherein,

the feature extraction is carried out by the downsampling layer through a convolution layer and a resnet18, and the feature extracted by the downsampling layer is subjected to downsampling 16 times compared with the feature input to the downsampling layer; taking the features subjected to 2 times of downsampling, 4 times of downsampling, 8 times of downsampling and 16 times of downsampling in sequence as input features of the upsampling layer;

the up-sampling layer carries out first up-sampling on the feature layer which is subjected to 16 times of down-sampling treatment by the down-sampling layer to obtain a first up-sampling result; performing channel splicing and convolution processing on the first upsampling result and the downsampling layer downsampling 8 times characteristic to obtain a second upsampling result; performing channel splicing and convolution processing on the second upsampling result and the downsampling 4 times characteristic of the downsampling layer to obtain a third upsampling result;

directly performing up-sampling treatment on the first up-sampling result and the second up-sampling result to obtain a first layer output of the characteristic pyramid structure; outputting the third upsampling result as a second layer of the feature pyramid structure; performing channel splicing and convolution processing on the features of the up-sampling result of the third time and the down-sampling result of the down-sampling layer by 2 times to obtain a third layer output of the feature pyramid structure;

and performing channel splicing on the first layer output, the second layer output and the third layer output, and taking the channel splicing result as a final feature extraction result of the feature pyramid structure.

Preferably, the step (3) specifically includes the following steps:

(3.1) carrying out 1X 1 convolution processing on the extracted features twice to obtain the confidence coefficient of the key point of the target frame;

(3.2) extracting 512 key points with highest confidence coefficient of the key points as tracking target key points;

(3.3) extracting the feature position_feature of the feature map corresponding to the BEV grid position of the tracking target key point to obtain the projection feature of the aerial view to the key point;

(3.4) passing through the grid location

Actual distance size of each grid

And minimum point cloud range->

Obtain the actual position information of the target point +.>

The specific calculation formula is as follows:

；

；

。

preferably, the step (4) specifically includes the following steps:

(4.1) information on the actual position of the target point

And inputting the feature position_feature of the feature map into a transducer attention mechanism;

(4.2) extracting the sequence features of the front and rear frames by using a self-Attention mechanism (Attention), wherein the self-Attention mechanism is specifically as follows: k (Key) =q (Query) =v (Valuse), calculated as follows:

first, the dot product between Q and K is calculated, and the result is divided by the product to prevent excessive results

Wherein->

Dimension K>

Normalizing the result into probability distribution by using a softmax normalized exponential function as a transposed matrix of the matrix K, and multiplying the probability distribution by the matrix V to obtain a final weight summation result:

；

(4.3) using the obtained weight summation result of the current frame and the weight summation result of the previous frame as the input of a cross attention mechanism, and using the key point position as the position embedding of the transducer attention mechanism, thereby obtaining the correlation degree between the frames before and after the target.

Preferably, the step (4.3) specifically includes:

and (3) taking the current frame weight summation result as K (Key) and V (Value), taking the previous frame weight summation result as Q (Query), and calculating by utilizing the step (4.2) to obtain the correlation degree of the front and rear frame targets.

Preferably, the step (5) specifically includes the following steps:

(5.1) performing channel splicing on the output characteristics of the cross attention mechanism and the position information of the key points of the current frame to obtain spliced characteristics;

(5.2) extracting the fused features from the spliced features by using 3 convolution layers, and taking the first two dimension features of the fused features as tracking center points; and taking a result obtained by using two layers of convolutions of the spliced features as tracking confidence;

and (5.3) carrying out normalization processing on the features of the non-first two dimensions of the fusion feature and the key point features of the current frame by using a normalization exponential function, and then carrying out channel splicing on the features after normalization processing and the key point features of the current frame to obtain splicing features, wherein the splicing features respectively obtain the category confidence coefficient, regression frame and speed of target detection by two-layer convolution.

Preferably, the cross-attention mechanism module further comprises:

predicting the position of a central point of a key point of a current frame in a previous frame from the characteristics subjected to fusion processing, flexibly adjusting a central offset threshold of the previous frame and taking the central offset threshold as a front frame and rear frame matching standard, and taking the central offset threshold as a final tracking matching mode, wherein the method comprises the following specific steps of:

；

wherein track_id _i For the id of the current frame, track_id _i-1 For the tracking id of the previous frame,

for the center point of the previous frame prediction, +.>

For the center point of the target tracked by the current frame, x_thresh, y_thresh are allowed center offset thresholds, track_score _i Score_thresh is the threshold for tracking, which is the confidence of tracking.

The device for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model is mainly characterized by comprising the following components:

a processor configured to execute computer-executable instructions;

and the memory stores one or more computer executable instructions which, when executed by the processor, implement the steps of the method for implementing 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model.

The processor for realizing the 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model is mainly characterized in that the processor is configured to execute computer executable instructions, and when the computer executable instructions are executed by the processor, the steps of the method for realizing the 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model are realized.

The computer readable storage medium is mainly characterized in that the computer program is stored thereon, and the computer program can be executed by a processor to realize the steps of the method for realizing 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model.

Compared with the traditional clustering tracking speed measurement method, the device, the processor and the computer readable storage medium for realizing the target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model, the method, the device and the processor for realizing the target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model have the advantages in cost control because the technical scheme adopts a data driving mode, does not need to design a large number of rules and complex post-processing operations for complex and changeable road scenes manually, and only uses the millimeter wave radar. Compared with the existing multiple networks with multiple tasks of target detection, tracking and speed measurement, the method uses a deep learning single-stage mode, the detection and tracking modules share model features, and the calculated amount of secondary feature extraction is reduced. In addition, the technical scheme uses the characteristics of the front frame and the rear frame after fusion to carry out target detection, so that the problem of target missing detection can be reduced to a certain extent. Meanwhile, the method uses the speed measuring function of the millimeter wave radar, obtains the speed characteristic of the current frame while detecting and tracking the target, can carry out tracking secondary matching, and has outstanding technical advantages in use.

Drawings

Fig. 1 is a schematic diagram of a process flow of the method for realizing target detection, tracking and speed measurement of a 4D millimeter wave radar based on a deep learning network model.

Fig. 2 is a schematic diagram of a point cloud structure of 4D millimeter wave radar data of a previous frame in an embodiment of the present invention.

Fig. 3 is a schematic view of a point cloud structure of the next frame of 4D millimeter wave radar data according to an embodiment of the present invention.

FIG. 4 is a diagram showing the result of tracking matching according to the present invention.

Detailed Description

In order to more clearly describe the technical contents of the present invention, a further description will be made below in connection with specific embodiments.

Before describing in detail embodiments that are in accordance with the present invention, it should be observed that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

Referring to fig. 1, the method for implementing target detection, tracking and speed measurement of a 4D millimeter wave radar based on a deep learning network model includes the following steps:

As a preferred embodiment of the present invention, the step (1) specifically includes:

As a preferred embodiment of the present invention, the step (2) specifically includes:

As a preferred embodiment of the present invention, the step (3) specifically includes the following steps:

(3.4) passing through the grid location

Actual distance size of each grid

And minimum point cloud range->

Obtain the actual position information of the target point +.>

The specific calculation formula is as follows:

；

；

。

as a preferred embodiment of the present invention, the step (4) specifically includes the following steps:

(4.1) information on the actual position of the target point

first, the dot product between Q and K is calculated to prevent knotsThe excessive fruit will be divided by

Wherein->

Dimension K>

；

As a preferred embodiment of the present invention, the step (4.3) specifically includes:

The key point characteristics of the current frame are output as K and V of a cross attention mechanism after being subjected to a transducer self attention mechanism; taking the output of the key point feature of the previous frame after the self-attention mechanism of the transducer as the Q of the cross-attention mechanism module; the cross attention mechanism module calculates the correlation degree of the front frame target and the rear frame target by using an attention mechanism calculation formula based on the acquired K, Q and V.

As a preferred embodiment of the present invention, the step (5) specifically includes the steps of:

As a preferred embodiment of the present invention, the cross-attention mechanism module further comprises:

；

for the center point of the previous frame prediction, +.>

In practical applications, please refer to fig. 2 and 3, which are front and back frame millimeter wave Lei Dadian cloud data with a time interval of 100ms, the point cloud data is rendered at an absolute mirror speed, and the front and back frame rate of the millimeter wave Lei Dadian cloud data has strong correlation. According to the technical scheme, the speed is used as the main characteristic of tracking and target detection, the network is designed by fusing the front and rear frame data in a deep learning single-stage mode, and the tasks of target detection, tracking and speed measurement are realized. Wherein the RGB information is speed.

According To the technical scheme, during testing, the millimeter wave radar of the current frame is only required To be subjected To corresponding feature extraction and Bev To Point projection, and the last frame Point sequence feature is directly multiplexed with the result of target detection tracking at the last moment. With reference to fig. 1, the overall steps are as follows:

1) And selecting two frames of millimeter wave Lei Dadian clouds with a time interval of 100ms, performing point cloud clipping, voxelizing the intercepted point clouds, converting the voxelized point clouds into voxel grids with a height of 1, selecting the center offset of the point clouds of each voxel grid, the density of the point clouds in the voxels, the radial speed of the point clouds and the point cloud compensation speed as characteristics, and projecting the point clouds into the 2D BEV grid. According to the technical scheme, the point cloud mirror image speed and the point cloud compensation speed are used as input features and are used as main feature input for 4d millimeter wave tracking and speed measurement.

2) Feature extraction is performed by using a feature pyramid structure, and the network structure is shown in figure 1 and a feature extraction part. The first part downsamples the layer, which uses one convolution layer and resnet18 for feature extraction, a total of 16 times downsampling. A second part upsampling layer part, which upsamples the feature layer which is downsampled to 16 times to obtain a first upsampling feature, and the upsampling result and the 8 times downsampling result are subjected to channel splicing and convolution, and then upsampling to obtain a second upsampling result; and performing channel splicing and convolution on the second upsampling result and 4 times of downsampling characteristics, then upsampling to obtain a third upsampling result, directly upsampling the second upsampling result to obtain fpn (characteristic pyramid structure) first-layer output, outputting the third upsampling result as fpn second-layer output, performing channel splicing and convolution on the third upsampling result and 2 times of downsampling characteristic channel to obtain fpn third-layer output, and performing channel splicing on the fpn third-layer output to obtain fpn-layer final result. Different from the single-target tracking of the laser radar based on the search area and the matching, the technical scheme directly uses the extracted point cloud characteristics as the characteristics matched by the tracking module, and the same fpn characteristic layer result is used for detection and tracking, so that the point cloud is not sampled again and the characteristics are extracted.

3) The key point extraction uses a detection head of an anchor-free frame CenterNet, uses 1*1 convolution to replace full connection, and uses two convolutions to obtain the key point confidence coefficient of the target frame. The network structure is shown in figure 1, and the corresponding point sequence features and point sequence positions are obtained by decoding from bev grid positions in the technical scheme. Specifically, the following formula is adopted to extract the feature position_feature of the feature map corresponding to the bev grid position of the target key point, and the projection feature of the aerial view to the key point is obtained. From grid locations

And the actual distance size of each grid +.>

And minimum point cloud range->

Obtain the actual position information of the target point +.>

。

；

；

；

4) Position of using 3) target _x ,position _y The position_feature is used as input, as shown in fig. 1, and the present technical solution uses a transform attention mechanism for feature matching. The attention mechanism (attention) is calculated by first calculating the dot product between Q and K, dividing by the product to prevent excessive results

Wherein->

And (3) normalizing the result into probability distribution by using a softmax normalized exponential function for the dimension of K, and multiplying the probability distribution by a matrix V to obtain a final weight summation result.

；

The front and back point sequence features are extracted using a transducer self-attention mechanism module, which uses the key point features as K, Q, V of the transducer multi-head attention module and the key point positions as position embeds. The output of the current frame after the self-attention mechanism is used as K and V of a cross-attention mechanism module, the output of the previous frame after the self-attention mechanism is used as Q of the cross-attention mechanism module, the cross-attention mechanism module calculates the correlation degree of the front frame target and the rear frame target, and the output characteristic of the cross-attention mechanism and the position information of the key point of the current frame are subjected to channel splicing to obtain the spliced characteristic. The spliced features are sent to a voting layer, the 3 convolution layers are used for extracting the fused features, then tracking center points are obtained from the fused features, and the spliced features are subjected to two-layer convolution to obtain tracking confidence coefficients. Finally, the non-position feature part of the fused feature and the key point feature of the current frame are normalized by using the normalization index function, then the non-position feature part and the key point feature of the current frame are subjected to channel splicing, the category confidence of target detection is obtained through two-layer convolution, the regression frame and the speed are obtained through two-layer convolution, and the missing detection phenomenon can be reduced to a certain extent.

5) The technical scheme is that the post-processing part of the tracking part: the central point position of the key point of the current frame is predicted from the fusion characteristic and used as a matching mode of final tracking, and the central offset threshold of the previous frame can be flexibly adjusted to be used as a front-back frame matching standard. The matching formula is as follows, track_id _i For the id of the current frame, track_id _i-1 Tracking for previous frameid，

Center point of last frame prediction, +.>

；

The 4D millimeter wave radar outputs the speed of the vehicle and the speed of the target at the same time, so that the central position of the current frame target at the last frame can be further predicted to be used as secondary matching through the speed predicted by the target detection of the current frame and the direction angle of the target, and the tracking matching effect is improved.

As shown in fig. 4, the left side is the first frame prediction result, and the right side is the predicted tracking result, which includes the prediction frame and the tracking ID.

The device for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model comprises:

a processor configured to execute computer-executable instructions;

The processor for realizing the 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model is configured to execute computer executable instructions, and when the computer executable instructions are executed by the processor, the steps of the method for realizing the 4D millimeter wave radar target detection, tracking and speed measurement based on the deep learning network model are realized.

The computer readable storage medium has stored thereon a computer program executable by a processor to perform the steps of the method for achieving 4D millimeter wave radar target detection, tracking and speed measurement based on a deep learning network model described above.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and further implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

It is to be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution device.

Those of ordinary skill in the art will appreciate that all or a portion of the steps carried out in the method of the above-described embodiments may be implemented by a program to instruct related hardware, and the program may be stored in a computer readable storage medium, where the program when executed includes one or a combination of the steps of the method embodiments.

The above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, or the like.

In the description of the present specification, reference to the terms "one embodiment," "some embodiments," "examples," "specific examples," or "embodiments," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiments or examples. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

While embodiments of the present invention have been shown and described above, it will be understood that the above embodiments are illustrative and not to be construed as limiting the invention, and that variations, modifications, alternatives and variations may be made to the above embodiments by one of ordinary skill in the art within the scope of the invention.

In this specification, the invention has been described with reference to specific embodiments thereof. It will be apparent, however, that various modifications and changes may be made without departing from the spirit and scope of the invention. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.

Claims

1. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model is characterized by comprising the following steps of:

2. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model according to claim 1, wherein the step (1) is specifically as follows:

3. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model according to claim 2, wherein the step (2) is specifically:

the feature extraction is carried out by the downsampling layer through a convolution layer and a resnet18, and the feature extracted by the downsampling layer is subjected to downsampling 16 times compared with the feature input to the downsampling layer; taking the features subjected to 2 times of downsampling, 4 times of downsampling, 8 times of downsampling and 16 times of downsampling as input features of the upsampling layer;

4. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model according to claim 3, wherein the step (3) specifically comprises the following steps:

(3.4) passing through the grid location

Actual distance size of each grid

And minimum point cloud range->

Obtain the actual position information of the target point +.>

The specific calculation formula is as follows:

；

；

。

5. the method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model as claimed in claim 4, wherein the step (4) specifically comprises the following steps:

(4.1) information on the actual position of the target point

first, the dot product between Q and K is calculated to prevent the result from being too largeWill be divided by

Wherein->

Dimension K>

；

6. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model according to claim 5, wherein the step (4.3) is specifically as follows:

7. The method for realizing target detection, tracking and speed measurement of the 4D millimeter wave radar based on the deep learning network model as claimed in claim 6, wherein the step (5) specifically comprises the following steps:

8. The method for implementing 4D millimeter wave radar target detection, tracking and speed measurement based on deep learning network model according to claim 7, wherein the cross attention mechanism module further comprises:

；

for the center point of the previous frame prediction, +.>

9. Device for realizing target detection, tracking and speed measurement of 4D millimeter wave radar based on deep learning network model, which is characterized in that the device comprises:

a processor configured to execute computer-executable instructions;

a memory storing one or more computer-executable instructions which, when executed by the processor, perform the steps of the method for implementing 4D millimeter wave radar target detection, tracking and speed measurement based on a deep learning network model of any one of claims 1 to 8.

10. A processor for implementing 4D millimeter wave radar target detection, tracking and speed measurement based on a deep learning network model, wherein the processor is configured to execute computer executable instructions that, when executed by the processor, implement the steps of the method for implementing 4D millimeter wave radar target detection, tracking and speed measurement based on a deep learning network model as claimed in any one of claims 1 to 8.

11. A computer readable storage medium having stored thereon a computer program executable by a processor to perform the steps of the method of achieving 4D millimeter wave radar target detection, tracking and speed measurement based on a deep learning network model as claimed in any one of claims 1 to 8.