CN110991534A

CN110991534A - Point cloud data processing method, device, equipment and computer readable storage medium

Info

Publication number: CN110991534A
Application number: CN201911221610.3A
Authority: CN
Inventors: 周康明; 郭义波
Original assignee: Shanghai Eye Control Technology Co Ltd
Current assignee: Shanghai Eye Control Technology Co Ltd
Priority date: 2019-12-03
Filing date: 2019-12-03
Publication date: 2020-04-10

Abstract

The application provides a point cloud data processing method, a point cloud data processing device and a computer readable storage medium, and the implementation scheme comprises the following steps: acquiring point cloud data in a target area; clustering the point cloud data according to a plurality of anchor point frames with different yaw angles; extracting data characteristics of the point cloud data in each anchor point frame; and generating a feature map according to the extracted data features. According to the point cloud data processing method, the point cloud data processing device, the point cloud data processing equipment and the computer readable storage medium, the point cloud data are clustered by adopting the anchor point frames, and the anchor point frames used for clustering the point cloud data all have different yaw angles, so that the point cloud data obtained by clustering have diversity, and the extracted low-dimensional data feature types are more abundant. Corresponding feature maps can be generated by extracting the data features of the cloud data, and further, the vehicle detection results can be obtained by directly utilizing the deep convolutional network for processing.

Description

Point cloud data processing method, device, equipment and computer readable storage medium

Technical Field

The present disclosure relates to a target detection technology, and in particular, to a method, an apparatus, a device, and a computer-readable storage medium for processing point cloud data.

Background

Deep learning is a technical field that has been rapidly developed and studied in recent years. Since the rise of deep learning, convolutional neural networks have been widely used in image recognition and target detection. Compared with a target detection algorithm using an image as input data, the input data of the three-dimensional target detection is mainly laser point cloud data. However, the laser point cloud has the characteristics of sparseness and irregularity, and it is generally difficult to directly utilize the deep convolutional network to process the point cloud data, so that the development of the deep convolutional network is restricted, and therefore, the technical problem to be solved by the technical staff in the field is urgent.

Disclosure of Invention

The application provides a point cloud data processing method, a point cloud data processing device and a computer readable storage medium, and aims to solve the technical problem that point cloud data cannot be processed directly by using a deep convolution network in the prior art.

A first aspect of the present application provides a point cloud data processing method, including:

acquiring point cloud data in a target area;

clustering the point cloud data according to a plurality of anchor point frames with different yaw angles;

extracting data characteristics of the point cloud data in each anchor point frame;

and generating a feature map according to the extracted data features.

Another aspect of the present application is to provide a point cloud data processing apparatus including:

the acquisition module is used for acquiring point cloud data in a target area;

the clustering module is used for clustering the point cloud data according to a plurality of anchor point frames with different yaw angles;

the extraction module is used for extracting the data characteristics of the point cloud data in each anchor point frame;

and the generating module is used for generating a feature map according to the extracted data features.

Yet another aspect of the present application provides a point cloud data processing apparatus including:

a memory;

a processor; and

a computer program;

wherein the computer program is stored in the memory and configured to be executed by the processor to implement the point cloud data processing method.

Yet another aspect of the present application is to provide a computer-readable storage medium, having stored thereon a computer program,

the computer program is executed by a processor to implement the point cloud data processing method.

The technical effects of the point cloud data processing method, the point cloud data processing device, the point cloud data processing equipment and the computer readable storage medium are as follows:

the application provides a point cloud data processing method, a point cloud data processing device and a computer readable storage medium, wherein the point cloud data processing device comprises the following steps: acquiring point cloud data in a target area; clustering the point cloud data according to a plurality of anchor point frames with different yaw angles; extracting data characteristics of the point cloud data in each anchor point frame; and generating a feature map according to the extracted data features. According to the point cloud data processing method, the point cloud data processing device, the point cloud data processing equipment and the computer readable storage medium, the point cloud data are clustered by adopting the anchor point frames, and the anchor point frames used for clustering the point cloud data all have different yaw angles, so that the point cloud data obtained by clustering have diversity, and the extracted low-dimensional data feature types are more abundant. Corresponding feature maps can be generated by extracting the data features of the cloud data, so that the deep convolutional network can be directly utilized for processing and obtaining a vehicle detection result, and the applicability of the deep convolutional network is improved.

Drawings

Fig. 1 is a flowchart illustrating a point cloud data processing method according to an exemplary embodiment of the present application;

FIG. 2 is a flow chart of a point cloud data processing method according to another exemplary embodiment of the present application;

FIG. 3 is a schematic spatial view of a three-dimensional point cloud as shown in an exemplary embodiment of the present application;

FIG. 4 is a bird's eye view of a three-dimensional point cloud as shown in an exemplary embodiment of the present application;

FIG. 5 is a schematic diagram of a signature graph generation process shown in an exemplary embodiment of the present application;

FIG. 6 is a schematic diagram illustrating a subsequent processing of a feature map according to an exemplary embodiment of the present application;

FIG. 7 is a block diagram of a point cloud data processing apparatus according to an exemplary embodiment of the present application;

FIG. 8 is a block diagram of a point cloud data processing apparatus according to another exemplary embodiment of the present application;

FIG. 9 is a block diagram of a point cloud data processing apparatus according to yet another exemplary embodiment of the present application;

fig. 10 is a block diagram of a point cloud data processing apparatus according to an exemplary embodiment of the present application.

Detailed Description

In order to process point cloud data by using a conventional deep convolution network, it is a conventional practice to perform voxelization on point cloud data (a Voxel-net is a three-dimensional target detection algorithm), cut a three-dimensional space into a plurality of small three-dimensional voxels, and perform 3d convolution in the three-dimensional space of the whole target. However, this method has a disadvantage that the 3d convolution speed is slow. Furthermore, unlike anchor-box (anchor-point box) in two-dimensional image target detection algorithms, which are all rectangular boxes, there are many possibilities for the yaw angle of a target (e.g., a vehicle) in three-dimensional target detection.

Since the anchor-box of the three-dimensional target detection generally has various yaw angles, although the right-angle rectangular frame of the anchor-box can form a good matching effect with the right-angle movement of the convolution operation in the two-dimensional target detection, the yaw angle of the anchor-box in the three-dimensional target detection has various possibilities, and the 3d convolution operation along the right-angle direction is difficult to form a good matching effect with the anchor-box with various yaw angles.

In another approach, a Frustum-pointet (i.e., a cone point cloud network), a three-dimensional target detection algorithm, directly processes point cloud data through Pointnet. The method extracts the data characteristics of the point cloud data in the Frustum target area through Pointnet. Then dividing each point cloud data into a foreground and a background, and connecting Rpn networks by using the data characteristics in the foreground data point cloud for detection and classification. However, this method requires that the target be detected on the image in advance by a two-dimensional target detection algorithm, and if the two-dimensional target is lost, this method will fail.

In the scheme provided by the embodiment of the application, a plurality of anchor point frames are adopted for clustering point cloud data aiming at the problems, and the anchor point frames for clustering the point cloud data all have different yaw angles, so that the point cloud data obtained by clustering has diversity and the extracted low-dimensional data feature types are more abundant. Corresponding feature maps can be generated by extracting the data features of the cloud data, and further, the vehicle detection results can be obtained by directly utilizing the deep convolutional network for processing.

Fig. 1 is a flowchart illustrating a point cloud data processing method according to an exemplary embodiment of the present application.

As shown in fig. 1, the method for processing point cloud data provided by this embodiment includes:

step 101, point cloud data in a target area is obtained.

The point cloud data processing method provided by the embodiment may be executed by an electronic device with computing capability, for example, may be executed by a computer. The method provided by the embodiment can be arranged in the electronic equipment in a software form.

Specifically, the electronic device may store the point cloud data to be processed in advance, or store the point cloud data in another device, and at this time, the electronic device may access the device in which the point cloud data is stored, so as to read the point cloud data.

Further, the point cloud data may be directly or real-time acquired by a laser radar, which may be installed near a target area to be detected, and may acquire the point cloud data in the target area in real time when a target, such as a vehicle, in the target area moves.

Referring to fig. 3 and 4, the point cloud data represents the position of the vehicle in the three-dimensional space of the target area on the x-axis coordinate, the y-axis coordinate, and the z-axis coordinate, in the form of (x, y, z, r). Where r represents the reflectivity. For example, when the x-axis coordinate is (0, 70.4), this represents the forward direction. When the y-axis coordinate is (-40, 40), this represents the left-to-right direction. When the z-axis coordinate is (0, 4), this indicates that the vertical ground is pointed in the sky direction. Similar coordinates refer to the above examples and are not described in detail herein.

And step 102, clustering the point cloud data according to a plurality of anchor point frames with different yaw angles.

Therefore, after the point cloud data are clustered through the anchor point frames with different yaw angles, the types of data features can be richer, the obtained point cloud data are ensured to have diversity, and the extracted low-dimensional data feature types are richer.

When the feature map is generated by using the data features of the cloud data, the obtained feature map can be directly processed by using the deep convolutional network, and the vehicle detection result can be effectively obtained after the data processing is performed by using the deep convolutional network, so that the applicability of the deep convolutional network is expanded.

Referring to fig. 5, in one embodiment, the generation manner of the anchor blocks includes: generating image meshes in an x-y plane of the target area, each of the image meshes generating a predetermined number of the anchor boxes. For example, in the generation of the anchor boxes, image meshes corresponding to feature maps may be generated or divided within the x-y plane of the target area, and then a predetermined number of the anchor boxes may be generated in each of the image meshes.

Of course, when generating the anchor point frame, a person skilled in the art may also perform a generating operation according to other similar rules, and after generating the anchor point frame, it is only required that the anchor point frames are uniformly distributed in the x-y plane of the target area, and each position has a plurality of anchor point frames with different yaw angles, which is not limited herein.

Continuing with the illustration of fig. 5, an image mesh corresponding to Feature-map (Feature map) may be divided on the x-y plane, i.e., the bird's eye view plane, at intervals of 0.4 meters, thereby dividing the image mesh size to 0.4x0.4, by way of example. The y-axis coordinate range may be set to-40, 40 in meters. The x-axis coordinate range is set to 0, 70.4 in meters. Thus, there are 80/0.4, i.e., 200, intervals (grids) along the y-axis and 70.4/0.4, i.e., 176, intervals (grids) along the x-axis. Therefore, the number of image meshes of the corresponding feature map at this time is 200 × 176.

In addition, when the x-y plane of the target area is divided, or when the x-y plane of the target area generates the image mesh, a person skilled in the art may also divide the image mesh according to other sizes to obtain an image mesh with an appropriate size, which is not limited herein.

In a preferred embodiment, and with continued reference to FIG. 5, 4 anchor boxes may be generated at the center of each of the image grids, with different yaw angles. At this time, anchor blocks are generated in the center of each image grid, and the shape of each feature map is 200 × 176. From the above, the ranges on the corresponding x-y plane are (-40, 40) and (0, 70.4). Therefore, 200 coordinate points can be uniformly obtained in the range of-40 to 40, and 176 coordinate points can be uniformly obtained in the range of 0 to 70.4, wherein the coordinate points of the x axis and the y axis can be used as the center of the anchor point frame, and finally the coordinate point matrix with the shape of 200x176 is formed in a combined mode. Meanwhile, preferably, the coordinate of the z-axis may be selected to be 0.5, that is, it may be assumed that the center points of the vehicle targets are all 0.5 m high from the ground.

For example, the length, width and height of the anchor point frame may all be selected (3.9, 1.6 and 1.56), and the yaw angles of the generated 4 different anchor point frames may be selected (0, 0.79, 1.57 and 2.37), respectively. Therefore, in this embodiment, the number of anchor blocks is 200x176x4 ═ 140800, and the shape of the generated matrix of anchor blocks is 200x176x4x 7. Note that "7" in the last dimension represents the coordinates (x, y, z) of the center point of the anchor point frame, the width, length, and height (w, l, h) of the anchor point frame, and the yaw angle (θ) of the anchor point frame.

Additionally, in one embodiment, the clustering the point cloud data according to anchor point boxes with different yaw angles comprises:

and projecting the point cloud data along an x-y plane of the target area, and gathering the point cloud data projected into the same anchor point frame into one type.

Therefore, when clustering is carried out on the point cloud data, a mode of projecting to an x-y plane can be adopted, namely, the point cloud data of a target area is projected to the x-y plane, and the point cloud data projected to the same anchor point frame can be clustered into one class, so that clustering of the point cloud data can be completed.

The clustering mode can cluster various point cloud data in anchor point frames with different yaw angles, and can ensure the richness of data characteristics when the data characteristics of the point cloud data are subsequently extracted.

And 103, extracting the data characteristics of the point cloud data in each anchor point frame.

Next, features need to be extracted from the clustered point cloud data, so that a feature map can be generated through the extracted data features. A predetermined number of point cloud data may be selected within each anchor box when extracting the data features.

For example, 128 point clouds can be randomly selected from each anchor point box, and data features of the 128 point cloud data are extracted one by one. And putting the data features into corresponding positions of the feature map pair, thereby generating a required pair feature map.

The characteristic graph clusters point cloud data through a plurality of anchor point frames with different yaw angles and then extracts data characteristics one by one, so that the data types of the characteristic graph are rich, and the characteristic graph is suitable for processing by utilizing a deep convolution network.

In one embodiment, the extracting the data feature of the point cloud data in each of the anchor boxes comprises:

and selecting a predetermined amount of point cloud data in each anchor point frame, and extracting the data characteristics of the selected point cloud data according to a Pointernet network.

At this time, after a predetermined number of point cloud data are randomly selected in each anchor point frame, features of the randomly selected point cloud data may be extracted one by one using a pointent network.

With continued reference to fig. 7, for example, if 128 points of cloud data are randomly selected from each anchor box as described above, a feature map with a size of 200x176x128 may be obtained. Meanwhile, if 4 anchor blocks with different yaw angles are generated in the center of each image grid, the shape of the feature diagram obtained finally is 4x200x176x 128.

And 104, generating a feature map according to the extracted data features.

At this time, after extracting features from the point cloud data clustered in the anchor point frames located in each position of the feature map, the corresponding features are placed in corresponding positions on the feature map, so that the required feature map can be formed. In addition, the manner of generating the feature map may also be generated in other manners, and a person skilled in the art may select an appropriate manner according to requirements to form a required feature map by using the extracted data features, which is not limited herein.

The present embodiment provides a method for detecting a vehicle, which is performed by a device provided with the method of the present embodiment, which is typically implemented in hardware and/or software.

The point cloud data processing method provided by the embodiment comprises the following steps: acquiring point cloud data in a target area; clustering the point cloud data according to a plurality of anchor point frames with different yaw angles; extracting data characteristics of the point cloud data in each anchor point frame; and generating a feature map according to the extracted data features. According to the point cloud data processing method, the point cloud data processing device, the point cloud data processing equipment and the computer readable storage medium, the point cloud data are clustered by adopting the anchor point frames, and the anchor point frames used for clustering the point cloud data all have different yaw angles, so that the point cloud data obtained by clustering have diversity, and the extracted low-dimensional data feature types are more abundant. Corresponding feature maps can be generated by extracting the data features of the cloud data, so that the deep convolutional network can be directly utilized for processing and obtaining a vehicle detection result, and the applicability of the deep convolutional network is improved.

Fig. 2 is a flowchart illustrating a point cloud data processing method according to another exemplary embodiment of the present application.

As shown in fig. 2, the point cloud data processing method provided in this embodiment includes:

step 201, point cloud data in a target area is obtained.

And 202, clustering the point cloud data according to a plurality of anchor point frames with different yaw angles.

Step 203, extracting data characteristics of the point cloud data in each anchor point frame.

And step 204, generating a feature map according to the extracted data features.

The specific principle and implementation of steps 201 to 204 are similar to those of steps 101 to 104, and reference may be made to the related descriptions above, which are not repeated herein.

And step 205, performing feature fusion processing on the feature map, and performing regression processing and classification processing on the processed feature map to generate a vehicle detection result.

Referring to fig. 2 and 6, after the feature map is obtained in the above manner, the feature map may be subjected to feature fusion processing, and the processed feature map may be subjected to regression processing and classification processing to generate a vehicle detection result.

Specifically, it can be known from the above that after clustering point-to-point cloud data and extracting data features, a corresponding feature map can be obtained. As shown in fig. 6, optionally, the feature map may perform feature fusion processing on the feature map according to the nernet network structure, and the high-resolution feature may be obtained after performing fusion processing on the feature map through the nernet network structure, that is, the size of the feature map may not be changed.

In addition, a resnet network structure or an vgg network structure may be used to fuse the feature maps, and the feature maps after the resnet network structure or the vgg network structure are fused become smaller, so that the feature maps need to be restored to the original size by deconvolution or upsampling after the feature maps are fused by the resnet network structure or the vgg network structure.

The feature fusion processing of the feature map according to the hnnet network structure is exemplified, and when the obtained feature map is subjected to the feature fusion processing through the hnnet network structure, the data features with high resolution can be extracted, and the size of the feature map after the fusion processing is not changed.

As shown in fig. 6, for example, if the size of the feature map obtained before the fusion process is 200x176x128, then the feature map needs to be subjected to a feature stitching process before the fusion process, and a 200x176x512 feature map is obtained first. Then, after the fusion processing of the hrnet network structure, the scale of the feature map becomes 200x176x 128. At this time, the obtained data features with the highest resolution are input into Rpn for regression processing and classification processing.

Before that, after the anchor frame is generated according to the foregoing rule, the anchor frame and the Ground-route (the real value of the vehicle target in the target area) generated by the above process may be encoded.

The group-route indicates: if there are 3 vehicles in the target area, then the matrix of the group-route is 3x7, where the last dimension "7" has the same meaning as the last dimension "7" of the anchor point frame, i.e. the center point coordinates (x, y, z), the width, length, height (w, l, h) of the anchor point frame, and the yaw angle (θ).

Firstly, the score of each anchor frame needs to be calculated according to the anchor frame and the group-route, and the score calculation rule is as follows:

and calculating the iou (intersection ratio) of all anchor point frames and the group-truth in the x-y axis plane. The "intersection" in the intersection ratio refers to the overlapping area of the two regions, and refers to the sum of the areas of the two regions minus the overlapping area. For example, if the intersection area of the two regions is 20 and the union area is 100, the intersection ratio is 0.2.

Therefore, the specific calculation rule is that if the intersection ratio of the anchor frame and the group-channel is greater than 0.6, the classification label of the anchor frame is marked as a positive class, and the maximum group-channel corresponding to the intersection ratio is selected to be used for calculating the regression label. If the intersection ratio of the anchor box and the group-channel is less than 0.45, the classification label of the anchor box is marked as a negative class. If the iou of the anchor box and all group-truth is between 0.45 and 0.6, the anchor box does not participate in the calculation.

For the anchor point frame with the classification label as the positive class, the regression label of the anchor point frame can be obtained by encoding in the following way. The coding formula is as follows:

Δθ＝θ^g-θ^a。

wherein, Δ x, Δ y, Δ z, Δ l, Δ w, Δ h, Δ θ are regression labels of the anchor frames after encoding, g superscripts in each formula represent values of the group-channel corresponding to the anchor frame of the positive type, and a superscripts in each formula represent the anchor frames. For anchor blocks labeled as positive classes, the regression label is calculated using the above formula.

In addition, for the anchor point frame marked as the positive type, the type of the yaw angle is coded at the same time, namely when the yaw angle of the group-route corresponding to the positive type anchor point frame is larger than 0, the anchor point frame is marked as the positive type, and when the yaw angle of the group-route corresponding to the positive type anchor point frame is larger than 0, the anchor point frame is marked as the negative type.

The regression label of the anchor block after being coded is obtained through the formula, and the shape of the regression label is 200x176x4x 7. The shape of the encoded class label is 200x176x4x2, where "2" in the last dimension indicates one-hot encoding for the 0, 1 class. The shape of the encoded yaw angle classification label is 200x176x4x2, where "2" in the last dimension indicates one-hot encoding for the 0, 1 category.

The feature map is then input into Rpn networks for regression and classification processing. Rpn the network consists of regression branches and classification branches. Both the regression branch and the classification branch consist of multiple layers of convolutions. The regression branch is responsible for predicting the deviation between the anchor frame and the group-truth, so that the regression branch outputs the deviation relative to the anchor frame, namely Δ x, Δ y, Δ z, Δ w, Δ l, Δ h, Δ θ. With continued reference to fig. 6, in this embodiment, the matrix shape output at this time is 200x176x4x7, the shape corresponding to the regression label obtained above.

The classification branch comprises a vehicle classification branch and a yaw angle classification branch.

The vehicle classification is responsible for predicting whether a vehicle is, and the probability of each classification is obtained. The vehicle classification has two categories, i.e., vehicle or not. The matrix shape of the vehicle classification branch output is 200x176x4x2, and the shape corresponds to the obtained classification label.

The yaw angle classification is responsible for predicting the yaw angle class, which has two classes, positive or negative. The matrix shape of the yaw angle classification branch output is 200x176x4x2, and the shape corresponds to the yaw angle classification label described above.

Therefore, three kinds of labels, namely a regression label, a vehicle classification label and a yaw angle classification label, can be obtained finally. The Rpn-header of the network output also has three outputs, namely:

car _ cls _ header, header _ cls _ header, and reg _ header.

The matrix shapes of the three tag outputs are 200x176x4x2, 200x176x4x2, and 200x176x4x7, respectively. For the car _ cls _ header loss function, focal-loss is selected, the header _ cls _ header loss function, focal-loss is selected, and the reg _ header loss function, smooth-l1 loss is selected. The learning strategy can select adam, the initial learning rate is 0.001, and 10 epochs become 0.5 of the original rate as the iterative training is gradually reduced. And (5) performing iterative training by 100 epochs, and finally saving the trained model.

And finally, inputting the point cloud data into the trained model to output a result, and decoding and screening the result. Namely, the point cloud is output to a model to obtain a classification result and a regression result, the classification result is firstly sorted, the sorting is carried out from large to small according to the score of the automobile, and the classification result and the regression result of anchor point frames with the highest score and the preset number are reserved. The retained classification results and regression results are then decoded. For example, the first 1000 anchor blocks with the highest score may be reserved, or other numbers of anchor blocks may be reserved according to actual requirements. Wherein, the formula of decoding is as follows:

l^g＝Δl*l^a+l^a，

w^g＝Δw*w^a+w^a，

h^g＝Δh*h^a+h^a，

θ^g＝Δθ+θ^a。

through the decoding, real output can be obtained by combining the yaw angle classification result, and the optimal result is screened through nms by using the 1000 real outputs and the scores thereof, and the vehicle detection result is output as a final result.

Where nms is the non-maxima suppression algorithm. For example, for 1000 three-dimensional output frames and their corresponding scores, the three-dimensional output frame with the highest classification score is first selected and stored. Then, the three-dimensional output frame is 3d iou with the remaining 999 three-dimensional output frames.

At this time, if the calculated 3d iou is greater than a predetermined threshold, for example, greater than 0.1, it is considered that the two three-dimensional output frames of the comparison overlap too much, and the score of the two output frames is low is discarded. And if the calculated 3d iou is less than 0.1, reserving the corresponding three-dimensional output frame. The predetermined threshold may be set according to actual conditions, and is not limited herein.

And after the calculation, further repeating the calculation of the rest according to the mode. For example, if 500 three-dimensional output frames are reserved in the previous calculation, the three-dimensional output frame with the highest score is taken out from the 500 three-dimensional output frames and stored, and then the three-dimensional output frame and the rest 499 three-dimensional output frames are made into 3d iou. And continuing to use 0.1 as a preset threshold, if the calculated 3d iou is smaller than 0.1, reserving the corresponding three-dimensional output frame, and discarding the corresponding three-dimensional output frame larger than 0.1. The above operations are repeated, and the final result is the saved three-dimensional output frame. Those skilled in the art can use the algorithm to calculate the final result, which is not described herein.

Fig. 7 is a block diagram of a point cloud data processing apparatus according to an exemplary embodiment of the present application.

As shown in fig. 7, the present embodiment provides a point cloud data processing apparatus, including:

an obtaining module 31, configured to obtain point cloud data in a target area;

a clustering module 32, configured to cluster the point cloud data according to a plurality of anchor frames with different yaw angles;

an extracting module 33, configured to extract data features of the point cloud data in each anchor point frame;

and a generating module 34, configured to generate a feature map according to the extracted data features.

The present embodiment provides a point cloud data processing apparatus, including: an obtaining module 31, configured to obtain point cloud data in a target area; a clustering module 32, configured to cluster the point cloud data according to a plurality of anchor frames with different yaw angles; an extracting module 33, configured to extract data features of the point cloud data in each anchor point frame; and a generating module 34, configured to generate a feature map according to the extracted data features.

In the device that this embodiment provided, adopted a plurality of anchor point frames to cluster point cloud data, and a plurality of anchor point frames that are used for clustering point cloud data all have different yaw angles, can guarantee from this that the point cloud data that the clustering obtained have the variety, the low-dimensional data characteristic kind of extraction is abundanter. Corresponding feature maps can be generated by extracting the data features of the cloud data, and further, the vehicle detection results can be obtained by directly utilizing the deep convolutional network for processing.

The specific principle and implementation of the point cloud data processing apparatus provided in this embodiment are similar to those of the related technical contents in the method embodiment shown in fig. 1, and the related technical contents may refer to the foregoing descriptions and are not described herein again.

Fig. 8 is a block diagram of a point cloud data processing apparatus according to another exemplary embodiment of the present application.

As shown in fig. 8, on the basis of the above embodiment, optionally, the clustering module 32 includes a point cloud projection unit 321, configured to project the point cloud data along an x-y plane of the target area, and cluster the point cloud data projected into the same anchor point frame into a cluster.

Optionally, the clustering module 32 includes a grid generating unit 322, configured to generate image grids in an x-y plane of the target region, where each image grid generates a predetermined number of anchor point boxes.

Optionally, the center of each image grid generates 4 anchor point frames with different yaw angles.

Optionally, the extracting module 33 is specifically configured to:

The specific principle and implementation of the point cloud data processing apparatus provided in this embodiment are similar to those of the embodiment shown in fig. 7, and are not described here again.

Fig. 9 is a block diagram of a point cloud data processing apparatus according to still another exemplary embodiment of the present application.

As shown in fig. 9, the present embodiment provides a point cloud data processing apparatus, including:

an obtaining module 41, configured to obtain point cloud data in a target area;

a clustering module 42, configured to cluster the point cloud data according to a plurality of anchor frames with different yaw angles;

an extracting module 43, configured to extract data features of the point cloud data in each anchor point frame;

a generating module 44, configured to generate a feature map according to the extracted data features;

and the processing module 45 is configured to perform feature fusion processing on the feature map, and generate a vehicle detection result after performing regression processing and classification processing on the processed feature map.

The present embodiment provides a point cloud data processing apparatus, including: an obtaining module 41, configured to obtain point cloud data in a target area; a clustering module 42, configured to cluster the point cloud data according to a plurality of anchor frames with different yaw angles; an extracting module 43, configured to extract data features of the point cloud data in each anchor point frame; a generating module 44, configured to generate a feature map according to the extracted data features; and the processing module 45 is configured to perform feature fusion processing on the feature map, and generate a vehicle detection result after performing regression processing and classification processing on the processed feature map.

In the device that this embodiment provided, adopted a plurality of anchor point frames to cluster point cloud data, and a plurality of anchor point frames that are used for clustering point cloud data all have different yaw angles, can guarantee from this that the point cloud data that the clustering obtained have the variety, the low-dimensional data characteristic kind of extraction is abundanter. Corresponding feature maps can be generated by extracting the data features of the cloud data, and further the deep convolutional network can be directly utilized for processing. Then, the vehicle detection result can be finally obtained through the subsequent fusion network or classification network.

The specific principle and implementation of the point cloud data processing apparatus provided in this embodiment are similar to those of the related technical contents in the method embodiment shown in fig. 2, and the related technical contents may refer to the foregoing descriptions and are not described herein again.

On the basis of the foregoing embodiment, in the point cloud data processing apparatus provided in this embodiment, optionally, the processing module 45 is specifically configured to:

and performing feature fusion processing on the feature map, and generating a vehicle detection result after regression processing and classification processing on the processed feature map.

Optionally, when performing the feature fusion processing on the feature map, the feature fusion processing is performed on the feature map according to the Hrnet network structure.

The specific principle and implementation of the apparatus provided in this embodiment are similar to those of the embodiment shown in fig. 9, and are not described herein again.

As shown in fig. 10, the present embodiment provides a point cloud data processing apparatus, including:

a memory 51;

a processor 52; and

a computer program;

wherein the computer program is stored in the memory 51 and configured to be executed by the processor 52 to implement any one of the point cloud data processing methods as described above.

The present embodiments also provide a computer-readable storage medium, having stored thereon a computer program,

the computer program is executed by the processor 52 to implement the point cloud data processing method as described above.

The present embodiment also provides a computer program comprising a program code for executing any one of the point cloud data processing methods described above when the computer program is executed by a computer.

Those of ordinary skill in the art will understand that: all or a portion of the steps of implementing the above-described method embodiments may be performed by hardware associated with program instructions. The program may be stored in a computer-readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.

Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.

Claims

1. A point cloud data processing method is characterized by comprising the following steps:

acquiring point cloud data in a target area;

and generating a feature map according to the extracted data features.

2. The point cloud data processing method of claim 1, wherein the feature map is subjected to feature fusion processing, and the processed feature map is subjected to regression processing and classification processing to generate a vehicle detection result.

3. The point cloud data processing method of claim 2, wherein the feature map is subjected to feature fusion processing according to an Hrnet network structure.

4. The point cloud data processing method of any of claims 1-3, wherein the clustering the point cloud data according to anchor blocks of different yaw angles comprises:

5. The point cloud data processing method of claim 4, wherein the anchor boxes are generated in a manner that includes:

generating image meshes in an x-y plane of the target area, each of the image meshes generating a predetermined number of the anchor boxes.

6. The point cloud data processing method of claim 5, wherein the center of each image grid generates 4 anchor boxes with different yaw angles.

7. The point cloud data processing method of any of claims 1-3, wherein the extracting data features of the point cloud data in each of the anchor boxes comprises:

8. A point cloud data processing apparatus, comprising:

the acquisition module is used for acquiring point cloud data in a target area;

9. A point cloud data processing apparatus, characterized by comprising:

a memory;

a processor; and

a computer program;

wherein the computer program is stored in the memory and configured to be executed by the processor to implement the point cloud data processing method of any of claims 1-7.

10. A computer-readable storage medium, having stored thereon a computer program,

the computer program is executed by a processor to implement the point cloud data processing method of any of claims 1-7.