CN115474047A

CN115474047A - LiDAR point cloud encoding method and decoding method based on enhanced map correlation

Info

Publication number: CN115474047A
Application number: CN202211112366.9A
Authority: CN
Inventors: 郑明魁; 黄昕; 王适; 刘文强
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2022-09-13
Filing date: 2022-09-13
Publication date: 2022-12-13

Abstract

The invention relates to a LiDAR point cloud encoding method and a LiDAR point cloud decoding method based on enhanced mapping graph correlation, wherein the encoding method comprises the following steps: acquiring point clouds to be encoded, taking part of point cloud frames as I frames, and taking the rest of point cloud frames as P frames; mapping the geometric information of I and P frames into a 2D distance map, divided into N resolution ₁ ×N ₂ Is divided into blocks, I frames andconverting each block of the P frame into three-dimensional point cloud, and registering each block of the P frame with the three-dimensional point cloud of each block of the I frame by using an ICP (inductively coupled plasma) algorithm to obtain motion information H; converting each blocked three-dimensional point cloud of the P' frame into a 2D distance map; mapping the attribute information of the I frame and the P frame into a 2D attribute graph; encoding the 2D distance graph and the 2D attribute graph into a video stream, and encoding the video stream into a binary code stream; the motion information H is quantized into a 2D distance map and compressed into a binary code stream. The invention reduces the space and time redundancy, and has important strategic significance for the exploration and research of a high-efficiency point cloud compression scheme on the development of relevant applications such as automatic driving, mobile robots and the like.

Description

LiDAR point cloud encoding method and decoding method based on enhanced mapping graph correlation

Technical Field

The invention relates to a LiDAR point cloud encoding method and a LiDAR point cloud decoding method based on enhanced map correlation, and belongs to the technical field of point cloud processing.

Background

The development of three-dimensional sensor data acquisition generates new opportunities and challenges in the industries such as virtual reality technology, smart cities, intelligent robots and automatic driving, an automatic driving system has wide application in the fields of reducing road traffic working pressure, improving natural environments of traffic trips and the like, and the identification of the surrounding environment and the state of a vehicle by using a laser radar is one of the bases of automatic driving.

Because the three-dimensional point cloud acquisition has the advantages of high efficiency, high precision, high resolution, digital acquisition and the like, a plurality of automatic driving systems including Google and Uber are practically developed and applied on the basis of 3D point cloud data acquired by a vehicle-mounted laser radar. The point cloud model represents a visual scene by a series of three-dimensional points, describing topological information of the object surface and the associated attributes of the points. However, the non-uniform sampling and spatial structural distribution disorder of the point cloud make the point cloud compression and processing quite challenging, for example, the 64-line radar sensor produced by Velodyne has more than one hundred thousand points collected by one rotation, and under normal setting, the data of the sensor in half an hour is as high as more than 50G. Therefore, the amount of three-dimensional data generated by the point cloud acquisition device presents a higher challenge to the storage and transmission of data. In recent years, the three-dimensional point cloud data compression technology is gradually and deeply researched, and the inventor finds that the three-dimensional point cloud has the problems of wide point cloud distribution range and inconsistent density.

Disclosure of Invention

In order to overcome the problems, the invention provides a LiDAR point cloud encoding method and a LiDAR point cloud decoding method based on the relevance of an enhanced mapping map, the encoding method reduces the spatial and temporal redundancy, and has important strategic significance for the exploration and research of an efficient point cloud compression scheme on the development of relevant applications such as automatic driving and mobile robots.

The detachable electromechanical mounting frame is simple in structure, convenient to detach, replaceable in parts and suitable for mounting electromechanical equipment of different specifications.

The technical scheme of the invention is as follows:

first aspect of the invention

A LiDAR point cloud coding method based on enhanced map correlation comprises the following steps:

acquiring a point cloud to be encoded, taking a part of point cloud frames as I frames, and taking the rest point cloud frames as P frames;

mapping the geometric information of the I frame and the P frame into a 2D distance map, and dividing the 2D distance map into N resolution ratios ₁ ×N ₂ Partition of (2), N ₁ Converting the blocks of the I frame and the P frame into three-dimensional point clouds for the line number of the laser radar, and registering the block three-dimensional point clouds of the P frame according to the block three-dimensional point clouds of the I frame by using an ICP (inductively coupled plasma) algorithm to obtain the registered motion information H of the block three-dimensional point clouds of the P' frame and the block three-dimensional point clouds of the P frame on the block three-dimensional point clouds of the I frame; the motion information H comprises a rotation matrix and a translation matrix;

Converting the partitioned three-dimensional point clouds of the P' frame into a 2D distance map;

mapping the attribute information of the I frame and the P frame into a 2D attribute graph;

respectively encoding the 2D distance map of the I frame, the 2D distance map of the P' frame and the 2D attribute map into a video stream, and encoding the video stream into a binary code stream through HEVC;

the motion information H is quantized into a 2D distance map and compressed into a binary code stream by entropy coding.

Further, N ₂ And N ₁ The same is true.

Further, the geometric information is mapped to a 2D distance map, specifically:

from three-dimensional pointsYaw angle omega of cloud data computing point i _i Angle of pitch theta _i And distance r of point i from the center _i The disclosure is as follows:

wherein x is _i Is the left value of the x axis of the point i Cartesian coordinate, y _i Is the left value of the point i Cartesian coordinate y axis; z is a radical of _i The left value of the Z axis of the Cartesian coordinate of the point i is obtained;

will point (r) _i ，θ _i ，ω _i ) Mapping to pixel P (px) _i ，py _i ) The formula is as follows:

P(px _i ,py _i )＝round(r _i )；

wherein x _ quan and y _ quan are the yaw angle ω respectively _i And the pitch angle theta _i Quantized into integers;

according to the pixel P (px) _i ，py _i ) A 2D distance map is obtained.

Further, the set information is mapped to a 2D distance map of 16-bit depth values.

Further, the ICP algorithm is an ICP improvement algorithm, and the ICP improvement algorithm specifically comprises:

Estimating motion information between point cloud frames according to an ICP improved algorithm based on a translation model, wherein the formula is as follows:

wherein f (R, t) is an error function value of unpermitted punctual point clouds, fmin (R, t) is an error function of motion information after registration, n is the total number of source point set point clouds, and p _i Is an ICP source point set, t is a translation matrix, q _i R is a rotation matrix for the ICP target point set.

Further, the ICP improvement algorithm further comprises:

after the ICP improvement algorithm iterates N times, let the rotation matrix equal the identity matrix, i.e. R = R0:

and solving the translation matrix, and using an ICP algorithm for the newly obtained translation matrix and the unit matrix.

Further, the I-frame 2D distance map, the P' frame 2D distance map, and the 2D attribute map are respectively encoded into a video stream, and the video stream is encoded into a binary code stream by HEVC, specifically:

and coding the I frame 2D distance map, the P' frame 2D distance map and the 2D attribute map into a video stream, and compressing the video stream by using a video coding method to obtain a binary code stream.

Further, quantizing the motion information H into a 2D distance map specifically includes:

the motion information H is quantized in a uniform quantization manner, and the formula is as follows:

wherein, P _m (i, j) is the quantized pixel, H (i, j) is the rigid transform matrix H in i rows and j columnsNumber H _m in (i, j) is the minimum value in the rigid transformation matrix H, H _max (i, j) is the maximum value in the rigid transformation matrix H.

Further, the compressing into a binary code stream by entropy coding specifically includes:

performing 4-level 5/3 wavelet transform on the 2D distance map, and calculating the optimal quantization step size of each sub-band;

removing related redundancy among pixels of the data of the lowest-frequency sub-band by adopting a JPEG-LS algorithm, and coding by using zero run coding to obtain a zero run coding result;

and carrying out context modeling on the zero-run-length encoding result of the pattern, updating the number of levels of direct-tree-Columbus encoding, and obtaining a binary code stream by using the direct-tree-Columbus encoding of the corresponding number of levels.

Second aspect of the invention

A decoding method based on enhanced mapping map correlation LiDAR point cloud coding is used for decoding and recovering point clouds coded by the enhanced mapping map correlation LiDAR point cloud coding method in the first aspect into original point clouds, and comprises the following steps:

decoding motion information, namely performing entropy decoding on the binary code stream obtained by entropy coding to obtain a 2D distance map of the P 'frame, and converting registration information in the 2D distance map of the P' frame into motion information H;

Decoding geometric information, namely performing HEVC decoding on a binary code stream obtained by HEVC coding to obtain a 2D distance map of an I frame and a 2D distance map of a P 'frame containing the geometric information, converting the 2D distance maps of the I frame and the P' frame into three-dimensional point clouds of the I frame and the P 'frame, and restoring the three-dimensional point clouds of the P' frame by using decoded motion information H to obtain the geometric information of the P frame;

decoding attribute information, namely performing HEVC decoding on a binary code stream obtained by HEVC coding to obtain a 2D attribute graph containing the attribute information, and converting the 2D attribute graphs of I frames and P frames into the attribute information of the I frames and P' frames;

and recovering the three-dimensional point clouds of the I frame and the P frame according to the 2D distance map, the 2D attribute map, the geometric information and the attribute information.

The invention has the following beneficial effects:

adopting a method of mapping point cloud data to a distance map and an attribute map; in order to improve the time correlation of the distance map of the point cloud, a point cloud blocking and registration method is provided; in order to improve the accuracy of registration, a translation model is provided to improve the registration method of ICP; and converting the point cloud after registration into a distance map again, forming a video sequence by using the distance map and the attribute graph, coding by using HEVC, and mapping the motion information required by registration into a motion map by using entropy coding.

Drawings

FIG. 1 is a flow chart of the encoding method of the present invention.

Fig. 2 is a graph of the ICP improvement algorithm of the present invention.

FIG. 3 is a flow chart of the adaptive exponential Golomb coding algorithm of the present invention.

FIG. 4 is a flowchart of a decoding method according to the present invention.

Detailed Description

The invention is described in detail below with reference to the figures and the specific embodiments.

First aspect

Example one

Referring to FIG. 1, a LiDAR point cloud encoding method based on enhanced map correlation includes the following steps:

mapping the geometrical information of the I frame and the P frame into a 2D distance map, and dividing the 2D distance map into N frames with the resolution ₁ ×N ₂ Is divided into blocks of N ₁ Converting the blocks of the I frame and the P frame into three-dimensional point clouds for the line number of the laser radar, and registering the block three-dimensional point clouds of the P frame according to the block three-dimensional point clouds of the I frame by using an ICP (inductively coupled plasma) algorithm to obtain the registered motion information H of the block three-dimensional point clouds of the P' frame and the block three-dimensional point clouds of the P frame on the block three-dimensional point clouds of the I frame; the motion information comprises a rotation matrix and a translation matrix;

respectively encoding the 2D distance map of the I frame, the 2D distance map of the P' frame and the 2D attribute map into video streams, and encoding the video streams into binary code streams through HEVC;

and quantizing the motion information H into a 2D distance map, and compressing the distance map into a binary code stream through entropy coding.

The invention appoints a part of point cloud frame as reference frame, also called I frame (intra-frame coding frame), belonging to key frame; and uses them to predict other non-key point cloud frames, called P-frames (inter-frame predicted frames).

In one embodiment of the invention, in order to make the point number of point cloud registration at a reasonable and stable value, consider N from the experimental experience perspective ₂ And N ₁ The same is true.

Example two

On the basis of the first embodiment, the geometric information is mapped into a 2D distance map, specifically:

calculating the yaw angle omega of the point i according to the three-dimensional point cloud data _i Angle of pitch theta _i And distance r of point i from the center _i The publication is as follows:

wherein x is _i Is the left value of the x axis of the point i Cartesian coordinate, y _i Is the left value of the point i Cartesian coordinate y axis; z is a radical of formula _i The left value of the Z axis of the Cartesian coordinate of the point i is obtained;

P(px _i ,py _i )＝round(r _i )；

according to the pixel P (px) _i ，py _i ) A 2D distance map is obtained.

The pitch angle is determined by the inherent resolution of a vehicle-mounted laser radar, for example, a 16-line, 32-line and 64-line laser radar, the yaw angle is generally more than one point from 0 to 360 degrees, the effective sampling distance of the laser radar is generally no more than 130 meters at present, because the original data is obtained by measuring the radial distance from the center of the laser radar to an object, an index taking the pitch angle as a row and an index taking the yaw angle as a column are obtained, the distance value corresponding to a certain pitch angle and the yaw angle is the pixel value of a gray scale image, and the original compressed disordered scattered point cloud compression is converted into the compression of the distance image (gray scale image) with stronger correlation.

In one embodiment of the invention, the set information is mapped as a 2D distance map of 16-bit depth values.

EXAMPLE III

On the basis of the first embodiment, the ICP algorithm is an ICP improvement algorithm, and the ICP improvement algorithm specifically comprises the following steps:

wherein f (R, t) is an error function value of unpermitted punctual point clouds, fmin (R, t) is an error function of motion information after registration, n is the total number of source point set point clouds, and p _i Is an ICP source point set, t is a translation matrix, q _i And R is a rotation matrix for the ICP target point set.

The ICP algorithm is a rigid transformation algorithm using a wide application in point cloud registration, and the present invention estimates motion information from frame to frame of a point cloud by using a translation model-based ICP improvement algorithm.

The point cloud is partitioned and the corresponding modules are registered, so that the registration accuracy can be further improved, the operation speed can be reduced, as the laser radar point cloud is sparse and the acquisition unit rotates for one circle vertical to the vehicle, a large number of macro blocks have no data by using octree partitioning, and a large amount of encoded data and time are wasted by encoding and indexing the macro blocks with point cloud geometric data of each frame.

Referring to fig. 2, in one embodiment of the present invention, the ICP improvement algorithm further comprises:

after the ICP improvement algorithm iterates N times, let the rotation matrix equal the identity matrix, i.e., R = R0:

According to the collected data characteristics of the point cloud of the vehicle-mounted laser radar: the acquisition time interval between frames is short, and the road surface is smooth. At this time, the operation information between frames is mainly a large translation vector and a small rotation vector. Therefore, the method not only ensures the approach to the true value, but also can improve the coding efficiency.

Example four

On the basis of the first embodiment, the I-frame 2D distance map, the P' frame 2D distance map, and the 2D attribute map are respectively encoded into a video stream, and the video stream is encoded into a binary code stream by HEVC, specifically:

EXAMPLE five

A LiDAR point cloud coding method based on enhanced map correlation is disclosed, and on the basis of the first embodiment, the quantization of the motion information H into a 2D distance map specifically comprises the following steps:

wherein, P _m (i, j) is the quantized pixel, H (i, j) is the value of the rigid transformation matrix H in i rows and j columns, H _min (i, j) is the minimum value in the rigid transformation matrix H, H _max (i, j) is the maximum value in the rigid transformation matrix H.

EXAMPLE six

Referring to fig. 3, a LiDAR point cloud encoding method based on enhanced map correlation is, on the basis of the first embodiment, compressed into a binary code stream by entropy encoding, specifically:

Second aspect of the invention

EXAMPLE seven

Referring to fig. 4, a method for decoding LiDAR point cloud encoding based on enhanced map correlation, which decodes the point cloud encoded by the LiDAR point cloud encoding based on enhanced map correlation according to the first aspect to recover the original point cloud, and includes the following steps:

decoding motion information, namely performing entropy decoding on the binary code stream obtained by entropy coding to obtain a 2D distance graph of the P 'frame, and converting registration information in the 2D distance graph of the P' frame into motion information H;

Decoding geometric information, namely performing HEVC decoding on binary code stream obtained by HEVC coding to obtain a 2D distance map of an I frame and a 2D distance map of a P 'frame containing the geometric information, converting the 2D distance maps of the I frame and the P' frame into three-dimensional point clouds of the I frame and the P 'frame, and restoring the three-dimensional point clouds of the P' frame by using decoded motion information H to obtain the geometric information of the P frame;

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent structures made by using the contents of the specification and the drawings of the present invention or directly or indirectly applied to other related technical fields are included in the scope of the present invention.

Claims

1. A LiDAR point cloud coding method based on enhanced map correlation is characterized by comprising the following steps:

Mapping the geometric information of the I frame and the P frame into a 2D distance map, and dividing the 2D distance map into N resolution ratios ₁ ×N ₂ Is divided into blocks of N ₁ Converting the blocks of the I frame and the P frame into three-dimensional point clouds for the line number of the laser radar, and registering the block three-dimensional point clouds of the P frame according to the block three-dimensional point clouds of the I frame by using an ICP (inductively coupled plasma) algorithm to obtainMotion information H of each block point cloud of the registered P' frame and each block three-dimensional point cloud of the P frame to each block three-dimensional point cloud of the I frame; the motion information H comprises a rotation matrix and a translation matrix;

2. The LiDAR point cloud encoding method based on enhanced map correlation of claim 1, wherein N is ₂ And N ₁ The same is true.

3. The LiDAR point cloud encoding method based on enhanced map correlation according to claim 1, wherein the geometric information is mapped as a 2D distance map, specifically:

Calculating the yaw angle omega of the point i according to the three-dimensional point cloud data _i Pitch angle θ _i Distance r of point i from the center _i The publication is as follows:

wherein x is _i Is the left value of the x axis of the point i Cartesian coordinate, y _i Is the left value of the point i Cartesian coordinate y axis; z is a radical of formula _i Is the value of the left side of the z axis of the point i Cartesian coordinate;

P(px _i ，py _i )＝round(r _i )；

according to the pixel P (px) _i ，py _i ) A 2D distance map is obtained.

4. The enhanced map correlation-based LiDAR point cloud encoding method of claim 3, wherein the set information is mapped as a 2D distance map of 16-bit depth values.

5. The LiDAR point cloud encoding method based on enhanced map correlation according to claim 1, wherein the ICP algorithm is an ICP improvement algorithm, and the ICP improvement algorithm is specifically:

6. The enhanced map correlation-based LiDAR point cloud encoding method of claim 6, wherein the ICP-refinement algorithm further comprises:

7. The enhanced map correlation-based LiDAR point cloud encoding method of claim 1, wherein the I-frame 2D distance map, the P' frame 2D distance map, and the 2D attribute map are encoded into video streams, respectively, and the video streams are encoded into binary code streams by HEVC, specifically:

8. The LiDAR point cloud encoding method based on enhanced map correlation according to claim 1, wherein quantizing the motion information H into a 2D distance map specifically is:

and quantizing the motion information H in a uniform quantization mode, wherein the formula is as follows:

Wherein, P _m (i, j) is the quantized pixel, H (i, j) is the value of the rigid transformation matrix H in i rows and j columns, H _min (i, j) in rigid transformation matrix HMinimum value, H _max (i, j) is the maximum value in the rigid transformation matrix H.

9. The enhanced map correlation-based LiDAR point cloud encoding method of claim 7, wherein the encoding is compressed into a binary code stream by entropy encoding, specifically:

10. A decoding method based on enhanced map correlation LiDAR point cloud encoding, which decodes the point cloud encoded by the enhanced map correlation LiDAR point cloud encoding method of claims 1-8 to recover the original point cloud, comprising the following steps:

decoding attribute information, namely performing HEVC decoding on binary code streams obtained by HEVC coding to obtain a 2D attribute graph containing attribute information, and converting the 2D attribute graphs of I frames and P frames into attribute information of the I frames and the P' frames;