CN111899152A

CN111899152A - Point cloud data compression method and system based on projection and video stitching

Info

Publication number: CN111899152A
Application number: CN201910372758.0A
Authority: CN
Inventors: 徐异凌; 李哲; 朱文婕; 管云峰
Original assignee: Shanghai Jiaotong University
Current assignee: Shanghai Jiaotong University
Priority date: 2019-05-06
Filing date: 2019-05-06
Publication date: 2020-11-06

Abstract

The invention provides a point cloud data compression method and system based on projection and video splicing, which comprises the following steps: analyzing the geometric attribute characteristics of the point cloud data to determine a corresponding projection strategy and determine an optimal projection angle; and projecting the projection area of the same projection angle for multiple times to obtain a group of two-dimensional pictures with spatial correlation at the same angle. The invention firstly carries out coordinate conversion on the point cloud data to adapt to a corresponding projection mode, and solves the problem of shielding of dense point cloud in a mode of multiple projections at the same angle. The invention also converts the projection picture splicing into a video form, converts the single space correlation among the pictures into the time correlation among the video frames, reduces the dimension of the point cloud data of the three-dimensional space to a two-dimensional plane through a projection mode, splices the projected pictures into a video format, and further fully utilizes the existing coding tool to carry out compression coding, thereby not only improving the compression performance, but also greatly improving the efficiency by adopting the existing coding tool.

Description

Point cloud data compression method and system based on projection and video stitching

Technical Field

The invention relates to the field of point cloud data compression based on projection and video splicing, in particular to a point cloud data compression method and system based on projection and video splicing.

Background

In recent decades, three-dimensional scanning technology and systems have become mature, the manufacturing cost of 3D scanners is reduced, the precision is higher and higher, the applications are wider and wider, and three-dimensional coordinate information of the surfaces of actual objects can be rapidly and accurately acquired and stored, so that point cloud data can be widely applied to the related fields of image processing in scientific research and industry.

The point cloud data is data information of three-dimensional coordinates of the object after three-dimensional scanning, and information such as RGB (red, green, blue), depth and the like can be recorded. With the improvement of the precision and the speed of a three-dimensional scanning system, the amount of scanned point cloud data reaches the order of magnitude of millions or even more, and at present, the massive point cloud data increases heavy burden for computer storage, processing and transmission.

Point cloud compression algorithms have been studied more systematically. Most of the static point cloud compression methods are realized based on octree space decomposition. And decomposing the three-dimensional space in which the point clouds are positioned by using an octree structure, wherein each node represents a cube of a specific area in the space. The approximated point space coordinates can be calculated from the octree structure and the corresponding node information. Therefore, static point cloud compression can be realized by carrying out serialization coding on the octree structure. While a typical compression scheme for dynamic point clouds is to convert 3D point cloud data into 2D pictures for processing by projection mapping. This arrangement allows the point cloud to be compressed and transmitted by conventional processing means. However, when performing projection, due to the irregularity and density of the point cloud, there usually occurs a problem that projected points are overlapped on a two-dimensional plane to cause data loss, that is, a so-called occlusion problem. How to design reasonable projection and post-processing modes aiming at the problem of cloud occlusion, and retain original cloud point information as much as possible so as to realize efficient compression is a key problem to be solved urgently.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a point cloud data compression method and system based on projection and video splicing.

The invention provides a point cloud data compression method based on projection and video splicing, which comprises the following steps:

analyzing the geometric attribute characteristics of the point cloud data to determine a corresponding projection strategy and determine an optimal projection angle;

projecting the projection area of the same projection angle for multiple times to obtain a group of two-dimensional pictures with spatial correlation at the same angle;

and splicing a group of two-dimensional pictures into a video file.

Preferably, the method further comprises the following steps:

and analyzing the point cloud data to obtain geometric and attribute characteristics so as to determine a specific projection strategy.

Preferably, the projection strategy comprises:

the optimal projection axis and the optimal projection surface, and different projection modes are selected and set.

Preferably, the projection means comprises any one or more of:

based on the polyhedral projection method, the spherical projection method, the cylindrical projection method, and the conical projection method.

Preferably, the projection method adopts a cylindrical projection scheme:

analyzing the point cloud data, and judging a projection axis which furthest retains the original distribution characteristics; assume a sample point of x_iAnd assuming that the new coordinate system after projective transformation is W ═ W₁,w₂,w₃In which w₁,w₂,w₃For the basis vector of the new coordinate system, x can be obtained_iThe projection in the hyperplane is W^Tx_iWith the optimization objective of

s.tW^TW＝I

Using the lagrange multiplier method for the above equation, one can obtain:

XX^TW＝λW

wherein the content of the first and second substances,

i represents a unit array;

lambda represents a matrix of undetermined eigenvalues;

X＝(x₁,x₂,…,x_n) The method comprises the steps of (1) a point cloud data point set, wherein n is the total number of points contained in the point cloud data, and i is 1, 2, …, n;

for covariance matrix XX^TDecomposing the eigenvalues to obtain the largest eigenvectors of the eigenvalues, which are the normal orthogonal basis vectors of the space where the sample is located after dimensionality reduction,

with the corresponding feature vector w₁,w₂,w₃Establishing a new space rectangular coordinate system for the X, Y, Z axes, and determining the optimal projection axis.

Preferably, the method further comprises the following steps:

analyzing the point cloud data, and judging a projection axis which furthest retains the original distribution characteristics to obtain a determined optimal projection axis;

selecting a projection radius, and projecting and expanding points on the original point cloud data to a plane according to the principle of projection at the same angle;

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

repeating the above process can obtain a group of two-dimensional pictures projected at the same angle, and splicing the two-dimensional pictures into a video sequence.

Preferably, the method further comprises the following steps:

determining the number of multiple projections by:

and setting a projection acquisition threshold value of the acquired projection points accounting for the total points of the original point cloud data, and accumulating to acquire a plurality of two-dimensional pictures without continuing projection when the projected points reach the projection acquisition threshold value after multiple projections.

Preferably, the method further comprises the following steps:

after the projection angle is determined, only one point is preferentially selected from coincident points of the same pixel in the two-dimensional image for each projection among multiple projections according to the geometric attribute characteristics for reservation, so that each point can be projected only once, and the optimal point is selected from a series of points with the shielding relation according to the geometric and attribute characteristics for projection.

Preferably, the method further comprises the following steps:

and determining an optimal projection area according to the distance and the geometric attributes in a series of points with the occlusion relation.

Preferably, the method further comprises the following steps:

and carrying out compression coding on the spliced video file.

Preferably, the method further comprises the following steps:

the video splicing is in any one or more of the following formats: YUV, MP4, RMVB, MKV and AVI formats, and the compression mode adopts HEVC/MPEG1/MPEG2/MPEG4/H264 mode.

The invention provides a point cloud data compression system based on projection and video splicing, which comprises:

an analysis module: analyzing the geometric attribute characteristics of the point cloud data to determine a corresponding projection strategy and determine an optimal projection angle;

a projection module: projecting the projection area of the same projection angle for multiple times to obtain a group of two-dimensional pictures with spatial correlation at the same angle;

splicing modules: and splicing a group of two-dimensional pictures into a video file.

Preferably, the method further comprises the following steps:

the analysis module is used for analyzing the point cloud data to obtain geometric and attribute characteristics and determining a specific projection strategy.

Preferably, the method further comprises the following steps:

the projection strategy comprises: the optimal projection axis and the optimal projection surface, and different projection modes are selected and set.

Preferably, the method further comprises the following steps:

the projection mode comprises any one or more of the following modes: based on a polyhedral projection method, a spherical projection method, a cylindrical projection method, or a conical projection method.

Preferably, the method further comprises the following steps:

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

Preferably, the method further comprises the following steps:

determining the number of multiple projections by:

Preferably, the method further comprises the following steps:

and the compression coding module is used for carrying out compression coding on the spliced video file.

Compared with the prior art, the invention has the following beneficial effects:

the invention firstly carries out coordinate conversion on the point cloud data to adapt to a corresponding projection mode, and solves the problem of shielding of dense point cloud in a mode of multiple projections at the same angle. In addition, projection pictures are spliced and converted into a video form, single space correlation among the pictures is converted into time correlation among video frames, point cloud data of a three-dimensional space is reduced to a two-dimensional plane in a projection mode, the projected pictures are spliced into a video format, then the existing coding tools are fully utilized for compression coding, not only is the compression performance improved, but also the efficiency can be greatly improved by adopting the existing coding tools.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

FIG. 1 is a schematic diagram of original point cloud data under cylindrical projection and a cylindrical projection plane thereof according to an embodiment of the present invention;

FIG. 2 is a schematic spatial diagram of a point cloud projected onto a cylindrical surface under cylindrical projection according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a two-dimensional image of a cylindrical projection surface after expansion under cylindrical projection according to an embodiment of the present invention;

FIG. 4 is a schematic diagram of a group of pictures, i.e., a group of video frames after stitching, obtained by multiple projections at the same angle in the embodiment of the present invention;

fig. 5 is a schematic flowchart of a point cloud data compression method based on projection and video stitching according to an embodiment of the present invention.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.

and splicing a group of two-dimensional pictures into a video file.

Preferably, the method further comprises the following steps:

Preferably, the projection strategy comprises:

Preferably, the projection means comprises any one or more of:

Preferably, the projection method adopts a cylindrical projection scheme:

s.tW^TW＝I

Using the lagrange multiplier method for the above equation, one can obtain:

XX^TW＝λW

wherein the content of the first and second substances,

i represents a unit array;

lambda represents a matrix of undetermined eigenvalues;

coordinating partyDifference matrix XX^TDecomposing the eigenvalues to obtain the largest eigenvectors of the eigenvalues, which are the normal orthogonal basis vectors of the space where the sample is located after dimensionality reduction,

Preferably, the method further comprises the following steps:

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

Preferably, the method further comprises the following steps:

determining the number of multiple projections by:

Preferably, the method further comprises the following steps:

and carrying out compression coding on the spliced video file.

Preferably, the method further comprises the following steps:

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

Preferably, the method further comprises the following steps:

determining the number of multiple projections by:

Preferably, the method further comprises the following steps:

The present invention will be described more specifically below with reference to preferred examples.

Preferred example 1:

the mainstream processing method of the point cloud is to convert the point cloud into a two-dimensional plane for processing, but the precision, irregularity and density of the point cloud collection cause the problem of point superposition, namely the problem of occlusion during projection. The common processing mode is multiple projections at different angles, but a large number of black dispersion points and even stripes appear on a two-dimensional picture obtained after multiple projections, which is extremely not beneficial to compression coding. People are further upgrading the consumption of media nowadays, and point clouds are indispensable as an important representation form of immersive media. In the fields of automatic driving, security monitoring and the like, efficient point cloud compression is also needed to optimize a solution. In order to realize higher performance and efficiency of point cloud coding based on two-dimensional plane compression, the invention provides a point cloud data compression method based on projection and video splicing, wherein point clouds are subjected to multiple projections at the same angle and are spliced to form a video file, and the time-space domain correlation is fully utilized to realize high-efficiency compression coding performance.

The method of projecting a projection line through a point or other object onto a selected projection surface and obtaining a pattern on that surface is called projection. Including but not limited to projections based on polyhedrons, spheres, cylinders, cones, etc.

In particular, when a cylindrical projection scheme is applied, the invention is implemented as follows:

firstly, analyzing original point cloud data, and judging a projection axis which furthest retains original distribution characteristics by adopting a principal component analysis method; assume a sample point of x_iAnd assuming that the new coordinate system after projective transformation is W ═ W₁,w₂,w₃In which w₁,w₂,w₃For the basis vector of the new coordinate system, x can be obtained_iThe projection in the hyperplane is W^Tx_i. The optimization objective is

s.tW^TW＝I

Using Lagrange multiplier method for the above equation

XX^TW＝λW

Then, for the covariance matrix XX^TAnd (4) decomposing the eigenvalues, wherein the obtained maximum eigenvectors of a plurality of (the first 3 adopted in the embodiment) eigenvalues are the orthonormal basis vectors of the space where the sample is located after dimensionality reduction. With the corresponding feature vector w₁,w₂,w₃And establishing a new space rectangular coordinate system for the X, Y, Z axis, thereby realizing the analysis of the shape of the point cloud data and determining a projection axis, a projection plane and a most appropriate projection mode.

Secondly, selecting a projection radius, and projecting points on the original point cloud shown in the attached figure 1 onto a space cylinder according to the principle of equal-angle projection. The projected effect is shown in fig. 2, and the spatial cylinder is unfolded to a two-dimensional plane, as shown in fig. 3. Generally, the point cloud data has a serious occlusion in a region with a large density, so that when the point cloud data is projected to a specific direction, a plurality of points correspond to the same pixel position on a projection plane. In order to solve the problem, each point is specified to be projected only once, and the optimal point is selected from a series of points with occlusion relation according to geometric and attribute characteristics for projection. In addition, a percentage threshold value of the number of projection points in the total number of point cloud data (i.e. a projection acquisition threshold value) is set, a group of pictures projected at the same angle can be obtained by repeating the above process (as shown in fig. 4), and when the number of projection points reaches the threshold value after multiple projections, the process is not continued.

And splicing the projected two-dimensional pictures as a group of video frames into a yuv video sequence, and then performing coding compression by using an HEVC coding mode. The spliced video sequence can be in MP4/RMVB/MKV/AVI format, and the compression modes such as MPEG1/MPEG2/MPEG4/H264 are also optional. In the above implementation in the cylindrical projection manner, the geometric features of the point cloud data are more emphasized. It is noted that other attribute features, such as color, are also important references in the projection process and are included within the scope of the present invention.

As shown in fig. 5, a detailed flow chart of the method is briefly described as follows: after the original point cloud data is obtained, the geometric and attribute characteristics of the original point cloud data are analyzed to confirm the optimal projection mode and projection plane/axis. In order to solve the shielding problem, after the projection angle is determined, only one point is preferentially selected from coincident points of the same pixel in the two-dimensional image for retention in each projection according to geometric and attribute characteristics, and a plurality of two-dimensional images at the same angle are obtained through multiple projections. When the percentage of the projection points reaches a set threshold (i.e., a projection acquisition threshold), the projection is not continued, for example, if the projection acquisition threshold in this embodiment in fig. 5 is 99%, which means that the acquired projection points account for 99% of the total points of the original point cloud data, then the projection is not continued for 6 times in fig. 5, two-dimensional pictures of 6 frames (e.g., Frame1 to Frame6 in fig. 5) are obtained, the obtained group of two-dimensional pictures are spliced into a video sequence, and a video compression tool is used to perform coding compression. The above description is only an embodiment, and the description of the projection acquisition threshold and the number of projections is not a number limitation of the present invention.

Omitted from the drawings in the specification, the embodiment also provides a point cloud data compression system based on projection and video stitching, including: the device comprises an analysis module, a projection module, a splicing module and a compression coding module.

And the analysis module is used for analyzing the geometric attribute characteristics of the point cloud data to determine a corresponding projection strategy and determine an optimal projection angle.

And the projection module is used for projecting the projection area with the same projection angle for multiple times to obtain a group of two-dimensional pictures with spatial correlation at the same angle.

And the splicing module is used for splicing the group of two-dimensional pictures into a video file.

Optionally, the projection and video stitching based point cloud data compression system may further include: and the compression coding module is used for carrying out compression coding on the spliced video file.

In summary, the point cloud data compression method and system based on projection and video stitching can analyze according to the geometrical attribute characteristics of the point cloud data, optimize the projection axis and the projection plane, and select and set different projection modes (such as a plurality of modes including a polyhedron, a spherical surface, a cylinder, a conical surface and the like), thereby improving the projection efficiency. Particularly, after the projection angle is determined for the occlusion problem, only one point is preferentially selected from coincident points of the same pixel in the two-dimensional image for each projection according to geometric and attribute characteristics to be reserved, and a plurality of two-dimensional images at the same angle are obtained through multiple projections. The method comprises the steps of setting a percentage threshold value of projection points in the total points of point cloud data, splicing a plurality of projected two-dimensional pictures into a video format when the number of projected points reaches the threshold value after multiple times of projection, converting the spatial correlation among the pictures into the time correlation among video frames, and realizing high-efficiency compression.

In the invention, the method for analyzing the point cloud data, the projection method, the splicing mode and the compression tool are various, and preferably, the method can be realized by taking the following group of alternatives as an example:

the data analysis method comprises the following steps: principal Component Analysis (PCA) can be used to determine the optimal projection axis and the optimal projection plane. PCA is characterized in that a certain hyperplane is searched, so that all sample points are spread in the hyperplane as much as possible, namely, the maximum variance of a discrete point set is calculated, and the original distribution characteristics of the discrete point set are kept to the maximum extent.

The projection method comprises the following steps: projection methods from a three-dimensional space to a two-dimensional plane are many and have advantages and disadvantages, and projection methods based on polyhedrons, spherical surfaces, cylinders, conical surfaces and the like are taken as examples.

Splicing mode and compression tool: in order to improve the intra-frame/inter-frame coding efficiency to the maximum extent, the video is proposed to be spliced into a YUV format, and an HEVC coding compression tool is adopted. The modes of MPEG1/MPEG2/MPEG4/H264 and the like are also optional. Compression may be performed using conventional video coding tools.

It should be noted that, in the invention, the projection and splicing scheme is described only by taking the above method as an example, and is not limited to the above data analysis method, projection method, video splicing and encoding method.

In the point cloud data compression system based on projection and video stitching provided in this embodiment, each functional module corresponds to the point cloud data compression method based on projection and video stitching in the above embodiment, so that the structure and technical elements in the device can be formed by corresponding conversion of the generation method, and description is omitted here and will not be repeated.

Although the present invention has been described with reference to the preferred embodiments, it is not intended to limit the present invention, and those skilled in the art can make variations and modifications of the present invention without departing from the spirit and scope of the present invention by using the methods and technical contents disclosed above.

In the description of the present application, it is to be understood that the terms "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience in describing the present application and simplifying the description, but do not indicate or imply that the referred device or element must have a specific orientation, be constructed in a specific orientation, and be operated, and thus, should not be construed as limiting the present application.

Those skilled in the art will appreciate that, in addition to implementing the systems, apparatus, and various modules thereof provided by the present invention in purely computer readable program code, the same procedures can be implemented entirely by logically programming method steps such that the systems, apparatus, and various modules thereof are provided in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system, the device and the modules thereof provided by the present invention can be considered as a hardware component, and the modules included in the system, the device and the modules thereof for implementing various programs can also be considered as structures in the hardware component; modules for performing various functions may also be considered to be both software programs for performing the methods and structures within hardware components.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.

Claims

1. A point cloud data compression method based on projection and video splicing is characterized by comprising the following steps:

and splicing a group of two-dimensional pictures into a video file.

2. The projection and video stitching-based point cloud data compression method as claimed in claim 1, further comprising:

3. The projection and video stitching-based point cloud data compression method as claimed in claim 1, wherein the projection strategy comprises:

4. The projection and video stitching-based point cloud data compression method according to claim 3, wherein the projection mode comprises any one or more of the following modes:

5. The projection and video stitching-based point cloud data compression method according to claim 4, wherein the point cloud data compression method is characterized in that a cylindrical projection scheme is adopted:

s.tW^TW＝I

Using the lagrange multiplier method for the above equation, one can obtain:

XX^TW＝λW

wherein the content of the first and second substances,

i represents a unit array;

lambda represents a matrix of undetermined eigenvalues;

6. The projection and video stitching-based point cloud data compression method as claimed in claim 1, further comprising:

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

7. The projection and video stitching-based point cloud data compression method as claimed in claim 1, further comprising:

determining the number of multiple projections by:

8. The projection and video stitching-based point cloud data compression method as claimed in claim 6, further comprising:

9. The projection and video stitching-based point cloud data compression method as claimed in claim 1, further comprising:

10. The projection and video stitching-based point cloud data compression method as claimed in claim 1, further comprising:

and carrying out compression coding on the spliced video file.

11. The projection and video stitching-based point cloud data compression method as claimed in claim 7, further comprising:

12. A point cloud data compression system based on projection and video stitching is characterized by comprising:

13. The projection and video stitching-based point cloud data compression system of claim 12, further comprising:

14. The projection and video stitching-based point cloud data compression system of claim 12, further comprising:

15. The projection and video stitching-based point cloud data compression system of claim 14, further comprising:

16. The projection and video stitching-based point cloud data compression system of claim 12, further comprising:

determining an optimal projection area;

each point in the optimal projection area can be projected only once;

17. The projection and video stitching-based point cloud data compression system of claim 12, further comprising:

determining the number of multiple projections by:

18. The projection and video stitching-based point cloud data compression system of claim 17, further comprising:

19. The projection and video stitching-based point cloud data compression system of claim 12, further comprising:

20. The projection and video stitching-based point cloud data compression system of claim 12, further comprising: