WO2024065271A1

WO2024065271A1 - Point cloud encoding/decoding method and apparatus, and device and storage medium

Info

Publication number: WO2024065271A1
Application number: PCT/CN2022/122116
Authority: WO
Inventors: 孙泽星
Original assignee: Oppo广东移动通信有限公司
Priority date: 2022-09-28
Filing date: 2022-09-28
Publication date: 2024-04-04

Abstract

Provided in the present application are a point cloud encoding/decoding method and apparatus, and a device and a storage medium. The point cloud encoding/decoding method comprises: during L3C2-based point cloud encoding/decoding, first determining, according to the total number of points comprised in a point cloud and the total number of nodes of L3C2 of the point cloud, the total number of duplicate points comprised in the point cloud; and during such encoding/decoding, recording in real time the number of duplicate points that have been encoded/decoded, and comparing the number of duplicate points that have been encoded/decoded with the total number of duplicate points comprised in the point cloud, so as to determine whether to encode/decode information of duplicate points of the current node when encoding/decoding the node. For example, when the current node is encoded/decoded, it being determined that the number of duplicate points that have been encoded/decoded is equal to the total number of duplicate points in the point cloud indicates that the encoding/decoding of duplicate points in the point cloud has ended, and that none of the remaining nodes comprises duplicate points and thus it is not necessary to encode/decode information of duplicate points of the subsequent nodes, thereby reducing the complexity in encoding/decoding the point cloud, saving on encoding/decoding time, and increasing the encoding/decoding efficiency.

Description

Point cloud encoding and decoding method, device, equipment and storage medium

Technical Field

The present application relates to the field of point cloud technology, and in particular to a point cloud encoding and decoding method, device, equipment and storage medium.

Background technique

The surface of the object is collected by the acquisition device to form point cloud data, which includes hundreds of thousands or even more points. In the video production process, the point cloud data is transmitted between the point cloud encoding device and the point cloud decoding device in the form of point cloud media files. However, such a large number of points brings challenges to transmission, so the point cloud encoding device needs to compress the point cloud data before transmission.

Point cloud compression is also called point cloud encoding. In the point cloud encoding process, different encoding models are used to correspond the points in the point cloud to nodes and encode the nodes, some of which include duplicate points. The current encoding and decoding methods, such as the low latency, low complexity coding model (L3C2), need to encode and decode the duplicate point information of each node in L3C2 when encoding and decoding the node, thereby reducing the encoding and decoding efficiency of the point cloud.

Summary of the invention

The embodiments of the present application provide a point cloud encoding and decoding method, apparatus, device and storage medium to reduce the complexity of encoding and decoding, save encoding and decoding time, and thereby improve the encoding and decoding efficiency of the point cloud.

In a first aspect, an embodiment of the present application provides a point cloud decoding method, comprising:

Decode the geometric code stream of the point cloud to obtain the total number of nodes of the low-latency and low-complexity coding model L3C2 of the point cloud and the total number of points of the point cloud;

Determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

When decoding the current node in the L3C2, determining the number of decoded duplicate points;

The current node is decoded according to the number of the decoded repeated points and the total number of repeated points in the point cloud.

In a second aspect, the present application provides a point cloud encoding method, comprising:

Determine the L3C2 structure of the low-latency and low-complexity coding model for point clouds;

Determine the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud;

When encoding the current node in the L3C2, determining the number of encoded repeated points;

The current node is encoded according to the number of the encoded repeated points and the total number of repeated points in the point cloud.

In a third aspect, the present application provides a point cloud decoding device for executing the method in the first aspect or its respective implementations. Specifically, the device includes a functional unit for executing the method in the first aspect or its respective implementations.

In a fourth aspect, the present application provides a point cloud encoding device for executing the method in the second aspect or its respective implementations. Specifically, the device includes a functional unit for executing the method in the second aspect or its respective implementations.

In a fifth aspect, a point cloud decoder is provided, comprising a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the first aspect or its implementation manners.

In a sixth aspect, a point cloud encoder is provided, comprising a processor and a memory. The memory is used to store a computer program, and the processor is used to call and run the computer program stored in the memory to execute the method in the second aspect or its respective implementations.

In a seventh aspect, a point cloud encoding and decoding system is provided, comprising a point cloud encoder and a point cloud decoder. The point cloud decoder is used to execute the method in the first aspect or its respective implementations, and the point cloud encoder is used to execute the method in the second aspect or its respective implementations.

In an eighth aspect, a chip is provided for implementing the method in any one of the first to second aspects or their respective implementations. Specifically, the chip includes: a processor for calling and running a computer program from a memory, so that a device equipped with the chip executes the method in any one of the first to second aspects or their respective implementations.

In a ninth aspect, a computer-readable storage medium is provided for storing a computer program, wherein the computer program enables a computer to execute the method of any one of the first to second aspects or any of their implementations.

In a tenth aspect, a computer program product is provided, comprising computer program instructions, which enable a computer to execute the method in any one of the first to second aspects or their respective implementations.

In an eleventh aspect, a computer program is provided, which, when executed on a computer, enables the computer to execute the method in any one of the first to second aspects or in each of their implementations.

In a twelfth aspect, a code stream is provided, which is generated based on the method of the second aspect. Optionally, the code stream includes at least one of the first parameter and the second parameter.

Based on the above technical solution, in the point cloud encoding and decoding based on L3C2, the total number of repeated points included in the point cloud is first determined by the total number of points included in the point cloud and the total number of L3C2 nodes of the point cloud. In this way, during encoding and decoding, the number of repeated points that have been encoded and decoded is recorded in real time, and the number of repeated points that have been encoded and decoded is compared with the total number of repeated points included in the point cloud to determine whether to encode and decode the repeated point information of the node when encoding and decoding the current node. For example, when encoding and decoding the current point, it is determined that the number of repeated points that have been encoded and decoded is equal to the total number of repeated points in the point cloud, indicating that the repeated points in the point cloud have been encoded and decoded, and the remaining nodes do not include repeated points, and thus there is no need to encode and decode the repeated point information of subsequent nodes, thereby reducing the encoding and decoding complexity of the point cloud, saving encoding and decoding time, and thus improving the encoding and decoding efficiency.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG1A is a schematic diagram of a point cloud;

Figure 1B is a partial enlarged view of the point cloud;

FIG2 is a schematic diagram of six viewing angles of a point cloud image;

FIG3 is a schematic block diagram of a point cloud encoding and decoding system according to an embodiment of the present application;

FIG4A is a schematic block diagram of a point cloud encoder provided in an embodiment of the present application;

FIG4B is a schematic block diagram of a point cloud decoder provided in an embodiment of the present application;

5A to 5C are schematic diagrams of geometric information encoding based on triangular facets;

FIG6 is a schematic diagram of a decoding framework of L3C2;

FIG7A is a schematic diagram of a single-chain structure;

FIG7B is a schematic diagram of a single chain after the single chain structure shown in FIG7A is regularized;

FIG8 is a schematic diagram of the scanning principle of a laser scanner;

FIG9 is a schematic diagram of a prediction structure;

FIG10 is a schematic diagram of a determination principle of a quantization factor;

FIG11 is a schematic diagram of another principle for determining a quantization factor;

FIG12 is a schematic diagram of a decoding framework of L3C2;

FIG13 is a schematic diagram of a point cloud decoding method flow chart provided in an embodiment of the present application;

FIG14 is a schematic diagram of a point cloud encoding method flow chart provided by an embodiment of the present application;

FIG15 is a schematic block diagram of a point cloud decoding device provided in an embodiment of the present application;

FIG16 is a schematic block diagram of a point cloud encoding device provided in an embodiment of the present application;

FIG17 is a schematic block diagram of an electronic device provided in an embodiment of the present application;

Figure 18 is a schematic block diagram of the point cloud encoding and decoding system provided in an embodiment of the present application.

Detailed ways

The present application can be applied to the field of point cloud upsampling technology, for example, can be applied to the field of point cloud compression technology.

In order to facilitate understanding of the embodiments of the present application, the relevant concepts involved in the embodiments of the present application are briefly introduced as follows:

Point cloud refers to a set of irregularly distributed discrete points in space that express the spatial structure and surface properties of a three-dimensional object or three-dimensional scene. Figure 1A is a schematic diagram of a three-dimensional point cloud image, and Figure 1B is a partial enlarged view of Figure 1A. It can be seen from Figures 1A and 1B that the point cloud surface is composed of densely distributed points.

Two-dimensional images have information expressed at each pixel point, and the distribution is regular, so there is no need to record its position information; however, the distribution of points in the point cloud in three-dimensional space is random and irregular, so it is necessary to record the position of each point in space to fully express a point cloud. Similar to two-dimensional images, each position has corresponding attribute information during the acquisition process.

Point cloud data is a specific record form of point cloud. Points in the point cloud may include the location information of the point and the attribute information of the point. For example, the location information of the point may be the three-dimensional coordinate information of the point. The location information of the point may also be called the geometric information of the point. For example, the attribute information of the point may include color information, reflectance information, normal vector information, etc. Color information reflects the color of an object, and reflectance information reflects the surface material of an object. The color information may be information in any color space. For example, the color information may be (RGB). For another example, the color information may be information about brightness and chromaticity (YcbCr, YUV). For example, Y represents brightness (Luma), Cb (U) represents blue color difference, Cr (V) represents red, and U and V represent chromaticity (Chroma) for describing color difference information. For example, according to the point cloud obtained by the laser measurement principle, the points in the point cloud may include the three-dimensional coordinate information of the point and the laser reflection intensity (reflectance) of the point. For another example, according to the point cloud obtained by the photogrammetry principle, the points in the point cloud may include the three-dimensional coordinate information of the point and the color information of the point. For another example, a point cloud is obtained by combining the principles of laser measurement and photogrammetry. The points in the point cloud may include the three-dimensional coordinate information of the point, the laser reflection intensity (reflectance) of the point, and the color information of the point. FIG2 shows a point cloud image, where FIG2 shows six viewing angles of the point cloud image. Table 1 shows the point cloud data storage format composed of a file header information part and a data part:

Table 1

In Table 1, the header information includes the data format, data representation type, the total number of point cloud points, and the content represented by the point cloud. For example, the point cloud in this example is in the ".ply" format, represented by ASCII code, with a total number of 207242 points, and each point has three-dimensional position information XYZ and three-dimensional color information RGB.

Point clouds can flexibly and conveniently express the spatial structure and surface properties of three-dimensional objects or scenes. Point clouds are obtained by directly sampling real objects, so they can provide a strong sense of reality while ensuring accuracy. Therefore, they are widely used, including virtual reality games, computer-aided design, geographic information systems, automatic navigation systems, digital cultural heritage, free viewpoint broadcasting, three-dimensional immersive remote presentation, and three-dimensional reconstruction of biological tissues and organs.

Point cloud data can be obtained by at least one of the following ways: (1) computer equipment generation. Computer equipment can generate point cloud data based on virtual three-dimensional objects and virtual three-dimensional scenes. (2) 3D (3-Dimension) laser scanning acquisition. 3D laser scanning can be used to obtain point cloud data of static real-world three-dimensional objects or three-dimensional scenes, and millions of point cloud data can be obtained per second; (3) 3D photogrammetry acquisition. The visual scene of the real world is collected by 3D photography equipment (i.e., a group of cameras or camera equipment with multiple lenses and sensors) to obtain point cloud data of the visual scene of the real world. 3D photography can be used to obtain point cloud data of dynamic real-world three-dimensional objects or three-dimensional scenes. (4) Point cloud data of biological tissues and organs can be obtained by medical equipment. In the medical field, point cloud data of biological tissues and organs can be obtained by medical equipment such as magnetic resonance imaging (MRI), computed tomography (CT), and electromagnetic positioning information.

Point clouds can be divided into dense point clouds and sparse point clouds according to the way they are acquired.

Point clouds are divided into the following types according to the time series of the data:

The first type of static point cloud: the object is stationary, and the device that obtains the point cloud is also stationary;

The second type of dynamic point cloud: the object is moving, but the device that obtains the point cloud is stationary;

The third type of dynamic point cloud acquisition: the device that acquires the point cloud is moving.

Point clouds can be divided into two categories according to their uses:

Category 1: Machine perception point cloud, which can be used in autonomous navigation systems, real-time inspection systems, geographic information systems, visual sorting robots, emergency rescue robots, etc.

Category 2: Point cloud perceived by the human eye, which can be used in point cloud application scenarios such as digital cultural heritage, free viewpoint broadcasting, 3D immersive communication, and 3D immersive interaction.

The above point cloud acquisition technology reduces the cost and time of point cloud data acquisition and improves the accuracy of data. The change in the point cloud data acquisition method makes it possible to acquire a large amount of point cloud data. With the growth of application demand, the processing of massive 3D point cloud data encounters bottlenecks of storage space and transmission bandwidth.

Taking a point cloud video with a frame rate of 30fps (frames per second) as an example, the number of points in each point cloud frame is 700,000, and each point has coordinate information xyz (float) and color information RGB (uchar). The data volume of a 10s point cloud video is approximately 0.7 million (4Byte 3 + 1Byte 3) 30fps 10s = 3.15GB, while the YUV sampling format is 4:2:0, and the frame rate is 24fps. The 10s data volume of a 1280 720 two-dimensional video is approximately 1280 720 12bit 24frames 10s ≈ 0.33GB, and the data volume of a 10s two-view 3D video is approximately 0.33X2 = 0.66GB. It can be seen that the data volume of a point cloud video far exceeds that of a two-dimensional video and a three-dimensional video of the same length. Therefore, in order to better realize data management, save server storage space, and reduce the transmission traffic and transmission time between the server and the client, point cloud compression has become a key issue in promoting the development of the point cloud industry.

The following is an introduction to the relevant knowledge of point cloud encoding and decoding.

FIG3 is a schematic block diagram of a point cloud encoding and decoding system involved in an embodiment of the present application. It should be noted that FIG3 is only an example, and the point cloud encoding and decoding system of the embodiment of the present application includes but is not limited to that shown in FIG3. As shown in FIG3, the point cloud encoding and decoding system 100 includes an encoding device 110 and a decoding device 120. The encoding device is used to encode (which can be understood as compression) the point cloud data to generate a code stream, and transmit the code stream to the decoding device. The decoding device decodes the code stream generated by the encoding device to obtain decoded point cloud data.

The encoding device 110 of the embodiment of the present application can be understood as a device with a point cloud encoding function, and the decoding device 120 can be understood as a device with a point cloud decoding function, that is, the embodiment of the present application includes a wider range of devices for the encoding device 110 and the decoding device 120, such as smartphones, desktop computers, mobile computing devices, notebook (e.g., laptop) computers, tablet computers, set-top boxes, televisions, cameras, display devices, digital media players, point cloud game consoles, vehicle-mounted computers, etc.

In some embodiments, the encoding device 110 may transmit the encoded point cloud data (such as a code stream) to the decoding device 120 via the channel 130. The channel 130 may include one or more media and/or devices capable of transmitting the encoded point cloud data from the encoding device 110 to the decoding device 120.

In one example, the channel 130 includes one or more communication media that enable the encoding device 110 to transmit the encoded point cloud data directly to the decoding device 120 in real time. In this example, the encoding device 110 can modulate the encoded point cloud data according to the communication standard and transmit the modulated point cloud data to the decoding device 120. The communication medium includes a wireless communication medium, such as a radio frequency spectrum, and optionally, the communication medium may also include a wired communication medium, such as one or more physical transmission lines.

In another example, the channel 130 includes a storage medium, which can store the point cloud data encoded by the encoding device 110. The storage medium includes a variety of locally accessible data storage media, such as optical disks, DVDs, flash memories, etc. In this example, the decoding device 120 can obtain the encoded point cloud data from the storage medium.

In another example, the channel 130 may include a storage server that can store the point cloud data encoded by the encoding device 110. In this example, the decoding device 120 can download the stored encoded point cloud data from the storage server. Optionally, the storage server can store the encoded point cloud data and transmit the encoded point cloud data to the decoding device 120, such as a web server (e.g., for a website), a file transfer protocol (FTP) server, etc.

In some embodiments, the encoding device 110 includes a point cloud encoder 112 and an output interface 113. The output interface 113 may include a modulator/demodulator (modem) and/or a transmitter.

In some embodiments, the encoding device 110 may further include a point cloud source 111 in addition to the point cloud encoder 112 and the input interface 113 .

The point cloud source 111 may include at least one of a point cloud acquisition device (e.g., a scanner), a point cloud archive, a point cloud input interface, and a computer graphics system, wherein the point cloud input interface is used to receive point cloud data from a point cloud content provider, and the computer graphics system is used to generate point cloud data.

The point cloud encoder 112 encodes the point cloud data from the point cloud source 111 to generate a code stream. The point cloud encoder 112 transmits the encoded point cloud data directly to the decoding device 120 via the output interface 113. The encoded point cloud data can also be stored in a storage medium or a storage server for subsequent reading by the decoding device 120.

In some embodiments, the decoding device 120 includes an input interface 121 and a point cloud decoder 122 .

In some embodiments, the decoding device 120 may further include a display device 123 in addition to the input interface 121 and the point cloud decoder 122 .

The input interface 121 includes a receiver and/or a modem. The input interface 121 can receive the encoded point cloud data through the channel 130 .

The point cloud decoder 122 is used to decode the encoded point cloud data to obtain decoded point cloud data, and transmit the decoded point cloud data to the display device 123.

The decoded point cloud data is displayed on the display device 123. The display device 123 may be integrated with the decoding device 120 or may be external to the decoding device 120. The display device 123 may include a variety of display devices, such as a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or other types of display devices.

In addition, Figure 3 is only an example, and the technical solution of the embodiment of the present application is not limited to Figure 3. For example, the technology of the present application can also be applied to unilateral point cloud encoding or unilateral point cloud decoding.

The current point cloud encoder can adopt two point cloud compression coding technology routes proposed by the International Standards Organization Moving Picture Experts Group (MPEG), namely Video-based Point Cloud Compression (VPCC) and Geometry-based Point Cloud Compression (GPCC). VPCC projects the three-dimensional point cloud into two dimensions and uses the existing two-dimensional coding tools to encode the projected two-dimensional image. GPCC uses a hierarchical structure to divide the point cloud into multiple units step by step, and encodes the entire point cloud by encoding the division process.

The following uses the GPCC encoding and decoding framework as an example to explain the point cloud encoder and point cloud decoder applicable to the embodiments of the present application.

FIG. 4A is a schematic block diagram of a point cloud encoder provided in an embodiment of the present application.

From the above, we can know that the points in the point cloud can include the location information of the points and the attribute information of the points. Therefore, the encoding of the points in the point cloud mainly includes location encoding and attribute encoding. In some examples, the location information of the points in the point cloud is also called geometric information, and the corresponding location encoding of the points in the point cloud can also be called geometric encoding.

In the GPCC coding framework, the geometric information of the point cloud and the corresponding attribute information are encoded separately.

As shown in FIG. 4A below, the current geometric coding and decoding of G-PCC can be divided into octree-based geometric coding and decoding and prediction tree-based geometric coding and decoding.

The process of position coding includes: preprocessing the points in the point cloud, such as coordinate transformation, quantization, and removal of duplicate points; then, geometric coding the preprocessed point cloud, such as constructing an octree, or constructing a prediction tree, and geometric coding based on the constructed octree or prediction tree to form a geometric code stream. At the same time, based on the position information output by the constructed octree or prediction tree, the position information of each point in the point cloud data is reconstructed to obtain the reconstructed value of the position information of each point.

The attribute encoding process includes: given the reconstruction information of the input point cloud position information and the original value of the attribute information, selecting one of the three prediction modes for point cloud prediction, quantizing the predicted result, and performing arithmetic coding to form an attribute code stream.

As shown in Figure 4A, position encoding can be achieved by the following units:

Coordinate transformation (Tanmsform coordinates) unit 201, voxel (Voxelize) unit 202, octree partition (Analyze octree) unit 203, geometry reconstruction (Reconstruct geometry) unit 204, first arithmetic encoding (Arithmetic enconde) unit 205, surface fitting unit (Analyze surface approximation) 206 and prediction tree construction unit 207.

The coordinate conversion unit 201 can be used to convert the world coordinates of the point in the point cloud into relative coordinates. For example, the geometric coordinates of the point are respectively subtracted from the minimum value of the xyz coordinate axis, which is equivalent to a DC removal operation, so as to realize the conversion of the coordinates of the point in the point cloud from the world coordinates to the relative coordinates.

The voxel unit 202 is also called a quantize and remove points unit, which can reduce the number of coordinates by quantization; after quantization, originally different points may be assigned the same coordinates, based on which, duplicate points can be deleted by deduplication operation; for example, multiple clouds with the same quantized position and different attribute information can be merged into one cloud by attribute conversion. In some embodiments of the present application, the voxel unit 202 is an optional unit module.

The octree division unit 203 may use an octree encoding method to encode the position information of the quantized points. For example, the point cloud is divided in the form of an octree, so that the position of the point can correspond to the position of the octree one by one, and the position of the point in the octree is counted and its flag is recorded as 1 to perform geometric encoding.

In some embodiments, in the process of geometric information encoding based on triangle soup (trisoup), the point cloud is also divided into octrees by the octree division unit 203. However, different from the geometric information encoding based on the octree, the trisoup does not need to divide the point cloud into unit cubes with a side length of 1x1x1 step by step, but stops dividing when the block (sub-block) has a side length of W. Based on the surface formed by the distribution of the point cloud in each block, at most twelve vertices (intersections) generated by the surface and the twelve edges of the block are obtained, and the intersections are surface fitted by the surface fitting unit 206, and the fitted intersections are geometrically encoded.

The prediction tree construction unit 207 can use the prediction tree encoding method to encode the position information of the quantized points. For example, the point cloud is divided in the form of a prediction tree, so that the position of the point can correspond to the position of the node in the prediction tree one by one. By counting the positions of the points in the prediction tree, the geometric position information of the node is predicted by selecting different prediction modes to obtain the prediction residual, and the geometric prediction residual is quantized using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure and the quantization parameter are encoded to generate a binary code stream.

The geometric reconstruction unit 204 can perform position reconstruction based on the position information output by the octree division unit 203 or the intersection points fitted by the surface fitting unit 206 to obtain the reconstructed value of the position information of each point in the point cloud data. Alternatively, the position reconstruction can be performed based on the position information output by the prediction tree construction unit 207 to obtain the reconstructed value of the position information of each point in the point cloud data.

The arithmetic coding unit 205 can use entropy coding to perform arithmetic coding on the position information output by the octree analysis unit 203 or the intersection points fitted by the surface fitting unit 206, or the geometric prediction residual values output by the prediction tree construction unit 207 to generate a geometric code stream; the geometric code stream can also be called a geometry bitstream.

Attribute encoding can be achieved through the following units:

A color conversion (Transform colors) unit 210, a recoloring (Transfer attributes) unit 211, a Region Adaptive Hierarchical Transform (RAHT) unit 212, a Generate LOD (Generate LOD) unit 213, a lifting (lifting transform) unit 214, a Quantize coefficients (Quantize coefficients) unit 215 and an arithmetic coding unit 216.

It should be noted that the point cloud encoder 200 may include more, fewer or different functional components than those shown in FIG. 4A .

The color conversion unit 210 may be used to convert the RGB color space of the points in the point cloud into a YCbCr format or other formats.

The recoloring unit 211 recolors the color information using the reconstructed geometric information so that the uncoded attribute information corresponds to the reconstructed geometric information.

After the original value of the attribute information of the point is converted by the recoloring unit 211, any transformation unit can be selected to transform the points in the point cloud. The transformation unit may include: RAHT transformation 212 and lifting (lifting transform) unit 214. Among them, the lifting transformation depends on generating a level of detail (LOD).

Any of the RAHT transformation and the lifting transformation can be understood as being used to predict the attribute information of a point in a point cloud to obtain a predicted value of the attribute information of the point, and then obtain a residual value of the attribute information of the point based on the predicted value of the attribute information of the point. For example, the residual value of the attribute information of the point can be the original value of the attribute information of the point minus the predicted value of the attribute information of the point.

In one embodiment of the present application, the process of generating LOD by the LOD generating unit includes: obtaining the Euclidean distance between points according to the position information of the points in the point cloud; and dividing the points into different detail expression layers according to the Euclidean distance. In one embodiment, the Euclidean distances can be sorted and the Euclidean distances in different ranges can be divided into different detail expression layers. For example, a point can be randomly selected as the first detail expression layer. Then the Euclidean distances between the remaining points and the point are calculated, and the points whose Euclidean distances meet the first threshold requirement are classified as the second detail expression layer. The centroid of the points in the second detail expression layer is obtained, and the Euclidean distances between the points other than the first and second detail expression layers and the centroid are calculated, and the points whose Euclidean distances meet the second threshold are classified as the third detail expression layer. By analogy, all points are classified into the detail expression layer. By adjusting the threshold of the Euclidean distance, the number of points in each LOD layer can be increased. It should be understood that the LOD division method can also be adopted in other ways, and the present application does not limit this.

It should be noted that the point cloud may be directly divided into one or more detail expression layers, or the point cloud may be first divided into a plurality of point cloud slices, and then each point cloud slice may be divided into one or more LOD layers.

For example, the point cloud can be divided into multiple point cloud blocks, and the number of points in each point cloud block can be between 550,000 and 1.1 million. Each point cloud block can be regarded as a separate point cloud. Each point cloud block can be divided into multiple detail expression layers, and each detail expression layer includes multiple points. In one embodiment, the detail expression layer can be divided according to the Euclidean distance between points.

The quantization unit 215 may be used to quantize the residual value of the attribute information of the point. For example, if the quantization unit 215 is connected to the RAHT transformation unit 212, the quantization unit 215 may be used to quantize the residual value of the attribute information of the point output by the RAHT transformation unit 212.

The arithmetic coding unit 216 may use zero run length coding to perform entropy coding on the residual value of the attribute information of the point to obtain an attribute code stream. The attribute code stream may be bit stream information.

FIG4B is a schematic block diagram of a point cloud decoder provided in an embodiment of the present application.

As shown in Fig. 4B, the decoder 300 can obtain the point cloud code stream from the encoding device, and obtain the position information and attribute information of the points in the point cloud by parsing the code. The decoding of the point cloud includes position decoding and attribute decoding.

The process of position decoding includes: performing arithmetic decoding on the geometric code stream; merging after building the octree, reconstructing the position information of the point to obtain the reconstructed information of the point position information; performing coordinate transformation on the reconstructed information of the point position information to obtain the point position information. The point position information can also be called the geometric information of the point.

The attribute decoding process includes: obtaining the residual value of the attribute information of the point in the point cloud by parsing the attribute code stream; obtaining the residual value of the attribute information of the point after dequantization by dequantizing the residual value of the attribute information of the point; based on the reconstruction information of the point position information obtained in the position decoding process, selecting one of the following RAHT inverse transform and lifting inverse transform to predict the point cloud to obtain the predicted value, and adding the predicted value to the residual value to obtain the reconstructed value of the attribute information of the point; performing color space inverse conversion on the reconstructed value of the attribute information of the point to obtain the decoded point cloud.

As shown in FIG4B , position decoding can be achieved by the following units:

Arithmetic decoding unit 301, octree synthesis (synthesize octree) unit 302, surface fitting unit (Synthesize suface approximation) 303, geometry reconstruction (Reconstruct geometry) unit 304, inverse transform (inverse transform coordinates) unit 305 and prediction tree reconstruction unit 306.

Attribute encoding can be achieved through the following units:

Arithmetic decoding unit 310, inverse quantize unit 311, RAHT inverse transform unit 312, generate LOD unit 313, inverse lifting unit 314 and inverse trasform colors unit 315.

It should be noted that decompression is the inverse process of compression. Similarly, the functions of each unit in the decoder 300 can refer to the functions of the corresponding units in the encoder 200. In addition, the point cloud decoder 300 may include more, fewer or different functional components than those in FIG. 5 .

For example, the decoder 300 can divide the point cloud into multiple LODs according to the Euclidean distance between points in the point cloud; then, the attribute information of the points in the LOD is decoded in sequence; for example, the number of zeros (zero_cnt) in the zero-run encoding technology is calculated to decode the residual based on zero_cnt; then, the decoding framework 200 can perform inverse quantization based on the decoded residual value, and add the inverse quantized residual value to the predicted value of the current point to obtain the reconstruction value of the point cloud until all point clouds are decoded. The current point will be used as the nearest point of the subsequent LOD point, and the reconstruction value of the current point will be used to predict the attribute information of the subsequent point.

The above is the basic process of the point cloud codec based on the GPCC codec framework. With the development of technology, some modules or steps of the framework or process may be optimized. This application is applicable to the basic process of the point cloud codec based on the GPCC codec framework, but is not limited to the framework and process.

The following introduces octree-based geometric coding and prediction tree-based geometric coding.

Octree-based geometric encoding includes: first, coordinate transformation of geometric information so that all point clouds are contained in a bounding box. Then quantization is performed. This step of quantization mainly plays a role of scaling. Due to quantization rounding, the geometric information of some points is the same. Whether to remove duplicate points is determined based on parameters. The process of quantization and removal of duplicate points is also called voxelization. Next, the bounding box is continuously divided into trees (octree/quadtree/binary tree) in the order of breadth-first traversal, and the placeholder code of each node is encoded. In an implicit geometric division method, the bounding box of the point cloud is first calculated.

Assume that the bounding box of _dx > _dy > _dz corresponds to a cuboid. During geometric partitioning, binary tree partitioning will be performed based on the x-axis to obtain two child nodes. When the condition of _dx = _dy > _dz is met, quadtree partitioning will be performed based on the x-axis and y-axis to obtain four child nodes. When the condition of _dx = _dy = _dz is finally met, octree partitioning will be performed until the leaf node obtained by partitioning is a 1x1x1 unit cube. The partitioning will be stopped, and the points in the leaf node will be encoded to generate a binary code stream. In the process of binary tree/quadtree/octree partitioning, two parameters are introduced: K and M. Parameter K indicates the maximum number of binary tree/quadtree partitioning before octree partitioning; parameter M is used to indicate that the minimum block side length corresponding to binary tree/quadtree partitioning is ^2M . At the same time, K and M must meet the following conditions: Assuming d _max = max(d _x , _dy , d _z ), d _min = min(d _x , _dy , d _z ), parameter K satisfies: K＞＝d _max -d _min ; parameter M satisfies: M＞＝d _min . The reason why parameters K and M meet the above conditions is that in the process of geometric implicit partitioning of G-PCC, the priority of partitioning is binary tree, quadtree and octree. When the node block size does not meet the conditions of binary tree/quadtree, the node will be partitioned by octree until it is partitioned to the minimum unit of leaf node 1X1X1. However, the geometric information coding mode based on octree has an efficient compression rate only for points with correlation in space, while for points in isolated positions in geometric space, the use of direct coding mode (Direct Coding Model, referred to as DCM) coding can greatly reduce the complexity. For all nodes in the octree, the use of DCM is not represented by flag information, but is inferred by the parent node and neighbor information of the current node. There are two ways to determine whether the current node is eligible for DCM encoding:

(1) The current node has only one occupied child node, and the parent node of the current node's parent node has only two occupied child nodes, that is, the current node has at most one neighbor node.

(2) The parent node of the current node has only one child node, the current node. At the same time, the six neighbor nodes that share a face with the current node are also empty nodes.

If the current node does not have the DCM coding qualification, it will be divided into octrees. If it has the DCM coding qualification, the number of points contained in the node will be further determined. When the number of points is less than the threshold 2, the node will be DCM-encoded, otherwise the octree division will continue. When the DCM coding mode is applied, the geometric coordinates X, Y, and Z components of the points contained in the current node will be directly encoded independently. When the side length of a node is (2 ^dx , 2 ^dy , 2 ^dz ), dx, dy, dz bits are required to encode each component of the _x , _y, and _z components of the geometric coordinates of the node, and this bit information is directly encoded into the bitstream.

It should be noted that when nodes are divided into leaf nodes, in the case of geometric lossless coding, the number of repeated points in the leaf nodes needs to be encoded. Finally, the placeholder information of all nodes is encoded to generate a binary code stream. In addition, G-PCC currently introduces a plane coding mode. In the process of geometric division, it will determine whether the child nodes of the current node are in the same plane. If the child nodes of the current node meet the conditions of the same plane, the child nodes of the current node will be represented by the plane.

In octree-based geometric decoding, the decoder obtains the placeholder code of each node by continuously parsing in the order of breadth-first traversal, and divides the nodes in sequence until a 1x1x1 unit cube is obtained. The number of points contained in each leaf node is parsed, and finally the geometric reconstructed point cloud information is restored.

In the geometric information coding framework based on trisoup (triangle soup, triangle patch set), geometric division must also be performed first, but different from the geometric information coding based on binary tree/quadtree/octree, this method does not need to divide the point cloud into unit cubes with a side length of 1x1x1 step by step, but stops dividing when the block (sub-block) has a side length of W. Based on the surface formed by the distribution of the point cloud in each block, at most twelve vertices (intersection points) generated by the surface and the twelve edges of the block are obtained. The vertex coordinates of each block are encoded in turn to generate a binary code stream.

When reconstructing the point cloud geometry information based on trisoup, the vertex coordinates are first decoded to complete the reconstruction of the triangle facets at the decoding end. The process is shown in Figures 5A to 5C. There are three vertices (v1, v2, v3) in the block shown in Figure 5A. The triangle facet set formed by these three vertices in a certain order is called triangle soup, or trisoup, as shown in Figure 5B. After that, sampling is performed on the triangle facet set, and the obtained sampling points are used as the reconstructed point cloud in the block, as shown in Figure 5C.

The geometric coding based on the prediction tree includes: first, sorting the input point cloud. The currently used sorting methods include unordered, Morton order, azimuth order and radial distance order. At the encoding end, the prediction tree structure is established by using two different methods, including: KD-Tree (high-latency slow mode) and using the laser radar calibration information to divide each point into different Lasers, and establish a prediction structure according to different Lasers (low-latency fast mode). Next, based on the structure of the prediction tree, traverse each node in the prediction tree, predict the geometric position information of the node by selecting different prediction modes to obtain the prediction residual, and quantize the geometric prediction residual using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure and the quantization parameters are encoded to generate a binary code stream.

Based on the geometric decoding of the prediction tree, the decoding end reconstructs the prediction tree structure by continuously parsing the bit stream, and then obtains the geometric position prediction residual information and quantization parameters of each prediction node through parsing, and dequantizes the prediction residual to recover the reconstructed geometric position information of each node, and finally completes the geometric reconstruction of the decoding end.

After the geometric encoding is completed, the geometric information is reconstructed. At present, attribute encoding is mainly performed on color information. First, the color information is converted from the RGB color space to the YUV color space. Then, the point cloud is recolored using the reconstructed geometric information so that the unencoded attribute information corresponds to the reconstructed geometric information. In color information encoding, there are two main transformation methods. One is the distance-based lifting transformation that relies on LOD (Level of Detail) division, and the other is to directly perform RAHT (Region Adaptive Hierarchal Transform) transformation. Both methods will convert color information from the spatial domain to the frequency domain, obtain high-frequency coefficients and low-frequency coefficients through transformation, and finally quantize and encode the coefficients to generate a binary code stream.

When using geometric information to predict attribute information, Morton codes can be used to search for nearest neighbors. The Morton code corresponding to each point in the point cloud can be obtained from the geometric coordinates of the point. The specific method for calculating the Morton code is described as follows. For each component of the three-dimensional coordinate represented by a d-bit binary number, its three components can be expressed as formula (1):

in,

The highest bits of x, y, and z are

To the lowest position

The corresponding binary value. The Morton code M is x, y, z, starting from the highest bit, arranged in sequence

To the lowest bit, the calculation formula of M is shown in the following formula (2):

in,

The highest bit of M

To the lowest position

After obtaining the Morton code M of each point in the point cloud, the points in the point cloud are arranged in order from small to large Morton codes, and the weight w of each point is set to 1.

There are 4 general test conditions for GPCC:

Condition 1: The geometric position is limitedly lossy and the attributes are lossy;

Condition 2: The geometric position is lossless, but the attributes are lossy;

Condition 3: The geometric position is lossless, and the attributes are limitedly lossy;

Condition 4: The geometric position and attributes are lossless.

GPCC's general test sequences include four categories: Cat1A, Cat1B, Cat3-fused, and Cat3-frame. Among them, Cat2-frame point cloud only contains reflectance attribute information, Cat1A and Cat1B point clouds only contain color attribute information, and Cat3-fused point cloud contains both color and reflectance attribute information.

GPCC's technical routes: There are 2 types in total, differentiated by the algorithms used for geometric compression.

Technical route 1: Octree encoding branch:

At the encoding end, the bounding box is divided into sub-cubes in sequence, and the non-empty sub-cubes (containing points in the point cloud) are continued to be divided until the leaf node obtained by division is a 1X1X1 unit cube. In the case of geometric lossless coding, it is necessary to encode the number of points contained in the leaf node, and finally complete the encoding of the geometric octree to generate a binary code stream.

At the decoding end, the decoding end obtains the placeholder code of each node by continuously parsing in the order of breadth-first traversal, and continuously divides the nodes in turn until a 1x1x1 unit cube is obtained. In the case of geometric lossless decoding, it is necessary to parse the number of points contained in each leaf node and finally restore the geometrically reconstructed point cloud information.

Technical route 2: prediction tree encoding branch:

At the encoding end, the prediction tree structure is established by using two different methods, including: KD-Tree (high-latency slow mode) and using the laser radar calibration information to divide each point into different Lasers and establish a prediction structure according to different Lasers (low-latency fast mode). Next, based on the structure of the prediction tree, each node in the prediction tree is traversed, and the geometric position information of the node is predicted by selecting different prediction modes to obtain the prediction residual, and the geometric prediction residual is quantized using the quantization parameter. Finally, through continuous iteration, the prediction residual of the prediction tree node position information, the prediction tree structure, and the quantization parameters are encoded to generate a binary code stream.

At the decoding end, the decoding end reconstructs the prediction tree structure by continuously parsing the bit stream, and then obtains the geometric position prediction residual information and quantization parameters of each prediction node through parsing, and dequantizes the prediction residual to restore the reconstructed geometric position information of each node, and finally completes the geometric reconstruction at the decoding end.

The following is an introduction to the L3C2 encoding and decoding framework.

As shown in Figure 6, the L3C2 encoding algorithm:

Input for constructing a single chain structure: voxelized point cloud and prior information of rotating lidar.

The output of constructing a single chain structure includes: the geometric prediction value and prediction residual of the current point, the prediction mode adopted by the current point, the quantization parameter of the current point, the number of repeated points, and the number of skipped points corresponding to each mode.

As shown in Figure 6, the construction of L3C2 is specifically divided into: reordering, coordinate transformation, establishing a single chain structure, selecting a prediction mode, generating a prediction value, encoding the number of repeated points, quantizing the prediction residual, inverse coordinate transformation, and encoding the coordinate transformation residual.

The following are the steps of the L3C2 encoding process:

1) Reorder

The voxelized point cloud is reordered to construct a more efficient single chain structure. The default sorting method is to sort according to the scanning order of the lidar.

Exemplarily, the Cartesian coordinates (x, y, z) of each point are converted into polar coordinates (r, φ, tanθ), and the points are sorted in turn according to the elevation tangent value tanθ, the azimuth angle φ, and the radius r.

2) Coordinate transformation

According to the sorted results, the point cloud is traversed and the points are converted from Cartesian coordinates (x, y, z) to cylindrical coordinates (r, φ, i) according to the following formula (4) and stored.

Among them, i is the LaserID corresponding to the point (a typical laser radar system may have 16, 32 or 64 Laser Scanners, and the prior information of each laser is different, that is, the elevation angle θ and the height zLaser in the vertical direction are different). i is determined by looping the prior information of different Lasers. In each loop, the z component of the point is calculated using the r, prior information and the above conversion formula of the point, and the deviation between the converted z component and the original z component of the point is calculated, and then the point with the smallest deviation is selected from different LaserIDs as the i of the point. This process processes the non-uniform distribution of the point cloud in the vertical direction in space and makes it regular.

3) Establish a single chain structure

FIG. 7A is a single chain structure obtained by sorting the points in the point cloud based on the cylindrical coordinates (r, φ, i).

Since the laser radar point cloud is obtained by scanning the object with a laser scanner, the specific scanning diagram is shown in Figure 8. The rotation interval of each Laser can be obtained by using the acquisition parameters of the laser scanner, that is,

by using

To compare the single-chain structure shown in FIG7A in the vertical direction (i.e.

The regularized single-stranded structure is obtained as shown in FIG7B .

For example, the single-chain structure shown in FIG. 7A can be regularized in the vertical direction by the following formula (5):

Among them, round() is the rounding function.

In the embodiment of the present application, the entire encoding process is performed using the regularized single-chain structure as an example.

After obtaining the regularized structure shown in FIG8 , the order o(P) of each point in the single chain structure is calculated.

Exemplarily, the o(P) corresponding to each point in the single-chain structure is calculated by the following formula (6):

Among them, N _laser represents the number of Lasers, i represents the LaserID corresponding to the current point,

Represents the regularized current point

After obtaining the order of each point in the single chain structure, i.e., o(P), o(P) is encoded.

4) Prediction model selection

Exemplarily, each point is predicted and encoded according to the structure shown in FIG9. A prediction list is established for each point, which includes N (assuming N=5) reference values. The best prediction mode predMode is selected by using the rate-distortion optimization criterion at the encoding end, and the prediction mode needs to be encoded.

After encoding, the values of the prediction list need to be updated. The update criteria are as follows:

Criterion 1: When the prediction residual is greater than a certain threshold (Th), the current point is updated to the prediction list and the last prediction value is deleted.

Criterion 2: When the prediction residual is less than or equal to a certain threshold (Th), the selected prediction value is deleted and the current point is updated to the prediction list.

5) Quantification of prediction residuals

Based on the above determined prediction mode and prediction list, the prediction value of the current point is assumed to be

Based on the predicted value and original value of the current point, the predicted residual value of the current point is determined to be

Next, the predicted residual value of the current point

Quantify

For example, the determination principle of the quantization factors corresponding to x and y is shown in FIG. 10 , where r and

The determination principle of the corresponding quantization factor is shown in FIG11 .

For example, r and r are calculated according to the following formula (7) and formula (8) respectively:

The corresponding quantization factors Δr and Δφ are:

Δr＝ΔQ/(|cosφ _c |+|sinφ _c |) (7)

Δφ＝Δr/r (8)

After determining the quantization factors Δr and Δφ, the prediction residual value of the current point is quantized, for example,

Qr _res = round(r _res /Δr)

And encode the quantized prediction residual value.

6) Encoding of repeated points

Encodes the number of duplicate points of the current point in Cartesian coordinates.

7) Coordinate inverse transformation

Reconstruct the cylindrical coordinates of the current point

Convert to Cartesian coordinates according to the following formula (9):

This is the predicted Cartesian coordinate of the current point.

r _rec = r _pred + Q _{r res} × Δr

Among them, i is the LaserID corresponding to the point, and the prior information of each laser is different, that is, the elevation angle θ and the height zLaser in the vertical direction are different. Therefore, the elevation angle corresponding to the i-th Laser is θ(i), and the height in the vertical direction is zLaser(i).

8) Coding of coordinate transformation residuals

The coordinates obtained by using the coordinates (x, y, z) of the current point and the inverse conversion of the coordinates

After obtaining the coordinate transformation residual (Δx, Δy, Δz), in the case of geometric loss, the Δx and Δy residual components are not encoded, and the Δz is residually quantized and encoded. In the case of lossless encoding, the coordinate transformation residual needs to be encoded.

The decoding process of L3C2 is introduced below.

Input for reconstructing the single-chain structure: decoded data (including the prediction mode adopted by the current point, the prediction residual of the current point, the quantization parameter of the current point, the number of repeated points, the order of each point), and the prior information of the rotating lidar.

Output of reconstructing a single chain structure: reconstructed voxelized point cloud.

As shown in Figure 12, the reconstruction of the single chain structure is divided into four steps: generating prediction values, reconstructing the single chain structure, decoding the number of repeated points, inverse coordinate transformation, and reconstructing the geometric point cloud. The following are introduced respectively:

1) Generate predicted values

According to the prediction mode adopted by the current point obtained by decoding and the reconstructed cylindrical coordinates of its ancestors, the cylindrical coordinates of the current point are predicted and the corresponding prediction values are generated. The specific process is as follows:

For example, predictive coding is performed on each point according to the structure shown in FIG9 . A prediction list is established for each point, which contains N (assuming N=5) reference values. The prediction mode predMode of the current point is obtained by parsing the bitstream at the decoding end, and then the cylindrical coordinate prediction value of the current point is obtained in the prediction list using the prediction mode predMode.

The predicted value of Cartesian coordinates is the reconstructed value of cylindrical coordinates of the point

The inverse transformation

Dequantize the prediction residual components in the cylindrical coordinate system and restore the predicted value to obtain the reconstructed cylindrical coordinates of the current point

2) Reconstruction of L3C2 structure

The predicted value of the cylindrical coordinates of the current point can be obtained through the previous step

Next, according to the following formula (10), the cylindrical coordinate residual obtained by decoding is used

and the predicted cylindrical coordinates of the current point

Calculate the reconstructed cylindrical coordinates of the current point

Δr＝ΔQ/(|cosφ _c |+|sinφ _c |)

Δφ＝Δr/r

r _rec = r _pred + Q _{r res} × Δr

By analyzing the order (order) o(P) of the current point in the single chain structure, the position of the current point in the L3C2 structure can be determined, and the cylindrical coordinates (r, φ, i) can be reconstructed to reconstruct the L3C2 structure.

3) Number of decoding repeat points

Decode the number of duplicate points of the current point.

4) Coordinate inverse transformation

The reconstructed cylindrical coordinates of the current point (r, φ, i), that is, the reconstructed cylindrical coordinates mentioned above

Convert to Cartesian coordinates according to the following formula (11):

This is the predicted Cartesian coordinate of the current point.

6) Reconstruct geometric point cloud

Exemplarily, the following formula (12) is used to use the decoded Cartesian coordinate residual (r _x , _ry , r _z ) and the predicted Cartesian coordinate of the current point:

Computes the reconstructed Cartesian coordinates (x,y,z) of the current point.

Finally, the reconstructed voxelized point cloud is obtained.

The above introduces the encoding and decoding method based on L3C2. It can be seen from the above that in the encoding and decoding based on L3C2, for each node in L3C2, the repeated point information of the node needs to be encoded and decoded, but not every node has repeated points. Therefore, encoding and decoding repeated point information for each node increases the complexity of encoding and decoding, wastes a lot of encoding and decoding time, and makes the encoding and decoding efficiency low.

In order to solve the above technical problems, in the embodiment of the present application, in the point cloud encoding and decoding based on L3C2, the total number of repeated points included in the point cloud is first determined by the total number of points included in the point cloud and the total number of L3C2 nodes of the point cloud. In this way, during encoding and decoding, the number of repeated points that have been encoded and decoded is recorded in real time, and the number of repeated points that have been encoded and decoded is compared with the total number of repeated points included in the point cloud to determine whether to encode and decode the repeated point information of the node when encoding and decoding the current node. For example, when encoding and decoding the current point, it is determined that the number of repeated points that have been encoded and decoded is equal to the total number of repeated points in the point cloud, indicating that the repeated points in the point cloud have been encoded and decoded, and the remaining nodes do not include repeated points, and thus there is no need to encode and decode the repeated point information of subsequent nodes, thereby reducing the encoding and decoding complexity of the point cloud, saving encoding and decoding time, and thus improving the encoding and decoding efficiency.

The point cloud encoding and decoding method involved in the embodiments of the present application is introduced below in conjunction with specific embodiments.

First, taking the decoding end as an example, the point cloud decoding method provided in the embodiment of the present application is introduced.

Fig. 13 is a schematic diagram of a point cloud decoding method according to an embodiment of the present application. The point cloud decoding method according to the embodiment of the present application can be implemented by the point cloud decoding device shown in Fig. 3 or Fig. 5 above.

As shown in FIG13 , the point cloud decoding method of the embodiment of the present application includes:

S101, decoding the geometric code stream of the point cloud to obtain the total number of L3C2 nodes of the point cloud and the total number of points of the point cloud.

As can be seen from the above, the point cloud includes geometric information and attribute information, and the decoding of the point cloud includes geometric decoding and attribute decoding. The embodiment of the present application relates to geometric decoding of point clouds.

In some embodiments, the geometric information of the point cloud is also referred to as the position information of the point cloud. Therefore, the geometric decoding of the point cloud is also referred to as the position decoding of the point cloud.

In the L3C2-based encoding method, the encoder constructs the L3C2 structure of the point cloud based on the geometric information of the point cloud. The L3C2 structure is a chain structure composed of at least one single chain structure, and each single chain structure includes at least one node. A node includes at least one point in the point cloud, that is, in the L3C2 encoding, the points in the point cloud are divided into nodes in L3C2.

In some embodiments, since some points in the point cloud have the same coordinate information, these points with the same coordinates are divided into the same node of L3C2, so that the node includes repeated points.

In some embodiments, when constructing L3C2 of a point cloud, it is necessary to perform coordinate conversion on the points in the point cloud. For example, when converting the coordinates of the points in the point cloud in the second coordinate system to the coordinates in the first coordinate system, the coordinates of the points with different coordinates in the second coordinate system may become the same when converted to the first coordinate system. In this way, when constructing L3C2 based on the coordinates of the points in the first coordinate system, the points with the same coordinates in the first coordinate system will be divided into one node, so that the node includes duplicate points.

Of course there may be other situations where the nodes in L3C2 of the point cloud include duplicate points.

At present, when encoding each node in L3C2, the encoder needs to encode the repeated point information of each node. Correspondingly, when decoding, the decoder decodes the repeated point information of each node. This will increase the complexity of encoding and decoding, waste encoding and decoding time, and reduce encoding and decoding efficiency.

From the above, it can be seen that in the L3C2 of the point cloud, not every node has duplicate points. It can even be understood that only a few nodes in the L3C2 of the point cloud include duplicate points. Based on this, in the embodiment of the present application, when decoding, the total number of duplicate points included in the point cloud is first determined, and during the decoding process, the number of decoded duplicate points is counted, and then before decoding each point, it is first determined whether the number of currently decoded duplicate points is equal to the total number of duplicate points included in the point cloud. If the number of currently decoded duplicate points is equal to the total number of duplicate points included in the point cloud, it means that the duplicate points in the point cloud have been decoded, and the remaining nodes to be decoded in L3C2 do not include duplicate nodes. In this way, when these nodes are subsequently decoded, the duplicate point information of these nodes will no longer be decoded, thereby reducing the decoding complexity of the point cloud, saving decoding time, and improving decoding efficiency.

The embodiment of the present application does not limit the specific method for the decoding end to obtain the total number of L3C2 nodes of the point cloud and the total number of points of the point cloud.

In some embodiments, the encoder writes the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud into the geometry stream. In this way, the decoder obtains the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud by decoding the geometry stream.

In some embodiments, the encoding end writes the number of single chains included in the point cloud and the number of nodes included in each single chain into the geometric code stream. In this way, the decoding end decodes the geometric code stream of the point cloud to obtain the number of single chains headsCount included in the point cloud and the number of nodes included in each single chain, and then obtains the total number of nodes of L3C2 based on the number of single chains and the number of nodes included in each single chain. For example, the sum of the number of nodes included in each single chain is determined as the total number of nodes of L3C2.

In an example of this embodiment, the nodes on each single chain included in the point cloud can be added together to obtain the total number of nodes nodeCount of L3C2 through the following instructions:

for(index＝0；index<headsCount；index++)

nodeCount+=nodeCnt

In some embodiments, the encoding end writes the total number of points of the point cloud into the geometric data unit of the point cloud. In this way, the decoding end obtains the total number of points of the point cloud by decoding the geometric data unit of the point cloud.

In some embodiments, since the point cloud includes at least one point, in order to save code words, when encoding the total number of points in the point cloud, the encoder subtracts one from the total number of points in the point cloud to obtain a first value, and writes the first value into the geometry data unit, that is, the first value is represented as slice_num_points_minus1. Correspondingly, when decoding, the decoder decodes the geometry data unit to obtain the first value, and then adds 1 to the first value to obtain the total number of points in the point cloud.

In one example, the script syntax of the geometry data unit (Geometry data unit footer syntax) is shown in Table 2:

Table 2

In Table 2, slice_num_points_minus1 plus 1 is used to specify the total number of points in the point cloud. Bitstream conformance requires that slice_num_points_minus1 plus 1 should be equal to the number of decodable points. Decoders should not rely on bitstream conformance to prevent implementation buffer overflows.

That is, the decoding end obtains the first value slice_num_points_minus1 by decoding the geometric data unit shown in Table 2, and adds 1 to the first value to obtain the total number of points in the point cloud.

Based on the above method, the decoding end obtains the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud, and then executes the following step S102.

S102. Determine the total number of repeated points in the point cloud according to the total number of points in the point cloud and the total number of nodes in L3C2.

After the decoding end determines the total number of L3C2 nodes of the point cloud and the total number of points in the point cloud, it determines the total number of repeated points in the point cloud based on the total number of points in the point cloud and the total number of L3C2 nodes, and then uses the total number of repeated points in the point cloud as supervision for subsequent decoding.

The embodiment of the present application does not limit the specific method in which the decoding end determines the total number of repeated points in the point cloud based on the total number of nodes of L3C2 of the point cloud and the total number of points in the point cloud.

In a possible implementation, some point information may be lost or damaged during data transmission or encoding. The decoding end may perform shallow decoding on the geometry stream to determine the number of nodes included in the geometry stream, and then determine the total number of repeated points in the point cloud based on the number of nodes included in the geometry stream, the total number of points in the point cloud determined in the above steps, and the total number of nodes in L3C2.

In another possible implementation, the difference between the total number of points in the point cloud and the total number of nodes in L3C2 is directly determined as the total number of repeated points in the point cloud.

Exemplarily, the decoding end determines the total number of repeated points in the point cloud by the following formula (13):

dupSumNum＝(slice_num_points_minus1+1)-nodeCount (13)

Among them, dupSumNum is the total number of duplicate points in the point cloud, slice_num_points_minus1+1 is the total number of points in the point cloud, and nodeCount is the total number of nodes in L3C2.

Based on the above steps, after the decoding end determines the total number of repeated points in the point cloud, it starts decoding each point in L3C2, as shown in S103 below.

S103. When decoding the current node in L3C2, determine the number of decoded duplicate points.

In the embodiment of the present application, the decoding end decodes each node in L3C2 in the same manner. For ease of description, the current node in L3C2 is taken as an example for illustration.

In one example, dupCount represents the number of duplicate points currently decoded, and dupCount is initialized to 0. When decoding the first node in L3C2, dupCount=0, which is less than the total number of duplicate points dupSumNum in the point cloud. When decoding the first node, it is necessary to decode the duplicate point information of the first node. Optionally, the duplicate point information of the first node includes the number of duplicate points included in the first node. Assuming that the number of duplicate points included in the first node is a, dupCount is updated to a. Next, decode the second node in L3C2. Before decoding the second node, first determine whether dupCount=a is equal to the total number of duplicate points dupSumNum in the point cloud. If not, continue to decode the duplicate point information of the second node, and use the number of duplicate points included in the second node to update the current dupCount, and so on.

As can be seen from the above, when the decoding end decodes the current node in L3C2, it first determines the number of duplicate points decoded before decoding the current node, that is, the number of decoded duplicate points dupCount, and then determines whether to decode the duplicate point information of the current node based on the number of decoded duplicate points dupCount.

In one example, the decoding end records the number of decoded duplicate points through the following instructions:

for(index＝0；index<nodeCount；index++)

dupCount+=dupPointNum

Wherein, dupPointNum represents the number of duplicate points included in the node.

S104. Decode the current node according to the number of decoded duplicate points and the total number of duplicate points in the point cloud.

When decoding the current node, the decoding end compares the number of decoded duplicate points with the total number of duplicate points in the point cloud to decode the current node.

Specifically, the implementation process of the above S104 includes the following situations:

Case 1: If the number of decoded duplicate points is equal to the total number of duplicate points in the point cloud, then the above S104 includes S104-A1 and S104-A2:

S104-A1, skip decoding repeated point information of the current node;

S104-A2. Determine the geometric reconstruction value of the first point included in the current node.

In case 1, when decoding the current node, if it is determined that the number of decoded duplicate points is equal to the total number of duplicate points in the point cloud, it means that all duplicate points have been decoded and the remaining nodes of L3C2 do not include duplicate points. Based on this, when decoding the current node, the decoding of duplicate point information is skipped, thereby reducing decoding complexity, saving decoding time, and improving decoding efficiency.

At this point, it can be determined that the current node includes only one point, which is recorded as the first point. Then, the geometric reconstruction value of the first point is determined.

It should be noted that the process of determining the geometric reconstruction value of the first point may specifically refer to the description of the following steps 11 to 13, which will not be repeated here.

The above describes the process of skipping decoding the duplicate point information of the current node and determining the geometric reconstruction value of the first point included in the current node for case 1, if the number of decoded duplicate points is equal to the total number of duplicate points in the point cloud.

Case 2: If the duplicate point information includes the number of duplicate points included in the current node, the number of decoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is the last node in L3C2, the above S104 includes S104-C1 and S104-C2:

S104-A1, skip decoding repeated point information of the current node;

S104-A2: Determine the difference between the total number of repeated points in the point cloud and the number of decoded repeated points as the number of repeated points included in the current node.

In this case 2, if the number of decoded duplicate points corresponding to the current node is less than the total number of duplicate points in the point cloud, it means that the current node may include duplicate points. Further, if the decoding end determines that the current node is the last node of L3C2, it is determined that the current node must include duplicate points, and the number of duplicate points included in the current node is the difference between the total number of duplicate points in the point cloud and the number of decoded duplicate points. Therefore, in this case 2, if it is determined that the number of decoded duplicate points corresponding to the current node is less than the total number of duplicate points in the point cloud, and the current node is the last node of L3C2, the decoding of the duplicate point information of the current node is skipped, and the difference between the total number of duplicate points in the point cloud and the number of decoded duplicate points is determined as the number of duplicate points included in the current node, thereby reducing the decoding complexity, saving decoding time, and improving decoding efficiency.

In this situation 2, the decoding end also needs to determine the geometric reconstruction value of the first point included in the current node, wherein the process of determining the geometric reconstruction value of the first point can be specifically referred to the description of the following steps 11 to 13, which will not be repeated here.

Case 3: If the number of decoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is not the last node in L3C2, the above S104 includes S104-B1 and S104-B2:

S104-B1, decoding repeated point information of the current node;

S104-B2. Based on the repeated point information, determine the geometric reconstruction values of N repeated points included in the current node, where N is an integer.

In the embodiment of the present application, when the encoder determines that the number of repeated points of the encoded points corresponding to the current node is less than the total number of repeated points of the point cloud, and the current node is not the last node in L3C2, when determining that the current node has repeated points, the repeated point information of the current node is written into the geometric code stream. Correspondingly, when the decoder determines that the number of repeated points of the decoded points corresponding to the current node is less than the total number of repeated points of the point cloud, and the current node is not the last node in L3C2, the decoder decodes the geometric code stream to obtain the repeated point information of the current node, and based on the repeated point information, determines the geometric reconstruction values of the N repeated points included in the current node.

The embodiment of the present application does not limit the specific content included in the repeated point information.

In some embodiments, the repeated point information includes the number of repeated points included in the current node.

In some embodiments, if N is greater than 0, the repeated point information includes the number of repeated points included in the current node, and at least one first flag, where the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point at the second coordinate.

For example, the current node includes three points, which are recorded as point 1, point 2 and point 3 respectively. Point 2 and point 3 correspond to a first flag respectively, wherein the first flag of point 2 is used to indicate whether the coordinates of point 2 and point 1 in the second coordinate are the same, and the first flag of point 3 is used to indicate whether the coordinates of point 3 and point 2 in the second coordinate are the same.

After the decoding end determines the repeated point information of the current node, it executes step S104-B2 to determine the geometric reconstruction value of each of the N repeated points included in the current node based on the repeated point information.

In some embodiments, if the duplicate point information indicates that the coordinate values of the points included in the current node are the same, the geometric reconstruction values of all the points included in the current node are determined with reference to the above method of determining the geometric reconstruction value of the first point.

In some embodiments, the above S104-B2 includes the following steps:

S104-B21, determining the geometric reconstruction value of the first point in the current node;

S104-B22. Determine the geometric reconstruction values of N repeated points based on the repeated point information and the geometric reconstruction value of the first point.

Specifically, when the encoder encodes the current node, the current node includes N+1 points, of which N points are repeated with the first point, and thus it is determined that the current node includes N repeated points. When encoding, the encoder first encodes the first point in the current node, for example, the coordinate residual value of the first point in the first coordinate system, the coordinate residual value of the first point in the second coordinate system, and the prediction mode corresponding to the first point and the order o(P) in the single chain are encoded. At the same time, the repeated point information corresponding to the current node is determined, and the repeated point information is encoded into the bitstream.

Based on this, the decoding end first determines the geometric reconstruction value of the first point in the current node. The process of the decoding end determining the geometric reconstruction value of the first point in the current node can refer to the detailed description of the following steps 11 to 13, which will not be repeated here.

After the decoding end determines the geometric reconstruction value of the first point in the current node, the geometric reconstruction value of each of the N repeated points is determined based on the repeated point information of the current node and the geometric reconstruction value of the first point.

In some embodiments, the repeated point information is parsed, and the first flag corresponding to each repeated point is obtained from the repeated point information. If the first flag corresponding to each repeated point indicates that the coordinate value of the repeated point is the same as that of the previous point in the second coordinate system, and is also the same as that of the first point in the second coordinate system, for example, the value of the first flag of each repeated point is 1. In this way, the decoding end can directly determine the geometric reconstruction value of the first point as the geometric reconstruction value of the N repeated points.

In some embodiments, the decoding end decodes each of the N repeated points one by one. In this case, the above S104-B22 includes the following steps:

S104-B22-1. For the i-th repeated point among the N repeated points, parse the repeated point information to obtain a first flag corresponding to the i-th repeated point, where the first flag is used to indicate whether the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are the same, where i is a positive integer less than or equal to N, and if i is 1, the i-1-th repeated point is the first point;

S104-B22-2. Determine the geometric reconstruction value of the ith repeated point based on the first mark corresponding to the ith repeated point.

In this implementation, the method for determining each of the N repeated points is the same. For ease of description, the i-th repeated point among the N repeated points is used as an example for explanation. For the i-th repeated point among the N repeated points, firstly, the first identifier corresponding to the i-th repeated point is obtained from the repeated point information of the current node. The first identifier indicates whether the i-th repeated point and the previous point, i.e., the i-1-th repeated point, have the same coordinates in the second coordinate system. Exemplarily, if the i-th repeated point is the first repeated point among the N repeated points, i.e., i=1, then the i-1-th repeated point is the first point in the current node.

Next, the decoding end determines the geometric reconstruction value of the ith repeated point based on the first flag corresponding to the ith repeated point.

In some embodiments, if the first flag indicates that the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are the same, for example, the value of the first flag is 1, then the geometric reconstruction value of the i-1-th repeated point is determined as the geometric reconstruction value of the i-th repeated point.

In some embodiments, if the first flag indicates that the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are different, the decoding end needs to re-determine the geometric reconstruction value of the i-th repeated point. For example, the decoding end decodes the geometric code stream, obtains the coordinate residual value of the i-th repeated point in the second coordinate system, the coordinate residual value of the i-th repeated point in the first coordinate system, and the prediction mode corresponding to the i-th repeated point, determines the coordinate prediction value of the i-th repeated point in the first coordinate system based on the prediction mode corresponding to the i-th repeated point, determines the coordinate reconstruction value of the i-th repeated point in the first coordinate system based on the coordinate prediction value and the coordinate residual value of the i-th repeated point in the first coordinate system, performs coordinate conversion on the coordinate reconstruction value of the i-th repeated point in the first coordinate system, adds it to the coordinate residual value of the i-th repeated point in the second coordinate system, obtains the coordinate reconstruction value of the i-th repeated point in the second coordinate system, and determines the coordinate reconstruction value of the i-th repeated point in the second coordinate system as the geometric reconstruction value of the i-th repeated point.

It should be noted that the above process of determining the geometric reconstruction value of the i-th repeated point may be specifically described in the following steps 11 to 13, which will not be repeated here.

Based on the above steps, the decoding end can determine the geometric reconstruction value of each of the N repeated points.

In this case 2, in addition to determining the geometric reconstruction values of the first node and N repeated points included in the current node through the above steps, the number of decoded repeated points is also updated based on the number of repeated points N included in the current node, for example, the sum of the number of repeated points included in the current node and the number of decoded repeated points is determined as the new number of decoded repeated points. Based on the new number of decoded repeated points, the next node is decoded.

The following describes the process of determining the geometric reconstruction value of a point included in the current node, i.e., the first point, in the above case 1, the geometric reconstruction value of the first point when the current node includes multiple points in

cases

2 and 3, and the process of determining the geometric reconstruction value of the i-th repeated point among the N repeated points included in the current node in case 3. That is, the target value in the following steps 11 to 13 can be understood as the first point in case 1, the first point in

cases

2 and 3, or the i-th repeated point in case 3.

The specific process of determining the geometric reconstruction value of the target point is introduced below.

Step 11, determine the predicted coordinate value of the target point in the first coordinate system.

In a possible implementation, the coordinate prediction value of the target point in the first coordinate system is determined based on the coordinate reconstruction value of the current decoded point in the first coordinate system. For example, the arithmetic mean or weighted mean of the coordinate reconstruction values of one or more decoded points before the target point in the first coordinate system is determined as the coordinate prediction value of the target point in the first coordinate system.

In another possible implementation, the above step 11 includes the following steps: step 11-1 and step 11-2:

Step 11-1, decoding the geometric code stream to obtain the prediction mode corresponding to the target point;

Step 11-2: Based on the prediction model, determine the coordinate prediction value of the target point in the first coordinate system.

When encoding, the encoder encodes the prediction mode corresponding to the target point into the geometric code stream, so that the decoder can decode the geometric code stream to obtain the prediction mode corresponding to the target point, and then determine the coordinate prediction value of the target point in the first coordinate system based on the prediction mode.

The embodiment of the present application does not limit the specific type of prediction mode corresponding to the target point.

In some embodiments, when L3C2 is encoded, it is assumed that the laser radar for scanning the point cloud includes N lasers, each laser corresponds to a prediction list, and the corresponding L3C2 corresponds to N prediction lists. For the target point, it is assumed that the coordinates of the target point in the first coordinate system are

Where i represents the laser mark corresponding to the target point, so that the prediction list corresponding to the i of the target point can be determined as the prediction list corresponding to the target point. Assuming that the prediction list corresponding to the target point includes M prediction values, the target point is predicted respectively using these M prediction values, and the cost corresponding to each of the M prediction values is determined, and then the index of the prediction value with the lowest price in the prediction list is determined as the prediction mode corresponding to the target point, and then encoded. In other words, the prediction mode corresponding to the target point is the index of the prediction value with the lowest cost in the prediction list corresponding to the target point. Then, the encoding end determines the order of the target point in the L3C2 single chain based on the i of the target point, that is, o(P), and encodes o(P) and the prediction mode corresponding to the target point into the bitstream. Correspondingly, the decoding end decodes the bitstream to obtain o(P) and the prediction mode corresponding to the target point, and then determines the i component of the target point based on o(P), record i _rec , and then determines the prediction list corresponding to the target point based on i _rec . Then, in the prediction list, the prediction value indexed as the prediction mode corresponding to the target point is determined as the coordinate prediction value of the target point in the first coordinate system. Optionally, the predicted value includes r _pred and

The embodiment of the present application does not limit the specific type of the first coordinate system.

In one example, the first coordinate system is cylindrical coordinates.

Based on the above steps, after the predicted coordinate value of the target point in the first coordinate system is determined, the following step 12 is executed.

Step 12: Determine the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate prediction value of the target point in the first coordinate system.

Based on the above steps, after the predicted coordinate value of the target point in the first coordinate system is determined, the reconstructed coordinate value of the target point in the first coordinate system is determined based on the predicted coordinate value of the target point in the first coordinate system.

In one example, the predicted coordinate value of the target point in the first coordinate system is determined as the reconstructed coordinate value of the target point in the first coordinate system.

In another example, the above step 12 includes the following steps 12-1 to 12-3:

Step 12-1, decoding the geometric code stream to obtain the coordinate residual value of the target point after quantization in the first coordinate system;

Step 12-2, dequantize the quantized coordinate residual value to obtain the coordinate residual value of the target point in the first coordinate system;

Step 12-3: Based on the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system, obtain the coordinate reconstruction value of the target point in the first coordinate system.

In this implementation, when encoding the target point, the encoder determines the coordinate residual value of the target point in the first coordinate system based on the coordinate prediction value of the target point in the first coordinate system, for example, the difference between the coordinate value of the target point in the first coordinate system and the coordinate prediction value is determined as the coordinate residual value of the target point in the first coordinate system. Then, the encoder quantizes the coordinate residual value of the target point in the first coordinate system and encodes it into the bitstream.

Based on this, the decoding end decodes the geometric code stream to obtain the quantized coordinate residual value of the target point in the first coordinate system, and dequantizes the quantized coordinate residual value to obtain the coordinate residual value of the target point in the first coordinate system.

In some embodiments, before dequantizing the coordinate residual value of the target point after quantization in the first coordinate system, the decoding end first determines a quantization factor, and dequantizes the coordinate residual value of the target point after quantization in the first coordinate system based on the determined quantization factor.

Assume that the geometric code stream is decoded and the coordinate residual value of the target point after quantization in the first coordinate system is Q(r _res ) and

Based on the above steps, the predicted coordinate value of the target point in the first coordinate system is

Exemplarily, the decoding end determines the quantization factor Δr corresponding to Q(r _res ) by the following formula (14):

Wherein, ΔQ is determined based on a preset quantization parameter.

Based on the quantization factor Δr determined by the above formula (14), the quantized coordinate residual value Q(r _res ) of the target point in the first coordinate system is dequantized to obtain the coordinate residual value r _res of the target point in the first coordinate system.

Next, the decoding end obtains a coordinate reconstruction value r _rec of the target point in the first coordinate system based on the coordinate prediction value r _prd and the coordinate residual value r _res of the target point in the first coordinate system.

Exemplarily, the decoding end determines the sum of the coordinate prediction value r _prd and the coordinate residual value r _res of the target point in the first coordinate system as the coordinate reconstruction value r _rec of the target point in the first coordinate system.

After the decoding end determines the coordinate reconstruction value r _rec of the target point in the first coordinate system according to the above method, the decoding end determines based on r _rec

The corresponding quantization factor

In one example, the decoding end determines by the following formula (15):

The corresponding quantization factor

Based on the above method, the decoding end determines

The corresponding quantization factor

Then, based on the quantization factor

right

Perform inverse quantization to obtain the coordinate residual value of the target point in the first coordinate system

Next, the decoder predicts the coordinates of the target point in the first coordinate system.

and coordinate residuals

The coordinate reconstruction value r _rec of the target point in the first coordinate system is obtained.

Exemplarily, the decoding end predicts the coordinates of the target point in the first coordinate system

and coordinate residuals

The sum of the coordinates of the target point in the first coordinate system is determined as

Based on the above method, the coordinate reconstruction value of the target point in the first coordinate system can be determined

Next, the decoding end executes the following step 13.

Step 13: Determine the coordinate reconstruction value of the target point in the second coordinate system based on the coordinate reconstruction value of the target point in the first coordinate system.

The embodiment of the present application does not limit the specific types of the first coordinate system and the second coordinate system.

The embodiment of the present application does not limit the specific method of determining the coordinate reconstruction value of the target point in the second coordinate system based on the coordinate reconstruction value of the target point in the first coordinate system.

Mode 1: The decoding end converts the coordinate reconstruction value of the target point in the first coordinate system into the coordinate reconstruction value of the target point in the second coordinate system based on the conversion relationship between the first coordinate system and the second coordinate system.

In method 2, the decoding end determines the coordinate reconstruction value of the target point in the second coordinate system through the following steps 13-1 to 13-3:

Step 13-1, performing coordinate transformation on the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate prediction value of the target point in the second coordinate system.

Specifically, based on the conversion relationship between the first coordinate system and the second coordinate system, the coordinate reconstruction value of the target point in the first coordinate system is transformed to obtain the coordinate prediction value of the target point in the second coordinate system.

Exemplarily, assuming that the first coordinate system is a Cartesian coordinate system and the second coordinate system is a cylindrical coordinate system, the coordinate prediction value of the target point in the second coordinate system is determined based on the following formula (16):

Wherein, θ is the elevation angle of the laser Laser corresponding to i _rec , and zLaser is the height of the laser Laser corresponding to i _rec in the vertical direction.

Step 13-2: Decode the geometric code stream to obtain the coordinate residual value of the target point in the second coordinate system.

In the embodiment of the present application, when encoding the target point, the encoding point determines the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate prediction value of the target point in the first coordinate system, and transforms the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate reconstruction value of the target point in the second coordinate system. Then, the encoding end determines the coordinate residual value of the target point in the second coordinate system based on the coordinate prediction value and the coordinate reconstruction value of the target point in the second coordinate system. The encoding end writes the coordinate residual value of the target point in the second coordinate system into the geometric code stream.

Based on this, the decoding end decodes the geometric code stream to obtain the coordinate residual value of the target point in the second coordinate system.

In some embodiments, if the encoder quantizes the coordinate residual value of the target point in the second coordinate system and writes it into the geometric code stream, the decoder parses the coordinate residual value of the target point in the second coordinate system from the geometric code stream and then dequantizes the coordinate residual value of the target point in the second coordinate system to obtain the coordinate residual value of the target point in the second coordinate system.

Step 13-3: Based on the predicted coordinate value and the residual coordinate value of the target point in the second coordinate system, obtain the reconstructed coordinate value of the target point in the second coordinate system.

For example, the sum of the predicted coordinate value and the residual coordinate value of the target point in the second coordinate system is determined as the reconstructed coordinate value of the target point in the second coordinate system.

Exemplarily, based on the following formula (17), the coordinate reconstruction value (x, y, z) of the target point in the second coordinate system is determined:

Among them, (r _x , _ry ,r _z ) is the coordinate residual value of the target point in the second coordinate system.

The above embodiment is described by taking the geometric decoding process of the current node in L3C2 as an example. The decoding process of other nodes in L3C2 can refer to the decoding process of the current node, and then the reconstructed point cloud geometric information can be obtained.

The point cloud decoding method provided in the embodiment of the present application decodes the geometric code stream of the point cloud to obtain the total number of L3C2 nodes of the point cloud and the total number of points of the point cloud. Based on the total number of points of the point cloud and the total number of L3C2 nodes of the point cloud, the total number of repeated points included in the point cloud is determined. In this way, during decoding, the number of decoded repeated points is recorded in real time, and the number of decoded repeated points is compared with the total number of repeated points included in the point cloud to determine whether to decode the repeated point information of the node when decoding the current node. For example, when decoding the current point, it is determined that the number of decoded repeated points is equal to the total number of repeated points in the point cloud, indicating that the repeated points in the point cloud have been decoded, and the remaining nodes do not include repeated points, so that there is no need to decode the repeated point information of subsequent nodes, thereby reducing the decoding complexity of the point cloud, saving decoding time, and improving decoding efficiency.

The above takes the decoding end as an example to introduce in detail the point cloud decoding method provided in the embodiment of the present application. The following takes the encoding end as an example to introduce the point cloud encoding method provided in the embodiment of the present application.

Fig. 14 is a schematic diagram of a point cloud coding method according to an embodiment of the present application. The point cloud coding method according to the embodiment of the present application can be implemented by the point cloud coding device shown in Fig. 3 or Fig. 4 above.

As shown in FIG. 14 , the point cloud encoding method of the embodiment of the present application includes:

S201. Determine the L3C2 structure of the point cloud.

Specifically, the encoding end reorders the points in the point cloud based on the coordinate information of the points in the point cloud, constructs a single chain structure based on the coordinate information of the reordered points, and then obtains the L3C2 structure of the point cloud.

The embodiment of the present application does not limit the method of reordering the point cloud.

In some embodiments, the voxelized point cloud is reordered to construct a more efficient single chain structure, and the default sorting method is to sort according to the scanning order of the lidar.

Exemplarily, the coordinate value of each point in the point cloud in the second coordinate system, such as the coordinate value (x, y, z) in the Cartesian coordinate system, is converted into the coordinate value in the third coordinate system, such as the third coordinate system is the polar coordinate system, and the corresponding coordinate value is (r, φ, tanθ). Then, the points are sorted according to the elevation tangent value tanθ, the azimuth angle φ, and the radius r in the polar coordinate system.

It should be noted that the embodiment of the present application does not limit the specific type of the third coordinate system. The third coordinate system includes a cylindrical coordinate system, a polar coordinate system, and may also include other coordinate systems.

Next, coordinate transformation is performed on the sorted point cloud. Specifically, the point cloud is traversed according to the sorted result, and the coordinate values of the midpoints of the point cloud in the second coordinate system are transformed into coordinate values in the first coordinate system and stored.

Exemplarily, assuming that the first coordinate system is a cylindrical coordinate system and the second coordinate system is a Cartesian coordinate system, based on the above formula (4), the coordinate value (x, y, z) of the point in the point cloud in the Cartesian coordinate system is converted to the coordinate value (r, φ, i) in the cylindrical coordinate system.

A single chain structure is constructed based on the coordinate values of the midpoints of the point cloud in the first coordinate system. In one example, assuming that the first coordinate system is a cylindrical coordinate system, the single chain structure constructed by the point cloud is shown in FIG. 7A .

In some embodiments, the rotation interval of each Laser is

by using

The single-chain structure shown in FIG7A is regularized in the vertical direction to obtain the single-chain structure shown in FIG7B. Optionally, the above formula (5) may be used for regularization.

In the embodiment of the present application, encoding is performed based on the regularized single-chain structure.

S202, determining the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud.

In the embodiment of the present application, in the encoding method based on L3C2, the encoding end constructs the L3C2 structure of the point cloud based on the geometric information of the point cloud. The L3C2 structure is a chain structure, which is composed of at least one single chain structure, and each single chain structure includes at least one node. A node includes at least one point in the point cloud, that is, in the L3C2 encoding, the points in the point cloud are divided into nodes in L3C2.

In some embodiments, it can be seen from the above that when constructing L3C2 of a point cloud, it is necessary to perform coordinate conversion on the points in the point cloud. For example, when converting the coordinates of the points in the point cloud in the second coordinate system to the coordinates in the first coordinate system, the coordinates of the points with different coordinates in the second coordinate system may become the same when converted to the first coordinate system. In this way, when constructing L3C2 based on the coordinates of the points in the first coordinate system, the points with the same coordinates in the first coordinate system will be divided into one node, so that the node includes duplicate points.

At present, when encoding each node in L3C2, the encoding end needs to encode the repeated point information of each node, which will increase the complexity of encoding, waste encoding time and reduce encoding efficiency.

From the above, it can be seen that in the L3C2 of the point cloud, not every node has duplicate points, and it can even be understood that only a few nodes in the L3C2 of the point cloud include duplicate points. Based on this, in the embodiment of the present application, when encoding, the total number of duplicate points included in the point cloud is first determined, and during the encoding process, the number of encoded duplicate points is counted, and then before encoding each point, it is first determined whether the number of currently encoded duplicate points is equal to the total number of duplicate points included in the point cloud. If the number of currently encoded duplicate points is equal to the total number of duplicate points included in the point cloud, it means that the duplicate points in the point cloud have been encoded, and the remaining nodes to be encoded in L3C2 do not include duplicate nodes. In this way, when these nodes are subsequently encoded, the duplicate point information of these nodes will no longer be encoded, thereby reducing the encoding complexity of the point cloud, saving encoding time, and improving encoding efficiency.

The embodiment of the present application does not limit the specific method for the encoder to obtain the total number of L3C2 nodes of the point cloud and the total number of points of the point cloud.

In some embodiments, when constructing L3C2, the encoder counts the number of node points included in L3C2.

In some embodiments, the encoding end determines the number of single chains included in L3C2 and the number of nodes included in each single chain, and obtains the total number of nodes of L3C2 based on the number of single chains and the number of nodes included in each single chain. For example, the sum of the number of nodes included in each single chain is determined as the total number of nodes of L3C2.

for(index＝0；index<headsCount；index++)

nodeCount+=nodeCnt

In this implementation, the encoding end also includes: writing the number of single chains included in L3C2 and the number of nodes included in each single chain into the geometric code stream of the point cloud.

In addition, the point cloud file includes the total number of points of the point cloud, so that the encoding end obtains the total number of points of the point cloud by receiving the point cloud file.

Furthermore, the encoder writes the total number of points in the point cloud into the geometry code stream.

In some embodiments, the encoding end writes the total number of points of the point cloud into the geometric data unit.

In some embodiments, the encoder subtracts 1 from the total number of points in the point cloud to obtain a first value, and writes the first value into the geometric data unit. For example, as shown in Table 2 above.

Based on the above method, the encoder obtains the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud, and then executes the following step S203.

S203. Determine the total number of repeated points in the point cloud according to the total number of points in the point cloud and the total number of nodes in L3C2.

After the encoder determines the total number of L3C2 nodes and the total number of points of the point cloud, it determines the total number of repeated points in the point cloud based on the total number of points in the point cloud and the total number of L3C2 nodes, and then uses the total number of repeated points in the point cloud as supervision for subsequent encoding.

The embodiment of the present application does not limit the specific method in which the encoder determines the total number of repeated points in the point cloud based on the total number of nodes of L3C2 of the point cloud and the total number of points in the point cloud.

In a possible implementation, the difference between the total number of points in the point cloud and the total number of nodes in L3C2 is directly determined as the total number of repeated points in the point cloud.

Exemplarily, the encoder determines the total number of repeated points in the point cloud through the above formula (13).

Based on the above steps, after the encoder determines the total number of repeated points in the point cloud, it starts to encode each point in L3C2, as shown in S204 below.

S204. When encoding the current node in L3C2, determine the number of encoded repeated points.

In the embodiment of the present application, the encoding end encodes each node in L3C2 in the same way. For the convenience of description, the current node in L3C2 is taken as an example for illustration.

In one example, dupCount is used to represent the number of duplicate points currently encoded, and dupCount is initialized to 0. When encoding the first node in L3C2, dupCount = 0, which is less than the total number of duplicate points dupSumNum in the point cloud. In this case, when encoding the first node, the duplicate point information of the first node needs to be encoded. Optionally, the duplicate point information of the first node includes the number of duplicate points included in the first node. Assuming that the number of duplicate points included in the first node is a, dupCount is updated to a. Next, encode the second node in L3C2. Before encoding the second node, first determine whether dupCount = a is equal to the total number of duplicate points dupSumNum in the point cloud. If not, continue to encode the duplicate point information of the second node, and use the number of duplicate points included in the second node to update the current dupCount, and so on.

As can be seen from the above, when encoding the current node in L3C2, the encoding end first determines the number of repeated points that have been encoded before encoding the current node, that is, the number of encoded repeated points dupCount, and then determines whether to encode the repeated point information of the current node based on the number of encoded repeated points dupCount.

In one example, the encoder records the number of encoded duplicate points through the following instructions:

for(index＝0；index<nodeCount；index++)

dupCount+=dupPointNum

S205 . Encode the current node according to the number of encoded duplicate points and the total number of duplicate points in the point cloud.

When encoding the current node, the encoder compares the number of encoded duplicate points with the total number of duplicate points in the point cloud to encode the current node.

Specifically, the implementation process of the above S205 includes the following situations:

Case 1: If the number of encoded duplicate points is equal to the total number of duplicate points in the point cloud, then the above S205 includes the following S205-A1 and S205-A2:

S205-A1, skip determining and encoding the repeated point information of the current node.

S205-A2. Determine the coordinate residual value of the first point included in the current node, and write the coordinate residual value of the first point into the geometric code stream of the point cloud.

In case 1, when encoding the current node, if it is determined that the number of coded repeated points is equal to the total number of repeated points in the point cloud, it means that all repeated points have been encoded and the remaining nodes of L3C2 do not include repeated points. Based on this, when encoding the current node, the encoding of repeated point information is skipped, thereby reducing the encoding complexity, saving encoding time, and improving encoding efficiency.

At this point, it can be determined that the current node includes only one point, which is recorded as the first point. Then, the coordinate residual value of the first point is determined, and the coordinate residual value of the first point is written into the geometric code stream of the point cloud.

It should be noted that the process of determining the coordinate residual value of the first point and writing the coordinate residual value of the first point into the geometric code stream of the point cloud can be specifically referred to the description of the following steps 21 to 25, which will not be repeated here.

The above introduces the case 1, that is, if the number of encoded duplicate points is equal to the total number of duplicate points in the point cloud, then the encoding of the duplicate point information of the current node is skipped, and the geometric encoding process of the first point included in the current node is determined.

Case 2: If the number of encoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is not the last node in L3C2, then the above S205 includes the following steps:

S205-B1. Determine the duplicate point information of the current node.

In an embodiment of the present application, when encoding the current node, the encoding end determines the number of encoded duplicate points corresponding to the current node. If the number of encoded duplicate points corresponding to the current node is less than the total number of duplicate points in the point cloud, it means that the current node may include duplicate points. At this time, in order to accurately encode, it is necessary to determine the duplicate point information of the current node and encode the duplicate point information of the current node.

In some embodiments, the repeated point information includes the number of repeated points included in the current node. Thus, by determining the number of repeated points included in the current node, the repeated point information of the current node can be determined.

In some embodiments, the repeated point information includes the number of repeated points included in the current node and at least one first flag, where the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point at the second coordinate.

In some embodiments, if N is greater than 0, and the repeated point information includes a first flag, the first flag is used to indicate whether the coordinates of the current point in the current node are the same as the coordinates of the previous point under the second coordinate, then the above S205-B1 includes the following S205-B11 and S205-B12:

S205-B11, determining the number of N repeated points included in the current node;

S205-B12, based on the coordinate value of the first point included in the current node in the second coordinate system and the coordinate values of the N repeated points in the second coordinate system, determine the first marks corresponding to the N repeated points respectively;

S205-B13. Determine the repeated point information of the current node based on the number of N repeated points included in the current node and the first flags respectively corresponding to the N repeated points.

Specifically, among the points in the current node except the first point, the points that are repeated with the first point are determined. For example, if the current node includes 3 points, it is determined that the current node includes 2 repeated nodes.

Next, based on the coordinate value of the first point included in the current node in the second coordinate system and the coordinate values of the N repeated points in the second coordinate system, first marks corresponding to the N repeated points are determined.

In a possible implementation, for each of the N repeated points, the coordinate value of the repeated point in the second coordinate system is compared with the coordinate value of the first point in the second coordinate system to determine the first mark corresponding to the repeated point. For example, if the coordinate value of the repeated point in the second coordinate system is the same as the coordinate value of the first point in the second coordinate system, the value of the first mark corresponding to the repeated point is set to the first value; if the coordinate value of the repeated point in the second coordinate system is different from the coordinate value of the first point in the second coordinate system, the value of the first mark corresponding to the repeated point is set to the second value.

The embodiment of the present application does not limit the specific values of the first numerical value and the second numerical value.

Optionally, the first value is 1.

Optionally, the second value is 0.

In another possible implementation, for the i-th repeated point among N repeated points, based on the coordinate value of the i-1-th repeated point in the second coordinate system and the coordinate value of the i-th repeated point in the second coordinate system, the first mark corresponding to the i-th repeated point is determined, where i is a positive integer greater than 0 and less than or equal to N. If i is 1, the i-1-th repeated point is the first point.

For example, if the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system, the value of the first mark is set to a first numerical value, and the first numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system.

For another example, if the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system, the value of the first mark is determined to be set to a second numerical value, and the second numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system.

In some embodiments, if it is determined that the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system, that is, the value of the first mark corresponding to the i-th repeated point is the second value, then the embodiment of the present application further includes: determining the coordinate residual value of the i-th repeated point, and using the coordinate residual value of the i-th repeated point to determine the geometric code stream. The determination of the coordinate residual value of the i-th repeated point can refer to the description of the following steps 21 to 25, which will not be repeated here.

After determining the number N of repeated points included in the current node and the first mark corresponding to each repeated point based on the above, the number N of repeated points included in the current node and the first mark corresponding to each repeated point are determined as the repeated point information of the current node.

Furthermore, the encoding end uses the number N of repeated points included in the current node to update the number of encoded repeated points, for example, the sum of the number of repeated points included in the current node and the number of encoded repeated points is determined as the new number of encoded repeated points.

Based on the above steps, after the duplicate point information of the current node is determined, step S205-B2 is executed.

S205-B2. Determine the coordinate residual value of the first point in the current node.

For example, the coordinate prediction value of the first point in the current node is determined, and the coordinate residual value of the first point is determined based on the coordinate prediction value and the coordinate value.

The specific implementation process of the above S205-B2 refers to the description of the following steps 21 to 25, which will not be repeated here.

S205-B3. Write the coordinate residual value of the first point and the duplicate point information of the current node into the geometry code stream.

In case 2, based on the above steps, after determining the duplicate point information of the current node and the coordinate residual value of the first point of the current node, the coordinate residual value of the first point and the duplicate point information of the current node are written into the geometry code stream.

Case 3: If the duplicate point information includes the number of duplicate points included in the current node, the number of encoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is the last node in L3C2, then the above S205 includes the following steps:

S205-C1, skip encoding the number of repeated points included in the current node;

S205-C2, determine the coordinate residual value of the first point in the current node;

S205-C3, write the coordinate residual value of the first point and the duplicate point information of the current node into the geometry code stream.

In this case 3, if the number of encoded duplicate points corresponding to the current node is less than the total number of duplicate points in the point cloud, it means that the current node may include duplicate points. Further, if the encoder determines that the current node is the last node of L3C2, it is determined that the current node must include duplicate points, and the number of duplicate points included in the current node is the difference between the total number of duplicate points in the point cloud and the number of encoded duplicate points. Therefore, in this case 3, if it is determined that the number of encoded duplicate points corresponding to the current node is less than the total number of duplicate points in the point cloud, and the current node is the last node of L3C2, the decoder can determine the difference between the total number of duplicate points in the point cloud and the number of encoded duplicate points as the number of duplicate points included in the current node, and then the encoder skips encoding the duplicate point information of the current node, thereby reducing encoding complexity, saving encoding time, and improving encoding efficiency.

In this case 3, the encoding end also needs to determine the coordinate residual value of the first point included in the current node, and write the coordinate residual value of the first point into the geometric code stream. The process of determining the coordinate residual value of the first point can be specifically referred to the description of steps 21 to 25 below, which will not be repeated here.

The following describes the process of determining the coordinate residual value of a point included in the current node, i.e., the first point, in the above-mentioned case 1, as well as the process of determining the coordinate residual value of the first point when the current node includes multiple points and the coordinate residual value of the i-th repeated point among the N repeated points included in the current node in case 2, and the process of determining the coordinate residual value of the first point of the current node in case 3. That is, the target value in the following steps 21 to 25 can be understood as the first point in case 1, and can also be understood as the first point in case 2.

The specific process of determining the coordinate residual value of the target point is introduced below.

Step 21: Determine the predicted coordinate value of the target point in the first coordinate system.

In a possible implementation, based on the coordinate value of the current encoded point in the first coordinate system, the predicted coordinate value of the target point in the first coordinate system is determined. For example, the arithmetic mean or weighted mean of the coordinate values of one or more encoded points before the target point in the first coordinate system is determined as the predicted coordinate value of the target point in the first coordinate system.

In a possible implementation, step 21 includes the following steps 21-1 and 21-2:

Step 21-1, determining the prediction mode corresponding to the target point;

Step 21-2: Based on the prediction model, determine the coordinate prediction value of the target point in the first coordinate system.

In some embodiments, the prediction mode corresponding to the target point is a default mode.

Where i represents the laser mark corresponding to the target point. In this way, the prediction list corresponding to i of the target point can be determined as the prediction list corresponding to the target point. Assuming that the prediction list corresponding to the target point includes M prediction values, the target point is predicted using these M prediction values, and the cost corresponding to each of the M prediction values is determined. Then, the index of the prediction value with the lowest price in the prediction list is determined as the prediction mode corresponding to the target point. In other words, the prediction mode corresponding to the target point is the index of the prediction value with the lowest cost in the prediction list corresponding to the target point.

Next, based on the prediction mode, the coordinate prediction value of the target point in the first coordinate system is determined. For example, the prediction value corresponding to the index of the prediction mode in the prediction list corresponding to the target point i is determined as the coordinate prediction value of the target point in the first coordinate system. Optionally, the prediction value includes r _pred and

In some embodiments, the encoder writes the prediction mode corresponding to the target point into the geometry bitstream.

After the encoder determines the predicted coordinate value of the target point in the first coordinate system based on the above steps, it executes the following step 22.

Step 22: Determine a residual value of the target point in the first coordinate system based on the predicted coordinate value of the target point in the first coordinate system and the coordinate value of the target point in the first coordinate system.

For example, the difference between the coordinate value of the target point in the first coordinate system and the predicted coordinate value is determined as the residual value of the target point in the first coordinate system.

In some embodiments, the encoding end quantizes the residual value of the target point in the first coordinate system to obtain the quantized residual value of the target point in the first coordinate system.

In some embodiments, when the residual value of the target point in the first coordinate system is quantized, the quantization factor needs to be determined first. For example, the quantization factors Δr and Δr are determined based on the above formulas (7) and (8).

Then, based on the quantization factor Δr, the residual value r _res of the target point in the first coordinate system is quantized and used

The residual value of the target point in the first coordinate system

Quantify.

In the embodiment of the present application, the coordinate residual value of the target point includes the coordinate residual value of the target point in the first coordinate system and the coordinate residual value of the target point in the second coordinate system. Based on the above method, the coordinate residual value of the target point in the first coordinate system is determined. Then, based on steps 23 to 25, the coordinate residual value of the target point in the second coordinate system is determined.

Step 23: Based on the predicted coordinate value of the target point in the first coordinate system, obtain the reconstructed coordinate value of the target point in the first coordinate system.

In some embodiments, the predicted coordinate value of the target point in the first coordinate system is used as the reconstructed coordinate value of the target point in the first coordinate system.

In some embodiments, if the residual value of the target point in the first coordinate system is not quantized, the sum of the coordinate prediction value and the residual value of the target point in the first coordinate system is determined as the coordinate reconstruction value of the target point in the first coordinate system.

In some embodiments, if the residual value of the target point in the first coordinate system is quantized, the encoder performs inverse quantization on the quantized residual value of the coordinate of the target point in the first coordinate system to obtain the residual value of the coordinate of the target point in the first coordinate system; based on the predicted value of the coordinate of the target point in the first coordinate system and the residual value of the coordinate, obtain the reconstructed value of the coordinate of the target point in the first coordinate system. For example, the sum of the predicted value of the coordinate of the target point in the first coordinate system and the residual value of the coordinate is determined as the reconstructed value of the coordinate of the target point in the first coordinate system.

Step 24: performing coordinate transformation on the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate reconstruction value of the target point in the second coordinate system.

Different first coordinate systems correspond to different second coordinate systems in different conversion relationships.

Exemplarily, assuming that the first coordinate system is a cylindrical coordinate system and the second coordinate system is a Cartesian coordinate system, the coordinate reconstruction value of the target point in the first coordinate system is transformed with reference to the above formula (9) to obtain the coordinate reconstruction value of the target point in the second coordinate system.

Step 25: Determine the coordinate residual value of the target point in the second coordinate system based on the coordinate reconstruction value and the coordinate value of the target point in the second coordinate system.

For example, the difference between the predicted coordinate value and the reconstructed coordinate value of the target point in the second coordinate system is determined as the coordinate residual value of the target point in the second coordinate system.

In some embodiments, the encoder writes the coordinate residual value of the target point in the first coordinate system and the coordinate residual value of the target point in the second coordinate system into the geometric code stream.

In some embodiments, the encoder quantizes at least one of a coordinate residual value of the target point in the first coordinate system and a coordinate residual value of the target point in the second coordinate system, and then writes the quantized value into a geometric code stream.

In some embodiments, the encoding end further determines the order of the target point in the single-link structure, that is, o(P), based on the above formula (6), and then encodes o(P).

The above embodiment is described by taking the geometric encoding process of the current node in L3C2 as an example. The encoding process of other nodes in L3C2 can refer to the encoding process of the current node, and then the reconstructed point cloud geometric information can be obtained.

The point cloud encoding method provided in the embodiment of the present application determines the L3C2 structure of the point cloud, determines the total number of nodes of L3C2 of the point cloud, and the total number of points of the point cloud, and determines the total number of repeated points included in the point cloud based on the total number of points of the point cloud and the total number of nodes of L3C2 of the point cloud. In this way, during encoding, the number of encoded repeated points is recorded in real time, and the number of encoded repeated points is compared with the total number of repeated points included in the point cloud to determine whether to encode the repeated point information of the node when encoding the current node. For example, when encoding the current point, it is determined that the number of encoded repeated points is equal to the total number of repeated points in the point cloud, indicating that the repeated points in the point cloud have been encoded, and the remaining nodes do not include repeated points, and thus there is no need to encode the repeated point information of subsequent nodes, thereby reducing the encoding complexity of the point cloud, saving encoding time, and improving encoding efficiency.

It should be understood that FIGS. 13 to 14 are merely examples of the present application and should not be construed as limitations to the present application.

The preferred embodiments of the present application are described in detail above in conjunction with the accompanying drawings. However, the present application is not limited to the specific details in the above embodiments. Within the technical concept of the present application, the technical solution of the present application can be subjected to a variety of simple modifications, and these simple modifications all belong to the protection scope of the present application. For example, the various specific technical features described in the above specific embodiments can be combined in any suitable manner without contradiction. In order to avoid unnecessary repetition, the present application will not further explain various possible combinations. For another example, the various different embodiments of the present application can also be arbitrarily combined, as long as they do not violate the ideas of the present application, they should also be regarded as the contents disclosed in the present application.

It should also be understood that in the various method embodiments of the present application, the size of the sequence number of each process does not mean the order of execution, and the execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present application. In addition, in the embodiments of the present application, the term "and/or" is merely a description of the association relationship of associated objects, indicating that three relationships may exist. Specifically, A and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone. In addition, the character "/" in this article generally indicates that the objects associated before and after are in an "or" relationship.

The above text, in combination with Figures 13 to 14 , describes in detail a method embodiment of the present application. The following text, in combination with Figures 15 to 18 , describes in detail a device embodiment of the present application.

FIG. 15 is a schematic block diagram of a point cloud decoding device provided in an embodiment of the present application.

As shown in FIG15 , the point cloud decoding device 10 may include:

The first decoding unit 11 is used to decode the geometric code stream of the point cloud to obtain the total number of nodes of the low-delay and low-complexity coding model L3C2 of the point cloud and the total number of points of the point cloud;

A point number determination unit 12, configured to determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

A repeated point determination unit 13, used to determine the number of decoded repeated points when decoding the current node in the L3C2;

The second decoding unit 14 is used to decode the current node according to the number of the decoded repeated points and the total number of repeated points in the point cloud.

In some embodiments, the point number determination unit 12 is specifically configured to determine the difference between the total number of points of the point cloud and the total number of nodes of the L3C2 as the total number of repeated points of the point cloud.

In some embodiments, the second decoding unit 14 is specifically used to skip decoding the duplicate point information of the current node if the number of decoded duplicate points is equal to the total number of duplicate points in the point cloud, and determine the geometric reconstruction value of the first point included in the current node.

In some embodiments, if the duplicate point information includes the number of duplicate points included in the current node, the second decoding unit 14 is specifically used to skip decoding the duplicate point information of the current node if the number of decoded duplicate points is less than the total number of duplicate points in the point cloud and the current node is the last node in the L3C2; and determine the difference between the total number of duplicate points in the point cloud and the number of decoded duplicate points as the number of duplicate points included in the current node.

In some embodiments, the second decoding unit 14 is specifically used to decode the duplicate point information of the current node if the number of decoded duplicate points is less than the total number of duplicate points in the point cloud and the current node is not the last node in the L3C2; based on the duplicate point information, determine the geometric reconstruction values of the N duplicate points included in the current node, where N is an integer.

In some embodiments, the second decoding unit 14 is specifically used to determine the geometric reconstruction value of the first point in the current node; based on the repeated point information and the geometric reconstruction value of the first point, determine the geometric reconstruction values of the N repeated points.

In some embodiments, if N is greater than 0, and the repeated point information includes a first flag, and the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point at the second coordinate, then the second decoding unit 14 is specifically used to parse the repeated point information for the i-th repeated point among the N repeated points, and obtain the first flag corresponding to the i-th repeated point, where i is a positive integer less than or equal to N, and if i is 1, the i-1th repeated point is the first point; based on the first flag corresponding to the i-th repeated point, determine the geometric reconstruction value of the i-th repeated point.

In some embodiments, the second decoding unit 14 is specifically used to determine the geometric reconstruction value of the i-1th repeated point as the geometric reconstruction value of the i-1th repeated point if the first flag indicates that the coordinates of the i-1th repeated point and the i-1th repeated point in the second coordinate system are the same.

In some embodiments, the second decoding unit 14 is specifically used to determine the geometric reconstruction value of the i-th repeated point if the first flag indicates that the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are different.

In some embodiments, the second decoding unit 14 is specifically configured to determine the sum of the number of repeated points included in the current node and the number of decoded repeated points as the new number of decoded repeated points.

In some embodiments, the second decoding unit 14 is specifically used to determine a coordinate prediction value of a target point in a first coordinate system, where the target point is the first point in the current node, or the first point included in the current node, or the i-th repeated point among N repeated points included in the current node; based on the coordinate prediction value of the target point in the first coordinate system, determine a coordinate reconstruction value of the target point in the first coordinate system; based on the coordinate reconstruction value of the target point in the first coordinate system, determine a coordinate reconstruction value of the target point in a second coordinate system; and determine the coordinate reconstruction value of the target point in the second coordinate system as the geometric reconstruction value of the target point.

In some embodiments, the second decoding unit 14 is specifically used to decode the geometric code stream to obtain a prediction mode corresponding to the target point; based on the prediction mode, determine a coordinate prediction value of the target point in the first coordinate system.

In some embodiments, the second decoding unit 14 is specifically used to decode the geometric code stream to obtain the quantized coordinate residual value of the target point in the first coordinate system; dequantize the quantized coordinate residual value to obtain the coordinate residual value of the target point in the first coordinate system; based on the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system, obtain the coordinate reconstruction value of the target point in the first coordinate system.

In some embodiments, the second decoding unit 14 is specifically configured to determine the sum of the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system as the coordinate reconstruction value of the target point in the first coordinate system.

In some embodiments, the second decoding unit 14 is specifically used to perform coordinate conversion on the coordinate reconstruction value of the target point in the first coordinate system to obtain a coordinate prediction value of the target point in the second coordinate system; decode the geometric code stream to obtain a coordinate residual value of the target point in the second coordinate system; and obtain a coordinate reconstruction value of the target point in the second coordinate system based on the coordinate prediction value and the coordinate residual value of the target point in the second coordinate system.

In some embodiments, the second decoding unit 14 is specifically configured to determine the sum of the coordinate prediction value and the coordinate residual value of the target point in the second coordinate system as the coordinate reconstruction value of the target point in the second coordinate system.

In some embodiments, the first decoding unit 11 is specifically used to decode the geometric code stream of the point cloud to obtain the number of single chains included in the L3C2 and the number of nodes included in each single chain; based on the number of single chains and the number of nodes included in each single chain, the total number of nodes of the L3C2 is obtained.

In some embodiments, the first decoding unit 11 is specifically configured to decode the geometric data unit of the point cloud to obtain the total number of points of the point cloud.

In some embodiments, the first decoding unit 11 is specifically used to decode the geometric data unit to obtain a first value, where the first value is the number of points in the point cloud minus one; and add one to the first value to obtain the total number of points in the point cloud.

It should be understood that the device embodiment and the method embodiment may correspond to each other, and similar descriptions may refer to the method embodiment. To avoid repetition, no further description is given here. Specifically, the point cloud decoding device 10 shown in FIG. 15 may correspond to the corresponding subject in the point cloud decoding method of the embodiment of the present application, and the aforementioned and other operations and/or functions of each unit in the point cloud decoding device 10 are respectively for implementing the corresponding processes in the point cloud decoding method, and for the sake of brevity, no further description is given here.

FIG16 is a schematic block diagram of a point cloud encoding device provided in an embodiment of the present application.

As shown in FIG16 , the point cloud encoding device 20 includes:

A structure determination unit 21, used to determine a low-delay and low-complexity coding model L3C2 structure of the point cloud;

A point number determination unit 22, used to determine the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud;

A repeated point determination unit 23, used to determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

A calculation unit 24, configured to determine the number of encoded repeated points when encoding the current node in the L3C2;

The encoding unit 25 is used to encode the current node according to the number of the encoded repeated points and the total number of repeated points in the point cloud.

In some embodiments, the point number determination unit 22 is specifically configured to determine the difference between the total number of points of the point cloud and the total number of nodes of the L3C2 as the total number of repeated points of the point cloud.

In some embodiments, the encoding unit 25 is specifically used to skip determining and encoding the duplicate point information of the current node if the number of encoded duplicate points is equal to the total number of duplicate points of the point cloud; determine the coordinate residual value of the first point included in the current node, and write the coordinate residual value of the first point into the geometric code stream of the point cloud.

In some embodiments, if the duplicate point information includes the number of duplicate points included in the current node, the encoding unit 25 is specifically used to skip encoding the number of duplicate points included in the current node if the number of encoded duplicate points is less than the total number of duplicate points in the point cloud and the current node is the last node in the L3C2; determine the coordinate residual value of the first point in the current node; and write the coordinate residual value of the first point and the duplicate point information of the current node into the geometric code stream.

In some embodiments, the encoding unit 25 is specifically used to determine the duplicate point information of the current node if the number of the encoded duplicate points is less than the total number of duplicate points in the point cloud and the current node is not the last node in the L3C2; determine the coordinate residual value of the first point in the current node; and write the coordinate residual value of the first point and the duplicate point information of the current node into the geometric code stream.

In some embodiments, if N is greater than 0, and the repeated point information includes a first flag, and the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point in the second coordinate system, then the encoding unit 25 is specifically used to determine the number of N repeated points included in the current node; based on the coordinate value of the first point included in the current node in the second coordinate system, and the coordinate values of the N repeated points in the second coordinate system, determine the first flags corresponding to the N repeated points respectively; based on the number of N repeated points included in the current node and the first flags corresponding to the N repeated points respectively, determine the repeated point information of the current node.

In some embodiments, the encoding unit 25 is specifically used to determine, for the i-th repeated point among the N repeated points, a first mark corresponding to the i-th repeated point based on the coordinate value of the i-1-th repeated point in the second coordinate system and the coordinate value of the i-th repeated point in the second coordinate system, where i is a positive integer greater than 0 and less than or equal to N, and if i is 1, the i-1-th repeated point is the first point.

In some embodiments, the encoding unit 25 is specifically used to set the value of the first mark to a first numerical value if the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system, and the first numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system.

In some embodiments, the encoding unit 25 is specifically used to set the value of the first mark to a second numerical value if the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system, and the second numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system.

In some embodiments, the encoding unit 25 is also used to determine the coordinate residual value of the i-th repeated point if the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system; and write the coordinate residual value of the i-th repeated point into the geometric code stream.

In some embodiments, the encoding unit 25 is specifically configured to determine the sum of the number of repeated points included in the current node and the number of encoded repeated points as the number of new encoded repeated points.

In some embodiments, the encoding unit 25 is specifically used to determine a coordinate prediction value of a target point in a first coordinate system, where the target point is the first point in the current node, or the first point included in the current node, or the i-th repeated point among N repeated points included in the current node; based on the coordinate prediction value of the target point in the first coordinate system and the coordinate value of the target point in the first coordinate system, determine the residual value of the target point in the first coordinate system; based on the coordinate prediction value of the target point in the first coordinate system, obtain the coordinate reconstruction value of the target point in the first coordinate system; perform coordinate transformation on the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate reconstruction value of the target point in the second coordinate system; based on the coordinate reconstruction value and coordinate value of the target point in the second coordinate system, determine the coordinate residual value of the target point in the second coordinate system.

In some embodiments, the encoding unit 25 is specifically configured to quantize at least one of a coordinate residual value of the target point in the first coordinate system and a coordinate residual value of the target point in the second coordinate system, and then write the quantized value into the geometric code stream.

In some embodiments, the encoding unit 25 is specifically configured to determine a prediction mode corresponding to the target point; and based on the prediction mode, determine a coordinate prediction value of the target point in the first coordinate system.

In some embodiments, the encoding unit 25 is specifically configured to write the prediction mode corresponding to the target point into the geometric code stream.

In some embodiments, the encoding unit 25 is specifically used to dequantize the quantized coordinate residual value of the target point in the first coordinate system to obtain the coordinate residual value of the target point in the first coordinate system; based on the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system, obtain the coordinate reconstruction value of the target point in the first coordinate system.

In some embodiments, the encoding unit 25 is specifically configured to determine the sum of the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system as the coordinate reconstruction value of the target point in the first coordinate system.

In some embodiments, the point determination unit 22 is specifically used to determine the number of single chains included in the L3C2 and the number of nodes included in each single chain; based on the number of single chains and the number of nodes included in each single chain, the total number of nodes of the L3C2 is obtained.

In some embodiments, the encoding unit 25 is further used to write the number of single chains included in the L3C2 and the number of nodes included in each single chain into the geometric code stream of the point cloud.

In some embodiments, the encoding unit 25 is further configured to write the total number of points of the point cloud into the geometry data unit.

In some embodiments, the encoding unit 25 is further used to subtract 1 from the total number of points in the point cloud to obtain a first value; and write the first value into the geometric data unit.

It should be understood that the device embodiment and the method embodiment may correspond to each other, and similar descriptions may refer to the method embodiment. To avoid repetition, it will not be repeated here. Specifically, the point cloud coding device 20 shown in Figure 16 may correspond to the corresponding subject in the point cloud coding method of the embodiment of the present application, and the aforementioned and other operations and/or functions of each unit in the point cloud coding device 20 are respectively for implementing the corresponding processes in the point cloud coding method. For the sake of brevity, they will not be repeated here.

The above describes the device and system of the embodiment of the present application from the perspective of the functional unit in conjunction with the accompanying drawings. It should be understood that the functional unit can be implemented in hardware form, can be implemented by instructions in software form, and can also be implemented by a combination of hardware and software units. Specifically, the steps of the method embodiment in the embodiment of the present application can be completed by the hardware integrated logic circuit and/or software form instructions in the processor, and the steps of the method disclosed in the embodiment of the present application can be directly embodied as a hardware decoding processor to perform, or a combination of hardware and software units in the decoding processor to perform. Optionally, the software unit can be located in a mature storage medium in the field such as a random access memory, a flash memory, a read-only memory, a programmable read-only memory, an electrically erasable programmable memory, a register, etc. The storage medium is located in a memory, and the processor reads the information in the memory, and completes the steps in the above method embodiment in conjunction with its hardware.

FIG. 17 is a schematic block diagram of an electronic device provided in an embodiment of the present application.

As shown in FIG. 17 , the electronic device 30 may be a point cloud decoding device or a point cloud encoding device as described in an embodiment of the present application, and the electronic device 30 may include:

The memory 33 and the processor 32, the memory 33 is used to store the computer program 34 and transmit the program code 34 to the processor 32. In other words, the processor 32 can call and run the computer program 34 from the memory 33 to implement the method in the embodiment of the present application.

For example, the processor 32 may be configured to execute the steps in the method 200 according to the instructions in the computer program 34 .

In some embodiments of the present application, the processor 32 may include but is not limited to:

General-purpose processor, digital signal processor (DSP), application-specific integrated circuit (ASIC), field programmable gate array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.

In some embodiments of the present application, the memory 33 includes but is not limited to:

Volatile memory and/or non-volatile memory. Among them, the non-volatile memory can be read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM) or flash memory. The volatile memory can be random access memory (RAM), which is used as an external cache. By way of example and not limitation, many forms of RAM are available, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate synchronous dynamic random access memory (DDR SDRAM), enhanced synchronous dynamic random access memory (ESDRAM), synchronous link DRAM (SLDRAM) and direct RAM bus random access memory (Direct Rambus RAM, DR RAM).

In some embodiments of the present application, the computer program 34 may be divided into one or more units, which are stored in the memory 33 and executed by the processor 32 to complete the method provided by the present application. The one or more units may be a series of computer program instruction segments capable of completing specific functions, and the instruction segments are used to describe the execution process of the computer program 34 in the electronic device 30.

As shown in FIG. 17 , the electronic device 30 may further include:

The transceiver 33 may be connected to the processor 32 or the memory 33 .

The processor 32 may control the transceiver 33 to communicate with other devices, specifically, to send information or data to other devices, or to receive information or data sent by other devices. The transceiver 33 may include a transmitter and a receiver. The transceiver 33 may further include an antenna, and the number of antennas may be one or more.

It should be understood that the various components in the electronic device 30 are connected via a bus system, wherein the bus system includes not only a data bus but also a power bus, a control bus and a status signal bus.

As shown in Figure 18, the point cloud encoding and decoding system 40 may include: a point cloud encoder 41 and a point cloud decoder 42, wherein the point cloud encoder 41 is used to execute the point cloud encoding method involved in the embodiment of the present application, and the point cloud decoder 42 is used to execute the point cloud decoding method involved in the embodiment of the present application.

The present application also provides a code stream, which is generated according to the above encoding method.

The present application also provides a computer storage medium on which a computer program is stored, and when the computer program is executed by a computer, the computer can perform the method of the above method embodiment. In other words, the present application embodiment also provides a computer program product containing instructions, and when the instructions are executed by a computer, the computer can perform the method of the above method embodiment.

When software is used for implementation, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the process or function according to the embodiment of the present application is generated in whole or in part. The computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices. The computer instructions can be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium. For example, the computer instructions can be transmitted from a website, computer, server or data center to another website, computer, server or data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (digital subscriber line, DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) mode. The computer-readable storage medium can be any available medium that a computer can access or a data storage device such as a server or data center that includes one or more available media integrated. The available medium can be a magnetic medium (e.g., a floppy disk, a hard disk, a tape), an optical medium (e.g., a digital video disc (digital video disc, DVD)), or a semiconductor medium (e.g., a solid state drive (solid state disk, SSD)), etc.

Those of ordinary skill in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Professional and technical personnel can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.

In the several embodiments provided in the present application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the unit is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the scheme of this embodiment. For example, each functional unit in each embodiment of the present application may be integrated into a processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.

The above contents are only specific implementation methods of the present application, but the protection scope of the present application is not limited thereto. Any technician familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the present application, which should be included in the protection scope of the present application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims

A point cloud decoding method, characterized by comprising:

Decode the geometric code stream of the point cloud to obtain the total number of nodes of the low-latency and low-complexity coding model L3C2 of the point cloud and the total number of points of the point cloud;

Determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

When decoding the current node in the L3C2, determining the number of decoded duplicate points;

The current node is geometrically decoded according to the number of the decoded repeated points and the total number of repeated points in the point cloud.
The method according to claim 1, characterized in that the determining the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2 comprises:

The difference between the total number of points of the point cloud and the total number of nodes of the L3C2 is determined as the total number of repeated points of the point cloud.
The method according to claim 1, characterized in that decoding the current node according to the number of the decoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the decoded repeated points is equal to the total number of repeated points in the point cloud, then the decoding of the repeated point information of the current node is skipped, and the geometric reconstruction value of the first point included in the current node is determined.
The method according to claim 1, characterized in that if the repeated point information includes the number of repeated points included in the current node, then decoding the current node according to the number of decoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the decoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is the last node in the L3C2, then the decoding of the duplicate point information of the current node is skipped;

The difference between the total number of repeated points in the point cloud and the number of decoded repeated points is determined as the number of repeated points included in the current node.
The method according to claim 1, characterized in that decoding the current node according to the number of the decoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the decoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is not the last node in the L3C2, then decode the duplicate point information of the current node;

Based on the repeated point information, the geometric reconstruction values of N repeated points included in the current node are determined, where N is an integer.
The method according to claim 5, characterized in that the step of determining the geometric reconstruction values of the N repeated points included in the current node based on the repeated point information comprises:

Determine a geometric reconstruction value of a first point in the current node;

Based on the repeated point information and the geometric reconstruction value of the first point, the geometric reconstruction values of the N repeated points are determined.
The method according to claim 6, characterized in that, if N is greater than 0, and the repeated point information includes a first flag, the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point at the second coordinate, then determining the geometric reconstruction values of the N repeated points based on the repeated point information and the geometric reconstruction value of the first point includes:

For the i-th repeated point among the N repeated points, the repeated point information is parsed to obtain a first flag corresponding to the i-th repeated point, where i is a positive integer less than or equal to N. If i is 1, the i-1-th repeated point is the first point;

Based on the first mark corresponding to the i-th repeated point, a geometric reconstruction value of the i-th repeated point is determined.
The method according to claim 7, characterized in that the step of determining the geometric reconstruction value of the i-th repeated point based on the first mark corresponding to the i-th repeated point comprises:

If the first flag indicates that the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are the same, then the geometric reconstruction value of the i-1-th repeated point is determined as the geometric reconstruction value of the i-th repeated point.
The method according to claim 7, characterized in that the step of determining the geometric reconstruction value of the i-th repeated point based on the first mark corresponding to the i-th repeated point comprises:

If the first flag indicates that the coordinates of the i-th repeated point and the i-1-th repeated point in the second coordinate system are different, then the geometric reconstruction value of the i-th repeated point is determined.
The method according to any one of claims 5 to 9, characterized in that the method further comprises:

The sum of the number of repeated points included in the current node and the number of decoded repeated points is determined as the new number of decoded repeated points.
The method according to claim 3, 6 or 9, characterized in that the method further comprises:

Determine a coordinate prediction value of a target point in a first coordinate system, wherein the target point is the first point in the current node, or the first point included in the current node, or the i-th repeated point among N repeated points included in the current node;

Determining a coordinate reconstruction value of the target point in the first coordinate system based on the predicted coordinate value of the target point in the first coordinate system;

Determine a coordinate reconstruction value of the target point in a second coordinate system based on the coordinate reconstruction value of the target point in the first coordinate system;

The coordinate reconstruction value of the target point in the second coordinate system is determined as the geometric reconstruction value of the target point.
The method according to claim 11, characterized in that determining the predicted coordinate value of the target point in the first coordinate system comprises:

Decoding the geometric code stream to obtain a prediction mode corresponding to the target point;

Based on the prediction mode, a coordinate prediction value of the target point in the first coordinate system is determined.
The method according to claim 11, characterized in that the step of determining the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate prediction value of the target point in the first coordinate system comprises:

Decoding the geometric code stream to obtain a quantized coordinate residual value of the target point in the first coordinate system;

Dequantizing the quantized coordinate residual value to obtain the coordinate residual value of the target point in the first coordinate system;

Based on the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system, a coordinate reconstruction value of the target point in the first coordinate system is obtained.
The method according to claim 13, characterized in that the step of obtaining the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate reconstruction value and the coordinate residual value of the target point in the first coordinate system comprises:

The sum of the predicted coordinate value and the residual coordinate value of the target point in the first coordinate system is determined as the reconstructed coordinate value of the target point in the first coordinate system.
The method according to claim 11, characterized in that the step of determining the coordinate reconstruction value of the target point in the second coordinate system based on the coordinate reconstruction value of the target point in the first coordinate system comprises:

Performing coordinate transformation on the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate prediction value of the target point in the second coordinate system;

Decoding the geometric code stream to obtain a coordinate residual value of the target point in the second coordinate system;

Based on the coordinate prediction value and the coordinate residual value of the target point in the second coordinate system, the coordinate reconstruction value of the target point in the second coordinate system is obtained.
The method according to claim 15, characterized in that obtaining the coordinate reconstruction value of the target point in the second coordinate system based on the coordinate prediction value and the coordinate residual value of the target point in the second coordinate system comprises:

The sum of the predicted coordinate value and the residual coordinate value of the target point in the second coordinate system is determined as the reconstructed coordinate value of the target point in the second coordinate system.
The method according to any one of claims 1 to 9, characterized in that the decoding of the geometric code stream of the point cloud to obtain the total number of nodes of the low-delay and low-complexity coding model L3C2 of the point cloud comprises:

Decode the geometric code stream of the point cloud to obtain the number of single chains included in the L3C2 and the number of nodes included in each single chain;

Based on the number of single chains and the number of nodes included in each single chain, the total number of nodes of the L3C2 is obtained.
The method according to any one of claims 1 to 9, characterized in that decoding the geometric code stream of the point cloud to obtain the total number of points of the point cloud comprises:

Decode the geometric data unit of the point cloud to obtain the total number of points of the point cloud.
The method according to claim 18, characterized in that the decoding of the geometric data unit to obtain the total number of points of the point cloud comprises:

Decoding the geometric data unit to obtain a first value, where the first value is the number of points in the point cloud minus one;

The first value is added by one to obtain the total number of points in the point cloud.
A point cloud encoding method, characterized by comprising:

Determine the L3C2 structure of the low-latency and low-complexity coding model for point clouds;

Determine the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud;

Determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

When encoding the current node in the L3C2, determining the number of encoded repeated points;

The current node is encoded according to the number of the encoded repeated points and the total number of repeated points in the point cloud.
The method according to claim 20, characterized in that the determining the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2 comprises:

The difference between the total number of points of the point cloud and the total number of nodes of the L3C2 is determined as the total number of repeated points of the point cloud.
The method according to claim 20, characterized in that encoding the current node according to the number of the encoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the encoded duplicate points is equal to the total number of duplicate points in the point cloud, then skipping determining and encoding duplicate point information of the current node;

Determine a coordinate residual value of a first point included in the current node, and write the coordinate residual value of the first point into a geometric code stream of the point cloud.
The method according to claim 20, characterized in that if the repeated point information includes the number of repeated points included in the current node, then encoding the current node according to the number of encoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the encoded repeated points is less than the total number of repeated points in the point cloud, and the current node is the last node in the L3C2, then skip encoding the number of repeated points included in the current node;

Determine the coordinate residual value of the first point in the current node;

The coordinate residual value of the first point and the repeated point information of the current node are written into the geometry code stream.
The method according to claim 20, characterized in that encoding the current node according to the number of the encoded repeated points and the total number of repeated points in the point cloud comprises:

If the number of the encoded duplicate points is less than the total number of duplicate points in the point cloud, and the current node is not the last node in the L3C2, determining the duplicate point information of the current node;

Determine the coordinate residual value of the first point in the current node;

The coordinate residual value of the first point and the repeated point information of the current node are written into the geometric code stream of the point cloud.
The method according to claim 24, characterized in that, if N is greater than 0, and the repeated point information includes a first flag, the first flag is used to indicate whether the coordinates of the current point in the current node are the same as those of the previous point under the second coordinate, then determining the repeated point information of the current node includes:

Determine the number of N repeated points included in the current node, where N is an integer;

Determine first marks corresponding to the N repeated points respectively based on the coordinate value of the first point included in the current node in the second coordinate system and the coordinate values of the N repeated points in the second coordinate system;

Based on the number of N repeated points included in the current node and the first flags respectively corresponding to the N repeated points, the repeated point information of the current node is determined.
The method according to claim 25, characterized in that the determining the first marks respectively corresponding to the N repeated points based on the coordinates of the first point included in the current node in the second coordinate system and the coordinates of the N repeated points in the second coordinate system comprises:

For the i-th repeated point among the N repeated points, based on the coordinate value of the i-1-th repeated point in the second coordinate system and the coordinate value of the i-th repeated point in the second coordinate system, determine the first mark corresponding to the i-th repeated point, where i is a positive integer greater than 0 and less than or equal to N. If i is 1, the i-1-th repeated point is the first point.
The method according to claim 26, characterized in that the determining the first mark corresponding to the i-th repeated point based on the coordinate value of the i-1th repeated point in the second coordinate system and the coordinate value of the i-th repeated point in the second coordinate system comprises:

If the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system, the value of the first mark is set to a first numerical value, and the first numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is the same as the coordinate value of the i-1-th repeated point in the second coordinate system.
The method according to claim 26, characterized in that the determining the first mark corresponding to the i-th repeated point based on the coordinate value of the i-1th repeated point in the second coordinate system and the coordinate value of the i-th repeated point in the second coordinate system comprises:

If the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system, the value of the first mark is set to a second numerical value, and the second numerical value indicates that the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system.
The method according to claim 28, characterized in that if the coordinate value of the i-th repeated point in the second coordinate system is different from the coordinate value of the i-1-th repeated point in the second coordinate system, the method further comprises:

Determine the coordinate residual value of the i-th repeated point;

The coordinate residual value of the i-th repeated point is written into the geometric code stream.
The method according to any one of claims 24 to 29, characterized in that the method further comprises:

The sum of the number of repeated points included in the current node and the number of encoded repeated points is determined as the new number of encoded repeated points.
The method according to claim 22, 25 or 29, characterized in that the method further comprises:

Determine a coordinate prediction value of a target point in a first coordinate system, wherein the target point is the first point in the current node, or the first point included in the current node, or the i-th repeated point among N repeated points included in the current node;

Determine a residual value of the target point in the first coordinate system based on a predicted value of the coordinates of the target point in the first coordinate system and a coordinate value of the target point in the first coordinate system;

Obtaining a coordinate reconstruction value of the target point in the first coordinate system based on the predicted coordinate value of the target point in the first coordinate system;

Performing coordinate transformation on the coordinate reconstruction value of the target point in the first coordinate system to obtain the coordinate reconstruction value of the target point in the second coordinate system;

Based on the coordinate reconstruction value and the coordinate value of the target point in the second coordinate system, a coordinate residual value of the target point in the second coordinate system is determined.
The method according to claim 31, characterized in that the method further comprises:

After quantizing at least one of the coordinate residual value of the target point in the first coordinate system and the coordinate residual value of the target point in the second coordinate system, the residual value is written into the geometric code stream.
The method according to claim 31, characterized in that determining the predicted coordinate value of the target point in the first coordinate system comprises:

Determining a prediction mode corresponding to the target point;

Based on the prediction mode, a coordinate prediction value of the target point in the first coordinate system is determined.
The method according to claim 33, characterized in that the method further comprises:

The prediction mode corresponding to the target point is written into the geometry code stream.
The method according to claim 32, characterized in that the step of determining the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate prediction value of the target point in the first coordinate system comprises:

Dequantizing the quantized coordinate residual value of the target point in the first coordinate system to obtain the coordinate residual value of the target point in the first coordinate system;

Based on the coordinate prediction value and the coordinate residual value of the target point in the first coordinate system, a coordinate reconstruction value of the target point in the first coordinate system is obtained.
The method according to claim 35, characterized in that the step of obtaining the coordinate reconstruction value of the target point in the first coordinate system based on the coordinate reconstruction value and the coordinate residual value of the target point in the first coordinate system comprises:

The sum of the predicted coordinate value and the residual coordinate value of the target point in the first coordinate system is determined as the reconstructed coordinate value of the target point in the first coordinate system.
The method according to any one of claims 20 to 29, characterized in that the determining the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud comprises:

Determine the number of single chains included in the L3C2 and the number of nodes included in each single chain;

Based on the number of single chains and the number of nodes included in each single chain, the total number of nodes of the L3C2 is obtained.
The method according to claim 37, characterized in that the method further comprises:

The number of single chains included in the L3C2 and the number of nodes included in each single chain are written into the geometric code stream of the point cloud.
The method according to any one of claims 20 to 29, characterized in that the method further comprises:

The total number of points of the point cloud is written into the geometry data unit.
The method according to claim 39, characterized in that writing the total number of points of the point cloud into the geometric data unit comprises:

Subtract 1 from the total number of points in the point cloud to obtain a first value;

The first value is written into the geometry data unit.
A point cloud encoding device, characterized in that it comprises:

A first decoding unit is used to decode the geometric code stream of the point cloud to obtain the total number of nodes of the low-delay and low-complexity coding model L3C2 of the point cloud and the total number of points of the point cloud;

A point number determination unit, used to determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

a repeated point determination unit, configured to determine the number of decoded repeated points when decoding the current node in the L3C2;

The second decoding unit is used to decode the current node according to the number of the decoded repeated points and the total number of repeated points in the point cloud.
A point cloud encoding device, characterized in that it comprises:

A structure determination unit, used to determine the low-latency and low-complexity coding model L3C2 structure of the point cloud;

A point number determination unit, used to determine the total number of nodes of L3C2 of the point cloud and the total number of points of the point cloud;

A repeated point determination unit, used to determine the total number of repeated points of the point cloud according to the total number of points of the point cloud and the total number of nodes of the L3C2;

A calculation unit, configured to determine the number of encoded repeated points when encoding the current node in the L3C2;

The encoding unit is used to encode the current node according to the number of the encoded repeated points and the total number of repeated points in the point cloud.
An electronic device, characterized in that it comprises: a processor and a memory;

The memory is used to store computer programs;

The processor is used to call and run the computer program stored in the memory to execute the method according to any one of claims 1 to 19 or 20 to 40.
A computer-readable storage medium, characterized in that it is used to store a computer program, wherein the computer program enables a computer to execute the method according to any one of claims 1 to 19 or 20-30.