CN112565764B

CN112565764B - A method for inter-frame encoding and decoding of point cloud geometric information

Info

Publication number: CN112565764B
Application number: CN202011396370.3A
Authority: CN
Inventors: 张伟; 杨付正; 聂佳明; 陈天
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2020-12-03
Filing date: 2020-12-03
Publication date: 2022-10-04
Anticipated expiration: 2040-12-03
Also published as: CN112565764A

Abstract

The embodiment of the invention provides a point cloud geometric information interframe coding and decoding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and obtaining a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud after iterative division to leaf nodes; acquiring the occupancy condition of each reference node, and acquiring the occupancy condition of a first neighbor node of the reference node when the occupancy condition of the reference node corresponding to the coding node is occupied; and determining a context model of the occupation code of the node to be coded based on the occupation situation and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies, otherwise uses the context model of the intraframe to carry out entropy coding without motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.

Description

A method for inter-frame encoding and decoding of point cloud geometric information

技术领域technical field

本发明属于编解码技术领域，具体涉及一种点云几何信息帧间编码及解码方法。The invention belongs to the technical field of encoding and decoding, and in particular relates to a method for inter-frame encoding and decoding of point cloud geometric information.

背景技术Background technique

随着点云技术的不断发展，点云数据的压缩编码成为重要的研究问题。目前，国内数字音视频编解码技术标准工作组(AVS，Audio Video coding Standard Workgroup ofChina)和国际标准化组织中的运动图像专家组(MPEG，Moving Picture Experts Group)均在制订点云编码的标准。现阶段，AVS的点云编码过程如图1所示，首先对几何信息进行坐标转换，使点云全都包含在一个bounding box(包围盒)中。然后再进行量化，这一步量化主要起到缩放的作用，由于量化取整，使得一部分点的几何信息相同，根据参数来决定是否移除重复点，量化和移除重复点这一过程又被称为体素化。将包围盒称为根节点，对根节点进行八叉树划分即将包围盒八等分为8个子立方体，各子立方体称为根节点的子节点，对这八个子节点各以1位比特来表示其是否被占据即是否包含点云中的点，以0表示子节点不被占据，以1表示为子节点被占据，该比特被称为占位码，并经过熵编码生成二进制码流。对非空的(包含点云中的点)的子立方体继续进行八叉树划分，直到划分得到的叶子结点为1x1x1的单位立方体时停止划分，整个过程采用广度优先的遍历顺序，对叶子结点中的点进行编码生成二进制码流。几何编码完成后对几何信息进行重建。属性信息的编码主要针对颜色信息进行。首先，将颜色信息从RGB颜色空间转换到YUV颜色空间。然后利用原始点云对几何重建点云进行重新着色，使得未编码的属性信息与重建的几何信息对应起来。在颜色信息编码中，用莫顿码对点云进行排序后，利用几何空间关系搜索待预测点的最近邻，并利用所找到邻居的重建属性值对待预测点进行插值预测得到预测属性值，然后将真实属性值和预测属性值进行差分得到预测残差，最后对预测残差进行量化并编码，生成二进制码流。解码过程如图2所示，解码过程与编码过程互逆。With the continuous development of point cloud technology, the compression coding of point cloud data has become an important research problem. At present, both the Audio Video coding Standard Workgroup of China (AVS, Audio Video coding Standard Workgroup of China) and the Moving Picture Experts Group (MPEG, Moving Picture Experts Group) in the International Organization for Standardization are formulating standards for point cloud coding. At this stage, the point cloud encoding process of AVS is shown in Figure 1. First, coordinate transformation is performed on the geometric information, so that all point clouds are contained in a bounding box (bounding box). Then quantization is carried out. This step of quantization mainly plays the role of scaling. Due to the rounding of quantization, the geometric information of some points is the same, and whether to remove duplicate points is determined according to the parameters. The process of quantization and removal of duplicate points is also called for voxelization. The bounding box is called the root node, and the octree division of the root node is to divide the bounding box into eight sub-cubes. Whether it is occupied means whether it contains a point in the point cloud. 0 means that the child node is not occupied, and 1 means that the child node is occupied. This bit is called the occupancy code, and is entropy encoded to generate a binary code stream. Continue to perform octree division on non-empty sub-cubes (including points in the point cloud) until the divided leaf nodes are 1x1x1 unit cubes, and stop dividing. The point in the point is encoded to generate a binary code stream. After the geometry encoding is completed, the geometry information is reconstructed. The encoding of attribute information is mainly carried out for color information. First, convert the color information from RGB color space to YUV color space. The geometrically reconstructed point cloud is then recolored using the original point cloud so that the unencoded attribute information corresponds to the reconstructed geometric information. In the color information coding, after the point cloud is sorted by Morton code, the nearest neighbor of the point to be predicted is searched by using the geometric space relationship, and the predicted attribute value is obtained by interpolation prediction of the point to be predicted by using the reconstructed attribute value of the found neighbor, and then the predicted attribute value is obtained. The prediction residual is obtained by differentiating the real attribute value and the predicted attribute value, and finally the prediction residual is quantized and encoded to generate a binary code stream. The decoding process is shown in Figure 2, and the decoding process and the encoding process are inverse to each other.

在现阶段的AVS点云编码平台中还没有帧间技术，只有上述的帧内技术，因此只能利用空间上的相关性去除冗余信息，无法利用时域上的相关性去除冗余信息。In the current AVS point cloud coding platform, there is no inter-frame technology, only the above-mentioned intra-frame technology, so only the spatial correlation can be used to remove redundant information, and the temporal correlation cannot be used to remove redundant information.

发明内容SUMMARY OF THE INVENTION

为了解决现有技术中存在的上述问题，本发明提供了一种点云几何信息帧间编码及解码方法。本发明要解决的技术问题通过以下技术方案实现：In order to solve the above problems existing in the prior art, the present invention provides an inter-frame encoding and decoding method for point cloud geometric information. The technical problem to be solved by the present invention is realized by the following technical solutions:

第一方面，本发明提供的一种点云几何信息帧间编码方法包括：In a first aspect, a method for inter-frame coding of point cloud geometric information provided by the present invention includes:

获取当前帧点云前的帧点云作为当前帧点云的参考帧点云；Obtain the frame point cloud before the current frame point cloud as the reference frame point cloud of the current frame point cloud;

对当前帧点云进行体素化处理，获得体素化处理后的当前帧点云；Perform voxelization on the point cloud of the current frame to obtain the point cloud of the current frame after voxelization;

对当前帧点云以及参考帧点云分别进行八叉树划分，对划分得到的子节点继续进行八叉树划分，直到划分到叶子节点，得到当前帧点云的第一八叉树结构以及参考帧点云的第二八叉树结构；The current frame point cloud and the reference frame point cloud are divided into octrees respectively, and the divided sub-nodes are divided into octrees until they are divided into leaf nodes, and the first octree structure and reference of the current frame point cloud are obtained. The second octree structure of the frame point cloud;

其中，所述第一八叉树结构包括多个节点，每个节点对应一个在第二八叉树结构中的参考节点；Wherein, the first octree structure includes a plurality of nodes, and each node corresponds to a reference node in the second octree structure;

针对第一八叉树结构的每个待编码节点，获取第二八叉树结构中对应参考节点的占位情况；For each node to be encoded in the first octree structure, obtain the occupancy situation of the corresponding reference node in the second octree structure;

针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况；For each node to be coded in the first octree structure, when the occupancy situation of the reference node corresponding to the node to be coded is occupied, obtain the occupancy situation of the first neighbor node adjacent to the reference node;

基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型；Based on the occupancy situation of the first neighbor node, determine the context model of the occupancy code of the node to be encoded;

对确定上下文模型之后的待编码节点占位码进行熵编码，获得二进制码流。Entropy encoding is performed on the node occupancy code to be encoded after the context model is determined to obtain a binary code stream.

可选的，所述针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况的步骤包括：Optionally, for each node to be encoded in the first octree structure, when the occupancy status of the reference node corresponding to the node to be encoded is occupied, obtain the first neighbor node adjacent to the reference node. The steps for a placeholder situation include:

针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点共面的第一邻居节点的占位情况。For each node to be coded in the first octree structure, when the occupancy status of the reference node corresponding to the to-be-coded node is occupied, the occupancy status of the first neighbor node coplanar with the reference node is acquired.

可选的，在所述针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的步骤之前，所述点云几何信息帧间编码方法还包括：Optionally, at each node to be encoded for the first octree structure, when the occupancy status of the reference node corresponding to the node to be encoded is occupied, obtain the first neighbor adjacent to the reference node. Before the step of the node, the method for inter-frame encoding of point cloud geometric information further includes:

针对第一八叉树结构的每个待编码节点，判断待编码节点对应的参考节点是否被占据，如果否，则获取与该待编码节点共面、共线、共点以及与待编码节点的父节点共面中任一或者组合的第二邻居节点的占位情况；For each node to be encoded in the first octree structure, determine whether the reference node corresponding to the node to be encoded is occupied; The occupancy of any or combination of the second neighbor nodes in the parent node coplanarity;

基于第二邻居节点的占位情况，确定所述待编码节点占位码的上下文模型。Based on the occupancy situation of the second neighbor node, the context model of the occupancy code of the node to be encoded is determined.

可选的，在所述基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型的步骤之前，所述点云几何信息帧间编码方法还包括：Optionally, before the step of determining the context model of the occupancy code of the node to be encoded based on the occupancy situation of the first neighbor node, the method for inter-frame coding of point cloud geometric information further includes:

获取与该待编码节点共面、共线、共点以及与待编码节点的父节点共面中任一或者组合的第二邻居节点的占位情况；Obtain the occupancy status of the second neighbor node that is coplanar, collinear, co-point with the node to be encoded, and coplanar with the parent node of the node to be encoded or combined;

所述基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型的步骤包括：The step of determining the context model of the occupancy code of the node to be encoded based on the occupancy situation of the first neighbor node includes:

基于第一邻居节点以及第二邻居节点的占位情况，确定所述待编码节点占位码的上下文模型。Based on the occupancy conditions of the first neighbor node and the second neighbor node, a context model of the occupancy code of the node to be encoded is determined.

获取与待编码节点对应的参考节点共面、共线、共点的第三邻居节点的占位情况；Obtain the occupancy status of the third neighbor node that is coplanar, colinear, and copointed with the reference node corresponding to the node to be encoded;

将所述第三邻居节点的占位情况组成集合；forming a set of the occupancy conditions of the third neighbor nodes;

在所述集合中任选第一预设数量个元素作为目标元素；Selecting a first preset number of elements in the set as target elements;

所述基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型步骤包括：The step of determining the context model of the occupancy code of the node to be encoded based on the occupancy situation of the first neighbor node includes:

基于所述目标元素，确定所述待编码节点占位码的上下文模型。Based on the target element, a context model of the placeholder code of the node to be encoded is determined.

可选的，在所述集合中任选预设数量个元素作为目标元素的步骤之后，所述点云几何信息帧间编码方法还包括：Optionally, after the step of selecting a preset number of elements in the set as target elements, the method for inter-frame coding of point cloud geometric information further includes:

在所述目标元素中任选第二预设数量个元素，将第二预设数量个元素作为确定同一上下文模型的元素；Optionally select a second preset number of elements in the target elements, and use the second preset number of elements as elements for determining the same context model;

其中，第一预设数量大于第二预设数量；Wherein, the first preset number is greater than the second preset number;

基于所述目标元素，确定所述待编码节点占位码的上下文模型的步骤包括：Based on the target element, the step of determining the context model of the placeholder code of the node to be encoded includes:

基于目标元素中的第二数量个元素，确定所述待编码节点占位码的上下文模型。Based on the second number of elements in the target element, a context model of the placeholder code of the node to be encoded is determined.

可选的，在所述获取与参考节点共面的第一邻居节点的占位情况的步骤之后，所述点云几何信息帧间编码方法还包括：Optionally, after the step of acquiring the occupancy status of the first neighbor node coplanar with the reference node, the method for inter-frame encoding of point cloud geometric information further includes:

获取所述第一邻居节点的位置信息；obtaining the location information of the first neighbor node;

基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型的步骤包括：Based on the occupancy situation of the first neighbor node, the step of determining the context model of the occupancy code of the node to be encoded includes:

基于所述第一邻居节点的位置信息以及占位情况，确定所述待编码节点占位码的上下文模型。Based on the location information and occupancy status of the first neighbor node, a context model of the occupancy code of the node to be encoded is determined.

第二方面，本发明提供的一种点云几何信息帧间解码方法包括：In a second aspect, a method for inter-frame decoding of point cloud geometric information provided by the present invention includes:

接收二进制码流；receive binary stream;

其中，二进制码流包括待解码节点占位码；Wherein, the binary code stream includes the node occupancy code to be decoded;

针对所述每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况；For each node to be decoded, when the occupancy situation of the reference node corresponding to the node to be decoded is occupied, obtain the occupancy situation of the first neighbor node adjacent to the reference node;

基于所述第一邻居节点的占位情况，确定所述待解码节点占位码的上下文模型；determining the context model of the occupancy code of the node to be decoded based on the occupancy situation of the first neighbor node;

对确定上下文模型之后的待解码节点占位码进行熵解码。Entropy decoding is performed on the node occupancy code to be decoded after the context model is determined.

可选的，所述针对第一八叉树结构的每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况的步骤包括：Optionally, for each node to be decoded in the first octree structure, when the occupancy status of the reference node corresponding to the node to be decoded is occupied, obtain the first neighbor node adjacent to the reference node. The steps for a placeholder situation include:

针对第一八叉树结构的每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点共面的第一邻居节点的占位情况。For each node to be decoded in the first octree structure, when the occupancy status of the reference node corresponding to the to-be-decoded node is occupied, the occupancy status of the first neighbor node coplanar with the reference node is acquired.

可选的，在所述针对第一八叉树结构的每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的步骤之前，所述点云几何信息帧间解码方法还包括：Optionally, at each node to be decoded for the first octree structure, when the occupancy status of the reference node corresponding to the node to be decoded is occupied, obtain the first neighbor adjacent to the reference node. Before the step of the node, the method for inter-frame decoding of point cloud geometric information further includes:

判断待解码节点对应的参考节点是否被占据，如果否，则获取与该待解码节点共面、共线、共点以及与待解码节点的父节点共面中任一或者组合的第二邻居节点的占位情况；Determine whether the reference node corresponding to the node to be decoded is occupied, if not, obtain the second neighbor node that is coplanar, collinear, co-point with the node to be decoded, and coplanar with the parent node of the node to be decoded or combined occupancy;

基于第二邻居节点的占位情况，确定所述待解码节点占位码的上下文模型。Based on the occupancy situation of the second neighbor node, the context model of the occupancy code of the node to be decoded is determined.

本发明实施例提供的一种点云几何信息帧间编码方法，通过获取当前帧点云前的帧点云作为参考帧点云；对当前帧点云及参考帧点云分别进行八叉树划分，然后对划分后的节点进行划分，迭代划分到叶子节点结束，得到当前帧点云的第一八叉树结构及参考帧点云的第二八叉树结构；获取每个参考节点的占位情况；当与编码节点对应的参考节点的占位情况为被占据时，获取参考节点的第一邻居节点的占位情况；基于所述占位情况，确定待编码节点占位码的上下文模型并按该上下文模型进行熵编码，获得二进制码流。本发明在帧间预测为占据时用帧间的上下文模型进行熵编码，否则用帧内的上下文模型进行熵编码，无需运动补偿，因此可以降低帧间编解码复杂度，并提高几何熵编码性能。An embodiment of the present invention provides a method for inter-frame encoding of point cloud geometric information. By acquiring the frame point cloud before the current frame point cloud as the reference frame point cloud; the current frame point cloud and the reference frame point cloud are divided into octrees respectively. , and then divide the divided nodes, iteratively divide to the end of the leaf node, and obtain the first octree structure of the point cloud of the current frame and the second octree structure of the point cloud of the reference frame; obtain the occupancy of each reference node situation; when the occupancy situation of the reference node corresponding to the coding node is occupied, obtain the occupancy situation of the first neighbor node of the reference node; based on the occupancy situation, determine the context model of the occupancy code of the node to be encoded and Entropy coding is performed according to the context model to obtain a binary code stream. In the present invention, the inter-frame context model is used for entropy coding when the inter-frame prediction is occupied, otherwise, the intra-frame context model is used for entropy coding without motion compensation, so the complexity of inter-frame coding and decoding can be reduced and the geometric entropy coding performance can be improved. .

以下将结合附图及实施例对本发明做进一步详细说明。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments.

附图说明Description of drawings

图1是本发明实施例提供的AVS编码过程的流程示意图；1 is a schematic flowchart of an AVS encoding process provided by an embodiment of the present invention;

图2是本发明实施例提供的AVS解码过程示意图；2 is a schematic diagram of an AVS decoding process provided by an embodiment of the present invention;

图3为本发明实施例提供的帧间技术应用于AVS编码过程的流程图；FIG. 3 is a flowchart of the application of the inter-frame technology provided by an embodiment of the present invention to an AVS encoding process;

图4为本发明实施例提供的帧间技术应用于AVS解码过程的流程图；FIG. 4 is a flowchart of the application of the inter-frame technology provided by an embodiment of the present invention to an AVS decoding process;

图5为本发明实施例提供的一种点云几何信息帧间编码方法的流程图；5 is a flowchart of a method for inter-frame encoding of point cloud geometric information provided by an embodiment of the present invention;

图6为参考帧对应节点的邻居节点的示意图；6 is a schematic diagram of a neighbor node of a reference frame corresponding node;

图7为本发明实施例提供的AVS帧间预测流程图；7 is a flowchart of an AVS inter-frame prediction provided by an embodiment of the present invention;

图8为本发明实施例提供的参考节点存在6个共面邻居节点的示意图；8 is a schematic diagram of 6 coplanar neighbor nodes existing in a reference node according to an embodiment of the present invention;

图9为本发明实施例提供的参考节点存在12个共线邻居节点的示意图；9 is a schematic diagram of 12 collinear neighbor nodes existing in a reference node provided by an embodiment of the present invention;

图10为本发明实施例提供的参考节点存在8个共点邻居节点的示意图；10 is a schematic diagram of 8 common-point neighbor nodes existing in a reference node provided by an embodiment of the present invention;

图11为本发明实施例提供的一种点云几何信息帧间解码方法的流程图。FIG. 11 is a flowchart of a method for inter-frame decoding of point cloud geometric information according to an embodiment of the present invention.

具体实施方式Detailed ways

下面结合具体实施例对本发明做进一步详细的描述，但本发明的实施方式不限于此。The present invention will be described in further detail below with reference to specific embodiments, but the embodiments of the present invention are not limited thereto.

实施例一Example 1

参考图3，在AVS编码过程中帧间技术，从而的带本发明实施例提供的一种点云几何信息帧间编码方法，进行熵编码之前可以帧间预测当前帧点云，之后进行熵编码。Referring to FIG. 3, in the process of AVS encoding, the inter-frame technology is used, so that a method for inter-frame encoding of point cloud geometric information provided by an embodiment of the present invention is used. .

如图4所示，本发明实施例提供的一种点云几何信息帧间编码方法包括：As shown in FIG. 4 , a method for inter-frame coding of point cloud geometric information provided by an embodiment of the present invention includes:

S1a，获取当前帧点云前的帧点云作为当前帧点云的参考帧点云；S1a, obtaining the frame point cloud before the current frame point cloud as the reference frame point cloud of the current frame point cloud;

其中，当前帧点云前的帧点云可以是当前帧点云的前一帧点云，或者前N帧点云中的一帧，本发明在此处不做限制。The frame point cloud before the current frame point cloud may be the previous frame point cloud of the current frame point cloud, or a frame in the previous N frame point clouds, which is not limited in the present invention.

S2a，对当前帧点云进行体素化处理，获得体素化处理后的当前帧点云；S2a, perform voxelization processing on the point cloud of the current frame to obtain the point cloud of the current frame after voxelization processing;

S3a，对当前帧点云以及参考帧点云分别进行八叉树划分，对划分得到的子节点继续进行八叉树划分，直到划分到叶子节点，得到当前帧点云的第一八叉树结构以及参考帧点云的第二八叉树结构；S3a, perform octree division on the point cloud of the current frame and the point cloud of the reference frame respectively, and continue to perform octree division on the sub-nodes obtained by division until the leaf nodes are divided to obtain the first octree structure of the point cloud of the current frame and the second octree structure of the reference frame point cloud;

S4a，针对第一八叉树结构的每个待编码节点，获取第二八叉树结构中对应参考节点的占位情况；S4a, for each node to be encoded in the first octree structure, obtain the occupancy situation of the corresponding reference node in the second octree structure;

S5a，针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况；S5a, for each node to be coded in the first octree structure, when the occupancy situation of the reference node corresponding to the node to be coded is occupied, obtain the occupancy situation of the first neighbor node adjacent to the reference node ;

参考图5，图5左侧为参考帧，图5右侧为当前帧，当前帧中待编码节点为灰色块，参考帧中与待编码节点对应的参考节点也为灰色块，空白块为第一邻居节点。Referring to Figure 5, the left side of Figure 5 is the reference frame, the right side of Figure 5 is the current frame, the node to be encoded in the current frame is a gray block, the reference node corresponding to the node to be encoded in the reference frame is also a gray block, and the blank block is the first. a neighbor node.

S6a，基于第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型；S6a, based on the occupancy situation of the first neighbor node, determine the context model of the occupancy code of the node to be encoded;

S7a，对确定上下文模型之后的待编码节点占位码进行熵编码，获得二进制码流。S7a, entropy encoding is performed on the node occupancy code to be encoded after the context model is determined to obtain a binary code stream.

作为本发明一种可选的实施方式，在获得二进制码流之后，可以对待编码节点占位码对分配的上下文模型对应的概率分布进行更新。As an optional implementation manner of the present invention, after the binary code stream is obtained, the probability distribution corresponding to the allocated context model of the occupancy code of the node to be encoded may be updated.

可以理解，每次分配上下文后，上下文对应的概率分布可能发生变化，因此对上下文的概率分布进行更新，可以提高对待编码节点占位码分配上下文的准确性，从而提高编码的性能。It can be understood that the probability distribution corresponding to the context may change after each context is allocated. Therefore, updating the probability distribution of the context can improve the accuracy of allocating the context to the node occupancy code to be coded, thereby improving the coding performance.

参考图6，执行过程可以是：Referring to Figure 6, the execution process can be:

(1)首先可以开辟一部分内存保存前一帧得到的重建点云作为当前帧点云的参考帧记为pointcloudPred；(1) First, a part of the memory can be opened to save the reconstructed point cloud obtained from the previous frame as the reference frame of the point cloud of the current frame, which is recorded as pointcloudPred;

(2)对当前帧点云pointcloud进行量化得到当前帧量化点云记为pointcloudQua；(2) Quantize the current frame point cloud pointcloud to obtain the current frame quantized point cloud and record it as pointcloudQua;

(3)对当前帧量化点云pointcloudQua和参考帧点云pointcloudPred分别进行八叉树划分，得到八叉树的占据码分别记为occupancy和occupancyPred，8位占据码的每一位分别记为Oi和OPi，并分别将当前点云和参考点云的8位占据码存放在哈希表中，便于后面为节点寻找邻居信息。(3) The quantized point cloud pointcloudQua of the current frame and the point cloudPred of the reference frame are divided into octrees respectively, and the occupancy codes of the octrees are respectively recorded as occupancy and occupancyPred, and each bit of the 8-bit occupancy code is respectively recorded as Oi and OPi, and store the 8-bit occupancy codes of the current point cloud and the reference point cloud in the hash table, which is convenient for finding neighbor information for the node later.

(4)判断当前节点是否满足孤立点编码方案，如果满足孤立点编码方案则直接令occupancy为0，进入(5)编码occupancy，然后编码孤立点的相对坐标，如果不满足孤立点编码方案则直接进入(5)。(4) Determine whether the current node satisfies the outlier coding scheme. If it satisfies the outlier coding scheme, directly set the occupancy to 0, enter (5) coding the occupancy, and then encode the relative coordinates of the outliers. If the outlier coding scheme is not satisfied, directly Enter (5).

(5)如果OPi等于1，即参考帧当前块的第i个子节点被占据，那么寻找当前子节点Oi已编码共面邻居节点对应的参考帧节点(如图4所示)的占据信息，计算被占据邻居节点的和为0—3共4种情况，对应4个上下文假设为ctx_interPredict[4]，每个上下文对应一个概率分布，利用邻居节点的占位情况选择上下文对应的概率分布，基于所选择的概率分布对当前帧当前块的第i个子节点占位码Oi进行熵编码，基于Oi对所选择的上下文对应的概率分布进行更新；如果OPi等于0，即参考帧当前块的第i个子节点没被占据，那么利用原始帧内的191个上下文对当前帧当前块的第i个子节点占位码Oi进行熵编码，其中一个上下文对应一个概率分布，基于Oi对所选择的上下文对应的概率分布进行更新。(5) If OPi is equal to 1, that is, the ith child node of the current block of the reference frame is occupied, then find the occupancy information of the reference frame node (as shown in Figure 4) corresponding to the coded coplanar neighbor node of the current child node Oi, and calculate The sum of the occupied neighbor nodes is 0-3, a total of 4 cases, the corresponding 4 contexts are assumed to be ctx_interPredict[4], each context corresponds to a probability distribution, and the probability distribution corresponding to the context is selected by using the occupancy of the neighbor nodes. The selected probability distribution performs entropy encoding on the occupancy code Oi of the ith child node of the current block of the current frame, and updates the probability distribution corresponding to the selected context based on Oi; if OPi is equal to 0, that is, the ith child of the current block of the reference frame is used. If the node is not occupied, then use the 191 contexts in the original frame to entropy encode the occupancy code Oi of the i-th child node of the current block of the current frame. One of the contexts corresponds to a probability distribution, and the probability corresponding to the selected context is based on Oi. The distribution is updated.

实施例二Embodiment 2

作为本发明可选的一种实施例，所述针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况的步骤包括：As an optional embodiment of the present invention, for each to-be-coded node of the first octree structure, when the occupancy status of the reference node corresponding to the to-be-coded node is occupied, obtain and reference node The steps of the occupancy situation of the adjacent first neighbor nodes include:

实施例三Embodiment 3

作为本发明可选的一种实施例，在所述针对第一八叉树结构的每个待编码节点，当与该待编码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的步骤之前，所述点云几何信息帧间编码方法还包括：As an optional embodiment of the present invention, for each to-be-coded node of the first octree structure, when the occupancy status of the reference node corresponding to the to-be-coded node is occupied, obtain and reference Before the step of the first neighbor node adjacent to the node, the method for inter-frame coding of point cloud geometric information further includes:

步骤a：针对第一八叉树结构的每个待编码节点，判断待编码节点对应的参考节点是否被占据，如果否，则获取与该待编码节点共面、共线、共点以及与待编码节点的父节点共面中任一或者组合的第二邻居节点的占位情况；Step a: for each node to be encoded in the first octree structure, determine whether the reference node corresponding to the node to be encoded is occupied, if not, obtain coplanar, collinear, common point and the node to be encoded The occupancy situation of any one or a combination of the second neighbor nodes in the coplanar parent node of the encoding node;

步骤b：基于第二邻居节点的占位情况，确定所述待编码节点占位码的上下文模型。Step b: Determine the context model of the occupancy code of the node to be encoded based on the occupancy situation of the second neighbor node.

实施例四Embodiment 4

作为本发明可选的一种实施例，在所述基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型的步骤之前，所述点云几何信息帧间编码方法还包括：As an optional embodiment of the present invention, before the step of determining the context model of the occupancy code of the node to be encoded based on the occupancy status of the first neighbor node, the point cloud geometric information Coding methods also include:

可以理解，本实施例不再将帧内帧间分开单独使用，而是将帧内帧间的上下文结合使用，可以组合成context[intracontext][intercontext]或者context[intercontext][intracontext]，既考虑帧内信息也考虑帧间信息。在上述帧间方案中帧间有4种上下文，帧内有191种上下文，那么就有4*191＝764种上下文。It can be understood that in this embodiment, the intra-frame and inter-frame are no longer used separately, but the context of the intra-frame and inter-frame is used in combination, which can be combined into context[intracontext][intercontext] or context[intercontext][intracontext], considering both Intra-frame information also considers inter-frame information. In the above inter-frame scheme, there are 4 contexts between frames and 191 contexts within a frame, so there are 4*191=764 contexts.

实施例五Embodiment 5

步骤a：获取与待编码节点对应的参考节点共面、共线、共点的第三邻居节点的占位情况；Step a: Obtain the occupancy status of the third neighbor node that is coplanar, colinear, and co-pointed with the reference node corresponding to the node to be encoded;

步骤b：将所述第三邻居节点的占位情况组成集合；Step b: forming a set of the occupancy conditions of the third neighbor nodes;

步骤c：在所述集合中任选预设数量个元素作为目标元素；Step c: Optionally select a preset number of elements as target elements in the set;

步骤d：所述基于所述第一邻居节点的占位情况，确定所述待编码节点占位码的上下文模型步骤包括：Step d: The step of determining the context model of the occupancy code of the node to be encoded based on the occupancy situation of the first neighbor node includes:

步骤e：基于所述目标元素，确定所述待编码节点占位码的上下文模型。Step e: Based on the target element, determine the context model of the placeholder code of the node to be encoded.

如图7至图9所示，参考节点存在6个共面邻居，12个共线邻居以及8个共点邻居本实施例改变寻找邻居的方式，不只是寻找当前子节点Oi已编码共面节点对应的参考帧节点。由于帧间的所有信息在编码和解码端都能得到，因此可以寻找Oi节点对应的OPi的所有共面共线和共点一共26个邻居节点或者这26个邻居节点的子集。As shown in Fig. 7 to Fig. 9 , the reference node has 6 coplanar neighbors, 12 collinear neighbors and 8 coplanar neighbors. This embodiment changes the way of finding neighbors, not just finding the current child node Oi that has been coded coplanar nodes The corresponding reference frame node. Since all the information between frames can be obtained at the encoding and decoding ends, it is possible to find all the coplanar, collinear and common points of OPi corresponding to the Oi node, a total of 26 neighbor nodes or a subset of these 26 neighbor nodes.

实施例六Embodiment 6

作为本发明可选的一种实施例，在所述集合中任选预设数量个元素作为目标元素的步骤之后，所述点云几何信息帧间编码方法还包括：As an optional embodiment of the present invention, after the step of selecting a preset number of elements in the set as target elements, the method for inter-frame encoding of point cloud geometric information further includes:

可以理解，对实施例中的第(5)步进行修改，将节点占据个数和进行映射，不再是每一个占据个数和对应一个上下文，而是经过映射可以将几个占据个数和对应一个上下文，比如将主方案中的占据邻居个数为1和2时用同一个上下文编码，那么就可以减少一个上下文，只使用3个上下文。It can be understood that the step (5) in the embodiment is modified, and the number of nodes occupied and mapped are no longer the number of occupied nodes and a corresponding context, but the number of occupied nodes can be mapped to the sum of the corresponding context. Corresponding to a context, for example, when the number of occupied neighbors in the main scheme is 1 and 2, the same context is used, then one context can be reduced and only 3 contexts are used.

实施例七Embodiment 7

作为本发明可选的一种实施例，在所述获取与参考节点共面的第一邻居节点的占位情况的步骤之后，所述点云几何信息帧间编码方法还包括：As an optional embodiment of the present invention, after the step of acquiring the occupancy status of the first neighbor node that is coplanar with the reference node, the method for inter-frame encoding of point cloud geometric information further includes:

可以理解，占位情况为被占据或者未占据，位置信息包括第一邻居节点相对于参考节点的位置，将占位情况与位置信息进行排列组合，其上下文模型个数则扩展到几何倍数。It can be understood that the occupancy status is occupied or unoccupied, and the position information includes the position of the first neighbor node relative to the reference node. The occupancy status and the position information are arranged and combined, and the number of context models is expanded to a geometric multiple.

下面以试验数据验证本发明提供的编码方法的优点。在不改变几何PSNR(PSNR是一种图像评价的客观标准，PSNR越大则图像的质量越好。)的情况下，可以降低编码码流大小，如下表所示，重建点云在C1(几何有损，属性有损)情况下的BD-rate(BD-rate是用来衡量性能的好坏的参数，BD-rate为负时表示性能变好，在此基础上BD-rate的绝对值越大，则性能的增益越大。)性能变好，在C4(几何无损，属性无损)情况下的bpp(bpp是用于衡量无损情况下的码流大小，小于100％表示性能变好，在此基础上绝对值越小增益越大)性能变好。在AVS平台上对部分多帧点云序列的几何信息在C1(几何有损，属性有损)和C4(几何无损，属性无损)条件下进行测试以后结果如表1和表2所示：The advantages of the encoding method provided by the present invention are verified below with experimental data. Without changing the geometric PSNR (PSNR is an objective standard for image evaluation, the larger the PSNR, the better the image quality.), the encoding code stream size can be reduced, as shown in the table below, the reconstructed point cloud is in C1 (geometric BD-rate in the case of lossy, attribute lossy) (BD-rate is a parameter used to measure the quality of performance. When BD-rate is negative, it means that the performance becomes better. On this basis, the absolute value of BD-rate is higher. Larger, the greater the gain in performance.) The performance becomes better, in the case of C4 (geometric lossless, attribute lossless) bpp (bpp is used to measure the size of the code stream in the lossless case, less than 100% means the performance becomes better, in On this basis, the smaller the absolute value, the larger the gain) and the better the performance. On the AVS platform, the geometric information of some multi-frame point cloud sequences is tested under the conditions of C1 (geometric lossy, attribute lossy) and C4 (geometric lossless, attribute lossless), and the results are shown in Table 1 and Table 2:

表1本发明的编码方法与现有AVS点云编码技术在C1情况下的性能对比结果Table 1 The performance comparison result of the coding method of the present invention and the existing AVS point cloud coding technology in the case of C1

表2本发明的编码方法与现有AVS点云编码技术在C4情况下的性能对比结果Table 2 The performance comparison results of the coding method of the present invention and the existing AVS point cloud coding technology in the case of C4

序列名sequence name bppbpp ford_01ford_01 97.1％97.1% ford_02ford_02 98.0％98.0% ford_03ford_03 98.7％98.7%

实施例八Embodiment 8

参考图10，在AVS解码过程中帧间技术，从而得到本发明实施例提供的一种点云几何信息帧间解码方法。Referring to FIG. 10 , in the process of AVS decoding, the inter-frame technology is used to obtain an inter-frame decoding method for point cloud geometric information provided by an embodiment of the present invention.

如图11所示，本发明实施例提供的一种点云几何信息帧间解码方法包括：As shown in FIG. 11 , an inter-frame decoding method for point cloud geometric information provided by an embodiment of the present invention includes:

S1b，接收二进制码流；S1b, receive binary code stream;

S2b，针对所述每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况；S2b, for each node to be decoded, when the occupancy situation of the reference node corresponding to the node to be decoded is occupied, obtain the occupancy situation of the first neighbor node adjacent to the reference node;

S3b，基于所述第一邻居节点的占位情况，确定所述待解码节点占位码的上下文模型；S3b, based on the occupancy situation of the first neighbor node, determine the context model of the occupancy code of the node to be decoded;

S4b，对确定上下文模型之后的待解码节点占位码进行熵解码。S4b, entropy decoding is performed on the node occupancy code to be decoded after the context model is determined.

实施例九Embodiment 9

作为本发明可选的一种实施例，所述针对第一八叉树结构的每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的第一邻居节点的占位情况的步骤包括：As an optional embodiment of the present invention, for each node to be decoded in the first octree structure, when the occupancy status of the reference node corresponding to the node to be decoded is occupied, obtain the reference node The steps of the occupancy situation of the adjacent first neighbor nodes include:

实施例十Embodiment ten

作为本发明可选的一种实施例，在所述针对第一八叉树结构的每个待解码节点，当与该待解码节点对应的参考节点的占位情况为被占据时，获取与参考节点相邻的邻居节点的步骤之前，所述点云几何信息帧间解码方法还包括：As an optional embodiment of the present invention, for each node to be decoded in the first octree structure, when the occupancy status of the reference node corresponding to the node to be decoded is occupied, obtain and reference Before the step of the neighbor node adjacent to the node, the method for inter-frame decoding of point cloud geometric information further includes:

步骤a：判断待解码节点对应的参考节点是否被占据，如果否，则获取与该待解码节点共面、共线、共点以及与待解码节点的父节点共面中任一或者组合的第二邻居节点的占位情况；Step a: Determine whether the reference node corresponding to the node to be decoded is occupied, and if not, obtain any or a combination of the nodes that are coplanar, collinear, co-point with the node to be decoded, and coplanar with the parent node of the node to be decoded. Occupancy of two neighbor nodes;

步骤b：基于第二邻居节点的占位情况，确定所述待解码节点占位码的上下文模型。Step b: Determine the context model of the occupancy code of the node to be decoded based on the occupancy situation of the second neighbor node.

此外，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本发明的描述中，“多个”的含义是两个或两个以上，除非另有明确具体的限定。In addition, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as indicating or implying relative importance or implying the number of indicated technical features. Thus, a feature defined as "first" or "second" may expressly or implicitly include one or more of that feature. In the description of the present invention, "plurality" means two or more, unless otherwise expressly and specifically defined.

以上内容是结合具体的优选实施方式对本发明所作的进一步详细说明，不能认定本发明的具体实施只局限于这些说明。对于本发明所属技术领域的普通技术人员来说，在不脱离本发明构思的前提下，还可以做出若干简单推演或替换，都应当视为属于本发明的保护范围。The above content is a further detailed description of the present invention in combination with specific preferred embodiments, and it cannot be considered that the specific implementation of the present invention is limited to these descriptions. For those of ordinary skill in the technical field of the present invention, without departing from the concept of the present invention, some simple deductions or substitutions can be made, which should be regarded as belonging to the protection scope of the present invention.

Claims

1. A point cloud geometric information interframe coding method is characterized by comprising the following steps:

acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;

performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;

respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuously carrying out octree division on sub nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;

wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;

aiming at each node to be coded in the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;

for each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;

determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;

and entropy coding the placeholder code of the node to be coded after the context model is determined to obtain a binary code stream.

2. The point cloud geometric information interframe coding method according to claim 1, wherein the step of acquiring, for each node to be coded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied comprises:

for each node to be coded of the first octree structure, when the occupancy condition of a reference node corresponding to the node to be coded is occupied, the occupancy condition of a first neighbor node coplanar with the reference node is obtained.

3. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of obtaining, for each node to be coded of the first octree structure, a first neighboring node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied, the point cloud geometric information interframe coding method further comprises:

judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;

and determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the second neighbor node.

4. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of determining the context model of the node-to-be-coded space-occupying code based on the space-occupying condition of the first neighboring node, the point cloud geometric information interframe coding method further comprises:

acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;

the step of determining the context model of the node occupation code to be coded based on the occupation situation of the first neighbor node comprises:

and determining a context model of the node occupation code to be coded based on occupation conditions of the first neighbor node and the second neighbor node.

5. The point cloud geometry information interframe coding method of claim 1, wherein prior to the step of determining the context model of the node-to-be-coded placeholder code based on the placeholder of the first neighboring node, the point cloud geometry information interframe coding method further comprises:

acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;

forming a set of the occupation conditions of the third neighbor nodes;

optionally selecting a first preset number of elements in the set as target elements;

the step of determining the context model of the node occupancy code to be coded based on the occupancy of the first neighboring node comprises:

and determining a context model of the node occupation code to be coded based on the target element.

6. The point cloud geometric information interframe coding method according to claim 5, wherein after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further comprises:

optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;

the first preset quantity is greater than the second preset quantity;

the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:

determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.

7. The point cloud geometry information interframe coding method of claim 2, wherein after the step of obtaining an occupancy of a first neighbor node that is coplanar with a reference node, the point cloud geometry information interframe coding method further comprises:

acquiring the position information of the first neighbor node;

the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:

and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.

8. A point cloud geometric information interframe decoding method is characterized by comprising the following steps:

receiving a binary code stream;

the binary code stream comprises a node occupation code to be decoded;

for each node to be decoded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be decoded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;

determining a context model of the node occupation code to be decoded based on the occupation situation of the first neighbor node;

and entropy decoding the placeholder of the node to be decoded after the context model is determined.

9. The point cloud geometric information interframe decoding method of claim 8, wherein the step of obtaining, for each node to be decoded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied comprises:

for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, acquiring the occupancy of a first neighbor node coplanar with the reference node.

10. The point cloud geometric information interframe decoding method according to claim 8, wherein before the step of acquiring, for each node to be decoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further comprises:

judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be decoded and coplanarity with a father node of the node to be decoded;

and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.