CN112565764B - Point cloud geometric information interframe coding and decoding method - Google Patents
Point cloud geometric information interframe coding and decoding method Download PDFInfo
- Publication number
- CN112565764B CN112565764B CN202011396370.3A CN202011396370A CN112565764B CN 112565764 B CN112565764 B CN 112565764B CN 202011396370 A CN202011396370 A CN 202011396370A CN 112565764 B CN112565764 B CN 112565764B
- Authority
- CN
- China
- Prior art keywords
- node
- point cloud
- coded
- occupation
- context model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/13—Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The embodiment of the invention provides a point cloud geometric information interframe coding and decoding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and obtaining a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud after iterative division to leaf nodes; acquiring the occupancy condition of each reference node, and acquiring the occupancy condition of a first neighbor node of the reference node when the occupancy condition of the reference node corresponding to the coding node is occupied; and determining a context model of the occupation code of the node to be coded based on the occupation situation and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies, otherwise uses the context model of the intraframe to carry out entropy coding without motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.
Description
Technical Field
The invention belongs to the technical field of coding and decoding, and particularly relates to a point cloud geometric information interframe coding and decoding method.
Background
With the continuous development of point cloud technology, the compression and encoding of point cloud data becomes an important research problem. At present, the Standard working Group (AVS) of the domestic digital Audio and Video coding Standard (Standard) of China and the Moving Picture Experts Group (MPEG) in the International organization for standardization both make the Standard of point cloud coding. At present, as shown in fig. 1, the process of encoding the point clouds in AVS first performs coordinate transformation on the geometric information, so that all the point clouds are contained in a bounding box. And then, quantizing, wherein the step of quantizing mainly plays a role of scaling, as the geometric information of a part of points is the same due to quantization rounding, whether to remove the repeated points is determined according to parameters, and the process of quantizing and removing the repeated points is also called voxelization. The bounding box is called a root node, the bounding box is divided into eight equal parts into 8 subcubes by carrying out octree division on the root node, each subcube is called a child node of the root node, the eight child nodes are respectively represented by 1 bit whether the child nodes are occupied or not, namely whether points in the point cloud are contained or not, 0 is represented by 0, 1 is represented by child nodes, the bits are called occupied bit codes, and the binary code stream is generated through entropy coding. And continuously carrying out octree division on the non-empty (including the points in the point cloud) subcubes until the division is stopped when the leaf nodes obtained by division are unit cubes of 1x1x1, and encoding the points in the leaf nodes to generate a binary code stream by adopting a traversal sequence with a preferred breadth in the whole process. And reconstructing the geometric information after the geometric coding is finished. The encoding of the attribute information is mainly performed for color information. First, color information is converted from an RGB color space to a YUV color space. And then recoloring the geometrically reconstructed point cloud by using the original point cloud so as to enable the uncoded attribute information to correspond to the reconstructed geometrical information. In color information coding, after dot clouds are sequenced by Morton codes, the nearest neighbor of a point to be predicted is searched by using a geometric spatial relationship, interpolation prediction is carried out on the point to be predicted by using a reconstruction attribute value of the found neighbor to obtain a prediction attribute value, then difference is carried out on a real attribute value and the prediction attribute value to obtain a prediction residual error, and finally quantization and coding are carried out on the prediction residual error to generate a binary code stream. Decoding process as shown in fig. 2, the decoding process is reciprocal to the encoding process.
In the AVS point cloud coding platform at the present stage, no inter-frame technology exists, and only the intra-frame technology exists, so that redundant information can be removed only by utilizing spatial correlation, and the redundant information cannot be removed by utilizing temporal correlation.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a point cloud geometric information interframe coding and decoding method. The technical problem to be solved by the invention is realized by the following technical scheme:
in a first aspect, the invention provides a point cloud geometric information interframe coding method, which includes:
acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuing octree division on sub-nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
aiming at each node to be coded in the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
for each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and entropy coding the placeholder code of the node to be coded after the context model is determined to obtain a binary code stream.
Optionally, for each node to be encoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be encoded is occupied, the step of obtaining the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be coded of the first octree structure, when the occupancy condition of a reference node corresponding to the node to be coded is occupied, the occupancy condition of a first neighbor node coplanar with the reference node is obtained.
Optionally, before the step of acquiring, for each node to be encoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be encoded is occupied, the point cloud geometric information interframe coding method further includes:
judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
and determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the second neighbor node.
Optionally, before the step of determining the context model of the node-to-be-encoded bit-occupying code based on the bit-occupying situation of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupation code to be coded based on the occupation situation of the first neighbor node comprises:
and determining a context model of the occupation code of the node to be coded based on the occupation situations of the first neighbor node and the second neighbor node.
Optionally, before the step of determining the context model of the node-to-be-encoded bit-occupying code based on the bit-occupying situation of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
forming a set of the occupation conditions of the third neighbor nodes;
optionally selecting a first preset number of elements in the set as target elements;
the step of determining the context model of the occupation code of the node to be coded based on the occupation situation of the first neighbor node comprises the following steps:
and determining a context model of the node occupation code to be coded based on the target element.
Optionally, after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further includes:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
the first preset quantity is greater than the second preset quantity;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
Optionally, after the step of obtaining the occupancy of the first neighboring node coplanar with the reference node, the point cloud geometric information interframe coding method further includes:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
In a second aspect, the invention provides a point cloud geometric information interframe decoding method, which includes:
receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
for each node to be decoded, when the occupancy of the reference node corresponding to the node to be decoded is occupied, acquiring the occupancy of a first neighbor node adjacent to the reference node;
determining a context model of the occupation code of the node to be decoded based on the occupation situation of the first neighbor node;
and entropy decoding the placeholder of the node to be decoded after the context model is determined.
Optionally, for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the step of obtaining the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the occupancy of a first neighbor node coplanar with the reference node is acquired.
Optionally, before the step of acquiring, for each node to be decoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further includes:
judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanar, collinear and concurrent with the node to be decoded and coplanar with a father node of the node to be decoded;
and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
The embodiment of the invention provides a point cloud geometric information interframe coding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and finishing iterative division until leaf nodes are reached to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud; acquiring the occupation situation of each reference node; when the occupation situation of a reference node corresponding to the coding node is occupied, acquiring the occupation situation of a first neighbor node of the reference node; and determining a context model of the occupation code of the node to be coded based on the occupation situation, and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies the space, otherwise uses the context model of the intraframe to carry out entropy coding, and does not need motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.
The present invention will be described in further detail with reference to the accompanying drawings and examples.
Drawings
Fig. 1 is a flowchart illustrating an AVS encoding process according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an AVS decoding process provided by an embodiment of the present invention;
FIG. 3 is a flowchart of an embodiment of an AVS encoding process using inter-frame techniques;
FIG. 4 is a flowchart of an embodiment of the interframe technique applied to AVS decoding;
FIG. 5 is a flowchart of a method for interframe coding of point cloud geometric information according to an embodiment of the present invention;
FIG. 6 is a schematic diagram of neighboring nodes of a node corresponding to a reference frame;
FIG. 7 is a flowchart of AVS inter-frame prediction according to an embodiment of the present invention;
fig. 8 is a schematic diagram of a reference node according to an embodiment of the present invention having 6 coplanar neighbor nodes;
fig. 9 is a schematic diagram of a reference node according to an embodiment of the present invention, where 12 collinear neighbor nodes exist;
fig. 10 is a schematic diagram of a reference node according to an embodiment of the present invention having 8 co-located neighbor nodes;
fig. 11 is a flowchart of a point cloud geometric information interframe decoding method according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to specific examples, but the embodiments of the present invention are not limited thereto.
Example one
Referring to fig. 3, in the interframe technique in the AVS encoding process, with the point cloud geometric information interframe encoding method provided by the embodiment of the present invention, the current frame point cloud can be interframe predicted before entropy encoding, and then entropy encoding is performed.
As shown in fig. 4, a point cloud geometric information interframe coding method provided in the embodiment of the present invention includes:
s1a, acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
the frame point cloud before the current frame point cloud may be a previous frame point cloud of the current frame point cloud, or one frame of previous N frame point clouds, which is not limited herein.
S2a, performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
s3a, respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuing octree division on the sub-nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
s4a, aiming at each node to be coded of the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
s5a, aiming at each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
referring to fig. 5, the left side of fig. 5 is a reference frame, the right side of fig. 5 is a current frame, a node to be encoded in the current frame is a gray block, a reference node corresponding to the node to be encoded in the reference frame is also a gray block, and a blank block is a first neighbor node.
S6a, determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and S7a, entropy coding is carried out on the node placeholder code to be coded after the context model is determined, and a binary code stream is obtained.
As an optional implementation manner of the present invention, after obtaining the binary code stream, the probability distribution corresponding to the context model assigned to the placeholder pair of the node to be coded may be updated.
It can be understood that after the context is allocated each time, the probability distribution corresponding to the context may change, so that updating the probability distribution of the context can improve the accuracy of allocating the context to the node bit occupying code to be encoded, thereby improving the performance of encoding.
Referring to fig. 6, the execution process may be:
(1) Firstly, a part of memory can be developed to store reconstructed point cloud obtained from the previous frame as a reference frame of the current frame point cloud and the reference frame is marked as pointcloudPred;
(2) Quantizing the current frame point cloud to obtain a current frame quantized point cloud which is marked as pointcloudQua;
(3) The current frame quantized point cloud pointcloudQua and the reference frame point cloud pointcloudPred are respectively subjected to octree division to obtain occupation codes of the octree which are respectively recorded as occupancy and occupancy Pred, each bit of the 8-bit occupation codes are respectively recorded as Oi and OPi, and the 8-bit occupation codes of the current point cloud and the reference point cloud are respectively stored in a hash table, so that neighbor information can be conveniently searched for nodes later.
(4) Judging whether the current node meets the isolated point coding scheme, if so, directly making the occupancy be 0, entering the coding (5), then coding the relative coordinates of the isolated point, and if not, directly entering the coding (5).
(5) If OPi is equal to 1, namely the ith sub-node of the current block of the reference frame is occupied, searching occupation information of a reference frame node (shown in figure 4) corresponding to a coplanar neighbor node coded by the current sub-node Oi, calculating 4 conditions of 0-3 of the occupied neighbor node, assuming that 4 corresponding contexts are ctx _ interPredict [4], each context corresponds to one probability distribution, selecting the probability distribution corresponding to the context by utilizing the occupation condition of the neighbor node, entropy coding the ith sub-node occupation code Oi of the current block of the current frame based on the selected probability distribution, and updating the probability distribution corresponding to the selected context based on the Oi; if OPi is equal to 0, namely the ith sub-node of the current block of the reference frame is not occupied, entropy coding the ith sub-node occupied bit code Oi of the current block of the current frame by using 191 contexts in the original frame, wherein one context corresponds to one probability distribution, and updating the probability distribution corresponding to the selected context based on Oi.
The embodiment of the invention provides a point cloud geometric information interframe coding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and finishing iterative division until leaf nodes are reached to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud; acquiring the occupation situation of each reference node; when the occupation situation of the reference node corresponding to the coding node is occupied, acquiring the occupation situation of a first neighbor node of the reference node; and determining a context model of the occupation code of the node to be coded based on the occupation situation, and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies, otherwise uses the context model of the intraframe to carry out entropy coding, and does not need motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.
Example two
As an optional embodiment of the present invention, for each node to be encoded in the first octree structure, when the occupancy of the reference node corresponding to the node to be encoded is occupied, the step of acquiring the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be coded of the first octree structure, when the occupancy of the reference node corresponding to the node to be coded is occupied, the occupancy of a first neighbor node coplanar with the reference node is obtained.
EXAMPLE III
As an optional embodiment of the present invention, before the step of acquiring, for each node to be encoded of the first octree structure, a first neighboring node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be encoded is occupied, the point cloud geometric information interframe coding method further includes:
a, step a: judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
step b: and determining a context model of the bit occupying code of the node to be coded based on the bit occupying condition of the second neighbor node.
Example four
As an optional embodiment of the present invention, before the step of determining the context model of the node-to-be-encoded placeholder code based on the placeholder condition of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupancy code to be coded based on the occupancy of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the occupation situations of the first neighbor node and the second neighbor node.
It is understood that, in this embodiment, intra-frame and inter-frame are not separately used, but context between intra-frame and inter-frame is used in combination, and may be combined into context or context, taking both intra-frame information and inter-frame information into consideration. In the above interframe scheme, there are 4 contexts between frames and 191 contexts in a frame, and then there are 4 × 191=764 contexts.
EXAMPLE five
As an optional embodiment of the present invention, before the step of determining the context model of the node-to-be-encoded placeholder code based on the placeholder condition of the first neighboring node, the point cloud geometric information interframe coding method further includes:
step a: acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
step b: forming a set of the occupation conditions of the third neighbor nodes;
step c: optionally selecting a preset number of elements in the set as target elements;
step d: the step of determining the context model of the occupation code of the node to be coded based on the occupation situation of the first neighbor node comprises the following steps:
step e: and determining a context model of the node occupation code to be coded based on the target element.
As shown in fig. 7 to 9, the embodiment of the present embodiment that the reference node has 6 coplanar neighbors, 12 collinear neighbors and 8 concurrent neighbors changes the way of finding the neighbors, and not only finds the reference frame node corresponding to the coplanar node encoded by the current child node Oi. Since all the information between frames can be obtained at the encoding and decoding ends, all the coplanarity, collinearity and collinearity points of the OPi corresponding to the Oi node can be searched for, and the total 26 neighbor nodes or the subset of the 26 neighbor nodes can be searched for.
EXAMPLE six
As an optional embodiment of the present invention, after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further includes:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
wherein the first preset number is greater than the second preset number;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
It can be understood that, the step (5) in the embodiment is modified to map the occupied numbers and the corresponding contexts of the nodes, instead of mapping each occupied number and corresponding to one context, several occupied numbers and corresponding to one context may be mapped, for example, if the occupied neighbors in the main scheme are 1 and 2, the same context is used for encoding, and then one context may be reduced, and only 3 contexts are used.
EXAMPLE seven
As an optional embodiment of the present invention, after the step of obtaining the occupancy of the first neighboring node coplanar with the reference node, the point cloud geometric information interframe coding method further includes:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
It is understood that the occupancy is occupied or unoccupied, the position information includes the position of the first neighboring node relative to the reference node, the occupancy and the position information are arranged and combined, and the number of the context models is expanded to a geometric multiple.
The advantages of the coding method provided by the present invention are verified by experimental data. Under the condition of not changing geometric PSNR (PSNR is an objective standard for image evaluation, and the quality of an image is better when PSNR is larger), the size of a coded code stream can be reduced, as shown in the following table, BD-rate of reconstructed point cloud under the condition of C1 (geometric damage and attribute damage) (BD-rate is a parameter for measuring the performance is good, and when BD-rate is negative, the performance is good, on the basis, the larger the absolute value of BD-rate is, the larger the gain of the performance is), the performance is good, and bpp under the condition of C4 (geometric damage and attribute damage) (bpp is used for measuring the size of the code stream under the condition of damage, less than 100% represents that the performance is good, and on the basis, the smaller the absolute value is, the larger the gain is), the performance is good. The results of testing the geometric information of a part of multi-frame point cloud sequences on an AVS platform under the conditions of C1 (geometric lossy, attribute lossy) and C4 (geometric lossless and attribute lossless) are shown in tables 1 and 2:
TABLE 1 comparison of the Performance of the encoding method of the present invention with that of the existing AVS Point cloud encoding technique under C1
TABLE 2 Performance comparison results of the encoding method of the present invention and the existing AVS point cloud encoding technique under C4 conditions
Sequence name | bpp |
ford_01 | 97.1% |
ford_02 | 98.0% |
ford_03 | 98.7% |
Example eight
Referring to fig. 10, an interframe technique is performed in the AVS decoding process, thereby obtaining an interframe decoding method for point cloud geometric information according to an embodiment of the present invention.
As shown in fig. 11, a point cloud geometric information interframe decoding method provided in the embodiment of the present invention includes:
s1b, receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
s2b, aiming at each node to be decoded, when the occupation situation of the reference node corresponding to the node to be decoded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
s3b, determining a context model of the occupation code of the node to be decoded based on the occupation situation of the first neighbor node;
and S4b, entropy decoding the node placeholder code to be decoded after the context model is determined.
Example nine
As an optional embodiment of the present invention, for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the step of acquiring the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the occupancy of a first neighbor node coplanar with the reference node is acquired.
Example ten
As an optional embodiment of the present invention, before the step of acquiring, for each node to be decoded of the first octree structure, a neighboring node adjacent to a reference node when an occupation situation of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further includes:
step a: judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be decoded and coplanarity with a father node of the node to be decoded;
step b: and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.
The foregoing is a further detailed description of the invention in connection with specific preferred embodiments and it is not intended to limit the invention to the specific embodiments described. For those skilled in the art to which the invention pertains, numerous simple deductions or substitutions may be made without departing from the spirit of the invention, which shall be deemed to belong to the scope of the invention.
Claims (10)
1. A point cloud geometric information interframe coding method is characterized by comprising the following steps:
acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuously carrying out octree division on sub nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
aiming at each node to be coded in the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
for each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and entropy coding the placeholder code of the node to be coded after the context model is determined to obtain a binary code stream.
2. The point cloud geometric information interframe coding method according to claim 1, wherein the step of acquiring, for each node to be coded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied comprises:
for each node to be coded of the first octree structure, when the occupancy condition of a reference node corresponding to the node to be coded is occupied, the occupancy condition of a first neighbor node coplanar with the reference node is obtained.
3. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of obtaining, for each node to be coded of the first octree structure, a first neighboring node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied, the point cloud geometric information interframe coding method further comprises:
judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
and determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the second neighbor node.
4. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of determining the context model of the node-to-be-coded space-occupying code based on the space-occupying condition of the first neighboring node, the point cloud geometric information interframe coding method further comprises:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupation code to be coded based on the occupation situation of the first neighbor node comprises:
and determining a context model of the node occupation code to be coded based on occupation conditions of the first neighbor node and the second neighbor node.
5. The point cloud geometry information interframe coding method of claim 1, wherein prior to the step of determining the context model of the node-to-be-coded placeholder code based on the placeholder of the first neighboring node, the point cloud geometry information interframe coding method further comprises:
acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
forming a set of the occupation conditions of the third neighbor nodes;
optionally selecting a first preset number of elements in the set as target elements;
the step of determining the context model of the node occupancy code to be coded based on the occupancy of the first neighboring node comprises:
and determining a context model of the node occupation code to be coded based on the target element.
6. The point cloud geometric information interframe coding method according to claim 5, wherein after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further comprises:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
the first preset quantity is greater than the second preset quantity;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
7. The point cloud geometry information interframe coding method of claim 2, wherein after the step of obtaining an occupancy of a first neighbor node that is coplanar with a reference node, the point cloud geometry information interframe coding method further comprises:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
8. A point cloud geometric information interframe decoding method is characterized by comprising the following steps:
receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
for each node to be decoded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be decoded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the node occupation code to be decoded based on the occupation situation of the first neighbor node;
and entropy decoding the placeholder of the node to be decoded after the context model is determined.
9. The point cloud geometric information interframe decoding method of claim 8, wherein the step of obtaining, for each node to be decoded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied comprises:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, acquiring the occupancy of a first neighbor node coplanar with the reference node.
10. The point cloud geometric information interframe decoding method according to claim 8, wherein before the step of acquiring, for each node to be decoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further comprises:
judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be decoded and coplanarity with a father node of the node to be decoded;
and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011396370.3A CN112565764B (en) | 2020-12-03 | 2020-12-03 | Point cloud geometric information interframe coding and decoding method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011396370.3A CN112565764B (en) | 2020-12-03 | 2020-12-03 | Point cloud geometric information interframe coding and decoding method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112565764A CN112565764A (en) | 2021-03-26 |
CN112565764B true CN112565764B (en) | 2022-10-04 |
Family
ID=75047729
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011396370.3A Active CN112565764B (en) | 2020-12-03 | 2020-12-03 | Point cloud geometric information interframe coding and decoding method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112565764B (en) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115471627A (en) * | 2021-06-11 | 2022-12-13 | 维沃移动通信有限公司 | Point cloud geometric information encoding processing method, point cloud geometric information decoding processing method and related equipment |
CN115474058A (en) * | 2021-06-11 | 2022-12-13 | 维沃移动通信有限公司 | Point cloud encoding processing method, point cloud decoding processing method and related equipment |
CN115474052A (en) * | 2021-06-11 | 2022-12-13 | 维沃移动通信有限公司 | Point cloud encoding processing method, point cloud decoding processing method and related equipment |
CN113453009B (en) * | 2021-07-04 | 2023-02-14 | 西北工业大学 | Point cloud space scalable coding geometric reconstruction method based on fitting plane geometric error minimum |
WO2023003144A1 (en) * | 2021-07-20 | 2023-01-26 | 엘지전자 주식회사 | Point cloud data transmission device, point cloud data transmission method, point cloud data reception device, and point cloud data reception method |
CN113613017B (en) * | 2021-07-27 | 2024-04-19 | 闽都创新实验室 | Method for improving V-PCC inter-frame prediction by three-dimensional inter-frame prediction |
CN113676738B (en) * | 2021-08-19 | 2024-03-29 | 上海交通大学 | Geometric coding and decoding method and device of three-dimensional point cloud |
EP4425928A1 (en) * | 2021-10-27 | 2024-09-04 | LG Electronics Inc. | Point cloud data transmission device, point cloud data transmission method, point cloud data reception device, and point cloud data transmission method |
CN116055751A (en) * | 2021-10-28 | 2023-05-02 | 华为技术有限公司 | Encoding and decoding method, device, equipment, storage medium and program product of point cloud |
CN116233388B (en) * | 2021-12-03 | 2024-08-27 | 维沃移动通信有限公司 | Point cloud coding and decoding processing method and device, coding equipment and decoding equipment |
WO2023131126A1 (en) * | 2022-01-04 | 2023-07-13 | Beijing Bytedance Network Technology Co., Ltd. | Method, apparatus, and medium for point cloud coding |
WO2023132331A1 (en) * | 2022-01-07 | 2023-07-13 | Kddi株式会社 | Point cloud decoding device, point cloud decoding method, and program |
EP4220561A1 (en) * | 2022-02-01 | 2023-08-02 | Beijing Xiaomi Mobile Software Co., Ltd. | Method and apparatus of encoding/decoding a slice of point cloud data |
CN116800969A (en) * | 2022-03-18 | 2023-09-22 | 维沃移动通信有限公司 | Encoding and decoding methods, devices and equipment |
US20230342987A1 (en) * | 2022-04-14 | 2023-10-26 | Qualcomm Incorporated | Occupancy coding using inter prediction with octree occupancy coding based on dynamic optimal binary coder with update on the fly (obuf) in geometry-based point cloud compression |
WO2024145934A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Point cloud coding/decoding method and apparatus, and device and storage medium |
WO2024145910A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Encoding method, decoding method, bitstream, encoder, decoder and storage medium |
WO2024145935A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Point cloud encoding method and apparatus, point cloud decoding method and apparatus, device, and storage medium |
WO2024145904A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Encoding method, decoding method, code stream, encoder, decoder, and storage medium |
WO2024145913A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Point cloud encoding and decoding method and apparatus, device, and storage medium |
WO2024145933A1 (en) * | 2023-01-06 | 2024-07-11 | Oppo广东移动通信有限公司 | Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium |
WO2024197680A1 (en) * | 2023-03-29 | 2024-10-03 | Oppo广东移动通信有限公司 | Point cloud coding method and apparatus, point cloud decoding method and apparatus, device, and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109196559A (en) * | 2016-05-28 | 2019-01-11 | 微软技术许可有限责任公司 | The motion compensation of dynamic voxelization point cloud is compressed |
CN111465964A (en) * | 2017-10-19 | 2020-07-28 | 交互数字Vc控股公司 | Method and apparatus for encoding/decoding geometry of point cloud representing 3D object |
WO2020190090A1 (en) * | 2019-03-20 | 2020-09-24 | 엘지전자 주식회사 | Point cloud data transmission device, point cloud data transmission method, point cloud data reception device and point cloud data reception method |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10499054B2 (en) * | 2017-10-12 | 2019-12-03 | Mitsubishi Electric Research Laboratories, Inc. | System and method for inter-frame predictive compression for point clouds |
US11210813B2 (en) * | 2019-05-30 | 2021-12-28 | Tencent America LLC | Method and apparatus for point cloud compression |
-
2020
- 2020-12-03 CN CN202011396370.3A patent/CN112565764B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109196559A (en) * | 2016-05-28 | 2019-01-11 | 微软技术许可有限责任公司 | The motion compensation of dynamic voxelization point cloud is compressed |
CN111465964A (en) * | 2017-10-19 | 2020-07-28 | 交互数字Vc控股公司 | Method and apparatus for encoding/decoding geometry of point cloud representing 3D object |
WO2020190090A1 (en) * | 2019-03-20 | 2020-09-24 | 엘지전자 주식회사 | Point cloud data transmission device, point cloud data transmission method, point cloud data reception device and point cloud data reception method |
Non-Patent Citations (3)
Title |
---|
A 3D Haar Wavelet Transform for Point Cloud Attribute Compression Based on Local Surface Analysis;Sujun Zhang,Wei Zhang,Fuzheng Yang,Junyan Huo;《2019 Picture Coding Symposium (PCS)》;20200109;全文 * |
Shishir Subramanyam ; Pablo Cesar.Enhancement Layer Inter Frame Coding for 3D Dynamic Point Clouds.《2018 IEEE Games, Entertainment, Media Conference (GEM)》.2018, * |
基于几何重构的LiDAR点云几何无损压缩的研究;魏志文;《中国优秀博硕士学位论文全文数据库(硕士) 工程科技Ⅱ辑》;20200715;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN112565764A (en) | 2021-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112565764B (en) | Point cloud geometric information interframe coding and decoding method | |
CN112565795B (en) | Point cloud geometric information encoding and decoding method | |
CN108322742B (en) | A kind of point cloud genera compression method based on intra prediction | |
CN108781281B (en) | Shape adaptive model based codec for lossy and lossless image compression | |
CN113382252B (en) | Encoding and decoding method, device, equipment and storage medium | |
JP7268598B2 (en) | Information processing device and method | |
US20230065156A1 (en) | Point cloud encoding/decoding method, encoder, decoder, and storage medium | |
CN112565794A (en) | Point cloud isolated point encoding and decoding method and device | |
CN114095735A (en) | Point cloud geometric inter-frame prediction method based on block motion estimation and motion compensation | |
CN102484705B (en) | Encoding and decoding a video image sequence by image areas | |
CN111447452B (en) | Data coding method and system | |
CN115190300A (en) | Method for predicting attribute information, encoder, decoder, and storage medium | |
KR100573507B1 (en) | Partition coding method and device | |
CN109688411B (en) | Video coding rate distortion cost estimation method and device | |
CN113395516A (en) | Intra-frame prediction method and device and computer-readable storage medium | |
CN113453009B (en) | Point cloud space scalable coding geometric reconstruction method based on fitting plane geometric error minimum | |
CN112509107A (en) | Point cloud attribute recoloring method, device and encoder | |
CN107409216B (en) | Image encoding and decoding method, encoding and decoding device and corresponding computer program | |
CN116320488A (en) | Intra-frame prediction method, device, encoder, decoder, and storage medium | |
CN110855991A (en) | Graphic data compression method for computer image processing | |
CN112969067B (en) | Video coding method, device, equipment and readable storage medium | |
CN116781898A (en) | HEVC intra-frame prediction mode rapid selection method, device and medium | |
CN117156115A (en) | Point cloud geometric coding and decoding method and device based on multi-neighbor state transition | |
CN118765499A (en) | Encoding method, encoder, and storage medium | |
CN118525299A (en) | Encoding method, decoding method, encoder, decoder, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |