CN112565764B - Point cloud geometric information interframe coding and decoding method - Google Patents

Point cloud geometric information interframe coding and decoding method Download PDF

Info

Publication number
CN112565764B
CN112565764B CN202011396370.3A CN202011396370A CN112565764B CN 112565764 B CN112565764 B CN 112565764B CN 202011396370 A CN202011396370 A CN 202011396370A CN 112565764 B CN112565764 B CN 112565764B
Authority
CN
China
Prior art keywords
node
point cloud
coded
occupation
context model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011396370.3A
Other languages
Chinese (zh)
Other versions
CN112565764A (en
Inventor
张伟
杨付正
聂佳明
陈天
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN202011396370.3A priority Critical patent/CN112565764B/en
Publication of CN112565764A publication Critical patent/CN112565764A/en
Application granted granted Critical
Publication of CN112565764B publication Critical patent/CN112565764B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The embodiment of the invention provides a point cloud geometric information interframe coding and decoding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and obtaining a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud after iterative division to leaf nodes; acquiring the occupancy condition of each reference node, and acquiring the occupancy condition of a first neighbor node of the reference node when the occupancy condition of the reference node corresponding to the coding node is occupied; and determining a context model of the occupation code of the node to be coded based on the occupation situation and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies, otherwise uses the context model of the intraframe to carry out entropy coding without motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.

Description

Point cloud geometric information interframe coding and decoding method
Technical Field
The invention belongs to the technical field of coding and decoding, and particularly relates to a point cloud geometric information interframe coding and decoding method.
Background
With the continuous development of point cloud technology, the compression and encoding of point cloud data becomes an important research problem. At present, the Standard working Group (AVS) of the domestic digital Audio and Video coding Standard (Standard) of China and the Moving Picture Experts Group (MPEG) in the International organization for standardization both make the Standard of point cloud coding. At present, as shown in fig. 1, the process of encoding the point clouds in AVS first performs coordinate transformation on the geometric information, so that all the point clouds are contained in a bounding box. And then, quantizing, wherein the step of quantizing mainly plays a role of scaling, as the geometric information of a part of points is the same due to quantization rounding, whether to remove the repeated points is determined according to parameters, and the process of quantizing and removing the repeated points is also called voxelization. The bounding box is called a root node, the bounding box is divided into eight equal parts into 8 subcubes by carrying out octree division on the root node, each subcube is called a child node of the root node, the eight child nodes are respectively represented by 1 bit whether the child nodes are occupied or not, namely whether points in the point cloud are contained or not, 0 is represented by 0, 1 is represented by child nodes, the bits are called occupied bit codes, and the binary code stream is generated through entropy coding. And continuously carrying out octree division on the non-empty (including the points in the point cloud) subcubes until the division is stopped when the leaf nodes obtained by division are unit cubes of 1x1x1, and encoding the points in the leaf nodes to generate a binary code stream by adopting a traversal sequence with a preferred breadth in the whole process. And reconstructing the geometric information after the geometric coding is finished. The encoding of the attribute information is mainly performed for color information. First, color information is converted from an RGB color space to a YUV color space. And then recoloring the geometrically reconstructed point cloud by using the original point cloud so as to enable the uncoded attribute information to correspond to the reconstructed geometrical information. In color information coding, after dot clouds are sequenced by Morton codes, the nearest neighbor of a point to be predicted is searched by using a geometric spatial relationship, interpolation prediction is carried out on the point to be predicted by using a reconstruction attribute value of the found neighbor to obtain a prediction attribute value, then difference is carried out on a real attribute value and the prediction attribute value to obtain a prediction residual error, and finally quantization and coding are carried out on the prediction residual error to generate a binary code stream. Decoding process as shown in fig. 2, the decoding process is reciprocal to the encoding process.
In the AVS point cloud coding platform at the present stage, no inter-frame technology exists, and only the intra-frame technology exists, so that redundant information can be removed only by utilizing spatial correlation, and the redundant information cannot be removed by utilizing temporal correlation.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a point cloud geometric information interframe coding and decoding method. The technical problem to be solved by the invention is realized by the following technical scheme:
in a first aspect, the invention provides a point cloud geometric information interframe coding method, which includes:
acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuing octree division on sub-nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
aiming at each node to be coded in the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
for each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and entropy coding the placeholder code of the node to be coded after the context model is determined to obtain a binary code stream.
Optionally, for each node to be encoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be encoded is occupied, the step of obtaining the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be coded of the first octree structure, when the occupancy condition of a reference node corresponding to the node to be coded is occupied, the occupancy condition of a first neighbor node coplanar with the reference node is obtained.
Optionally, before the step of acquiring, for each node to be encoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be encoded is occupied, the point cloud geometric information interframe coding method further includes:
judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
and determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the second neighbor node.
Optionally, before the step of determining the context model of the node-to-be-encoded bit-occupying code based on the bit-occupying situation of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupation code to be coded based on the occupation situation of the first neighbor node comprises:
and determining a context model of the occupation code of the node to be coded based on the occupation situations of the first neighbor node and the second neighbor node.
Optionally, before the step of determining the context model of the node-to-be-encoded bit-occupying code based on the bit-occupying situation of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
forming a set of the occupation conditions of the third neighbor nodes;
optionally selecting a first preset number of elements in the set as target elements;
the step of determining the context model of the occupation code of the node to be coded based on the occupation situation of the first neighbor node comprises the following steps:
and determining a context model of the node occupation code to be coded based on the target element.
Optionally, after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further includes:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
the first preset quantity is greater than the second preset quantity;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
Optionally, after the step of obtaining the occupancy of the first neighboring node coplanar with the reference node, the point cloud geometric information interframe coding method further includes:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
In a second aspect, the invention provides a point cloud geometric information interframe decoding method, which includes:
receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
for each node to be decoded, when the occupancy of the reference node corresponding to the node to be decoded is occupied, acquiring the occupancy of a first neighbor node adjacent to the reference node;
determining a context model of the occupation code of the node to be decoded based on the occupation situation of the first neighbor node;
and entropy decoding the placeholder of the node to be decoded after the context model is determined.
Optionally, for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the step of obtaining the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the occupancy of a first neighbor node coplanar with the reference node is acquired.
Optionally, before the step of acquiring, for each node to be decoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further includes:
judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanar, collinear and concurrent with the node to be decoded and coplanar with a father node of the node to be decoded;
and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
The embodiment of the invention provides a point cloud geometric information interframe coding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and finishing iterative division until leaf nodes are reached to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud; acquiring the occupation situation of each reference node; when the occupation situation of a reference node corresponding to the coding node is occupied, acquiring the occupation situation of a first neighbor node of the reference node; and determining a context model of the occupation code of the node to be coded based on the occupation situation, and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies the space, otherwise uses the context model of the intraframe to carry out entropy coding, and does not need motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.
The present invention will be described in further detail with reference to the accompanying drawings and examples.
Drawings
Fig. 1 is a flowchart illustrating an AVS encoding process according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an AVS decoding process provided by an embodiment of the present invention;
FIG. 3 is a flowchart of an embodiment of an AVS encoding process using inter-frame techniques;
FIG. 4 is a flowchart of an embodiment of the interframe technique applied to AVS decoding;
FIG. 5 is a flowchart of a method for interframe coding of point cloud geometric information according to an embodiment of the present invention;
FIG. 6 is a schematic diagram of neighboring nodes of a node corresponding to a reference frame;
FIG. 7 is a flowchart of AVS inter-frame prediction according to an embodiment of the present invention;
fig. 8 is a schematic diagram of a reference node according to an embodiment of the present invention having 6 coplanar neighbor nodes;
fig. 9 is a schematic diagram of a reference node according to an embodiment of the present invention, where 12 collinear neighbor nodes exist;
fig. 10 is a schematic diagram of a reference node according to an embodiment of the present invention having 8 co-located neighbor nodes;
fig. 11 is a flowchart of a point cloud geometric information interframe decoding method according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to specific examples, but the embodiments of the present invention are not limited thereto.
Example one
Referring to fig. 3, in the interframe technique in the AVS encoding process, with the point cloud geometric information interframe encoding method provided by the embodiment of the present invention, the current frame point cloud can be interframe predicted before entropy encoding, and then entropy encoding is performed.
As shown in fig. 4, a point cloud geometric information interframe coding method provided in the embodiment of the present invention includes:
s1a, acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
the frame point cloud before the current frame point cloud may be a previous frame point cloud of the current frame point cloud, or one frame of previous N frame point clouds, which is not limited herein.
S2a, performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
s3a, respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuing octree division on the sub-nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
s4a, aiming at each node to be coded of the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
s5a, aiming at each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
referring to fig. 5, the left side of fig. 5 is a reference frame, the right side of fig. 5 is a current frame, a node to be encoded in the current frame is a gray block, a reference node corresponding to the node to be encoded in the reference frame is also a gray block, and a blank block is a first neighbor node.
S6a, determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and S7a, entropy coding is carried out on the node placeholder code to be coded after the context model is determined, and a binary code stream is obtained.
As an optional implementation manner of the present invention, after obtaining the binary code stream, the probability distribution corresponding to the context model assigned to the placeholder pair of the node to be coded may be updated.
It can be understood that after the context is allocated each time, the probability distribution corresponding to the context may change, so that updating the probability distribution of the context can improve the accuracy of allocating the context to the node bit occupying code to be encoded, thereby improving the performance of encoding.
Referring to fig. 6, the execution process may be:
(1) Firstly, a part of memory can be developed to store reconstructed point cloud obtained from the previous frame as a reference frame of the current frame point cloud and the reference frame is marked as pointcloudPred;
(2) Quantizing the current frame point cloud to obtain a current frame quantized point cloud which is marked as pointcloudQua;
(3) The current frame quantized point cloud pointcloudQua and the reference frame point cloud pointcloudPred are respectively subjected to octree division to obtain occupation codes of the octree which are respectively recorded as occupancy and occupancy Pred, each bit of the 8-bit occupation codes are respectively recorded as Oi and OPi, and the 8-bit occupation codes of the current point cloud and the reference point cloud are respectively stored in a hash table, so that neighbor information can be conveniently searched for nodes later.
(4) Judging whether the current node meets the isolated point coding scheme, if so, directly making the occupancy be 0, entering the coding (5), then coding the relative coordinates of the isolated point, and if not, directly entering the coding (5).
(5) If OPi is equal to 1, namely the ith sub-node of the current block of the reference frame is occupied, searching occupation information of a reference frame node (shown in figure 4) corresponding to a coplanar neighbor node coded by the current sub-node Oi, calculating 4 conditions of 0-3 of the occupied neighbor node, assuming that 4 corresponding contexts are ctx _ interPredict [4], each context corresponds to one probability distribution, selecting the probability distribution corresponding to the context by utilizing the occupation condition of the neighbor node, entropy coding the ith sub-node occupation code Oi of the current block of the current frame based on the selected probability distribution, and updating the probability distribution corresponding to the selected context based on the Oi; if OPi is equal to 0, namely the ith sub-node of the current block of the reference frame is not occupied, entropy coding the ith sub-node occupied bit code Oi of the current block of the current frame by using 191 contexts in the original frame, wherein one context corresponds to one probability distribution, and updating the probability distribution corresponding to the selected context based on Oi.
The embodiment of the invention provides a point cloud geometric information interframe coding method, which comprises the steps of obtaining a frame point cloud before a current frame point cloud as a reference frame point cloud; respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, then dividing the divided nodes, and finishing iterative division until leaf nodes are reached to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud; acquiring the occupation situation of each reference node; when the occupation situation of the reference node corresponding to the coding node is occupied, acquiring the occupation situation of a first neighbor node of the reference node; and determining a context model of the occupation code of the node to be coded based on the occupation situation, and carrying out entropy coding according to the context model to obtain a binary code stream. The invention uses the context model of the interframe to carry out entropy coding when the interframe prediction occupies, otherwise uses the context model of the intraframe to carry out entropy coding, and does not need motion compensation, thereby reducing the complexity of interframe coding and decoding and improving the performance of geometric entropy coding.
Example two
As an optional embodiment of the present invention, for each node to be encoded in the first octree structure, when the occupancy of the reference node corresponding to the node to be encoded is occupied, the step of acquiring the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be coded of the first octree structure, when the occupancy of the reference node corresponding to the node to be coded is occupied, the occupancy of a first neighbor node coplanar with the reference node is obtained.
EXAMPLE III
As an optional embodiment of the present invention, before the step of acquiring, for each node to be encoded of the first octree structure, a first neighboring node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be encoded is occupied, the point cloud geometric information interframe coding method further includes:
a, step a: judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
step b: and determining a context model of the bit occupying code of the node to be coded based on the bit occupying condition of the second neighbor node.
Example four
As an optional embodiment of the present invention, before the step of determining the context model of the node-to-be-encoded placeholder code based on the placeholder condition of the first neighboring node, the point cloud geometric information interframe coding method further includes:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupancy code to be coded based on the occupancy of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the occupation situations of the first neighbor node and the second neighbor node.
It is understood that, in this embodiment, intra-frame and inter-frame are not separately used, but context between intra-frame and inter-frame is used in combination, and may be combined into context or context, taking both intra-frame information and inter-frame information into consideration. In the above interframe scheme, there are 4 contexts between frames and 191 contexts in a frame, and then there are 4 × 191=764 contexts.
EXAMPLE five
As an optional embodiment of the present invention, before the step of determining the context model of the node-to-be-encoded placeholder code based on the placeholder condition of the first neighboring node, the point cloud geometric information interframe coding method further includes:
step a: acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
step b: forming a set of the occupation conditions of the third neighbor nodes;
step c: optionally selecting a preset number of elements in the set as target elements;
step d: the step of determining the context model of the occupation code of the node to be coded based on the occupation situation of the first neighbor node comprises the following steps:
step e: and determining a context model of the node occupation code to be coded based on the target element.
As shown in fig. 7 to 9, the embodiment of the present embodiment that the reference node has 6 coplanar neighbors, 12 collinear neighbors and 8 concurrent neighbors changes the way of finding the neighbors, and not only finds the reference frame node corresponding to the coplanar node encoded by the current child node Oi. Since all the information between frames can be obtained at the encoding and decoding ends, all the coplanarity, collinearity and collinearity points of the OPi corresponding to the Oi node can be searched for, and the total 26 neighbor nodes or the subset of the 26 neighbor nodes can be searched for.
EXAMPLE six
As an optional embodiment of the present invention, after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further includes:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
wherein the first preset number is greater than the second preset number;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
It can be understood that, the step (5) in the embodiment is modified to map the occupied numbers and the corresponding contexts of the nodes, instead of mapping each occupied number and corresponding to one context, several occupied numbers and corresponding to one context may be mapped, for example, if the occupied neighbors in the main scheme are 1 and 2, the same context is used for encoding, and then one context may be reduced, and only 3 contexts are used.
EXAMPLE seven
As an optional embodiment of the present invention, after the step of obtaining the occupancy of the first neighboring node coplanar with the reference node, the point cloud geometric information interframe coding method further includes:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
It is understood that the occupancy is occupied or unoccupied, the position information includes the position of the first neighboring node relative to the reference node, the occupancy and the position information are arranged and combined, and the number of the context models is expanded to a geometric multiple.
The advantages of the coding method provided by the present invention are verified by experimental data. Under the condition of not changing geometric PSNR (PSNR is an objective standard for image evaluation, and the quality of an image is better when PSNR is larger), the size of a coded code stream can be reduced, as shown in the following table, BD-rate of reconstructed point cloud under the condition of C1 (geometric damage and attribute damage) (BD-rate is a parameter for measuring the performance is good, and when BD-rate is negative, the performance is good, on the basis, the larger the absolute value of BD-rate is, the larger the gain of the performance is), the performance is good, and bpp under the condition of C4 (geometric damage and attribute damage) (bpp is used for measuring the size of the code stream under the condition of damage, less than 100% represents that the performance is good, and on the basis, the smaller the absolute value is, the larger the gain is), the performance is good. The results of testing the geometric information of a part of multi-frame point cloud sequences on an AVS platform under the conditions of C1 (geometric lossy, attribute lossy) and C4 (geometric lossless and attribute lossless) are shown in tables 1 and 2:
TABLE 1 comparison of the Performance of the encoding method of the present invention with that of the existing AVS Point cloud encoding technique under C1
Figure GDA0003679610810000131
Figure GDA0003679610810000141
TABLE 2 Performance comparison results of the encoding method of the present invention and the existing AVS point cloud encoding technique under C4 conditions
Sequence name bpp
ford_01 97.1%
ford_02 98.0%
ford_03 98.7%
Example eight
Referring to fig. 10, an interframe technique is performed in the AVS decoding process, thereby obtaining an interframe decoding method for point cloud geometric information according to an embodiment of the present invention.
As shown in fig. 11, a point cloud geometric information interframe decoding method provided in the embodiment of the present invention includes:
s1b, receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
s2b, aiming at each node to be decoded, when the occupation situation of the reference node corresponding to the node to be decoded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
s3b, determining a context model of the occupation code of the node to be decoded based on the occupation situation of the first neighbor node;
and S4b, entropy decoding the node placeholder code to be decoded after the context model is determined.
Example nine
As an optional embodiment of the present invention, for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the step of acquiring the occupancy of the first neighbor node adjacent to the reference node includes:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, the occupancy of a first neighbor node coplanar with the reference node is acquired.
Example ten
As an optional embodiment of the present invention, before the step of acquiring, for each node to be decoded of the first octree structure, a neighboring node adjacent to a reference node when an occupation situation of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further includes:
step a: judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be decoded and coplanarity with a father node of the node to be decoded;
step b: and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.
The foregoing is a further detailed description of the invention in connection with specific preferred embodiments and it is not intended to limit the invention to the specific embodiments described. For those skilled in the art to which the invention pertains, numerous simple deductions or substitutions may be made without departing from the spirit of the invention, which shall be deemed to belong to the scope of the invention.

Claims (10)

1. A point cloud geometric information interframe coding method is characterized by comprising the following steps:
acquiring a frame point cloud before a current frame point cloud as a reference frame point cloud of the current frame point cloud;
performing voxelization processing on the current frame point cloud to obtain the voxelized current frame point cloud;
respectively carrying out octree division on the current frame point cloud and the reference frame point cloud, and continuously carrying out octree division on sub nodes obtained by division until leaf nodes are divided to obtain a first octree structure of the current frame point cloud and a second octree structure of the reference frame point cloud;
wherein the first octree structure comprises a plurality of nodes, each node corresponding to a reference node in the second octree structure;
aiming at each node to be coded in the first octree structure, acquiring the occupation situation of a corresponding reference node in the second octree structure;
for each node to be coded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be coded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the first neighbor node;
and entropy coding the placeholder code of the node to be coded after the context model is determined to obtain a binary code stream.
2. The point cloud geometric information interframe coding method according to claim 1, wherein the step of acquiring, for each node to be coded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied comprises:
for each node to be coded of the first octree structure, when the occupancy condition of a reference node corresponding to the node to be coded is occupied, the occupancy condition of a first neighbor node coplanar with the reference node is obtained.
3. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of obtaining, for each node to be coded of the first octree structure, a first neighboring node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be coded is occupied, the point cloud geometric information interframe coding method further comprises:
judging whether a reference node corresponding to a node to be coded is occupied or not aiming at each node to be coded of the first octree structure, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be coded and coplanarity with a father node of the node to be coded;
and determining a context model of the bit occupying code of the node to be coded based on the bit occupying situation of the second neighbor node.
4. The point cloud geometric information interframe coding method according to claim 1, wherein before the step of determining the context model of the node-to-be-coded space-occupying code based on the space-occupying condition of the first neighboring node, the point cloud geometric information interframe coding method further comprises:
acquiring the occupation situation of a second neighbor node which is coplanar, collinear or concurrent with the node to be coded and is coplanar or combined with a father node of the node to be coded;
the step of determining the context model of the node occupation code to be coded based on the occupation situation of the first neighbor node comprises:
and determining a context model of the node occupation code to be coded based on occupation conditions of the first neighbor node and the second neighbor node.
5. The point cloud geometry information interframe coding method of claim 1, wherein prior to the step of determining the context model of the node-to-be-coded placeholder code based on the placeholder of the first neighboring node, the point cloud geometry information interframe coding method further comprises:
acquiring the occupation situation of a third neighbor node which is coplanar, collinear and concurrent with the reference node corresponding to the node to be coded;
forming a set of the occupation conditions of the third neighbor nodes;
optionally selecting a first preset number of elements in the set as target elements;
the step of determining the context model of the node occupancy code to be coded based on the occupancy of the first neighboring node comprises:
and determining a context model of the node occupation code to be coded based on the target element.
6. The point cloud geometric information interframe coding method according to claim 5, wherein after the step of optionally selecting a preset number of elements in the set as target elements, the point cloud geometric information interframe coding method further comprises:
optionally selecting a second preset number of elements from the target elements, and taking the second preset number of elements as elements for determining the same context model;
the first preset quantity is greater than the second preset quantity;
the step of determining the context model of the node occupation code to be coded based on the target element comprises the following steps:
determining a context model of the node-to-be-encoded bit-occupying code based on a second number of target elements.
7. The point cloud geometry information interframe coding method of claim 2, wherein after the step of obtaining an occupancy of a first neighbor node that is coplanar with a reference node, the point cloud geometry information interframe coding method further comprises:
acquiring the position information of the first neighbor node;
the step of determining the context model of the node-occupying code to be coded based on the occupying situation of the first neighboring node comprises:
and determining a context model of the occupation code of the node to be coded based on the position information and the occupation situation of the first neighbor node.
8. A point cloud geometric information interframe decoding method is characterized by comprising the following steps:
receiving a binary code stream;
the binary code stream comprises a node occupation code to be decoded;
for each node to be decoded of the first octree structure, when the occupation situation of a reference node corresponding to the node to be decoded is occupied, acquiring the occupation situation of a first neighbor node adjacent to the reference node;
determining a context model of the node occupation code to be decoded based on the occupation situation of the first neighbor node;
and entropy decoding the placeholder of the node to be decoded after the context model is determined.
9. The point cloud geometric information interframe decoding method of claim 8, wherein the step of obtaining, for each node to be decoded of the first octree structure, an occupancy of a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied comprises:
for each node to be decoded of the first octree structure, when the occupancy of the reference node corresponding to the node to be decoded is occupied, acquiring the occupancy of a first neighbor node coplanar with the reference node.
10. The point cloud geometric information interframe decoding method according to claim 8, wherein before the step of acquiring, for each node to be decoded of the first octree structure, a first neighbor node adjacent to a reference node when an occupancy of the reference node corresponding to the node to be decoded is occupied, the point cloud geometric information interframe decoding method further comprises:
judging whether a reference node corresponding to a node to be decoded is occupied, if not, acquiring the occupation condition of a second neighbor node which is any one or combination of coplanarity, collinearity and coplanarity with the node to be decoded and coplanarity with a father node of the node to be decoded;
and determining a context model of the occupation code of the node to be decoded based on the occupation situation of the second neighbor node.
CN202011396370.3A 2020-12-03 2020-12-03 Point cloud geometric information interframe coding and decoding method Active CN112565764B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011396370.3A CN112565764B (en) 2020-12-03 2020-12-03 Point cloud geometric information interframe coding and decoding method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011396370.3A CN112565764B (en) 2020-12-03 2020-12-03 Point cloud geometric information interframe coding and decoding method

Publications (2)

Publication Number Publication Date
CN112565764A CN112565764A (en) 2021-03-26
CN112565764B true CN112565764B (en) 2022-10-04

Family

ID=75047729

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011396370.3A Active CN112565764B (en) 2020-12-03 2020-12-03 Point cloud geometric information interframe coding and decoding method

Country Status (1)

Country Link
CN (1) CN112565764B (en)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115471627A (en) * 2021-06-11 2022-12-13 维沃移动通信有限公司 Point cloud geometric information encoding processing method, point cloud geometric information decoding processing method and related equipment
CN115474058A (en) * 2021-06-11 2022-12-13 维沃移动通信有限公司 Point cloud encoding processing method, point cloud decoding processing method and related equipment
CN115474052A (en) * 2021-06-11 2022-12-13 维沃移动通信有限公司 Point cloud encoding processing method, point cloud decoding processing method and related equipment
CN113453009B (en) * 2021-07-04 2023-02-14 西北工业大学 Point cloud space scalable coding geometric reconstruction method based on fitting plane geometric error minimum
WO2023003144A1 (en) * 2021-07-20 2023-01-26 엘지전자 주식회사 Point cloud data transmission device, point cloud data transmission method, point cloud data reception device, and point cloud data reception method
CN113613017B (en) * 2021-07-27 2024-04-19 闽都创新实验室 Method for improving V-PCC inter-frame prediction by three-dimensional inter-frame prediction
CN113676738B (en) * 2021-08-19 2024-03-29 上海交通大学 Geometric coding and decoding method and device of three-dimensional point cloud
EP4425928A1 (en) * 2021-10-27 2024-09-04 LG Electronics Inc. Point cloud data transmission device, point cloud data transmission method, point cloud data reception device, and point cloud data transmission method
CN116055751A (en) * 2021-10-28 2023-05-02 华为技术有限公司 Encoding and decoding method, device, equipment, storage medium and program product of point cloud
CN116233388B (en) * 2021-12-03 2024-08-27 维沃移动通信有限公司 Point cloud coding and decoding processing method and device, coding equipment and decoding equipment
WO2023131126A1 (en) * 2022-01-04 2023-07-13 Beijing Bytedance Network Technology Co., Ltd. Method, apparatus, and medium for point cloud coding
WO2023132331A1 (en) * 2022-01-07 2023-07-13 Kddi株式会社 Point cloud decoding device, point cloud decoding method, and program
EP4220561A1 (en) * 2022-02-01 2023-08-02 Beijing Xiaomi Mobile Software Co., Ltd. Method and apparatus of encoding/decoding a slice of point cloud data
CN116800969A (en) * 2022-03-18 2023-09-22 维沃移动通信有限公司 Encoding and decoding methods, devices and equipment
US20230342987A1 (en) * 2022-04-14 2023-10-26 Qualcomm Incorporated Occupancy coding using inter prediction with octree occupancy coding based on dynamic optimal binary coder with update on the fly (obuf) in geometry-based point cloud compression
WO2024145934A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Point cloud coding/decoding method and apparatus, and device and storage medium
WO2024145910A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Encoding method, decoding method, bitstream, encoder, decoder and storage medium
WO2024145935A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Point cloud encoding method and apparatus, point cloud decoding method and apparatus, device, and storage medium
WO2024145904A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Encoding method, decoding method, code stream, encoder, decoder, and storage medium
WO2024145913A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Point cloud encoding and decoding method and apparatus, device, and storage medium
WO2024145933A1 (en) * 2023-01-06 2024-07-11 Oppo广东移动通信有限公司 Point cloud coding method and apparatus, point cloud decoding method and apparatus, and devices and storage medium
WO2024197680A1 (en) * 2023-03-29 2024-10-03 Oppo广东移动通信有限公司 Point cloud coding method and apparatus, point cloud decoding method and apparatus, device, and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109196559A (en) * 2016-05-28 2019-01-11 微软技术许可有限责任公司 The motion compensation of dynamic voxelization point cloud is compressed
CN111465964A (en) * 2017-10-19 2020-07-28 交互数字Vc控股公司 Method and apparatus for encoding/decoding geometry of point cloud representing 3D object
WO2020190090A1 (en) * 2019-03-20 2020-09-24 엘지전자 주식회사 Point cloud data transmission device, point cloud data transmission method, point cloud data reception device and point cloud data reception method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10499054B2 (en) * 2017-10-12 2019-12-03 Mitsubishi Electric Research Laboratories, Inc. System and method for inter-frame predictive compression for point clouds
US11210813B2 (en) * 2019-05-30 2021-12-28 Tencent America LLC Method and apparatus for point cloud compression

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109196559A (en) * 2016-05-28 2019-01-11 微软技术许可有限责任公司 The motion compensation of dynamic voxelization point cloud is compressed
CN111465964A (en) * 2017-10-19 2020-07-28 交互数字Vc控股公司 Method and apparatus for encoding/decoding geometry of point cloud representing 3D object
WO2020190090A1 (en) * 2019-03-20 2020-09-24 엘지전자 주식회사 Point cloud data transmission device, point cloud data transmission method, point cloud data reception device and point cloud data reception method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
A 3D Haar Wavelet Transform for Point Cloud Attribute Compression Based on Local Surface Analysis;Sujun Zhang,Wei Zhang,Fuzheng Yang,Junyan Huo;《2019 Picture Coding Symposium (PCS)》;20200109;全文 *
Shishir Subramanyam ; Pablo Cesar.Enhancement Layer Inter Frame Coding for 3D Dynamic Point Clouds.《2018 IEEE Games, Entertainment, Media Conference (GEM)》.2018, *
基于几何重构的LiDAR点云几何无损压缩的研究;魏志文;《中国优秀博硕士学位论文全文数据库(硕士) 工程科技Ⅱ辑》;20200715;全文 *

Also Published As

Publication number Publication date
CN112565764A (en) 2021-03-26

Similar Documents

Publication Publication Date Title
CN112565764B (en) Point cloud geometric information interframe coding and decoding method
CN112565795B (en) Point cloud geometric information encoding and decoding method
CN108322742B (en) A kind of point cloud genera compression method based on intra prediction
CN108781281B (en) Shape adaptive model based codec for lossy and lossless image compression
CN113382252B (en) Encoding and decoding method, device, equipment and storage medium
JP7268598B2 (en) Information processing device and method
US20230065156A1 (en) Point cloud encoding/decoding method, encoder, decoder, and storage medium
CN112565794A (en) Point cloud isolated point encoding and decoding method and device
CN114095735A (en) Point cloud geometric inter-frame prediction method based on block motion estimation and motion compensation
CN102484705B (en) Encoding and decoding a video image sequence by image areas
CN111447452B (en) Data coding method and system
CN115190300A (en) Method for predicting attribute information, encoder, decoder, and storage medium
KR100573507B1 (en) Partition coding method and device
CN109688411B (en) Video coding rate distortion cost estimation method and device
CN113395516A (en) Intra-frame prediction method and device and computer-readable storage medium
CN113453009B (en) Point cloud space scalable coding geometric reconstruction method based on fitting plane geometric error minimum
CN112509107A (en) Point cloud attribute recoloring method, device and encoder
CN107409216B (en) Image encoding and decoding method, encoding and decoding device and corresponding computer program
CN116320488A (en) Intra-frame prediction method, device, encoder, decoder, and storage medium
CN110855991A (en) Graphic data compression method for computer image processing
CN112969067B (en) Video coding method, device, equipment and readable storage medium
CN116781898A (en) HEVC intra-frame prediction mode rapid selection method, device and medium
CN117156115A (en) Point cloud geometric coding and decoding method and device based on multi-neighbor state transition
CN118765499A (en) Encoding method, encoder, and storage medium
CN118525299A (en) Encoding method, decoding method, encoder, decoder, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant