WO2021043136A1

WO2021043136A1 - Point cloud projection transmission method, and reception processing method and system

Info

Publication number: WO2021043136A1
Application number: PCT/CN2020/112857
Authority: WO
Inventors: 徐异凌; 王丹盈; 徐英展; 朱文婕
Original assignee: 上海交通大学
Priority date: 2019-09-03
Filing date: 2020-09-01
Publication date: 2021-03-11

Abstract

Disclosed are a point cloud projection transmission method based on a user viewing angle, and a reception processing method and system, characterized in that the method comprises: distinguishing, on the basis of a user viewing angle, point cloud fragments of viewing angles needing to be observed and of viewing angles not needing to be observed, preferentially allocating resources to the point cloud fragments of the viewing angles needing to be observed, so as to encode and transmit same, and/or maintaining, on the basis of a spatial association space between an edge fragment and a main fragment, a projection vector of the edge fragment or changing same on the basis of the user viewing angle. In a projection transmission solution where adaptive adjustment is performed according to a user viewing angle, optimization adjustment is performed in an attention characteristic direction of a user when viewing a point cloud object, thereby improving the transmission quality.

Description

Point cloud projection transmission method, receiving processing method and system

Technical field

The invention belongs to the field of point clouds, and in particular relates to a point cloud projection transmission method, reception processing method and system based on a user's perspective.

Background technique

In recent years, media transmission technology has been rapidly developed and improved, and the transmission cost of image and video information has been continuously reduced while accuracy has been improved, which has further promoted the maturity of related application fields. Three-dimensional point cloud data is a new type of image media data, which can provide users with more accurate and vivid image information, and bring higher fidelity and more interactive communication experience.

Point cloud data is the surface information of the object obtained by scanning, including three-dimensional coordinate data, depth information, color information, and so on. The maturity of 3D scanning technology and the perfection of the system architecture have improved the accuracy of point cloud data, while also greatly increasing the amount of point cloud data. Among them, there are different point cloud data compression algorithms for static and dynamic point cloud data, and different types of point cloud data such as machine perception and human eye perception. For example, for dynamic human eye perception point cloud data, a typical point cloud compression algorithm is to convert 3D point cloud data into 2D image data, and then perform data processing, one of which is video-based point cloud compression (Video- based Point Cloud Compression, VPCC) algorithm. This compression method first projects a 3D point cloud onto a 2D plane to obtain occupancy map information, geometric information, attribute information, and auxiliary information. The attribute information usually includes texture information and color information. Therefore, the compressed information is usually divided into four categories. Data is transferred. Through this method, the overall compressed transmission of the point cloud sequence is realized.

In order to solve the computational burden caused by the huge amount of point cloud data, there have been more researches on the point cloud compression coding algorithm, and there are some discussions on the standardization of point cloud compression. At present, a typical compression scheme applied to point cloud coding and transmission is to convert three-dimensional point cloud data into two-dimensional images for processing through segmented projection. This solution uses the existing video coding tools to encode and transmit the point cloud, in which it is necessary to sequentially carry out the division of the point cloud, the subdivision, segment projection, blank filling and other steps to achieve the segment projection of the point cloud. Then, the two-dimensional data obtained by projection can be encoded and transmitted with the help of video encoding technologies such as HEVC. Through this method, the overall compressed transmission of the point cloud sequence is realized.

Patent document CN104778691B (application number: 201510160208.4) discloses a simplified processing method for three-dimensional point cloud data. The invention mainly solves the technical problems of poor retention of three-dimensional object surface features, poor reconstruction effect, and large amount of calculation existing in the existing method for streamlining three-dimensional point clouds. The invention uses the information in the color image synchronized by the Kinect camera to streamline the three-dimensional point cloud when collecting the point cloud data. The method combines the color map and the point cloud for streamlining, which can avoid some curvature of the object surface caused by the collection and streamlining process There is no significant change but the point cloud data of the characteristic information is lost; you can set the corresponding reduction rate according to the gray information of the object itself for reduction; you can also reserve the characteristic points that you think are more important according to subjective factors, and selectively simplify Point cloud.

However, in actual application scenarios, the user cannot observe the entire content of a point cloud object. For example, when the user observes the point cloud object from the front, the user cannot see the media content on the back of the point cloud. The process of cloud processing does not differentiate users' attention to differences in various aspects. In addition, the existing point cloud compression scheme implements point cloud transmission. Since the point cloud fragments that can be observed and the point cloud fragments that cannot be observed are compressed with the same accuracy and provide the same definition of presentation, this will Unnecessary demand for transmission bandwidth, etc. will be put forward. For this unnecessary amount of information caused by the occlusion of the point cloud, the current overall compression transmission scheme for the point cloud sequence does not propose a corresponding solution. How to make distinctions based on the degree of attention of the users' different perspectives, and to reduce this information redundancy without affecting the quality of observations, and adapt it to actual user application scenarios, are key issues that need to be resolved urgently.

Summary of the invention

The purpose of the present invention is to provide a point cloud projection transmission method, reception processing method and system, which introduces the user's perspective to improve the quality of point cloud projection processing, and optionally, projection is performed on the side with high attention that has a greater impact on the user's perception The vector is modified and perfected, or combined with optimized point cloud compression algorithm to achieve.

In order to achieve the above objective, the present invention provides a point cloud projection transmission method based on the user's perspective, which has such features, including: preliminary division of point cloud objects on a standard vector to obtain the point cloud segment to be processed; and determination based on the user's perspective The projection transmission scheme is to process the point cloud segment to be processed.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the closest standard normal vector is taken as the classification category of the midpoint of the point cloud object; based on six standards The normal vector divides the midpoint of the point cloud object into multiple first-level point cloud fragments; in the same first-level point cloud fragment, the points with the same normal vector and the distance less than the predetermined value are then divided into second-level point cloud fragments as to be processed Cloud fragment.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such a feature, wherein the projection transmission scheme includes: further dividing the to-be-processed point cloud segment to obtain points corresponding to the perspective category Cloud fragments, including the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed; the point cloud fragments that need to be observed are assigned priority resources for encoding transmission, and the point cloud fragments that do not need to be observed are compared Prioritize the allocation of secondary resources for coding transmission.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such a feature, wherein the projection transmission scheme includes any one or more of the following: no point cloud modification Projection plan, improve the point cloud projection plan, modify the standard normal vector plan according to the user's perspective.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the solution of not modifying the point cloud projection method includes: placing the point cloud object in the standard vector The above preliminary division is to associate the parameter list with the standard vector as the projection vector to obtain the point cloud segment to be processed. Considering the user's perspective and the direction of the projection vector, the point cloud segment to be processed is divided into the point cloud segment that needs to be observed and the point cloud segment that needs to be observed. The point cloud fragment that needs to be observed.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature of setting corresponding points for the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed. The encoding and decoding parameters of each point cloud segment are recorded as additional information for encoding and transmission.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the improved point cloud projection solution includes: further dividing the to-be-processed point cloud fragments to obtain corresponding The point cloud fragments of the perspective category include the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed; the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed are projected onto the projection plane, according to The size of the projection plane needs to be changed to obtain the need to observe pictures and the need not to observe pictures.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such characteristics, and further include: as the first improved point cloud projection solution, the point cloud segment and the Under the premise that the point cloud fragments that do not need to be observed are misaligned, the pictures that do not need to be observed are merged into the pictures that need to be observed, and a projected picture is obtained, which is compressed and encoded.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that a projected picture obtained forms a set of video sequences, and the point cloud that does not need to be observed is recorded The index parameter index corresponding to the segment and the scale of the size change are used as newly added additional information for encoding transmission.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such characteristics, and further include: as a second improved point cloud projection solution, there will be a need to observe the picture and the need not to observe the picture. Compress and encode separately.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the projected images that need to be observed and the images that do not need to be observed respectively form two sets of video sequences, and the two sets are encoded. Video sequence, record the change ratio of the video sequence corresponding to the point cloud segment that does not need to be observed, encode and transmit as new additional information, and further record the position of each group of video sequences in the coded stream.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such a feature. The third improved point cloud projection solution includes: the point cloud segment to be processed according to six standard methods The vectors are grouped and projected onto six projection planes, and the size of the six projection planes is changed according to the user's perspective based on the user's perspective, and compression coding is performed respectively.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, the point cloud segment corresponding to each standard normal vector can be respectively projected to each group of video sequences, and six groups are encoded. Video sequence, record the change ratio of the video sequence corresponding to the point cloud segment that does not need to be observed, encode and transmit as new additional information, and further record the position of each group of video sequences in the coded stream.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the modification of the standard normal vector scheme according to the user's perspective includes: determining the perspective according to the user's observation perspective Vector; rotate the view vector with one of the standard normal vectors, and then perform the same rotation mapping processing on the other standard normal vectors, and determine the projection direction after the rotation mapping on the premise that the overall modification amount is less than a predetermined value.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature, wherein the step of reducing the overall modification amount to be less than a predetermined value includes: the perspective vector corresponds to the standard normal vector , And after the rotation, the sum of the dot products of the six standard normal vectors after the rotation mapping including the viewing angle vector and the corresponding original six standard normal vectors takes the maximum value or a larger value within a predetermined range.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the standard normal vector after the rotation mapping is adjusted will be encoded as the newly added additional information transmission.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have such a feature, wherein the projection transmission scheme includes: the point cloud segment to be processed is distinguished from the main segment and the point cloud segment according to the user's perspective. Multiple edge segments; considering the spatial connection between the edge segment and the main segment, the projection vector of the edge segment is maintained or changed based on the user's perspective; subsequent point cloud processing is performed after projection.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature, in which the judgment basis considering the spatial connection between the edge segment and the main segment includes: The user’s perspective determines the principal vector in the standard normal vector; the principal segment and the edge segment are respectively projected along the principal vector direction to obtain the principal two-dimensional projection map and the edge two-dimensional projection map; based on each edge two-dimensional projection map, and each The main two-dimensional projection map is calculated to obtain the intersection ratio; multiple intersection ratios are compared with the preset threshold value as the basis for judging the spatial connection.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that when multiple intersection ratios are greater than a preset threshold, the edge The projection vector of the segment is changed to the main vector; if none of the multiple intersection ratios is greater than the preset threshold, the projection vector of the edge segment is maintained.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that the preset threshold value depends on the surface characteristics of the point cloud object to distinguish the object point cloud object The determination of the threshold value of the surface characteristics of the character point cloud object and the surface characteristics of the character point cloud object can be further adjusted and determined according to the experimental results.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature, wherein, according to the position where the user views the point cloud object, a point cloud is used to point the user to the point cloud. The vector of the observation point represents the viewing angle of the user, which is called the viewing angle vector. The distance between the viewing angle vector and the six standard normal vectors is calculated, and the standard normal vector with the closest distance to the viewing angle vector is called the principal vector under the viewing angle.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature, wherein, in addition to distinguishing the main segment and multiple edge segments of the point cloud object according to the user's perspective, Including distinguishing irrelevant fragments, among which, the standard normal vector closest to the user's perspective distance is determined as the main vector; among multiple standard normal vectors, the point cloud fragments to be processed that are the same as the main vector are determined as the main fragment; The to-be-processed point cloud segment opposite to the main vector in the standard normal vector is determined as an irrelevant segment; the to-be-processed point cloud segment associated with the main vector in the multiple standard normal vectors is determined as an edge segment.

Further optionally, in the point cloud projection transmission method based on the user's perspective provided by the present invention, it may also have the feature that, in the subsequent point cloud processing after projection, the projection relationship identification field is used to identify that the point cloud The projection vector of the point in the object after processing and adjustment, including the index value of the point cloud segment and the corresponding projection vector value.

In addition, the present invention also provides a point cloud receiving and processing method based on the user's perspective, which is characterized in that the codec parameters determined by the user's perspective are used to obtain a video sequence from the received code stream, and the video sequence is processed to obtain a point cloud. Segment, and decode and reconstruct each point cloud segment to obtain a point cloud object.

Further optionally, in the point cloud receiving and processing method based on the user's perspective provided by the present invention, it may also have the feature, in which the process of encoding, decoding and subsequent reconstruction of each point cloud segment according to the encoding and decoding parameters, Contains any one or more of the following: the encoding and decoding parameters are set at the sending end for each point cloud segment, distinguishing the point cloud segment that needs to be observed and the point cloud segment that does not need to be observed, and performing subsequent point cloud object reconstruction; or According to the received parameter information and size change information, the point cloud segment is restored to its original size, and then the subsequent point cloud object reconstruction is performed; or each video sequence is restored according to the position of each video sequence in the code stream. The received size changes to restore these video sequences to the original size, and the point clouds recovered from multiple sets of video sequences need to be spliced to reconstruct the point cloud objects; or the standard method needs to be adjusted according to the rotation mapping of the sender Vector, the inverse process of point cloud projection is performed, so as to reconstruct a three-dimensional point cloud sequence from a two-dimensional video sequence and perform point cloud object reconstruction.

In addition, the present invention also provides a point cloud projection transmission method based on the user's perspective, which is characterized in that: a segment generation module is used to preliminarily divide the point cloud object on the standard vector to obtain the point cloud segment to be processed; projection coding processing The module determines the projection transmission scheme based on the user's perspective, and processes the point cloud segment to be processed.

In addition, the present invention also provides a point cloud receiving processing system based on the user's perspective, which is characterized in that: the inverse processing module uses the codec parameters determined from the user's perspective to obtain a video sequence from the received code stream, and to obtain a video sequence for the video sequence. The point cloud segment is obtained by processing, and each point cloud segment is decoded and reconstructed to obtain a point cloud object.

In addition, the present invention also provides a point cloud projection transmission method based on the user's perspective, which is characterized in that: the point cloud object is preliminarily divided on the standard vector to obtain the point cloud segment to be processed; the point cloud segment to be processed is further divided to obtain The point cloud segment corresponding to the perspective category includes the point cloud segment that needs to be observed and the point cloud segment that does not need to be observed; priority resources are assigned to the point cloud segment that needs to be observed for encoding and transmission, and for the perspective that does not need to be observed Point cloud fragments are compared with priority allocation of secondary resources for encoding transmission

In addition, the present invention also provides a point cloud projection transmission method based on the user's perspective, which may also have the following characteristics: including: the point cloud segment to be processed is distinguished from the main segment and multiple edge segments according to the user's perspective; The spatial connection between the fragment and the main fragment is to maintain the projection vector of the edge fragment or change it based on the user's perspective; after the projection, the subsequent point cloud processing is performed.

In addition, the present invention also provides a point cloud projection transmission method based on the user's perspective, which may also have such features: including: preliminary division of the point cloud objects on the standard vector to obtain the to-be-processed point cloud segment, which is distinguished according to the user's perspective Generate the main segment and multiple edge segments; consider the spatial connection between the edge segment and the main segment, maintain the projection vector of the edge segment or change it based on the user's perspective; use the adjusted and changed main segment as the point to be observed Cloud fragments, adjusted and changed edge fragments and irrelevant fragments as point cloud fragments that do not need to be observed; and assign priority resources to the point cloud fragments that need to be observed for encoding transmission, and point cloud fragments that do not need to be observed. Compared with priority allocation of secondary resources for coding transmission.

Function and effect of the present invention

According to the point cloud projection transmission method, the receiving processing method and the system of the present invention, the point cloud object is preliminarily divided on the standard vector to obtain the point cloud segment to be processed, the projection transmission scheme is determined based on the user's perspective, and the projection is adaptively adjusted according to the user's perspective The transmission scheme is optimized and adjusted in the direction of the user's attention to the point cloud object to improve the transmission quality.

Description of the drawings

By reading the detailed description of the non-limiting embodiments with reference to the following drawings, other features, purposes and advantages of the present invention will become more apparent, by reading the detailed description of the non-limiting embodiments with reference to the following drawings:

FIG. 1 is a process diagram of processing point cloud fragments from different perspectives without modifying the point cloud projection scheme in the first embodiment;

Figure 2-1 is a process diagram of processing point cloud fragments from the first perspective under the improved point cloud projection scheme in the first embodiment;

Fig. 2-2 is a process diagram of processing point cloud fragments from a second perspective under the improved point cloud projection scheme in the first embodiment;

Figure 2-3 is a process diagram of processing point cloud fragments from the third perspective under the improved point cloud projection scheme in the first embodiment;

FIG. 3 is an example diagram of different aspects that need to be mainly observed under different user perspectives in the first embodiment; FIG.

4 is a schematic block diagram of the system of the sending end in the first embodiment;

5 is a three-dimensional schematic diagram of a three-dimensional edge box of a point cloud object in the second embodiment;

6 is a diagram of the projection adjustment process of the point cloud segment under the projection optimization adjustment scheme in the second embodiment;

FIG. 7 is a corresponding table of the relationship between normal vectors and principal vectors of different point cloud segment types in the second embodiment;

8-1 is a schematic diagram of the relative positions of the three-dimensional edge box of the main segment and the three-dimensional edge box of the edge segment used for projection adjustment according to the main vector direction in the second embodiment;

Figure 8-2 is a schematic diagram of a two-dimensional projection obtained after the three-dimensional edge boxes of the main segment and the edge segment in Figure 11-1 are projected onto a two-dimensional plane;

FIG. 9 is a schematic flowchart of a point cloud projection transmission method at the sending end in the second embodiment;

10 is a schematic flowchart of a point cloud projection receiving method at the receiving end in the second embodiment; and

Fig. 11 is a schematic block diagram of a system at the sending end in the second embodiment.

detailed description

The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be pointed out that for those of ordinary skill in the art, several modifications and improvements can be made without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

The present invention provides a point cloud projection transmission method based on a user's perspective, which includes: preliminarily dividing point cloud objects on a standard vector to obtain a point cloud segment to be processed; and determining a projection transmission scheme based on the user's perspective, and the point cloud segment to be processed To process.

The present invention specifically describes different embodiments of the projection transmission scheme in the following.

{First Embodiment}

A point cloud projection transmission method based on a user's perspective provided according to the present invention includes:

Point cloud segment determination step: determine the corresponding point cloud segment according to the observation perspective, and obtain a series of point cloud segments classified into the corresponding perspective category; and

Perspective transmission coding step: According to the obtained series of point cloud segments that are divided into corresponding perspective categories, the perspective transmission scheme is adopted, and high-definition coding transmission is performed for the perspective that needs to be observed, and for perspectives that do not need to be observed Perform lower-definition coding transmission.

Point cloud segment determination step: segmentation is performed according to the normal vector of the point cloud, and the distance between the normal vector of each point in the point cloud and the 6 standard normal vectors is calculated, and the standard normal vector with the closest distance is taken as the point category. To divide all points into 6 major categories, that is, 6 major categories of point cloud fragments, and then extract the connection components of these 6 major categories of point cloud fragments, and re-divide the 6 major categories of point cloud fragments , To obtain the repartitioned point cloud segment.

The 6 standard normal vectors adopt the existing technology, which are: (1, 0, 0), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), and (0, 0, -1).

Extract the connected components of the six preliminarily divided point cloud fragments. The extraction of the connected components includes: the point cloud fragments of the six major categories, according to the spatial distance between points and the adjacency relationship between points , Divide the point cloud fragments where the spatial distance between the points is greater than the preset distance range and the points are not adjacent to each other into smaller point cloud fragments, so that the point cloud fragments after the re-division are between the points The spatial distance of is smaller than the preset distance range and the points are adjacent to each other;

The re-divided point cloud fragments still belong to 6 categories corresponding to the 6 standard normal vectors respectively, and then the obtained re-divided point cloud fragments are projected in different directions according to the normal vector categories;

By finding the corresponding normal vectors for different viewing angles, and then confirming which point cloud fragments correspond to these normal vectors, a series of point cloud fragments classified into corresponding viewing angle categories are obtained.

Projection transmission schemes that perform sub-view transmission coding include any one or more of the following: a scheme that does not modify the point cloud projection method, a scheme that improves the point cloud projection method, and a standard normal vector scheme that is modified according to the user's perspective.

Regarding the solution of not modifying the point cloud projection method, FIG. 1 is a process diagram of processing point cloud fragments from perspectives without modifying the point cloud projection solution in the first embodiment. It can be seen from Fig. 1 that the 3D point cloud object is divided according to 6 standard vectors to obtain point cloud segment 1 to point cloud segment t.

By assigning a parameter list (patch index list), assigning a parameter index to each point cloud segment, and assigning a parameter list patch index list to each block of the picture that needs to be compressed and encoding, to record which point cloud segments this block is related to United.

According to the obtained series of point cloud fragments that are classified into corresponding viewing angle categories, the point cloud fragments that need to be mainly observed and the point cloud fragments that do not need to be mainly observed are divided according to the observation angle of view.

Then, according to the corresponding relationship between the point cloud segment and the 2D segment, that is, the patch index list, confirm which 2D segments are projected to be mainly observed, and which do not need to be mainly observed. In this embodiment, the selection of the standard vector projection plane as 0, 1, and 2 is the one that needs to be mainly observed. Then, in the point cloud segment 1 to the point cloud segment t, the corresponding

point cloud segments

1, 3, 6, 7, 9, etc. are divided into the point cloud segments that need to be mainly observed, and the rest are divided into the point cloud segments that do not need to be mainly observed Point cloud fragment.

For example, Figure 3 is an example diagram of different faces that need to be mainly observed under different user perspectives in the first embodiment; for a point cloud object, among the six faces of its three-dimensional bounding box, Figure 3 shows one There are three situations in which the surface corresponding to the normal vector needs to be mainly observed, the surface corresponding to the two normal vectors needs to be mainly observed, and the surface corresponding to the three normal vectors needs to be mainly observed.

For the two-dimensional block to which the two-dimensional segment that needs to be mainly observed belongs, when using video compression technology for encoding, set compression parameters with higher accuracy and higher resolution;

For the two-dimensional blocks to which the two-dimensional fragments that do not need to be mainly observed belong, when video compression technology is used for encoding, compression parameters of lower accuracy and lower resolution are set.

Regarding the solution for improving the point cloud projection method, this embodiment is described with reference to Figs. 2-1, 2-2, and Fig. 2-3. The point cloud projection transmission method based on the user’s perspective of the present invention includes any one of the following: The 1, 2, and 3 improved point cloud projection schemes include:

Observation steps of main observation fragments: According to the obtained series of point cloud fragments that are classified into corresponding perspective categories, according to the observation perspective, they are divided into the point cloud fragments that need to be mainly observed, and the point cloud fragments that do not need to be mainly observed, which is confirmed Which standard normal vector corresponds to the point cloud segment that needs to be mainly observed.

Figure 2-1 is a process diagram of processing point cloud fragments from the first perspective under the improved point cloud projection scheme in the first embodiment.

The first improved point cloud projection scheme is the same segment division process as in Figure 1. In Figure 2-1 of this embodiment, after confirming which standard normal vector corresponds to the point cloud segment that needs to be mainly observed, The point cloud fragments that need to be mainly observed and the point cloud fragments that do not need to be mainly observed are projected onto different planes as needed. For example, the point cloud fragments that need to be mainly observed are projected into picture A, and they do not need to be projected. The main observation point cloud fragment is projected into picture B, and the size of picture A and picture B are changed, and after the enlargement or reduction process is performed, in this embodiment, picture C is obtained after picture B is reduced, and picture A remains unchanged , Under the premise of misalignment between the point cloud segment that needs to be observed and the point cloud segment that does not need to be observed, the unobserved picture C is merged into the observable picture A to obtain a projected picture, which is then compressed and encoded.

In the present invention, it is within the scope of the present invention to change the size of the picture, such as zooming in picture A while picture B remains the same or zoomed out, or picture A remains unchanged while reducing picture B, which is within the scope of the invention, making it unnecessary It is sufficient if there is a significant size difference between the observation picture and the picture that needs to be observed, and the combination scheme of size change and the magnification and reduction coefficients do not limit the scope of the invention. In addition, in this case, the actual shape of the generated point cloud segment is based on the actual situation. In the figure, only the block diagram represents the point cloud segment for illustration.

The second improved point cloud projection scheme. After confirming which standard normal vectors correspond to the point cloud fragments that need to be mainly observed, it is the same as the first improved point cloud projection scheme above, and the point cloud fragments are projected to On different planes, the point cloud fragments that need to be mainly observed are projected into picture A, and the point cloud fragments that do not need to be mainly observed are projected into picture B. In this case, no stitching is performed, but the pictures that need to be observed are based on requirements. A. There is no need to observe the pictures to be resized separately, for example, after the size is enlarged or reduced, the projected pictures after the resize are respectively subjected to subsequent compression coding.

The third improved point cloud projection scheme is to project point cloud segment 1 to point cloud segment t on six different planes according to their standard normal vector types, corresponding to the six standard vectors, and get the first group of point clouds. Fragments to the sixth group of point cloud fragments. After confirming which standard normal vector corresponds to the point cloud fragments that need to be observed, the size of the planes corresponding to these point cloud fragments is changed according to requirements, as shown in Figure 2-3. According to the user’s perspective, select the point cloud segment on the projection surface of the

standard vectors

0, 1, 2 as the main observation surface, that is, the point cloud segments of the 1-3 groups remain unchanged, and the point cloud segments of the 4-6 groups are reduced. , And then compress and code separately after such processing.

Regarding the modification of the standard normal vector scheme according to the user's perspective, the following steps are included:

Observation angle normal vector calculation steps: Obtain the direction of the user's observation angle, and calculate the normal vector of the observation angle. The specific process is as follows:

When observing a point cloud object, take the observed point cloud object as the origin of the relative coordinate system, provide the user with the relative position coordinates of the observed point cloud object, calculate the position difference between the two, and obtain the relative observation angle of view. The direction normal vector indicates the direction of the user's viewing angle.

The rest of the standard normal vector calculation steps: define the normal vector as one of the standard normal vectors, that is, the normal vector and one of the standard normal vectors are rotated and mapped, and then the other standard normal vectors are subjected to the same rotation mapping processing to make the overall Under the premise that the modification amount of is small, to obtain the other corresponding five sets of standard normal vectors, the calculation process is as follows:

Suppose the general formula of the normal vector of the user observation angle is

Set it to the standard normal vector (1, 0, 0) as a rotation mapping relationship, then the other five standard normal vectors (0, 1, 0), (-1, 0, 0), (0, -1 , 0), (0, 0, 1), (0, 0, -1) corresponding to the normal vector general formula of the rotation map can be expressed as:

The present invention provides a point cloud projection transmission system based on a user's perspective, which includes a segment generation module, a point cloud segment determination module, and a perspective transmission encoding module as a projection encoding processing module.

Among them, the fragment generation module is used to preliminarily divide the point cloud object on the standard vector to obtain the point cloud fragment to be processed.

Point cloud segment determination module: Determine the corresponding point cloud segment according to the observation perspective, and obtain a series of point cloud segments that are classified into the corresponding perspective category;

The perspective transmission coding module is used to determine the projection transmission scheme according to the user perspective, and process the to-be-processed point cloud segment. According to the obtained series of point cloud fragments divided into corresponding viewing angle categories, the perspective transmission scheme is adopted to perform high-definition encoding transmission for the viewing angle that needs to be observed, and lower-definition viewing angle for the viewing angle that does not need to be observed. Encoding transmission.

Preferably, the point cloud segment determination module:

Divide fragments according to the normal vector of the point cloud, calculate the distance between the normal vector of each point in the point cloud and the 6 standard normal vectors, and take the closest standard normal vector as the point category to divide all points For 6 major categories, that is, 6 major categories of point cloud fragments, then these 6 major categories of point cloud fragments are extracted for connection components, and the 6 major categories of point cloud fragments are re-divided to obtain the re-divided point cloud fragments. Point cloud fragment;

The 6 standard normal vectors are:

(1, 0, 0), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1) ；

The connection component extraction refers to: taking 6 broad categories of point cloud fragments, according to the spatial distance between the points and the adjacency relationship between the points, the spatial distance between the points is greater than the preset distance range And the point cloud fragments that are not adjacent to each other are divided into smaller point cloud fragments, so that the spatial distance between the points of the re-divided point cloud fragment is less than the preset distance range and the points are adjacent to each other ；

By searching for the corresponding normal vectors for different perspectives, and then confirming which point cloud fragments these normal vectors correspond to, a series of point cloud fragments classified into corresponding perspective categories are obtained;

The sub-view transmission coding module:

The perspective transmission solution includes any one or more of the following: a solution that does not modify the point cloud projection method, a solution that improves the point cloud projection method, and a standard normal vector solution that is modified according to the user's perspective. The solution of not modifying the point cloud projection method, the solution of improving the point cloud projection method, and the solution of modifying the standard normal vector according to the user's perspective are the same as the above, and the repeated description is omitted here.

In addition, the present invention provides a point cloud receiving and processing system based on the user's perspective, including:

The inverse processing module uses the codec parameters determined from the user's perspective to obtain a video sequence from the received bitstream, process the video sequence to obtain a point cloud segment, and decode and reconstruct each point cloud segment to obtain a point cloud object.

The point cloud projection transmission system based on the user's perspective and the point cloud receiving and processing system based on the user's perspective provided by the present invention can pass the point cloud projection transmission system method based on the user's perspective and the point cloud receiving processing method based on the user's perspective provided by the present invention The steps of the process are realized.

Hereinafter, the present invention will be explained more specifically through preferred examples.

Preferred example 1:

The purpose of the present invention is to provide a point cloud projection transmission scheme based on the user's perspective, which is modified and perfected on the basis of the existing typical point cloud compression scheme, and provides different degrees of observation at different viewing angles according to the needs of users. The rendering effect.

The present invention is implemented based on the current classic point cloud sequence coding transmission scheme. The classic point cloud sequence coding method needs to be carried out sequentially. Segments are divided according to normal vectors, segment is subdivided according to segment connections, and projections in different directions are performed according to normal vectors. Fill the projection plane with blanks, and use a two-dimensional coding tool such as HEVC to compress and encode the video sequence obtained by the projection.

In order to achieve different rendering effects from different perspectives, there can be many implementation methods, for example:

(1) Only the data corresponding to the viewing angle that needs to be observed can be encoded and transmitted.

(2) It can be encoded and transmitted in different viewing angles and definitions. That is, the high-definition coding transmission is performed for the viewing angle that needs to be observed, and the lower-definition coding transmission is performed for the viewing angle that does not need to be observed.

(3) When bandwidth is allocated for transmission, the data corresponding to the viewing angle that needs to be observed can be preferentially allocated to ensure that the data corresponding to the viewing angle that needs to be observed has sufficient bandwidth for transmission.

(4) When bandwidth is allocated for transmission, the data corresponding to the viewing angle that needs to be observed can be allocated according to a higher ratio, and the data corresponding to the viewing angle that does not need to be observed according to a lower ratio.

Next, we will take the perspective and definition transmission as an example to illustrate how to realize the perspective transmission. In addition to the perspective and definition, there are many different implementations. In order to achieve the above-mentioned objectives, the present invention provides the following two technical solutions to be implemented from three perspectives:

The first solution is based on the classic point cloud sequence coding, after obtaining the video sequence after projection, without changing the structure of the video sequence, it is achieved by modifying the coding parameters of the video coding tool. Specifically, because the point cloud sequence is divided into different fragments according to the normal vector, we can find the normal vector corresponding to the corresponding perspective according to the perspective, and further find the corresponding fragments. For these fragments, locate in the two-dimensional picture The corresponding projected segments can then be assigned different compression parameters to these segments in the video encoding tool. Different compression parameters can achieve different compression coding effects. For example, for the compression parameter quantization step sequence number, QP value for short, a low QP value can achieve higher compression quality, and a high QP value can achieve lower compression quality. .

Perform high-resolution and high-precision compression for the two-dimensional image fragments corresponding to the viewing angles we need, and perform low-resolution and low-precision compression for the two-dimensional image fragments corresponding to the unwanted viewing angles.

In the second solution, the projection mode needs to be modified, so the structure of the video sequence will be modified accordingly, but the two-dimensional video sequence is still obtained and compressed using a two-dimensional coding tool. For example, after segment division and projection are performed on the point cloud, down-sampling processing can be performed on the two-dimensional segment obtained from the point cloud segment of the unwanted perspective. After such processing, we can obtain a video sequence with composite resolution, and then use a two-dimensional coding tool to compress with the same resolution and accuracy. In addition, we also provide some other ideas.

In the third scheme, the standard normal vector can be modified according to the user's perspective. The current classic compression algorithm of dynamic point cloud is to divide the divided point cloud fragments into categories according to the relationship between the six standard normal vectors, and then select one of them as the main projection direction and project it onto the plane. Based on this projection method, the standard normal vector can be modified according to the user's perspective, so that the user's perspective corresponds to the front of the projection.

It should be noted that the three schemes are not opposed to each other and can be used in combination. For example, the standard normal vector can be modified according to the user's perspective, and then the projection mode or coding parameters can be modified.

According to the foregoing, in order to achieve "different viewing angles presenting different rendering effects", the present invention adopts three solutions. Among them, the three schemes implement split-view coding in terms of different strategies, and they can all be combined with each other, or the three can be used in combination.

The following table 1 summarizes the three schemes in terms of different strategies to achieve sub-view coding.

Table 1

Preferred example 2:

The three coding schemes proposed by the present invention for realizing the transmission of point cloud by viewing angle and resolution, the specific implementation schemes are as follows:

First of all, no matter which coding scheme is adopted, the corresponding point cloud segment needs to be determined according to the observation angle of view. It can be implemented relatively simply on the basis of the current point cloud sequence coding method. After the step of determining the point cloud segment corresponding to the perspective, a series of point cloud segments classified into the corresponding perspective category can be obtained. This is the premise of the three technical solutions in the point cloud sub-perspective transmission strategy proposed in this patent. And foundation. The specific implementation algorithm is as follows.

The current classic coding method of point cloud sequence is to divide the segment according to the normal vector of the point cloud. By calculating the distance between the normal vector of each point in the point cloud and the 6 standard normal vectors:

(1, 0, 0), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1)

Take the nearest standard normal vector to divide all points into 6 categories, namely 6 large point cloud segments. Then, these 6 large point cloud segments are extracted by connecting components and divided into smaller segments. However, these smaller fragments still belong to the 6 categories for the 6 standard normal vectors, and then the projections are performed in different directions according to the normal vector categories. Connection component extraction processing, that is, points that have been divided into six categories, according to the spatial distance and the adjacency relationship between points, are divided into smaller point cloud segments to prevent points from appearing in the same segment. The distance is too large and not adjacent.

For example, for each divided segment, a parameter index can be assigned to indicate which normal vector plane they belong to. The corresponding relationship is shown in Table 2 below. Table 1 is the patch index list in the first embodiment. ):

参数IndexParameter Index	标准投影向量PlaneStandard Projection Vector Plane
00	(1，0，0)(1, 0, 0)
11	(-1，0，0)(-1, 0, 0)
22	(0，1，0)(0, 1, 0)
33	(0，-1，0)(0, -1, 0)
44	(0，0，1)(0, 0, 1)
55	(0，0，-1)(0, 0, -1)

Table 2

Since the normal vector projection of the point cloud is performed, we can find the corresponding normal vectors from different perspectives, and then confirm which point cloud fragments these normal vectors correspond to. The method of confirming the corresponding normal vector from different perspectives can include the following:

For example, the standard normal vector (0, 1, 0) corresponds to the perspective from the front. When the user observes the point cloud object from the front, it can be confirmed that the normal vector belongs to (0, 1). , 0) Those point cloud fragments of this category. If it is not observed from a standard forward perspective, there are more types of point cloud fragments that are mainly observed.

1. Plan without modifying the point cloud projection method:

For the current video compression technology, such as HEVC video compression technology, the two-dimensional pictures that need to be compressed are divided into blocks, that is, the two-dimensional segments are divided evenly, and then the QP offset values and other codes are set for these blocks respectively. The parameters are encoded.

Therefore, on the basis of obtaining the user's perspective, it is possible to confirm which point cloud segments need to be mainly observed. First, a parameter index can be assigned to each point cloud segment, and then a parameter list patch index list can be assigned to each block of the picture that needs to be compressed and encoded to record which point cloud segments this block is associated with. In the aforementioned division process, these point cloud segments have been divided into point cloud segments that need to be primarily observed and point cloud segments that do not need to be primarily observed according to the observation angle of view. Therefore, according to the relationship between the point cloud segment and the two-dimensional segment, that is, the patch index list, it is possible to confirm which of the projected two-dimensional segments need to be mainly observed and which do not need to be mainly observed.

(1) For the two-dimensional block to which the two-dimensional segment that needs to be mainly observed belongs, when we use video compression technology for encoding, we set compression parameters with higher accuracy and higher resolution. For example, when using HEVC encoding, set a lower QP offset value.

(2) For the two-dimensional blocks to which the two-dimensional segments that do not need to be mainly observed belong, when we use video compression technology for encoding, we set compression parameters with lower accuracy and lower resolution for them. For example, when using HEVC encoding, set a higher QP offset value.

Regarding the sending end and the receiving end, for method one: the method of not modifying the projection scheme, the coding and decoding parameters of each segment need to be recorded at the sending end, and this part of the parameters will be encoded and transmitted as new additional information; at the receiving end, it is required According to the received encoding and decoding parameters of each segment, each segment is encoded and decoded.

2. Scheme to improve the point cloud projection method:

In addition to setting different coding parameters for different two-dimensional clips, the point cloud projection method can also be modified to achieve the coding effect of the point cloud by viewing angle and resolution.

In the current classic point cloud sequence coding scheme, the method for projecting each point cloud segment to a two-dimensional picture is as follows: Projection is performed according to the standard normal vector category of each point cloud segment, for example, the standard method of a point cloud segment The vector category is (1, 0, 0), then when projecting, the (y, z) coordinates in the three-dimensional coordinates (x, y, z) of this point cloud segment are used as plane coordinates, and the (x) coordinates are used Make the projection depth. As for where the projection position is on the picture, this is mainly based on the way to save the overall occupied space, starting from the upper left corner, filling into the two-dimensional picture in turn, so that the final total occupied plane size is the most economical.

However, in this point cloud projection scheme, the need for perspective is not considered. No matter which kind of standard normal vector point cloud fragments are projected onto the two-dimensional picture with the same resolution, it means that no matter which The point cloud segments observed from one angle of view are all projected and encoded with the same resolution.

To solve this problem, the present invention proposes to optimize the way of projection of the point cloud from different perspectives. And based on this idea, the following three optimization ideas are proposed. The three optimization ideas can be performed in parallel, that is, you can choose one:

(1) After confirming which point cloud fragments corresponding to standard normal vectors need to be mainly observed, the point cloud fragments are projected onto different planes as needed, processed, and then spliced onto a picture.

For example, the point cloud fragments that need to be mainly observed are projected onto picture A according to the classic method, and the size is 1280×1280, and the point cloud fragments that do not need to be mainly observed are projected onto picture B according to the classic method, and the size is 1280×1280 . Then, the picture B is reduced to obtain the picture C, and the size is 640×640. Since the projection according to the classic method ensures the minimum total occupied space, the projected position on the picture A is likely to be no more than 1/4, so the picture C can be pasted in the area of the picture A that is not occupied by the projection.

Comparing before and after optimization, the picture occupies a size of 1280×1280 before the optimization, and the picture occupies a size of 1280×1280 after the optimization, but the plane space occupied by the point cloud fragments not mainly observed is smaller. Take the same video The coding technology can achieve the purpose of dividing the definition coding presentation according to the viewing angle requirements.

(2) After confirming which standard normal vectors correspond to point cloud segments that need to be mainly observed, the point cloud segments are projected onto different planes as required, and these planes are processed according to requirements before compression and encoding.

For example, point cloud fragments that need to be mainly observed are projected onto picture A according to the classic method, point cloud fragments that do not need to be mainly observed are projected onto picture B according to the classic method, and then picture B is reduced in size. After that, both the picture A and the reduced picture B are compressed and encoded for presentation.

Before and after optimization, the video sequence presented before optimization is a set of video sequences, and the video sequence presented after optimization is two groups of video sequences. However, the point cloud fragments that are not mainly observed are reduced due to the reduction process, which reduces the difficulty and cost of compression encoding, and adopts the same video The coding technology can achieve the purpose of dividing the definition coding presentation according to the viewing angle requirements.

(3) After confirming which point cloud fragments corresponding to standard normal vectors need to be mainly observed, project the point cloud fragments onto different planes according to their standard normal vector categories, and process these planes according to requirements Re-compress and encode.

For example, each point cloud segment is projected onto planes A, B, C, D, E, and F according to their standard normal vector categories, respectively, according to the classic method. Since it is possible to determine which standard normal vector point cloud fragments need to be mainly observed according to the user's perspective, it is possible to determine which of the six planes do not need to be mainly observed, and can be reduced in size. After performing this processing on each frame of the point cloud sequence, six groups of video sequences can be obtained, and the same video coding technology is used for coding processing.

Before and after optimization, the video sequence presented before optimization is a group of video sequences, and the video sequence presented after optimization is multiple groups of video sequences. However, the point cloud fragments that are not mainly observed are reduced due to the reduction process, which reduces the difficulty and cost of compression encoding. Video coding technology can achieve the purpose of dividing definition coding and presentation according to viewing angle requirements.

Regarding the sending end and the receiving end, the second method: the method of modifying the point cloud projection method is realized by the following. Solution 1: A solution in which segments corresponding to the user's perspective and segments corresponding to the non-user's perspective are spliced on a video sequence. It is necessary to record the index of all segments corresponding to the non-user perspective and the size reduction ratio at the sending end. This information will be encoded and transmitted as new additional information; at the receiving end, it needs to be based on the received index information and the reduction ratio. Restore these fragments to the original size, and then perform the subsequent 3D point cloud reconstruction steps. Solution 2: A solution of projecting segments corresponding to the user's perspective and segments corresponding to the non-user's perspective to two video sequences respectively. Two sets of video sequences need to be encoded at the sending end, and the reduction ratio of the video sequence corresponding to the non-user perspective needs to be recorded at the sending end. This information will be encoded and transmitted as new additional information. In addition, the encoded code of each video sequence needs to be recorded. The position in the stream; at the receiving end, each video sequence needs to be restored according to the position of each video sequence in the code stream, and then these video sequences need to be restored to the original size according to the received reduction ratio, and then the subsequent steps The step of 3D point cloud reconstruction, and the point clouds recovered from the two sets of video sequences need to be stitched together. Solution 3: Project the fragments corresponding to each standard normal vector to each video sequence. Six groups of video sequences need to be encoded at the sending end, and the reduction ratio of the video sequence corresponding to the non-user perspective needs to be recorded at the sending end. This information will be encoded and transmitted as new additional information. In addition, the encoded code of each video sequence needs to be recorded. The position in the stream; at the receiving end, each video sequence needs to be restored according to the position of each video sequence in the code stream, and then these video sequences need to be restored to the original size according to the received reduction ratio, and then the subsequent steps Three-dimensional point cloud reconstruction steps, and need to stitch together the point clouds recovered from the six groups of video sequences.

3. Modify the standard normal vector according to the user's perspective:

In the current classic coding method for point cloud sequences, the standard normal vectors for dividing point cloud segments are the following six normal vectors:

(1, 0, 0), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1)

However, when the user modifies the viewing angle, the main observing projection direction may be too much. For example, when the user's perspective is obliquely above the observed object, the main projected surface to be observed needs to include the front, side, and top of the observed object, that is, there are at least three sides. In the ideal sub-view transmission, it is better to have only one projection surface that is mainly observed. Therefore, the above-mentioned situation cannot achieve a good sub-view transmission effect. In this regard, the present invention proposes to adaptively adjust the direction of the standard normal vector according to the user's observation angle of view.

First, the direction of the user's observation angle can be obtained, and the normal vector of the observation angle can be calculated. For example, when observing a point cloud object, take the observed point cloud object as the origin of the relative coordinate system, provide the user with the relative position coordinates of the observed point cloud object, and calculate the position difference between the two to obtain the relative position. The normal vector of the direction of the viewing angle, which represents the direction of the user's viewing angle.

Then the normal vector is defined as one of the standard normal vectors, that is, the normal vector and one of the standard normal vectors are rotated and mapped, and then the other standard normal vectors are subjected to the same rotation mapping processing, so that the overall modification amount is small Under the premise, to obtain the other corresponding five sets of standard normal vectors. The calculation formula i is as follows:

If the general formula of the normal vector of the user's observation angle is

It should be noted that the normal vector has been normalized, that is, the sum of the squares of the three coordinates is 1. Set it to the standard normal vector (1, 0, 0) as a rotation mapping relationship, then the other five standard normal vectors (0, 1, 0), (-1, 0, 0), (0, -1 , 0), (0, 0, 1), (0, 0, -1) corresponding to the normal vector general formula of the rotation map can be expressed as:

For example, if the normal vector of the observation angle is

Then the corresponding set of orthogonal standard normal vectors can be obtained as:

(0, 0, 1), (0, 0, -1)

Among them, under the premise that the overall modification amount is small, the other corresponding five sets of standard normal vectors are obtained. The calculation process of the overall modification amount is as follows:

The method to ensure that the overall amount of modification is small is to make the new six standard normal vectors, including the user’s perspective normal vector, correspond to the corresponding six standard methods after the user’s perspective normal vector corresponds to the standard normal vector and is rotated. The sum of the dot products of the vectors is as large as possible. In this case, suppose the general formula of the normal vector of the user's viewing angle

Set it to the standard normal vector (1, 0, 0) as a rotation mapping relationship, then the other five standard normal vectors (0, 1, 0), (-1, 0, 0), (0, -1 , 0), (0, 0, 1), (0, 0, -1) corresponding to the normal vector general formula of the rotation map can be obtained by calculation:

Later, when the point cloud segment is divided, the subsequent steps such as point cloud segmentation, segment projection, and video encoding will be performed according to these new standard normal vectors. Among them, segment projection and video coding can adopt the other two modification schemes mentioned in the present invention.

Regarding the sending end and the receiving end, for method three: the method of modifying the standard normal vector. The new standard normal vector needs to be recorded at the sending end, and this part of the information will be encoded and transmitted as additional additional information; at the receiving end, the inverse process of point cloud projection needs to be performed according to the new standard normal vector, so as to start from the two-dimensional video The sequence reconstructs a three-dimensional point cloud sequence.

Functions and effects of the first embodiment of the present invention

According to the transmission strategy of the present invention adaptively adjusted according to the user's perspective, there is no need to make major modifications to the current existing point cloud transmission strategy, and the user's attention to various aspects is also different, distinguishing between the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed. The observed point cloud fragments can be better combined with the current high compression rate encoding algorithm, without losing the experience quality of the user's main perspective, and without affecting the compression rate and the objective quality of the point cloud, saving transmission consumption The resources to achieve the purpose of sub-view transmission.

{Second Embodiment}

In the prior art solution, the processing steps of the point cloud sequence encoding method include: dividing the point cloud object into multiple point cloud segments according to the normal vector, subdividing the segments according to the connection relationship of the point cloud segments, and dividing the subdivided point cloud The segments are projected according to the normal vector, the projection plane is blank filled, and the video sequence obtained by the projection is compressed and encoded using a two-dimensional coding tool such as HEVC.

In the embodiment of the present invention, it is implemented by improving on the existing point cloud sequence coding transmission scheme, and the user’s viewing angle is introduced in the process of compressing the three-dimensional point cloud into a two-dimensional image to optimize and adjust the point cloud projection processing process. Thereby improving compression performance.

The projection vector of the point cloud segment is selected by comprehensively considering the two factors of the user’s viewing angle and the normal vector of the segment. This embodiment provides a point cloud projection processing method based on the user’s perspective, including: distinguishing points based on the user’s perspective The main segment and multiple edge segments of the cloud object; considering the spatial connection between the edge segment and the main segment, the projection vector of the edge segment is maintained or changed based on the user's perspective; subsequent point cloud processing is performed after projection.

When observing a point cloud object from a specific direction, such as a user's perspective, as proposed in the present invention, as shown in FIG. 5, the point cloud object is projected by using a three-dimensional bounding box of the point cloud object to include the point cloud object. Generally, a three-dimensional bounding box of a point cloud object is a rectangular cube set for a specific point cloud object, and the specific size and shape are not limited by this embodiment.

Not shown in the figure, with the point cloud object as the origin (0, 0, 0), first define the standard normal vectors (x, y, z) of the 6 projection surfaces corresponding to the point cloud object: (1, 0, 0) ), (-1, 0, 0), (0, 1, 0), (0, -1, 0), (0, 0, 1), (0, 0, -1).

According to the position where the user views the point cloud object, a vector pointing from the point cloud to the user's observation point is used to represent the user's view of the user's observation, which is called the view vector. Then calculate the distance between the viewing angle vector and the six standard normal vectors, and the standard normal vector with the closest distance to the viewing angle vector is called the principal vector under the viewing angle.

Usually 6 different Index index values are used to indicate 6 different normal vectors. Table 3 is a patch index list of index parameter values and projection vector values in the second embodiment, and their parameter assignments and corresponding The standard projection vector surface is the same as Table 2.

In this embodiment, 0 is used to refer to the direction of the first standard normal vector, which is perpendicular to the (1, 0, 0) plane, and 1 is used to refer to the direction of the sixth standard normal vector, which is perpendicular to the (-1, 0, 0) plane, use 2 to refer to the direction of the second standard normal vector, perpendicular to the (0, 1, 0) plane, use 3 to refer to the direction of the third standard normal vector, perpendicular to the (0, -1, 0) ) Plane, using 4 to refer to the direction of the fourth standard normal vector, perpendicular to the (0, 0, 1) plane, and using 5 to refer to the direction of the fifth standard normal vector, perpendicular to the (0, 0, -1) surface.

In this embodiment, the main vector is the first standard normal vector (1, 0, 0).

It can be seen that the six standard normal vectors have a one-to-one correspondence with the six faces of the three-dimensional edge box, and the standard normal vectors are perpendicular to their corresponding faces.

索引参数值(Index)Index parameter value (Index)	投影向量面NormalProjection vector surface Normal
00	(1，0，0)(1, 0, 0)
11	(-1，0，0)(-1, 0, 0)
22	(0，1，0)(0, 1, 0)
33	(1，-1，0)(1, -1, 0)
44	(0，0，1)(0, 0, 1)
55	(0，0，-1)(0, 0, -1)

table 3

Combine the neighbor point information of the point cloud object to calculate the normal vector of each point, and then calculate the distance between the normal vector of each point and the above-mentioned first standard normal vector to the sixth standard normal vector. The closest standard normal vector is the The standard normal vector corresponding to the point. Through this step, the points in all the point cloud objects are divided into 6 first-level point cloud segments. Through the first-level division, the points with the same standard normal vector belong to the same first-level point cloud segment. The 6 first-level point cloud segments are subdivided, and the points that belong to the same first-level point cloud segment and are relatively close are divided into a second-level point cloud segment. The method of calculating the distance between points can be selected according to the specific situation. Euclidean distance is usually used. According to this method, the 6 first-level point cloud segments are subdivided, and the normal vectors of the second-level point cloud segments obtained after subdivision all correspond to one of the above-mentioned 6 standard normal vectors.

In this embodiment, the secondary point cloud segment is used as the cloud segment to be processed to continue processing, and the cloud segments to be processed on the six standard normal vector planes are further distinguished according to the direction of the view angle vector. Calculate the normal vector of the secondary point cloud segment. Then the main direction of the projection is determined according to the user's viewing angle. According to the relationship between the normal vector of the secondary point cloud segment and the main direction, the secondary point cloud segment is divided into the main segment, the edge segment, and the irrelevant segment.

As shown in Figure 6, in the second embodiment, point cloud segment 1 to point cloud segment t are projected onto six different planes according to their standard normal vector types, corresponding to the six standard vectors, and the first 1 set of point cloud fragments to the 6th group of point cloud fragments, and then determine the main direction of projection according to the user’s viewing angle. According to the relationship between the normal vector of the secondary point cloud fragment and the main direction, the secondary point cloud fragments are these 6 groups Point cloud fragments are divided into main fragments, edge fragments, and irrelevant fragments. In this embodiment, the first group is divided into main segments, the second group is divided into irrelevant segments, and the 3-6 groups are divided into edge segments.

After further determining the spatial association between the edge segments and the main segment, projection is performed according to the optimized and adjusted projection direction. As shown in Figure 6, the main segment and the irrelevant segment keep the projection direction unchanged, adjust the edge segment according to the large or small spatial correlation, and obtain the first group of point cloud segments to the sixth group of point clouds after the projection direction is optimized and adjusted. Fragments, it is obvious that the first group of point cloud fragments towards the user's perspective are more concentrated.

Fig. 7 is a corresponding table of the relationship between normal vectors and principal vectors of different point cloud segment types in the embodiment of the present invention; Fig. 7 shows the distinguishing criteria for different segment types.

As shown in Figure 7, the secondary point cloud fragments with the same normal vector and the main vector are defined as the main fragments. These main fragments are also called the first standard fragments. These main fragments form the main fragment set, the first standard fragment The set (ie, the main fragment set) is gathered on the main fragment projection gathering surface of the point cloud 3D edge box.

Define the secondary point cloud fragments whose normal vector is opposite to the main vector as irrelevant fragments. These irrelevant fragments are also called second standard fragments. These irrelevant fragments form a set of irrelevant fragments, and the second standard fragment set (ie, irrelevant point cloud fragments) Set) is gathered on the projection gathering surface of the irrelevant fragments of the 3D edge box of the point cloud.

The secondary point cloud fragments whose normal vector is perpendicular to the main vector are defined as edge fragments. These edge fragments are distributed on the projection gathering surface of the four edge fragments perpendicular to the main vector. Figure 5 shows only four edge fragments. One of the projection surfaces is projected, and the other three surfaces are not marked.

Among them, these secondary point cloud segments located on the projection gathering surface of the first edge segment, that is, these edge segments form a set of the first edge segment, and the first edge segment is derived from the third standard segment. By analogy, the second-level point cloud segments located on the gathering surface of the other three edge segments are called the second edge segment set, the third edge segment set, and the fourth edge segment set. The above forms a collection, from the 4th to 6th standard fragments.

Next, the projection vector of the main segment is projected with its standard normal vector, that is, the main vector. The projection vector of the irrelevant segment is projected with its standard normal vector, that is, the opposite of the principal vector.

The key difference between this embodiment and the prior art is that considering the spatial connection between the edge segment and the main segment, it is necessary to consider whether the projection vector of the above edge segment needs to be further adjusted and optimized.

In short, the size of the spatial connection is judged by sequentially calculating the intersection ratio (IOU) of each edge segment with each main segment. If there is an IOU greater than the threshold, it means that the edge point cloud segment has a greater spatial connection with the main point cloud segment, and the main vector is selected as the projection vector of the edge point cloud segment; if there is no IOU greater than the threshold, it means the edge point The cloud segment has a small spatial connection with the main point cloud segment, and the normal vector of the edge point cloud segment is selected as its projection vector.

Figure 8-1 is a schematic diagram of the relative positions of the three-dimensional edge box of the main segment and the three-dimensional edge box of the edge segment used for projection adjustment according to the main vector direction in the second embodiment; Figure 8-2 is the main segment and the three-dimensional edge box in Figure 8-1 A schematic diagram of the two-dimensional projection obtained after the three-dimensional edge box of the edge segment is projected onto the two-dimensional plane; the IOU judgment process of one of the edge segments and a certain main segment of each main segment is described in detail.

Fig. 8-1 is a schematic diagram of the relative positions of the three-dimensional edge box of the main segment and the three-dimensional edge box of the edge segment for projection adjustment according to the main vector direction in the embodiment of the present invention. The three-dimensional edge box (called edge box 1) shown by the solid line corresponds to the main segment, and the three-dimensional edge box (called edge box 2) shown by the dashed line corresponds to the edge segment.

Figure 8-1 and Figure 8-2 are only a brief enumeration of the three-dimensional edge boxes of a certain main segment and a certain edge segment, showing the projection orientation between the three-dimensional boxes and the overlap of the two-dimensional projection map. Regarding the case of one-by-one overlapping calculation spatial associations between a certain edge segment and other main segments, or between other edge segments and other main segments, it will not be listed one by one, but only an example.

Fig. 8-2 is a rectangular two-dimensional projection diagram obtained after the three-dimensional edge boxes of the main segment and the edge segment are projected onto a two-dimensional plane.

The three-dimensional edge box of the main segment is projected along the main vector direction to obtain the main two-dimensional projection image, and the three-dimensional edge box of the edge segment is projected along the main vector direction to obtain the edge two-dimensional projection image. In Figure 8-2, the pattern filling part is projected along the principal vector direction of the edge box 1 of the main segment (called rectangular area 1), and the dot filling part is projected along the principal vector direction of the edge box 2 of the edge segment (called rectangular). Area 2), the grid point filling part is the common part where the rectangular area 1 and the rectangular area 2 overlap (referred to as the rectangular area 3).

For each edge segment, calculate its intersection over Union (IOU) with each main segment in turn. The specific calculation method will be described with reference to FIG. 1 and FIG. 2. Calculated as follows:

S ₁ =(y _i1 -y _i0 )×(z _i1 -z _i0 ) (2)

S ₂ = (y _j1 -y _j0 )×(z _j1 -z _j0 ) (3)

S ₃ ＝(y _i1 -y _j0 )×(z _i1 -z _j0 ) (4)

S ₃ corresponds to the area of the rectangular area 3 in Figure 4, that is, the overlapping common part of the main two-dimensional projection image and the edge two-dimensional projection image

S ₁ corresponds to the main two-dimensional projection map in Figure 4 excluding the area of the common part,

S ₂ corresponds to the two-dimensional projection of the edge in Fig. 4 excluding the area of the common part.

As shown in the solid three-dimensional edge box in Figure 8-1, A _i and B _i are the three-dimensional edge boxes of the main segment, that is, the vertices closest to the origin O on the edge frame 1; as shown in Figure 8-1 As shown by the dashed three-dimensional edge box (edge box 2), A _j and B _j are respectively the three-dimensional edge box of the edge segment, that is, the vertices of the edge box 2 closest to the origin O and the farthest.

As shown in the figures _{8-2, A 'i, A'} j, B 'i, B' j are _{_{_{A i, A j, B i}}} , B _j of the resultant vector along the main direction of the projection point. Their coordinates in the XYZ coordinate system with O as the origin are: A _i (x _i0 ,y _i0 ,z _i0 ),B _i (x _i1 ,y _i1 ,z _i1 ),A _j (x _j0 ,y _j0 ,z _j0 ),B _j (x _j1 ,y _j1 ,z _j1 ),A′ _i (y _i0 ,z _i0 ),B′ _i (y _i1 ,z _i1 ),A′ _j (y _j0 ,z _j0 ) ,B′ _j (y _j1 ,z _j1 ).

(4) Based on the two-dimensional projection map, for each edge segment, multiple IOUs are calculated with multiple main segments, and the number of IOUs is equal to the number of main segments.

For an edge segment, compare its respective IOUs with an intersection ratio threshold. If there is an IOU greater than the intersection ratio threshold, it means that the edge segment has a greater spatial connection with the main segment, in order to optimize the point of the main vector direction Cloud visual quality, change the projection direction of the edge segment, and select the main vector as the projection vector of the edge segment; if there is no IOU greater than the intersection ratio threshold, it means that the edge segment has a small spatial connection with the main segment, then keep this The normal vector of the edge segment is used as its projection vector.

It can be seen that the final projection vector of the edge segment is determined by the normal vector of the edge segment and the spatial position relationship between the edge segment and the main segment.

Among them, the selection of the intersection ratio threshold depends on the surface characteristics of the point cloud object to be processed. For example, if the surface characteristics of the object point cloud object and the character point cloud object are different, then the intersection ratio threshold value selected when comparing It's different. For common character point cloud objects, the threshold value is generally selected to be 0.2-0.3, and further optionally, the threshold value is determined mainly through adjustment and selection of experimental results.

Through the above process, the adjustment, optimization or maintenance of the projection vector is completed for the first edge segment, and the same processing of the above process is used to discriminate and adjust each edge segment one by one, until the second edge segment is set and the third edge segment is set. Set, all edge fragments in the set of 4th edge fragments have been processed.

(5) For the main segment, its projection vector is the main vector; for the irrelevant segment, its projection vector is the normal vector of the irrelevant segment.

After the above process, all the secondary point cloud segments, that is, the projection vectors of the cloud segments to be processed, have been optimized and adjusted. In the subsequent point cloud processing after projection, the projection relationship identification field is used to identify: the projection vector of the point in the point cloud object after processing and adjustment, including the optimized and adjusted projection vector coordinate value and the corresponding index parameter value. .

Table 4 is an example table of the corresponding relationship before and after adjustment and optimization of the projection vectors of multiple edge segments.

For example, in Table 4, for the first edge segment, before the projection adjustment optimization, its original standard normal vector: index parameter value 2, (0, 1, 0), after the above optimization adjustment, the projection direction is changed Main vector, namely index parameter value 0, (1, 0, 0). Whether other edge segments change or not depends on the size of the spatial correlation between each edge segment and the main segment. In short, for the 1st to kth edge segments, either adjust and optimize to 0 or keep.

The main segment is: the index parameter value is 0, the standard normal vector coordinate value is (1, 0, 0) without optimization adjustment processing, and the main vector is maintained for projection. Irrelevant edge fragments are: index parameter value 1, standard normal vector coordinate value (-1, 0, 0) without optimization and adjustment processing, keeping the original standard normal vector direction for projection.

类型Types of	原索引参数值Original index parameter value	原投影向量Original projection vector	与主片段空间关联Associate with the main segment space	优化调整后索引参数值Optimized index parameter value after adjustment	优化调整后投影向量Optimize the adjusted projection vector
主片段Main fragment	00	(1，0，0)(1, 0, 0)	————	00	(1，0，0)(1, 0, 0)
无关片段Irrelevant fragments	11	(-1，0，0)(-1, 0, 0)	保持maintain	11	(-1，0，0)(-1, 0, 0)
第1边缘片段1st edge segment	22	(0，1，0)(0, 1, 0)	大Big	00	(1，0，0)(1, 0, 0)

第2边缘片段2nd edge segment	22	(0，1，0)(0, 1, 0)	不大Not big	22	(0，1，0)(0, 1, 0)
第3边缘片段3rd edge segment	44	(0，0，1)(0, 0, 1)	大Big	00	(1，0，0)(1, 0, 0)
第4边缘片段4th edge segment	55	(0，0，-1)(0, 0, -1)	不大Not big	55	(0，0，-1)(0, 0, -1)
第5边缘片段5th edge segment	33	(0，-1，0)(0, -1, 0)	大Big	00	(1，0，0)(1, 0, 0)
第k边缘片段K-th edge segment	……	……	……	……	……

Table 4

Continue to complete subsequent projection processing, including:

For the main segment, the edge segment that has undergone projection processing, and the irrelevant segment, project them according to the adjusted projection vector to generate a compressible two-dimensional image, and further process to obtain a two-dimensional video sequence, and use a video codec for the two-dimensional video sequence. The video sequence is compressed.

FIG. 9 is a schematic flowchart of a point cloud projection transmission method based on a user's perspective in an embodiment of the present invention; the steps of a point cloud projection processing method based on a user's perspective include:

Input point cloud object;

Generate point cloud fragments, and calculate the normal vector of each point in turn;

For the preliminary division of each point, the point cloud object is divided to generate first-level point cloud fragments. At this time, the first-level point cloud fragments are preliminarily divided according to the difference of the six standard vectors;

Generate a second-level point cloud segment, where the first-level point cloud segment is further subdivided. Since the vectors between the first-level point cloud segments of the same standard vector are the same, the first-level point is further based on the distance between the points The points in the cloud segment are further clustered and divided. It is worth noting that in this way, the process of generating a secondary point cloud segment from the primary point cloud segment can be completed by using the usual technical means in the existing point cloud projection step, and the secondary point cloud segment is used as a cloud segment to be processed and the following steps are performed. In the present invention, obtaining cloud fragments to be processed is not limited to the above-mentioned steps such as the first-level point cloud fragment and the second-level point cloud fragment. The preprocessing steps of any point cloud fragment in the prior art are all within the scope of the present invention. .

Calculate the main vector according to the user's viewing angle, and use the standard vector that is closest to the angle of view vector as the main vector;

The cloud fragments to be processed are distinguished. According to the relationship between the normal vector of the secondary point cloud fragment and the main vector, the secondary point cloud fragment is divided into the main fragment, the edge fragment, and the irrelevant fragment. Among them, the normal vector and the main vector are the same. Fragment, normal vector and main vector opposite are irrelevant fragments, normal vector and main vector are perpendicular to edge fragments;

The projection direction of the secondary point cloud segment is calculated, and the projection direction of the edge segment in the secondary point cloud segment is optimized and adjusted. The specific optimization and adjustment process is as described above;

Project the secondary point cloud segment, and project the optimized and adjusted secondary point cloud segment, that is, the main segment, edge segment, and irrelevant segment according to the adjusted projection direction;

After projection, a compressible two-dimensional image is generated;

Generate a two-dimensional video sequence;

Compress two-dimensional video sequences; and

Finally, the compressed two-dimensional video is output in code stream.

FIG. 10 is a schematic flowchart of a point cloud receiving and processing method based on a user's perspective in an embodiment of the present invention;

The point cloud receiving and processing method based on the user's perspective in this embodiment is the inverse process of the above-mentioned point cloud projection processing method based on the user's perspective, and the decompression on the receiving end is the inverse process of compression on the transmitting end.

As shown in Figure 10, first input the compressed code stream;

Generate a two-dimensional video sequence after decompression;

Then obtain a two-dimensional image (usually including three categories: geometric map, attribute map, occupancy map);

Use two-dimensional image information to generate secondary point cloud fragments;

Then use the second-level point cloud segment to generate the first-level point cloud segment;

The first-level point cloud fragment is combined with auxiliary information to restore and generate a point cloud object, and the above-mentioned projection relationship identification field is used for back projection processing to restore the two-dimensional point cloud fragment to a three-dimensional point cloud object;

Finally, after processing, a complete point cloud object is obtained for output.

In addition, in this embodiment, located on the side of the transmitting end, a point cloud projection processing system based on the user's perspective is also provided. FIG. 11 is a schematic block diagram of the point cloud projection processing system based on the user's perspective in an embodiment of the present invention.

As shown in FIG. 11, the point cloud projection processing system based on the user's perspective includes a point cloud input module, a segment generation module, a segment division module, a projection vector adjustment module, a projection module and a compression module, and a code stream output module.

Point cloud input module, used to input point cloud objects;

The fragment generation module is used to generate cloud fragments to be processed from the point cloud object according to predetermined rules, and perform subsequent projection optimization adjustments;

Fragment division module, used to divide the cloud fragment to be processed into main fragments, edge fragments and irrelevant fragments;

The projection vector adjustment module is used to take into account the spatial connection between the edge segment and the main segment, and to maintain the projection vector of the edge segment according to the above projection vector adjustment rule or change it based on the user's perspective;

Projection module, used for projecting the main segment, edge segment and irrelevant segment after projection adjustment;

The compression module and the code stream output module are used to compress the two-dimensional image after projection; and the subsequent code stream output.

On the sender side, the projection module uses the optimized and adjusted projection vector for projection, and is identified by the projection relationship identification field. The projection relationship identification field contains the index parameter value of the optimized and adjusted projection vector and the corresponding projection vector value .

In addition, in this embodiment, it is located on the side of the receiving end, and a point cloud receiving and processing system based on the user's perspective is also provided. The point cloud receiving and processing system based on the user's perspective includes: a code stream processing module and a back projection module, a code stream processing module, used to receive the input of the code stream; a back projection module, used to back project two-dimensional point cloud fragments, according to The indication of the projection relationship identification field restores the two-dimensional point cloud segment to a point cloud object. The projection relationship identification field contains the index parameter value of the optimized and adjusted projection vector and the corresponding projection vector value.

In the present invention, the functions of each module in the point cloud projection processing system based on the user's perspective and the point cloud receiving and processing system based on the user's perspective are the same as the above-mentioned point cloud projection processing method based on the user's perspective and the point cloud receiving based on the user's perspective. The optimization and adjustment process in the processing method corresponds to the corresponding, which can be deduced by analogy. Then the structure and technical elements of the device can be formed by the corresponding conversion of the generation method and the receiving method. The repeated description is omitted here, and the details are not repeated here.

Functions and effects of the second embodiment of the present invention

In the existing projection processing method, since the user's perspective is not considered, of course, the distance relationship between the edge segment and the main segment is not considered. If the point cloud object is only divided into multiple points according to multiple standard vectors Cloud fragments, there will be such a situation:

The point cloud segment corresponding to the same standard vector may have a large spatial distance between the points contained. Such a simple rough division operation has nothing to do with the user's perspective, and the point cloud segment corresponding to the same standard vector is geometrically spatially related Small and fragmented. In other words, the associated points in the point cloud object that are very close in space are segmented and projected into different standard vectors because of the difference in standard vectors.

According to the point cloud projection processing method, receiving processing method and system based on the user's perspective of the present invention, when the point cloud object is divided into multiple point cloud fragments according to multiple standard vectors, the factor of the user's perspective is introduced to perform the multiple point cloud fragments. Distinguish, divided into a main segment and multiple edge segments. Among them, the user has a high degree of attention to the front content of the point cloud object, then the main segment is determined based on the user's perspective. In addition, the edge segments associated with the main segment, such as the left and right sides and the top and bottom sides Point cloud segments are used as edge segments. The projection direction of these edge segments further considers the distance relationship between these edge segments and the main segment. The projection vector of the points in the edge segment is adjusted and optimized. If When the spatial connection is large, the projection vector of the point in the edge segment is changed based on the user's perspective, so that it is projected according to the main vector direction consistent with the main segment. In addition, if the spatial connection is small, keep the edge segment The projection vector of the point. After completing the optimization and adjustment of the above-mentioned projection vector, according to the optimized and adjusted projection vector, perform projection operations on multiple point cloud fragments of the point cloud object, including the main fragment and multiple edge fragments, and the subsequent point cloud compression and encoding transmission Processing etc.

The above optimization and adjustment of the projection direction of the edge segment takes into account the user's perspective and the spatial correlation with the main vector, so that those points in the edge segment that are largely related to the main segment are changed and projected according to the main vector of the main segment, which pays more attention to the user. The direction of higher degree of concentration is concentrated, and the adjusted and concentrated points are originally highly related and close in space with the main segment of higher attention. This avoids the scattered fragmentation and overcomes the problem of point cloud objects. The associated points with very close spatial distance are divided and projected into different standard vectors because of the difference in standard vectors.

{Third Embodiment}

In addition, as a third embodiment, the present invention also provides a point cloud projection transmission method based on the user's perspective, a point cloud receiving processing method based on the user's perspective, and a point cloud projection transmission system based on the user's perspective, and a point cloud projection transmission system based on the user's perspective. Point cloud receiving and processing system.

The above-mentioned first embodiment and the second embodiment are combined as the technical solution of the third embodiment. There are three types in this embodiment, and the specific content of the projection adjustment optimization and the multi-view coding transmission in the first and second embodiments are repeated, and the description is omitted here. In the third embodiment, the third embodiment corresponds to the content in the first embodiment and the second embodiment, and can be obtained from the above-mentioned first embodiment and second embodiment respectively.

In the third embodiment, the point cloud projection transmission method based on the user's perspective includes the following steps:

Preliminarily divide the point cloud object on the standard vector to obtain the point cloud segment to be processed, and distinguish the main segment and multiple edge segments according to the user's perspective;

Considering the spatial connection between the edge segment and the main segment, maintain the projection vector of the edge segment or change it based on the user's perspective;

Take the adjusted and changed main segment as the point cloud segment that needs to be observed, and adjust and change the edge segment and irrelevant segments as the point cloud segment that does not need to be observed; and

The point cloud segments that need to be observed are allocated with priority resources for encoding transmission, and the point cloud segments with a perspective that do not need to be observed are allocated for encoding and transmission compared to the priority allocation of secondary resources.

In the third embodiment, the point cloud projection transmission system based on the user's perspective as the sending end, the point cloud receiving processing method based on the user's perspective as the receiving end, and the point cloud projection and transmission system based on the user's perspective as the receiving end can be determined by the point cloud projection transmission system based on the user's perspective. The cloud projection transmission method can be deduced by analogy, and also includes the content of the combination of the above-mentioned first embodiment and the second embodiment respectively, and the overlapping content of the above-mentioned first embodiment and the second embodiment is omitted here.

{Fourth Embodiment}

The present invention corresponds to the above-mentioned first embodiment, and also provides a point cloud projection transmission method based on the user's perspective, which is characterized in that it includes: preliminarily dividing the point cloud object on the standard vector to obtain the point cloud segment to be processed; The point cloud fragments to be processed are further divided to obtain the point cloud fragments of the corresponding perspective category, including the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed; the point cloud fragments that need to be observed are assigned priority resources for encoding transmission, and The point cloud segment of the perspective that does not need to be observed is preferentially allocated to secondary resources for encoding transmission.

In the fourth embodiment, the point cloud projection transmission system based on the user's perspective as the sending end, the point cloud receiving processing method based on the user's perspective as the receiving end, and the point cloud projection and transmission system based on the user's perspective as the receiving end can be determined by the point cloud projection transmission system based on the user's perspective. The cloud projection transmission method can be deduced by analogy, and it also includes the content of the above-mentioned first embodiment, and the overlapping content of the above-mentioned first embodiment is omitted here.

{Fifth Embodiment}

The present invention corresponds to the above-mentioned second embodiment, and also provides a point cloud projection transmission method based on a user's perspective, which is characterized in that it includes: a point cloud segment to be processed, distinguishing a main segment and a plurality of edge segments according to the user's perspective; Considering the spatial connection between the edge segment and the main segment, the projection vector of the edge segment is maintained or changed based on the user's perspective; subsequent point cloud processing is performed after projection.

In the fifth embodiment, the point cloud projection transmission system based on the user's perspective as the sending end, the point cloud receiving processing method based on the user's perspective as the receiving end, and the point cloud projection and transmission system based on the user's perspective as the receiving end can be determined by the point cloud projection transmission system based on the user's perspective. The cloud projection transmission method can be deduced by analogy, and it also includes the content of the above-mentioned second embodiment, and the overlapping content of the above-mentioned second embodiment is omitted here.

In the description of this application, it should be understood that the terms "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", The orientation or positional relationship indicated by "bottom", "inner", "outer", etc. is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the application and simplifying the description, rather than indicating or implying the pointed device Or the element must have a specific orientation, be constructed and operated in a specific orientation, and therefore cannot be understood as a limitation of the present application.

Those skilled in the art know that, in addition to implementing the system, device and various modules provided by the present invention in a purely computer-readable program code manner, it is completely possible to make the system, device and various modules provided by the present invention by logically programming method steps The same program is implemented in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, and embedded microcontrollers. Therefore, the system, device and various modules provided by the present invention can be regarded as a kind of hardware component, and the modules included in it for implementing various programs can also be regarded as the structure within the hardware component; Modules for realizing various functions can be regarded as both software programs for realizing methods and structures within hardware components.

The specific embodiments of the present invention have been described above. It should be understood that the present invention is not limited to the above specific embodiments, and those skilled in the art can make various changes or modifications within the scope of the claims, which does not affect the essence of the present invention. In the case of no conflict, the embodiments of the application and the features in the embodiments can be combined with each other arbitrarily.

A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through a computer program. The program can be stored in a computer readable storage medium, and the program can be stored in a computer readable storage medium. During execution, it may include the procedures of the above-mentioned method embodiments. Wherein, the storage medium may be a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.

The embodiment of the present invention provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, it realizes the point cloud projection processing method based on the user's perspective as provided by the embodiment of the present invention. Point cloud receiving and processing method.

Any combination of one or more computer-readable media may be used. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above. More specific examples (non-exhaustive list) of computer-readable storage media include: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), Erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In this document, the computer-readable storage medium can be any tangible medium that contains or stores a program, and the program can be used by or in combination with an instruction execution system, apparatus, or device.

The computer-readable signal medium may include a data signal propagated in baseband or as a part of a carrier wave, and computer-readable program code is carried therein. This propagated data signal can take many forms, including, but not limited to, electromagnetic signals, optical signals, or any suitable combination of the foregoing. The computer-readable signal medium may also be any computer-readable medium other than the computer-readable storage medium. The computer-readable medium may send, propagate, or transmit the program for use by or in combination with the instruction execution system, apparatus, or device .

The program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to wireless, wire, optical cable, RF, etc., or any suitable combination of the above. The computer program code used to perform the operations of the present invention can be written in one or more programming languages or a combination thereof. The programming languages include object-oriented programming languages—such as Java, Smalltalk, C++, and also conventional Procedural programming language-such as "C" language or similar programming language. The program code can be executed entirely on the user's computer, partly on the user's computer, executed as an independent software package, partly on the user's computer and partly executed on a remote computer, or entirely executed on the remote computer or server. In the case of a remote computer, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to pass Internet connection).

Although the present invention has been disclosed as above in preferred embodiments, it is not intended to limit the present invention. Any person skilled in the art can use the methods and technical content disclosed above to improve the present invention without departing from the spirit and scope of the present invention. The technical solution makes possible changes and modifications. Therefore, any simple modifications, equivalent changes and modifications made to the above embodiments based on the technical essence of the present invention without departing from the content of the technical solution of the present invention belong to the technical solution of the present invention. protected range.

Claims

A point cloud projection transmission method based on the user's perspective, which is characterized in that it includes:

Preliminarily divide the point cloud object on the standard vector to obtain the point cloud segment to be processed;

The projection transmission scheme is determined based on the user's perspective, and the point cloud segment to be processed is processed.
The point cloud projection transmission method based on the user's perspective according to claim 1, characterized in that:

Take the nearest standard normal vector as the division category of the midpoint of the point cloud object;

Divide the midpoint of the point cloud object into multiple first-level point cloud fragments based on six standard normal vectors;

In the same first-level point cloud segment, points with the same normal vector and a distance less than a predetermined value are further divided into second-level point cloud segments as cloud segments to be processed.
The point cloud projection transmission method based on the user's perspective according to claim 1, wherein the projection transmission scheme comprises:

The point cloud fragments to be processed are further divided to obtain the point cloud fragments of the corresponding viewing angle category, including the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed;

The point cloud segments that need to be observed are allocated with priority resources for encoding transmission, and the point cloud segments with a perspective that do not need to be observed are allocated for encoding and transmission compared to the priority allocation of secondary resources.
The point cloud projection transmission method based on the user's perspective according to claim 1, wherein the projection transmission scheme includes any one or more of the following:

Do not modify the point cloud projection scheme, improve the point cloud projection scheme, and modify the standard normal vector scheme according to the user's perspective.
The point cloud projection transmission method based on the user's perspective according to claim 4, wherein the solution of not modifying the point cloud projection method comprises:

The point cloud object is preliminarily divided on the standard vector, and the parameter list is associated with the standard vector as the projection vector to obtain the point cloud fragment to be processed,

Considering the user's perspective and the direction of the projection vector, the point cloud segment to be processed is divided into the point cloud segment that needs to be observed and the point cloud segment that does not need to be observed.
The point cloud projection transmission method based on the user's perspective according to claim 5, characterized in that,

Set the corresponding codec parameters for the point cloud segments that need to be observed and the point cloud segments that do not need to be observed,

Record the coding and decoding parameters of each point cloud segment, as new additional information for coding transmission.
The point cloud projection transmission method based on the user's perspective according to claim 4, wherein the improved point cloud projection solution includes:

The point cloud fragments to be processed are further divided to obtain the point cloud fragments of the corresponding viewing angle category, including the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed;

Project the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed on the projection plane, and change the size of the projection plane as needed to obtain the pictures that need to be observed and the pictures that do not need to be observed.
The point cloud projection transmission method based on the user's perspective according to claim 7, characterized in that it further comprises:

As the first improved point cloud projection solution, under the premise of dislocation between the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed, the pictures that do not need to be observed are merged into the pictures that need to be observed, and a projected picture is obtained. Compression coding.
The point cloud projection transmission method based on the user's perspective according to claim 8, wherein the obtained one projection picture forms a group of video sequences,

Record the index parameter index corresponding to the point cloud segment that does not need to be observed and the scale of the size change, as new additional information for encoding transmission.
The point cloud projection transmission method based on the user's perspective according to claim 7, characterized in that it further comprises:

As the second improved point cloud projection solution, the need to observe pictures and the unneeded pictures are compressed and coded separately.
The point cloud projection transmission method based on the user's perspective according to claim 10, characterized in that:

The projected pictures that need to be observed and the pictures that do not need to be observed form two sets of video sequences respectively, and the two sets of video sequences are encoded,

Record the change ratio of the video sequence corresponding to the point cloud segment that does not need to be observed, and encode it for transmission as newly added additional information, and further record the position of each group of video sequences in the coded stream.
The point cloud projection transmission method based on the user's perspective according to claim 4, wherein the third improved point cloud projection solution includes:

The point cloud fragments to be processed are grouped according to six standard normal vectors and projected onto six projection planes respectively. Based on the user's perspective, the size of the six projection planes is changed according to requirements, and compression coding is performed respectively.
The point cloud projection transmission method based on the user's perspective according to claim 12, characterized in that,

Project the point cloud segments corresponding to each standard normal vector to each group of video sequences, and encode six groups of video sequences,

Record the change ratio of the video sequence corresponding to the point cloud segment that does not need to be observed, and encode and transmit it as new additional information, and further record the position of each group of video sequences in the coded stream.
The point cloud projection transmission method based on the user's perspective according to claim 4, wherein the scheme for modifying the standard normal vector according to the user's perspective includes:

Determine the angle of view vector according to the user's observation angle;

Perform rotation mapping on the view vector and one of the standard normal vectors, and then perform the same rotation mapping processing on the other standard normal vectors, and determine the projection direction after the rotation mapping on the premise that the overall modification amount is less than a predetermined value.
The point cloud projection transmission method based on the user's perspective according to claim 14, characterized in that:

Among them, the step of making the overall modification amount less than a predetermined value includes: after the view angle vector corresponds to the standard normal vector and rotates, the six standard normal vectors after the rotation mapping including the view angle vector correspond to the original six The sum of the dot products of the standard normal vector takes the maximum value or a larger value within a predetermined range.
The point cloud projection transmission method based on the user's perspective according to claim 15, characterized in that,

The standard normal vector after the rotation mapping is adjusted will be encoded and transmitted as newly added additional information.
The point cloud projection transmission method based on the user's perspective according to claim 1, characterized in that it comprises:

Among them, the projection transmission scheme includes:

For the point cloud segment to be processed, distinguish the main segment and multiple edge segments according to the user's perspective;

Considering the spatial connection between the edge segment and the main segment, maintain the projection vector of the edge segment or change it based on the user's perspective;

After projection, the subsequent point cloud processing is performed.
The point cloud projection transmission method based on the user's perspective according to claim 17, characterized in that:

Among them, taking into account the judgment basis of the spatial connection between the edge segment and the main segment, including:

Determine the main vector in the standard normal vector according to the user's perspective;

The main segment and the edge segment are respectively projected along the main vector direction to obtain the main two-dimensional projection map and the edge two-dimensional projection map;

Based on the two-dimensional projection map of each edge, calculate the intersection and ratio with each main two-dimensional projection map respectively;

The multiple intersection ratios are compared with the preset threshold value as the basis for judging the spatial connection.
The point cloud projection transmission method based on the user's perspective according to claim 18, characterized in that:

Among them, if there are multiple intersection ratios greater than the preset threshold, change the projection vector of the edge segment to the main vector; if none of the multiple intersection ratios is greater than the preset threshold, keep The projection vector of this edge segment.
The point cloud projection transmission method based on the user's perspective according to claim 19, characterized in that:

Among them, the preset threshold value depends on the surface characteristics of the point cloud object, distinguishing the surface characteristics of the object point cloud object and the surface characteristics of the character point cloud object.

The determination of the threshold value can be further adjusted and determined based on the experimental results.
The point cloud projection transmission method based on the user's perspective according to claim 17, characterized in that,

According to the position of the user when viewing the point cloud object, a vector from the point cloud to the user’s observation point is used to represent the user’s observation perspective, which is called the perspective vector.

Calculate the distance between the viewing angle vector and the six standard normal vectors, and the standard normal vector with the closest distance to the viewing angle vector is called the principal vector under the viewing angle.
The point cloud projection transmission method based on the user's perspective according to claim 17, characterized in that,

In addition to distinguishing the main segment and multiple edge segments of the point cloud object according to the user’s perspective, it also includes distinguishing irrelevant segments.

Among them, the standard normal vector closest to the user's viewing angle is determined as the main vector;

Determine the point cloud segment to be processed that is the same as the main vector among the multiple standard normal vectors as the main segment;

Determine the to-be-processed point cloud segment opposite to the main vector among the multiple standard normal vectors as irrelevant segments;

The to-be-processed point cloud segment associated with the main vector among the multiple standard normal vectors is determined as an edge segment.
The point cloud projection transmission method based on the user's perspective according to claim 1, characterized in that:

In the subsequent point cloud processing after projection, the projection relationship identification field is used to identify the projection vector of the point in the point cloud object after processing and adjustment, including the point cloud fragment index value and the corresponding projection vector value.
A point cloud receiving and processing method based on the user's perspective, which is characterized in:

Use the codec parameters determined by the user's perspective to obtain a video sequence from the received bitstream, process the video sequence to obtain a point cloud segment, and decode and reconstruct each point cloud segment to obtain a point cloud object.
The method for receiving and processing a point cloud based on a user's perspective according to claim 24, wherein:

The process of encoding, decoding and subsequent reconstruction of each point cloud segment according to the encoding and decoding parameters includes any one or more of the following:

The coding and decoding parameters are set at the sending end for each point cloud segment to distinguish between the point cloud segment that needs to be observed and the point cloud segment that does not need to be observed, and the subsequent reconstruction of the point cloud object is performed; or

It is necessary to restore the point cloud segment to its original size according to the received parameter information and size change information, and then perform subsequent point cloud object reconstruction; or

According to the position of each video sequence in the code stream, each video sequence is restored, and these video sequences are restored to the original size according to the received change size, and the point clouds recovered from multiple sets of video sequences need to be spliced. Reconstruction of point cloud objects; or

It is necessary to perform the inverse process of point cloud projection according to the standard normal vector adjusted by the rotation mapping of the transmitting end, so as to reconstruct a three-dimensional point cloud sequence from the two-dimensional video sequence and perform the reconstruction of the point cloud object.
A point cloud projection transmission system based on the user's perspective is characterized by:

The fragment generation module is used to preliminarily divide the point cloud object on the standard vector to obtain the point cloud fragment to be processed;

The projection coding processing module determines the projection transmission scheme based on the user's perspective, and processes the point cloud segment to be processed.
A point cloud receiving and processing system based on the user's perspective is characterized by:

The inverse processing module uses the codec parameters determined from the user's perspective to obtain a video sequence from the received bitstream, process the video sequence to obtain a point cloud segment, and decode and reconstruct each point cloud segment to obtain a point cloud object.
A point cloud projection transmission method based on the user's perspective, which is characterized in that it includes:

Preliminarily divide the point cloud object on the standard vector to obtain the point cloud segment to be processed;

The point cloud fragments to be processed are further divided to obtain the point cloud fragments of the corresponding viewing angle category, including the point cloud fragments that need to be observed and the point cloud fragments that do not need to be observed;

The point cloud fragments that need to be observed are allocated with priority resources for encoding transmission, and the point cloud fragments that do not need to be observed are compared with the priority allocation of secondary resources for encoding transmission.
A point cloud projection transmission method based on the user's perspective, which is characterized in that it includes:

For the point cloud segment to be processed, distinguish the main segment and multiple edge segments according to the user's perspective;

Considering the spatial connection between the edge segment and the main segment, maintain the projection vector of the edge segment or change it based on the user's perspective;

After projection, the subsequent point cloud processing is performed.
A point cloud projection transmission method based on the user's perspective, which is characterized in that it includes:

Preliminarily divide the point cloud object on the standard vector to obtain the point cloud segment to be processed, and distinguish the main segment and multiple edge segments according to the user's perspective;

Considering the spatial connection between the edge segment and the main segment, maintain the projection vector of the edge segment or change it based on the user's perspective;

Take the adjusted and changed main segment as the point cloud segment that needs to be observed, and adjust and change the edge segment and irrelevant segments as the point cloud segment that does not need to be observed; and

The point cloud segments that need to be observed are allocated with priority resources for encoding transmission, and the point cloud segments with a perspective that do not need to be observed are allocated for encoding and transmission compared to the priority allocation of secondary resources.