CN116485634B

CN116485634B - Point cloud display diagram generation method and device, electronic equipment and storage medium

Info

Publication number: CN116485634B
Application number: CN202310377559.5A
Authority: CN
Inventors: 请求不公布姓名
Original assignee: Beijing Chengshi Wanglin Information Technology Co Ltd
Current assignee: Beijing Chengshi Wanglin Information Technology Co Ltd
Priority date: 2023-04-10
Filing date: 2023-04-10
Publication date: 2024-04-02
Anticipated expiration: 2043-04-10
Also published as: CN116485634A

Abstract

The application provides a point cloud display diagram generation method, a point cloud display diagram generation device, electronic equipment and a storage medium, wherein the method comprises the following steps: step 1, generating an N Ping Miandian cloud display diagram according to an N panoramic diagram of an N space; step 2, when an N+1st panorama of an N+1st space is acquired, generating an N+1st Ping Miandian cloud display diagram, wherein the N+1st space and the N space are related in a target space based on a space structure; step 3, splicing the two planar point cloud display diagrams based on image registration to obtain a target point cloud display diagram; step 4, judging whether other spaces which do not participate in splicing exist in the target space, if so, executing step 5, otherwise, ending the processing; and 5, merging the N space and the (n+1) space to be used as an N space, taking the target point cloud display diagram as an N plane point cloud display diagram, and returning to the step 2. According to the method and the device, the processing efficiency of the point cloud display diagram can be improved, the point cloud effect is displayed in a new image display form, and the visual experience of a user is improved.

Description

Point cloud display diagram generation method and device, electronic equipment and storage medium

Technical Field

The present disclosure relates to the field of computer data processing technologies, and in particular, to a method and an apparatus for generating a point cloud display, an electronic device, and a storage medium.

Background

When the indoor visual positioning is performed, in order to facilitate a user to check whether the current positioning is accurate, a 3D-form point cloud display diagram is generally adopted as an aid. At present, when generating a 3D form point cloud display diagram, a corresponding 3D form point cloud image is usually generated for each space, and then the 3D point cloud images of different spaces are combined to obtain the required 3D form point cloud display diagram.

According to the processing mode, after the point cloud images in the 3D forms corresponding to different spaces are acquired, the required point cloud display images in the 3D forms are acquired through combination, the processing period is long, and the processing efficiency is low.

In the existing scheme, when a 3D form point cloud image is generated, the method mainly comprises the following two modes, wherein one mode is marked by sparse three-dimensional points generated during visual positioning; another is to directly use the point cloud acquired by the depth device for visualization.

For the first mode, sparse three-dimensional points generated during visual positioning are fewer, and a complete indoor room structure is difficult to represent, so that whether positioning information of a current point position is correct or not is difficult to judge; for the second mode, the mode of visualizing the point cloud based on the depth equipment depends on professional equipment, the equipment dependence is strong, most indoor positioning equipment only has image acquisition equipment, and if the depth equipment is adopted, the processing cost is required to be greatly improved.

Therefore, the existing point cloud images are single in display form, whether positioning information is correct or not is difficult to judge based on the point cloud images, and when the required 3D point cloud display images are generated, the problems of low processing efficiency, strong equipment dependence and high cost exist.

Disclosure of Invention

In view of the foregoing, embodiments of the present application provide a point cloud display diagram generating method, apparatus, electronic device, and storage medium that overcome or at least partially solve the foregoing problems.

In a first aspect, an embodiment of the present application provides a method for generating a point cloud display diagram, including:

step 1, generating an N Ping Miandian cloud display diagram according to an N panoramic image obtained by panoramic shooting an N space of a target space;

step 2, generating an n+ Ping Miandian cloud display diagram according to the n+1th panorama when the n+1th panorama of the n+1th space is acquired based on panorama shooting, wherein the n+1th space is a space which is determined based on a space structure and is associated with the N space in the target space; the plane point cloud display diagram corresponding to the nth space and the (n+1) th space is an image generated by projecting partial pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to partial pixel points in a corresponding panorama, the three-dimensional coordinates corresponding to the pixel points are determined based on a depth image and a panorama segmentation image, and the depth image and the panorama segmentation image are generated based on the panorama;

Step 3, based on image registration, splicing the (N+1) Ping Miandian cloud display diagram and the (N Ping Miandian) cloud display diagram to obtain a target point cloud display diagram;

step 4, judging whether other spaces which do not participate in splicing exist in the target space, if so, executing step 5, and if not, ending the processing;

and 5, merging the N space and the (n+1) th space to be used as the N space, taking the target point cloud display diagram as the N plane point cloud display diagram, and returning to the step 2.

In a second aspect, an embodiment of the present application provides a point cloud display generating device, including:

the first generation module is used for generating an N Ping Miandian cloud display diagram according to an N panoramic image obtained by panoramic shooting an N space of the target space;

the second generation module is used for generating an n+ Ping Miandian cloud display diagram according to the n+1th panorama when the n+1th panorama of the n+1th space is acquired based on panorama shooting, wherein the n+1th space is a space which is determined based on a space structure and is associated with the N space in the target space; the plane point cloud display diagram corresponding to the nth space and the (n+1) th space is an image generated by projecting partial pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to partial pixel points in a corresponding panorama, the three-dimensional coordinates corresponding to the pixel points are determined based on a depth image and a panorama segmentation image, and the depth image and the panorama segmentation image are generated based on the panorama;

The splicing acquisition module is used for splicing the (N+1) Ping Miandian cloud display diagram and the (N Ping Miandian) cloud display diagram based on image registration to acquire a target point cloud display diagram;

the judging module is used for judging whether other spaces which do not participate in splicing exist in the target space or not, wherein the processing is ended under the condition that the other spaces do not exist;

and the processing module is used for merging the N space and the (n+1) space to be used as the N space when other spaces exist, taking the target point cloud display diagram as the N plane point cloud display diagram, and returning to the second generation module to continue processing.

In a third aspect, an embodiment of the present application provides an electronic device, including a processor, a memory, and a computer program stored on the memory and executable on the processor, where the computer program is executed by the processor to implement the steps of the point cloud display diagram generating method according to the first aspect.

In a fourth aspect, embodiments of the present application provide a computer readable storage medium, on which a computer program is stored, the computer program implementing the steps of the point cloud display diagram generating method according to the first aspect, when the computer program is executed by a processor.

According to the technical scheme, under the condition that an N-th plane point cloud display diagram corresponding to an N space and an N+ Ping Miandian cloud display diagram corresponding to an N+1th space are generated, wherein the N+ Ping Miandian cloud display diagram corresponds to the N-th space in a space structure, the N+ Ping Miandian cloud display diagram and the N-th plane point cloud display diagram are spliced based on image registration, a target point cloud display diagram is obtained, when other spaces which do not participate in splicing are detected, the panoramic image is continuously obtained, the Ping Miandian cloud display diagram is generated based on the panoramic image to participate in splicing, the matching plane point cloud display diagram can be generated after each new panoramic image is obtained, the generated Ping Miandian cloud display diagram and the associated plane point cloud display diagram are spliced, in the process of panoramic shooting of a latter space, the process of converting the panoramic image into the plane point cloud display diagram can be carried out on the former space, panoramic shooting can be carried out on the latter space when the planar point cloud display diagram is spliced, different operation modes are executed in parallel at the same time, the processing cycle of the cloud display diagrams is shortened, and the processing efficiency of the cloud display diagrams is improved.

By calculating the three-dimensional coordinates of the pixel points based on the panoramic segmentation image and the depth image, the calculation accuracy of the three-dimensional coordinates of the pixel points at the edge of the scene can be improved, the planar point cloud display diagram with better quality can be obtained, the accuracy of positioning judgment is improved, and the display effect of the Ping Miandian cloud display diagram is optimized. Professional equipment is not needed, dependence on equipment is reduced, and cost is saved while the point cloud picture display effect is ensured; by acquiring the Ping Miandian cloud display diagram, the point cloud effect is presented by adopting a new image display form, and the visual experience of a user is improved.

Drawings

Fig. 1 is a schematic diagram of a point cloud display diagram generating method according to an embodiment of the present application;

fig. 2 is a schematic diagram of a method for stitching two planar point cloud display views based on image registration according to an embodiment of the present application;

fig. 3 is a schematic diagram of a method for generating Ping Miandian cloud display based on panorama according to an embodiment of the present application;

fig. 4 shows a specific example of calculating three-dimensional coordinates of a wall pixel according to an embodiment of the present application;

FIG. 5 shows a specific illustration of a Ping Miandian cloud display provided by an embodiment of the present application;

fig. 6 shows a schematic diagram of a point cloud display diagram generating device provided in an embodiment of the present application;

fig. 7 shows a schematic structural diagram of an electronic device according to an embodiment of the present application.

Detailed Description

The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all, of the embodiments of the present application. All other embodiments, which can be made by one of ordinary skill in the art based on the embodiments herein without making any inventive effort, are intended to be within the scope of the present application.

It should be appreciated that reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present application. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. The plurality of embodiments of the present application may include two and more than two.

In various embodiments of the present application, it should be understood that the sequence numbers of the following processes do not mean the order of execution, and the order of execution of the processes should be determined by the functions and internal logic thereof, and should not constitute any limitation on the implementation process of the embodiments of the present application.

The embodiment of the application provides a point cloud display diagram generating method, which is shown in fig. 1 and includes:

and step 1, generating an N Ping Miandian cloud display diagram according to an N panoramic image obtained by panoramic shooting of an N space of the target space.

The target space in this embodiment is a real physical space, and the physical space may refer to a building such as a house, a mall, an office building, a gym, or the like, for example, the physical space may be an office building, a commercial building, a residential building, or the like. The nth space may be a building of a single spatial structure in a physical space, such as a room in a house.

And after panoramic shooting is carried out on an nth space of the target space to obtain an nth panoramic image corresponding to the nth space, generating an nth plane point cloud display image corresponding to the nth space based on the nth panoramic image. The panorama used for generating the Ping Miandian cloud display diagram in the embodiment is a panorama subjected to vertical correction processing; the panoramic view shown in a Virtual Reality (VR) scene is a VR panoramic view.

Step 2, generating an n+ Ping Miandian cloud display diagram according to the n+1th panorama when the n+1th panorama of the n+1th space is acquired based on panorama shooting, wherein the n+1th space is a space which is determined based on a space structure and is associated with the N space in the target space; the plane point cloud display diagram corresponding to the nth space and the (n+1) th space is an image generated by projecting partial pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to partial pixel points in a corresponding panoramic image, the three-dimensional coordinates corresponding to the pixel points are determined based on a depth image and a panoramic segmentation image, and the depth image and the panoramic segmentation image are generated based on the panoramic image.

In the case of acquiring an n+1st panorama corresponding to an n+1st space of the target space based on panorama shooting, an n+1st Ping Miandian cloud display diagram may be generated based on the n+1st panorama, the n+1st space being a space spatially associated with the N space in the target space; and the n+1th space and the N-th space may be regarded as subspaces of the target space.

When the planar point cloud display diagrams corresponding to the N space and the (n+1) th space are generated, the adopted processing modes are the same. The plane point cloud display diagram corresponding to the nth space is an image generated by projecting partial pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to partial pixel points in the nth panorama; correspondingly, the plane point cloud display diagram corresponding to the n+1th space is an image generated by projecting part of pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to part of pixel points in the n+1th panorama. And the three-dimensional coordinates corresponding to the pixel points are determined based on the depth image and the panoramic segmentation image, the depth image and the panoramic segmentation image are generated based on the panoramic image, and the three-dimensional coordinates of the pixel points are determined by adopting the depth image and the panoramic segmentation image, so that the problem of larger deviation in scene edge calculation based on the depth image can be avoided to a certain extent compared with the situation that the three-dimensional coordinates of the pixel points are obtained based on the depth image only.

The color characteristic value carried by the pixel point is a pixel value corresponding to the pixel point in the panorama, the pixel value of the pixel point is determined by the value corresponding to the R (red) G (green) B (blue) three elements, and the values of the RGB three elements are all between 0 and 255. The process of projecting the pixel points carrying the color characteristic values can be understood as a process of determining the plane coordinates corresponding to the pixel points based on the three-dimensional coordinates, so as to obtain Ping Miandian cloud display diagrams based on plane projection.

Because the three-dimensional coordinates of the pixel points are calculated based on the panoramic segmentation image and the depth image, the calculation accuracy of the three-dimensional coordinates of the pixel points at the edge of the scene can be improved, and then after plane projection, a plane point cloud display diagram with better quality can be obtained, and the display effect of the Ping Miandian cloud display diagram is optimized.

And 3, based on image registration, splicing the (N+1) Ping Miandian cloud display diagram with the (N Ping Miandian) cloud display diagram to obtain a target point cloud display diagram.

Because the n+1th space is related to the N-th space in spatial structure, when the N-th planar point cloud display diagram corresponding to the N-th space and the n+ Ping Miandian-th cloud display diagram corresponding to the n+1th space are acquired, the n+ Ping Miandian-th cloud display diagram and the N-th planar point cloud display diagram can be spliced based on image registration, and the target point cloud display diagram is acquired through splicing processing.

And when the n+ Ping Miandian cloud display diagram and the N Ping Miandian cloud display diagram are spliced based on image registration, the n+ Ping Miandian cloud display diagram and the N plane point cloud display diagram can be spliced based on a registration result by performing image registration on the n+1th panorama and the N panorama to obtain the target point cloud display diagram.

And step 4, judging whether other spaces which do not participate in splicing exist in the target space, if so, executing step 5, and if not, ending the processing.

After the n+ Ping Miandian cloud display diagram and the N plane point cloud display diagram are spliced to obtain the target point cloud display diagram, whether other spaces which do not participate in the splicing exist in the target space can be judged. If the point cloud display diagram does not exist, the splicing of the plane point cloud display diagram corresponding to all the spaces of the target space is completed; if so, step 5 is performed.

And under the condition that the target space is determined to have other planar point cloud display diagrams corresponding to the spaces which do not participate in the splicing, merging the N-th space and the N+1-th space which are spliced by the planar point cloud display diagrams to be used as the N-th space, and returning to the step 2 to continuously splice the planar point cloud display diagrams by using the target point cloud display diagrams corresponding to the N-th space and the N+1-th space as the N-th planar point cloud display diagrams.

After returning to step 2, other spaces that are spatially related to the updated nth space (formed by combining the original nth space and the original n+1th space) and that do not participate in the stitching may be determined, and the space may be used as the updated n+1th space. If the corresponding Ping Miandian cloud display diagram exists in the updated n+1th space, the updated N-th space corresponding planar point cloud display diagram (the original N-th space and the original n+1th space corresponding target point cloud display diagram) and the updated n+1th space corresponding planar point cloud display diagram can be directly spliced based on image registration. If the corresponding Ping Miandian cloud display diagram does not exist in the updated n+1th space, a corresponding plane point cloud display diagram can be generated based on the panorama corresponding to the updated n+1th space, and then splicing is performed.

After the splicing of the Ping Miandian cloud display diagram is completed, whether a certain space which does not participate in the splicing exists in the target space can be continuously judged, if so, the nth space is continuously updated until the plane point cloud display diagram corresponding to all the spaces of the target space is obtained, and the splicing of the Ping Miandian cloud display diagram is completed.

According to the implementation process, under the condition that the nth plane point cloud display diagram corresponding to the nth space and the n+ Ping Miandian cloud display diagram corresponding to the (n+1) th space which is related to the nth space in a space structure are generated, the nth+ Ping Miandian cloud display diagram and the nth plane point cloud display diagram are spliced based on image registration, the target point cloud display diagram is acquired, when other spaces which do not participate in splicing are detected, the panoramic image is continuously acquired, the Ping Miandian cloud display diagram is generated based on the panoramic image to participate in splicing, the matching plane point cloud display diagram can be generated after each new panoramic image is obtained, the generated Ping Miandian cloud display diagram and the associated plane point cloud display diagram are spliced, in the process of panoramic shooting of the latter space, the process of converting the panoramic image into the plane point cloud display diagram can be carried out on the former space, when the planar point cloud display diagram is spliced, the panoramic shooting can be carried out on the latter space, different operation modes can be executed in parallel at the same moment, the processing cycle is shortened, and the processing efficiency of the cloud display diagram is improved.

The process of stitching Ping Miandian cloud display images based on image registration is described below. The space associated based on the spatial structure is an adjacent space; referring to fig. 2, when the n+ Ping Miandian cloud display diagram and the N Ping Miandian cloud display diagram are spliced based on image registration to obtain a target point cloud display diagram, the method includes the following steps:

and 31, carrying out image registration on the (N+1) -th panorama and the (N) th panorama, and obtaining the relative pose of the (N+1) -th panorama and the (N) -th panorama.

And step 32, according to the relative pose, splicing the (N+1) Ping Miandian cloud display diagram and the (N Ping Miandian) cloud display diagram to acquire the target point cloud display diagram.

In this embodiment, the space associated based on the spatial structure in the target space is the adjacent space, that is, the nth space and the n+1th space are the adjacent spaces in the target space. For example, the target space is a target house, the nth space is a living room space, and the (n+1) th space is a bedroom space adjacent to the living room space.

It should be noted that, the nth space and the n+1th space associated based on the spatial structure may be associated based on the panorama shooting order, that is, the nth space and the n+1th space are consecutive in the shooting order; it is also possible that the nth space and the n+1th space are not associated with each other based on the panorama photographing order, i.e., both are discontinuous in the photographing order.

When the n+ Ping Miandian cloud display diagram and the N Ping Miandian cloud display diagram are spliced based on image registration, the n+1th panorama and the N panorama can be subjected to image registration, and the relative pose of the n+1th panorama and the N panorama is acquired based on the image registration of the panorama. After the relative pose of the two panoramas is determined, based on the relative pose of the n+1st panoramas and the N panoramic views, the n+ Ping Miandian cloud display view and the N plane point cloud display view are spliced, and the N space and the n+1st target point cloud display view corresponding to the N space are acquired.

The n+ Ping Miandian cloud display diagram and the N plane point cloud display diagram are spliced, which can be understood as that the n+ Ping Miandian cloud display diagram and the N plane point cloud display diagram are combined and placed.

In the above embodiment, by acquiring the relative pose of the panorama based on the registration of the two spatially corresponding panoramas, stitching the two spatially corresponding planar point cloud display views based on the relative pose, the panorama may be used as a registration object, and the panorama-based registration stitching Ping Miandian cloud display view may be used to acquire the target point cloud display view through stitching.

As an optional embodiment, after taking the target point cloud display diagram as the nth plane point cloud display diagram and returning to step 2, the method further includes:

and under the condition that the planar point cloud display diagrams corresponding to all the spaces of the target space are determined to be spliced, acquiring the target point cloud display diagrams corresponding to the target space.

After the original nth space and the original n+1th space are merged to be the nth space and the target point cloud display diagram corresponding to the original nth space and the original n+1th space is used as the nth Ping Miandian cloud display diagram, other spaces which are related to the updated nth space (formed by merging the original nth space and the original n+1th space) in a space structure and do not participate in splicing can be determined, and the spaces are used as the updated n+1th space. And then continuing to splice the plane point cloud display diagrams so as to update the target point cloud display diagrams.

Under the condition that all the planar point cloud display diagrams corresponding to the target space are determined to participate in the splicing and the splicing is completed, the target point cloud display diagrams corresponding to the target space are acquired, and the overall Ping Miandian cloud display diagrams corresponding to the target space are acquired.

The foregoing mainly describes the stitching of the planar point cloud display, and the following describes the process of generating Ping Miandian cloud display based on the panorama. And the process of generating the nth Ping Miandian cloud display based on the nth panorama is similar to the process of generating the (n+1) Ping Miandian cloud display based on the (n+1) th panorama, the process of generating the Ping Miandian cloud display will be described below taking the generation of the nth plane point cloud display based on the nth panorama as an example.

As an alternative embodiment, referring to fig. 3, when generating an nth Ping Miandian cloud display according to an nth panorama acquired by panoramic shooting an nth space of a target space, the method includes the following steps:

and 11, acquiring a depth image and a panoramic segmentation image corresponding to the Nth panoramic image based on the Nth panoramic image.

After the nth panorama corresponding to the nth space of the target space is acquired, the depth image and the panorama segmentation image corresponding to the nth panorama can be obtained by processing the nth panorama. Depth images, also known as range images, refer to images having as pixel values the distance (depth) from an image collector to points in a scene, which directly reflect the geometry of the visible surface of the scene; the panoramic segmentation image is an image determined by dividing the panoramic image into areas on the basis of the panoramic image and setting category labels for the pixel points of the areas, and the category labels corresponding to the pixel points in the panoramic image can be obtained on the basis of the panoramic segmentation image.

And 12, acquiring three-dimensional coordinates corresponding to each target pixel point in a target set according to the depth image and the panoramic segmentation image corresponding to the Nth panoramic image, wherein the target set comprises at least part of the pixel points in the Nth panoramic image.

After the depth image and the panoramic segmented image corresponding to the nth panorama are acquired, three-dimensional coordinates corresponding to at least part of pixel points in the nth panorama can be acquired according to the depth image and the panoramic segmented image, so that the depth image is combined on the basis of the panoramic segmented image, and the three-dimensional coordinates corresponding to each target pixel point in the target set corresponding to the nth panorama are calculated.

The target set is a pixel point set and comprises at least part of pixel points in the Nth panoramic image, and based on the cooperation of the depth image and the panoramic segmentation image, three-dimensional coordinates corresponding to each target pixel point in the target set are calculated, so that the three-dimensional coordinates corresponding to at least part of pixel points in the Nth panoramic image are obtained.

And 13, based on the three-dimensional coordinates of part of target pixel points in the target set, projecting the part of target pixel points carrying color characteristic values to obtain a plane point cloud display diagram corresponding to the N space, wherein the part of target pixel points are pixel points meeting projection requirements.

After the three-dimensional coordinates corresponding to each target pixel point in the target set are obtained, partial target pixel points meeting the projection requirement can be determined in the target set, and projection is carried out on partial target pixel points carrying color characteristic values based on the determined three-dimensional coordinates of the partial target pixel points so as to obtain a plane point cloud display diagram corresponding to the Nth space through plane projection.

According to the embodiment, the three-dimensional coordinates of the pixel points are calculated by combining the depth image on the basis of the panoramic segmentation image, so that the calculation accuracy of the three-dimensional coordinates of the pixel points at the edge of the scene can be improved, further, after plane projection, a plane point cloud display diagram with better quality can be obtained, the accuracy of positioning judgment is improved, and the display effect of the Ping Miandian cloud display diagram is optimized.

Wherein, for step 11, when acquiring the depth image and the panoramic segmentation image corresponding to the nth panorama based on the nth panorama, the method comprises:

predicting the depth value of a pixel point in the Nth panorama based on a depth map model, and acquiring the depth image;

and carrying out category prediction on the pixel points in the Nth panoramic image based on a semantic segmentation model, and obtaining the panoramic segmentation image carrying category labels corresponding to the pixel points in the Nth panoramic image.

When determining the depth image corresponding to the nth space based on the nth panorama, the nth panorama can be processed by adopting a depth image model to obtain the depth value of each pixel point in the nth panorama, so as to obtain the depth image.

The depth map model is obtained by training the open source image and the real depth map data, and can adopt a common encoding-decoding network, such as a unet and other structures. By inputting the nth panorama into the depth map model, the depth value of each pixel point in the nth panorama can be predicted based on the depth map model, and further a depth image representing the depth value corresponding to each pixel point in the nth panorama is obtained.

When determining the panoramic segmentation image corresponding to the nth space based on the nth panoramic image, the nth panoramic image can be processed by adopting a semantic segmentation model to obtain category labels corresponding to each pixel point in the nth panoramic image, so as to obtain the panoramic segmentation image. Because the panoramic segmentation image comprises class labels of all pixel points in the nth panoramic image, the pixel points in the nth panoramic image can be classified based on the class labels of the pixel points, the segmentation of the nth panoramic image based on the class labels of the pixel points can be realized, the segmentation is not real image segmentation, the segmentation can be regarded as the regional division of the nth panoramic image based on the class labels of the pixel points, and the class labels corresponding to the pixel points in the same region can be the same.

The segmentation labels of the semantic segmentation model mainly comprise ceilings, wall surfaces and floors, and also comprise other object types, such as indoor furniture in space, such as tables, chairs, beds and the like. The semantic segmentation model is obtained by training data of an open-source image and a real pixel class label, and a model structure can also adopt a common encoding-decoding network. By inputting the nth panorama into the semantic segmentation model, the class labels of all the pixel points in the nth panorama can be predicted based on the semantic segmentation model, and further the panorama segmentation image carrying the class labels corresponding to all the pixel points in the nth panorama is obtained.

According to the embodiment, the Nth panorama can be respectively input into the depth map model and the semantic segmentation model, the depth value of the pixel points in the Nth panorama is predicted based on the depth map model to obtain the depth image, the category prediction is performed on the pixel points in the Nth panorama based on the semantic segmentation model, the panorama segmentation image carrying category labels respectively corresponding to the pixel points in the Nth panorama is obtained, the acquisition of the depth image and the panorama segmentation image based on the training mature model is achieved, and the processing efficiency is improved while the quality is ensured.

In an optional embodiment, step 12 obtains three-dimensional coordinates corresponding to each target pixel point in the target set according to the depth image and the panorama segmentation image corresponding to the nth panorama, including:

determining all pixel points corresponding to a first class label and all pixel points corresponding to a second class label in the Nth panorama according to the panorama segmentation image, wherein the panorama segmentation image comprises class labels respectively corresponding to all pixel points in the Nth panorama, the pixel points corresponding to the first class label are located in a first area, and the pixel points corresponding to the second class label are located in a second area;

determining a first coordinate set comprising three-dimensional coordinates corresponding to the pixel points of the first area and a second coordinate set comprising three-dimensional coordinates corresponding to the pixel points of the second area based on the depth image;

according to at least one of the first coordinate set and the second coordinate set, a third coordinate set corresponding to at least part of pixel points in a third area of the N-th panorama is obtained, and a fourth coordinate set corresponding to at least part of pixel points in a fourth area of the N-th panorama is obtained;

The pixel points corresponding to the first coordinate set, the second coordinate set, the third coordinate set and the fourth coordinate set are all target pixel points;

the first area, the second area and the third area are respectively a top area, a ground area and a wall area of the nth space corresponding to the nth panorama, and the fourth area is an area which is different from the first area, the second area and the third area in the nth panorama.

After the depth image and the panoramic segmented image corresponding to the nth panorama are generated, three-dimensional coordinates corresponding to each target pixel point in the target set can be obtained based on the cooperation of the depth image and the panoramic segmented image.

Since the panorama-divided image includes class labels to which each pixel point in the nth panorama image corresponds, all the pixel points in the nth panorama image corresponding to the first class labels and all the pixel points corresponding to the second class labels can be determined based on the panorama-divided image. The first category label corresponds to a first region of the nth panorama and the second category label corresponds to a second region of the nth panorama, i.e., the pixel point corresponding to the first category label is located in the first region and the pixel point corresponding to the second category label is located in the second region. The first area is a top area, such as a ceiling area, of the nth space corresponding to the nth panorama; the second area is a ground area corresponding to the Nth space on the Nth panorama.

After all the pixel points which are positioned in the first area and correspond to the first class labels in the Nth panorama are acquired, three-dimensional coordinates corresponding to the pixel points in the first area respectively can be determined based on the depth image, and a first coordinate set is acquired based on the determined three-dimensional coordinates; after all the pixel points located in the second area and corresponding to the second class labels in the nth panorama are acquired, three-dimensional coordinates corresponding to the pixel points in the second area respectively can be determined based on the depth image, and a second coordinate set is acquired based on the determined three-dimensional coordinates.

In the case of determining the first coordinate set and the second coordinate set, a third coordinate set corresponding to at least a part of the pixels located in the third region of the nth panorama and a fourth coordinate set corresponding to at least a part of the pixels located in the fourth region of the nth panorama may be acquired according to at least one of the first coordinate set and the second coordinate set.

The third area is a wall area corresponding to the Nth space on the Nth panorama, and the fourth area is an area which is different from the first area, the second area and the third area in the Nth panorama. The fourth region of the nth panorama may be regarded as a region remaining after the first, second, and third regions are removed from the nth panorama, and may be regarded as a region surrounded by the first, second, and third regions.

The third coordinate set comprises three-dimensional coordinates corresponding to at least part of pixel points of the third region respectively, and the three-dimensional coordinates corresponding to any pixel point in the third region need to be determined based on the three-dimensional coordinates of corresponding pixel points in the first coordinate set or the second coordinate set; the fourth coordinate set includes three-dimensional coordinates corresponding to at least part of the pixel points in the fourth area, and the three-dimensional coordinates corresponding to any pixel point in the fourth area need to be determined based on the three-dimensional coordinates of the corresponding pixel point in the first coordinate set or the second coordinate set.

The pixel points corresponding to the first coordinate set, the second coordinate set, the third coordinate set and the fourth coordinate set are all target pixel points, and the target set comprises at least part of pixel points in the Nth panorama because the third coordinate set comprises three-dimensional coordinates corresponding to at least part of pixel points in the third region and the fourth coordinate set comprises three-dimensional coordinates corresponding to at least part of pixel points in the fourth region.

According to the embodiment, all pixel points corresponding to the first class labels and all pixel points corresponding to the second class labels can be obtained according to the panoramic segmented image, the obtained pixel points are processed based on the depth image, three-dimensional coordinates corresponding to the pixel points in the first area respectively and three-dimensional coordinates corresponding to the pixel points in the second area respectively are determined, so that a first coordinate set and a second coordinate set are determined, and the first coordinate set and the second coordinate set are obtained based on the matching of the panoramic segmented image and the depth image; after the first coordinate set and the second coordinate set are determined, the third coordinate set corresponding to the third area and the fourth coordinate set corresponding to the fourth area are obtained based on at least one of the first coordinate set and the second coordinate set, three-dimensional coordinates corresponding to pixel points of other areas are obtained by operation based on the existing coordinate sets, the processing flow is simplified, and the processing efficiency is improved.

The process of determining the first set of coordinates and the second set of coordinates is described below. When determining, based on the depth image, a first coordinate set including three-dimensional coordinates corresponding to the pixel points of the first region and a second coordinate set including three-dimensional coordinates corresponding to the pixel points of the second region, the method includes:

acquiring a first height value corresponding to each pixel point in the first area and a second height value corresponding to each pixel point in the second area based on the depth image, wherein the height value corresponding to each pixel point is a vertical component corresponding to a reference three-dimensional coordinate determined by the pixel point based on the depth image in the height direction;

determining a first average height value according to the first height values respectively corresponding to the pixel points in the first area;

determining a second average height value according to second height values respectively corresponding to the pixel points in the second area;

according to the first average height value, panoramic pixel coordinates corresponding to each pixel point in the first area and a conversion formula, determining three-dimensional coordinates corresponding to each pixel point in the first area, and determining the first coordinate set based on the three-dimensional coordinates corresponding to each pixel point in the first area;

Determining three-dimensional coordinates corresponding to each pixel point in the second area according to the second average height value, the panoramic pixel coordinates corresponding to each pixel point in the second area and a conversion formula, and determining the second coordinate set based on the three-dimensional coordinates corresponding to each pixel point in the second area;

the conversion formula is used for converting panoramic pixel coordinates into three-dimensional coordinates.

The reference three-dimensional coordinates of the pixel points may be determined based on the depth image, and precisely, the reference three-dimensional coordinates directly determined based on the depth image are not on one 3D plane. For the first region and the second region, the first region corresponds to a plane in the three-dimensional space, and the second region corresponds to a plane in the three-dimensional space, so that the depth image is only used for determining the height of the region.

After determining the pixel points corresponding to the first class labels and located in the first area, a reference three-dimensional coordinate corresponding to each pixel point in the first area may be acquired based on the depth image, and calculation may be performed based on the reference three-dimensional coordinate to determine a three-dimensional coordinate (final three-dimensional coordinate) corresponding to the pixel point.

For each pixel point in the first area, after the reference three-dimensional coordinate corresponding to the current pixel point is acquired, a vertical component corresponding to the reference three-dimensional coordinate in the height direction may be acquired, and the vertical component corresponding to the reference three-dimensional coordinate in the height direction is determined as a first height value corresponding to the pixel point. Since the first region is the top region of the nth space corresponding to the nth panorama, the vertical component corresponding to the reference three-dimensional coordinate in the height direction is the component in the direction parallel to the height of the wall surface.

After the first height value corresponding to each pixel point in the first area is obtained, average value calculation is performed based on the first height value corresponding to each pixel point in the first area, a first average height value is obtained, and the first average height value is used as the height h1 of the first area.

And then, determining the real coordinate position (three-dimensional coordinate) of the pixel point in the three-dimensional space based on the first average height value h1, the panoramic pixel coordinate corresponding to the pixel point and a conversion formula for each pixel point in the first area, wherein the conversion formula is a calculation formula for converting the panoramic pixel point into a 3d coordinate point. After the three-dimensional coordinates (final three-dimensional coordinates) corresponding to each pixel point in the first area are obtained, the three-dimensional coordinates corresponding to each pixel point in the first area are aggregated, and a first coordinate set corresponding to the first area is obtained.

After determining the pixel points corresponding to the second class labels and located in the second area, a reference three-dimensional coordinate corresponding to each pixel point in the second area may be acquired based on the depth image, and calculation may be performed based on the reference three-dimensional coordinate to determine a three-dimensional coordinate (final three-dimensional coordinate) corresponding to the pixel point.

For each pixel point in the second area, after the reference three-dimensional coordinate corresponding to the current pixel point is acquired, a vertical component corresponding to the reference three-dimensional coordinate in the height direction may be acquired, and the vertical component corresponding to the reference three-dimensional coordinate in the height direction is determined as a second height value corresponding to the pixel point. Because the second area is the ground area corresponding to the nth space on the nth panorama, the vertical component corresponding to the reference three-dimensional coordinate in the height direction is the component in the direction parallel to the height of the wall surface.

After the second height value corresponding to each pixel point in the second area is obtained, average value calculation is performed based on the second height value corresponding to each pixel point in the second area, a second average height value is obtained, and the second average height value is used as the height h2 of the second area.

Then, for each pixel point in the second area, a real coordinate position (final three-dimensional coordinate) of the pixel point in the three-dimensional space is determined based on the second average height value h2, the panoramic pixel coordinate corresponding to the pixel point, and a conversion formula. After the three-dimensional coordinates (final three-dimensional coordinates) corresponding to each pixel point in the second area are obtained, the three-dimensional coordinates corresponding to each pixel point in the second area are aggregated, and a second coordinate set corresponding to the second area is obtained.

In the above embodiment, the first average height value and the second average height value may be determined based on the depth image, the three-dimensional coordinates may be determined based on the first average height value, the panoramic pixel coordinates and the conversion formula for each pixel point in the first region, and the three-dimensional coordinates may be determined based on the second average height value, the panoramic pixel coordinates and the conversion formula for each pixel point in the second region, so as to obtain the first coordinate set corresponding to the first region and the second coordinate set corresponding to the second region based on the depth image.

The process of determining the third and fourth coordinate sets is described below. The obtaining, according to at least one of the first coordinate set and the second coordinate set, a third coordinate set corresponding to at least a part of pixels located in a third area of the nth panorama and a fourth coordinate set corresponding to at least a part of pixels located in a fourth area of the nth panorama includes:

for each pixel point in the third area and the fourth area, searching a first pixel point which is associated with the current pixel point in the column direction and is positioned in the first area or the second area, wherein the first pixel point is intersected with a second pixel point corresponding to the current pixel point, and the second pixel point is in the same column with the current pixel point and is intersected with the first area or the second area;

under the condition that the first pixel point is found, based on the first coordinate set or the second coordinate set, acquiring a three-dimensional coordinate corresponding to the first pixel point;

according to the three-dimensional coordinates corresponding to the first pixel points, a first distance between a projection point corresponding to a virtual camera in a three-dimensional live-action space model and a first pixel point associated with a current pixel point is obtained, the three-dimensional live-action space model is a three-dimensional space model corresponding to the N-th space, and in the three-dimensional live-action space model, a connecting line of the projection point and the first pixel point is perpendicular to the column direction of the current pixel point;

Determining a depth value of the current pixel point based on the first distance and the panorama latitude angle corresponding to the current pixel point, and determining a three-dimensional coordinate of the current pixel point based on the depth value of the current pixel point;

determining the third coordinate set according to the three-dimensional coordinates corresponding to at least part of the pixel points in the third region, and determining the fourth coordinate set according to the three-dimensional coordinates corresponding to at least part of the pixel points in the fourth region;

wherein the pixel points in the third region correspond to a third category label; the pixel points in the fourth region correspond to at least one fourth category label.

For each pixel point in the third area and the fourth area, a first pixel point associated with the current pixel point in the column direction can be searched in the first area or the second area, and under the condition that the first pixel point is searched, the three-dimensional coordinate corresponding to the first pixel point is obtained based on the first coordinate set or the second coordinate set.

When searching for the first pixel point associated with the current pixel point, the second pixel point which is in the same column with the current pixel point and intersects with the first area or the second area can be searched first. When searching the second pixel point, the second pixel point which is in the same column with the current pixel point and is intersected with the first area can be searched first, if the second pixel point which is in the same column with the current pixel point and is intersected with the second area can not be searched, and if the second pixel point which is in the same column with the current pixel point and is intersected with the second area can not be searched, the second pixel point corresponding to the current pixel point is determined to be absent.

After the second pixel point corresponding to the current pixel point is found, if the second pixel point is intersected with the first area, determining the pixel point intersected with the second pixel point in the first area as the first pixel point, and if the second pixel point is intersected with the second area, determining the pixel point intersected with the second pixel point in the second area as the first pixel point, so that the first pixel point associated with the current pixel point is obtained. Aiming at the situation that the first pixel point is located in the first area, the three-dimensional coordinate corresponding to the first pixel point can be directly searched in the first coordinate set corresponding to the first area; for the case that the first pixel point is located in the second area, the three-dimensional coordinate corresponding to the first pixel point can be directly found in the second coordinate set corresponding to the second area.

After the three-dimensional coordinates corresponding to the first pixel point are obtained, a first distance between a projection point of the virtual camera in the three-dimensional live-action space model on the target plane and the first pixel point associated with the current pixel point in the column direction can be obtained according to the three-dimensional coordinates corresponding to the first pixel point. The target plane may be a top of the nth space or an end surface of the ground corresponding to the three-dimensional real scene space model, where the projection point corresponding to the virtual camera and the first pixel point are both located on the target plane, and in the three-dimensional real scene space model, a connection line between the projection point of the virtual camera and the first pixel point is perpendicular to a column direction (height direction) where the current pixel point is located. The three-dimensional real scene space model can be a three-dimensional space model corresponding to the Nth space, the virtual camera can be a coordinate origin of the three-dimensional real scene space model, but the three-dimensional real scene space model is not limited to the three-dimensional real scene space model, the virtual camera can be spaced from the ground, the top end face and the wall surface by a certain distance, and the virtual camera can be arranged at any position.

After the first distance corresponding to the current pixel point is obtained, determining a depth value of the current pixel point based on the first distance corresponding to the current pixel point and the panorama latitude angle corresponding to the current pixel point, and determining three-dimensional coordinates of the current pixel point based on the depth value of the current pixel point so as to operate based on the matched three-dimensional coordinates of the first pixel point to obtain the three-dimensional coordinates of the current pixel point. When determining the three-dimensional coordinates of the pixel points based on the depth values of the pixel points, the calculation may be performed based on the depth values of the pixel points, the panoramic pixel coordinates of the pixel points, and the conversion formula.

When the first pixel point associated with the current pixel point in the column direction is searched in the first area or the second area, the first area may be searched first, if the associated first pixel point cannot be searched in the first area, the second area may be searched continuously, and if the associated first pixel point cannot be searched in the second area, the three-dimensional coordinate corresponding to the current pixel point is directly abandoned. Of course, it is also possible to search in the second area first and then search in the first area, which is not particularly limited in this embodiment.

Because of the situation that other objects can be blocked between the ground and the wall surface and the situation that other objects can be blocked between the top and the wall surface, the associated first pixel points can not be found in the second area and the first area aiming at the pixel points in the third area; because there are situations in which other objects are placed on the ground and suspended on top, for the pixel points in the fourth area, the associated first pixel point may not be found in the second area and the first area.

The third area is a wall surface area corresponding to the Nth space on the Nth panorama, and pixel points in the third area correspond to third class labels; the pixel points in the fourth area correspond to the fourth category labels, and the number of the fourth category labels is at least one, and as the fourth area can comprise one or more pieces of furniture, different pieces of furniture can correspond to the same category labels, or one piece of furniture can correspond to a fourth category label.

Calculating corresponding three-dimensional coordinates for each pixel point in the third area, and determining a third coordinate set after obtaining the three-dimensional coordinates corresponding to at least part of the pixel points in the third area; after calculating the corresponding three-dimensional coordinates for each pixel point in the fourth area and obtaining the three-dimensional coordinates corresponding to at least part of the pixel points in the fourth area, a fourth coordinate set may be determined.

In the above embodiment, for the pixel points in the third and fourth areas, the associated first pixel point may be searched, and the three-dimensional coordinates of the current pixel point may be calculated based on the three-dimensional coordinates of the first pixel point, so as to determine the third and fourth coordinate sets with the aid of the first and/or second coordinate sets.

As an optional embodiment, the determining the depth value of the current pixel point based on the first distance and the panorama latitude angle corresponding to the current pixel point includes:

determining the distance between the virtual camera and the current pixel point in the three-dimensional live-action space model based on the ratio of the first distance to the cosine value of the latitude angle of the panoramic image corresponding to the current pixel point;

and determining the distance between the virtual camera and the current pixel point as the depth value of the current pixel point.

The first distance is a first distance between a projection point of the virtual camera on the target plane and a first pixel point associated with a current pixel point in a column direction, and a panoramic latitude angle a corresponding to the current pixel point can be shown in fig. 4, d in fig. 4 represents the first distance, P represents the current pixel point, and Q represents the first pixel point associated with the current pixel point. The distance between the virtual camera and the current pixel point in the three-dimensional live-action space model can be determined based on the ratio of the first distance to the cosine value of the latitude angle of the panoramic image corresponding to the current pixel point, the distance between the virtual camera and the current pixel point is determined to be the depth value of the current pixel point, and the depth value of the current pixel point is obtained based on operation. Fig. 4 illustrates a case where the current pixel is a pixel on the wall surface area.

Taking fig. 4 as an example, a process of searching for the first pixel associated with the current pixel and calculating the depth value of the current pixel will be described. The current pixel point P is a pixel point on a wall area, the junction point of the wall pixel point and the ground pixel point on the column of the panoramic image where the current pixel point P is located is calculated, the 3d coordinate Xq of the ground pixel point Q at the junction point is taken, so that the distance d between the projection of the virtual camera on the ground and the ground pixel point Q can be obtained, and the connecting line between the projection of the virtual camera on the ground and the ground pixel point Q is perpendicular to the corresponding straight line of the current pixel point P in the column direction.

Based on the panoramic latitude angle a of the P point and the distance d, a depth value of the current pixel point P is obtained by adopting trigonometric function operation, so that a 3d point coordinate Xp of the current pixel point P is obtained. If the boundary point with the ground pixel point is not found, trying to find the boundary point with the ceiling pixel point, and if the boundary point is not found, abandoning to calculate the 3d point position of the current pixel point.

In the above embodiment, trigonometric function operation may be performed based on the first distance and the panorama latitude angle corresponding to the current pixel point, and the depth value of the current pixel point may be determined based on the operation, so as to determine the three-dimensional coordinate based on the depth value.

The following describes a plane projection process, wherein the projecting, based on the three-dimensional coordinates of a part of target pixel points in the target set, the part of target pixel points carrying color feature values to obtain a plane point cloud display diagram corresponding to the nth space includes:

screening out partial target pixel points meeting projection requirements from the target set based on a preset rule;

based on the three-dimensional coordinates of the partial target pixel points, projecting the partial target pixel points carrying color characteristic values to a preset plane, and obtaining a plane point cloud display diagram;

and determining the color characteristic value carried by the target pixel point based on the Nth panorama, wherein the pixel point positioned in the first area does not meet the projection requirement.

After the target set is determined, part of target pixel points meeting projection requirements can be screened out from the target set based on a preset rule, and part of target pixel points carrying color characteristic values are projected to a preset plane based on three-dimensional coordinates of the screened part of target pixel points so as to obtain a Ping Miandian cloud display diagram through plane projection. Referring to fig. 5, a specific illustration of a planar point cloud display diagram corresponding to a room is shown, and colors of different areas are not shown in fig. 5.

The virtual camera can be respectively spaced with the top end face of the three-dimensional live-action space model, the ground and the wall surface at a certain distance, and can also be arranged at any position. Based on shooting of the virtual camera in the three-dimensional live-action space model, an Nth panorama corresponding to the Nth space can be obtained.

When the pixel screening is performed based on a preset rule, the pixel corresponding to the first area can be filtered according to the target set, so that the pixel screening is realized, and the pixel meeting the projection requirement is obtained. Since the projection of the pixels of the first region onto the ground region affects the projection effect, it is necessary to filter the pixels.

When the pixel point screening is performed based on a preset rule, the target pixel point below the horizontal plane where the virtual camera is located can be screened out, so that the target pixel point meeting the projection requirement is screened out from the target set. Other strategies for screening the target pixels may, of course, be employed, and are not further described herein. When the target pixel point carrying the color characteristic value is projected to a preset plane, the target pixel point is actually projected to the ground of the three-dimensional real scene space model.

In order to simplify the projection operation, necessary target pixels can be selected from the target set, so that the problem of heavy workload of the projection operation caused by excessive pixels participating in the projection is avoided.

In the overall implementation process of the point cloud display diagram generation method provided in the embodiment of the application, under the conditions of generating the nth planar point cloud display diagram corresponding to the nth space and the n+ Ping Miandian cloud display diagram corresponding to the n+1th space related to the nth space in a space structure, based on image registration, the n+ Ping Miandian cloud display diagram and the nth planar point cloud display diagram are spliced to obtain a target point cloud display diagram, when other spaces which do not participate in splicing are detected, the panoramic image is continuously obtained, and the panoramic image generation Ping Miandian cloud display diagram is generated based on the panoramic image to participate in splicing, so that the generation of a matched planar point cloud display diagram after each new panoramic image is obtained, the splicing of the generated Ping Miandian cloud display diagram and the associated planar point cloud display diagram can be realized, in the process of panoramic image shooting of the next space, the panoramic image can be processed to the planar point cloud display diagram, when the planar point cloud display diagram is spliced, the next space can be shot, and the panoramic image is executed at the same time, so that the panoramic image processing efficiency is improved by the parallel processing of different operation point cloud display diagrams.

By registering the panorama based on the two spatially corresponding panoramas, the relative pose of the panorama is obtained, the planar point cloud display based on the relative pose is spliced, the panorama can be used as a registration object, and the target point cloud display is obtained through splicing processing based on the registration splicing Ping Miandian cloud display of the panorama.

Acquiring a depth image based on a depth image model and acquiring a panoramic segmentation image based on a semantic segmentation model, so that a demand image based on a training mature model is acquired, the quality is ensured, and the processing efficiency is improved; and determining a third coordinate set and a fourth coordinate set based on the first coordinate set and/or the second coordinate set, so that three-dimensional coordinates corresponding to the pixels in other areas are obtained by operation based on the existing coordinate sets, the processing flow is simplified, and the processing efficiency is improved.

The embodiment of the application provides a point cloud display diagram generating device, as shown in fig. 6, including:

a first generation module 601, configured to generate an nth Ping Miandian cloud display diagram according to an nth panorama obtained by panoramic shooting an nth space of the target space;

a second generating module 602, configured to generate an n+ Ping Miandian cloud display diagram according to an n+1th panorama when the n+1th panorama of an n+1th space is acquired based on panorama shooting, where the n+1th space is a space related to the N-th space determined based on a spatial structure in the target space; the plane point cloud display diagram corresponding to the nth space and the (n+1) th space is an image generated by projecting partial pixel points carrying color characteristic values based on three-dimensional coordinates corresponding to partial pixel points in a corresponding panorama, the three-dimensional coordinates corresponding to the pixel points are determined based on a depth image and a panorama segmentation image, and the depth image and the panorama segmentation image are generated based on the panorama;

The stitching obtaining module 603 is configured to stitch the n+ Ping Miandian th cloud display diagram and the N Ping Miandian th cloud display diagram based on image registration, to obtain a target point cloud display diagram;

a determining module 604, configured to determine whether another space that does not participate in the stitching exists in the target space, where the processing is ended if the other space does not exist;

and the processing module 605 is configured to combine the nth space and the n+1th space as the nth space when there is another space, and return the target point cloud display diagram to the second generating module 602 for processing.

Optionally, the space associated based on the spatial structure is a neighboring space; the splicing acquisition module comprises:

the registration acquisition sub-module is used for carrying out image registration on the (N+1) -th panoramic image and the (N) -th panoramic image to acquire the relative pose of the (N+1) -th panoramic image and the (N) -th panoramic image;

and the splicing and acquiring sub-module is used for splicing the (N+1) Ping Miandian cloud display diagram and the (N Ping Miandian) cloud display diagram according to the relative pose so as to acquire the target point cloud display diagram.

Optionally, the apparatus further comprises:

The acquisition module is used for acquiring the target point cloud display diagram corresponding to the target space under the condition that the planar point cloud display diagrams corresponding to all the spaces of the target space are determined to be spliced.

Optionally, the first generating module includes:

the first acquisition sub-module is used for acquiring a depth image and a panoramic segmentation image corresponding to the Nth panoramic image based on the Nth panoramic image;

the second obtaining submodule is used for obtaining three-dimensional coordinates corresponding to each target pixel point in a target set according to the depth image and the panoramic segmentation image corresponding to the Nth panoramic image, and the target set comprises at least part of pixel points in the Nth panoramic image;

the projection acquisition sub-module is used for projecting part of target pixel points carrying color characteristic values based on the three-dimensional coordinates of the part of target pixel points in the target set to acquire a plane point cloud display diagram corresponding to the N space, wherein the part of target pixel points are pixel points meeting projection requirements.

Optionally, the first obtaining submodule includes:

the first acquisition unit is used for predicting the depth value of the pixel point in the Nth panorama based on a depth map model, and acquiring the depth image;

The second obtaining unit is used for carrying out category prediction on the pixel points in the nth panorama based on the semantic segmentation model, and obtaining the panorama segmentation image carrying category labels corresponding to the pixel points in the nth panorama.

Optionally, the second acquisition submodule includes:

a first determining unit, configured to determine, according to the panoramic segmented image, all pixels in the nth panoramic image corresponding to a first class label and all pixels in the second class label, where the panoramic segmented image includes class labels corresponding to the pixels in the nth panoramic image, and the pixels corresponding to the first class label are located in a first area and the pixels corresponding to the second class label are located in a second area;

a second determining unit, configured to determine, based on the depth image, a first coordinate set including three-dimensional coordinates corresponding to the pixel points of the first area and a second coordinate set including three-dimensional coordinates corresponding to the pixel points of the second area;

a third obtaining unit, configured to obtain, according to at least one of the first coordinate set and the second coordinate set, a third coordinate set corresponding to at least a part of pixels located in a third area of the nth panorama, and obtain a fourth coordinate set corresponding to at least a part of pixels located in a fourth area of the nth panorama;

Optionally, the second determining unit includes:

the first acquisition subunit is used for acquiring a first height value corresponding to each pixel point in the first area and a second height value corresponding to each pixel point in the second area based on the depth image, wherein the height value corresponding to each pixel point is a vertical component corresponding to a reference three-dimensional coordinate determined by the pixel point based on the depth image in the height direction;

the first determining subunit is used for determining a first average height value according to the first height values respectively corresponding to the pixel points in the first area;

a second determining subunit, configured to determine a second average height value according to second height values corresponding to each pixel point in the second area respectively;

The third determining subunit is configured to determine, according to the first average height value, the panoramic pixel coordinate corresponding to each pixel point in the first area, and a conversion formula, a three-dimensional coordinate corresponding to each pixel point in the first area, and determine the first coordinate set based on the three-dimensional coordinate corresponding to each pixel point in the first area;

a fourth determining subunit, configured to determine, according to the second average height value, the panoramic pixel coordinate and the conversion formula corresponding to each pixel point in the second area, a three-dimensional coordinate corresponding to each pixel point in the second area, and determine the second coordinate set based on the three-dimensional coordinate corresponding to each pixel point in the second area;

Optionally, the third obtaining unit includes:

a searching subunit, configured to search, for each pixel point in the third area and the fourth area, a first pixel point associated with a current pixel point in a column direction and located in the first area or the second area, where the first pixel point intersects a second pixel point corresponding to the current pixel point, and the second pixel point intersects the first area or the second area in the same column as the current pixel point;

The second acquisition subunit is used for acquiring the three-dimensional coordinates corresponding to the first pixel point based on the first coordinate set or the second coordinate set under the condition that the first pixel point is found;

the third obtaining subunit is configured to obtain, according to the three-dimensional coordinate corresponding to the first pixel point, a first distance between a projection point corresponding to a virtual camera in a three-dimensional live-action space model and a first pixel point associated with a current pixel point, where the three-dimensional live-action space model is a three-dimensional space model corresponding to the nth space, and in the three-dimensional live-action space model, a connection line between the projection point and the first pixel point is perpendicular to a column direction in which the current pixel point is located;

a fifth determining subunit, configured to determine a depth value of the current pixel point based on the first distance and a panorama latitude angle corresponding to the current pixel point, and determine a three-dimensional coordinate of the current pixel point based on the depth value of the current pixel point;

a sixth determining subunit, configured to determine the third coordinate set according to three-dimensional coordinates corresponding to at least some pixel points in the third area, and determine the fourth coordinate set according to three-dimensional coordinates corresponding to at least some pixel points in the fourth area;

Optionally, the fifth determining subunit is further configured to:

Optionally, the projection acquisition submodule includes:

the screening unit is used for screening out partial target pixel points meeting projection requirements from the target set based on a preset rule;

the projection acquisition unit is used for projecting the partial target pixel points carrying the color characteristic values to a preset plane based on the three-dimensional coordinates of the partial target pixel points, and acquiring the plane point cloud display diagram;

For the device embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and reference is made to the description of the method embodiments for relevant points.

The embodiment of the application also provides electronic equipment, which comprises: the computer program is stored in the memory and can run on the processor, and when the computer program is executed by the processor, the processes of the point cloud display diagram generating method embodiment are realized, and the same technical effects can be achieved, so that repetition is avoided, and the description is omitted here.

For example, fig. 7 shows a schematic diagram of the physical structure of an electronic device. As shown in fig. 7, the electronic device may include: processor 710, communication interface (Communications Interface) 720, memory 730, and communication bus 740, wherein processor 710, communication interface 720, memory 730 communicate with each other via communication bus 740. The processor 710 may invoke logic instructions in the memory 730, and the processor 710 is configured to perform the steps in the point cloud display diagram generation method according to any of the embodiments described above.

Further, the logic instructions in the memory 730 described above may be implemented in the form of software functional units and may be stored in a computer readable storage medium when sold or used as a stand alone product. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the methods described in the embodiments of the present application.

The embodiment of the application also provides a computer readable storage medium, on which a computer program is stored, which when executed by a processor, implements each process of the above embodiment of the point cloud display diagram generating method, and can achieve the same technical effect, so that repetition is avoided, and no further description is provided herein. Wherein the computer readable storage medium is selected from Read-Only Memory (ROM), random access Memory (Random Access Memory, RAM), magnetic disk or optical disk.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk), including several instructions for causing a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the method described in the embodiments of the present application.

The embodiments of the present application have been described above with reference to the accompanying drawings, but the present application is not limited to the above-described embodiments, which are merely illustrative and not restrictive, and many forms may be made by those of ordinary skill in the art without departing from the spirit of the present application and the scope of the claims, which are also within the protection of the present application.

Those of ordinary skill in the art will appreciate that the elements and algorithm steps described in connection with the embodiments disclosed herein can be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, and are not repeated herein.

In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.

The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: a usb disk, a removable hard disk, a ROM, a RAM, a magnetic disk, or an optical disk, etc.

The foregoing is merely specific embodiments of the present application, but the scope of the present application is not limited thereto, and any person skilled in the art can easily think about changes or substitutions within the technical scope of the present application, and the changes and substitutions are intended to be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. The point cloud display diagram generation method is characterized by comprising the following steps of:

step 5, merging the nth space and the n+1th space to be used as the nth space, taking the target point cloud display diagram as the nth plane point cloud display diagram, and returning to the step 2;

the three-dimensional coordinates corresponding to the pixel points are determined based on the depth image and the panoramic segmentation image, and the three-dimensional coordinates comprise:

determining all pixel points corresponding to a first class label and all pixel points corresponding to a second class label in the Nth panorama according to the panorama segmentation image corresponding to the Nth panorama, wherein the panorama segmentation image comprises class labels respectively corresponding to all pixel points in the Nth panorama, the pixel points corresponding to the first class label are located in a first area, and the pixel points corresponding to the second class label are located in a second area;

determining a first coordinate set comprising three-dimensional coordinates corresponding to the pixel points of the first area and a second coordinate set comprising three-dimensional coordinates corresponding to the pixel points of the second area based on the depth image corresponding to the Nth panorama;

2. The method of claim 1, wherein the space associated based on the spatial structure is a contiguous space; the image registration-based stitching the n+ Ping Miandian cloud display diagram and the N Ping Miandian cloud display diagram to obtain a target point cloud display diagram includes:

performing image registration on the (N+1) -th panoramic image and the (N) -th panoramic image to obtain the relative pose of the (N+1) -th panoramic image and the (N) -th panoramic image;

And according to the relative pose, splicing the (N+1) Ping Miandian cloud display diagram and the (N Ping Miandian) cloud display diagram to acquire the target point cloud display diagram.

3. The method of claim 1, wherein after taking the target point cloud representation as the nth plane point cloud representation and returning to step 2, the method further comprises:

4. The method of claim 1, wherein generating the nth Ping Miandian cloud display from the nth panorama acquired by panoramic shooting the nth space of the target space comprises:

the target set comprises at least part of pixel points in the Nth panorama; and based on the three-dimensional coordinates of part of target pixel points in the target set, projecting the part of target pixel points carrying color characteristic values to obtain a plane point cloud display diagram corresponding to the N space, wherein the part of target pixel points are pixel points meeting projection requirements.

5. The method of claim 4, wherein acquiring a depth image and a panoramic segmented image corresponding to the nth panorama based on the nth panorama comprises:

6. The method of claim 1, wherein the determining, based on the depth image, a first set of coordinates including three-dimensional coordinates corresponding to pixels of the first region, a second set of coordinates including three-dimensional coordinates corresponding to pixels of the second region, comprises:

7. The method of claim 1, wherein the obtaining a third coordinate set corresponding to at least a portion of pixels located in a third region of the nth panorama and a fourth coordinate set corresponding to at least a portion of pixels located in a fourth region of the nth panorama according to at least one of the first coordinate set and the second coordinate set comprises:

8. The method of claim 7, wherein determining the depth value of the current pixel based on the first distance and the panorama latitude angle corresponding to the current pixel comprises:

9. The method according to claim 4, wherein the projecting the partial target pixel points carrying color feature values based on the three-dimensional coordinates of the partial target pixel points in the target set to obtain the plane point cloud display diagram corresponding to the nth space includes:

10. A point cloud display diagram generation apparatus, characterized by comprising:

The processing module is used for merging the nth space and the (n+1) th space to be used as the nth space when other spaces exist, taking the target point cloud display diagram as the nth plane point cloud display diagram, and returning to the second generating module to continue processing;

11. An electronic device comprising a processor, a memory and a computer program stored on the memory and executable on the processor, which when executed by the processor implements the steps of the point cloud display generation method of any of claims 1 to 9.

12. A computer-readable storage medium, on which a computer program is stored, which computer program, when being executed by a processor, implements the steps of the point cloud representation generation method according to any of claims 1 to 9.