CN115496818B - Semantic graph compression method and device based on dynamic object segmentation - Google Patents

Semantic graph compression method and device based on dynamic object segmentation Download PDF

Info

Publication number
CN115496818B
CN115496818B CN202211390992.4A CN202211390992A CN115496818B CN 115496818 B CN115496818 B CN 115496818B CN 202211390992 A CN202211390992 A CN 202211390992A CN 115496818 B CN115496818 B CN 115496818B
Authority
CN
China
Prior art keywords
semantic
dynamic object
semantic graph
graph
pixel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202211390992.4A
Other languages
Chinese (zh)
Other versions
CN115496818A (en
Inventor
高健健
华炜
明彬彬
谢天
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Lab
Original Assignee
Zhejiang Lab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Lab filed Critical Zhejiang Lab
Priority to CN202211390992.4A priority Critical patent/CN115496818B/en
Publication of CN115496818A publication Critical patent/CN115496818A/en
Application granted granted Critical
Publication of CN115496818B publication Critical patent/CN115496818B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/70Labelling scene content, e.g. deriving syntactic or semantic representations

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a semantic graph compression method and a device based on dynamic object segmentation, wherein the method comprises the steps of firstly dividing a simulation scene into a static background and a dynamic object, and drawing the simulation scene to obtain a semantic graph; segmenting all dynamic objects in the semantic graph to obtain semantic subgraphs, and filling the static background semantic graph with adjacent pixels of the semantic subgraphs of the dynamic objects; and finally, coding all dynamic object semantic subgraphs and the filled static background semantic graph respectively by using a coding algorithm. The method separates the static background from the dynamic object, and splits a semantic graph into a static background semantic graph and a plurality of dynamic object semantic subgraphs, thereby reducing the pixel mutation in the semantic graph, increasing the continuity of data distribution and obviously improving the compression ratio of semantic graph coding.

Description

Semantic graph compression method and device based on dynamic object segmentation
Technical Field
The invention relates to the field of image data compression, in particular to a semantic graph compression method and device based on dynamic object segmentation.
Background
As a core technology of computer vision and image understanding, an object detection algorithm is an important research direction in the field of computer vision and is also the basis of other complex visual tasks. The application scenes of the target detection algorithm comprise image description, scene understanding, image segmentation, target tracking, event detection and the like, and the target detection algorithm is also widely applied to judgment scenes such as automatic driving, medical images and unmanned aerial vehicle navigation.
The target detection algorithm based on machine learning needs to provide a large amount of truth data in the training process, and the truth data mainly comprises a semantic segmentation graph, namely a semantic graph. The semantic graph divides a picture or a video into a plurality of blocks according to the difference of the categories, so as to realize the classification of the pixel level. The semantic graph truth value is generally generated by adopting manual labeling or a simulation program automatically. The simulation algorithm has a high degree of customization and can generate massive data quickly, so that the simulation algorithm becomes one of important methods for generating a truth value data set of a semantic map. The traditional simulation semantic graph is not generally coded or directly coded by a compression algorithm such as run-length coding, and the characteristics of a simulation scene are not fully considered, so that the compression rate is low, and the efficiency of storage and data transmission is limited.
Disclosure of Invention
In order to solve the defects of the prior art and realize more efficient semantic graph compression, the invention adopts the following technical scheme:
a semantic graph compression method based on dynamic object segmentation comprises the following steps:
s1, initializing a simulation scene, wherein the simulation scene consists of a static background and a dynamic object;
s2, updating and drawing the simulation scene to obtain a semantic graph and two-dimensional bounding volumes of all dynamic objects under a semantic graph coordinate system;
s3, segmenting the semantic data of the dynamic object by using the two-dimensional bounding volume to form a plurality of dynamic object semantic subgraphs; filling the corresponding image area by using the adjacent pixels of each dynamic object semantic sub-image in the rest static background semantic graph;
and S4, respectively coding all dynamic object semantic subgraphs and the filled static background semantic graphs by using a coding algorithm.
Furthermore, all objects of the simulation scene in the S1 are assigned with an ID, and the IDs of the objects with the same semantic meaning are the same; each ID uniquely corresponds to one color, and different IDs correspond to different colors.
Further, updating the simulation scene in S2 includes updating the poses of all dynamic objects and rendering views; each pixel in the semantic graph uniquely corresponds to one object, and the color corresponding to the object ID is used for coloring.
Further, in S2, the two-dimensional bounding volume of the dynamic object in the semantic graph coordinate system is to contain all pixels of the dynamic object on the semantic graph.
Further, in S3, the semantic data of the dynamic object is segmented by using the two-dimensional bounding volume in S2 to form a plurality of semantic subgraphs, which is specifically realized by the following sub-steps:
(1) Initializing corresponding semantic subgraph data according to the size of a two-dimensional bounding volume of each dynamic object, wherein each element of the semantic subgraph is initialized to be (R: 0, G:0, B: 0);
(2) Traversing each pixel of the semantic graph, and for each pixel, performing the following processing:
and judging whether the pixel coordinate of the pixel is positioned in the range of a two-dimensional surrounding body of a certain dynamic object, if so, calculating the relative coordinate of the pixel under the semantic sub-image coordinate system of the dynamic object, writing the relative coordinate into a corresponding element of the semantic sub-image of the dynamic object, otherwise, ignoring the pixel, and continuously traversing the next pixel to obtain the semantic sub-image of the certain dynamic object.
Further, in S3, the remaining static background semantic graphs are obtained by filling corresponding image regions with pixels adjacent to each dynamic object semantic sub-graph, through the following sub-steps:
(1) Searching edge pixels around the dynamic object semantic subgraph, and if the edge pixels exceed the range of the semantic graph, determining the edge pixels to be invalid;
(2) And counting all effective edge pixels, and filling an image area corresponding to the dynamic object semantic sub-image in the semantic graph according to the values of the edge pixels.
Further, in S4, the encoding result of each dynamic object semantic subgraph is accompanied by corresponding two-dimensional bounding volume information.
The device comprises one or more processors and is used for realizing the semantic graph compression method based on the dynamic object segmentation.
A computer-readable storage medium, on which a program is stored which, when executed by a processor, implements a semantic graph compression method based on dynamic object segmentation.
The invention has the following beneficial effects:
the invention discloses a semantic graph compression method based on dynamic object segmentation, which comprises the steps of firstly dividing a simulation scene into a static background and a dynamic object, and drawing the simulation scene to obtain a semantic graph; segmenting all dynamic objects in the semantic graph to obtain semantic subgraphs, and filling the static background semantic graph with adjacent pixels of the semantic subgraphs of the dynamic objects; and finally, coding all dynamic object semantic subgraphs and the filled static background semantic graphs respectively by using a coding algorithm. The method separates the static background from the dynamic object, and splits a semantic graph into a static background semantic graph and a plurality of dynamic object semantic subgraphs, thereby reducing the pixel mutation in the semantic graph, increasing the continuity of data distribution and obviously improving the compression ratio of semantic graph coding.
Drawings
FIG. 1 is a diagram of steps of a semantic graph compression method based on dynamic object segmentation in an exemplary embodiment.
FIG. 2 is a diagram of a static background and dynamic objects of a simulation scene in an exemplary embodiment.
FIG. 3 is a diagram of semantic graph dynamic object segmentation and compression in an exemplary embodiment.
FIG. 4 is a schematic diagram of a semantic graph compression apparatus based on dynamic object segmentation in an exemplary embodiment.
Detailed Description
For the purposes of promoting an understanding of the objects, aspects and advantages of the invention, reference will now be made to the following detailed description of the invention, taken in conjunction with the accompanying drawings and examples, it being understood that the specific embodiments described herein are illustrative of the invention and are not intended to be exhaustive. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the scope of the present invention.
In one embodiment, as shown in fig. 1, a semantic graph compression method and apparatus based on dynamic object segmentation is provided, in which a simulation scene is first divided into two parts, namely a static background and a dynamic object, and the simulation scene is drawn to obtain a semantic graph; segmenting all dynamic objects in the semantic graph to obtain semantic subgraphs, and filling the static background semantic graph with adjacent pixels of the semantic subgraphs of the dynamic objects; and finally, coding all dynamic object semantic subgraphs and the filled static background semantic graph respectively by using a coding algorithm.
The method specifically comprises the following steps:
step 1, initializing a simulation scene, wherein the simulation scene is composed of a static background and a dynamic object.
As shown in fig. 2, in the present embodiment, the simulation scene is a digital twin scene of a certain park, and is composed of several types of models, such as an oblique photography model, an art handmade model, and a procedural generation model, which are classified into two types, namely a static background and a dynamic object. The static background comprises sky, terrain, road surface, road teeth, street lamps, buildings, plants and the like, and the dynamic object comprises a non-motor vehicle, a pedestrian and the like.
The image rendering interface of the simulation program adopts an Open-source Graphics rendering Library OpenGL (Open Graphics Library), which is a cross-language, cross-platform Application Programming Interface (API) for rendering 2D and 3D vector Graphics. The OpenGL interface is typically used to interact with a Graphics Processing Unit (GPU) to implement hardware-accelerated rendering, which includes nearly 350 different function call components to draw from simple graphics bits to complex three-dimensional scenes; openGL is commonly used in CAD, virtual reality, scientific visualization programs, and electronic game development, and contains 7 major functions: establishing a 3D model, graph transformation, a color mode, illumination and material setting, texture mapping, an image enhancement function, an extended function of bitmap display and a double-cache function.
All objects of the simulation scene are assigned with an ID, and the IDs of the objects with the same type of semantics are the same, wherein the same type of semantics refers to the objects belonging to the same type in a target detection algorithm; each ID uniquely corresponds to one color, and different IDs correspond to different colors. The correspondence between the IDs and the colors of the different objects is shown in table 1.
And a semantic graph simulation camera is also placed in the simulation scene, and a semantic graph is generated by a scene drawing method. Setting the field angle of the semantic graph simulation camera to be 72 degrees, the aspect ratio to be 16, and the resolution of the target image to be 1280 × 720. The semantic graph simulation camera also comprises a motion control module which controls the motion logic of the semantic graph simulation camera in a scene: a motion track is preset, each point of the motion track comprises a group of pose data, the semantic graph simulation camera moves at a constant speed according to the preset speed along the motion track, and a semantic graph is generated by drawing at fixed time intervals.
TABLE 1 correspondence of ID and color of different objects
Figure 671642DEST_PATH_IMAGE001
And 2, updating and drawing the simulation scene to obtain a semantic graph and two-dimensional bounding volumes of all dynamic objects under a semantic graph coordinate system.
The two-dimensional enclosure may be a two-dimensional enclosure frame, such as a rectangular frame, or may be other regular shapes, such as a circle, an ellipse, etc., or other irregular shapes.
Semantic segmentation is an important ring in image processing and computer vision techniques with respect to image understanding. The semantic segmentation classifies each pixel point in the image, determines the category of each point, such as the category of the background, the edge or the body, and separates the example segmentation. Semantic segmentation does not separate instances of the same class; it is only the class of each pixel that is of interest, and if there are two objects of the same class in the input object, the segmentation itself will not distinguish them as separate objects. The semantic segmentation algorithm outputs a semantic segmentation image, which is called a semantic image for short.
In the embodiment, the semantic graph divides one picture or video into a plurality of blocks according to the difference of the categories, so as to realize the classification of the pixel level; each pixel in the semantic graph obtained by drawing uniquely corresponds to an object, and the color corresponding to the object ID is used for coloring.
In the embodiment, the updating of the simulation scene comprises the pose updating of all dynamic objects and semantic graph simulation cameras, and the updating frequency is set to be 60HZ; all dynamic objects including non-motor vehicles, motor vehicles and pedestrians are skeleton skin models, and the pose updating comprises the following two steps:
(1) Receiving the state data of the dynamic object from the server, analyzing the position and rotation data in the state data, and setting the state data of the dynamic object to a corresponding dynamic object according to the object identifier character string;
(2) And performing animation skinning calculation based on the GPU according to the current simulation time and the animation state of the dynamic object, and updating each vertex of the dynamic object model.
The semantic graph simulation camera updates the position and the posture of the semantic graph simulation camera by sampling a preset motion track, and the method comprises the following specific steps of:
(1) Simulating the motion speed of a camera according to the current simulation time and a preset semantic graph, and calculating the path length of the current motion;
(2) If the path length is larger than the preset motion track length, stopping the simulation process and exiting the program; otherwise, sampling a preset motion track by using the path length, and setting the obtained position and posture to the semantic graph simulation camera.
And starting to execute the drawing operation after the poses of all the dynamic objects and the semantic graph simulation cameras are updated. Before drawing each scene object, calling a glUniform1i function to transfer the ID of the object as a uniform variable to a pixel shader, and calling a glDrawElements function to draw vertex data of the model. In the pixel shader, the calculation formula of the output pixel color PixelColor and the object ID is as follows:
PixelColor.r=ID*20
PixelColor.g=ID*20
PixelColor.b=ID*20
in addition to obtaining drawn semantic graph data, the embodiment also generates a two-dimensional bounding box of all dynamic objects in a semantic graph coordinate system, and an area covered by the two-dimensional bounding box includes all pixels of the dynamic objects on the semantic graph; for each dynamic object, the specific generation algorithm of the two-dimensional bounding box is as follows:
(1) Traversing all vertexes of the dynamic object, respectively calculating the maximum value and the minimum value in three axial directions, and constructing an axially Aligned three-dimensional Bounding Box (AABB, axis-Aligned Bounding Box), wherein the three-dimensional Bounding Box can contain all vertexes of the dynamic object;
(2) Traversing 8 three-dimensional corner points of the three-dimensional bounding box, and transforming the three-dimensional corner points into two-dimensional corner points under a semantic map coordinate system through a model matrix, a view matrix, a perspective matrix and a viewport transformation matrix in sequence;
(3) Traversing the two-dimensional angular points obtained in the last step, respectively calculating the maximum value and the minimum value in two axial directions, and constructing an axially aligned two-dimensional surrounding frame containing all the two-dimensional angular points, wherein the two-dimensional surrounding frame is the two-dimensional surrounding frame of the dynamic object.
Step 3, segmenting the semantic data of the dynamic object by using the two-dimensional bounding volume in the step 2 to form a plurality of semantic subgraphs; and filling the corresponding image area by using adjacent pixels of each dynamic object semantic sub-image in the rest static background semantic graph.
In this embodiment, as shown in fig. 3, the two-dimensional bounding box of the dynamic object is used to segment semantic data of all dynamic objects from the original semantic graph to form a semantic subgraph; storing the semantic graph data and the semantic subgraph data by using a two-dimensional array, wherein elements of the two-dimensional array are RGB color values of pixels;
initializing corresponding semantic subgraph data according to the size of a two-dimensional bounding volume of each dynamic object, wherein each element of the semantic subgraph is initialized to be (R: 0, G:0, B: 0); traversing each pixel of the original semantic graph, and performing the following processing for each pixel:
judging whether the pixel coordinate of the pixel is located in the range of a two-dimensional surrounding body of a certain dynamic object, if so, calculating the relative coordinate of the pixel under the semantic subgraph coordinate system of the dynamic object, and writing the relative coordinate into the corresponding element of the semantic subgraph of the dynamic object; otherwise, the pixel is ignored, and the next pixel is continuously traversed to obtain the semantic subgraph of a certain dynamic object.
For each dynamic object semantic sub-graph, searching all edge pixels of a circle of the dynamic object semantic sub-graph around the original semantic graph, and if the edge pixels exceed the range of the semantic graph, determining that the edge pixels are invalid; and selecting the mode of all effective edge pixels as specified adjacent pixels, and filling the corresponding image area of the dynamic object semantic subgraph in the original semantic graph with the adjacent pixels. Wherein, the edge pixel can be one circle or a plurality of circles; in the case of specifying adjacent pixels, the average number and the median number of all effective edge pixels may be selected in addition to the mode of all effective edge pixels.
In this embodiment, if no dynamic object exists in the simulation scene, the above steps of dynamic object segmentation and original semantic graph filling are not performed.
And 4, respectively coding all the dynamic object semantic subgraphs and the filled static background semantic graphs by using a coding algorithm.
In the embodiment, all dynamic object semantic subgraphs and the filled static background semantic graph are compressed and encoded by using the stroke coding; run-Length Encoding (RLE) is a form of lossless data compression in which runs of data (sequences of identical data values occurring in many consecutive data elements) are stored as a single data value and count, rather than as original runs. This is most effective for data containing many such runs, such as simple graphical images, like icons, line drawings and animations. For files without too many repeated characters, run length encoding may instead increase the file size.
In addition, the run-length coding is sensitive to transmission errors, and if one bit of the code is wrong, the correctness of the whole coding sequence is affected, so that the run-length coding cannot restore the original data, and therefore, the errors are generally controlled within one row and one column by using a row synchronization and column synchronization method. In summary, run-length coding is well suited for compressing highly repetitive computer-generated images, especially binary images, and is also suitable for image data such as semantic maps.
All dynamic object semantic subgraphs and the filled static background semantic graphs are compressed by using stroke coding, and the coding and compression processes are consistent; here, taking the filled static background semantic graph (background semantic graph for short) as an example, the compression process is described as follows:
creating a variable Header, and initializing the variable Header to be the first pixel of the background semantic graph; creating a variable Count, and initializing to 0; creating an empty one-dimensional array for storing the compressed result data, wherein the array elements are 24-bit integer values; traversing each pixel of the background semantic graph, and executing the following operations for each pixel: judging whether the color value of the current pixel is the same as the Header or not, and if so, adding one to the Count; otherwise, writing the current Count value into the compression result array, combining the RGB three channel values of the color value of the Header into an integer, writing the integer into the compression result array, assigning the color value of the current pixel to the Header, resetting the Count value to 0, and continuously traversing the next pixel.
The compression result of the dynamic object semantic subgraph needs to store two-dimensional bounding volume information of the dynamic object semantic subgraph on the original semantic graph besides the compressed data of the run-length coding. The resulting compression result data includes three parts:
(1) A stroke coding compression array of the background semantic graph;
(2) Compressing arrays of the stroke codes of all dynamic object semantic subgraphs;
(3) And (3) two-dimensional bounding volume information of all dynamic object semantic subgraphs.
In this embodiment, the complete compression result data is transmitted to the algorithm end through the network, and the algorithm end restores the original semantic graph through the corresponding decompression method: firstly, decompressing background semantic graph data according to a stroke coding compression array of a background semantic graph, decompressing all dynamic object semantic sub-graph data according to the stroke coding compression array of all dynamic object semantic sub-graphs, and finally covering all dynamic object semantic sub-graph data to corresponding positions of the background semantic graph data according to two-dimensional bounding volume information of all dynamic object semantic sub-graphs to obtain original semantic graph data;
the specific method for decompressing the original data according to the run length coding compression array is as follows:
(1) Creating a variable Count, and initializing to 0; reading the first array element and assigning a value to the Count;
(2) Reading subsequent Count array elements, splitting each array element into RGB three channel values to form a pixel color value, and writing the pixel color value into original data;
(3) If all the array elements have been traversed, the decompression step is completed, and the program is exited; otherwise, reading the next array element, assigning value to Count, and continuing to execute the step 2.
After the decompression of all the dynamic object semantic subgraphs and the background semantic graph is finished, the final semantic graph splicing step is executed: traversing the two-dimensional bounding volume information of all the dynamic object semantic subgraphs, traversing all the pixels of the dynamic object semantic subgraph corresponding to the current two-dimensional bounding volume, transforming the coordinates of the current pixels in the dynamic object semantic subgraph to the coordinates of the background semantic graph by using the two-dimensional bounding volume, and writing the current pixels into the pixels at the corresponding positions of the background semantic graph.
The application effect of the final semantic graph compression method based on dynamic object segmentation is shown in table 2.
Table 2 shows the compression ratio comparison of image compression using the conventional three methods and the method of the present invention
RLE TRLE FastPFor Ours
Compression ratio 22.4% 14.6% 9.3% 5.9%
Table 2 compares the compression ratios of four Encoding methods, i.e., the conventional Run-Length Encoding (RLE), the Turbo Run-Length Encoding (TRLE), the fast integer sequence compression (FastPFor), and the semantic graph compression based on dynamic object segmentation. It can be seen that, because the characteristics of the simulation scene are fully utilized, the moving objects and the static objects are separated, the continuity of the semantic graph data is maximized so as to facilitate the subsequent compression and encoding, and the compression ratio of the semantic graph compression method based on the dynamic object segmentation is the lowest, thereby embodying the beneficial effects of the invention.
Corresponding to the foregoing embodiment, the present invention further provides an embodiment of a semantic graph compression apparatus based on dynamic object segmentation, as shown in fig. 4, the apparatus includes one or more processors, and is configured to implement the above laser radar point cloud generation method.
The semantic graph compressing device based on dynamic object segmentation of the present invention can be applied to any device with data processing capability, such as a computer or other devices. The apparatus embodiments may be implemented by software, or by hardware, or by a combination of hardware and software.
Figure 277198DEST_PATH_IMAGE002
The software implementation is taken as an example, and as a device in a logical sense, a processor of any device with data processing capability reads corresponding computer program instructions in the nonvolatile memory into the memory for operation. From the hardware layerIn addition to the processor, the memory, the network interface, and the nonvolatile memory, any device with data processing capability in which the apparatus is located in the embodiment may further include other hardware generally according to an actual function of the any device with data processing capability, which is not described herein again.
The implementation process of the functions and actions of each unit in the above device is specifically described in the implementation process of the corresponding step in the above method, and is not described herein again.
For the device embodiment, since it basically corresponds to the method embodiment, reference may be made to the partial description of the method embodiment for relevant points. The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of the present invention. One of ordinary skill in the art can understand and implement it without inventive effort.
The embodiment of the present invention further provides a computer-readable storage medium, on which a program is stored, and when the program is executed by a processor, the method for compressing a semantic graph based on dynamic object segmentation in the foregoing embodiments is implemented.
The computer readable storage medium may be an internal storage unit, such as a hard disk or a memory, of any data processing capability device described in any of the foregoing embodiments. The computer readable storage medium may also be an external storage device such as a plug-in hard disk, a Smart Media Card (SMC), an SD card, a Flash memory card (Flash card), etc. provided on the device. Further, the computer readable storage medium may include both internal storage units and external storage devices of any data processing capable device. The computer readable storage medium is used to store the computer program, and other programs and data needed by any of the data processing capable devices, and may also be used to temporarily store data that has been or will be output.
The above examples are only intended to illustrate the technical solution of the present invention, and not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and these modifications or substitutions do not depart from the scope of the embodiments of the present invention in nature.

Claims (8)

1. A semantic graph compression method based on dynamic object segmentation is characterized by comprising the following steps:
s1, initializing a simulation scene, wherein the simulation scene consists of a static background and a dynamic object;
s2, updating and drawing the simulation scene to obtain a semantic graph and two-dimensional bounding volumes of all dynamic objects under a semantic graph coordinate system;
s3, segmenting the semantic data of the dynamic object by using the two-dimensional bounding volume to form a plurality of dynamic object semantic subgraphs; filling the corresponding image area with the adjacent pixels of each dynamic object semantic subgraph in the rest static background semantic graph;
in the step S3, the semantic data of the dynamic object is segmented by using the two-dimensional bounding volume in the step S2 to form a plurality of semantic subgraphs, and the semantic subgraphs are specifically realized by the following substeps:
(1) Initializing corresponding semantic subgraph data according to the size of a two-dimensional bounding volume of each dynamic object, wherein each element of the semantic subgraph is initialized to be (R: 0, G:0, B: 0);
(2) Traversing each pixel of the semantic graph, and for each pixel, performing the following processing:
judging whether the pixel coordinate of the pixel is located in the range of a two-dimensional surrounding body of a certain dynamic object, if so, calculating the relative coordinate of the pixel under the semantic sub-image coordinate system of the dynamic object, writing the relative coordinate into a corresponding element of the semantic sub-image of the dynamic object, otherwise, ignoring the pixel, and continuously traversing the next pixel to obtain the semantic sub-image of the certain dynamic object;
and S4, respectively coding all the dynamic object semantic subgraphs and the filled static background semantic graph by using a coding algorithm.
2. The semantic graph compression method based on dynamic object segmentation according to claim 1, wherein all objects of the simulation scene in S1 are assigned with an ID, and IDs of objects having the same semantic class are the same; each ID uniquely corresponds to one color, and different IDs correspond to different colors.
3. The semantic graph compression method based on dynamic object segmentation according to claim 1, wherein the updating of the simulation scene in S2 includes pose updating of all dynamic objects and rendering view angles; each pixel in the semantic graph uniquely corresponds to one object, and the color corresponding to the object ID is used for coloring.
4. The method according to claim 1, wherein in S2, the two-dimensional bounding volume of the dynamic object in the semantic map coordinate system contains all pixels of the dynamic object on the semantic map.
5. The semantic map compression method based on dynamic object segmentation according to claim 1, wherein in the step S3, the remaining static background semantic map fills the corresponding image region with the adjacent pixels of each dynamic object semantic sub-map, and is implemented by the following sub-steps:
(1) Searching edge pixels around the dynamic object semantic subgraph, and if the edge pixels exceed the range of the semantic graph, determining the edge pixels to be invalid;
(2) And counting all effective edge pixels, and filling the corresponding image area of the dynamic object semantic subgraph in the semantic graph according to the values of the edge pixels.
6. The semantic graph compression method based on dynamic object segmentation according to claim 1, wherein the encoding result of each dynamic object semantic subgraph in S4 is accompanied by corresponding two-dimensional bounding volume information.
7. A semantic graph compression device based on dynamic object segmentation is characterized by comprising one or more processors and being used for realizing the semantic graph compression method based on dynamic object segmentation in any one of claims 1 to 6.
8. A computer-readable storage medium, on which a program is stored, which, when executed by a processor, implements the semantic graph compression method based on dynamic object segmentation according to any one of claims 1 to 6.
CN202211390992.4A 2022-11-08 2022-11-08 Semantic graph compression method and device based on dynamic object segmentation Active CN115496818B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211390992.4A CN115496818B (en) 2022-11-08 2022-11-08 Semantic graph compression method and device based on dynamic object segmentation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211390992.4A CN115496818B (en) 2022-11-08 2022-11-08 Semantic graph compression method and device based on dynamic object segmentation

Publications (2)

Publication Number Publication Date
CN115496818A CN115496818A (en) 2022-12-20
CN115496818B true CN115496818B (en) 2023-03-10

Family

ID=85115811

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211390992.4A Active CN115496818B (en) 2022-11-08 2022-11-08 Semantic graph compression method and device based on dynamic object segmentation

Country Status (1)

Country Link
CN (1) CN115496818B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111739037A (en) * 2020-07-31 2020-10-02 之江实验室 Semantic segmentation method for indoor scene RGB-D image
CN111918071A (en) * 2020-06-29 2020-11-10 北京大学 Data compression method, device, equipment and storage medium
CN112132897A (en) * 2020-09-17 2020-12-25 中国人民解放军陆军工程大学 Visual SLAM method based on deep learning semantic segmentation
CN114915786A (en) * 2022-04-26 2022-08-16 哈尔滨工业大学(深圳) Asymmetric semantic image compression method for scene of Internet of things
CN115049832A (en) * 2022-06-27 2022-09-13 西安交通大学 Scene semantic segmentation method, system, terminal device and readable storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180161986A1 (en) * 2016-12-12 2018-06-14 The Charles Stark Draper Laboratory, Inc. System and method for semantic simultaneous localization and mapping of static and dynamic objects
EP3660787A4 (en) * 2017-07-25 2021-03-03 Cloudminds (Shenzhen) Robotics Systems Co., Ltd. Training data generation method and generation apparatus, and image semantics segmentation method therefor
DE102020209024A1 (en) * 2020-07-20 2022-01-20 Robert Bosch Gesellschaft mit beschränkter Haftung Method for generating a surveillance image
US12062174B2 (en) * 2020-09-15 2024-08-13 Sri International Fully automated multimodal system architecture for semantic segmentation of large-scale 3D outdoor point cloud data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111918071A (en) * 2020-06-29 2020-11-10 北京大学 Data compression method, device, equipment and storage medium
CN111739037A (en) * 2020-07-31 2020-10-02 之江实验室 Semantic segmentation method for indoor scene RGB-D image
CN112132897A (en) * 2020-09-17 2020-12-25 中国人民解放军陆军工程大学 Visual SLAM method based on deep learning semantic segmentation
CN114915786A (en) * 2022-04-26 2022-08-16 哈尔滨工业大学(深圳) Asymmetric semantic image compression method for scene of Internet of things
CN115049832A (en) * 2022-06-27 2022-09-13 西安交通大学 Scene semantic segmentation method, system, terminal device and readable storage medium

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Real-time Semantic Segmentation of Crop and Weed for Precision Agriculture Robots Leveraging Background Knowledge in CNNs;Andres Milioto,et al.;《IEEE Xplore》;20180913;全文 *
基于环境语义信息的同步定位与地图构建方法综述;李小倩 等;《工程科学学报》;20210630;全文 *
面向内容的工程图识别与理解综述;路通 等;《图学学报》;20121031;全文 *
面向实时视频流分析的边缘计算技术;杨铮 等;《中国科学:信息科学》;20220110;全文 *

Also Published As

Publication number Publication date
CN115496818A (en) 2022-12-20

Similar Documents

Publication Publication Date Title
CN112884875B (en) Image rendering method, device, computer equipment and storage medium
US9852544B2 (en) Methods and systems for providing a preloader animation for image viewers
US7940279B2 (en) System and method for rendering of texel imagery
US20230108967A1 (en) Micro-meshes, a structured geometry for computer graphics
US9928643B2 (en) Hierarchical continuous level of detail for three-dimensional meshes
US11450057B2 (en) Hardware acceleration for ray tracing primitives that share vertices
KR20160033128A (en) Sparse gpu voxelization for 3d surface reconstruction
US20050017971A1 (en) Ray tracing hierarchy
Duguet et al. Flexible point-based rendering on mobile devices
CN1655191A (en) Programmable graphic hardware based rapid voxel method for polygonal grid model
US20150332481A1 (en) Indexed uniform styles for stroke rendering
CN114419099B (en) Method for capturing motion trail of virtual object to be rendered
CN109979013B (en) Three-dimensional face mapping method and terminal equipment
CN113648655B (en) Virtual model rendering method and device, storage medium and electronic equipment
CN114863038A (en) Real-time dynamic free visual angle synthesis method and device based on explicit geometric deformation
CN115428027A (en) Neural opaque point cloud
CN110930492B (en) Model rendering method, device, computer readable medium and electronic equipment
US10347034B2 (en) Out-of-core point rendering with dynamic shapes
WO2019088865A1 (en) Method and system for removing hidden surfaces from a three-dimensional scene
CN115496818B (en) Semantic graph compression method and device based on dynamic object segmentation
CN110136235B (en) Three-dimensional BIM model shell extraction method and device and computer equipment
WO2021155318A1 (en) Method for improved handling of texture data for texturing and other image processing tasks
US20240320903A1 (en) Methods and systems for generating enhanced light texture data
Poux How to represent 3D Data?
WO2023184139A1 (en) Methods and systems for rendering three-dimensional scenes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant