WO2022222641A1 - 点云编解码方法、装置、计算机可读介质以及电子设备 - Google Patents
点云编解码方法、装置、计算机可读介质以及电子设备 Download PDFInfo
- Publication number
- WO2022222641A1 WO2022222641A1 PCT/CN2022/080266 CN2022080266W WO2022222641A1 WO 2022222641 A1 WO2022222641 A1 WO 2022222641A1 CN 2022080266 W CN2022080266 W CN 2022080266W WO 2022222641 A1 WO2022222641 A1 WO 2022222641A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- point cloud
- frame rate
- file
- track
- media
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 65
- 238000005538 encapsulation Methods 0.000 claims abstract description 62
- 230000005540 biological transmission Effects 0.000 claims description 54
- 230000011664 signaling Effects 0.000 claims description 52
- 230000000153 supplemental effect Effects 0.000 claims description 14
- 238000012545 processing Methods 0.000 claims description 7
- 238000012546 transfer Methods 0.000 claims description 5
- 238000004891 communication Methods 0.000 abstract description 8
- 238000004590 computer program Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 11
- 230000000694 effects Effects 0.000 description 5
- 230000006835 compression Effects 0.000 description 4
- 238000007906 compression Methods 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/61—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
- H04L65/612—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for unicast
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/001—Model-based coding, e.g. wire frame
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/70—Media network packetisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/762—Media network packet handling at the source
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/463—Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23605—Creation or processing of packetized elementary streams [PES]
Definitions
- the present application belongs to the field of computer and communication technologies, and in particular relates to a point cloud encoding and decoding method, a point cloud encoding and decoding device, a computer-readable medium, and an electronic device.
- a point cloud is a set of discrete points that are randomly distributed in space and express the spatial structure and surface properties of a three-dimensional object or scene. After the large-scale point cloud data is acquired through the point cloud acquisition device, the point cloud data can be encoded and packaged for transmission and presentation to the user.
- Embodiments of the present application provide a point cloud encoding and decoding method, a point cloud encoding and decoding apparatus, a computer-readable medium, and an electronic device.
- a point cloud decoding method comprising: receiving a point cloud file transmitted by a data source, the point cloud file including one or more point cloud media having the same point cloud content track, the point cloud file includes some point cloud media tracks with different frame rates; parses the file encapsulation information of the one or more point cloud media tracks, and obtains the frame rate indication information carried in the file encapsulation information.
- the frame rate indication information is used to indicate the frame rate of the one or more point cloud media tracks; according to the frame rate indication information carried in the file encapsulation information, select and decode the point cloud file with the specified frame rate.
- the point cloud media track is used to indicate the frame rate of the one or more point cloud media tracks.
- a point cloud decoding apparatus the apparatus includes: a receiving module configured to receive a point cloud file transmitted by a data source, where the point cloud file includes one or more points having the same point The point cloud media track of the cloud content, the point cloud file includes some point cloud media tracks with different frame rates; the parsing module is configured to parse the file encapsulation information of the one or more point cloud media tracks, and obtain the The frame rate indication information carried in the file encapsulation information, where the frame rate indication information is used to indicate the frame rate of the one or more point cloud media tracks; the decoding module is configured to, according to the frame rate carried in the file encapsulation information rate indication information, and select and decode the point cloud media track with the specified frame rate from the point cloud file.
- the receiving module includes: a signaling receiving unit configured to receive streaming media signaling sent by a data source for transmitting point cloud data; a signaling parsing unit , which is configured to parse the streaming media signaling to obtain a time-domain hierarchical group identifier carried in the streaming media signaling for identifying a track group, where the track group includes one or more points with the same point cloud content Cloud media tracks, the track group includes some point cloud media tracks with different frame rates; a request sending unit is configured to send a first data transmission request to the data source according to the time-domain hierarchical group identifier; file reception a unit configured to receive a point cloud file corresponding to the first data transmission request transmitted by the data source.
- the request sending unit includes: a bandwidth acquisition subunit, configured to acquire network bandwidth for data transmission with the data source; a track selection subunit, configured to According to the time-domain hierarchical group identifier, one or more target point cloud media tracks with a target frame rate that match the network bandwidth are selected from the track group; the request sending subunit is configured to send a request to the track group.
- the data source sends a first data transmission request requesting transmission of the one or more target point cloud media tracks.
- the parsing module includes: an information parsing unit, configured to parse the file encapsulation information of the one or more point cloud media tracks, to determine the relationship with the one or more point cloud media tracks. a frame rate indication field corresponding to the frame rates of multiple point cloud media tracks; the information determination unit is configured to determine the frame rate indication of the one or more point cloud media tracks according to the value of the frame rate indication field information.
- the point cloud decoding apparatus further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a track selection module, configured to According to the frame rate indication information carried in the file encapsulation information, other point cloud media tracks with the same frame rate as the point cloud media track to be displayed are selected from the point cloud file; the first replacement module is configured To replace the point cloud media track to be displayed with the other point cloud media track, so as to decode and display the other point cloud media track.
- the point cloud decoding apparatus further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a track selection module, configured to According to the frame rate indication information carried in the file encapsulation information, select one or more other point cloud media tracks with the same frame rate as the point cloud media track to be displayed from the point cloud file; first merge The module is configured to decode the one or more other point cloud media tracks, and combine and display the one or more other point cloud media tracks and the point cloud media track to be displayed.
- the point cloud decoding apparatus further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a file acquisition module, configured to Sending a second data transmission request to the data source to receive a supplemental point cloud file transmitted by the data source, the supplemental point cloud file comprising the same point cloud content and the same point cloud media track as the point cloud media track to be displayed other point cloud media tracks of the frame rate; and a second replacement module, configured to replace the point cloud media tracks to be displayed with the other point cloud media tracks, so as to decode and display the other point cloud media tracks.
- the point cloud decoding apparatus further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a file acquisition module, configured to Sending a third data transmission request to the data source to receive a supplemental point cloud file transmitted by the data source, the supplemental point cloud file comprising the same point cloud content and the same point cloud media track as the point cloud media track to be displayed frame rate of one or more other point cloud media tracks; a second merging module configured to decode the one or more other point cloud media tracks and combine the one or more other point cloud media tracks with the The point cloud media tracks to be displayed are combined and displayed.
- a point cloud encoding method comprising: encoding point cloud data to be transmitted according to different encoding standards to obtain multiple point cloud code streams having the same point cloud content,
- the multiple point cloud code streams include some point cloud code streams with different frame rates; the multiple point cloud code streams are encapsulated into multiple point cloud media tracks, and the multiple point cloud media tracks are filled with Frame rate indication information corresponding to the multiple point cloud code streams, where the frame rate indication information is used to indicate frame rates of the multiple point cloud media tracks.
- a point cloud encoding device the device includes: an encoding module configured to encode point cloud data to be transmitted according to different encoding standards, to obtain multi-point cloud data with the same point cloud content.
- point cloud code streams the multiple point cloud code streams include some point cloud code streams with different frame rates;
- the encapsulation module is configured to encapsulate the multiple point cloud code streams into multiple point cloud media tracks, and filling the multiple point cloud media tracks with frame rate indication information corresponding to the multiple point cloud code streams, where the frame rate indication information is used to indicate the frame rates of the multiple point cloud media tracks.
- the point cloud encoding apparatus further includes: a signaling generation module, configured to generate streaming media signaling for transmitting point cloud data; a signaling filling module, configured by is configured to fill the streaming media signaling with a time-domain hierarchical group identifier for identifying a track group, where the track group includes one or more point cloud media tracks with the same point cloud content, and the track group includes part of point cloud media tracks with different frame rates; the signaling sending module is configured to send the streaming media signaling to the data receiver.
- a signaling generation module configured to generate streaming media signaling for transmitting point cloud data
- a signaling filling module configured by is configured to fill the streaming media signaling with a time-domain hierarchical group identifier for identifying a track group, where the track group includes one or more point cloud media tracks with the same point cloud content, and the track group includes part of point cloud media tracks with different frame rates
- the signaling sending module is configured to send the streaming media signaling to the data receiver.
- the point cloud encoding apparatus further includes: a request receiving module configured to receive the data transmission generated based on the streaming media signaling and sent by the data receiver request; a file transmission module configured to transmit a point cloud file to the data receiver according to the data transmission request, where the point cloud file includes one or more point cloud media tracks with the same point cloud content, the The point cloud file includes some point cloud media tracks with different frame rates.
- the encapsulation module includes: an information determination unit configured to determine, from the file encapsulation information of the plurality of point cloud media tracks, the frame rate indication information corresponding to the frame rate indication information. A corresponding frame rate indication field; an information filling unit configured to fill the frame rate indication field in the file encapsulation information with frame rate indication information corresponding to the plurality of point cloud code streams.
- a computer-readable medium on which a computer program is stored, and when the computer program is executed by a processor, implements the method in the above technical solution.
- an electronic device comprising: a processor; and a memory for storing executable instructions of the processor; wherein the processor is configured to execute the The executable instructions are described to execute the method as in the above technical solution.
- a computer program product or computer program where the computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium.
- the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the method as in the above technical solutions.
- FIG. 1 shows a schematic diagram of an exemplary system architecture to which the technical solutions of the embodiments of the present application can be applied.
- Figure 2 shows the placement of the point cloud encoding device and the point cloud decoding device in a streaming environment.
- FIG. 3 shows a flowchart of steps of a point cloud decoding method in an embodiment of the present application.
- FIG. 4 shows a schematic diagram of an alternative group packaged in multiple tracks in one embodiment of the present application.
- FIG. 5 shows a flowchart of steps for receiving a point cloud file from a data source in an embodiment of the present application.
- FIG. 6 shows a flowchart of steps of a point cloud encoding method in an embodiment of the present application.
- FIG. 7 shows a flowchart of steps for encoding and decoding point cloud data in an application scenario according to an embodiment of the present application.
- FIG. 8 schematically shows a structural block diagram of a point cloud decoding apparatus provided by an embodiment of the present application.
- FIG. 9 schematically shows a structural block diagram of a point cloud encoding apparatus provided by an embodiment of the present application.
- FIG. 10 schematically shows a structural block diagram of a computer system suitable for implementing the electronic device of the embodiment of the present application.
- Example embodiments will now be described more fully with reference to the accompanying drawings.
- Example embodiments can be embodied in various forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this application will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art.
- a point cloud is a set of discrete points that are randomly distributed in space and express the spatial structure and surface properties of a three-dimensional object or scene. Each point in the point cloud has at least three-dimensional position information, and may also have color, material or other information depending on the application scenario. Typically, each point in a point cloud has the same number of additional properties.
- point cloud media can be further divided into Video-based Point Cloud Compression (VPCC), which is compressed based on traditional video encoding methods, and Point Cloud Compression (Geometry-based Point Cloud Compression), which is compressed based on geometric features. GPCC).
- VPCC Video-based Point Cloud Compression
- Geometry-based Point Cloud Compression Point Cloud Compression
- the three-dimensional position information is usually called the geometric component of the point cloud file (Geometry Component), and the attribute information is called the attribute component (Attribute Component) of the point cloud file.
- a point cloud file may have only one geometric component, but can have one or more attribute components.
- Point cloud can express the spatial structure and surface properties of 3D objects or scenes flexibly and conveniently, so it is widely used, and its main application scenarios can be classified into two categories.
- Machine perception point cloud such as autonomous navigation system, real-time inspection system, geographic information system, visual sorting robot, rescue and disaster relief robot.
- Human eyes perceive point clouds such as point cloud application scenarios such as digital cultural heritage, free viewpoint broadcasting, 3D immersive communication, and 3D immersive interaction.
- the acquisition of point cloud mainly includes the following methods: computer generation, 3D laser scanning, 3D photogrammetry, etc.
- Computers can generate point clouds of virtual 3D objects and scenes.
- 3D scanning can obtain point clouds of static real-world 3D objects or scenes, and millions of point clouds can be obtained per second.
- 3D cameras can obtain point clouds of dynamic real-world three-dimensional objects or scenes, and can obtain tens of millions of point clouds per second.
- point clouds of biological tissues and organs can be obtained from MRI, CT, and electromagnetic positioning information.
- the encoded data stream needs to be encapsulated and transmitted to the user.
- the point cloud file needs to be decapsulated first, then decoded, and finally the decoded data stream is presented.
- FIG. 1 shows a schematic diagram of an exemplary system architecture to which the technical solutions of the embodiments of the present application can be applied.
- the system architecture 100 includes a plurality of end devices that can communicate with each other through, for example, a network 150 .
- the system architecture 100 may include a first end device 110 and a second end device 120 interconnected by a network 150 .
- the first terminal device 110 and the second terminal device 120 perform unidirectional data transmission.
- the first terminal device 110 may encode point cloud data (eg, a point cloud code stream collected by the first terminal device 110 ) for transmission to the second terminal device 120 through the network 150 , and the encoded point cloud data is
- the second terminal device 120 may receive the encoded point cloud data from the network 150, decode the encoded point cloud data, and display the decoded point cloud data.
- the system architecture 100 may include a third end device 130 and a fourth end device 140 that perform bidirectional transmission of encoded point cloud data, such as may occur during a video conference.
- each of the third terminal device 130 and the fourth terminal device 140 may encode point cloud data (eg, a point cloud code stream collected by the terminal device) for transmission to the third terminal device through the network 150 Another terminal device among the terminal device 130 and the fourth terminal device 140 .
- Each of the third terminal device 130 and the fourth terminal device 140 may also receive encoded point cloud data transmitted by the other of the third terminal device 130 and the fourth terminal device 140, and may The encoded point cloud data is decoded to recover the point cloud data, and the point cloud data can be displayed on an accessible display device based on the recovered point cloud data.
- the first terminal device 110 , the second terminal device 120 , the third terminal device 130 and the fourth terminal device 140 may be servers, personal computers and smart phones, but the principles disclosed in this application may not be limited thereto . Embodiments disclosed herein are applicable to laptop computers, tablet computers, media players, and/or dedicated videoconferencing equipment.
- the network 150 represents any number of networks that communicate encoded point cloud data between the first end device 110, the second end device 120, the third end device 130, and the fourth end device 140, including, for example, wired and/or wireless communication networks .
- Network 150 may exchange data in circuit-switched and/or packet-switched channels.
- the network may include a telecommunications network, a local area network, a wide area network, and/or the Internet. For the purposes of this application, unless explained below, the architecture and topology of network 150 may be immaterial to the operations disclosed herein.
- FIG. 2 shows the placement of the point cloud encoding device and the point cloud decoding device in a streaming environment.
- the subject matter disclosed herein is equally applicable to other point cloud enabled applications including, for example, videoconferencing, digital television, storing compressed point cloud data on digital media including CDs, DVDs, memory sticks, and the like.
- the streaming system may include an acquisition subsystem 213 , which may include a point cloud data source 201 such as a digital camera, and the point cloud data source 201 may, for example, create uncompressed point cloud data 202 .
- the point cloud data 202 includes samples captured by a digital camera.
- point cloud data 202 is depicted as a thick line to emphasize high data volume point cloud data that can be processed by electronic device 220
- the electronic device 220 includes a video encoding device 203 coupled to the video source 201 .
- Video encoding device 203 may include hardware, software, or a combination of hardware and software to implement or implement various aspects of the disclosed subject matter as described in greater detail below.
- encoded point cloud data 204 (or encoded point cloud codestream 204) is depicted as a thin line to emphasize the lower amount of encoded point cloud data 204 (or encoded point cloud data 204).
- point cloud code stream 204 which may be stored on the streaming server 205 for future use.
- One or more streaming client subsystems such as client subsystem 206 and client subsystem 208 in FIG. 2 , may access streaming server 205 to retrieve encoded point cloud data 207 that is a copy of point cloud data 204 and point cloud data 209.
- Client subsystem 206 may include, for example, point cloud decoding device 210 in electronic device 230 .
- the point cloud decoding device 210 decodes the incoming copy 207 of the encoded point cloud data and produces output point cloud data 211 that can be presented on a display 212 (eg, a display screen) or another presentation device.
- the encoded point cloud data 204, point cloud data 207, and point cloud data 209 may be encoded according to certain point cloud encoding/compression standards. Examples of these standards may include standards developed by MPEG for GPCC.
- electronic device 220 and the electronic device 230 may include other components not shown in the figures.
- electronic device 220 may include a point cloud decoding device
- electronic device 230 may also include a point cloud encoding device.
- FIG. 3 shows a flow chart of the steps of a point cloud decoding method in an embodiment of the present application.
- the method can be applied to links such as a server, a client, and an intermediate node of a point cloud media system.
- a point cloud decoding method is installed.
- the point cloud decoding method performed by the client device of the apparatus is taken as an example.
- the point cloud decoding method may mainly include the following steps S310 to S330.
- Step S310 Receive the point cloud file transmitted by the data source, the point cloud file includes one or more point cloud media tracks with the same point cloud content, and the point cloud file includes some point cloud media tracks with different frame rates.
- the point cloud file may individually encapsulate a point cloud media track corresponding to a specified frame rate, or may encapsulate multiple point cloud media tracks with the same or different frame rates.
- multiple point cloud media tracks may form a track group, and the content of the track group may include point cloud media tracks (including geometric components and attribute components) in single-track packaging mode or may include geometric component tracks in multi-track packaging mode ( The attribute component track is indexed by the geometry component track).
- the point cloud media tracks of the same frame rate can be substituted for each other during decoding and display, and the point cloud media tracks of the same frame rate can be combined for consumption to achieve a better point cloud presentation effect.
- some point cloud media tracks in the point cloud file have the same frame rate, while other point cloud media tracks may have different frame rates.
- three point cloud media tracks, track1, track2, and track3 are encapsulated in the point cloud file.
- the frame rate of track1 is 60fps, while the frame rate of track2 and track3 are both 30fps.
- Point cloud media tracks with the same point cloud content and different point cloud qualities can be divided into the same alternative group.
- Cloud quality may include various quality parameters corresponding to different standards, such as bit rate, frame rate, and resolution.
- the tracks of the point cloud content of different qualities belong to the same replaceable group.
- the geometric component tracks of point cloud contents of different qualities belong to the same replaceable group, and the attribute component track can be associated with the geometric component track.
- FIG. 4 shows a schematic diagram of an alternative group packaged in multiple tracks in one embodiment of the present application.
- the first point cloud data 410 and the second point cloud data 420 having the same point cloud content are included in the alternative group 400 .
- the first point cloud data 410 is lossless compressed point cloud data with relatively high point cloud quality (Lossless coded GPCC)
- the second point cloud data 420 is lossy compressed point cloud data with relatively low point cloud quality (Lossy coded GPCC) GPCC).
- the first point cloud data 410 includes a first geometrical component track 411 and a first attribute component track 412 associated with the first geometrical component track 411
- the second point cloud data 420 includes a second geometrical component track 421 and a second geometrical component track 421 associated with the second geometrical component track 411
- the second attribute component track 422 of the geometry component track 421 is included in the first point cloud data 410 .
- Step S320 Parse the file encapsulation information of one or more point cloud media tracks to obtain frame rate indication information carried in the file encapsulation information, where the frame rate indication information is used to indicate the frame rate of one or more point cloud media tracks.
- the method for parsing file encapsulation information to obtain quality indication information may include: parsing the file encapsulation information of one or more point cloud media tracks to determine the relationship with one or more point cloud media tracks.
- the frame rate indication field corresponding to the frame rate of the cloud media track; the frame rate indication information of one or more point cloud media tracks is determined according to the value of the frame rate indication field.
- the file encapsulation information is an ISOBMFF (ISO Base Media File Format) data box generated when a point cloud code stream is encapsulated into a point cloud media track.
- ISOBMFF ISO Base Media File Format
- the file encapsulation information may specifically be expressed as an expanded track group data box TrackGroupTypeBox, and its syntax is as follows.
- the frame_rate is frame rate indication information used to indicate the frame rate corresponding to the point cloud file, and its value is an unsigned integer with a length of 8 bytes.
- point cloud media tracks with the same content and different frame rates can be associated with each other.
- the point cloud orbits that belong to the same orbit group meet the following conditions.
- the track is a point cloud track (including geometry and attribute components) in the single-track packaging mode or a geometric component track in the multi-track packaging mode (the attribute component track is obtained from the geometric component track index).
- Step S330 According to the frame rate indication information carried in the file encapsulation information, select and decode the point cloud media track with the specified frame rate from the point cloud file.
- the file encapsulation information corresponding to each point cloud media track carries frame rate indication information of the point cloud media track, and the frame rate indication information identifies the frame rate of the point cloud media track in an explicit way.
- the data receiver can decode the point cloud media track with the specified frame rate according to the device performance and user requirements.
- the device performance of the data receiver can be collected, and the device performance and the quality indication information (frame rate indication information) carried in the file encapsulation information can be matched and detected to determine the data receiver's device. performance-matched frame rate, and then select and decode point cloud media tracks with the specified frame rate from the point cloud file.
- the quality indication information frame rate indication information
- Device capabilities may include at least one of hardware capabilities, software capabilities, and network capabilities.
- the hardware performance may include, for example, the device model, processor model, memory capacity, display size, etc. of the electronic device.
- the software performance may include, for example, the program version of the point cloud decoder installed by the data receiver.
- the network performance may include, for example, network bandwidth, network transfer status, etc.
- the frame rate selection rule configured by the data receiver can be obtained, and the frame rate selection rule and the frame rate indication information carried in the file encapsulation information can be matched and detected, and the frame rate configured with the data receiver can be determined. Pick the frame rate that matches the rule, and then pick and decode the point cloud media track with the specified frame rate from the point cloud file.
- the frame rate selection rule may be a selection rule configured according to user requirements for selecting point cloud data with a specified frame rate, such as selecting point cloud data with a frame rate greater than (or less than) a specified value according to user instructions.
- FIG. 5 shows a flowchart of steps for receiving a point cloud file from a data source in an embodiment of the present application.
- receiving the point cloud file transmitted by the data source in step S310 may include the following steps S510 to S540.
- Step S510 Receive streaming media signaling sent by the data source for transmitting point cloud data.
- the streaming media signaling for transmitting point cloud data may be dynamic adaptive streaming over HTTP (DASH) signaling based on HTTP
- DASH is an adaptive bit rate streaming technology that enables high-quality streaming media to be delivered over the Internet through traditional HTTP web servers.
- Step S520 Parse the streaming media signaling to obtain a time-domain hierarchical group identifier carried in the streaming media signaling for identifying the track group, where the track group includes one or more point cloud media tracks with the same point cloud content, and the track group Includes some point cloud media tracks with different frame rates.
- the frame rate indication information of the point cloud media track can be stored by using an existing field, for example, the frameRate field in the DASH signaling can be used to indicate the frame rates of various different point cloud media tracks.
- a group ID can be used to identify them in the DASH signaling, for example, the GPCC time-domain hierarchical group Identifier (GPCCTemporalScaleGroupId).
- the set ID element is a child element of the AdaptationSet element.
- the GPCCTemporalScaleGroupId element may appear at the adaptation set level, but not at any other level.
- Table 1 shows the semantics and attributes of the GPCC time-domain hierarchical group ID in an embodiment of the present application.
- Step S530 Send a data transmission request to the data source according to the time-domain hierarchical group identifier.
- one or more of the frame rate indication information can be selected as the target frame rate, and a data transmission request corresponding to the target frame rate is further sent to the data source.
- the data transmission request is, for example, the first data transmission request.
- the method for sending a data transmission request may include: obtaining a network bandwidth for data transmission with a data source; One or more target point cloud media tracks at a rate; send a data transfer request to the data source for requesting transfer of the target point cloud media tracks.
- two or more target point cloud media tracks may be selected.
- a target point cloud media track can be selected.
- Step S540 Receive the point cloud file corresponding to the data transmission request transmitted by the data source.
- the data source may transmit a corresponding one target point cloud media track to the data receiver based on the request.
- the data source may transmit corresponding multiple data transmissions with the same frame rate or different frame rates to the data receiver based on the request. target point cloud media track.
- each point cloud media file can be replaced or merged to improve the display effect of the point cloud media files.
- point cloud media track to be displayed fails to select, fails to decode, or the quality of the point cloud obtained after decoding is poor
- other point cloud media tracks with the same frame rate can be used to perform track replacement .
- two point cloud media files with the same point cloud content at 30fps can be replaced with each other.
- multiple point cloud media tracks with the same frame rate can be track merged to improve the display frame of point cloud data.
- Rate For example, two point cloud media tracks with the same point cloud content with a frame rate of 30fps can be merged to form a point cloud media track with a frame rate of 60fps, so that a better point cloud can be obtained by increasing the frame rate.
- Media file display effect when the network environment of the data recipient is optimized and the network bandwidth is high, multiple point cloud media tracks with the same frame rate can be track merged to improve the display frame of point cloud data.
- the point cloud media track replacement method obtain the frame rate of the point cloud media track to be displayed; according to the frame rate indication information carried in the file encapsulation information, select from the point cloud file the same frame rate as the point cloud media track to be displayed.
- Other point cloud media tracks replace the point cloud media tracks to be displayed with other point cloud media tracks to decode and display other point cloud media tracks.
- Method for merging point cloud media tracks obtain the frame rate of the point cloud media track to be displayed; according to the frame rate indication information carried in the file encapsulation information, select from the point cloud file the same frame rate as the point cloud media track to be displayed.
- One or more other point cloud media tracks decode one or more other point cloud media tracks, and combine and display the one or more other point cloud media tracks with the point cloud media track to be displayed.
- the point cloud media track replacement method can be implemented: obtain the frame rate of the point cloud media track to be displayed; send a data transmission request to the data source (for example, the second data transmission request) to receive the supplementary point cloud file transmitted by the data source, the supplementary point cloud file includes other point cloud media tracks with the same point cloud content and the same frame rate as the point cloud media track to be displayed; The point cloud media tracks are replaced with other point cloud media tracks to decode and display the other point cloud media tracks.
- the point cloud media track merging method can be implemented: obtain the frame rate of the point cloud media track to be displayed; send a data transmission request to the data source (for example, the third data transmission request) to receive a supplemental point cloud file transmitted by the data source, the supplemental point cloud file including one or more other point cloud media tracks having the same point cloud content and the same frame rate as the point cloud media track to be displayed; Decode one or more other point cloud media tracks, and combine the one or more other point cloud media tracks with the point cloud media track to be displayed.
- FIG. 6 shows a flowchart of steps of a point cloud encoding method in an embodiment of the present application.
- the point cloud encoding method can be applied to links such as a server, a client, and an intermediate node of a point cloud media system.
- the point cloud encoding method performed by the server device of the point cloud encoding apparatus is taken as an example.
- the point cloud encoding method may mainly include the following steps S610 to S620.
- Step S610 Encode the point cloud data to be transmitted according to different encoding standards to obtain multiple point cloud code streams with the same point cloud content, and the multiple point cloud code streams include some point cloud code streams with different frame rates.
- the point cloud data of a certain point cloud content can be encoded according to a variety of different encoding standards to obtain multiple frames with the same frame rate or different frame rates.
- the coding standard may include parameter values of one or more quality parameters, and multiple different coding standards may be formed by combining different parameter values of various quality parameters. For example, when the quality parameter includes code rate and frame rate, and the code rate includes two different code rate values A1 and A2, and the frame rate also includes two different frame rate values B1 and B2, it can be determined that the corresponding Four encoding standards for quality parameter values: A1B1, A1B2, A2B1, and A2B2.
- Step S620 Encapsulate multiple point cloud code streams into multiple point cloud media tracks, and fill the multiple point cloud media tracks with frame rate indication information corresponding to the multiple point cloud code streams, and the frame rate indication information is used for Indicates the frame rate of multiple point cloud media tracks.
- Each point cloud media track has corresponding file encapsulation information
- the file encapsulation information can be an ISOBMFF data box generated when the point cloud code stream is encapsulated into a point cloud media track, for example, it can be extended track group data Box TrackGroupTypeBox.
- the frame rate indication field corresponding to the frame rate indication information is determined in the file encapsulation information of the point cloud media track
- the frame rate indication corresponding to the point cloud code stream can be indicated by referring to the frame rate of the point cloud code stream.
- the information is filled into the frame rate indication field in the file encapsulation information.
- streaming media signaling for transmitting point cloud data can be generated according to data transmission requirements; the streaming media signaling can be the above DASH signaling in an embodiment.
- the streaming media signaling can be the above DASH signaling in an embodiment.
- the track group includes one or more point cloud media tracks with the same point cloud content, and the track group includes some point cloud media tracks with different frame rates. .
- streaming media signaling is sent to the data receiver for point cloud data transmission between the data source and the data receiver.
- the data source may receive a data transmission request sent by the data receiver and generated based on the streaming media signaling;
- the receiver transmits the point cloud file, the point cloud file includes one or more point cloud media tracks with the same point cloud content, and the point cloud file includes some point cloud media tracks with different frame rates.
- FIG. 7 shows a flowchart of steps for encoding and decoding point cloud data in an application scenario according to an embodiment of the present application.
- the server is used as the data source for producing point cloud data
- the method for transmitting and encoding and decoding point cloud data between the server and the client where the user resides may include the following steps.
- Step S701 Encode the point cloud content A on the server to obtain three point cloud code streams S1, S2 and S3 corresponding to two different frame rates.
- the frame rate of point cloud code stream S1 is 60fps
- the frame rate of point cloud code stream S2 is 30fps
- the frame rate of point cloud code stream S3 is also 30fps.
- Step S703 use the frameRate field in the DASH signaling to indicate the frame rate of each point cloud media track, use the time domain hierarchical group identifier GPCCTemporalScaleGroupId in the DASH signaling to indicate a track group composed of multiple point cloud media tracks, and send the DASH signaling To the client C1 and C2 where the user is located.
- Step S704 The clients C1 and C2 request the point cloud file according to the network bandwidth and the information in the DASH signaling.
- the point cloud file requested by C1 includes the point cloud media track Track1; the point cloud file requested by C2 includes the point cloud media track Track2.
- Step S705 The server transmits the point cloud files to the clients C1 and C2 respectively.
- the point cloud file transmitted by the server to the client C1 includes the point cloud media track Track1; the point cloud file transmitted to the client C2 includes the point cloud media track Track2.
- Step S706 The client receives the point cloud file, decodes and displays the corresponding point cloud media track through the frameRate field information in Track1 and Track2.
- the client C2 can further request the point cloud media track Track3, and display and consume the point cloud media track Track2 and the point cloud media track Track3 together, so as to achieve the point cloud display effect with a frame rate of 60fps.
- FIG. 8 shows a structural block diagram of a point cloud decoding apparatus in an embodiment of the present application.
- the point cloud decoding apparatus 800 may mainly include: a receiving module 810 configured to receive a point cloud file transmitted by a data source, where the point cloud file includes one or more point clouds with the same point cloud content media track, the point cloud file includes some point cloud media tracks with different frame rates; the parsing module 820 is configured to parse the file encapsulation information of the one or more point cloud media tracks, and obtain the file encapsulation information in the The frame rate indication information carried, where the frame rate indication information is used to indicate the frame rate of the one or more point cloud media tracks; the decoding module 830 is configured to, according to the frame rate indication information carried in the file encapsulation information , select and decode the point cloud media track with the specified frame rate from the point cloud file.
- the receiving module 810 includes: a signaling receiving unit configured to receive streaming media signaling sent by a data source for transmitting point cloud data; signaling A parsing unit, configured to parse the streaming media signaling to obtain a time-domain hierarchical group identifier carried in the streaming media signaling and used to identify a track group, where the track group includes one or more points with the same point cloud content The point cloud media track, the track group includes some point cloud media tracks with different frame rates; the request sending unit is configured to send a first data transmission request to the data source according to the time domain hierarchical group identifier; A file receiving unit, configured to receive a point cloud file transmitted by the data source and corresponding to the first data transmission request.
- the request sending unit includes: a bandwidth acquisition subunit, configured to acquire network bandwidth for data transmission with the data source; a track selection subunit, configured In order to select one or more target point cloud media tracks with a target frame rate matching the network bandwidth from the track group according to the time-domain hierarchical group identifier; the request sending subunit is configured to send The data source sends a first data transmission request for requesting transmission of the one or more target point cloud media tracks.
- the parsing module 820 includes: an information parsing unit, configured to parse the file encapsulation information of the one or more point cloud media tracks to determine the a frame rate indication field corresponding to the frame rate of one or more point cloud media tracks; the information determination unit is configured to determine the frame of the one or more point cloud media tracks according to the value of the frame rate indication field rate indication information.
- the point cloud decoding apparatus 800 further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a track selection module, configured by is configured to select other point cloud media tracks with the same frame rate as the point cloud media track to be displayed from the point cloud file according to the frame rate indication information carried in the file encapsulation information; the first replacement module, is configured to replace the point cloud media track to be presented with the other point cloud media track to decode and present the other point cloud media track.
- a frame rate acquisition module configured to acquire the frame rate of the point cloud media track to be displayed
- a track selection module configured by is configured to select other point cloud media tracks with the same frame rate as the point cloud media track to be displayed from the point cloud file according to the frame rate indication information carried in the file encapsulation information
- the first replacement module is configured to replace the point cloud media track to be presented with the other point cloud media track to decode and present the other point cloud media track.
- the point cloud decoding apparatus 800 further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a track selection module, configured by is configured to select one or more other point cloud media tracks with the same frame rate as the point cloud media track to be displayed from the point cloud file according to the frame rate indication information carried in the file encapsulation information; A merging module configured to decode the one or more other point cloud media tracks, and combine and display the one or more other point cloud media tracks and the to-be-displayed point cloud media track.
- the point cloud decoding apparatus 800 further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a file acquisition module, configured by is configured to send a second data transmission request to the data source to receive a supplemental point cloud file transmitted by the data source, the supplemental point cloud file comprising the same point cloud content as the point cloud media track to be displayed and other point cloud media tracks of the same frame rate; the second replacement module is configured to replace the point cloud media track to be displayed with the other point cloud media tracks to decode and display the other point cloud media tracks .
- the point cloud decoding apparatus 800 further includes: a frame rate acquisition module, configured to acquire the frame rate of the point cloud media track to be displayed; a file acquisition module, configured by is configured to send a third data transmission request to the data source to receive a supplemental point cloud file transmitted by the data source, the supplemental point cloud file comprising the same point cloud content as the point cloud media track to be displayed and one or more other point cloud media tracks of the same frame rate; the second merging module is configured to decode the one or more other point cloud media tracks and combine the one or more other point cloud media tracks with The point cloud media tracks to be displayed are combined and displayed.
- FIG. 9 shows a structural block diagram of a point cloud encoding apparatus in an embodiment of the present application.
- the point cloud encoding apparatus 900 may mainly include: an encoding module 910 configured to encode the point cloud data to be transmitted according to different encoding standards to obtain multiple point cloud code streams with the same point cloud content , the multiple point cloud code streams include some point cloud code streams with different frame rates; the encapsulation module 920 is configured to encapsulate the multiple point cloud code streams into multiple point cloud media tracks, and report to the Multiple point cloud media tracks are filled with frame rate indication information corresponding to the multiple point cloud code streams, where the frame rate indication information is used to indicate the frame rates of the multiple point cloud media tracks.
- the point cloud encoding apparatus 900 further includes: a signaling generation module configured to generate streaming media signaling for transmitting point cloud data; a signaling filling module is configured to fill the streaming media signaling with a time-domain hierarchical group identifier for identifying a track group, where the track group includes one or more point cloud media tracks with the same point cloud content, and the track group includes one or more point cloud media tracks. It includes some point cloud media tracks with different frame rates; the signaling sending module is configured to send the streaming media signaling to the data receiver.
- the point cloud encoding apparatus 900 further includes: a request receiving module configured to receive the data sent by the data receiver and generated based on the streaming media signaling. a data transmission request; a file transmission module configured to transmit a point cloud file to the data receiver according to the data transmission request, where the point cloud file includes one or more point cloud media tracks with the same point cloud content, The point cloud file includes some point cloud media tracks with different frame rates.
- the encapsulation module 920 includes: an information determination unit configured to determine the frame rate indication in the file encapsulation information of the multiple point cloud media tracks a frame rate indication field corresponding to the information; the information filling unit is configured to fill the frame rate indication field in the file encapsulation information with the frame rate indication information corresponding to the plurality of point cloud code streams.
- FIG. 10 schematically shows a structural block diagram of a computer system for implementing an electronic device according to an embodiment of the present application.
- the computer system 1000 includes a central processing unit 1001 (Central Processing Unit, CPU), which can be loaded into a random device according to a program stored in a read-only memory 1002 (Read-Only Memory, ROM) or from a storage part 1008 Various appropriate actions and processes are performed by accessing the program in the memory 1003 (Random Access Memory, RAM). In the random access memory 1003, various programs and data necessary for system operation are also stored.
- the central processing unit 1001 , the read-only memory 1002 and the random access memory 1003 are connected to each other through a bus 1004 .
- An input/output interface 1005 (Input/Output interface, ie, I/O interface) is also connected to the bus 1004 .
- the following components are connected to the input/output interface 1005: an input section 1006 including a keyboard, a mouse, etc.; an output section 1007 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and a speaker, etc. ; a storage section 1008 including a hard disk, etc.; and a communication section 1009 including a network interface card such as a local area network card, a modem, and the like.
- the communication section 1009 performs communication processing via a network such as the Internet.
- a driver 1010 is also connected to the input/output interface 1005 as required.
- a removable medium 1011 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is mounted on the drive 1010 as needed so that a computer program read therefrom is installed into the storage section 1008 as needed.
- embodiments of the present application include a computer program product comprising a computer program carried on a computer-readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
- the computer program may be downloaded and installed from the network via the communication portion 1009, and/or installed from the removable medium 1011.
- the central processing unit 1001 various functions defined in the system of the present application are executed.
- the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
- the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
- Computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Erasable Programmable Read Only Memory (EPROM), flash memory, optical fiber, portable Compact Disc Read-Only Memory (CD-ROM), optical storage device, magnetic storage device, or any suitable of the above The combination.
- a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
- a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, carrying computer-readable program code therein.
- Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
- a computer-readable signal medium can also be any computer-readable medium, other than a computer-readable storage medium, that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
- Program code embodied on a computer-readable medium may be transmitted using any suitable medium, including but not limited to wireless, wired, etc., or any suitable combination of the foregoing.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (15)
- 一种点云解码方法,其特征在于,包括:接收由数据源传输的点云文件,所述点云文件包括一个或者多个具有相同点云内容的点云媒体轨道,所述点云文件中包括部分帧率不同的点云媒体轨道;解析所述一个或者多个点云媒体轨道的文件封装信息,得到所述文件封装信息中携带的帧率指示信息,所述帧率指示信息用于指示所述一个或者多个点云媒体轨道的帧率;根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取并解码具有指定帧率的点云媒体轨道。
- 根据权利要求1所述的点云解码方法,其特征在于,接收由数据源传输的点云文件,包括:接收由数据源发送的用于传输点云数据的流媒体信令;解析所述流媒体信令,得到所述流媒体信令中携带的用于标识轨道组的时域层级组标识,所述轨道组包括一个或者多个具有相同点云内容的点云媒体轨道,所述轨道组中包括部分帧率不同的点云媒体轨道;根据所述时域层级组标识,向所述数据源发送第一数据传输请求;接收由所述数据源传输的与所述第一数据传输请求相对应的点云文件。
- 根据权利要求2所述的点云解码方法,其特征在于,根据所述时域层级组标识,向所述数据源发送第一数据传输请求,包括:获取与所述数据源进行数据传输的网络带宽;根据所述时域层级组标识,从所述轨道组中选取与所述网络带宽相匹配的具有目标帧率的一个或者多个目标点云媒体轨道;向所述数据源发送用于请求传输所述一个或者多个目标点云媒体轨道的第一数据传输请求。
- 根据权利要求1所述的点云解码方法,其特征在于,解析所述一个或者多个点云媒体轨道的文件封装信息,得到所述文件封装信息中携带的帧率指示信息,包括:解析所述一个或者多个点云媒体轨道的文件封装信息,以确定与所述一个或者多个点云媒体轨道的帧率相对应的帧率指示字段;根据所述帧率指示字段的取值,确定所述一个或者多个点云媒体轨道的帧率指示信息。
- 根据权利要求1~4中任一项所述的点云解码方法,其特征在于,在根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取并解码具有指定帧率的点云媒体轨道之后,所述方法还包括:获取待展示的点云媒体轨道的帧率;根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取与所述待展示的点云媒体轨道具有相同帧率的其他点云媒体轨道;将所述待展示的点云媒体轨道替换为所述其他点云媒体轨道,以解码并展示所述其他点云媒体轨道。
- 根据权利要求1~4中任一项所述的点云解码方法,其特征在于,在根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取并解码具有指定帧率的点云媒体轨道之后,所述方法还包括:获取待展示的点云媒体轨道的帧率;根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取与所述待展示的点云媒体轨道具有相同帧率的一个或者多个其他点云媒体轨道;解码所述一个或者多个其他点云媒体轨道,并将所述一个或者多个其他点云媒体轨道与所述待展示的点云媒体轨道进行合并展示。
- 根据权利要求1~4中任一项所述的点云解码方法,其特征在于,在根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取并解码具有指定帧率的点云媒体轨道之后,所述方法还包括:获取待展示的点云媒体轨道的帧率;向所述数据源发送第二数据传输请求,以接收由所述数据源传输的补充点云文件,所述补充点云文件包括与所述待展示的点云媒体轨道具有相同点云内容和相同帧率的其他点云媒体轨道;将所述待展示的点云媒体轨道替换为所述其他点云媒体轨道,以解码并展示所述其他点云媒体轨道。
- 根据权利要求1~4中任一项所述的点云解码方法,其特征在于,在根据所述文件封装信息中携带的帧率指示信息,从所述点云文件中选取并解码具有指定帧率的点云媒体轨道之后,所述方法还包括:获取待展示的点云媒体轨道的帧率;向所述数据源发送第三数据传输请求,以接收由所述数据源传输的补充点云文件,所述补充点云文件包括与所述待展示的点云媒体轨道具有相同点云内容和相同帧率的一个或者多个其他点云媒体轨道;解码所述一个或者多个其他点云媒体轨道,并将所述一个或者多个其他点云媒体轨道与所述待展示的点云媒体轨道进行合并展示。
- 一种点云编码方法,其特征在于,包括:按照不同的编码标准对待传输的点云数据进行编码处理,得到具有相同点云内容的多个点云码流,所述多个点云码流中包括部分帧率不同的点云码流;将所述多个点云码流封装为多个点云媒体轨道,并向所述多个点云媒体轨道中填充与所述多个点云码流相对应的帧率指示信息,所述帧率指示信息用于指示所述多个点云媒体轨道的帧率。
- 根据权利要求9所述的点云编码方法,其特征在于,在将所述多个点云码流封装为多个点云媒体轨道之后,所述方法还包括:生成用于传输点云数据的流媒体信令;向所述流媒体信令中填充用于标识轨道组的时域层级组标识,所述轨道组包括一个或者多个具有相同点云内容的点云媒体轨道,所述轨道组中包括部分帧率不同的点云媒体轨道;向数据接收方发送所述流媒体信令。
- 根据权利要求10所述的点云编码方法,其特征在于,在向数据接收方发送所述流媒体信令之后,所述方法还包括:接收由所述数据接收方发送的基于所述流媒体信令生成的数据传输请求;根据所述数据传输请求,向所述数据接收方传输点云文件,所述点云文件包括一个或者多个具有相同点云内容的点云媒体轨道,所述点云文件中包括部分帧率不同的点云媒体轨道。
- 根据权利要求9~11中任一项所述的点云解码方法,其特征在于,向所述多个点云媒体轨道中填充与所述多个点云码流相对应的帧率指示信息,包括:在所述多个点云媒体轨道的文件封装信息中确定与所述帧率指示信息相对应的帧率指示字段;将与所述多个点云码流相对应的帧率指示信息填充至所述文件封装信息中的帧率指示字段。
- 一种点云解码装置,其特征在于,包括:接收模块,被配置为接收由数据源传输的点云文件,所述点云文件包括一个或者多个具有相同点云内容的点云媒体轨道,所述点云文件中包括部分帧率不同的点云媒体轨道;解析模块,被配置为解析所述一个或者多个点云媒体轨道的文件封装信息,得到所述文件封装信息中携带的帧率指示信息,所述帧率指示信息用于指示所述一个或者多个点云媒体轨道的帧率;解码模块,被配置为根据所述文件封装信息中携带的帧率指示信息从所述点云文件 中选取并解码具有指定帧率的点云媒体轨道。
- 一种点云编码装置,其特征在于,包括:编码模块,被配置为按照不同的编码标准对待传输的点云数据进行编码处理,得到具有相同点云内容的多个点云码流,所述多个点云码流中包括部分帧率不同的点云码流;封装模块,被配置为将所述多个点云码流封装为多个点云媒体轨道,并向所述多个点云媒体轨道中填充与所述多个点云码流相对应的帧率指示信息,所述帧率指示信息用于指示所述多个点云媒体轨道的帧率。
- 一种电子设备,其特征在于,包括:处理器;以及存储器,用于存储所述处理器的可执行指令;其中,所述处理器配置为经由执行所述可执行指令来执行权利要求1至12中任意一项所述的方法。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020237031030A KR20230144620A (ko) | 2021-04-22 | 2022-03-11 | 포인트 클라우드 인코딩 및 디코딩 방법, 포인트 클라우드 인코딩 및 디코딩 장치, 컴퓨터로 판독 가능한 매체, 그리고 전자 디바이스 |
JP2023552564A JP2024508865A (ja) | 2021-04-22 | 2022-03-11 | 点群符号化・復号方法、装置、及び電子機器 |
US17/982,927 US20230061573A1 (en) | 2021-04-22 | 2022-11-08 | Point Cloud Encoding and Decoding Method and Apparatus, Computer-Readable Medium, and Electronic Device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110437255.4 | 2021-04-22 | ||
CN202110437255.4A CN115243053B (zh) | 2021-04-22 | 2021-04-22 | 点云编解码方法及相关设备 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/982,927 Continuation US20230061573A1 (en) | 2021-04-22 | 2022-11-08 | Point Cloud Encoding and Decoding Method and Apparatus, Computer-Readable Medium, and Electronic Device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022222641A1 true WO2022222641A1 (zh) | 2022-10-27 |
Family
ID=83666808
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/080266 WO2022222641A1 (zh) | 2021-04-22 | 2022-03-11 | 点云编解码方法、装置、计算机可读介质以及电子设备 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20230061573A1 (zh) |
JP (1) | JP2024508865A (zh) |
KR (1) | KR20230144620A (zh) |
CN (1) | CN115243053B (zh) |
TW (1) | TWI803274B (zh) |
WO (1) | WO2022222641A1 (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190139266A1 (en) * | 2017-11-09 | 2019-05-09 | Samsung Electronics Co., Ltd. | Point cloud compression using non-orthogonal projection |
WO2020060813A1 (en) * | 2018-09-18 | 2020-03-26 | Vid Scale, Inc. | Methods and apparatus for point cloud compression bitstream format |
US20200132822A1 (en) * | 2018-10-29 | 2020-04-30 | Dji Technology, Inc. | User interface for displaying point clouds generated by a lidar device on a uav |
WO2020137642A1 (ja) * | 2018-12-28 | 2020-07-02 | ソニー株式会社 | 情報処理装置および情報処理方法 |
CN114079781A (zh) * | 2020-08-18 | 2022-02-22 | 腾讯科技(深圳)有限公司 | 一种点云媒体的数据处理方法、装置、设备及存储介质 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110012279B (zh) * | 2018-01-05 | 2020-11-17 | 上海交通大学 | 基于3d点云数据的分视角压缩和传输方法及系统 |
US10984541B2 (en) * | 2018-04-12 | 2021-04-20 | Samsung Electronics Co., Ltd. | 3D point cloud compression systems for delivery and access of a subset of a compressed 3D point cloud |
US11049266B2 (en) * | 2018-07-31 | 2021-06-29 | Intel Corporation | Point cloud viewpoint and scalable compression/decompression |
CN113170238B (zh) * | 2018-09-12 | 2023-08-01 | 诺基亚技术有限公司 | 用于视频编码和解码的装置、方法和计算机程序 |
WO2020070379A1 (en) * | 2018-10-03 | 2020-04-09 | Nokia Technologies Oy | Method and apparatus for storage and signaling of compressed point clouds |
US11457231B2 (en) * | 2019-03-15 | 2022-09-27 | Mediatek Singapore Pte. Ltd. | Methods and apparatus for signaling spatial relationships for point cloud multimedia data tracks |
US11581022B2 (en) * | 2019-05-29 | 2023-02-14 | Nokia Technologies Oy | Method and apparatus for storage and signaling of compressed point clouds |
US11704867B2 (en) * | 2019-09-27 | 2023-07-18 | Intel Corporation | Methods for timed metadata priority rank signaling for point clouds |
WO2021022266A2 (en) * | 2019-10-07 | 2021-02-04 | Futurewei Technologies, Inc. | Video-based point cloud compression (v-pcc) timing information |
US11302063B2 (en) * | 2020-07-21 | 2022-04-12 | Facebook Technologies, Llc | 3D conversations in an artificial reality environment |
-
2021
- 2021-04-22 CN CN202110437255.4A patent/CN115243053B/zh active Active
-
2022
- 2022-03-11 JP JP2023552564A patent/JP2024508865A/ja active Pending
- 2022-03-11 KR KR1020237031030A patent/KR20230144620A/ko unknown
- 2022-03-11 WO PCT/CN2022/080266 patent/WO2022222641A1/zh active Application Filing
- 2022-04-07 TW TW111113336A patent/TWI803274B/zh active
- 2022-11-08 US US17/982,927 patent/US20230061573A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20190139266A1 (en) * | 2017-11-09 | 2019-05-09 | Samsung Electronics Co., Ltd. | Point cloud compression using non-orthogonal projection |
WO2020060813A1 (en) * | 2018-09-18 | 2020-03-26 | Vid Scale, Inc. | Methods and apparatus for point cloud compression bitstream format |
US20200132822A1 (en) * | 2018-10-29 | 2020-04-30 | Dji Technology, Inc. | User interface for displaying point clouds generated by a lidar device on a uav |
WO2020137642A1 (ja) * | 2018-12-28 | 2020-07-02 | ソニー株式会社 | 情報処理装置および情報処理方法 |
CN114079781A (zh) * | 2020-08-18 | 2022-02-22 | 腾讯科技(深圳)有限公司 | 一种点云媒体的数据处理方法、装置、设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
JP2024508865A (ja) | 2024-02-28 |
KR20230144620A (ko) | 2023-10-16 |
CN115243053B (zh) | 2024-04-16 |
US20230061573A1 (en) | 2023-03-02 |
TWI803274B (zh) | 2023-05-21 |
TW202243481A (zh) | 2022-11-01 |
CN115243053A (zh) | 2022-10-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190158933A1 (en) | Method, device, and computer program for improving streaming of virtual reality media content | |
CN112804256B (zh) | 多媒体文件中轨道数据的处理方法、装置、介质及设备 | |
CN114697668B (zh) | 点云媒体的编解码方法及相关产品 | |
WO2023029858A1 (zh) | 点云媒体文件的封装与解封装方法、装置及存储介质 | |
EP3637722A1 (en) | Method and apparatus for processing media information | |
CN115396645A (zh) | 一种沉浸媒体的数据处理方法、装置、设备及存储介质 | |
WO2022206200A1 (zh) | 点云编解码方法、装置、计算机可读介质及电子设备 | |
WO2023226504A1 (zh) | 一种媒体数据处理方法、装置、设备以及可读存储介质 | |
CN115396647B (zh) | 一种沉浸媒体的数据处理方法、装置、设备及存储介质 | |
WO2022222641A1 (zh) | 点云编解码方法、装置、计算机可读介质以及电子设备 | |
CN115150368B (zh) | 媒体文件的关联处理方法、装置、介质及电子设备 | |
WO2023169003A1 (zh) | 点云媒体的解码方法、点云媒体的编码方法及装置 | |
WO2022134962A1 (zh) | 点云视窗的呈现方法、装置、计算机可读介质及电子设备 | |
US20240129537A1 (en) | Method and apparatus for signaling cmaf switching sets in isobmff | |
KR102661694B1 (ko) | 미디어 파일 캡슐화 방법, 미디어 파일 캡슐화 해제 방법 및 관련 디바이스 | |
WO2023024843A1 (zh) | 媒体文件封装与解封装方法、设备及存储介质 | |
WO2023169004A1 (zh) | 点云媒体的数据处理方法、装置、设备及介质 | |
CN116455880A (zh) | 流媒体传输方法及相关产品 | |
CN116347118A (zh) | 一种沉浸媒体的数据处理方法及相关设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22790750 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2023552564 Country of ref document: JP |
|
ENP | Entry into the national phase |
Ref document number: 20237031030 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020237031030 Country of ref document: KR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |