WO2021087819A1 - Information processing method, terminal device and storage medium - Google Patents
Information processing method, terminal device and storage medium Download PDFInfo
- Publication number
- WO2021087819A1 WO2021087819A1 PCT/CN2019/116055 CN2019116055W WO2021087819A1 WO 2021087819 A1 WO2021087819 A1 WO 2021087819A1 CN 2019116055 W CN2019116055 W CN 2019116055W WO 2021087819 A1 WO2021087819 A1 WO 2021087819A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- depth information
- video image
- original depth
- information
- code stream
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
Definitions
- the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information is obtained when the depth information of the target object is obtained through the depth information sensor, so
- the video image data is of the target object acquired by an image sensor, and the original depth information represents a collection state of the depth information collected by the depth information sensor or information other than the collected depth information;
- FIG. 13 is a schematic diagram of an optional structure of an electronic device provided by an embodiment of the present invention.
- the original depth information and image video data are encoded using the video image encoding and decoding protocol
- the depth information collected by the depth information sensor is encoded using the video image encoding and decoding protocol.
- the data carried by the video image information includes: Original depth information, depth information and video image data.
- the encoding end only merges and encodes the original depth information corresponding to the designated image frame and the video image data, and does not encode the original depth information corresponding to non-designated video frames other than the designated image frame in the image frame corresponding to the video image data.
- the original depth information and the spatial correlation or temporal correlation between the image and video data are used to encode the original depth information to obtain the first encoded information, and the video image data is encoded to obtain Second encoding information, and writing the first encoding information into the designated position of the first encoding information to obtain a video image code stream.
- the original depth information and the video image data are combined and coded, which can be executed as S203B: the original depth information and the video image data that have undergone redundancy elimination processing are combined and coded to obtain a video image code stream.
- the specified depth is a range of the scene-sensitive depth where the target object is located
- the original depth information is redundantly eliminated based on the specified depth
- Image processing is performed on the original depth information and the video image by an image processor to obtain a target video image.
- the decoder performs independent decoding or hybrid decoding on the video image stream to obtain the original depth information of the interval viewpoint And the video image of at least one viewpoint; difference the original depth information of the interval viewpoint to obtain the original depth information of other viewpoints in at least one viewpoint except the interval viewpoint; use the original depth information of the interval viewpoint and the original depth information of other viewpoints , Perform image processing on the video image to obtain the target video image.
- the performing image processing on the original depth information and the video image to obtain a target video image includes: The video image is deblurred to obtain the target video image.
- the depth image generator and the video image decoder are independent of each other.
- the depth image generator is integrated in the video image decoder.
- the depth image generator and the image processor are integrated in the video image decoder.
- the video image code stream is input to the video image decoder, and the video image decoder outputs the target video image and the depth image.
- the decoding end decodes the video image code stream to obtain a video image corresponding to the original depth information and the video image data.
- the decoding end performs image processing on the original depth information and the video image to obtain a target video image.
- Using the continuous modulation TOF method under two different transmission signal frequencies, by controlling the integration time, a total of 8 groups of optical signals with different phases are obtained through the TOF sensor sampling, and the 8 groups of optical signals are photoelectrically converted to obtain 8 groups Charge signal, and then perform 10-bit quantization of these 8 groups of charge signals to generate 8 original charge images; the decoding end encodes these 8 original charge images together with the TOF sensor's temperature and other attribute parameters as original depth information; or Eight original charge images are preprocessed to generate two process depth data and one background data, and the two process depth data and one background data are encoded as the original depth information.
- the encoding unit 1103 is further configured to:
- the preprocessing unit is configured as:
- redundancy elimination processing is performed on the original depth information to eliminate redundant information in the original depth information.
- the decoding unit 1202 is configured to decode the video image code stream to obtain the original depth information and the video image corresponding to the video image data;
- processing unit 1203 is further configured to:
- the volatile memory may be a random access memory (RAM, Random Access Memory), which is used as an external cache.
- RAM random access memory
- SRAM static random access memory
- SSRAM synchronous static random access memory
- Synchronous Static Random Access Memory Synchronous Static Random Access Memory
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
Abstract
Disclosed is an information processing method, comprising: in the case that depth information of a target object is obtained by means of a depth information sensor, obtaining original depth information corresponding to the depth information, the original depth information representing an acquisition state of the depth information acquired by the depth information sensor or information other than the acquired depth information; obtaining video image data of the target object by means of an image sensor; and merging and encoding the original depth information and the video image data to obtain a video image code stream, and outputting the video image code stream. Also disclosed are another information processing method, a terminal device and a storage medium.
Description
本发明涉及计算机技术,尤其涉及一种信息处理方法、终端设备及存储介质。The present invention relates to computer technology, in particular to an information processing method, terminal equipment and storage medium.
在当今社会,越来越多的终端上都设置有摄像装置,从而方便用户可以随时随地的拍照或拍视频。在实际运用中,编码端通过现有的摄像装置采用飞行时间技术(Time Of Light,TOF)摄像头、双目摄像头等深度信息传感器获取目标对象的深度信息,在解码端通过深度信息进行目标对象的深度图像的恢复。但深度图像仅提供了目标对象的深度信息,并不能提高目标对象的视频图像的图像质量。In today's society, more and more terminals are equipped with camera devices, so that users can take pictures or videos anytime and anywhere. In actual application, the encoder uses the existing camera device to use Time Of Light (TOF) cameras, binocular cameras and other depth information sensors to obtain the depth information of the target object. The depth information is used on the decoder to perform the target object's depth information. Depth image restoration. However, the depth image only provides the depth information of the target object, and cannot improve the image quality of the video image of the target object.
发明内容Summary of the invention
本发明实施例提供一种信息处理方法、终端设备及存储介质,能够提高目标对象的视频图像的图像质量。The embodiment of the present invention provides an information processing method, terminal device and storage medium, which can improve the image quality of the video image of the target object.
第一方面,本发明实施例提供一种信息处理方法,包括:In the first aspect, an embodiment of the present invention provides an information processing method, including:
通过深度信息传感器获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息,所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;In the case of acquiring the depth information of the target object through the depth information sensor, the original depth information corresponding to the depth information is acquired, and the original depth information represents the acquisition state of the depth information collected by the depth information sensor or the collected depth information. Information other than in-depth information;
通过图像传感器获取所述目标对象的视频图像数据;Acquiring video image data of the target object through an image sensor;
对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,并输出所述视频图像码流。The original depth information and the video image data are combined and encoded to obtain a video image code stream, and the video image code stream is output.
第二方面,本发明实施例提供一种信息处理方法,包括:In the second aspect, an embodiment of the present invention provides an information processing method, including:
接收视频图像码流,所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感器获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感器获取的所述目标对象的,所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;Receive a video image code stream, the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information is obtained when the depth information of the target object is obtained through the depth information sensor, so The video image data is of the target object acquired by an image sensor, and the original depth information represents a collection state of the depth information collected by the depth information sensor or information other than the collected depth information;
对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;Decoding the video image code stream to obtain the original depth information and the video image corresponding to the video image data;
对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。Image processing is performed on the original depth information and the video image to obtain a target video image.
第三方面,本发明实施例提供一种终端设备,包括:In a third aspect, an embodiment of the present invention provides a terminal device, including:
第一获取单元,配置为通过深度信息传感单元获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息,所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The first acquiring unit is configured to acquire original depth information corresponding to the depth information when the depth information of the target object is acquired through the depth information sensing unit, and the original depth information represents that the depth information sensing unit collects the The collection status of depth information or information other than the collected depth information;
第二获取单元,配置为通过图像传感单元获取所述目标对象的视频图像数据;The second acquiring unit is configured to acquire the video image data of the target object through an image sensing unit;
编码单元,配置为对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流;An encoding unit, configured to merge and encode the original depth information and the video image data to obtain a video image code stream;
输出单元,配置为输出所述视频图像码流。The output unit is configured to output the video image code stream.
第四方面,本发明实施例提供一种终端设备,包括:In a fourth aspect, an embodiment of the present invention provides a terminal device, including:
接收单元,配置为接收视频图像码流,所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感单元获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感单元获取的所述目标对象的;所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The receiving unit is configured to receive a video image code stream, the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information obtains the depth information of the target object through the depth information sensing unit In the case of acquiring the video image data, the target object is acquired by the image sensing unit; the original depth information represents the acquisition state of the depth information acquired by the depth information sensing unit or the acquisition state of the depth information acquired by the depth information sensing unit Information other than the depth information;
解码单元,配置为对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;A decoding unit configured to decode the video image code stream to obtain the original depth information and the video image corresponding to the video image data;
图像处理单元,配置为对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。The image processing unit is configured to perform image processing on the original depth information and the video image to obtain a target video image.
第五方面,本发明实施例提供一种终端设备,包括处理器和配置为存储能够在处理器上运行的计算机程序的存储器,其中,所述处理器配置为运行所述计算机程序时,执行上述终端设备执行的信息处理方法的步骤。In a fifth aspect, an embodiment of the present invention provides a terminal device, including a processor and a memory configured to store a computer program that can run on the processor, wherein the processor is configured to execute the above-mentioned computer program when the computer program is run. The steps of the information processing method performed by the terminal device.
第六方面,本发明实施例提供一种存储介质,存储有可执行程序,所述可执行程序被处理器执行时,实现上述终端设备执行的信息处理方法。In a sixth aspect, an embodiment of the present invention provides a storage medium that stores an executable program, and when the executable program is executed by a processor, the above-mentioned information processing method executed by the terminal device is implemented.
本发明实施例提供的信息处理方法,包括:在编码端,通过深度信息传感器获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息;通过图像传感器获取所述目标对象的视频图像数据;对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,并输出所述视频图像码流。在解码端,接收视频图像码流;对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。从而将深度传感器获得的原始深度信息,在编码端直接写入视频图像码流,并在解码端进行解析,解析得到原始深度信息对图像传感器采集的图像数据得到的视频图像,得到目标视频图像,提高视频图像的质量,给用户带来更真实的图像视频体验。The information processing method provided by the embodiment of the present invention includes: acquiring the original depth information corresponding to the depth information in the case of acquiring the depth information of the target object through the depth information sensor at the encoding end; acquiring the information of the target object through the image sensor Video image data; merge and encode the original depth information and the video image data to obtain a video image code stream, and output the video image code stream. At the decoding end, the video image code stream is received; the video image code stream is decoded to obtain the original depth information and the video image corresponding to the video image data; the original depth information and the video image are imaged Process to get the target video image. In this way, the original depth information obtained by the depth sensor is directly written into the video image code stream at the encoding end, and analyzed at the decoding end, and the original depth information is parsed to obtain the video image obtained from the image data collected by the image sensor to obtain the target video image. Improve the quality of video images and bring users a more realistic image and video experience.
图1A为本发明实施例信息处理系统的一种可选的结构示意图;FIG. 1A is a schematic diagram of an optional structure of an information processing system according to an embodiment of the present invention;
图1B为本发明实施例编码端的一种可选的结构示意图;FIG. 1B is a schematic diagram of an optional structure of an encoding end according to an embodiment of the present invention;
图1C为本发明实施例解码端的一种可选的结构示意图FIG. 1C is a schematic diagram of an optional structure of the decoding end in an embodiment of the present invention
图2为本发明实施例信息处理方法的一种可选的处理流程示意图;2 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图3为本发明实施例信息处理方法的一种可选的处理流程示意图;3 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图4为本发明实施例信息处理方法的一种可选的处理流程示意图;4 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图5为本发明实施例信息处理方法的一种可选的处理流程示意图;5 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图6为本发明实施例信息处理方法的一种可选的处理流程示意图;6 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图7为本发明实施例信息处理方法的一种可选的处理流程示意图;FIG. 7 is a schematic diagram of an optional processing flow of an information processing method according to an embodiment of the present invention;
图8A为本发明实施例信息处理系统的一种可选的框架示意图;8A is a schematic diagram of an optional framework of an information processing system according to an embodiment of the present invention;
图8B为本发明实施例信息处理系统的一种可选的框架示意图;8B is a schematic diagram of an optional framework of an information processing system according to an embodiment of the present invention;
图9A为本发明实施例解码端的一种可选的框架示意图;FIG. 9A is a schematic diagram of an optional framework of a decoding end according to an embodiment of the present invention; FIG.
图9B为本发明实施例解码端的一种可选的框架示意图;FIG. 9B is a schematic diagram of an optional framework of the decoding end according to an embodiment of the present invention; FIG.
图9C为本发明实施例解码端的一种可选的框架示意图;FIG. 9C is a schematic diagram of an optional framework of the decoding end according to an embodiment of the present invention; FIG.
图9D为本发明实施例解码端的一种可选的框架示意图;FIG. 9D is a schematic diagram of an optional framework of the decoding end according to an embodiment of the present invention; FIG.
图10为本发明实施例对原始深度信息进行采样的采样示意图;10 is a schematic diagram of sampling for sampling original depth information according to an embodiment of the present invention;
图11为本发明实施的终端设备的一个可选的结构示意图;FIG. 11 is a schematic diagram of an optional structure of a terminal device implemented in the present invention;
图12是本发明实施例终端设备的一个可选的结构示意图;FIG. 12 is a schematic diagram of an optional structure of a terminal device according to an embodiment of the present invention;
图13是本发明实施例提供的电子设备的一个可选的结构示意图。FIG. 13 is a schematic diagram of an optional structure of an electronic device provided by an embodiment of the present invention.
为了能够更加详尽地了解本发明实施例的特点和技术内容,下面结合附图对本发明实施例的实现进行详细阐述,所附附图仅供参考说明之用,并非用来限定本发明实施例。In order to understand the features and technical content of the embodiments of the present invention in more detail, the implementation of the embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The attached drawings are for reference and explanation purposes only, and are not used to limit the embodiments of the present invention.
在对本发明实施例提供的信息处理方法进行详细说明之前,先对深度图像过程进行介绍。Before describing in detail the information processing method provided by the embodiment of the present invention, the depth image process will be introduced first.
深度图像(depth image)也被称为距离影像(range image),是指将从图像传感器到场景中各点的距离(深度)作为像素值的图像,能够直接反映了目标对象可见表面的几何形状。深度图像经过坐标转换可以计算为点云数据,有规则及必要信息的点云数据也可以反算为深度图像数据。Depth image, also called range image, refers to an image that uses the distance (depth) from the image sensor to each point in the scene as the pixel value, which can directly reflect the geometric shape of the visible surface of the target object . The depth image can be calculated as point cloud data after coordinate conversion, and the point cloud data with rules and necessary information can also be inversely calculated as depth image data.
这里,编码端对深度信息传感器捕获形成的深度图像进行视频编码,得到编码后的深度图像信息,解码器端仅能够根据编码后的深度图像信息将深度图像进行恢复。但是,深度信息传感器接收到的信息量远超深度图像的信息量。这些海量的信息在生成深度图像后,作为冗余进行抛弃。因此,在上述方案中,并未考虑到这些冗余信息的其他作用,如解码端的图像增强等。Here, the encoder end performs video encoding on the depth image captured by the depth information sensor to obtain the encoded depth image information, and the decoder end can only restore the depth image according to the encoded depth image information. However, the amount of information received by the depth information sensor far exceeds that of the depth image. These massive amounts of information are discarded as redundancy after the depth image is generated. Therefore, in the above solution, other functions of the redundant information, such as image enhancement at the decoding end, are not considered.
基于上述问题,本发明实施例提供一种信息处理方法,本发明实施例的信息处理方法可以应信息处理系统,Based on the foregoing problems, the embodiments of the present invention provide an information processing method. The information processing method of the embodiments of the present invention can be applied to an information processing system,
示例性的,本发明实施例应用的信息处理系统100,可为如图1A所示。该信息处理100可以包括编码端101和解码端102。编码端101用于对采集视频图像数据和原始深度信息,并对视频图像数据和原始深度信息进行编码,形成视频图像码流。解码端120用于对图像视频码流进行解码,得到视频图像数据和原始深度信息,并对视频图像数据和原始深度信息进行图像处理,得到目标视频图像。Exemplarily, the information processing system 100 applied in the embodiment of the present invention may be as shown in FIG. 1A. The information processing 100 may include an encoding terminal 101 and a decoding terminal 102. The encoding terminal 101 is used to collect video image data and original depth information, and encode the video image data and original depth information to form a video image code stream. The decoding terminal 120 is used to decode the image video code stream to obtain video image data and original depth information, and perform image processing on the video image data and original depth information to obtain a target video image.
编码端101和解码端102可包括包含台式计算机、移动计算装置、笔记本(例如,膝上型)计算机、平板计算机、机顶盒、智能电话等手持机、电视、相机、显示装置、数字媒体播放器、视频游戏控制台、车载计算机,或其类似者。The encoding terminal 101 and the decoding terminal 102 may include desktop computers, mobile computing devices, notebook (e.g., laptop) computers, tablet computers, set-top boxes, smart phones and other handhelds, televisions, cameras, display devices, digital media players, Video game consoles, on-board computers, or the like.
如图1A所示,解码端102可经由链路103接收来自编码端101编码后的视频图像码流。链路103可包括能够将视频图像码流从编码端101移动到解码端102的一个或多个媒体及/或装置。As shown in FIG. 1A, the decoding end 102 can receive the encoded video image stream from the encoding end 101 via the link 103. The link 103 may include one or more media and/or devices capable of moving the video image stream from the encoding end 101 to the decoding end 102.
在一示例中,链路103可包括使编码端101能够实时地将编码后的视频数据直接发送到解码端102的一个或多个通信媒体。在此实例中,编码端101可根据通信标准(例如,无线通信协议)来调制视频图像码流,且可将调制后的视频图像码流发送到解码端102。In an example, the link 103 may include one or more communication media that enable the encoding end 101 to directly send the encoded video data to the decoding end 102 in real time. In this example, the encoding end 101 can modulate the video image code stream according to a communication standard (for example, a wireless communication protocol), and can send the modulated video image code stream to the decoding end 102.
在一示例中,链路103可包含存储有编码端101形成的视频图像码流的存储媒体。在此示例中,解码端102可经由磁盘存取或卡存取来存取存储媒体。存储媒体可包含多种本地存取式数据存储媒体,例如蓝光光盘、DVD、CD-ROM、快闪存储器,或用于存储视频图像码流的其它合适数字存储媒体。In an example, the link 103 may include a storage medium storing a video image code stream formed by the encoding terminal 101. In this example, the decoder 102 can access the storage medium through disk access or card access. The storage medium may include a variety of locally accessible data storage media, such as Blu-ray discs, DVDs, CD-ROMs, flash memory, or other suitable digital storage media for storing video image streams.
在又一示例中,链路103可包含文件服务器或存储由编码端101形成的视频图像码流的另一中间存储装置。在此示例中,解码端102可经由流式传输或下载来存取存储于文件服务器或其它中间存储装置处的视频图像码流。文件服务器可以是能够存储视频图像码流且将视频图像码流发送到解码端102的服务器类型。文件服务器包含web服务器(例如,用于网站)、文件传送协议服务器、网络附加存储装置,及本地磁盘驱动器等。In another example, the link 103 may include a file server or another intermediate storage device that stores the video image stream formed by the encoding terminal 101. In this example, the decoder 102 can access the video image code stream stored in the file server or other intermediate storage device through streaming or downloading. The file server may be a server type capable of storing video image code streams and sending the video image code streams to the decoder 102. File servers include web servers (for example, for websites), file transfer protocol servers, network attached storage devices, and local disk drives.
解码端102可经由标准数据连接(例如,因特网连接)来存取视频图像码流。数据连接的实例类型包含适合于存取存储于文件服务器上的视频图像码流的无线链路(例如,Wi-Fi连接)、有线连接(例如,DSL、缆线调制解调器等),或两者的组合。The decoder 102 can access the video image stream via a standard data connection (for example, an Internet connection). Examples of data connection types include wireless links (for example, Wi-Fi connections), wired connections (for example, DSL, cable modem, etc.) suitable for accessing the video image stream stored on the file server, or both combination.
如图1B所示,编码端101包括:深度信息传感器1011、图像传感器1012和视频 图像编码器1013,深度信息传感器1011用于获取原始深度信息,图像传感器1012用于获取视频图像数据,视频图像编码器1013用于对原始深度信息和视频图像数据进行编码,形成视频图像码流。As shown in FIG. 1B, the encoding terminal 101 includes a depth information sensor 1011, an image sensor 1012, and a video image encoder 1013. The depth information sensor 1011 is used to obtain original depth information, the image sensor 1012 is used to obtain video image data, and the video image encoding The device 1013 is used to encode the original depth information and video image data to form a video image code stream.
如图1C所示,解码端102包括:视频图像解码器1021和图像处理器1022,视频图像解码器1021用于对视频图像码流进行解码,得到原始深度信息和视频图像数据对应的视频图像,图像处理器1022用于对原始深度信息和视频图像进行处理,得到目标视频图像。这里,原始深度信息作用于视频图像,能够得到清晰度高、噪声低等高质量的视频图像。As shown in FIG. 1C, the decoding terminal 102 includes a video image decoder 1021 and an image processor 1022. The video image decoder 1021 is used to decode a video image stream to obtain the original depth information and the video image corresponding to the video image data. The image processor 1022 is used to process the original depth information and the video image to obtain the target video image. Here, the original depth information acts on the video image, and high-quality video images with high definition and low noise can be obtained.
在一示例中,如图1C所示,解码端102还包括:深度图像生成器1023,用于基于原始深度信息生成深度图像。In an example, as shown in FIG. 1C, the decoding end 102 further includes: a depth image generator 1023, configured to generate a depth image based on the original depth information.
本发明实施例提供的信息处理方法的一种可选处理流程,应用于编码端,如图2所示,包括以下步骤:An optional processing procedure of the information processing method provided by the embodiment of the present invention is applied to the encoding end, as shown in FIG. 2, and includes the following steps:
S201,通过深度信息传感器获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息。S201: In a case where the depth information of the target object is acquired by the depth information sensor, the original depth information corresponding to the depth information is acquired.
所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息。The original depth information represents a collection state of the depth information collected by the depth information sensor or information other than the collected depth information.
深度信息传感器为能够采集目标对象的深度信息的传感器。在一示例中,深度信息传感器为采用TOF测距方法的TOF模组。在一示例中,深度信息传感器为双目摄像头。The depth information sensor is a sensor that can collect the depth information of the target object. In an example, the depth information sensor is a TOF module that uses a TOF ranging method. In an example, the depth information sensor is a binocular camera.
本发明实施例中,编码端在深度信息传感器采集深度信息的情况下,通过深度信息传感器获取原始深度信息,所述原始深度信息包括以下至少之一:电荷信息、相位信息和所述深度信息传感器的属性参数。其中,电荷信息、相位信息为深度信息传感器采集到的所述深度信息以外的信息,深度信息传感器的属性参数表征所述深度信息传感器的采集所述深度信息的采集状态。In the embodiment of the present invention, when the depth information sensor collects depth information, the encoder terminal obtains original depth information through the depth information sensor, and the original depth information includes at least one of the following: charge information, phase information, and the depth information sensor The attribute parameters. Wherein, the charge information and the phase information are information other than the depth information collected by the depth information sensor, and the attribute parameters of the depth information sensor represent the depth information collection state of the depth information sensor.
以原始深度信息为电荷信息为例,一个时间点的电荷信息可体现为一幅电荷图像。这里,获取深度信息传感器采集深度信息时接收的光信号,并将光信号通过光电转换转换为电信号,电信号经过量化后生成电荷图像。Taking the original depth information as charge information as an example, the charge information at a time point can be embodied as a charge image. Here, the optical signal received when the depth information sensor collects the depth information is acquired, and the optical signal is converted into an electrical signal through photoelectric conversion, and the electrical signal is quantized to generate a charge image.
以原始深度信息为相位信息为例,一个时间点的相位信息可体现为一幅相位图像。Taking the original depth information as phase information as an example, the phase information at a time point can be embodied as a phase image.
以原始深度信息为深度信息传感器的属性参数为例,原始深度信息可包括:温度、位姿等属性参数。Taking the original depth information as the attribute parameters of the depth information sensor as an example, the original depth information may include: temperature, pose and other attribute parameters.
S202,通过图像传感器获取所述目标对象的视频图像数据。S202: Obtain video image data of the target object through an image sensor.
编码端在图像预览或视频拍摄场景下通过图像传感器获取目标对象的视频图像数据,这里,视频图像数据包括至少一帧图像帧。The encoding terminal obtains the video image data of the target object through the image sensor in the image preview or video shooting scene, where the video image data includes at least one image frame.
本发明实施例中,原始深度信息与视频帧一一对应。在一示例中,不同的电荷图像或相位图像分别对应不同的图像帧。In the embodiment of the present invention, the original depth information corresponds to the video frame one-to-one. In an example, different charge images or phase images correspond to different image frames.
S203,对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,并输出所述视频图像码流。S203: Perform merge encoding on the original depth information and the video image data to obtain a video image code stream, and output the video image code stream.
编码端通过视频图像编码器对原始深度信息和视频图像数据进行合并编码,视频图像编码器输出视频图像码流,并将视频图像编码器输出的视频图像码流输出至解码端,使得解码端基于原始深度信息对视频图像数据对应的视频图像进行图像处理。The encoding end uses the video image encoder to merge and encode the original depth information and the video image data. The video image encoder outputs the video image code stream, and outputs the video image code stream output by the video image encoder to the decoding end, making the decoding end based on The original depth information performs image processing on the video image corresponding to the video image data.
可选地,视频图像编码器采用视频图像编解码协议,对视频图像帧或原始深度信息进行编码,得到视频图像码流信息;视频编解码协议可以为H.264、H.265、H.266、VP9或AV1等。Optionally, the video image encoder adopts a video image encoding and decoding protocol to encode video image frames or original depth information to obtain video image stream information; the video encoding and decoding protocol can be H.264, H.265, H.266 , VP9 or AV1, etc.
可选地,,采用视频图像编解码协议对原始深度信息和图像视频数据进行编码,此时,视频图像信息携带的数据不包括深度信息。Optionally, the original depth information and the image video data are encoded using a video image coding and decoding protocol. At this time, the data carried by the video image information does not include the depth information.
在本发明实施例中,在视频图像信息携带的数据不包括深度信息的情况下,编码端可仅获取深度信息传感器在获取深度信息时的原始深度信息,不获取深度信息传感器所采集到的深度信息,或将采集的深度信息丢弃。In the embodiment of the present invention, when the data carried by the video image information does not include depth information, the encoder can only obtain the original depth information of the depth information sensor when the depth information is obtained, and not the depth collected by the depth information sensor. Information, or discard the collected depth information.
可选地,采用视频图像编解码协议对原始深度信息和图像视频数据进行编码,并采用视频图像编解码协议对深度信息传感器采集的深度信息进行编码,此时,视频图像信息携带的数据包括:原始深度信息、深度信息和视频图像数据。Optionally, the original depth information and image video data are encoded using the video image encoding and decoding protocol, and the depth information collected by the depth information sensor is encoded using the video image encoding and decoding protocol. At this time, the data carried by the video image information includes: Original depth information, depth information and video image data.
本发明实施例中,对深度信息传感器所采集的深度信息的处理不进行任何限定。In the embodiment of the present invention, the processing of the depth information collected by the depth information sensor is not limited in any way.
可选地,视频图像编码器采用行业标准或特定组织的特定标准,对视频图像帧或原始深度信息进行编码,得到视频图像码流。Optionally, the video image encoder adopts an industry standard or a specific standard of a specific organization to encode video image frames or original depth information to obtain a video image code stream.
编码端可将全部的原始深度信息输入视频图像编码器,以对全部的原始深度信息进行编码,也可仅将部分原始深度信息输入视频图像编码器,以对部分原始深度信息编码。可选地,部分原始深度信息为指定图像帧对应的原始深度信息。可选地,部分原始深度信息为指定图像位置对应的原始深度信息。The encoding end may input all the original depth information into the video image encoder to encode all the original depth information, or only input part of the original depth information into the video image encoder to encode part of the original depth information. Optionally, part of the original depth information is original depth information corresponding to the specified image frame. Optionally, part of the original depth information is the original depth information corresponding to the specified image position.
以部分原始深度信息为指定图像视频对应的原始深度信息为例,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:对所述视频图像数据对应的图像帧中指定图像帧对应的原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流。Taking part of the original depth information as the original depth information corresponding to the specified image video as an example, the combining and encoding the original depth information and the video image data to obtain a video image code stream includes: corresponding to the video image data The original depth information corresponding to the designated image frame and the video image data in the image frames are combined and encoded to obtain a video image code stream.
可选地,指定图像帧为视频图像数据对应的图像帧中的一个图像帧。可选地,指定图像帧包括视频图像数据对应的图像帧中的多个图像帧。Optionally, the designated image frame is one of the image frames corresponding to the video image data. Optionally, the designated image frame includes a plurality of image frames among the image frames corresponding to the video image data.
本发明实施例对指定图像帧的数量不进行任何限制。The embodiment of the present invention does not impose any restriction on the number of designated image frames.
编码端仅对指定图像帧对应的原始深度信息和所述视频图像数据进行合并编码,对视频图像数据对应的图像帧中指定图像帧以外非指定视频帧对应的原始深度信息不进行编码。The encoding end only merges and encodes the original depth information corresponding to the designated image frame and the video image data, and does not encode the original depth information corresponding to non-designated video frames other than the designated image frame in the image frame corresponding to the video image data.
以部分原始深度信息为指定图像位置对应的原始深度信息为例,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:对指定图像位置对应的原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流。Taking part of the original depth information as the original depth information corresponding to the specified image position as an example, the combining and encoding the original depth information and the video image data to obtain the video image code stream includes: the original depth information corresponding to the specified image position The depth information and the video image data are combined and encoded to obtain a video image code stream.
指定图像位置为图像采集范围内指定点所在的位置。可选地,指定图像位置为图像采集范围内指定区域所在的位置。本发明实施例对指定图像位置的范围大小或所在的位置不进行任何限定。The designated image position is the position of the designated point in the image acquisition range. Optionally, the designated image position is the position of the designated area within the image acquisition range. The embodiment of the present invention does not limit the size of the range or the location of the designated image position in any way.
编码端仅对指定图像位置对应的原始深度信息和所述视频图像数据进行合并编码,对图像帧中指定图像位置以外非指定视频位置对应的原始深度信息不进行编码。The encoding end only merges and encodes the original depth information corresponding to the designated image position and the video image data, and does not encode the original depth information corresponding to the non-designated video position other than the designated image position in the image frame.
本发明实施例中,编码端对所述原始深度信息和所述视频图像数据进行合并编码的编码方式包括以下之一:In the embodiment of the present invention, the encoding method for the encoding end to merge and encode the original depth information and the video image data includes one of the following:
编码方式一、根据所述原始深度信息和所述视频图像数据的相关性,对所述原始深度信息和所述视频图像数据进行混合编码,得到视频图像码流;Encoding method 1: Perform mixed encoding on the original depth information and the video image data according to the correlation between the original depth information and the video image data to obtain a video image code stream;
编码方式二、分别对所述原始深度信息和所述视频图像数据进行独立编码,得到包括第一码流和第二码流的图像视频码流,其中,所述第一码流为所述原始深度信息编码后得到的码流,所述第二码流为所述图像视频数据编码后得到的码流。Encoding mode two, separately encoding the original depth information and the video image data to obtain an image video code stream including a first code stream and a second code stream, wherein the first code stream is the original A code stream obtained after encoding the depth information, and the second code stream is a code stream obtained after encoding the image video data.
在编码方式一中,对原始深度信息和视频图像数据进行编码所采用编解码协议相同。In the first encoding method, the encoding and decoding protocols used for encoding the original depth information and the video image data are the same.
可选地,在编码方式一中,视频图像码流中的编码信息为对原始深度信和视频图像数据进行联合编码得到的混合编码信息。其中,视频图像编码器可利用原始深度信 息和图像视频数据的空间相关性或时间相关性等,对原始深度信和视频图像数据进行联合编码。Optionally, in the first encoding method, the encoding information in the video image bitstream is mixed encoding information obtained by jointly encoding the original depth information and the video image data. Among them, the video image encoder can use the spatial correlation or temporal correlation between the original depth information and the image and video data to jointly encode the original depth information and the video image data.
可选地,在编码方式一中,将所述原始深度信息对应的第一编码信息写入所述视频图像数据对应的第二编码信息的指定位置处。可选地,指定位置可以为图像信息头、序列信息头、附加参数集或其他任意位置。Optionally, in the first encoding method, the first encoding information corresponding to the original depth information is written in a designated position of the second encoding information corresponding to the video image data. Optionally, the designated position may be an image information header, a sequence information header, an additional parameter set, or any other position.
可选地,在编码方式一中,利用原始深度信息和图像视频数据的空间相关性或时间相关性等,对原始深度信息进行编码,得到第一编码信息,并对视频图像数据进行编码,得到第二编码信息,并将第一编码信息写入第一编码信息的指定位置处,得到视频图像码流。Optionally, in the first encoding method, the original depth information and the spatial correlation or temporal correlation between the image and video data are used to encode the original depth information to obtain the first encoded information, and the video image data is encoded to obtain Second encoding information, and writing the first encoding information into the designated position of the first encoding information to obtain a video image code stream.
在编码方式二中,对原始深度信息所采用编解码协议与对视频图像数据进行编码所采用编解码协议独立。可选地,对原始深度信息所采用编解码协议与对视频图像数据进行编码所采用编解码协议相同。可选地,对原始深度信息所采用编解码协议与对视频图像数据进行编码所采用编解码协议不同。In the second encoding method, the encoding and decoding protocol used for the original depth information is independent of the encoding and decoding protocol used for encoding the video image data. Optionally, the encoding and decoding protocol used for the original depth information is the same as the encoding and decoding protocol used for encoding the video image data. Optionally, the encoding and decoding protocol used for the original depth information is different from the encoding and decoding protocol used for encoding the video image data.
在一实施例中,如图3所示,在S203之前,包括:In an embodiment, as shown in FIG. 3, before S203, the method includes:
204A,对所述原始深度信息进行预处理。204A, preprocessing the original depth information.
S203中对原始深度信息和视频图像数据进行合并编码,可执行为S203A:对经过预处理的原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流。In S203, the original depth information and the video image data are combined and coded, which can be executed as S203A: the preprocessed original depth information and the video image data are combined and coded to obtain a video image code stream.
本发明是实施例中,预处理可以为滤波、去噪、信号放大等相位校准等方式中的一种或两种处理方式,还可以为其他处理方式,具体的预处理可根据实际情况确定,本发明实施例对此不做限定。In the embodiment of the present invention, the preprocessing can be one or two of phase calibration methods such as filtering, denoising, signal amplification, etc., or other processing methods. The specific preprocessing can be determined according to actual conditions. The embodiment of the present invention does not limit this.
可选地,编码端通过深度信息传感器对原始深度信息进行预处理。Optionally, the encoding end preprocesses the original depth information through the depth information sensor.
在一实施例中,如图4所示,在S203之前,包括:In an embodiment, as shown in FIG. 4, before S203, the method includes:
204B,对所述原始深度信息进行冗余消除处理,以消除所述原始深度信息中的冗余信息。204B. Perform redundancy elimination processing on the original depth information to eliminate redundant information in the original depth information.
S203中对原始深度信息和视频图像数据进行合并编码,可执行为S203B:对经过冗余消除处理的原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流。In S203, the original depth information and the video image data are combined and coded, which can be executed as S203B: the original depth information and the video image data that have undergone redundancy elimination processing are combined and coded to obtain a video image code stream.
编码端通过对原始深度信息进行冗余消除处理,能够消除原始深度信息中的冗余信息,从而压缩原始深度信息的信息量,使得减小视频数据码流的大小。The encoding end can eliminate redundant information in the original depth information by performing redundancy elimination processing on the original depth information, thereby compressing the information amount of the original depth information, and reducing the size of the video data stream.
本发明实施例中,所述根据对所述原始深度信息进行冗余消除处理,包括以下至少之一:In the embodiment of the present invention, the performing redundancy elimination processing on the original depth information according to includes at least one of the following:
基于相位相关性对所述原始深度信息冗余消除处理;Eliminating the redundancy of the original depth information based on phase correlation;
基于空间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on spatial correlation;
基于时间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on time correlation;
基于指定深度对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on the specified depth;
基于频域相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on frequency domain correlation;
基于编码二进制数据之间的相关性对所述原始深度信息的编码比特进行冗余消除处理。Perform redundancy elimination processing on the coded bits of the original depth information based on the correlation between the coded binary data.
可选地,将原始深度信息转换到频域,基于频域相关性对转换为频域的所述原始深度信息进行冗余消除处理。Optionally, the original depth information is converted into the frequency domain, and the original depth information converted into the frequency domain is subjected to redundancy elimination processing based on the frequency domain correlation.
可选地,指定深度为目标对象所处的场景敏感的深度的范围,基于指定深度对所述原始深度信息进行冗余消除处理,将场景敏感的深度的范围之外的深度对应的原始深度信息作为冗余进行消除。Optionally, the specified depth is a range of the scene-sensitive depth where the target object is located, the original depth information is redundantly eliminated based on the specified depth, and the original depth information corresponding to the depth outside the range of the scene-sensitive depth Eliminate as redundancy.
可选地,对原始深度信息进行熵编码,对基于编码二进制数据之间的相关性对原 始深度信息的熵编码结果的编码比特进行冗余消除处理。Optionally, perform entropy coding on the original depth information, and perform redundancy elimination processing on the coded bits of the entropy coding result of the original depth information based on the correlation between the coded binary data.
以基于空间相关性对所述原始深度信息进行冗余消除处理为例,编码端的原始深度信息对应至少一个视点;从至少一个视点中确定间隔视点,将与间隔视点对应的原始深度信息,作为间隔原始深度信息;将原始深度信息中间隔原始深度以外的原始深度信息作为冗余消除,对间隔原始深度信息和视频图像数据进行合并编码,得到视频数据码流。Taking the redundant elimination processing of the original depth information based on spatial correlation as an example, the original depth information on the encoding end corresponds to at least one viewpoint; the interval viewpoint is determined from the at least one viewpoint, and the original depth information corresponding to the interval viewpoint is used as the interval Original depth information: The original depth information other than the original depth of the interval in the original depth information is used as redundancy to eliminate, and the original depth information of the interval and the video image data are combined and encoded to obtain the video data stream.
以基于时间相关性对所述原始深度信息进行冗余消除处理为例,编码端获取到一端时间内的原始深度信息,并基于采样间隔对获取到的原始深度信息进行采样,并保留采样后的原始深度信息,将获取的原始深度信息中采样后的原始深度信息以外的原始深度信息作为冗余消除,对采样得到的原始深度信息和视频图像数据进行合并编码,得到视频数据码流。Taking the redundancy elimination processing of the original depth information based on time correlation as an example, the encoding end obtains the original depth information within a certain period of time, and samples the obtained original depth information based on the sampling interval, and retains the sampled original depth information. For the original depth information, the original depth information other than the sampled original depth information in the obtained original depth information is used as redundancy to be eliminated, and the original depth information obtained by the sampling and the video image data are combined and encoded to obtain a video data stream.
本发明实施例提供的信息处理方法的一种可选处理流程,应用于解码端,如图5所示,包括以下步骤:An optional processing flow of the information processing method provided by the embodiment of the present invention, which is applied to the decoding end, as shown in FIG. 5, includes the following steps:
S501,接收视频图像码流。S501: Receive a video image code stream.
解码端通过链路接收编码端发送的视频图像码流。所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感器获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感器获取的所述目标对象的;所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息。The decoding end receives the video image code stream sent by the encoding end through the link. The video image code stream is obtained by combining and encoding original depth information and video image data, the original depth information is obtained when the depth information of the target object is obtained by the depth information sensor, and the video image data is obtained by Of the target object acquired by the image sensor; the original depth information represents a collection state of the depth information collected by the depth information sensor or information other than the collected depth information.
S502,对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像。S502: Decode the video image code stream to obtain a video image corresponding to the original depth information and the video image data.
这里,通过视频图像解码器对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像。Here, the video image code stream is decoded by a video image decoder to obtain the original depth information and the video image corresponding to the video image data.
解码端将接收的视频图像码流发送至视频图像解码器,视频图像解码器对视频图像码流进行解码。The decoder sends the received video image code stream to the video image decoder, and the video image decoder decodes the video image code stream.
可选地,视频图像解码器和编码端的视频编码器所支持的视频图像编解码协议相同。Optionally, the video image decoder and the video encoder on the encoding end support the same video image codec protocol.
可选地,视频图像编码器对原始深度信息和视频图像数据进行混合编码的情况下,Optionally, when the video image encoder performs mixed encoding on the original depth information and the video image data,
视频图像解码器对视频图像码流进行混合解码,得到原始深度信息和所述视频图像数据对应的视频图像。The video image decoder performs hybrid decoding on the video image code stream to obtain the original depth information and the video image corresponding to the video image data.
可选地,视频图像编码器对原始深度信息和视频图像数据独立进行编码的情况下,视频图像解码器对视频图像码流中的第一码流和第二码流进行独立解码,对视频图像数据的第一码流进行解码,得到原始深度信息,并对第二码流进行解码,得到所述视频图像数据对应的视频图像。这里,视频图像数据对应的视频图像也可称为原始视频图像。图像视频码流解码得到的原始视频图像可包括一帧或多帧原始视频图像。Optionally, in the case that the video image encoder encodes the original depth information and the video image data independently, the video image decoder independently decodes the first code stream and the second code stream in the video image code stream, and the video image The first code stream of the data is decoded to obtain the original depth information, and the second code stream is decoded to obtain the video image corresponding to the video image data. Here, the video image corresponding to the video image data may also be referred to as the original video image. The original video image obtained by decoding the image video code stream may include one or more frames of original video image.
S503,对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。S503: Perform image processing on the original depth information and the video image to obtain a target video image.
通过图像处理器对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。Image processing is performed on the original depth information and the video image by an image processor to obtain a target video image.
解码端解码得到原始深度信息和视频图像数据后,通过图像处理器,将原始深度信息作用于视频图像,对视频图像进行图像处理,得到目标视频图像。目标视频图像的图像质量高于原始视频图像。After the decoder obtains the original depth information and video image data through decoding, the image processor applies the original depth information to the video image, performs image processing on the video image, and obtains the target video image. The image quality of the target video image is higher than that of the original video image.
可选地,解码端可基于相位相关性、空间相关性、时间相关性、指定深度、频域相关性、编码二进制数据之间的相关性对解码得到的原始深度信息进行冗余恢复,得 到冗余恢复后的原始深度信息,基于冗余恢复后的原始深度信息对视频图像进行图像处理,得到目标视频图像。Optionally, the decoding end may perform redundancy recovery on the original depth information obtained by decoding based on phase correlation, spatial correlation, time correlation, specified depth, frequency domain correlation, and correlation between encoded binary data, to obtain redundant information. After restoring the original depth information, image processing is performed on the video image based on the original depth information after the redundant restoration to obtain the target video image.
以基于空间相关性对解码得到的原始深度信息进行冗余恢复,得到冗余恢复后的原始深度信息为例,解码端对视频图像码流进行独立解码或混合解码,得到间隔视点的原始深度信息和至少一个视点的视频图像;对间隔视点的原始深度信息进行差值,得到至少一个视点中除了间隔视点以外的其他视点的原始深度信息;利用间隔视点的原始深度信息、其他视点的原始深度信息,对视频图像进行图像处理,得到目标视频图像。Taking the redundant restoration of the original depth information obtained by decoding based on spatial correlation to obtain the original depth information after redundancy restoration as an example, the decoder performs independent decoding or hybrid decoding on the video image stream to obtain the original depth information of the interval viewpoint And the video image of at least one viewpoint; difference the original depth information of the interval viewpoint to obtain the original depth information of other viewpoints in at least one viewpoint except the interval viewpoint; use the original depth information of the interval viewpoint and the original depth information of other viewpoints , Perform image processing on the video image to obtain the target video image.
以基于时间相关性对解码得到的原始深度信息进行冗余恢复,得到冗余恢复后的原始深度信息为例,解码端对视频图像码流进行独立解码或混合解码,得到采样后的原始深度信息,并基于时间相邻的采样后的原始深度信息恢复相邻的采样后的原始深度信息之间的原始深度信息,利用解码得到的原始深度信息和恢复的原始深度信息,对视频图像进行图像处理,得到目标视频图像。Taking the original depth information obtained by decoding redundancy recovery based on time correlation to obtain the original depth information after redundancy recovery as an example, the decoder performs independent decoding or mixed decoding on the video image stream to obtain the original depth information after sampling , And restore the original depth information between the adjacent sampled original depth information based on the time-adjacent original depth information after sampling, and use the original depth information obtained by decoding and the restored original depth information to perform image processing on the video image , Get the target video image.
可选地,所述视频图像解码器和所述图像处理器相互独立。可选地,所述图像处理器集成在所述视频图像解码器内。Optionally, the video image decoder and the image processor are independent of each other. Optionally, the image processor is integrated in the video image decoder.
在一示例中,以所述原始深度信息为电荷信息为例,所述对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像,包括:根据所述原始深度信息对所述视频图像进行去噪处理或白平衡调节,得到所述目标视频图像。In an example, taking the original depth information as charge information as an example, the performing image processing on the original depth information and the video image to obtain a target video image includes: The video image is subjected to denoising processing or white balance adjustment to obtain the target video image.
在一示例中,以所述原始深度信息为相位信息为例,所述对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像,包括:根据所述原始深度信息对所述视频图像进行去模糊处理,得到所述目标视频图像。In an example, taking the original depth information as phase information as an example, the performing image processing on the original depth information and the video image to obtain a target video image includes: The video image is deblurred to obtain the target video image.
解码端中的图像处理器解析每个相位信息,得到解析结果,利用解析结果,对与之对应的视频帧进行去模糊,得到目标视频图像。The image processor in the decoding end analyzes each phase information to obtain the analysis result, and uses the analysis result to deblur the corresponding video frame to obtain the target video image.
在一示例中,在高动态范围(HDR,High Dynamic Range)视频中,每一帧HDR图像是对1幅长曝光图像和1幅短曝光图像进行融合得到的,在当前时刻,针对同一个场景,控制图像传感器拍摄长曝光图像和短曝光图像,并控制深度信息传感器拍摄相位图像,将相位图像作为原始深度信息;对相位图像和长曝光图像进行混合编码或独立编码,对相位图像和短曝光图像进行混合编码或独立编码,得到视频图像码流;并将视频图像码流输出至解码端;解码端解从视频图像码流中解码出长曝光图像、短曝光图像和相位图像;再利用相位图像,对长曝光图像和短曝光图像分别进行去模糊,得到去模糊后的长曝光图像和去模糊后的短曝光图像;对去模糊后的长曝光图像和去模糊后的短曝光图像进行融合,得到一帧更为清晰的HDR图像。In an example, in a High Dynamic Range (HDR) video, each frame of HDR image is obtained by fusing a long exposure image and a short exposure image. At the current moment, for the same scene , Control the image sensor to shoot long exposure images and short exposure images, and control the depth information sensor to shoot phase images, using the phase image as the original depth information; perform mixed encoding or independent encoding on the phase image and the long exposure image, and perform the phase image and short exposure The image is mixed or independently encoded to obtain the video image code stream; the video image code stream is output to the decoding end; the decoding end decodes the long-exposure image, short-exposure image and phase image from the video image code stream; and then uses the phase Image, deblur the long-exposure image and short-exposure image respectively to obtain the deblurred long-exposure image and the deblurred short-exposure image; fuse the deblurred long-exposure image and the deblurred short-exposure image , Get a clearer HDR image.
如图6所示,在S502之后,还包括:As shown in Figure 6, after S502, it also includes:
S504,对所述原始深度信息进行恢复,得到深度图像。S504: Restore the original depth information to obtain a depth image.
可选地,通过深度图像生成器对所述原始深度信息进行恢复,得到所述深度图像。Optionally, the original depth information is restored by a depth image generator to obtain the depth image.
需要说明的是,本发明实施例中,在图6中以S504位于S503之后为例对得到目标视频图像和得到深度图像的先后顺序进行示例性说明,在实际应用中,S504和S503的执行不分先后顺序。It should be noted that, in the embodiment of the present invention, S504 is located after S503 in FIG. 6 as an example to illustrate the sequence of obtaining the target video image and obtaining the depth image. In practical applications, the execution of S504 and S503 is not In order of priority.
可选地,深度图像生成器与视频图像解码器相互独立。可选地,深度图像生成器集成在视频图像解码器内。Optionally, the depth image generator and the video image decoder are independent of each other. Optionally, the depth image generator is integrated in the video image decoder.
在一示例中,视频图像解码器、深度图像生成器和图像处理器相互独立,此时,将视频图像码流输入视频图像解码器,视频图像解码器输出原始深度信息和视频图像,并将原始深度信息和视频图像输入图像处理器,将原始深度信息输入深度图像生成器,图像处理器输出目标视频图像,深度图像生成器输出深度图像。In an example, the video image decoder, the depth image generator, and the image processor are independent of each other. At this time, the video image code stream is input to the video image decoder, and the video image decoder outputs the original depth information and the video image, and the original The depth information and the video image are input to the image processor, the original depth information is input to the depth image generator, the image processor outputs the target video image, and the depth image generator outputs the depth image.
在一示例中,深度图像生成器和图像处理器集成在视频图像解码器中,此时,将 视频图像码流输入视频图像解码器,视频图像解码器输出目标视频图像和深度图像。In an example, the depth image generator and the image processor are integrated in the video image decoder. At this time, the video image code stream is input to the video image decoder, and the video image decoder outputs the target video image and the depth image.
在一示例中,深度图像生成器集成在视频图像解码器内,图像处理器与视频图像解码器相互独立,此时,将视频图像码流输入视频图像解码器,视频图像解码器输出原始深度信息和目标视频图像,并将原始深度信息输入深度图像生成器,深度图像生成器输出深度图像。In an example, the depth image generator is integrated in the video image decoder, and the image processor and the video image decoder are independent of each other. At this time, the video image code stream is input to the video image decoder, and the video image decoder outputs the original depth information And the target video image, and the original depth information is input to the depth image generator, and the depth image generator outputs the depth image.
在一示例中,图像处理器集成在视频图像解码器内,深度图像生成器与视频图像解码器相互独立,此时,将视频图像码流输入视频图像解码器,视频图像解码器输出原始深度信息、视频图像以及深度图像,并将原始深度信息和视频图像输入图像处理器,图像处理器输出目标视频图像。In one example, the image processor is integrated in the video image decoder, and the depth image generator and the video image decoder are independent of each other. At this time, the video image code stream is input to the video image decoder, and the video image decoder outputs the original depth information , Video image and depth image, and input the original depth information and video image to the image processor, and the image processor outputs the target video image.
本发明实施例还提供一种信息处理方法,应用于包括编码端和解码端的信息处理系统,如图7所示,包括:The embodiment of the present invention also provides an information processing method, which is applied to an information processing system including an encoding end and a decoding end, as shown in FIG. 7, including:
S701,编码端在深度信息传感器采集目标对象的深度信息的情况下,获取深度信息对应的原始深度信息。S701: The encoding end acquires original depth information corresponding to the depth information when the depth information sensor collects the depth information of the target object.
所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The original depth information represents a collection state of the depth information collected by the depth information sensor or information other than the collected depth information;
S702,编码端通过图像传感器获取目标对象的视频图像数据;S702: The encoding end obtains the video image data of the target object through the image sensor;
S703,编码端对原始深度信息和视频图像数据进行合并编码,得到视频图像码流,并输出视频图像码流。S703: The encoding terminal merges and encodes the original depth information and the video image data to obtain a video image code stream, and output the video image code stream.
S704,解码端接收视频图像码流。S704: The decoding end receives the video image code stream.
S705,解码端对视频图像码流进行解码,得到原始深度信息和视频图像数据对应的视频图像。S705: The decoding end decodes the video image code stream to obtain a video image corresponding to the original depth information and the video image data.
S706,解码端对原始深度信息和视频图像进行图像处理,得到目标视频图像。S706: The decoding end performs image processing on the original depth information and the video image to obtain a target video image.
本发明实施例中,解码端接收包括原始深度信息的编码信息和图像视频信息的编码信息的视频图像码流,如此,解码端可以从视频图像码流中解码出原始深度信息和视频图像,进而,解码端不仅可以利用原始深度信息,恢复得到深度图像,还可以利用原始深度信息,对视频图像进行去噪、白平衡调整和去模糊等优化处理,提高了信息利用率,并且优化处理后得到的目标视频图像相较于原始视频图像,图像质量更高。In the embodiment of the present invention, the decoding end receives a video image code stream including the encoding information of the original depth information and the encoding information of the image and video information. In this way, the decoding end can decode the original depth information and the video image from the video image code stream, and then , The decoding end can not only use the original depth information to recover the depth image, but also use the original depth information to perform optimization processing such as denoising, white balance adjustment and deblurring on the video image, which improves the information utilization rate and obtains the result after optimization. Compared with the original video image, the target video image has higher image quality.
下面,通过场景的示例对本发明实施例提供的信息处理方法进行举例说明。In the following, the information processing method provided by the embodiment of the present invention will be illustrated by using an example of a scenario.
本发明的信息系统的框架如图8A和图8B所示。视频图像编码器1013对深度信息传感器1011获取到的原始深度信息801和图像传感器1012采集到的视频图像数据802进行合并编码,形成视频图像码流803;视频图像解码器1021获取视频图像码流803后,对视频图像码流803解析,得到原始深度信息804和视频图像805,深度图像生成器1023对原始深度信息804恢复得到深度图像806,图像处理器1022通过原始深度信息804对视频图像解码器1021得到的视频图像805进行处理,得到目标视频图像807。其中,深度图像生成器1023、图像处理器1022和视频图像解码器1021可以分别独立,如图8A所示;也可以将深度图像生成器1023和图像处理器1022作为视频图像解码器1021的一个组成部分,如图8B所示。The framework of the information system of the present invention is shown in Fig. 8A and Fig. 8B. The video image encoder 1013 merges and encodes the original depth information 801 obtained by the depth information sensor 1011 and the video image data 802 collected by the image sensor 1012 to form a video image code stream 803; the video image decoder 1021 obtains a video image code stream 803 Then, the video image code stream 803 is parsed to obtain the original depth information 804 and the video image 805, the depth image generator 1023 restores the original depth information 804 to obtain the depth image 806, and the image processor 1022 decodes the video image through the original depth information 804 The video image 805 obtained by 1021 is processed to obtain the target video image 807. Among them, the depth image generator 1023, the image processor 1022, and the video image decoder 1021 can be independent, as shown in FIG. 8A; the depth image generator 1023 and the image processor 1022 can also be used as a component of the video image decoder 1021 Part, as shown in Figure 8B.
深度信息传感器输出的原始深度信息可以为深度信息传感器所获取得到的最初的数据信息即未经预处理的原始深度信息,也可以为最初的数据信息经预处理后所得到的中间数据信息即经过预处理的原始深度信息;当输出的信息为最初的数据信息,输出的信息可以为电荷信息或相位信息等经过光电转换后的电信号;当输出的信息为中间数据信息,输出的信息可以为对最初的数据信进行相位校准或其他方式等处理后的能够生成深度图像的中间视频图像数据。The original depth information output by the depth information sensor can be the original data information obtained by the depth information sensor, that is, the original depth information without preprocessing, or the intermediate data information obtained after the initial data information is preprocessed. Preprocessed original depth information; when the output information is the original data information, the output information can be electrical signals after photoelectric conversion such as charge information or phase information; when the output information is intermediate data information, the output information can be Intermediate video image data that can generate a depth image after processing the initial data signal by phase calibration or other methods.
视频图像编码器对输入的原始深度信息进行编码形成视频图像码流。其中,编码方式包括:The video image encoder encodes the input original depth information to form a video image code stream. Among them, the encoding methods include:
编码方式1、利用视频图像数据和深度原始信息的相关性,将二者混合进行编码; Encoding method 1. Use the correlation between video image data and depth original information to mix the two for encoding;
编码方式2、分别将视频图像数据和深度原始信息进行独立编码。 Encoding method 2. Independently encode the video image data and the original depth information respectively.
在编码方式1中,原始深度信息的编码信息在视频图像数据的编码信息的信息头、序列信息头、附加参数集或者其他等任意位置。In the coding method 1, the coding information of the original depth information is in the information header, the sequence information header, the additional parameter set, or other arbitrary positions of the coding information of the video image data.
在编码方式2中,利用原始深度信息的空间相关性或时间相关性等其他相关性,对原始深度信息自身进行单独的编码。In the coding method 2, the original depth information itself is separately coded by using other correlations such as the spatial correlation or temporal correlation of the original depth information.
在视频图像编码器中,可以对每一幅视频图像对应的原始深度信息都编码,也可以仅对指定图像或指定图像位置对应的原始深度信息进行编码,其他非指定图像或非指定图像位置的对应原始深度信息不进行编码。In the video image encoder, the original depth information corresponding to each video image can be encoded, or only the original depth information corresponding to the specified image or specified image position can be encoded, and other non-specified images or non-specified image positions can be encoded. Corresponding to the original depth information, no coding is performed.
对于图像处理器,在拍照或预览场景中,对于景深的产生,可以直接利用原始深度信息作用于视频图像,形成具有景深的目标视频图像,而不用将深度图像和视频图像叠加处理产生具有景深的目标视频图像。For the image processor, when taking pictures or previewing scenes, for the generation of depth of field, the original depth information can be directly used on the video image to form a target video image with depth of field, instead of superimposing the depth image and the video image to produce a depth of field Target video image.
编码端对原始深度信息进行编码的过程中,为了压缩数据量,可以利用且不限于如下相关性消除冗余:In the process of encoding the original depth information at the encoding end, in order to compress the amount of data, the following correlations can be used, but not limited to, to eliminate redundancy:
1、若原始深度信息包括多个视频图像的相位信息,利用相位之间的相关性消除相位数据冗余;若原始深度信息为其他数据,利用这些数据间的空间相关性等其相关性消除数据冗余;1. If the original depth information includes the phase information of multiple video images, use the correlation between the phases to eliminate phase data redundancy; if the original depth information is other data, use the spatial correlation between these data and other correlations to eliminate the data redundancy;
2、利用原始深度信息的时间相关性消除数据冗余;2. Use the time correlation of the original depth information to eliminate data redundancy;
3、利用指定深度消除基于场景的数据冗余;3. Use the specified depth to eliminate scene-based data redundancy;
4、将原始深度信息转化为频域,利用频域相关性消除频域的数据冗余;4. Convert the original depth information into the frequency domain, and use frequency domain correlation to eliminate data redundancy in the frequency domain;
5、利用编码二进制数据之间的相关性,消除编码的比特冗余;其中,这里的编码可为熵编码。5. Use the correlation between the encoded binary data to eliminate the bit redundancy of the encoding; among them, the encoding here can be entropy encoding.
本发明实施例中,视频图像编码器形成的包含原始深度信息的视频图像码流中,原始深度信息和视频图像数据可独立解码,即视频图像码流具有可解耦性或独立性,使得采用各种视频图像标准编解码协议的视频图像解码器可以从该视频图像码流中仅提取视频图像,而不提取原始深度信息,也可以仅提取原始深度信息,而不提取视频图像。In the embodiment of the present invention, in the video image code stream containing the original depth information formed by the video image encoder, the original depth information and the video image data can be decoded independently, that is, the video image code stream has decoupling or independence, so that the use of Video image decoders of various video image standard encoding and decoding protocols can extract only video images from the video image stream without extracting original depth information, or only extract original depth information without extracting video images.
如图9A至9D所示,对于视频图像解码器、深度图像生成器和图像处理器,三者相互配合将视频图像码流按照视频图像标准编解码协议进行解码,并产生处理后的图像和原始深度信息;该视频图像标准编解码协议可以为厂商定制的私有标准,也可以为行业标准。视频图像解码器、深度图像生成器和图像处理器这三者的组成方式包括:As shown in Figures 9A to 9D, for the video image decoder, depth image generator and image processor, the three cooperate with each other to decode the video image code stream in accordance with the video image standard codec protocol, and generate processed images and original In-depth information; the video image standard encoding and decoding protocol can be a private standard customized by the manufacturer or an industry standard. The three components of video image decoder, depth image generator and image processor include:
组成方式1、如图9A所示,视频图像解码器1021、深度图像生成器1023和图像处理器1022这三者相互独立,视频图像解码器1021解析视频图像码流803得到视频图像805和原始深度信息804后,将原始深度信息804送入深度图像生成器1023产生深度图像806,并将视频图像805和原始深度信息804送入图像处理器1022产生处理后的目标视频图像807; Composition 1. As shown in Figure 9A, the video image decoder 1021, the depth image generator 1023 and the image processor 1022 are independent of each other. The video image decoder 1021 parses the video image code stream 803 to obtain the video image 805 and the original depth After the information 804, the original depth information 804 is sent to the depth image generator 1023 to generate the depth image 806, and the video image 805 and the original depth information 804 are sent to the image processor 1022 to generate the processed target video image 807;
组成方式2、如图9B所示,深度图像生成器1023和图像处理器1022嵌入视频图像解码器1021内部,在视频图像解码器1021内部对视频图像码流803进行处理,直接输出深度图像806和处理后的目标视频图像807。 Composition 2. As shown in Figure 9B, the depth image generator 1023 and the image processor 1022 are embedded in the video image decoder 1021, and the video image code stream 803 is processed inside the video image decoder 1021 to directly output the depth image 806 and The processed target video image 807.
组成方式3、如图9C所示,深度图像生成器1023嵌入视频图像解码器1021内部,先在视频图像解码器1021内部对视频图像码流803进行处理,输出深度图像806和视频图像805,再将视频图像805和原始深度信息804送入图像处理器1022,输出处理后的目标视频图像807; Composition mode 3. As shown in Figure 9C, the depth image generator 1023 is embedded in the video image decoder 1021, and the video image code stream 803 is processed inside the video image decoder 1021, and then the depth image 806 and the video image 805 are output. Send the video image 805 and the original depth information 804 to the image processor 1022, and output the processed target video image 807;
组成方式4、如图9D所示,图像处理器1022嵌入视频图像解码器1021内部,先在视频图像解码器1021内部对图像视频码流803进行处理,输出原始深度信息804和处理后的目标视频图像807,再将原始深度信息804送入深度图像生成器1023,输出深度图像806。 Composition 4, as shown in Figure 9D, the image processor 1022 is embedded in the video image decoder 1021, and the image video code stream 803 is first processed inside the video image decoder 1021, and the original depth information 804 and the processed target video are output Image 807, the original depth information 804 is sent to the depth image generator 1023, and the depth image 806 is output.
本发明实施例提供的信息处理方法,在编码端,将通过深度信息传感器获取得到的原始深度信息,对原始深度信息进行视频图像编码,形成视频图像码流进行传输;在解码端,通过视频图像码流不仅能够恢复得到深度图像,而且可以通过解析得到的原始深度信息,对原始视频图像进行处理,得到图像质量更高的目标视频图像。In the information processing method provided by the embodiment of the present invention, at the encoding end, the original depth information obtained by the depth information sensor is encoded, and the original depth information is encoded to form a video image code stream for transmission; at the decoding end, the video image is transmitted through the video image The code stream can not only recover the depth image, but also process the original video image by analyzing the original depth information to obtain a target video image with higher image quality.
在一示例中,原始深度信息为相位信息,可通过不同时间点采样得到的多幅相位图像恢复得到深度图像,且当因为运动造成原始视频图像模糊时,由于多幅相位图像可携带不同时间点的更多信息,可以基于相位信息通过运动估计将模糊的原始视频图像进行恢复,以得到更为清晰的目标视频图像。In one example, the original depth information is phase information. The depth image can be recovered from multiple phase images sampled at different time points. When the original video image is blurred due to motion, multiple phase images can carry different time points. For more information, the blurred original video image can be restored through motion estimation based on the phase information to obtain a clearer target video image.
在又一示例中,深度信息传感器为TOF架构或模组,原始深度信息为电荷信息,不仅可以生成深度图像,而且可以根据电荷信息来判断拍摄场景的噪声和外部可见光,通过电荷信息进行原始视频图像的去燥和白平衡调节,以得到图像质量更好的视频图像,给用户更美更真实的图像视频体验。In another example, the depth information sensor is a TOF architecture or module, and the original depth information is charge information. Not only can the depth image be generated, but also the noise and external visible light of the shooting scene can be judged based on the charge information, and the original video can be made through the charge information Image desiccation and white balance adjustment to obtain better image quality video images, giving users a more beautiful and realistic image and video experience.
本发明实施例中,原始深度信息的获取方式包括但是不限于以下方式:In the embodiment of the present invention, the method for acquiring the original depth information includes but is not limited to the following methods:
方式一method one
采用连续调制的TOF方法,在两种不同的发射信号频率下,通过控制积分时间,通过TOF传感器采样得到不同相位的共8组光信号,并对这8组光信号进行光电转换,得到8组电荷信号,再将这8组电荷信号进行10比特量化,生成8张原始电荷图像;解码端将这8张原始电荷图像和TOF传感器的温度等属性参数一起作为原始深度信息进行编码;或者对这8张原始电荷图像进行预处理,生成2幅过程深度数据和一幅背景数据,并将这2幅过程深度数据和一幅背景数据作为原始深度信息进行编码。Using the continuous modulation TOF method, under two different transmission signal frequencies, by controlling the integration time, a total of 8 groups of optical signals with different phases are obtained through the TOF sensor sampling, and the 8 groups of optical signals are photoelectrically converted to obtain 8 groups Charge signal, and then perform 10-bit quantization of these 8 groups of charge signals to generate 8 original charge images; the decoding end encodes these 8 original charge images together with the TOF sensor's temperature and other attribute parameters as original depth information; or Eight original charge images are preprocessed to generate two process depth data and one background data, and the two process depth data and one background data are encoded as the original depth information.
方式二Way two
采用双目成像的原理,利用双目摄像头拍摄得到的两幅视频图像,根据两幅视频图像的位姿将计算得到视差等信息,将视差信息和摄像头参数等作为原始深度信息进行编码。Using the principle of binocular imaging, two video images captured by a binocular camera are used to calculate parallax and other information according to the poses of the two video images, and the parallax information and camera parameters are encoded as the original depth information.
本发明实施例中,以编解码协议3维高性能视频编码(3 Dimension High Efficiency Video Coding,3D HEVC)为例,对原始深度信息进行编码时,作为一种可能的实现方式,每个视点以及对应的原始深度信息均编码;作为另一种可能的实现方式,可基于视点对原始深度信息进行间隔编码,即由于在同一时刻的不同视点之间,如相位图或电荷图像等原始深度信息存在很强的相关性,可以利用该相关性减少传输的视频图像码流数据量。在一示例中,对于三个视点的视频编码,在编码端,在视频图像码流中仅需要保留左右两个视点的原始深度数据,在解码端,可以通过对左右两个视点的原始深度信息进行插值处理得到中间视点的原始深度信息。In the embodiment of the present invention, the 3D High Efficiency Video Coding (3D HEVC) of the coding and decoding protocol is taken as an example. When the original depth information is encoded, as a possible implementation method, each view point and The corresponding original depth information is encoded; as another possible implementation, the original depth information can be encoded based on the viewpoint, that is, because the original depth information such as phase map or charge image exists between different viewpoints at the same time Strong correlation, which can be used to reduce the amount of transmitted video image stream data. In one example, for three-view video encoding, at the encoding end, only the original depth data of the left and right viewpoints need to be retained in the video image code stream. On the decoding end, the original depth information of the left and right viewpoints can be obtained. Perform interpolation processing to obtain the original depth information of the intermediate viewpoint.
本发明实施例中,以基于时间相关性对原始深度信息进行冗余消除为例,作为一种可能的实现方式,不需要对所有原始深度信息进行编码,而仅需要采用采样的方式对深度信息传感器采集到的原始深度信息采用固定步长进行采样,并通过视频图像编码器对这些采样信号进行编码;解码端恢复得到这些采样信号后,通过插值等方法恢复得到未被采样的原始深度信息。In the embodiment of the present invention, the redundancy elimination of the original depth information based on time correlation is taken as an example. As a possible implementation method, all the original depth information does not need to be encoded, but only the depth information needs to be sampled. The original depth information collected by the sensor is sampled with a fixed step size, and these sampled signals are encoded by the video image encoder; after the decoder recovers the sampled signals, the original depth information that has not been sampled is restored by interpolation and other methods.
在一示例中,如图10所示,原始深度信息包括:编号分别为信号1、信号2、信号3、信号4……信号N,采用固定步长3对原始深度信息进行采样,得到采样后的原始深度信息包括:信号1、信号4、信号7……信号N,对于采样后的原始深度信息进行编码 并解码,对于解码后的非采样信号,根据其相邻的采样信号进行恢复;如,对信号1和信号4进行插值恢复得到信号2,对信号2和信号4进行插值恢复得到信号3,以此类推。In an example, as shown in Fig. 10, the original depth information includes: the numbers are signal 1, signal 2, signal 3, signal 4...signal N, the original depth information is sampled with a fixed step size 3, and the sampled The original depth information of includes: signal 1, signal 4, signal 7...signal N, the original depth information after sampling is encoded and decoded, and the decoded non-sampled signal is restored based on its adjacent sampled signal; for example, , Interpolate and restore signal 1 and signal 4 to get signal 2, interpolate and restore signal 2 and signal 4 to get signal 3, and so on.
本发明实施例中,在AR场景中,作为一种可能的实现方式,不需要对整幅深度图像对应的原始深度信息进行编码,而仅需要对部分画面进行编码,从而实现指定的局部原始深度信息的编码传输。In the embodiment of the present invention, in the AR scene, as a possible implementation manner, the original depth information corresponding to the entire depth image does not need to be encoded, but only part of the picture needs to be encoded, so as to realize the specified local original depth Coding and transmission of information.
为实现上述信息处理方法,本发明实施例还提供一种终端设备,所述终端设备的组成结构,如图11所示,终端设备1100包括:In order to implement the foregoing information processing method, an embodiment of the present invention further provides a terminal device. The composition structure of the terminal device is as shown in FIG. 11, the terminal device 1100 includes:
第一获取单元1101,配置为通过深度信息传感单元获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息,所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The first acquiring unit 1101 is configured to acquire original depth information corresponding to the depth information in the case of acquiring the depth information of the target object through the depth information sensing unit, and the original depth information represents what the depth information sensing unit collects. The collection status of the depth information or information other than the collected depth information;
第二获取单元1102,配置为通过图像传感单元获取所述目标对象的视频图像数据;The second acquiring unit 1102 is configured to acquire video image data of the target object through an image sensing unit;
编码单元1103,配置为对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流;The encoding unit 1103 is configured to merge and encode the original depth information and the video image data to obtain a video image code stream;
输出单元1104,配置为输出所述视频图像码流。The output unit 1104 is configured to output the video image code stream.
本发明实施例中,编码单元1103,还配置为:In the embodiment of the present invention, the encoding unit 1103 is further configured to:
对所述视频图像数据对应的图像帧中指定图像帧视对应的原始深度信息和所述视频图像数据进行合并编码,得到所述视频图像码流。The original depth information corresponding to the specified image frame in the image frame corresponding to the video image data and the video image data are combined and encoded to obtain the video image code stream.
本发明实施例中,编码单元1103,还配置为:In the embodiment of the present invention, the encoding unit 1103 is further configured to:
对指定图像位置对应的原始深度信息和所述视频图像数据进行合并编码,得到所述视频图像码流。The original depth information corresponding to the designated image position and the video image data are combined and encoded to obtain the video image code stream.
本发明实施例中,编码单元1103,还配置为:In the embodiment of the present invention, the encoding unit 1103 is further configured to:
根据所述原始深度信息和所述视频图像数据的相关性,对所述原始深度信息和所述视频图像数据进行混合编码,得到所述视频图像码流。According to the correlation between the original depth information and the video image data, the original depth information and the video image data are mixed-encoded to obtain the video image code stream.
本发明实施例中,编码单元1103,还配置为:In the embodiment of the present invention, the encoding unit 1103 is further configured to:
对所述原始深度信息的进行编码,得到第一编码信息;Encoding the original depth information to obtain first encoding information;
将所述第一编码信息写入所述视频图像数据的指定位置;Writing the first encoding information into a designated location of the video image data;
对写入所述第一编码信息的视频图像数据进行编码,得到所述视频图像码流。The video image data written in the first encoding information is encoded to obtain the video image code stream.
本发明实施例中,编码单元1103,还配置为:In the embodiment of the present invention, the encoding unit 1103 is further configured to:
对所述原始深度信息的进行编码,得到第一编码信息;Encoding the original depth information to obtain first encoding information;
对视频图像数据进行编码,得到第二编码信息;Encoding the video image data to obtain second encoding information;
将所述第一编码信息和所述第二编码信息进行合并,得到所述视频图像码流。Combining the first coding information and the second coding information to obtain the video image code stream.
本发明实施例中,所述终端设备还包括:In the embodiment of the present invention, the terminal device further includes:
预处理单元,配置为:The preprocessing unit is configured as:
在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,对所述原始深度信息进行预处理。Before combining and encoding the original depth information and the video image data to obtain a video image code stream, preprocessing the original depth information.
本发明实施例中,所述终端设备还包括:In the embodiment of the present invention, the terminal device further includes:
消除单元,配置为:Elimination unit, configured as:
在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,对所述原始深度信息进行冗余消除处理,以消除所述原始深度信息中的冗余信息。Before the original depth information and the video image data are combined and encoded to obtain a video image code stream, redundancy elimination processing is performed on the original depth information to eliminate redundant information in the original depth information.
本发明实施例中,所述消除单元,还配置为以下至少之一:In the embodiment of the present invention, the elimination unit is further configured as at least one of the following:
基于相位相关性对所述原始深度信息冗余消除处理;Eliminating the redundancy of the original depth information based on phase correlation;
基于空间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on spatial correlation;
基于时间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on time correlation;
基于指定深度对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on the specified depth;
基于频域相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on frequency domain correlation;
基于编码二进制数据之间的相关性对所述原始深度信息的编码比特进行冗余消除处理。Perform redundancy elimination processing on the coded bits of the original depth information based on the correlation between the coded binary data.
本发明实施例中,所述原始深度信息包括以下至少之一:电荷信息、相位信息和所述深度信息传感单元的属性参数。In the embodiment of the present invention, the original depth information includes at least one of the following: charge information, phase information, and attribute parameters of the depth information sensing unit.
本发明实施例还提供一种终端设备,包括处理器和配置为存储能够在处理器上运行的计算机程序的存储器,其中,所述处理器配置为运行所述计算机程序时,执行上述终端设备执行的信息处理方法的步骤。An embodiment of the present invention also provides a terminal device, including a processor and a memory configured to store a computer program that can run on the processor, wherein the processor is configured to execute the above-mentioned terminal device when the computer program is run. The steps of the information processing method.
需要说明的是,本发明实施例中的深度信息传感单元、图像传感单元和视频图像编码单元可分别为深度信息传感器、图像传感器和视频图像编码器。It should be noted that the depth information sensing unit, the image sensing unit, and the video image encoding unit in the embodiment of the present invention may be a depth information sensor, an image sensor, and a video image encoder, respectively.
为实现上述信息处理方法,本发明实施例还提供一种终端设备,所述终端设备的组成结构,如图12所示,终端设备1200包括:In order to implement the foregoing information processing method, an embodiment of the present invention also provides a terminal device. The composition structure of the terminal device is as shown in FIG. 12, the terminal device 1200 includes:
接收单元1201,配置为接收视频图像码流,所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感单元获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感单元获取的所述目标对象的;所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The receiving unit 1201 is configured to receive a video image code stream, the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information is obtained by obtaining the depth of the target object through a depth information sensing unit Information, the video image data is the target object acquired by the image sensing unit; the original depth information characterizes the acquisition state of the depth information acquired by the depth information sensing unit or the acquisition status of the depth information acquired by the depth information sensing unit Information other than the said depth information;
解码单元1202,配置为对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;The decoding unit 1202 is configured to decode the video image code stream to obtain the original depth information and the video image corresponding to the video image data;
处理单元1203,配置为对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。The processing unit 1203 is configured to perform image processing on the original depth information and the video image to obtain a target video image.
本发明实施例中,解码单元1202,还配置为通过视频图像解码单元对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;In the embodiment of the present invention, the decoding unit 1202 is further configured to decode the video image code stream through the video image decoding unit to obtain the original depth information and the video image corresponding to the video image data;
处理单元1203,还配置为通过述视频图像解码单元对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。The processing unit 1203 is further configured to perform image processing on the original depth information and the video image through the video image decoding unit to obtain a target video image.
本发明实施例中,所述视频图像解码单元和所述图像处理单元相互独立,或所述图像处理单元集成在所述视频图像解码单元内。In the embodiment of the present invention, the video image decoding unit and the image processing unit are independent of each other, or the image processing unit is integrated in the video image decoding unit.
本发明实施例中,所述原始深度信息包括以下至少之一:电荷信息、相位信息和所述深度信息传感单元的属性参数。In the embodiment of the present invention, the original depth information includes at least one of the following: charge information, phase information, and attribute parameters of the depth information sensing unit.
本发明实施例中,处理单元1203,还配置为:In the embodiment of the present invention, the processing unit 1203 is further configured to:
当所述原始深度信息为电荷信息,根据所述电荷信息对所述视频图像进行去噪处理或白平衡调节,得到所述目标视频图像。When the original depth information is charge information, denoising processing or white balance adjustment is performed on the video image according to the charge information to obtain the target video image.
本发明实施例中,处理单元1203,还配置为:In the embodiment of the present invention, the processing unit 1203 is further configured to:
当所述原始深度信息为相位信息,根据所述相位信息对所述视频图像进行去模糊处理,得到所述目标视频图像。When the original depth information is phase information, the video image is deblurred according to the phase information to obtain the target video image.
本发明实施例中,所述终端设备还包括:In the embodiment of the present invention, the terminal device further includes:
生成单元,配置为对所述原始深度信息进行恢复,得到深度图像。The generating unit is configured to restore the original depth information to obtain a depth image.
本发明实施例中,所述生成单元,还配置为通过深度图像生成单元对所述原始深度信息进行恢复,得到深度图像,得到所述深度图像。In the embodiment of the present invention, the generating unit is further configured to restore the original depth information through the depth image generating unit to obtain a depth image to obtain the depth image.
本发明实施例还提供一种终端设备,包括处理器和配置为存储能够在处理器上运行的计算机程序的存储器,其中,所述处理器配置为运行所述计算机程序时,执行上 述终端设备执行的信息处理方法的步骤。An embodiment of the present invention also provides a terminal device, including a processor and a memory configured to store a computer program that can run on the processor, wherein the processor is configured to execute the above-mentioned terminal device when the computer program is run. The steps of the information processing method.
需要说明的是,本发明实施例中的视频图像解码单元、图像处理单元和深度图像生成单元可分别为视频图像解码器、图像处理器和深度图像生成器。It should be noted that the video image decoding unit, image processing unit, and depth image generating unit in the embodiment of the present invention may be a video image decoder, an image processor, and a depth image generator, respectively.
图13是本发明实施例的电子设备(终端设备)的硬件组成结构示意图,电子设备1300包括:至少一个处理器1301、存储器1302和至少一个网络接口1304。电子设备1300中的各个组件通过总线系统1305耦合在一起。可理解,总线系统1305用于实现这些组件之间的连接通信。总线系统1305除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见,在图13中将各种总线都标为总线系统1305。13 is a schematic diagram of the hardware composition structure of an electronic device (terminal device) according to an embodiment of the present invention. The electronic device 1300 includes: at least one processor 1301, a memory 1302, and at least one network interface 1304. The various components in the electronic device 1300 are coupled together through the bus system 1305. It can be understood that the bus system 1305 is used to implement connection and communication between these components. In addition to the data bus, the bus system 1305 also includes a power bus, a control bus, and a status signal bus. However, for the sake of clarity, various buses are marked as the bus system 1305 in FIG. 13.
可以理解,存储器1302可以是易失性存储器或非易失性存储器,也可包括易失性和非易失性存储器两者。其中,非易失性存储器可以是ROM、可编程只读存储器(PROM,Programmable Read-Only Memory)、可擦除可编程只读存储器(EPROM,Erasable Programmable Read-Only Memory)、电可擦除可编程只读存储器(EEPROM,Electrically Erasable Programmable Read-Only Memory)、磁性随机存取存储器(FRAM,ferromagnetic random access memory)、快闪存储器(Flash Memory)、磁表面存储器、光盘、或只读光盘(CD-ROM,Compact Disc Read-Only Memory);磁表面存储器可以是磁盘存储器或磁带存储器。易失性存储器可以是随机存取存储器(RAM,Random Access Memory),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(SRAM,Static Random Access Memory)、同步静态随机存取存储器(SSRAM,Synchronous Static Random Access Memory)、动态随机存取存储器(DRAM,Dynamic Random Access Memory)、同步动态随机存取存储器(SDRAM,Synchronous Dynamic Random Access Memory)、双倍数据速率同步动态随机存取存储器(DDRSDRAM,Double Data Rate Synchronous Dynamic Random Access Memory)、增强型同步动态随机存取存储器(ESDRAM,Enhanced Synchronous Dynamic Random Access Memory)、同步连接动态随机存取存储器(SLDRAM,SyncLink Dynamic Random Access Memory)、直接内存总线随机存取存储器(DRRAM,Direct Rambus Random Access Memory)。本发明实施例描述的存储器1302旨在包括但不限于这些和任意其它适合类型的存储器。It can be understood that the memory 1302 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory. Among them, non-volatile memory can be ROM, Programmable Read-Only Memory (PROM), Erasable Programmable Read-Only Memory (EPROM), and electrically erasable Programmable read-only memory (EEPROM, Electrically Erasable Programmable Read-Only Memory), magnetic random access memory (FRAM, ferromagnetic random access memory), flash memory (Flash Memory), magnetic surface memory, optical disk, or CD-ROM (CD) -ROM, Compact Disc Read-Only Memory); Magnetic surface memory can be disk storage or tape storage. The volatile memory may be a random access memory (RAM, Random Access Memory), which is used as an external cache. By way of exemplary but not restrictive description, many forms of RAM are available, such as static random access memory (SRAM, Static Random Access Memory), synchronous static random access memory (SSRAM, Synchronous Static Random Access Memory), and dynamic random access memory. Memory (DRAM, Dynamic Random Access Memory), Synchronous Dynamic Random Access Memory (SDRAM, Synchronous Dynamic Random Access Memory), Double Data Rate Synchronous Dynamic Random Access Memory (DDRSDRAM, Double Data Rate Synchronous Dynamic Random Access Memory), enhanced Type synchronous dynamic random access memory (ESDRAM, Enhanced Synchronous Dynamic Random Access Memory), synchronous connection dynamic random access memory (SLDRAM, SyncLink Dynamic Random Access Memory), direct memory bus random access memory (DRRAM, Direct Rambus Random Access Memory) ). The memory 1302 described in the embodiment of the present invention is intended to include, but is not limited to, these and any other suitable types of memory.
本发明实施例中的存储器1302用于存储各种类型的数据以支持电子设备1300的操作。这些数据的示例包括:用于在电子设备1300上操作的任何计算机程序,如应用程序13021。实现本发明实施例方法的程序可以包含在应用程序13021中。The memory 1302 in the embodiment of the present invention is used to store various types of data to support the operation of the electronic device 1300. Examples of these data include: any computer program used to operate on the electronic device 1300, such as an application program 13021. The program for implementing the method of the embodiment of the present invention may be included in the application program 13021.
上述本发明实施例揭示的方法可以应用于处理器1301中,或者由处理器1301实现。处理器1301可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器1301中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器1301可以是通用处理器、数字信号处理器(DSP,Digital Signal Processor),或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。处理器1301可以实现或者执行本发明实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本发明实施例所公开的方法的步骤,可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储介质中,该存储介质位于存储器1302,处理器1301读取存储器1302中的信息,结合其硬件完成前述方法的步骤。The method disclosed in the foregoing embodiment of the present invention may be applied to the processor 1301 or implemented by the processor 1301. The processor 1301 may be an integrated circuit chip with signal processing capabilities. In the implementation process, the steps of the foregoing method may be completed by an integrated logic circuit of hardware in the processor 1301 or instructions in the form of software. The aforementioned processor 1301 may be a general-purpose processor, a digital signal processor (DSP, Digital Signal Processor), or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, and the like. The processor 1301 may implement or execute various methods, steps, and logical block diagrams disclosed in the embodiments of the present invention. The general-purpose processor may be a microprocessor or any conventional processor or the like. The steps of the method disclosed in the embodiments of the present invention may be directly embodied as being executed and completed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor. The software module may be located in a storage medium, and the storage medium is located in the memory 1302. The processor 1301 reads the information in the memory 1302 and completes the steps of the foregoing method in combination with its hardware.
在示例性实施例中,电子设备1300可以被一个或多个应用专用集成电路(ASIC,Application Specific Integrated Circuit)、DSP、可编程逻辑器件(PLD,Programmable Logic Device)、复杂可编程逻辑器件(CPLD,Complex Programmable Logic Device)、FPGA、通用处理器、控制器、MCU、MPU、或其他电子元件实现,用于执行前述方法。In an exemplary embodiment, the electronic device 1300 may be configured by one or more application specific integrated circuits (ASIC, Application Specific Integrated Circuit), DSP, programmable logic device (PLD, Programmable Logic Device), and complex programmable logic device (CPLD). , Complex Programmable Logic Device), FPGA, general-purpose processor, controller, MCU, MPU, or other electronic components to implement the foregoing method.
本发明实施例还提供了一种存储介质,用于存储计算机程序。The embodiment of the present invention also provides a storage medium for storing computer programs.
可选的,该存储介质可应用于本发明实施例中的终端设备,并且该计算机程序使得计算机执行本发明实施例的各个方法中的相应流程,为了简洁,在此不再赘述。Optionally, the storage medium can be applied to the terminal device in the embodiment of the present invention, and the computer program causes the computer to execute the corresponding process in each method of the embodiment of the present invention. For brevity, details are not described herein again.
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生配置为实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowcharts and/or block diagrams of methods, devices (systems), and computer program products according to embodiments of the present invention. It should be understood that each process and/or block in the flowchart and/or block diagram, and the combination of processes and/or blocks in the flowchart and/or block diagram can be implemented by computer program instructions. These computer program instructions can be provided to the processors of general-purpose computers, special-purpose computers, embedded processors, or other programmable data processing equipment to generate a machine, so that the instructions executed by the processor of the computer or other programmable data processing equipment generate configuration A device for realizing the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions can also be stored in a computer-readable memory that can guide a computer or other programmable data processing equipment to work in a specific manner, so that the instructions stored in the computer-readable memory produce an article of manufacture including the instruction device. The device implements the functions specified in one process or multiple processes in the flowchart and/or one block or multiple blocks in the block diagram.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供配置为实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing equipment, so that a series of operation steps are executed on the computer or other programmable equipment to produce computer-implemented processing, so as to execute on the computer or other programmable equipment. The instructions provide steps configured to implement functions specified in a flow or multiple flows in the flowchart and/or a block or multiple blocks in the block diagram.
以上所述,仅为本发明的较佳实施例而已,并非配置为限定本发明的保护范围,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above are only the preferred embodiments of the present invention and are not configured to limit the scope of protection of the present invention. Any modification, equivalent replacement and improvement made within the spirit and principle of the present invention shall be included in Within the protection scope of the present invention.
Claims (27)
- 一种信息处理方法,所述方法包括:An information processing method, the method comprising:通过深度信息传感器获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息,所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;In the case of acquiring the depth information of the target object through the depth information sensor, the original depth information corresponding to the depth information is acquired, and the original depth information represents the acquisition state of the depth information collected by the depth information sensor or the collected depth information. Information other than in-depth information;通过图像传感器获取所述目标对象的视频图像数据;Acquiring video image data of the target object through an image sensor;对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,并输出所述视频图像码流。The original depth information and the video image data are combined and encoded to obtain a video image code stream, and the video image code stream is output.
- 根据权利要求1所述的方法,其中,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:The method according to claim 1, wherein said combining and encoding said original depth information and said video image data to obtain a video image code stream comprises:对所述视频图像数据对应的图像帧中指定图像帧对应的原始深度信息和所述视频图像数据进行合并编码,得到所述视频图像码流。The original depth information corresponding to the designated image frame in the image frame corresponding to the video image data and the video image data are combined and encoded to obtain the video image code stream.
- 根据权利要求1所述的方法,其中,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:The method according to claim 1, wherein said combining and encoding said original depth information and said video image data to obtain a video image code stream comprises:对指定图像位置对应的原始深度信息和所述视频图像数据进行合并编码,得到所述视频图像码流。The original depth information corresponding to the designated image position and the video image data are combined and encoded to obtain the video image code stream.
- 根据权利要求1至3任一项所述的方法,其中,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:The method according to any one of claims 1 to 3, wherein the combining and encoding the original depth information and the video image data to obtain a video image code stream comprises:根据所述原始深度信息和所述视频图像数据的相关性,对所述原始深度信息和所述视频图像数据进行混合编码,得到所述视频图像码流。According to the correlation between the original depth information and the video image data, the original depth information and the video image data are mixed-encoded to obtain the video image code stream.
- 根据权利要求4所述的方法,其中,所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,还包括:The method according to claim 4, wherein said combining and encoding said original depth information and said video image data to obtain a video image code stream, further comprising:将所述原始深度信息对应的第一编码信息写入所述视频图像数据对应的第二编码信息的指定位置处。The first encoding information corresponding to the original depth information is written into the designated position of the second encoding information corresponding to the video image data.
- 根据权利要求1至3任一项所述的方法,其中,所述对所述原始深度信息和所述视频图像数据进行合并编码,包括:The method according to any one of claims 1 to 3, wherein the combining and encoding the original depth information and the video image data comprises:分别对所述原始深度信息和所述视频图像数据进行独立编码,得到包括第一码流和第二码流的图像视频码流,所述第一码流为所述原始深度信息编码后得到的码流,所述第二码流为所述图像视频数据编码后得到的码流。The original depth information and the video image data are separately encoded to obtain an image video code stream including a first code stream and a second code stream, and the first code stream is obtained after encoding the original depth information A code stream, and the second code stream is a code stream obtained after encoding the image and video data.
- 根据权利要求1至6任一项所述的方法,其中,在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,所述方法还包括:The method according to any one of claims 1 to 6, wherein, before combining and encoding the original depth information and the video image data to obtain a video image code stream, the method further comprises:对所述原始深度信息进行预处理;Preprocessing the original depth information;所述对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流,包括:The combining and encoding the original depth information and the video image data to obtain a video image code stream includes:对经过预处理的原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流。The preprocessed original depth information and the video image data are combined and encoded to obtain a video image code stream.
- 根据权利要求1至7任一项所述的方法,其中,在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,所述方法还包括:The method according to any one of claims 1 to 7, wherein, before combining and encoding the original depth information and the video image data to obtain a video image code stream, the method further comprises:对所述原始深度信息进行冗余消除处理,以消除所述原始深度信息中的冗余信息。Perform redundancy elimination processing on the original depth information to eliminate redundant information in the original depth information.
- 根据权利要求8所述的方法,其中,所述根据对所述原始深度信息进行冗余消除处理,包括以下至少之一:The method according to claim 8, wherein said performing redundancy elimination processing on said original depth information according to said process comprises at least one of the following:基于相位相关性对所述原始深度信息冗余消除处理;Eliminating the redundancy of the original depth information based on phase correlation;基于空间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on spatial correlation;基于时间相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on time correlation;基于指定深度对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on the specified depth;基于频域相关性对所述原始深度信息进行冗余消除处理;Performing redundancy elimination processing on the original depth information based on frequency domain correlation;基于编码二进制数据之间的相关性对所述原始深度信息的编码比特进行冗余消除处理。Perform redundancy elimination processing on the coded bits of the original depth information based on the correlation between the coded binary data.
- 根据权利要求1至9任一项所述的方法,其中,所述原始深度信息包括以下至少之一:电荷信息、相位信息和所述深度信息传感器的属性参数。The method according to any one of claims 1 to 9, wherein the original depth information includes at least one of the following: charge information, phase information, and attribute parameters of the depth information sensor.
- 一种信息处理方法,所述方法包括:An information processing method, the method comprising:接收视频图像码流,所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感器获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感器获取的所述目标对象的;所述原始深度信息表征所述深度信息传感器采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;Receive a video image code stream, the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information is obtained when the depth information of the target object is obtained by the depth information sensor, so The video image data is obtained by the target object through an image sensor; the original depth information represents the collection state of the depth information collected by the depth information sensor or information other than the collected depth information;对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;Decoding the video image code stream to obtain the original depth information and the video image corresponding to the video image data;对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。Image processing is performed on the original depth information and the video image to obtain a target video image.
- 根据权利要求11所述的方法,其中,The method of claim 11, wherein:通过视频图像解码器对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;Decoding the video image code stream by a video image decoder to obtain the original depth information and the video image corresponding to the video image data;通过图像处理器对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。Image processing is performed on the original depth information and the video image by an image processor to obtain a target video image.
- 根据权利要求12所述的方法,其中,所述视频图像解码器和所述图像处理器相互独立,或所述图像处理器集成在所述视频图像解码器内。The method according to claim 12, wherein the video image decoder and the image processor are independent of each other, or the image processor is integrated in the video image decoder.
- 根据权利要求11至13任一项所述的方法,其中,所述原始深度信息包括以下至少之一:电荷信息、相位信息和所述深度信息传感器的属性参数。The method according to any one of claims 11 to 13, wherein the original depth information includes at least one of the following: charge information, phase information, and attribute parameters of the depth information sensor.
- 根据权利要求14所述的方法,其中,当所述原始深度信息为电荷信息,所述对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像,包括:The method according to claim 14, wherein, when the original depth information is charge information, the performing image processing on the original depth information and the video image to obtain a target video image comprises:根据所述电荷信息对所述视频图像进行去噪处理或白平衡调节,得到所述目标视频图像。Performing denoising processing or white balance adjustment on the video image according to the charge information to obtain the target video image.
- 根据权利要求14所述的方法,其中,当所述原始深度信息为相位信息,所述对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像,包括:The method according to claim 14, wherein, when the original depth information is phase information, the performing image processing on the original depth information and the video image to obtain a target video image comprises:根据所述相位信息对所述视频图像进行去模糊处理,得到所述目标视频图像。Performing deblurring processing on the video image according to the phase information to obtain the target video image.
- 根据权利要求11至16任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 11 to 16, wherein the method further comprises:对所述原始深度信息进行恢复,得到深度图像。The original depth information is restored to obtain a depth image.
- 根据权利要求17所述的方法,其中,通过深度图像生成器对所述原始深度信息进行恢复,得到所述深度图像。The method according to claim 17, wherein the original depth information is restored by a depth image generator to obtain the depth image.
- 一种终端设备,所述终端设备包括:A terminal device, the terminal device includes:第一获取单元,配置为通过深度信息传感单元获取目标对象的深度信息的情况下,获取所述深度信息对应的原始深度信息,所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The first acquiring unit is configured to acquire original depth information corresponding to the depth information when the depth information of the target object is acquired through the depth information sensing unit, and the original depth information represents that the depth information sensing unit collects the The collection status of depth information or information other than the collected depth information;第二获取单元,配置为通过图像传感单元获取所述目标对象的视频图像数据;The second acquiring unit is configured to acquire the video image data of the target object through an image sensing unit;编码单元,配置为对所述原始深度信息和所述视频图像数据进行合并编码,得到 视频图像码流;An encoding unit configured to merge and encode the original depth information and the video image data to obtain a video image code stream;输出单元,配置为输出所述视频图像码流。The output unit is configured to output the video image code stream.
- 根据权利要求19所述的终端设备,其中,所述编码单元,还配置为:The terminal device according to claim 19, wherein the encoding unit is further configured to:根据所述原始深度信息和所述视频图像数据的相关性,对所述原始深度信息和所述视频图像数据进行混合编码,得到所述视频图像码流。According to the correlation between the original depth information and the video image data, the original depth information and the video image data are mixed-encoded to obtain the video image code stream.
- 根据权利要求19或20所述的终端设备,其中,所述编码单元,还配置为:The terminal device according to claim 19 or 20, wherein the encoding unit is further configured to:对所述原始深度信息的进行编码,得到第一编码信息;Encoding the original depth information to obtain first encoding information;对视频图像数据进行编码,得到第二编码信息;Encoding the video image data to obtain second encoding information;将所述第一编码信息和所述第二编码信息进行合并,得到所述视频图像码流。Combining the first coding information and the second coding information to obtain the video image code stream.
- 根据权利要求19至21任一项所述的终端设备,其中,所述终端设备还包括:The terminal device according to any one of claims 19 to 21, wherein the terminal device further comprises:预处理单元,配置为:The preprocessing unit is configured as:在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,对所述原始深度信息进行预处理。Before combining and encoding the original depth information and the video image data to obtain a video image code stream, preprocessing the original depth information.
- 根据权利要求19至22任一项所述的终端设备,其中,所述终端设备还包括:The terminal device according to any one of claims 19 to 22, wherein the terminal device further comprises:消除单元,配置为:Elimination unit, configured as:在对所述原始深度信息和所述视频图像数据进行合并编码,得到视频图像码流之前,对所述原始深度信息进行冗余消除处理,以消除所述原始深度信息中的冗余信息。Before the original depth information and the video image data are combined and encoded to obtain a video image code stream, redundancy elimination processing is performed on the original depth information to eliminate redundant information in the original depth information.
- 一种终端设备,所述终端设备包括:A terminal device, the terminal device includes:接收单元,配置为接收视频图像码流,所述视频图像码流为对原始深度信息和视频图像数据进行合并编码得到的,所述原始深度信息是通过深度信息传感单元获取目标对象的深度信息的情况下获取的,所述视频图像数据是通过图像传感单元获取的所述目标对象的;所述原始深度信息表征所述深度信息传感单元采集所述深度信息的采集状态或采集到的所述深度信息以外的信息;The receiving unit is configured to receive a video image code stream, the video image code stream is obtained by combining and encoding original depth information and video image data, and the original depth information obtains the depth information of the target object through the depth information sensing unit In the case of acquiring the video image data, the target object is acquired by the image sensing unit; the original depth information represents the acquisition state of the depth information acquired by the depth information sensing unit or the acquisition state of the depth information acquired by the depth information sensing unit Information other than the depth information;解码单元,配置为对所述视频图像码流进行解码,得到所述原始深度信息和所述视频图像数据对应的视频图像;A decoding unit configured to decode the video image code stream to obtain the original depth information and the video image corresponding to the video image data;处理单元,配置为对所述原始深度信息和所述视频图像进行图像处理,得到目标视频图像。The processing unit is configured to perform image processing on the original depth information and the video image to obtain a target video image.
- 根据权利要求24所述的终端设备,其中,所述终端设备还包括:The terminal device according to claim 24, wherein the terminal device further comprises:生成单元,配置为对所述深度信息进行恢复,得到深度图像。The generating unit is configured to restore the depth information to obtain a depth image.
- 一种终端设备,包括处理器和配置为存储能够在处理器上运行的计算机程序的存储器,其中,所述处理器配置为运行所述计算机程序时,执行上述权利要求1至10任一项所述的信息处理方法的步骤,或执行上述权利要求11至18任一项所述的信息处理方法的步骤。A terminal device, comprising a processor and a memory configured to store a computer program that can run on the processor, wherein the processor is configured to execute the computer program described in any one of claims 1 to 10 when the processor is configured to run the computer program. The steps of the information processing method described above, or the steps of the information processing method described in any one of claims 11 to 18 are executed.
- 一种存储介质,存储有可执行程序,所述可执行程序被处理器执行时,实现上述权利要求1至10任一项所述的信息处理方法,或实现上述权利要求11至18任一项所述的信息处理方法。A storage medium that stores an executable program that, when executed by a processor, implements the information processing method of any one of claims 1 to 10, or implements any one of claims 11 to 18 The described information processing method.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/116055 WO2021087819A1 (en) | 2019-11-06 | 2019-11-06 | Information processing method, terminal device and storage medium |
CN201980100362.9A CN114391259B (en) | 2019-11-06 | 2019-11-06 | Information processing method, terminal device and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/116055 WO2021087819A1 (en) | 2019-11-06 | 2019-11-06 | Information processing method, terminal device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021087819A1 true WO2021087819A1 (en) | 2021-05-14 |
Family
ID=75849423
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/116055 WO2021087819A1 (en) | 2019-11-06 | 2019-11-06 | Information processing method, terminal device and storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114391259B (en) |
WO (1) | WO2021087819A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116170581A (en) * | 2023-02-17 | 2023-05-26 | 厦门瑞为信息技术有限公司 | Video information encoding and decoding method based on target perception and electronic equipment |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110189371B (en) * | 2019-05-20 | 2023-06-30 | 东南大学 | Mouse balance state discriminating device and method based on TOF depth camera |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102630026A (en) * | 2011-02-03 | 2012-08-08 | 美国博通公司 | Method and system for processing video |
US20130021446A1 (en) * | 2011-07-20 | 2013-01-24 | GM Global Technology Operations LLC | System and method for enhanced sense of depth video |
CN103440662A (en) * | 2013-09-04 | 2013-12-11 | 清华大学深圳研究生院 | Kinect depth image acquisition method and device |
CN110355758A (en) * | 2019-07-05 | 2019-10-22 | 北京史河科技有限公司 | A kind of machine follower method, equipment and follow robot system |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MX2020007663A (en) * | 2018-01-19 | 2020-09-14 | Interdigital Vc Holdings Inc | Processing a point cloud. |
CN110335211B (en) * | 2019-06-24 | 2021-07-30 | Oppo广东移动通信有限公司 | Method for correcting depth image, terminal device and computer storage medium |
CN110336973B (en) * | 2019-07-29 | 2021-04-13 | 联想(北京)有限公司 | Information processing method and device, electronic device and medium |
-
2019
- 2019-11-06 CN CN201980100362.9A patent/CN114391259B/en active Active
- 2019-11-06 WO PCT/CN2019/116055 patent/WO2021087819A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102630026A (en) * | 2011-02-03 | 2012-08-08 | 美国博通公司 | Method and system for processing video |
US20130021446A1 (en) * | 2011-07-20 | 2013-01-24 | GM Global Technology Operations LLC | System and method for enhanced sense of depth video |
CN103440662A (en) * | 2013-09-04 | 2013-12-11 | 清华大学深圳研究生院 | Kinect depth image acquisition method and device |
CN110355758A (en) * | 2019-07-05 | 2019-10-22 | 北京史河科技有限公司 | A kind of machine follower method, equipment and follow robot system |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116170581A (en) * | 2023-02-17 | 2023-05-26 | 厦门瑞为信息技术有限公司 | Video information encoding and decoding method based on target perception and electronic equipment |
CN116170581B (en) * | 2023-02-17 | 2024-01-23 | 厦门瑞为信息技术有限公司 | Video information encoding and decoding method based on target perception and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN114391259A (en) | 2022-04-22 |
CN114391259B (en) | 2024-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109863754B (en) | Virtual reality 360-degree video camera system for live streaming | |
KR102343700B1 (en) | Video transmission based on independently encoded background updates | |
CN102326391B (en) | Multi-view image coding device, multi-view image decoding method, multi-view image decoding device, multi-view image decoding method | |
US8374444B2 (en) | Method and apparatus for providing higher resolution images in an embedded device | |
US11089214B2 (en) | Generating output video from video streams | |
JP6901468B2 (en) | Methods and equipment for encoding and decoding light field-based images, and corresponding computer program products. | |
JP2019534606A (en) | Method and apparatus for reconstructing a point cloud representing a scene using light field data | |
WO2021057689A1 (en) | Video decoding method and apparatus, video encoding method and apparatus, storage medium, and electronic device | |
US10511766B2 (en) | Video transmission based on independently encoded background updates | |
WO2018223086A1 (en) | Methods for full parallax light field compression | |
WO2021057477A1 (en) | Video encoding and decoding method and related device | |
WO2021087819A1 (en) | Information processing method, terminal device and storage medium | |
KR101763921B1 (en) | Method and system for contents streaming | |
JP2015019326A (en) | Encoding device, encoding method, decoding device, and decoding method | |
WO2015056712A1 (en) | Moving image encoding method, moving image decoding method, moving image encoding device, moving image decoding device, moving image encoding program, and moving image decoding program | |
JP2013150071A (en) | Encoder, encoding method, program and storage medium | |
JP2018033127A (en) | Method and device for encoding signal representative of light-field content | |
JP2019083405A (en) | Decoding device, transmission device, decoding method, control method for transmission device, and program | |
EP3907992A1 (en) | Method for image processing and apparatus for implementing the same | |
WO2021087810A1 (en) | Information processing methods and systems, and encoding apparatus, decoding apparatus and storage medium | |
JP7303930B1 (en) | Image processing method, device, electronic device and readable storage medium | |
US20220230361A1 (en) | Information processing method, and encoding device | |
KR101760760B1 (en) | Method and video transmission server for transmitting motion vector, and method and device for reproducing video | |
JP2021170689A (en) | Image processing device and method | |
WO2015141549A1 (en) | Video encoding device and method and video decoding device and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19951482 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19951482 Country of ref document: EP Kind code of ref document: A1 |