WO2018036112A1 - Method and system for encoding/decoding panoramic video - Google Patents

Method and system for encoding/decoding panoramic video Download PDF

Info

Publication number
WO2018036112A1
WO2018036112A1 PCT/CN2017/073979 CN2017073979W WO2018036112A1 WO 2018036112 A1 WO2018036112 A1 WO 2018036112A1 CN 2017073979 W CN2017073979 W CN 2017073979W WO 2018036112 A1 WO2018036112 A1 WO 2018036112A1
Authority
WO
WIPO (PCT)
Prior art keywords
panoramic
video
encoding
decoding
user
Prior art date
Application number
PCT/CN2017/073979
Other languages
French (fr)
Chinese (zh)
Inventor
孙其民
李炜
Original Assignee
深圳市掌网科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市掌网科技股份有限公司 filed Critical 深圳市掌网科技股份有限公司
Publication of WO2018036112A1 publication Critical patent/WO2018036112A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays

Definitions

  • the present invention relates to the field of panoramic video codec technology, and more particularly to a method and system for encoding and decoding panoramic video.
  • the panoramic video is the main content carrier of the virtual reality image display, and the panoramic video is a video composed of a series of panoramic image frames, and the intuitive image is dynamically displayed in the panoramic view.
  • the panoramic image frame is an image frame formed by recording a scene of a predetermined viewing angle, for example, a predetermined viewing angle of 360° horizontally, 180° vertical, or a predetermined viewing angle of 360° horizontal, 270° vertical, and the like.
  • the encoder needs to encode the panoramic video, and send the encoded panoramic video to the decoder for decoding. After decoding, the user can watch the panoramic video.
  • the encoder acquires all the panoramic image frames in the panoramic video, and encodes each panoramic image frame, using the encoding defined by the highly compressed digital video encoder standard H.264/AVC.
  • the rules are encoded, or encoded according to the encoding rules defined by the high-efficiency video coding standard H.265; after encoding each panoramic image frame of the panoramic video, we obtain a set of rectangular regions in units of blocks, that is, the image frames.
  • Encoding data then transmitting each panoramic image frame to the decoder, and the decoder in the user equipment decodes the transmitted encoded panoramic image frame, and decodes all the panoramic image frames to form a complete video for the user to watch.
  • the user's viewing angle is usually around 150°, so that the encoded data of only 150° in the panoramic image frame is valid, and the remaining encoded data is invalid, and receiving and encoding the data may cause Unnecessary decoding credits.
  • An object of the present invention is to provide a method and system for encoding and decoding panoramic video, which aims to solve the problem of redundant decoding and crediting of panoramic image frames in the existing virtual reality image, and how to ensure the real range of viewing angles.
  • the image frame coding and decoding problem of the panoramic video corresponding to the FOV, and the difficulty in judging the trend of the irregular head movement are difficult.
  • the present invention solves the technical problem thereof, and adopts the technical means: providing a method for encoding and decoding panoramic video, including using a panoramic camera, a communication unit, a user equipment, a camera group, an image data buffer unit, an encoder, and a positioning device
  • the preprocessing unit and the decoder include the following steps:
  • the image data buffer unit divides each panoramic image frame of the panoramic video into A regions according to a horizontal perspective according to a view division rule, and A is an integer, and the panoramic video is divided according to a head motion trend. Multiple valid view video sequences;
  • the encoder encodes an effective view video sequence composed of panoramic image frames of a specific viewing angle to obtain a plurality of sets of encoded data.
  • S4 Send corresponding encoded data to the user equipment according to the viewing angle of the user, and the user equipment decodes the encoded data by using a decoder, and plays the decoded video sequence of the view.
  • step S3 comprises the following sub-steps:
  • the panoramic camera captures video through cameras of different orientations
  • the panoramic image frame of the specific view constitutes a valid panoramic video and is provided to the encoder.
  • step S4 comprises the following sub-steps:
  • S41 collecting a trend of a user's head movement by using a positioning device in the user equipment;
  • the encoder determines, according to the head motion trend data, a valid view video sequence corresponding to the predetermined area in the foregoing A regions.
  • the pre-processing unit determines, according to the head motion trend data of the user collected by the positioning device, the size of the next full-frame image frame display gap.
  • step S41 comprises the following sub-steps:
  • S412 Track each motion of the user's head in each direction from the reference position and each rotation motion to determine how to display the panoramic image frame to maintain a continuous and true effect of the VR environment;
  • the user equipment sends the collected motion trend data to the panoramic camera.
  • the motion trend data includes angle and amplitude data of a detected head rotation of the user.
  • the crevice is determined by a certain perspective and a panoramic image, and is determined by the motion trend data of the user's head collected by the pre-processing unit.
  • the viewing angle division rules of all the image frames in one panoramic video video are the same, and the sum of the horizontal viewing angles of the A regional viewing angle video sequences is the total level of the panoramic image frames.
  • the angle of view, and the regions in a panoramic image frame do not overlap each other.
  • a system for encoding and decoding panoramic video including a panoramic camera, a communication unit, and a user equipment; the panoramic camera includes a camera group composed of cameras for capturing various orientation videos and a panoramic view for storing the camera group
  • An image data buffer unit of the video the image data buffer unit including an encoder for encoding a panoramic image frame of the panoramic video of a predetermined angle of view.
  • the user equipment includes positioning means for collecting user head motion trend data, a pre-processing unit for pre-determining the size of the next panoramic image frame in the user equipment, and for encoding
  • the panoramic image frame is decoded by the decoder.
  • the panoramic camera captures a video through a camera of different orientations, and sends the captured panoramic video to the image data buffer unit, where the encoder in the image data buffer unit divides the panoramic video into a plurality of viewing angle video sequences, the encoder encoding each of the viewing angle video sequences to obtain a plurality of sets of encoded data; and using the motion trend data of the user's head collected by the positioning device to pass the corresponding panoramic encoded data through the communication
  • the unit is sent to the user equipment; the pre-processing unit in the user equipment determines, according to the motion trend data of the user's head collected by the positioning device, the next partial range of the panoramic image frame display gap, and the control instruction And transmitting to the encoder and the decoder, where the encoder and the decoder encode and decode the panoramic image frame of the specific view according to the motion trend data and the control instruction.
  • the method and system for implementing the codec panoramic video of the present invention have the following beneficial effects:
  • the encoder in the image data buffer unit divides the panoramic video into multiple view video sequences according to the view division rule, by pairing each view
  • the video sequence is encoded to obtain a plurality of sets of encoded data
  • the positioning device is used to collect the motion trend data of the user's head, which not only can accurately determine the user's motion trend, but also ensures that the real-time view is effective according to the pre-processing unit.
  • the panoramic video corresponding to the FOV range is decoded to reduce the decoding credit.
  • FIG. 1 is a schematic diagram of a system embodiment of a codec panoramic video according to the present invention.
  • FIG. 2 is a logic diagram of a panoramic camera embodiment of a system for encoding and decoding panoramic video according to the present invention
  • FIG. 3 is a schematic diagram of a user equipment of a system for encoding and decoding panoramic video according to the present invention
  • FIG. 4 is a schematic flowchart of a method for encoding and decoding panoramic video according to the present invention.
  • FIG. 5 is a schematic flowchart of step S3 in an embodiment of a method for encoding and decoding panoramic video according to the present invention
  • step S4 is a schematic flowchart of step S4 in an embodiment of a method for encoding and decoding panoramic video according to the present invention
  • step S41 is a schematic flowchart of step S41 in an embodiment of a method for encoding and decoding panoramic video according to the present invention.
  • FIG. 1 is a logic diagram of a system embodiment of a codec panoramic video according to the present invention.
  • FIG. 2 is a schematic diagram of a panoramic camera embodiment of a system for encoding and decoding panoramic video according to the present invention
  • FIG. 3 is a schematic diagram of a panoramic camera embodiment of the present invention.
  • the system 100 of the present invention includes a panoramic camera 110, a communication unit 120, and a user equipment 130.
  • the panoramic camera 110 includes a camera group 111 composed of cameras of the azimuth video and an image data buffer unit 112 for storing the panoramic video captured by the camera group, the image data buffer unit 112 including a panoramic image frame for encoding the panoramic video of the predetermined angle of view.
  • the user equipment 130 includes a positioning device 131 for collecting user head motion trend data, a pre-processing unit 132 for pre-determining the size of the next panoramic image frame in the user equipment, and For the encoded panoramic image frame A decoder 133 that performs decoding.
  • the encoder acquires all the panoramic image frames in the panoramic video, and encodes each of the panoramic image frames, and encodes the encoding rules defined by the highly compressed digital video encoder standard H.264/A VC, or The encoding is performed according to the encoding rule defined by the high-efficiency video coding standard H.265; the decoder in the user equipment decodes the transmitted encoded panoramic image frame, and decodes all the panoramic image frames to form a complete video for the user to watch.
  • the user's viewing angle is usually around 150°, so the encoded data of only 150° in the panoramic image frame is valid, and the rest of the encoded data is invalid, and receiving and encoding and decoding the data may cause unnecessary Decode the credit.
  • the technical solution proposed by the present invention is: providing a positioning device 131 in the user equipment 130, the positioning device 131 is provided with a gravity sensor and a gyro sensor to detect the movement trend data of the user's head, specifically reflecting the head movement
  • the parameters of the six degrees of freedom (three spatial coordinates and three angular coordinates) accurately track the motion of the user's head, and the encoder 112a performs each panoramic image frame in the panoramic video according to a preset viewing angle division rule.
  • There is a pre-processing unit 132 which determines, according to the motion trend data of the user's head collected by the positioning device 1 31, the next partial-range panoramic image frame display gap, and sends a control command to the encoder 112a and
  • the decoder 133, the decoder 133 decodes the received panoramic image frame of the effective view video, and the video Presented to the user to watch.
  • the technical solution proposed by the present invention effectively solves two technical problems: 1.
  • the panoramic video encoding and decoding operation corresponding to the effective viewing angle range FOV before the viewing angle is ensured by the positioning device 131 and the angle of view division rule of the encoder;
  • the positioning device 131 and the pre-processing unit 132 Through the positioning device 131 and the pre-processing unit 132, the cloud head movement tendency can be accurately determined, and the encoded video frame is selected in advance.
  • the panoramic camera 110 captures video through the camera group 111 of different orientations, and transmits the captured panoramic video to the image data buffer unit 112.
  • the encoder 112a in the image data buffer unit 112 will panorama according to the viewing angle division rule.
  • the video is divided into a plurality of video sequences, and the encoder 112a encodes each video sequence to obtain a plurality of sets of encoded data.
  • the motion trend data of the user's head collected by the positioning device 131 passes the corresponding panoramic encoded data through the communication unit.
  • the pre-processing unit 132 in the user equipment 130 determines the next partial range of panoramic image frame display gap according to the motion trend data of the user's head collected by the positioning device 131, and sends a control command to the Encoder 112a and solution
  • the encoder 133, the encoder 112a and the decoder 133 encode and decode the panoramic image frame of a specific angle of view according to the motion trend data and the control command.
  • FIG. 4 is a schematic flowchart of a method for encoding and decoding a panoramic video according to the present invention.
  • FIG. 5 is a schematic flowchart of a step S3 in a method for encoding and decoding a panoramic video according to an embodiment of the present invention.
  • FIG. 7 is a schematic flowchart of step S4 in a method for encoding and decoding panoramic video according to an embodiment of a method for encoding and decoding a panoramic video;
  • the method for dynamically selecting and decoding panoramic video based on the perspective of the present invention includes the following steps:
  • the panoramic camera After the panoramic camera captures the panoramic video, the panoramic camera sends the panoramic video to the image data buffer unit 11 2; the camera group 111 of the panoramic camera may be distributed on the mechanism of the head mounted display device around the head, It may be separated from the user's VR display device; the panoramic video captured by the camera group 111 of the panoramic camera is stored in the image data buffer unit 112;
  • the image data buffering unit 112 divides the panoramic video into a plurality of viewing angle video sequences according to the viewing angle division rule by the encoder 112a; the image data buffering unit 112 sends the received panoramic video to the encoder 112a for processing;
  • the corresponding encoded data is sent to the user equipment 130 according to the viewing angle of the user, and the user equipment 130 decodes the encoded data through the decoder 133, and plays the decoded view video sequence.
  • the view division rule is used to instruct the encoder 112a to divide the panoramic image frame according to the perspective of the viewer, and the view division rules of all the panoramic image frames in one panoramic video are the same.
  • the sum of the horizontal viewing angles of the A video frames is the total horizontal viewing angle of the panoramic image frames, and the regions in one panoramic image frame do not overlap each other.
  • the total horizontal angle of view of the panoramic image frame is 360°
  • the sum of the horizontal angles of the A view video sequences is 360°.
  • the range of the horizontal angle of view of each region in a panoramic image frame may be the same. It can also be different and is not limited.
  • the viewing angle division rule can be determined according to the viewing angle commonly used by users to improve The accuracy of the area division, for example, the viewing angle commonly used by users is 20° ⁇ 175°, then the viewing angle division rule is to use 0° ⁇ 20° as a horizontal angle of view, and 20° ⁇ 175° as a horizontal angle of view, 175° ⁇ 3 60° as a horizontal viewing angle.
  • step S31 includes the following sub-steps: S31, the panoramic camera captures video through cameras of different orientations; S32, splicing and splicing each image frame captured by all cameras in the same engraving into a panoramic image frame; S33 Each panoramic video frame constitutes a panoramic video and is provided to the encoder 112a.
  • the horizontal viewing angle is 0° for the first frame of the panoramic image.
  • the area of 12 0° is used as the viewing area 1
  • the area with the horizontal viewing angle of 120° ⁇ 240° is taken as the viewing area 2
  • the area with the horizontal viewing angle of 240° ⁇ 360° is taken as the viewing area 3; for the second frame of the panoramic image, the horizontal viewing angle is used.
  • the area of 0° ⁇ 120° is used as the viewing area 1, the area with the horizontal viewing angle of 120° ⁇ 240° is taken as the viewing area 2, and the area with the horizontal viewing angle of 240° ⁇ 360° is taken as the viewing area 3; and so on, for the panoramic image
  • a region having a horizontal viewing angle of 0° to 120° is regarded as a viewing zone 1
  • a region having a horizontal viewing angle of 120° to 240° is regarded as a viewing zone 2
  • a region having a horizontal viewing angle of 240° to 360° is referred to as a viewing zone 3.
  • step S4 includes the following sub-step S41, collecting the head motion trend of the user by using the positioning device 131 in the user equipment 130; S42.
  • the encoder 112a determines the predetermined one of the A areas according to the head motion trend data.
  • the pre-processing unit 132 determines the size of the next panoramic image frame display gap according to the user's head motion trend data collected by the positioning device 131.
  • the gap proposed by the present invention does not display the scanning scan time used for displaying a picture in a full screen generally recognized in the field.
  • the length of the gap in the present proposal is that the user equipment 130 is based on the positioning device 131.
  • the actual monitoring of the collected motion data is determined; for example, in a certain interval, the user looks at the partial panoramic image displayed on the screen with a certain fixed viewing angle, and divides the rules according to the preset viewing angle, encoding and decoding A fixed horizontal viewing angle is presented. In another more active image interaction session, the user perceives the panoramic video in a volatility manner.
  • the encoder 112a and the decoder 133 display the images corresponding to the varying horizontal angles of view.
  • Step S41 includes the following sub-steps: S411, determining a reference location point, the location of the point and the VR environment The specific position and orientation of the user's field of view are associated; S412, tracking each movement of the user's head in each direction from the reference position and each rotation motion to determine how to display the panoramic image frame to maintain the VR environment continuously and The real effect; S413.
  • the user equipment 130 sends the collected motion trend data to the panoramic camera.
  • the latest parameter data of six degrees of freedom (three spatial coordinates and three angular coordinates) is acquired by the gyro sensor and the gravity sensor provided in the positioning device 131, thereby ensuring display by graphic data.
  • the field of view for the user is consistent with the field of view that should actually be displayed.
  • the method and system for encoding and decoding panoramic video of the present invention have the following beneficial effects:
  • the encoder 112a in the image data buffering unit 112 divides the panoramic video into a plurality of viewing angle video sequences according to the viewing angle division rule, by pairing each The video sequence of the view is encoded to obtain a plurality of sets of coded data, and the positioning device 131 is used to collect the motion trend data of the user's head, which not only accurately determines the user's motion trend, but also ensures the actual pair according to the pre-processing unit 132.
  • the panoramic video corresponding to the effective viewing angle range FOV before the viewing angle is decoded, which reduces the decoding credit.

Abstract

Provided are a method and system for encoding/decoding panoramic video, said method comprising: a panoramic camera photographing a panoramic video, then sending the panoramic video to a video-data caching unit; according to a viewing-angle division rule, said video-data caching unit dividing the panoramic video into a plurality of viewing-angle video sequences by means of an encoder; encoding each of the viewing-angle video sequences to obtain multiple sets of encoded data; a user equipment decoding said encoded data by means of a decoder and playing the obtained viewing-angle video sequences; the system comprises a panoramic camera, a communications unit, and a user equipment; the solution provided not only accurately determines the trend of a user's movement, but also ensures, according to a preprocessing unit, real-time decoding of the panoramic video corresponding to the effective field of view (FOV) in front of the viewing angle, reducing the decoding overhead.

Description

一种编解码全景视频的方法和系统 技术领域  Method and system for encoding and decoding panoramic video
[0001] 本发明涉及全景视频编解码技术领域, 更具体地说, 涉及一种编解码全景视频 的方法和系统。  [0001] The present invention relates to the field of panoramic video codec technology, and more particularly to a method and system for encoding and decoding panoramic video.
背景技术  Background technique
[0002] 全景视频是虚拟现实影像显示的主要内容载体, 全景视频是由一系列全景影像 帧组成的视频, 直观的影像是全景动态展现的。 全景影像帧是记录预定视角的 景物所形成的图像帧, 例如, 预定视角为水平 360°、 垂直 180°, 或预定视角为水 平 360°、 垂直 270°等等。 编码器在通过全景摄像机得到全景视频后, 需要对全景 视频进行编码, 并将编码后的全景视频发送给解码器进行解码, 通过解码, 用 户可以观看到全景视频。  [0002] The panoramic video is the main content carrier of the virtual reality image display, and the panoramic video is a video composed of a series of panoramic image frames, and the intuitive image is dynamically displayed in the panoramic view. The panoramic image frame is an image frame formed by recording a scene of a predetermined viewing angle, for example, a predetermined viewing angle of 360° horizontally, 180° vertical, or a predetermined viewing angle of 360° horizontal, 270° vertical, and the like. After obtaining the panoramic video through the panoramic camera, the encoder needs to encode the panoramic video, and send the encoded panoramic video to the decoder for decoding. After decoding, the user can watch the panoramic video.
[0003] 现有技术中, 编码器获取的是全景视频中的所有全景影像帧, 并对每个全景影 像帧进行编码, 利用的是高度压缩数字视频编码器标准 H.264/AVC定义的编码规 则进行编码, 或者按照高效视频编码标准 H.265定义的编码规则进行编码; 对全 景视频的每个全景影像帧编码后, 我们会得到以块为单元的矩形区域集合, 即 为该影像帧的编码数据; 然后将每个全景影像帧发送给解码器, 用户设备中的 解码器对发送过来的编码后的全景影像帧进行解码, 将全部全景影像帧解码后 形成一完整视频供用户观看。  [0003] In the prior art, the encoder acquires all the panoramic image frames in the panoramic video, and encodes each panoramic image frame, using the encoding defined by the highly compressed digital video encoder standard H.264/AVC. The rules are encoded, or encoded according to the encoding rules defined by the high-efficiency video coding standard H.265; after encoding each panoramic image frame of the panoramic video, we obtain a set of rectangular regions in units of blocks, that is, the image frames. Encoding data; then transmitting each panoramic image frame to the decoder, and the decoder in the user equipment decodes the transmitted encoded panoramic image frame, and decodes all the panoramic image frames to form a complete video for the user to watch.
[0004] 但是在实际中用户的视角通常在 150°左右, 因此全景影像帧中只有 150°的视角 的编码数据是有效的, 其余的编码数据是无效的, 而接收和编解码这些数据会 造成不必要的解码幵销。  [0004] However, in practice, the user's viewing angle is usually around 150°, so that the encoded data of only 150° in the panoramic image frame is valid, and the remaining encoded data is invalid, and receiving and encoding the data may cause Unnecessary decoding credits.
技术问题  technical problem
[0005] 本发明的目的在于提供一种编解码全景视频的方法和系统, 旨在解决现有虚拟 现实影像中的全景影像帧的多余的解码幵销问题, 以及如何确保实吋性对视角 范围 FOV对应的全景视频的影像帧编解码问题, 以及在无规律头部运动趋势判 断难度大的问题。 问题的解决方案 [0005] An object of the present invention is to provide a method and system for encoding and decoding panoramic video, which aims to solve the problem of redundant decoding and crediting of panoramic image frames in the existing virtual reality image, and how to ensure the real range of viewing angles. The image frame coding and decoding problem of the panoramic video corresponding to the FOV, and the difficulty in judging the trend of the irregular head movement are difficult. Problem solution
技术解决方案  Technical solution
[0006] 本发明解决其技术问题, 采用的技术手段是: 提供一种编解码全景视频的方法 , 使用包括全景相机、 通信单元、 用户设备、 摄像头组、 影像数据缓存单元、 编码器、 定位装置、 预处理单元以及解码器, 包括以下步骤:  The present invention solves the technical problem thereof, and adopts the technical means: providing a method for encoding and decoding panoramic video, including using a panoramic camera, a communication unit, a user equipment, a camera group, an image data buffer unit, an encoder, and a positioning device The preprocessing unit and the decoder include the following steps:
[0007] Sl、 将全景相机拍摄得到全景视频发送给影像数据缓存单元;  [0007] Sl, sending a panoramic video to obtain a panoramic video to the image data buffer unit;
[0008] S2、 所述影像数据缓存单元通过编码器依据视角划分规则将全景视频的每个全 景影像帧按照水平视角划分成 A个区域, A为整数, 根据头部运动趋势将全景视 频划分成多个有效的视角视频序列;  [0008] S2, the image data buffer unit divides each panoramic image frame of the panoramic video into A regions according to a horizontal perspective according to a view division rule, and A is an integer, and the panoramic video is divided according to a head motion trend. Multiple valid view video sequences;
[0009] S3、 编码器对由特定视角的全景影像帧组成的有效视角视频序列进行编码, 得 到多组编码数据;  [0009] S3. The encoder encodes an effective view video sequence composed of panoramic image frames of a specific viewing angle to obtain a plurality of sets of encoded data.
[0010] S4、 根据用户观看视角将对应的编码数据通过通信单元发送给用户设备, 所述 用户设备通过解码器对该编码数据进行解码, 对解码得到的视角视频序列进行 播放。  [0010] S4. Send corresponding encoded data to the user equipment according to the viewing angle of the user, and the user equipment decodes the encoded data by using a decoder, and plays the decoded video sequence of the view.
[0011] 优选地, 步骤 S3包括以下子步骤:  [0011] Preferably, step S3 comprises the following sub-steps:
[0012] S31、 全景相机通过不同方位的摄像头拍摄视频;  [0012] S31. The panoramic camera captures video through cameras of different orientations;
[0013] S32、 将摄像头组在同一吋刻拍摄到的各个图像帧进行拼接, 拼接成全景影像 帧;  [0013] S32, splicing each image frame captured by the camera group in the same engraving, and splicing into a panoramic image frame;
[0014] S33、 根据头部运动趋势数据和视角划分规则, 特定视角的全景影像帧组成有 效的全景视频, 提供给编码器。  [0014] S33. According to the head motion trend data and the view angle division rule, the panoramic image frame of the specific view constitutes a valid panoramic video and is provided to the encoder.
[0015] 优选地, 步骤 S4包括以下子步骤: [0015] Preferably, step S4 comprises the following sub-steps:
[0016] S41、 利用用户设备中的定位装置采集用户的头部运动趋势;  [0016] S41: collecting a trend of a user's head movement by using a positioning device in the user equipment;
[0017] S42、 编码器根据头部运动趋势数据来确定上述 A个区域中预定区域对应的有 效的视角视频序列; [0017] S42. The encoder determines, according to the head motion trend data, a valid view video sequence corresponding to the predetermined area in the foregoing A regions.
[0018] S43、 预处理单元根据定位装置采集到的用户的头部运动趋势数据确定下一全 景影像帧显示吋隙的大小。  [0018] S43. The pre-processing unit determines, according to the head motion trend data of the user collected by the positioning device, the size of the next full-frame image frame display gap.
[0019] 优选地, 步骤 S41包括以下子步骤: [0019] Preferably, step S41 comprises the following sub-steps:
[0020] S411、 确定一个基准位置点, 该点位置与 VR环境中用户的视场的特定位置和 方向关联; [0020] S411. Determine a reference location point, where the location of the point is different from a specific location of the user's field of view in the VR environment. Direction association
[0021] S412、 对用户头部从基准位置的每个方向上的每次运动以及每个旋转动作进行 跟踪, 以确定如何显示全景影像帧来维持 VR环境连续且真实的效果;  [0021] S412: Track each motion of the user's head in each direction from the reference position and each rotation motion to determine how to display the panoramic image frame to maintain a continuous and true effect of the VR environment;
[0022] S413、 用户设备将采集到的运动趋势数据发送到全景相机中。  [0022] S413. The user equipment sends the collected motion trend data to the panoramic camera.
[0023] 根据本发明所述一种编解码全景视频的方法, 所述运动趋势数据包括被检测用 户的头部转动的角度和幅度数据。  [0023] According to the method of encoding and decoding panoramic video according to the present invention, the motion trend data includes angle and amplitude data of a detected head rotation of the user.
[0024] 根据本发明所述一种编解码全景视频的方法, 所述吋隙为某一视角一全景影像 展示吋长, 由预处理单元采集到的用户的头部的运动趋势数据决定的。  [0024] According to the method for encoding and decoding panoramic video according to the present invention, the crevice is determined by a certain perspective and a panoramic image, and is determined by the motion trend data of the user's head collected by the pre-processing unit.
[0025] 根据本发明所述一种编解码全景视频的方法, 一个全景影像视频中的所有影像 帧的视角划分规则相同, A个区域视角视频序列的水平视角的总和为全景影像帧 的总水平视角, 且一个全景影像帧中的各个区域之间互不重叠。  [0025] According to the method for encoding and decoding panoramic video according to the present invention, the viewing angle division rules of all the image frames in one panoramic video video are the same, and the sum of the horizontal viewing angles of the A regional viewing angle video sequences is the total level of the panoramic image frames. The angle of view, and the regions in a panoramic image frame do not overlap each other.
[0026] 提供一种编解码全景视频的系统, 包括全景相机、 通信单元以及用户设备; 所 述全景相机包括用于拍摄各个方位视频的摄像头组成的摄像头组以及用于存储 摄像头组拍摄到的全景视频的影像数据缓存单元, 所述影像数据缓存单元包括 用于对预定视角的全景视频的全景影像帧进行编码的编码器。  [0026] A system for encoding and decoding panoramic video, including a panoramic camera, a communication unit, and a user equipment; the panoramic camera includes a camera group composed of cameras for capturing various orientation videos and a panoramic view for storing the camera group An image data buffer unit of the video, the image data buffer unit including an encoder for encoding a panoramic image frame of the panoramic video of a predetermined angle of view.
[0027] 所述用户设备包括用于采集用户头部运动趋势数据的定位装置、 用于对下一全 景图像帧在用户设备中展示吋隙大小进行预先判定的预处理单元以及用于对编 码后的全景影像帧进行解码的解码器。  [0027] the user equipment includes positioning means for collecting user head motion trend data, a pre-processing unit for pre-determining the size of the next panoramic image frame in the user equipment, and for encoding The panoramic image frame is decoded by the decoder.
[0028] 所述全景相机通过不同方位的摄像头拍摄视频, 并将拍摄到的全景视频发送给 所述影像数据缓存单元, 所述影像数据缓存单元中的编码器根据视角划分规则 将全景视频划分成多个视角视频序列, 所述编码器对每个视角视频序列进行编 码, 得到多组编码数据; 利用所述定位装置采集到的用户头部的运动趋势数据 将对应的全景编码数据通过所述通信单元发送到所述用户设备中; 所述用户设 备中预处理单元根据所述定位装置采集到的用户头部的运动趋势数据来判定下 一局部范围的全景影像帧展示吋隙, 并将控制指令发送给所述编码器和解码器 , 所述编码器和解码器根据所述运动趋势数据和所述控制指令对特定视角的全 景影像帧进行编解码。  [0028] the panoramic camera captures a video through a camera of different orientations, and sends the captured panoramic video to the image data buffer unit, where the encoder in the image data buffer unit divides the panoramic video into a plurality of viewing angle video sequences, the encoder encoding each of the viewing angle video sequences to obtain a plurality of sets of encoded data; and using the motion trend data of the user's head collected by the positioning device to pass the corresponding panoramic encoded data through the communication The unit is sent to the user equipment; the pre-processing unit in the user equipment determines, according to the motion trend data of the user's head collected by the positioning device, the next partial range of the panoramic image frame display gap, and the control instruction And transmitting to the encoder and the decoder, where the encoder and the decoder encode and decode the panoramic image frame of the specific view according to the motion trend data and the control instruction.
发明的有益效果 有益效果 Advantageous effects of the invention Beneficial effect
[0029] 实施本发明的编解码全景视频的方法和系统, 具有以下有益效果: 影像数据缓 存单元中的编码器根据视角划分规则将全景视频划分成多个视角视频序列, 通 过对对每个视角视频序列进行编码, 得到多组编码数据, 又利用定位装置采集 对用户头部的运动趋势数据进行实吋的采集, 不仅能够准确判定用户运动趋势 , 而且根据预处理单元确保实吋对视角前有效视角范围 FOV对应的全景视频进 行解码, 降低了解码幵销。  The method and system for implementing the codec panoramic video of the present invention have the following beneficial effects: The encoder in the image data buffer unit divides the panoramic video into multiple view video sequences according to the view division rule, by pairing each view The video sequence is encoded to obtain a plurality of sets of encoded data, and the positioning device is used to collect the motion trend data of the user's head, which not only can accurately determine the user's motion trend, but also ensures that the real-time view is effective according to the pre-processing unit. The panoramic video corresponding to the FOV range is decoded to reduce the decoding credit.
对附图的简要说明  Brief description of the drawing
附图说明  DRAWINGS
[0030] 图 1为本发明的一种编解码全景视频的系统实施例的逻辑示意图;  1 is a schematic diagram of a system embodiment of a codec panoramic video according to the present invention;
[0031] 图 2为本发明的一种编解码全景视频的系统的全景相机实施例的逻辑示意图; 2 is a logic diagram of a panoramic camera embodiment of a system for encoding and decoding panoramic video according to the present invention;
[0032] 图 3为本发明的一种编解码全景视频的系统的用户设备的逻辑示意图; 3 is a schematic diagram of a user equipment of a system for encoding and decoding panoramic video according to the present invention;
[0033] 图 4为本发明的一种编解码全景视频的方法流程示意图;  4 is a schematic flowchart of a method for encoding and decoding panoramic video according to the present invention;
[0034] 图 5为本发明的一种编解码全景视频的方法实施例中步骤 S3流程示意图;  [0034] FIG. 5 is a schematic flowchart of step S3 in an embodiment of a method for encoding and decoding panoramic video according to the present invention;
[0035] 图 6为本发明的一种编解码全景视频的方法实施例中步骤 S4流程示意图;  6 is a schematic flowchart of step S4 in an embodiment of a method for encoding and decoding panoramic video according to the present invention;
[0036] 图 7为本发明的一种编解码全景视频的方法实施例中步骤 S41流程示意图。  7 is a schematic flowchart of step S41 in an embodiment of a method for encoding and decoding panoramic video according to the present invention.
本发明的实施方式 Embodiments of the invention
[0037] 以下结合附图和实施例对本发明做进一步的解释说明。 [0037] The present invention will be further explained below in conjunction with the accompanying drawings and embodiments.
[0038] 图 1为本发明的编解码全景视频的系统实施例的逻辑示意图, 图 2为本发明的一 种编解码全景视频的系统的全景相机实施例的逻辑示意图, 图 3为本发明的一种 编解码全景视频的系统的用户设备的逻辑示意图, 如图 1-3所示, 本发明的系统 1 00包括全景相机 110、 通信单元 120以及用户设备 130; 全景相机 110包括用于拍 摄各个方位视频的摄像头组成的摄像头组 111以及用于存储摄像头组拍摄到的全 景视频的影像数据缓存单元 112, 所述影像数据缓存单元 112包括用于对预定视 角的全景视频的全景影像帧进行编码的编码器 112a; 所述用户设备 130包括用于 采集用户头部运动趋势数据的定位装置 131、 用于对下一全景图像帧在用户设备 中展示吋隙大小进行预先判定的预处理单元 132以及用于对编码后的全景影像帧 进行解码的解码器 133。 现有技术, 编码器获取的是全景视频中的所有全景影像 帧, 并对每个全景影像帧进行编码, 利用高度压缩数字视频编码器标准 H.264/A VC定义的编码规则进行编码, 或者按照高效视频编码标准 H.265定义的编码规则 进行编码; 用户设备中的解码器对发送过来的编码后的全景影像帧进行解码, 将全部全景影像帧解码后形成一完整视频供用户观看。 但是在实际中用户的视 角通常在 150°左右, 因此全景影像帧中只有 150°的视角的编码数据是有效的, 其 余的编码数据是无效的, 而接收和编解码这些数据会造成不必要的解码幵销。 1 is a logic diagram of a system embodiment of a codec panoramic video according to the present invention. FIG. 2 is a schematic diagram of a panoramic camera embodiment of a system for encoding and decoding panoramic video according to the present invention, and FIG. 3 is a schematic diagram of a panoramic camera embodiment of the present invention. A schematic diagram of a user equipment of a system for encoding and decoding panoramic video. As shown in FIG. 1-3, the system 100 of the present invention includes a panoramic camera 110, a communication unit 120, and a user equipment 130. The panoramic camera 110 includes a camera group 111 composed of cameras of the azimuth video and an image data buffer unit 112 for storing the panoramic video captured by the camera group, the image data buffer unit 112 including a panoramic image frame for encoding the panoramic video of the predetermined angle of view. The user equipment 130 includes a positioning device 131 for collecting user head motion trend data, a pre-processing unit 132 for pre-determining the size of the next panoramic image frame in the user equipment, and For the encoded panoramic image frame A decoder 133 that performs decoding. In the prior art, the encoder acquires all the panoramic image frames in the panoramic video, and encodes each of the panoramic image frames, and encodes the encoding rules defined by the highly compressed digital video encoder standard H.264/A VC, or The encoding is performed according to the encoding rule defined by the high-efficiency video coding standard H.265; the decoder in the user equipment decodes the transmitted encoded panoramic image frame, and decodes all the panoramic image frames to form a complete video for the user to watch. However, in practice, the user's viewing angle is usually around 150°, so the encoded data of only 150° in the panoramic image frame is valid, and the rest of the encoded data is invalid, and receiving and encoding and decoding the data may cause unnecessary Decode the credit.
[0039] 本发明提出的技术方案是: 在用户设备 130中设置定位装置 131, 定位装置 131 中设有重力传感器和陀螺仪传感器来检测用户头部的运动趋势数据, 具体为体 现头部运动的六个自由度 (三个空间坐标和三个角坐标) 的参数用精确地跟踪 用户头部的运动情况, 编码器 112a根据预先设定的视角划分规则对全景视频中的 每个全景影像帧进行划分, 将其划分成 A个区域, 并且根据定位装置 131发送过 来的用户头部运动趋势数据来确定 A个区域中的有效的视角视频序列, 编码器 11 2a对其编码, 在用户设备中设有预处理单元 132, 预处理单元根据所述定位装置 1 31采集到的用户头部的运动趋势数据来判定下一局部范围的全景影像帧展示吋 隙, 并将控制指令发送给编码器 112a和解码器 133, 解码器 133将接收的有效视角 视频的全景影像帧进行解码, 将视频展现给用户观看。  [0039] The technical solution proposed by the present invention is: providing a positioning device 131 in the user equipment 130, the positioning device 131 is provided with a gravity sensor and a gyro sensor to detect the movement trend data of the user's head, specifically reflecting the head movement The parameters of the six degrees of freedom (three spatial coordinates and three angular coordinates) accurately track the motion of the user's head, and the encoder 112a performs each panoramic image frame in the panoramic video according to a preset viewing angle division rule. Dividing, dividing it into A areas, and determining a valid view video sequence in the A areas according to the user head motion trend data sent by the positioning device 131, the encoder 11 2a encoding the same, and setting in the user equipment There is a pre-processing unit 132, which determines, according to the motion trend data of the user's head collected by the positioning device 1 31, the next partial-range panoramic image frame display gap, and sends a control command to the encoder 112a and The decoder 133, the decoder 133 decodes the received panoramic image frame of the effective view video, and the video Presented to the user to watch.
[0040] 本发明提出的技术方案有效地解决了两个技术问题: 1.通过定位装置 131和编码 器的视角划分规则确保实吋对视角前有效视角范围 FOV对应的全景视频编解码 操作; 2.通过定位装置 131和预处理单元 132能够准确云盘头部运动趋势, 提前选 择编码视频帧。  [0040] The technical solution proposed by the present invention effectively solves two technical problems: 1. The panoramic video encoding and decoding operation corresponding to the effective viewing angle range FOV before the viewing angle is ensured by the positioning device 131 and the angle of view division rule of the encoder; Through the positioning device 131 and the pre-processing unit 132, the cloud head movement tendency can be accurately determined, and the encoded video frame is selected in advance.
[0041] 具体地, 全景相机 110通过不同方位的摄像头组 111拍摄视频, 并将拍摄到的全 景视频发送给影像数据缓存单元 112, 影像数据缓存单元 112中的编码器 112a根据 视角划分规则将全景视频划分成多个视角视频序列, 编码器 112a对每个视角视频 序列进行编码, 得到多组编码数据; 利用定位装置 131采集到的用户头部的运动 趋势数据将对应的全景编码数据通过通信单元 120发送到用户设备 130中; 用户 设备 130中预处理单元 132根据定位装置 131采集到的用户头部的运动趋势数据来 判定下一局部范围的全景影像帧展示吋隙, 并将控制指令发送给编码器 112a和解 码器 133, 编码器 112a和解码器 133根据运动趋势数据和控制指令对特定视角的全 景影像帧进行编解码。 [0041] Specifically, the panoramic camera 110 captures video through the camera group 111 of different orientations, and transmits the captured panoramic video to the image data buffer unit 112. The encoder 112a in the image data buffer unit 112 will panorama according to the viewing angle division rule. The video is divided into a plurality of video sequences, and the encoder 112a encodes each video sequence to obtain a plurality of sets of encoded data. The motion trend data of the user's head collected by the positioning device 131 passes the corresponding panoramic encoded data through the communication unit. 120 is sent to the user equipment 130; the pre-processing unit 132 in the user equipment 130 determines the next partial range of panoramic image frame display gap according to the motion trend data of the user's head collected by the positioning device 131, and sends a control command to the Encoder 112a and solution The encoder 133, the encoder 112a and the decoder 133 encode and decode the panoramic image frame of a specific angle of view according to the motion trend data and the control command.
[0042] 图 4为本发明的一种编解码全景视频的方法流程示意图, 图 5为本发明的一种编 解码全景视频的方法实施例中步骤 S3流程示意图, 图 6为本发明的一种编解码全 景视频的方法实施例中步骤 S4流程示意图, 图 7为本发明的一种编解码全景视频 的方法实施例中步骤 S41流程示意图;  4 is a schematic flowchart of a method for encoding and decoding a panoramic video according to the present invention. FIG. 5 is a schematic flowchart of a step S3 in a method for encoding and decoding a panoramic video according to an embodiment of the present invention. FIG. FIG. 7 is a schematic flowchart of step S4 in a method for encoding and decoding panoramic video according to an embodiment of a method for encoding and decoding a panoramic video; FIG.
[0043] 请参考图 4-图 7, 在本发明提供的一种基于实吋视角的动态选择性编解码全景 视频的方法中, 包括以下步骤:  [0043] Please refer to FIG. 4-7. In the method for dynamically selecting and decoding panoramic video based on the perspective of the present invention, the method includes the following steps:
[0044] Sl、 全景相机在拍摄得到全景视频后, 将全景视频发送给影像数据缓存单元 11 2; 全景相机的摄像头组 111可分布于头戴式显示装置环绕固定在头部的机构上 , 也可以是与用户的 VR显示设备分离的; 全景相机的摄像头组 111拍摄的全景视 频存储在影像数据缓存单元 112;  [0044] After the panoramic camera captures the panoramic video, the panoramic camera sends the panoramic video to the image data buffer unit 11 2; the camera group 111 of the panoramic camera may be distributed on the mechanism of the head mounted display device around the head, It may be separated from the user's VR display device; the panoramic video captured by the camera group 111 of the panoramic camera is stored in the image data buffer unit 112;
[0045] S2、 所述影像数据缓存单元 112通过编码器 112a依据视角划分规则将全景视频 划分成多个视角视频序列; 影像数据缓存单元 112将收到的全景视频发送给编码 器 112a进行处理;  [0045] S2, the image data buffering unit 112 divides the panoramic video into a plurality of viewing angle video sequences according to the viewing angle division rule by the encoder 112a; the image data buffering unit 112 sends the received panoramic video to the encoder 112a for processing;
[0046] S3、 对每个视角视频序列进行编码, 得到多组编码数据; 编码器 112a对全景视 频中的每个各全景影像帧进行编码;  [0046] S3, encoding each view video sequence to obtain multiple sets of encoded data; the encoder 112a encodes each of the panoramic video frames in the panoramic video;
[0047] S4、 根据用户的观看视角将对应的编码数据发送给用户设备 130, 所述用户设 备 130通过解码器 133对该编码数据进行解码, 对解码得到的视角视频序列进行 播放。 [0047] S4. The corresponding encoded data is sent to the user equipment 130 according to the viewing angle of the user, and the user equipment 130 decodes the encoded data through the decoder 133, and plays the decoded view video sequence.
[0048] 具体地, 视角划分规则用于指示编码器 112a按照谁视角对全景影像帧进行区域 划分, 一个全景视频中的所有全景影像帧的视角划分规则相同。 其中 A个吋间视 频序列的水平视角的总和为全景影像帧的总水平视角, 且一个全景影像帧中的 各个区域之间互不重叠。 例如, 当全景影像帧的总水平视角为 360°吋, A个视角 视频序列的水平视角的总和为 360°, 举例来说, 一个全景影像帧中的各个区域的 水平视角的区间范围可以相同, 也可以不同, 不作限定。 假如视角划分规则是 平均划分成 4个水平视角, 每个水平视角的区间范围是 90°, 即将 0°~90°作为一个 水平视角, 90°~180°作为一个水平视角, 180°~270°作为一个水平视角, 270°~360°作为一个 水平视角; 或者在实现吋, 可以根据用户常用的观看视角来确定视角划分规则 , 以提高区域划分的准确性, 例如用户常用的观看视角是 20°~175°, 那么视角划 分规则是将 0°~20°作为一个水平视角, 将 20°~175°作为一个水平视角, 将 175°~3 60°作为水平视角。 [0048] Specifically, the view division rule is used to instruct the encoder 112a to divide the panoramic image frame according to the perspective of the viewer, and the view division rules of all the panoramic image frames in one panoramic video are the same. The sum of the horizontal viewing angles of the A video frames is the total horizontal viewing angle of the panoramic image frames, and the regions in one panoramic image frame do not overlap each other. For example, when the total horizontal angle of view of the panoramic image frame is 360°, the sum of the horizontal angles of the A view video sequences is 360°. For example, the range of the horizontal angle of view of each region in a panoramic image frame may be the same. It can also be different and is not limited. If the rule of view division is divided into 4 horizontal views on average, the range of each horizontal view is 90°, that is, 0°~90° is taken as a horizontal view. 90°~180° as a horizontal viewing angle, 180°~270° as a horizontal viewing angle, 270°~360° as a horizontal viewing angle; or in realizing 吋, the viewing angle division rule can be determined according to the viewing angle commonly used by users to improve The accuracy of the area division, for example, the viewing angle commonly used by users is 20°~175°, then the viewing angle division rule is to use 0°~20° as a horizontal angle of view, and 20°~175° as a horizontal angle of view, 175°~ 3 60° as a horizontal viewing angle.
[0049] 优选地, 步骤 S31包括以下子步骤: S31、 全景相机通过不同方位的摄像头拍摄 视频; S32、 将所有摄像头在同一吋刻拍摄到的各个图像帧进行拼接, 拼接成全 景影像帧; S33、 各个全景影像帧组成全景视频, 提供给编码器 112a。  [0049] Preferably, step S31 includes the following sub-steps: S31, the panoramic camera captures video through cameras of different orientations; S32, splicing and splicing each image frame captured by all cameras in the same engraving into a panoramic image frame; S33 Each panoramic video frame constitutes a panoramic video and is provided to the encoder 112a.
[0050] 在进行编码吋, 需要确定每个影像帧的视角范围的区域编号, 例如, 假设视角 划分规则为平均划分成 3个水平视角, 则对于全景图像第一帧, 将水平视角 0°~12 0°的区域作为视区 1, 将水平视角 120°~240°的区域作为视区 2, 将水平视角 240°~ 360°的区域作为视区 3; 对于全景图像第二帧, 将水平视角 0°~120°的区域作为视 区 1, 将水平视角 120°~240°的区域作为视区 2, 将水平视角 240°~360°的区域作为 视区 3; 以此类推, 对于全景图像第 N帧, 将水平视角 0°~120°的区域作为视区 1 , 将水平视角 120°~240°的区域作为视区 2, 将水平视角 240°~360°的区域作为视 区 3。  [0050] After performing encoding, it is necessary to determine the area number of the viewing angle range of each image frame. For example, if the viewing angle dividing rule is divided into three horizontal viewing angles, the horizontal viewing angle is 0° for the first frame of the panoramic image. The area of 12 0° is used as the viewing area 1, the area with the horizontal viewing angle of 120°~240° is taken as the viewing area 2, and the area with the horizontal viewing angle of 240°~360° is taken as the viewing area 3; for the second frame of the panoramic image, the horizontal viewing angle is used. The area of 0°~120° is used as the viewing area 1, the area with the horizontal viewing angle of 120°~240° is taken as the viewing area 2, and the area with the horizontal viewing angle of 240°~360° is taken as the viewing area 3; and so on, for the panoramic image In the N frame, a region having a horizontal viewing angle of 0° to 120° is regarded as a viewing zone 1, a region having a horizontal viewing angle of 120° to 240° is regarded as a viewing zone 2, and a region having a horizontal viewing angle of 240° to 360° is referred to as a viewing zone 3.
[0051] 优选地, 步骤 S4包括以下子步骤 S41、 利用用户设备 130中的定位装置 131采集 用户的头部运动趋势; S42、 编码器 112a根据头部运动趋势数据来确定上述 A个 区域中预定区域对应的有效的视角视频序列; S43、 预处理单元 132根据定位装 置 131采集到的用户的头部运动趋势数据确定下一全景影像帧显示吋隙的大小。  [0051] Preferably, step S4 includes the following sub-step S41, collecting the head motion trend of the user by using the positioning device 131 in the user equipment 130; S42. The encoder 112a determines the predetermined one of the A areas according to the head motion trend data. An effective view video sequence corresponding to the area; S43. The pre-processing unit 132 determines the size of the next panoramic image frame display gap according to the user's head motion trend data collected by the positioning device 131.
[0052] 值得注意的是, 本发明提出的吋隙并不予显示领域中普遍认知的全屏显示一幅 画面所用驱动扫描吋间, 本提案中吋隙的长短是用户设备 130根据定位装置 131 实吋监控采集的运动数据来决定的; 例如在一定的吋间段内, 用户以某一固定 的视角方位注视屏幕显示的局部全景影像吋, 根据预设的视角划分规则, 编码 与解码对某一固定的水平视角进行展现。 而在另一种更为活跃的影像互动环节 中, 用户是以某一变化无常的方式感知全景视频吋, 同样, 编码器 112a和解码器 133以变化的各水平视角对应的影像进行展现。  [0052] It should be noted that the gap proposed by the present invention does not display the scanning scan time used for displaying a picture in a full screen generally recognized in the field. The length of the gap in the present proposal is that the user equipment 130 is based on the positioning device 131. The actual monitoring of the collected motion data is determined; for example, in a certain interval, the user looks at the partial panoramic image displayed on the screen with a certain fixed viewing angle, and divides the rules according to the preset viewing angle, encoding and decoding A fixed horizontal viewing angle is presented. In another more active image interaction session, the user perceives the panoramic video in a volatility manner. Similarly, the encoder 112a and the decoder 133 display the images corresponding to the varying horizontal angles of view.
[0053] 步骤 S41包括以下子步骤: S411、 确定一个基准位置点, 该点位置与 VR环境中 用户的视场的特定位置和方向关联; S412、 对用户头部从基准位置的每个方向 上的每次运动以及每个旋转动作进行跟踪, 以确定如何显示全景影像帧来维持 V R环境连续且真实的效果; S413、 用户设备 130将采集到的运动趋势数据发送到 全景相机中。 [0053] Step S41 includes the following sub-steps: S411, determining a reference location point, the location of the point and the VR environment The specific position and orientation of the user's field of view are associated; S412, tracking each movement of the user's head in each direction from the reference position and each rotation motion to determine how to display the panoramic image frame to maintain the VR environment continuously and The real effect; S413. The user equipment 130 sends the collected motion trend data to the panoramic camera.
[0054] 本实施例中, 通过设置在定位装置 131中的陀螺仪传感器和重力传感器采集六 个自由度 (三个空间坐标和三个角坐标) 的最新的参数数据, 从而确保通过图 形数据显示给用户的视场与实际上本该显示的视场一致性。  [0054] In the present embodiment, the latest parameter data of six degrees of freedom (three spatial coordinates and three angular coordinates) is acquired by the gyro sensor and the gravity sensor provided in the positioning device 131, thereby ensuring display by graphic data. The field of view for the user is consistent with the field of view that should actually be displayed.
[0055] 本发明的编解码全景视频的方法和系统, 具有以下有益效果: 影像数据缓存单 元 112中的编码器 112a根据视角划分规则将全景视频划分成多个视角视频序列, 通过对对每个视角视频序列进行编码, 得到多组编码数据, 又利用定位装置 131 采集对用户头部的运动趋势数据进行实吋的采集, 不仅能够准确判定用户运动 趋势, 而且根据预处理单元 132确保实吋对视角前有效视角范围 FOV对应的全景 视频进行解码, 降低了解码幵销。  The method and system for encoding and decoding panoramic video of the present invention have the following beneficial effects: The encoder 112a in the image data buffering unit 112 divides the panoramic video into a plurality of viewing angle video sequences according to the viewing angle division rule, by pairing each The video sequence of the view is encoded to obtain a plurality of sets of coded data, and the positioning device 131 is used to collect the motion trend data of the user's head, which not only accurately determines the user's motion trend, but also ensures the actual pair according to the pre-processing unit 132. The panoramic video corresponding to the effective viewing angle range FOV before the viewing angle is decoded, which reduces the decoding credit.
[0056] 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领域的 技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和原则之内 , 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的权利要求范围之 内。  The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. All modifications, equivalents, improvements, etc., made within the spirit and scope of the invention are intended to be included within the scope of the appended claims.

Claims

权利要求书 Claim
[权利要求 1] 一种编解码全景视频的方法, 使用包括全景相机 (110) 、 通信单元  [Claim 1] A method of encoding and decoding panoramic video, using a panoramic camera (110), a communication unit
(120) 、 用户设备 (130) 、 摄像头组 (111) 、 影像数据缓存单元 (112) 、 编码器 (112a) 、 定位装置 (131) 、 预处理单元 (132) 以及解码器 (133) , 其特征在于, 包括以下步骤: (120), user equipment (130), camera group (111), image data buffer unit (112), encoder (112a), positioning device (131), pre-processing unit (132), and decoder (133), It is characterized by the following steps:
51、 将全景相机 (110) 拍摄得到全景视频发送给影像数据缓存单元 (112) ; 51. The panoramic camera (110) captures the panoramic video and sends it to the image data buffer unit (112);
52、 所述影像数据缓存单元 (112) 通过编码器 (112a) 依据视角划 分规则将全景视频的每个全景影像帧按照水平视角划分成 A个区域, A为整数, 根据头部运动趋势将全景视频划分成多个有效的视角视频 序列;  52. The image data buffer unit (112) divides each panoramic image frame of the panoramic video into A regions according to a horizontal perspective according to a view division rule by the encoder (112a), where A is an integer, and the panorama is based on the head motion trend. The video is divided into a plurality of valid viewing angle video sequences;
53、 编码器 (112a) 对由特定视角的全景影像帧组成的有效视角视频 序列进行编码, 得到多组编码数据;  53. The encoder (112a) encodes an effective view video sequence composed of panoramic image frames of a specific view to obtain a plurality of sets of coded data;
54、 根据用户观看视角将对应的编码数据通过通信单元 (120) 发送 给用户设备 (130) , 所述用户设备 (130) 通过解码器 (133) 对该 编码数据进行解码, 对解码得到的视角视频序列进行播放。  54. The corresponding encoded data is sent to the user equipment (130) according to the viewing angle of the user, and the user equipment (130) decodes the encoded data by using a decoder (133), and the decoded perspective is obtained. The video sequence is played.
[权利要求 2] 根据权利要求 1所述一种编解码全景视频的方法, 其特征在于, 步骤 S  [Claim 2] A method for encoding and decoding panoramic video according to claim 1, wherein step S
3包括以下子步骤:  3 includes the following substeps:
531、 全景相机 (110) 通过不同方位的摄像头拍摄视频;  531, panoramic camera (110) shooting video through cameras of different orientations;
532、 将摄像头组 (111) 在同一吋刻拍摄到的各个图像帧进行拼接, 拼接成全景影像帧;  532. splicing and splicing the image frames captured by the camera group (111) in the same moment into a panoramic image frame;
533、 根据头部运动趋势数据和视角划分规则, 特定视角的全景影像 帧组成有效的全景视频, 提供给编码器 (112a) 。  533. According to the head motion trend data and the view division rule, the panoramic image frame of the specific view constitutes an effective panoramic video and is provided to the encoder (112a).
[权利要求 3] 根据权利要求 1所述一种编解码全景视频的方法, 其特征在于, 步骤 S  [Claim 3] A method for encoding and decoding panoramic video according to claim 1, wherein step S
4包括以下子步骤:  4 includes the following substeps:
541、 利用用户设备 (130) 中的定位装置采集用户的头部运动趋势; 541. Collecting a trend of a user's head movement by using a positioning device in the user equipment (130);
542、 编码器 (112a) 根据头部运动趋势数据来确定上述 A个区域中 预定区域对应的有效的视角视频序列; S43、 预处理单元 (132) 根据定位装置 (131) 采集到的用户的头部 运动趋势数据确定下一全景影像帧显示吋隙的大小。 542. The encoder (112a) determines, according to the head motion trend data, a valid view video sequence corresponding to the predetermined area in the foregoing A areas. S43. The pre-processing unit (132) determines, according to the user's head motion trend data collected by the positioning device (131), the size of the next panoramic image frame display gap.
[权利要求 4] 根据权利要求 3所述一种编解码全景视频的方法, 其特征在于, 步骤 S [Claim 4] A method for encoding and decoding panoramic video according to claim 3, wherein: step S
41包括以下子步骤:  41 includes the following substeps:
5411、 确定一个基准位置点, 该点位置与 VR环境中用户的视场的特 定位置和方向关联;  5411. Determine a reference location point, where the location is associated with a specific location and direction of a user's field of view in the VR environment;
5412、 对用户头部从基准位置的每个方向上的每次运动以及每个旋转 动作进行跟踪, 以确定如何显示全景影像帧来维持 VR环境连续且真 实的效果;  5412. Track each motion of the user's head in each direction from the reference position and each rotation motion to determine how to display the panoramic image frame to maintain a continuous and true effect of the VR environment;
5413、 用户设备 (130) 将采集到的运动趋势数据发送到全景相机中  5413, User equipment (130) Send the collected motion trend data to the panoramic camera
[权利要求 5] 根据权利要求 1所述一种编解码全景视频的方法, 其特征在于, 所述 运动趋势数据包括被检测用户的头部转动的角度和幅度数据。 [Claim 5] A method of encoding and decoding a panoramic video according to claim 1, wherein the motion trend data includes angle and amplitude data of a detected head rotation of the user.
[权利要求 6] 根据权利要求 1所述一种编解码全景视频的方法, 其特征在于, 所述 吋隙为某一视角一全景影像展示吋长, 由预处理单元采集到的用户的 头部的运动趋势数据决定的。  [Claim 6] A method for encoding and decoding a panoramic video according to claim 1, wherein the crevice is a view angle, a panoramic image display length, and a user's head collected by the preprocessing unit The trend data of the movement is determined.
[权利要求 7] 根据权利要求 1所述一种编解码全景视频的方法, 其特征在于, 一个 全景影像视频中的所有影像帧的视角划分规则相同, A个区域视角视 频序列的水平视角的总和为全景影像帧的总水平视角, 且一个全景影 像帧中的各个区域之间互不重叠。  [Claim 7] A method for encoding and decoding panoramic video according to claim 1, wherein a view dividing rule of all image frames in one panoramic video video is the same, and a sum of horizontal viewing angles of A regional viewing angle video sequences It is the total horizontal viewing angle of the panoramic image frame, and the regions in one panoramic image frame do not overlap each other.
[权利要求 8] 提供一种编解码全景视频的系统, 其特征在于, 包括全景相机 (110  [Claim 8] A system for encoding and decoding panoramic video, comprising: a panoramic camera (110)
) 、 通信单元 (120) 以及用户设备 (130) ; 所述全景相机 (110) 包括用于拍摄各个方位视频的摄像头组成的摄像头组 (111) 以及用 于存储摄像头组拍摄到的全景视频的影像数据缓存单元 (112) , 所 述影像数据缓存单元 (112) 包括用于对预定视角的全景视频的全景 影像帧进行编码的编码器 (112a) 。  a communication unit (120) and a user equipment (130); the panoramic camera (110) includes a camera group (111) composed of cameras for capturing various orientation videos and an image for storing panoramic video captured by the camera group A data buffer unit (112), the image data buffer unit (112) including an encoder (112a) for encoding a panoramic image frame of a panoramic video of a predetermined angle of view.
[权利要求 9] 根据权利要求 8所述一种编解码全景视频的系统, 其特征在于, 所述 用户设备 (130) 包括用于采集用户头部运动趋势数据的定位装置 (1 31) 、 用于对下一全景图像帧在用户设备中展示吋隙大小进行预先判 定的预处理单元 (132) 以及用于对编码后的全景影像帧进行解码的 解码器 (133) 。 [Claim 9] A system for encoding and decoding panoramic video according to claim 8, wherein the user equipment (130) includes positioning means for collecting user head motion trend data (1) 31), a pre-processing unit (132) for pre-determining the size of the gap in the user equipment for the next panoramic image frame, and a decoder (133) for decoding the encoded panoramic image frame.
[权利要求 10] 根据权利要求 9所述一种编解码全景视频的系统, 其特征在于, 所述 全景相机通过不同方位的摄像头拍摄视频, 并将拍摄到的全景视频发 送给所述影像数据缓存单元 (112) , 所述影像数据缓存单元 (112) 中的编码器 (112a) 根据视角划分规则将全景视频划分成多个视角视 频序列, 所述编码器 (112a) 对每个视角视频序列进行编码, 得到多 组编码数据; 利用所述定位装置 (131) 采集到的用户头部的运动趋 势数据将对应的全景编码数据通过所述通信单元 (120) 发送到所述 用户设备 (130) 中; 所述用户设备 (130) 中预处理单元 (132) 根 据所述定位装置 (131) 采集到的用户头部的运动趋势数据来判定下 一局部范围的全景影像帧展示吋隙, 并将控制指令发送给所述编码器 ( 112a) 和解码器 (133) , 所述编码器 (112a) 和解码器 (133) 根 据所述运动趋势数据和所述控制指令对特定视角的全景影像帧进行编 解码。  [Claim 10] A system for encoding and decoding panoramic video according to claim 9, wherein the panoramic camera captures video through cameras of different orientations, and transmits the captured panoramic video to the image data buffer. The unit (112), the encoder (112a) in the image data buffer unit (112) divides the panoramic video into a plurality of view video sequences according to a view division rule, and the encoder (112a) performs a video sequence for each view. Encoding, obtaining a plurality of sets of encoded data; using the motion trend data of the user's head collected by the positioning device (131) to transmit corresponding panoramic encoded data to the user equipment (130) through the communication unit (120) The pre-processing unit (132) in the user equipment (130) determines, according to the motion trend data of the user's head collected by the positioning device (131), the next partial range of the panoramic image frame to display the gap, and controls An instruction is sent to the encoder (112a) and the decoder (133), the encoder (112a) and the decoder (133) according to Said movement trend data and the control instruction to the codec particular view panorama image frame.
PCT/CN2017/073979 2016-08-23 2017-02-17 Method and system for encoding/decoding panoramic video WO2018036112A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610708955.1A CN108322763A (en) 2016-08-23 2016-08-23 A kind of method and system of encoding and decoding panoramic video
CN201610708955.1 2016-08-23

Publications (1)

Publication Number Publication Date
WO2018036112A1 true WO2018036112A1 (en) 2018-03-01

Family

ID=61246073

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/073979 WO2018036112A1 (en) 2016-08-23 2017-02-17 Method and system for encoding/decoding panoramic video

Country Status (2)

Country Link
CN (1) CN108322763A (en)
WO (1) WO2018036112A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108419268A (en) * 2017-02-09 2018-08-17 中国移动通信有限公司研究院 A kind of virtual reality method for processing business and wireless access network element device

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110769260A (en) * 2018-07-27 2020-02-07 晨星半导体股份有限公司 Video decoding device and video decoding method
CN112218158B (en) * 2019-07-12 2021-12-28 华为技术有限公司 Video processing method and device
CN111208966B (en) * 2019-12-31 2021-07-16 华为技术有限公司 Display method and device
CN111698520A (en) * 2020-06-24 2020-09-22 北京奇艺世纪科技有限公司 Multi-view video playing method, device, terminal and storage medium
CN112188219B (en) * 2020-09-29 2022-12-06 北京达佳互联信息技术有限公司 Video receiving method and device and video transmitting method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012090061A1 (en) * 2010-12-30 2012-07-05 Advanced Digital Broadcast S.A. Coding and decoding of multiview videos
CN104735464A (en) * 2015-03-31 2015-06-24 华为技术有限公司 Panorama video interactive transmission method, server and client end
CN105072393A (en) * 2015-07-31 2015-11-18 深圳英飞拓科技股份有限公司 Multi-lens panoramic network camera and method
CN105704501A (en) * 2016-02-06 2016-06-22 普宙飞行器科技(深圳)有限公司 Unmanned plane panorama video-based virtual reality live broadcast system
CN105791882A (en) * 2016-03-22 2016-07-20 腾讯科技(深圳)有限公司 Video coding method and device

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR3004565B1 (en) * 2013-04-12 2016-11-11 Kolor FUSION OF SEVERAL VIDEO STREAMS
US9063330B2 (en) * 2013-05-30 2015-06-23 Oculus Vr, Llc Perception based predictive tracking for head mounted displays
CN103561261B (en) * 2013-10-12 2016-10-26 重庆邮电大学 The panoramic locatable video coded method that view-based access control model notes
CN104539929B (en) * 2015-01-20 2016-12-07 深圳威阿科技有限公司 Stereo-image coding method and code device with motion prediction
CN105323552B (en) * 2015-10-26 2019-03-12 北京时代拓灵科技有限公司 A kind of panoramic video playback method and system
CN105323503B (en) * 2015-11-02 2019-07-09 Tcl集团股份有限公司 A kind of panoramic video transmission method and system
CN105869215B (en) * 2016-03-28 2019-03-12 上海米影信息科技有限公司 A kind of virtual reality imaging system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012090061A1 (en) * 2010-12-30 2012-07-05 Advanced Digital Broadcast S.A. Coding and decoding of multiview videos
CN104735464A (en) * 2015-03-31 2015-06-24 华为技术有限公司 Panorama video interactive transmission method, server and client end
CN105072393A (en) * 2015-07-31 2015-11-18 深圳英飞拓科技股份有限公司 Multi-lens panoramic network camera and method
CN105704501A (en) * 2016-02-06 2016-06-22 普宙飞行器科技(深圳)有限公司 Unmanned plane panorama video-based virtual reality live broadcast system
CN105791882A (en) * 2016-03-22 2016-07-20 腾讯科技(深圳)有限公司 Video coding method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108419268A (en) * 2017-02-09 2018-08-17 中国移动通信有限公司研究院 A kind of virtual reality method for processing business and wireless access network element device

Also Published As

Publication number Publication date
CN108322763A (en) 2018-07-24

Similar Documents

Publication Publication Date Title
WO2018036112A1 (en) Method and system for encoding/decoding panoramic video
US11924394B2 (en) Methods and apparatus for receiving and/or using reduced resolution images
US11575876B2 (en) Stereo viewing
EP3065049A2 (en) Interactive video display method, device, and system
US11218683B2 (en) Method and an apparatus and a computer program product for adaptive streaming
CN106658011A (en) Panoramic video coding and decoding methods and devices
KR20170008725A (en) Methods and apparatus for streaming content
US9654762B2 (en) Apparatus and method for stereoscopic video with motion sensors
EP3434021B1 (en) Method, apparatus and stream of formatting an immersive video for legacy and immersive rendering devices
WO2014094537A1 (en) Immersion communication client and server, and method for obtaining content view
US10404964B2 (en) Method for processing media content and technical equipment for the same
US20230328329A1 (en) User-chosen, object guided region of interest (roi) enabled digital video
US20200252585A1 (en) Systems, Algorithms, and Designs for See-through Experiences With Wide-Angle Cameras
KR20200076529A (en) Indexing of tiles for region of interest in virtual reality video streaming
WO2017220851A1 (en) Image compression method and technical equipment for the same
CN117041518A (en) 3D projection system and 3D video communication system
TW201822536A (en) Viewing angle control system and method for playing panoramic image through set-top box capable of dynamically presenting a predictive picture that rotates according to the rotating speed of the viewing angle

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17842538

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17842538

Country of ref document: EP

Kind code of ref document: A1