TWI637356B - Method and apparatus for mapping omnidirectional image to a layout output format - Google Patents

Method and apparatus for mapping omnidirectional image to a layout output format Download PDF

Info

Publication number
TWI637356B
TWI637356B TW106121387A TW106121387A TWI637356B TW I637356 B TWI637356 B TW I637356B TW 106121387 A TW106121387 A TW 106121387A TW 106121387 A TW106121387 A TW 106121387A TW I637356 B TWI637356 B TW I637356B
Authority
TW
Taiwan
Prior art keywords
layout
current
omnidirectional image
cubemap
format
Prior art date
Application number
TW106121387A
Other languages
Chinese (zh)
Other versions
TW201802771A (en
Inventor
林建良
林鴻志
李佳盈
張勝凱
Original Assignee
聯發科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/585,224 external-priority patent/US10249019B2/en
Application filed by 聯發科技股份有限公司 filed Critical 聯發科技股份有限公司
Publication of TW201802771A publication Critical patent/TW201802771A/en
Application granted granted Critical
Publication of TWI637356B publication Critical patent/TWI637356B/en

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

公開了處理全向圖像的方法和裝置。根據一種方法,接收使用所選擇的投影格式從360度全景視訊序列中的每個球面圖像轉換的當前一組全向圖像,其中所選擇的投影格式屬於包括立方面格式的投影格式組,以及具有該立方面格式的該當前全方向圖像集合由六個立方面組成。如果該所選擇的投影格式對應於該立方面格式,將該當前全方向圖像集合映射到屬於輸出佈局格式集合的當前立方體貼圖圖像的一個或多個映射語法元素被標識。提供位元流中的編解碼資料,所述位元流包括用於該當前全方向圖像集合的該一個或多個映射語法元素。 Methods and apparatus for processing omnidirectional images are disclosed. According to one method, a current set of omnidirectional images converted from each of the 360 degree panoramic video sequences using the selected projection format is received, wherein the selected projection format belongs to a projection format group including an aspect format, And the current omnidirectional image set having the format of the aspect is composed of six aspects. If the selected projection format corresponds to the aspect format, one or more mapping syntax elements that map the current omnidirectional image collection to the current cubemap image belonging to the output layout format set are identified. A codec material is provided in the bitstream, the bitstream including the one or more mapping syntax elements for the current omnidirectional image set.

Description

將全向圖像映射到佈局輸出格式的方法和裝置 Method and apparatus for mapping an omnidirectional image to a layout output format 【交叉引用】【cross reference】

本申請要求2016.5.6提交的美國臨時申請號為62/332,505的美國臨時申請案和2016.6.30提交的美國臨時申請號為62/356,571的美國臨時申請案的優先權,其全部內容也一併引用於此。 The present application claims priority to U.S. Provisional Application No. 62/332,505, filed on Jan. 6, s., and U.S. Provisional Application Serial No. 62/356,571, filed on Jun. Quoted here.

本發明涉及360度視訊。特別地,本發明涉及使用所選擇的投影格式將360度全景視訊序列中的每個球面圖像(spherical image)轉換的一組全向圖像(omnidirectional image)映射成輸出格式。 The present invention relates to 360 degree video. In particular, the present invention relates to mapping an omnidirectional image converted from each spherical image in a 360 degree panoramic video sequence into an output format using the selected projection format.

數位視訊編碼標準的進步導致了過去十年來諸如智能電話、數位電視和數位照相機等多媒體系統的成功。在H.261、MPEG-1、MPEG-2、H.263、MPEG-4和H.264/AVC的標準化活動之後,由於需要更大的圖像分辨率、更高的訊框率(frame rate)、更好的視訊品質,對視頻壓縮性能的提高的需求依然強勁。因此,開發新的,比H.264/AVC更好的編解碼效率的視訊編解碼技術一直都沒有結束。HEVC基於基於混合 塊的運動補償變換編解碼架構。 Advances in digital video coding standards have led to the success of multimedia systems such as smart phones, digital televisions and digital cameras over the past decade. After the standardization activities of H.261, MPEG-1, MPEG-2, H.263, MPEG-4 and H.264/AVC, due to the need for larger image resolution and higher frame rate (frame rate) ), better video quality, the demand for improved video compression performance is still strong. Therefore, the development of new video codec technology with better codec efficiency than H.264/AVC has not ended. HEVC is based on mixing The motion compensation transform codec architecture of the block.

三維(3D)電視機近年來一直是技術趨勢,旨在使觀眾聳人聽聞的觀看體驗。已經開發出各種技術來實現3D觀看,而多視角視訊是3D電視應用的關鍵技術之一。例如,視訊可以是二維(2D)介質,其僅從攝像機的角度向觀看者提供場景的單一視圖。然而,多視圖視訊能夠提供動態場景的任意觀點(viewpoint),並為觀眾提供現實感的感覺。3D視訊格式還可以包括與相應紋理圖片(texture picture)相關聯的深度圖(depth map)。深度圖也必須進行編解碼,以呈現三維視圖或多視圖。由於編解碼多視圖資料,較大圖像分辨率和更好質量的要求,提高了3D和多視圖視訊編碼效率的強烈需求,提出了各種技術。 Three-dimensional (3D) televisions have been a technology trend in recent years, aiming to make the audience a sensational viewing experience. Various technologies have been developed to achieve 3D viewing, and multi-view video is one of the key technologies for 3D TV applications. For example, the video can be a two-dimensional (2D) medium that provides a single view of the scene to the viewer only from the perspective of the camera. However, multi-view video can provide any viewpoint of a dynamic scene and provide a realistic feeling to the viewer. The 3D video format may also include a depth map associated with the corresponding texture picture. The depth map must also be coded to render a 3D view or multiple views. Due to the strong need for encoding and decoding multi-view data, large image resolution and better quality, and increasing the efficiency of 3D and multi-view video coding efficiency, various techniques have been proposed.

作為對HEVC和下一代3D視訊編解碼標準的擴展,3D-HEVC視頻編解碼標準的標準化由3D視訊編解碼擴展開發(Joint Collaborative Team on 3D video coding Extension Development,簡寫為JCT-3V)聯合協作小組於2012年7月份正式推出,並於2015年2月舉行的第11屆JCT-3V會議後確定。為了更實際地支持自動立體多視圖顯示,多視圖視訊加深度(multi-view video plus depth,簡寫為MVD)格式被引入為新的3D視訊格式,用於3D-HEVC。MVD格式由紋理圖片及其相關的深度圖組成。與表示物體的亮度和色度訊息的紋理圖像不同,深度圖是包含與攝像機拍攝平面的物體的距離相關的訊息的圖像,並且通常用於作為非視覺訊息(non-visual information)的虛擬視圖呈現。 As an extension of HEVC and next-generation 3D video codec standards, the standardization of 3D-HEVC video codec standards is developed by the Joint Collaborative Team on 3D video coding extension development (JCT-3V) joint collaboration group. It was officially launched in July 2012 and was confirmed after the 11th JCT-3V meeting in February 2015. In order to more practically support autostereoscopic multi-view display, a multi-view video plus depth (MVD) format is introduced as a new 3D video format for 3D-HEVC. The MVD format consists of a texture image and its associated depth map. Unlike texture images that represent the brightness and chrominance information of an object, a depth map is an image of a message that contains the distance from the object of the camera's shooting plane, and is typically used as a virtual non-visual information. View rendering.

具有頭戴式顯示器(head-mounted display,簡寫為HMD)的虛擬現實(Virtual reality,簡寫為VR)與各種應用相關聯。可以使用向用戶顯示寬視野內容的能力來提供沉浸式的視覺體驗。必須在所有方向上捕獲真實世界的環境,從而產生對應於觀察球(viewing sphere)的全方位視訊。隨著相機托架(camera rigs)和HMD的進步,由于代表此類內容所需的高比特率,VR內容的傳遞很快將成為瓶頸。由於全向視訊通常具有4K或更高的分辨率,壓縮對於降低比特率至關重要。提供的全向視訊是等距長方投影(equirectangular projection)。圖1示出了等距長方投影格式的全向視訊(稱為“Hangpai_2”)的圖像的示例。當原始圖像為全彩色時,黑白版本如圖1所示(因為黑白圖像足以說明本發明)。 Virtual reality (VR) with a head-mounted display (HMD) is associated with various applications. The ability to display wide field of view content to the user can be used to provide an immersive visual experience. The real world environment must be captured in all directions to produce omnidirectional video corresponding to the viewing sphere. With the advancement of camera rigs and HMDs, the delivery of VR content will quickly become a bottleneck due to the high bit rate required to represent such content. Since omnidirectional video usually has a resolution of 4K or higher, compression is critical to reducing the bit rate. The omnidirectional video provided is an equitctangular projection. FIG. 1 shows an example of an image of an omnidirectional video (referred to as "Hangpai_2") of an equidistant rectangular projection format. When the original image is full color, the black and white version is shown in Figure 1 (because the black and white image is sufficient to illustrate the invention).

等距長方投影格式可以轉換為例如第2a圖到第2k圖中所示的不同格式:(a)立方體貼圖(cubemap)、(b)Cubemap_32、(c)Cubemap_180、(d)Plane_poles、(e)P1ane_poles_6、(f)Plane_poles_cubemap、(g)Plane_cubemap、(h)Plane_cubemap_32、(i)Flat_fixed、(j)180degree 3D視訊(即,180-3D)以及(k)圓柱圖/圓柱形(Cylindermap/Cylindrical)。第2a圖到第2i圖中的圖像基於第1圖的圖像。 The equidistant rectangular projection format can be converted to different formats such as shown in Figures 2a through 2k: (a) cubemap, (b) Cubemap_32, (c) Cubemap_180, (d) Plane_poles, (e ) P1ane_poles_6, (f) Plane_poles_cubemap, (g) Plane_cubemap, (h) Plane_cubemap_32, (i) Flat_fixed, (j) 180degree 3D video (ie, 180-3D), and (k) Cylindermap/Cylindrical . The images in FIGS. 2a to 2i are based on the image of FIG. 1.

第3圖示出了使用投影轉換(projection conversion)310將等距長方投影格式轉換為立方格式(cubic format)的示例,其中標記為1至6的圖像對應於用於表示360度視訊的立方體的六個面上的圖像。第4圖中示出了四個常用佈 局(即1x6-佈局410,2×3-佈局420,3×2-佈局430和6×1-佈局440)。在每個佈局中,來自6個面的圖像被組裝成一個單個矩形圖像(rectangular image)。第5圖示出了等距長方格式和立方格式之間的幾何比較。第5圖中示出了等距長方幾何510和立方幾何520。圖像512是等距長方格式的示例,圖像522是立方格式的示例。 FIG. 3 shows an example of converting an equidistant rectangular projection format into a cubic format using a projection conversion 310, wherein the images labeled 1 to 6 correspond to those for representing 360-degree video. An image of the six faces of the cube. Four common cloths are shown in Figure 4. Bureau (ie 1x6-layout 410, 2x3-layout 420, 3x2-layout 430 and 6x1-layout 440). In each layout, images from 6 faces are assembled into a single rectangular image. Figure 5 shows the geometric comparison between the equidistant rectangular format and the cubic format. An equidistant rectangular geometry 510 and a cubic geometry 520 are shown in FIG. Image 512 is an example of an equidistant rectangular format, and image 522 is an example of a cubic format.

在用於將立方面轉換成輸出格式的現有方法中,總是使用相同的選擇的輸出佈局格式,並且以固定的方式將六個面分配給輸出佈局格式。雖然固定的映射很簡單,但它阻止用戶使用其他佈局格式來滿足用戶的需求。此外,在將立方面轉換為輸出佈局格式之後,經轉換的輸出圖像經常被壓縮以減少所需的空間。選擇的輸出佈局格式和固定映射對於壓縮可能不是有效的。 In existing methods for converting an aspect to an output format, the same selected output layout format is always used, and six faces are assigned to the output layout format in a fixed manner. Although the fixed mapping is simple, it prevents users from using other layout formats to meet the needs of the user. Furthermore, after converting the aspect to the output layout format, the converted output image is often compressed to reduce the space required. The selected output layout format and fixed mapping may not be valid for compression.

公開了處理全向圖像的方法和裝置。根據一種方法,接收使用所選擇的投影格式從360度全景視訊序列中的每個球面圖像轉換的當前一組全向圖像,其中所選擇的投影格式屬於包括立方面格式的投影格式組,以及具有該立方面格式的該當前全方向圖像集合由六個立方面組成。如果該所選擇的投影格式對應於該立方面格式,將該當前全方向圖像集合映射到屬於輸出佈局格式集合的當前立方體貼圖圖像的一個或多個映射語法元素被標識。提供位元流中的編解碼資料,所述位元流包括用於該當前全方向圖像集合的該一個或多個映射語法元素。 Methods and apparatus for processing omnidirectional images are disclosed. According to one method, a current set of omnidirectional images converted from each of the 360 degree panoramic video sequences using the selected projection format is received, wherein the selected projection format belongs to a projection format group including an aspect format, And the current omnidirectional image set having the format of the aspect is composed of six aspects. If the selected projection format corresponds to the aspect format, one or more mapping syntax elements that map the current omnidirectional image collection to the current cubemap image belonging to the output layout format set are identified. A codec material is provided in the bitstream, the bitstream including the one or more mapping syntax elements for the current omnidirectional image set.

投影格式組可還包括等距長方格式、180-3D格式和圓柱圖格式。如果該當前全方向圖像集合是該等距長方投影格式,則將該當前全方向圖像集合轉換成該立方面格式,並且通過將轉換的當前全方向圖像集合視為具有該立方面格式,該一個或多個映射語法元素被標識用於該轉換的當前全方向圖像集合。 The projection format group may further include an equidistant rectangular format, a 180-3D format, and a cylindrical chart format. If the current omnidirectional image set is the equidistant rectangular projection format, converting the current omnidirectional image set into the vertical aspect format, and by considering the converted current omnidirectional image set as having the aspect Format, the one or more mapping syntax elements are identified for the current omnidirectional image collection of the transformation.

映射語法元素可包括與該當前立方體貼圖圖像相關聯的當前立方體類型,並且該當前立方體類型屬於由1x6立方體貼圖佈局、2x3立方體貼圖佈局、3x2立方體貼圖佈局和6x1立方體貼圖佈局組成的當前輸出佈局格式集合。映射語法元素可還包括佈局映射索引,其中每個佈局映射索引將該當前全方向圖像集合的一個立方面與該當前立方體貼圖圖像的一個位置相關聯。對於該當前全方向圖像集合的每個立方面,除了該當前全方向圖像集合的最後一個立方面之外,標識一個佈局映射索引。每個佈局映射索引使用具有等於要映射的剩餘立方面數的條目數的代碼表進行編解碼。在另一實施例中,每個佈局映射索引使用具有等於要映射的立方面數的條目數的代碼表進行編解碼。 The mapping syntax element can include a current cube type associated with the current cubemap image, and the current cube type belongs to a current output layout consisting of a 1x6 cubemap layout, a 2x3 cubemap layout, a 3x2 cubemap layout, and a 6x1 cubemap layout. Format collection. The mapping syntax element can further include a layout mapping index, wherein each layout mapping index associates an aspect of the current omnidirectional image collection with a location of the current cubemap image. For each aspect of the current omnidirectional image collection, a layout map index is identified in addition to the last aspect of the current omnidirectional image collection. Each layout map index is coded using a code table having an entry number equal to the number of remaining facades to be mapped. In another embodiment, each layout map index is coded using a code table having an entry number equal to the number of entries to be mapped.

在另一實施例中,映射語法元素還包括旋轉索引,其中每個旋轉索引指示該當前全方向圖像集合的一個立方面在該當前立方體貼圖圖像的所述一個位置處的旋轉角度。針對該當前全方向圖像集合的每個立方面,標識一個旋轉索引。每個旋轉索引使用代碼表進行編解碼,以指示從對應於{0°和90°},{0°,+90°,-90°和180°}或{0°,90°,180°和270°}選轉角度 集合中選擇的一個旋轉角度。 In another embodiment, the mapping syntax element further includes a rotation index, wherein each rotation index indicates a rotation angle of an aspect of the current omnidirectional image collection at the one location of the current cubemap image. A rotation index is identified for each aspect of the current omnidirectional image collection. Each rotation index is coded using a code table to indicate from {0° and 90°}, {0°, +90°, -90° and 180°} or {0°, 90°, 180° and 270°}selection angle A rotation angle selected in the collection.

映射語法元素還包括默認佈局標誌,用於指示具有該當前立方體類型的該當前全方向圖像集合是否使用默認立方體貼圖圖像,並且其中僅當該默認佈局標誌指示該默認立方體貼圖圖像未被用於該當前全方向圖像集合時,佈局映射索引和旋轉索引被標識用於該當前全方向圖像集合。若該默認佈局標誌指示該默認立方體貼圖圖像被用於該當前全方向圖像集合時,默認佈局映射索引和默認旋轉索引被用於該當前全方向圖像集合。 The mapping syntax element also includes a default layout flag for indicating whether the current omnidirectional image collection having the current cube type uses a default cubemap image, and wherein only the default layout flag indicates that the default cubemap image is not When used for the current omnidirectional image collection, the layout map index and the rotation index are identified for the current omnidirectional image collection. If the default layout flag indicates that the default cubemap image is used for the current omnidirectional image collection, the default layout mapping index and the default rotation index are used for the current omnidirectional image collection.

用於該當前全方向圖像集合的該輸出佈局格式集合可以以序列級別、視圖級別、圖像級別、切片級別、序列參數集合、視訊參數集合、或應用參數集合標識在360度全景視訊序列的位元流中。映射語法元素以序列級別、視圖級別、圖像級別、切片級別、序列參數集、視訊參數集、或應用參數集標識在360度全景視訊序列的位元流中。 The output layout format set for the current omnidirectional image collection may be identified in a 360 degree panoramic video sequence at a sequence level, a view level, an image level, a slice level, a sequence parameter set, a video parameter set, or an application parameter set. In the bit stream. The mapping syntax elements are identified in a bit stream of a 360 degree panoramic video sequence at a sequence level, view level, image level, slice level, sequence parameter set, video parameter set, or application parameter set.

映射語法元素可基於一個或多個參考映射語法元素被預測地標識。在一個實施例中,一個或多個參考映射語法元素的多個集合以序列級別、視圖級別、或圖像級別標識在用於360度全景視訊序列的位元流中,並且在切片級別或圖像級別中標識標誌,以從該當前全方向圖像集合的一個或多個參考映射語法元素的該多個集合中選擇所述一個或多個映射語法元素。在另一個實施例中,參考映射語法元素由來自先前圖片、切片或訊框的一個或多個第一映射語法元素預測。 The mapping syntax elements may be predictively identified based on one or more reference mapping syntax elements. In one embodiment, multiple sets of one or more reference mapping syntax elements are identified in a bitstream for a 360 degree panoramic video sequence at a sequence level, view level, or image level, and at a slice level or graph Like the level identification flag, the one or more mapping syntax elements are selected from the plurality of sets of one or more reference mapping syntax elements of the current omnidirectional image collection. In another embodiment, the reference mapping syntax element is predicted by one or more first mapping syntax elements from a previous picture, slice or frame.

410、1210、1510‧‧‧1x6-佈局 410, 1210, 1510‧‧1x6-layout

420、1220、1520‧‧‧2×3-佈局 420, 1220, 1520‧‧‧2×3-layout

430、1230、1530‧‧‧3×2-佈局 430, 1230, 1530‧‧3×2-layout

440、1240、1540‧‧‧6×1-佈局 440, 1240, 1540‧‧‧6×1-layout

510‧‧‧等距長方幾何 510‧‧‧isometric rectangular geometry

520‧‧‧立方幾何 520‧‧‧Cubic Geometry

512、522、1010、1020、1030‧‧‧圖像 512, 522, 1010, 1020, 1030‧‧‧ images

1310‧‧‧立方面的默認佈局 1310‧‧ ‧ default layout

1320‧‧‧3×2立方體佈局 1320‧‧‧3×2 cube layout

1410、1110‧‧‧立方面索引 1410, 1110‧‧‧ Index

1420、1120‧‧‧等距長方 1420, 1120‧‧‧ equidistant rectangular

1610、1611‧‧‧2x3立方體佈局 1610, 1611‧‧2x3 cube layout

1620~1625‧‧‧1x6立方體佈局 1620~1625‧‧1x6 cube layout

1710、1720、1730、1740‧‧‧步驟 1710, 1720, 1730, 1740‧‧ steps

第1圖示出了等距長方投影格式中的全向視訊(稱為“Hangpai_2”)的圖像的示例。 Figure 1 shows an example of an image of omnidirectional video (referred to as "Hangpai_2") in an equidistant rectangular projection format.

第2a-2k圖展示了包含(第2a圖)立方體貼圖、(第2b圖)Cubemap_32、(第2c圖)Cubemap_180、(第2d圖)Plane_poles、(第2e圖)Plane_poles_6、(第2f圖)Plane_poles_cubemap、(第2g圖)Plane_cubemap、(第2h圖)Plane_cubemap_32、(第2i圖)Flat_fixed、(第2j圖)180 degree 3D視訊(即,180-3D)以及(第2k圖)圓柱圖/圓柱形的不同格式的輸出佈局格式。 Figure 2a-2k shows the inclusion of (Fig. 2a) cube map, (2b) Cubemap_32, (2c) Cubemap_180, (2d) Plane_poles, (2e) Plane_poles_6, (2f) Plane_poles_cubemap (2g) Plane_cubemap, (2h) Plane_cubemap_32, (2i) Flat_fixed, (2j) 180 degree 3D video (ie, 180-3D) and (2k) cylindrical/cylindrical Output layout format for different formats.

第3圖示出了使用投影轉換將等距長方投影格式轉換為立方格式的示例,其中標記為1至6的圖像對應於用於表示360度視訊的立方體的六個面上的圖像。 Figure 3 shows an example of converting an equidistant rectangular projection format to a cubic format using projection transformation, where images labeled 1 through 6 correspond to images on six faces of a cube representing 360 degree video. .

第4圖中示出了四個常用佈局:1x6-佈局410,2×3-佈局420,3×2-佈局430和6×1-佈局440。 Four common layouts are shown in FIG. 4: 1x6-layout 410, 2x3-layout 420, 3x2-layout 430, and 6x1-layout 440.

第5圖中示出了等距長方格式和立方體格式的幾何比較。 A geometric comparison of the equidistant rectangular format and the cubic format is shown in Figure 5.

第6圖示出了將六個立方面映射到1x6佈局的示例。 Figure 6 shows an example of mapping six vertical aspects to a 1x6 layout.

第7圖示出了將六個立方面映射到2x3佈局的示例。 Figure 7 shows an example of mapping six vertical aspects to a 2x3 layout.

第8a-8c圖示出了將六個立方面分配到1x6佈局的示例,其中(第8a圖)步驟1中面# 5分配到佈局的第一位置,(第8b圖)步驟2中面# 4分配到佈局的第二位置,(第8c圖)步驟3中面# 6分配到佈局的第三位置。 Figures 8a-8c illustrate an example of assigning six vertical aspects to a 1x6 layout, where (Fig. 8a), the face #5 in step 1 is assigned to the first position of the layout, (Fig. 8b), step 2, face# 4 is assigned to the second position of the layout, (Fig. 8c), in step 3, face #6 is assigned to the third position of the layout.

第9圖示出了將六個立方面分配到1x6佈局的示例,其中將具有(-90度旋轉)的立方面# 1分配給佈局中的第四位置的示例。 Fig. 9 shows an example of assigning six vertical aspects to a 1x6 layout in which an aspect #1 having a (-90 degree rotation) is assigned to an example of a fourth position in the layout.

第10圖示出了旋轉預測的示例,其中參考佈局用於預測目標輸出佈局。 Figure 10 shows an example of rotation prediction in which a reference layout is used to predict a target output layout.

第11圖示出了立方面索引的六個立方面的表示和相應的等距長方圖像的示例。 Figure 11 shows an example of the representation of six vertical aspects of the facade index and the corresponding equidistant rectangular image.

第12圖中示出了用於1x6佈局、2x3佈局、3x2佈局和6x1佈局中的每個立方體佈局中的位置順序。 The positional order in each of the cube layouts for the 1x6 layout, the 2x3 layout, the 3x2 layout, and the 6x1 layout is shown in FIG.

第13圖示出了六個立方面和目標3×2立方體佈局的默認佈局的示例。 Figure 13 shows an example of a default layout for six vertical and target 3x2 cube layouts.

第14圖示出了立方面的默認相對位置及立方面索引的索引和對應的等距長方。 Figure 14 shows the default relative position of the facade and the index of the aspect index and the corresponding equidistant rectangle.

第15圖中示出了1x6佈局、2x3佈局、3x2佈局和6x1佈局中的每個立方體佈局中的位置的示例性順序。 An exemplary sequence of locations in each of the 1x6 layout, 2x3 layout, 3x2 layout, and 6x1 layout is shown in FIG.

第16a-16d圖示出了(第16a圖)不同的6×1立方體佈局,(第16b圖)不同的3x2立方體佈局,(第16c圖)不同的2x3立方體佈局,和(第16d圖)不同的1x6立方體佈局的其他預定義佈局。 Figures 16a-16d show (6a) different 6x1 cube layouts, (16b) different 3x2 cube layouts, (16c) different 2x3 cube layouts, and (16d) different Other predefined layouts for the 1x6 cube layout.

第17圖示出了根據本發明的實施例的系統處理全向圖像的示例性流程圖。 Figure 17 shows an exemplary flow chart of a system processing an omnidirectional image in accordance with an embodiment of the present invention.

以下描述是實現本發明的最佳設想模式。這一描述是為了說明本發明的一般原理,而不應被認為是限制性的。本發明的範圍最好通過所附權利要求書來確定。 The following description is the best mode of concept for carrying out the invention. This description is made to illustrate the general principles of the invention and should not be considered as limiting. The scope of the invention is preferably determined by the appended claims.

在本發明的一個方面,預定義了一組不同的輸出佈局,並且在位元流中以序列級、視圖級、畫面級、切片(slice) 級別、序列參數集(sequence parameter set,簡寫為SPS)、視訊參數集(video parameter set,簡寫為VPS)或應用參數集(application parameter set,簡寫為APS)發送顯式標誌(explicit flag),以從不同的輸出佈局集中選擇輸出佈局。例如,不同輸出佈局的集合可以包括從包括立體圖佈局、cubemap_32佈局、cubemap_180佈局、plane_poles佈局、plane_poles_6佈局、plane_poles_cubemap佈局、plane_cubemap佈局、plane_cubemap_32佈局、flat_fixed佈局、cubemap_1x6佈局、cubemap_2x3佈局、cubemap_3x2佈局、和cubmap_6x1佈局的組中選擇的至少兩種輸出佈局格式。 In one aspect of the invention, a different set of output layouts is predefined, and in the bitstream, at the sequence level, view level, picture level, slice Level, sequence parameter set (short for SPS), video parameter set (VPS) or application parameter set (abstract for APS) to send an explicit flag to Select the output layout from a different set of output layouts. For example, a collection of different output layouts may include from stereoscopic layout, cubemap_32 layout, cubemap_180 layout, plane_poles layout, plane_poles_6 layout, plane_poles_cubemap layout, plane_cubemap layout, plane_cubemap_32 layout, flat_fixed layout, cubemap_1x6 layout, cubemap_2x3 layout, cubemap_3x2 layout, and cubmap_6x1 layout At least two output layout formats selected in the group.

在另一示例中,輸出佈局格式的集合僅包括cubemap_1x6佈局、cubemap_2x3佈局、cubemap_3x2佈局和cubmap_6x1佈局。傳輸標誌以選擇cubemap_1x6佈局、cubemap_2x3佈局、cubemap_3x2佈局和cubmap_6x1佈局中的輸出佈局。 In another example, the set of output layout formats includes only the cubemap_1x6 layout, the cubemap_2x3 layout, the cubemap_3x2 layout, and the cubmap_6x1 layout. Transfer flags to select the output layout in the cubemap_1x6 layout, cubemap_2x3 layout, cubemap_3x2 layout, and cubmap_6x1 layout.

第6圖和第7圖示出了將六個立方面映射到兩個可能的佈局的示例,其中六個立方面映射到第6圖中的1x6佈局,且六個立方面映射到第7圖中的2x3佈局。 Figures 6 and 7 show an example of mapping six vertical aspects to two possible layouts, where six vertical aspects map to the 1x6 layout in Figure 6, and six vertical aspects map to Figure 7. 2x3 layout in the middle.

根據本發明的一種方法,可以根據以下步驟將六個立方面分配給可能的佈局: According to one method of the present invention, six aspects can be assigned to possible layouts according to the following steps:

步驟1:發送(在編碼器側)/接收(在解碼器側)標誌,以將六個立方面中的一個分配給佈局中的第一位置。第8a圖示出了將面# 5分配到1x6佈局的第一位置的示例。 Step 1: Send (on the encoder side) / receive (on the decoder side) flag to assign one of the six vertical aspects to the first position in the layout. Figure 8a shows an example of assigning face #5 to the first position of the 1x6 layout.

步驟2:發送/接收標誌以將剩餘的五個立方面中的 一個分配給佈局中的第二位置。第8b圖示出了將面# 4分配給1x6佈局的第二位置的示例。 Step 2: Send/receive the flag to the remaining five aspects One is assigned to the second position in the layout. Figure 8b shows an example of assigning face #4 to the second position of the 1x6 layout.

步驟3:發送/接收標誌以將剩餘的四個立方面中的一個分配給佈局中的第三位置。第8c圖示出了將面# 6分配給1x6佈局的第三位置的示例。 Step 3: Send/receive flags to assign one of the remaining four aspects to the third position in the layout. Figure 8c shows an example of assigning face #6 to the third position of the 1x6 layout.

步驟4:發送/接收標誌以將剩餘的三個立方面中的一個分配給佈局中的第四位置。 Step 4: Send/receive flags to assign one of the remaining three aspects to the fourth position in the layout.

步驟5:發送/接收標誌以將剩餘的兩個立方面中的一個分配給佈局中的第五位置。 Step 5: Send/receive flags to assign one of the remaining two aspects to the fifth position in the layout.

步驟6:將最後剩餘的面分配給佈局中的最終位置。由於只有一個剩餘的立方面,所以不需要用標識最後一個標誌。 Step 6: Assign the last remaining faces to the final position in the layout. Since there is only one remaining aspect, there is no need to identify the last one.

在上述提出的方法中,可以使用如表1所示的截斷的一元代碼(Truncated Unary code)來發送標誌。在步驟1中,使用尺寸為6的截斷的一元代碼對第一標記進行編碼,以選擇六個立方面中的一個。在步驟2中,使用具有大小為5的截斷的一元代碼對第二標記進行編碼,以選擇剩下的五個立方體中的一個,等等。 In the method proposed above, the truncated Unary code as shown in Table 1 can be used to transmit the flag. In step 1, the first marker is encoded using a truncated unary code of size 6 to select one of the six aspects. In step 2, the second token is encoded using a truncated unary code having a size of 5 to select one of the remaining five cubes, and so on.

另一個實施例中,面的旋轉也被定義用於將立方面映射到一個輸出佈局。例如,根據以下步驟將六個立方面分配給可能的佈局:步驟1:發送(在編碼器側)/接收(在解碼器側)標誌,以將六個立方面中的一個分配給佈局中的第一位置。發送/接收另一個標誌來定義所選面的旋轉。 In another embodiment, the rotation of the face is also defined to map the aspect to an output layout. For example, assign six vertical aspects to a possible layout according to the following steps: Step 1: Send (on the encoder side) / Receive (on the decoder side) flags to assign one of the six vertical aspects to the layout First position. Send/receive another flag to define the rotation of the selected face.

步驟n(n從2到5):發送/接收標誌以將剩餘的(6-n+1)個立方面中的一個分配給佈局中的第n個位置。發送/接收另一個標誌以定義所選面的旋轉。 Step n (n from 2 to 5): Send/receive flags to assign one of the remaining (6-n+1) aspects to the nth position in the layout. Send/receive another flag to define the rotation of the selected face.

步驟6:將最後剩餘的面分配給佈局中的最終位置。發送/接收另一個標誌來定義所選面的旋轉。 Step 6: Assign the last remaining faces to the final position in the layout. Send/receive another flag to define the rotation of the selected face.

這六個面的旋轉也可以在六個面被分配之後被傳送,如下面的步驟所示: 步驟1-a:發送(在編碼器側)/接收(在解碼器側)標誌,以將六個立方面中的一個分配給佈局中的第一位置。 The rotation of these six faces can also be transmitted after the six faces are assigned, as shown in the following steps: Step 1-a: Transmit (on the encoder side) / receive (on the decoder side) flag to assign one of the six vertical aspects to the first position in the layout.

步驟n-a(n從2到5):發送/接收標誌以將剩餘的(6-n+1)個立方面中的一個分配給佈局中的第n個位置。 Step n-a (n from 2 to 5): Send/receive flags to assign one of the remaining (6-n+1) aspects to the nth position in the layout.

步驟6-a:將最後剩餘的面分配給佈局中的最終位置。 Step 6-a: Assign the last remaining faces to the final position in the layout.

步驟n-b(n從1到6):發送/接收標誌以定義第n個面的旋轉。 Step n-b (n from 1 to 6): Send/receive flags to define the rotation of the nth face.

另一方面,面的分配和該面的旋轉也可以組合為 一個標誌,如以下步驟所示:步驟1:發送(在編碼器側)/接收(在解碼器側)標誌以分配六個立方面中的一個,及其旋轉(方向)到佈局中的第一位置。 On the other hand, the assignment of the face and the rotation of the face can also be combined into A flag, as shown in the following steps: Step 1: Send (on the encoder side) / Receive (on the decoder side) flag to assign one of the six vertical aspects, and its rotation (direction) to the first in the layout position.

步驟n(n從2到5):發送/接收標誌以將餘下的(6-n+1)個立方面中的一個及其旋轉(方向)分配給它的佈局中的第n個位置。 Step n (n from 2 to 5): Send/receive flags to assign one of the remaining (6-n+1) vertical aspects and its rotation (direction) to the nth position in its layout.

步驟6:將最後剩餘的面分配給佈局中的最終位置。發送/接收一個標誌來定義第六個面的旋轉。 Step 6: Assign the last remaining faces to the final position in the layout. Send/receive a flag to define the rotation of the sixth face.

面的旋轉可以從集合{0度,90度}中選擇。在另一個實施例中,面的旋轉也可以從集合{0度,90度,180度和270度}或{0度,90度,-90度,180度}中選擇。例如,在步驟4中,發送或接收標誌,以將旋轉過的立方面# 1分配給佈局中的第四位置。在該示例中,在步驟1至3之後,剩餘的面是{面# 1,面# 2和面# 3}。基於以下截斷的一元表,使用代碼“0”發送索引0(非負)/索引1(嚴格為正),以將面# 1分配給表2所示的第四位置,以及將表3所示的旋轉代碼“10”(或表4所示的旋轉代碼“1”)分配給定義的-90度旋轉。 The rotation of the face can be selected from the set {0 degrees, 90 degrees}. In another embodiment, the rotation of the face can also be selected from the set {0 degrees, 90 degrees, 180 degrees and 270 degrees} or {0 degrees, 90 degrees, -90 degrees, 180 degrees}. For example, in step 4, a flag is sent or received to assign the rotated aspect #1 to the fourth position in the layout. In this example, after steps 1 to 3, the remaining faces are {face #1, face #2, and face #3}. Based on the following truncated unary table, index 0 (non-negative) / index 1 (strictly positive) is sent using code "0" to assign face #1 to the fourth position shown in Table 2, and as shown in Table 3. The rotation code "10" (or the rotation code "1" shown in Table 4) is assigned to the defined -90 degree rotation.

第9圖示出了將具有(-90度旋轉)的立方面# 1分配給佈局中的第四位置的示例。 Fig. 9 shows an example of assigning the aspect #1 having (-90 degree rotation) to the fourth position in the layout.

可以在位元流中以序列級別、視圖級別、圖像級別、切片(slice)級別、SPS(序列參數集)、VPS(視頻參數集)或者APS(適應參數集)級別發送上述提出的將等距長方格式標誌到佈局的方法。在另一個實施例中,如下所示,在序列級別、視圖級別、圖像級別、SPS、VPS或APS中標誌N組對輸出佈局的等距長方的映射格式: The proposed peers may be sent in the bitstream at the sequence level, view level, image level, slice level, SPS (sequence parameter set), VPS (video parameter set) or APS (adaptive parameter set) level. The method of mapping from the rectangular format to the layout. In another embodiment, as shown below, the mapping format of the N sets of equidistant rectangles to the output layout is marked in the sequence level, view level, image level, SPS, VPS or APS:

映射格式: Mapping format:

索引1:{面#1(旋轉角度),面#2(旋轉角度),面#3(旋轉角度),面#4(旋轉角度),面#5(旋轉角度),面#6(旋轉角度)} Index 1: {face #1 (rotation angle), face #2 (rotation angle), face #3 (rotation angle), face #4 (rotation angle), face #5 (rotation angle), face #6 (rotation angle) )}

索引2:{面#1(旋轉角度),面#2(旋轉角度),面#3(旋轉 角度),面#5(旋轉角度),面#4(旋轉角度),面#6(旋轉角度)} Index 2: {face #1 (rotation angle), face #2 (rotation angle), face #3 (rotation) Angle), face #5 (rotation angle), face #4 (rotation angle), face #6 (rotation angle)}

...... ......

索引N:{面#2(旋轉角度),面#3(旋轉角度),面#1(旋轉角度),面#5(旋轉角度),面#4(旋轉角度),面#6(旋轉角度)} Index N: {face #2 (rotation angle), face #3 (rotation angle), face #1 (rotation angle), face #5 (rotation angle), face #4 (rotation angle), face #6 (rotation angle) )}

然後在切片或圖像級別中進一步發送標誌,以從{索引1,索引2,...,索引N}中選擇一種映射格式。 The flag is then further transmitted in the slice or image level to select a mapping format from {index 1, index 2, ..., index N}.

在另一個實施例中,可以從另一映射格式預測映射格式。例如,可以從在序列級別、視圖級別或圖像級別中發送的N組映射格式中選擇隨後的參考映射格式。在又一個實施例中,來自先前圖片/切片/訊框的映射格式可以用作參考映射格式以預測當前的映射格式。參考映射格式{面# 1,面# 2,面# 3,面# 4,面# 5,面# 6}是已知的,用於預測當前的映射格式。 In another embodiment, the mapping format can be predicted from another mapping format. For example, a subsequent reference mapping format can be selected from N sets of mapping formats sent at the sequence level, view level, or image level. In yet another embodiment, the mapping format from the previous picture/slice/frame can be used as a reference mapping format to predict the current mapping format. The reference mapping format {face #1, face #2, face #3, face #4, face #5, face #6} is known for predicting the current mapping format.

假設目標映射格式為{面# 1,面# 2,面# 4,面# 3,面# 5,面# 6},為從{面# 1,面# 2,面# 3,面# 4,面# 5,面# 6)預測目標映射格式,預測演算法可以說明如下:步驟1:按順序的剩餘面是:{面# 1,面# 2,面# 3,面# 4,面# 5,面# 6}。索引0被傳送來決定佈局{面# 1,---,---,---,---,---}中的第一位置中的選定面# 1。 Suppose the target mapping format is {face #1, face#2, face#4, face#3, face#5, face#6}, from {face#1, face#2, face#3, face#4, Face # 5, face # 6) predict the target mapping format, the prediction algorithm can be explained as follows: Step 1: The remaining faces in order are: {face # 1, face # 2, face # 3, face # 4, face # 5 , face # 6}. Index 0 is transmitted to determine the selected face #1 in the first position in the layout {face # 1,---,---,---,---,---}.

步驟2:按順序的剩餘面是:{面# 2,面# 3,面# 4,面# 5,面# 6}。索引0被傳送來決定佈局{面# 1,面# 2,---,---,---,---}中的第二位置中的選定面# 2。 Step 2: The remaining faces in order are: {face # 2, face # 3, face # 4, face # 5, face # 6}. Index 0 is transmitted to determine the selected face #2 in the second position in the layout {face #1, face #2, ---, ---, ---, ---}.

步驟3:按順序的剩餘面是:{面# 3,面# 4,面# 5,面 # 6}。索引1被傳送來決定佈局{面# 1,面# 2,面# 4,---,---,---}中的第三位置中的選定面# 4。 Step 3: The remaining faces in order are: {face #3, face #4, face #5, face # 6}. Index 1 is transmitted to determine the selected face #4 in the third position in the layout {face #1, face #2, face #4, ---, ---, ---}.

步驟4:按順序的剩餘面是:{面# 3,面# 5,面# 6}。索引0被傳送來決定佈局{面# 1,面# 2,面# 4,面#3,---,---}中的第四位置中的選定面# 3。 Step 4: The remaining faces in order are: {face #3, face #5, face #6}. Index 0 is transmitted to determine the selected face #3 in the fourth position in the layout {face #1, face #2, face #4, face #3, ---, ---}.

步驟5:按順序的剩餘面是:{面# 5,面# 6}。索引0被傳送來決定佈局{面# 1,面# 2,面# 4,面#3,面#5,---}中的第五位置中的選定面# 5。 Step 5: The remaining faces in order are: {face #5, face #6}. Index 0 is transmitted to determine the selected face #5 in the fifth position in the layout {face #1, face #2, face #4, face #3, face #5, ---}.

步驟6:剩餘面為{面#6},其被選定為佈局{面# 1,面# 2,面# 4,面#3,面#5,面#6}的最終的面。 Step 6: The remaining face is {face #6}, which is selected as the final face of the layout {face #1, face #2, face #4, face #3, face #5, face #6}.

也可以從參考佈局來預測旋轉。第10圖示出了旋轉預測的示例,其中圖像1010對應於六個立方面,圖像1020對應於參考佈局,圖像1030對應於目標佈局。在第10圖中,旋轉目標佈局中的前三個位置。 It is also possible to predict the rotation from the reference layout. Figure 10 shows an example of rotation prediction, where image 1010 corresponds to six elevations, image 1020 corresponds to a reference layout, and image 1030 corresponds to a target layout. In Figure 10, the first three positions in the target layout are rotated.

在上述示例中,以下步驟可被用於從參考佈局預測目標佈局。在這個示例中,參考佈局是{面# 1(0),面# 2(0),面# 3(0),面# 4(0),面# 5(0),面# 6(0)},目標佈局為{面# 1(-90),面# 2(-90),面# 3(-90),面# 5(0),面# 4(0),面# 6(0))},其中(0)表示0度,(-90)表示-90度。 In the above example, the following steps can be used to predict the target layout from the reference layout. In this example, the reference layout is {face # 1 (0), face # 2 (0), face # 3 (0), face # 4 (0), face # 5 (0), face # 6 (0) }, the target layout is {face # 1 (-90), face # 2 (-90), face # 3 (-90), face # 5 (0), face # 4 (0), face # 6 (0) )}, where (0) represents 0 degrees and (-90) represents -90 degrees.

步驟1:按順序的剩餘面是:{面# 1(0),面# 2(0),面# 3(0),面# 4(0),面# 5(0),面# 6(0)}。索引0被傳送到佈局{面# 1,---,---,---,---,---}中的第一個位置中的選定面# 1。如表5所示,大小為6的截斷的一 元代碼可用於編碼索引0,其中所選代碼的行以粗斜體字體(bold Italic font)表示。進一步發送角度差(-90),最終角度重構為:參考角度+角度差=(0)+(-90)=(-90)。可以使用表6中的旋轉編碼類型1或表7中的旋轉編碼類型2(表4)對角度差(-90)進行編碼,其中所選代碼行以粗斜體字體表示。 Step 1: The remaining faces in order are: {face # 1 (0), face # 2 (0), face # 3 (0), face # 4 (0), face # 5 (0), face # 6 ( 0)}. Index 0 is passed to the selected face #1 in the first position in the layout {face # 1,---,---,---,---,---}. As shown in Table 5, a truncated one of size 6 The metacode can be used to encode index 0, where the lines of the selected code are represented in bold italic font. The angular difference (-90) is further transmitted, and the final angle is reconstructed as: reference angle + angle difference = (0) + (-90) = (-90). The angular difference (-90) can be encoded using Rotation Encoding Type 1 in Table 6 or Rotation Encoding Type 2 (Table 4) in Table 7, where the selected line of code is represented in bold italic font.

步驟2:按照順序的剩餘面的集合是:{面# 2(0),面# 3(0),面# 4(0),面# 5(0),面# 6(0)}。索引0被傳送到佈局{面# 1,面# 2,---,---,---,---}中的第二位置中的選定面#2。角度差(-90)進一步傳遞。在這種情況下,可以使用如表8所示的大小為5的截斷的一元代碼來編碼索引0,其中所選代碼的行以粗斜體字體表示。 Step 2: The set of remaining faces in order is: {face # 2 (0), face # 3 (0), face # 4 (0), face # 5 (0), face # 6 (0)}. Index 0 is passed to selected face #2 in the second position in the layout {face #1, face #2, ---, ---, ---, ---}. The angular difference (-90) is further passed. In this case, index 0 can be encoded using a truncated unary code of size 5 as shown in Table 8, where the lines of the selected code are represented in bold italic font.

步驟3:按順序排列的剩餘面的集合是:{面# 3(0),面# 4(0),面# 5(0),面# 6(0)}。索引0被傳送到佈局{面# 1,面# 2,面# 3,---,---,---}中的第3位置的選定面#3。進一步傳遞角度差(-90)。在這種情況下,可以使用如表9所示的大小為4的截斷的一元代碼來編碼索引0,其 中所選代碼行以粗斜體字體表示。 Step 3: The set of remaining faces arranged in order is: {face # 3 (0), face # 4 (0), face # 5 (0), face # 6 (0)}. The index 0 is transmitted to the selected face #3 of the third position in the layout {face #1, face #2, face #3, ---, ---, ---}. Further pass the angle difference (-90). In this case, index 0 can be encoded using a truncated unary code of size 4 as shown in Table 9, which The selected line of code is represented in bold italic font.

步驟4:按照順序的剩餘面的集合是:{面# 4(0),面# 5(0),面# 6(0)}。索引1被發送到佈局{面# 1,面# 2,面# 3,面# 5,---,---}中的第四位置的選定面# 5。進一步發送角度差(0)。在這種情況下,可以使用如表10所示的尺寸為3的截斷的一元代碼來編碼索引1,其中所選代碼行以粗斜體字體表示。 Step 4: The set of remaining faces in order is: {face # 4 (0), face # 5 (0), face # 6 (0)}. Index 1 is sent to the selected face #5 of the fourth position in the layout {face #1, face #2, face #3, face #5, ---, ---}. Further send the angle difference (0). In this case, index 1 can be encoded using a truncated unary code of size 3 as shown in Table 10, where the selected line of code is represented in bold italic font.

步驟5:按順序的剩餘面的集合是:{面# 4(0),面# 6(0)}。索引0被發送到佈局{面# 1,面# 2,面# 3,面# 5,面# 4,...}中的第5位置的選定面#4。進一步發送角度差(0)。在這種情況下,可以使用如表11所示的大小為2的截 斷的一元代碼來編碼索引0,其中所選代碼的行以粗斜體字體表示。 Step 5: The set of remaining faces in order is: {face # 4(0), face # 6(0)}. Index 0 is sent to the selected face #4 of the 5th position in the layout {face #1, face #2, face #3, face #5, face #4, ...}. Further send the angle difference (0). In this case, a cut of size 2 as shown in Table 11 can be used. The broken unary code encodes index 0, where the lines of the selected code are represented in bold italic font.

步驟6:剩下的面是{面# 6}並被選擇為佈局{面# 1,面# 2,面# 3,面# 5,面# 4,面# 6}中的最後位置。進一步發送角度差(0)。 Step 6: The remaining faces are {face #6} and are selected as the last position in the layout {face #1, face #2, face #3, face #5, face #4, face #6}. Further send the angle difference (0).

例如,可以首先在圖片/切片/訊框級別中發送索引,以選擇一個映射佈局作為參考佈局。然後,預測方法進一步用於從參考佈局預測當前圖片/切片/訊框的目標佈局。在另一示例中,參考佈局可以從先前編碼的圖像/切片/訊框繼承,並且然後應用預測方法來預測當前圖像/切片/訊框的目標佈局。 For example, you can first send an index in the picture/slice/frame level to select a mapping layout as a reference layout. The prediction method is then further used to predict the target layout of the current picture/slice/frame from the reference layout. In another example, the reference layout may be inherited from the previously encoded image/slice/frame and then the prediction method is applied to predict the target layout of the current image/slice/frame.

在上述提出的方法中,截斷的一元代碼也可以由其他熵編解碼方法代替。例如,以下熵編解碼方法之一可以應用於本發明。 In the above proposed method, the truncated unary code can also be replaced by other entropy coding and decoding methods. For example, one of the following entropy codec methods can be applied to the present invention.

- ae(v):上下文自適應算術熵編解碼語法元素。 - ae(v): context adaptive arithmetic entropy codec syntax element.

- b(8):具有任何位元串(8位元)模式的位元組(bit string)。該描述符的解析過程由函數read_bits(8)的返回值指定。 - b(8): A bit string with any bit string (8-bit) mode. The parsing process of this descriptor is specified by the return value of the function read_bits(8).

- f(n):使用具有最先寫入左邊位元(從左到右)的n位元寫入的固定模式位元串。該描述符的解析過程由函數read_bits(n)的返回值指定。 - f(n): Uses a fixed-mode bit string written with n-bits written to the left bit first (from left to right). The parsing process for this descriptor is specified by the return value of the function read_bits(n).

- se(v):帶符號整數的0階指數哥倫布編解碼語法元素,左邊位元是第一個。 - se(v): a 0th-order exponential Columbus codec syntax element with a signed integer, the left bit is the first.

- u(n):使用n位元的無符號整數。當n在語法表中為“v”時,位元數以取決於其他語法元素的值的方式變化。該描述符的解析過程由函數read_bits(n)的返回值指定,解釋為無符號整數的二進位表示,最高有效位元首先寫入。 - u(n): Use an unsigned integer of n bits. When n is "v" in the syntax table, the number of bits varies in a manner that depends on the values of other syntax elements. The parsing process of this descriptor is specified by the return value of the function read_bits(n), interpreted as a binary representation of the unsigned integer, with the most significant bit first written.

- ue(v):無符號整數0階指數哥倫布編解碼語法元素,左邊位元是第一個。 - ue(v): Unsigned integer 0th-order exponential Columbus codec syntax element, the left bit is the first.

根據本發明的實施例的用於佈局信令的示例性語法設計如下所示: An exemplary syntax design for layout signaling in accordance with an embodiment of the present invention is as follows:

在上述語法表中,語法num_of_layout_faces指定佈局中面的總數。語法layout_face[i]將剩餘(num_of_layout_faces-i)面的索引指定為佈局中第i個位置的 對應面,layout_face[i]的值應在0到(num_of_layout_faces-i-1)的範圍內,包括0和(num_of_layout_faces-i-1)。語法layout_face[num_of_layout_faces-1]被推斷為等於最後剩餘面。 In the above syntax table, the syntax num_of_layout_faces specifies the total number of faces in the layout. The syntax layout_face[i] specifies the index of the remaining (num_of_layout_faces-i) face as the corresponding face of the i-th position in the layout, and the value of layout_face[i] should be in the range of 0 to (num_of_layout_faces-i-1), including 0. And (num_of_layout_faces-i-1). The grammar layout_face[num_of_layout_faces-1] is inferred to be equal to the last remaining face.

根據本發明的實施例的用於具有旋轉的佈局信令的示例性語法設計如下所示: An exemplary syntax design for layout signaling with rotation in accordance with an embodiment of the present invention is as follows:

在上述語法表中,語法num_of_layout_faces指定剩餘num_of_layout_faces-i個面中的索引作為佈局中第i個位置的對應面,並且layout_face[i]的值應在0到(num_of_layout_faces-i-1)的範圍內,包括0和(num_of_layout_faces-i-1)。語法layout_face[num_of_layout_faces-1]被推斷為等於最後剩餘 面。語法layout_face[i]指定表14或表15中規定的佈局中第i個位置的相應面旋轉,並且layout_face[i]的值應在0到3(或1),,包括0和3(或1)。 In the above syntax table, the syntax num_of_layout_faces specifies the index in the remaining num_of_layout_faces-i faces as the corresponding face of the i-th position in the layout, and the value of layout_face[i] should be in the range of 0 to (num_of_layout_faces-i-1) , including 0 and (num_of_layout_faces-i-1). The grammar layout_face[num_of_layout_faces-1] is inferred to be equal to the last remaining face. The grammar layout_face[i] specifies the corresponding face rotation of the i-th position in the layout specified in Table 14 or Table 15, and the value of layout_face[i] should be 0 to 3 (or 1), including 0 and 3 (or 1 ).

根據另一個實施例,用於佈局信令的示例性語法設計在表16中示出,其中為這些六立方面的置換和旋轉的訊息標誌附加語法。默認立方面的相對位置。初始立方面陣列等於{Left,Front,Right,Rear,Top,Bottom}。立方面索引和相應的等距長方的標示如第11圖所示。 In accordance with another embodiment, an exemplary syntax design for layout signaling is shown in Table 16, with syntax added for these six-part replacement and rotated message flags. The relative position of the default aspect. The initial vertical aspect array is equal to {Left, Front, Right, Rear, Top, Bottom}. The vertical index and the corresponding equidistant rectangular are shown in Figure 11.

在上述語法表中,num_of_layout_face_minus1指定佈局中面的總數。語法num_of_layout_faces推斷為num_of_layout_face_rminus1+1。語法layout_face[i]指定剩餘(num_of_layout_faces-i)面中的索引作為在佈局中的第i個位置的對應face,layout_face[i]的值應在0到(num_of_layout_face_minus1-i)(含0和num_of_layout_face_minus1-i)的範圍內。語法layout_face[num_of_layout_face_minus1]被推定為最後的剩餘面。語法layout_rotation[i]指定表17中規定的佈局中第i個位置的相應面旋轉,並且layout_rotation[i]的值應在0到3的範圍內,包括0和3。 In the above syntax table, num_of_layout_face_minus1 specifies the total number of faces in the layout. The syntax num_of_layout_faces is inferred to be num_of_layout_face_rminus1+1. The syntax layout_face[i] specifies the index in the remaining (num_of_layout_faces-i) plane as the corresponding face of the i-th position in the layout, and the value of layout_face[i] should be 0 to (num_of_layout_face_minus1-i) (including 0 and num_of_layout_face_minus1- Within the scope of i). The grammar layout_face[num_of_layout_face_minus1] is presumed to be the last remaining face. The grammar layout_rotation[i] specifies the corresponding face rotation of the i-th position in the layout specified in Table 17, and the value of layout_rotation[i] should be in the range of 0 to 3, including 0 and 3.

第12圖中示出了用於1x6佈局(1210)、2x3佈局(1220)、3x2佈局(1230)和6x1佈局(1240)中的每個立方體佈局中的位置順序(即layout_face[i])。第13圖示出了六個立方面的默認佈局(1310)和目標3×2立方體佈局(1320)的示例。 The positional order (i.e., layout_face[i] ) in each of the cube layouts in the 1x6 layout (1210), the 2x3 layout (1220), the 3x2 layout (1230), and the 6x1 layout (1240) is shown in FIG. Figure 13 shows an example of a default layout (1310) and a target 3x2 cube layout (1320) for six aspects.

根據上述實施例的標誌立方面佈局的示例在下表中示出。對於i=0、1和2,前三個佈局面選擇對應於列表中的第一個位置(即0{Left},0{Front}和0{Right})。因此,對於i=0、1和2,選擇索引0。對於i=3,選擇列表中的第二位置(即,1{Top})。因此,標誌索引1。對於i=4,選擇列表中的第二位置(即0{Rear})。因此,標誌索引1。由於只剩下一個立方面,所以不需要為最後一個立方面佈局標誌索引。但是,最後一個立方面(即i=5)仍然需要layout_rotation[i]。 An example of the landmark aspect layout according to the above embodiment is shown in the following table. For i=0, 1 and 2, the first three layout face selections correspond to the first position in the list (ie 0{Left}, 0{Front} and 0{Right}). Therefore, for i=0, 1 and 2, index 0 is selected. For i=3, select the second position in the list (ie, 1{Top}). Therefore, the flag index is 1. For i=4, select the second position in the list (ie 0{Rear}). Therefore, the flag index is 1. Since there is only one aspect left, there is no need to index the flag for the last aspect. However, the last aspect (ie i=5) still requires layout_rotation[i].

在另一個實施例中,用於佈局信令的語法設計包括用於發送這六個立方面的置換和旋轉的訊息的附加語法。立方面的默認相對位置及其索引如第14圖所示,其中示出了立方面索引(1410)和對應的等距長方(1420)。 In another embodiment, the syntax design for layout signaling includes an additional syntax for transmitting the replacement and rotation of the six aspects. The default relative position of the aspect and its index are shown in Figure 14, which shows the aspect index (1410) and the corresponding equidistant rectangle (1420).

根據上述實施例的示例性語法設計在表19a和表19b中示出。在表19a中,mapping_extension()由斜體文本定義。mapping_extension()的詳細訊息如表19b所示。 Exemplary syntax designs in accordance with the above embodiments are shown in Tables 19a and 19b. In Table 19a, mapping_extension() is defined by italicized text. The detailed information of mapping_extension() is shown in Table 19b.

在上表中,語法num_of_layout_face_minus1指定佈局中面的總數。語法num_of_layout_face被推斷為num_of_layout_face_minus1+1。語法layout_face[i]指定立方面索引作為佈局中第i個位置的對應面,layout_face的值[i]的範圍為0到(num_of_layout_face_minus1)(含0和(num_of_layout_face_minusl))。語法layout_face[num_of_layout_face_minus1]被推定為最後的剩餘面。語法layout_rotation[i]指定表20中規定的佈局中第i個位置的相應面旋轉,並且layout_rotation[i]的值應在0到3的範圍內,包括0和3。 In the above table, the syntax num_of_layout_face_minus1 specifies the total number of faces in the layout. The syntax num_of_layout_face is inferred to be num_of_layout_face_minus1+1. The grammar layout_face[i] specifies the vertical aspect index as the corresponding face of the i-th position in the layout, and the value [i] of the layout_face ranges from 0 to (num_of_layout_face_minus1) (including 0 and (num_of_layout_face_minusl)). The grammar layout_face[num_of_layout_face_minus1] is presumed to be the last remaining face. The grammar layout_rotation[i] specifies the corresponding face rotation of the i-th position in the layout specified in Table 20, and the value of layout_rotation[i] should be in the range of 0 to 3, including 0 and 3.

第15圖中示出了1x6佈局(1510)、2x3佈局(1520 )、3x2佈局(1530)和6x1佈局(1540)中的每個立方體佈局中的位置(即layout_面[i])的另一示例性順序。 Figure 15 shows the 1x6 layout (1510), 2x3 layout (1520) Another exemplary order of positions in each cube layout (ie, layout_face [i]) in the 3x2 layout (1530) and the 6x1 layout (1540).

根據上述實施例的另一示例性語法設計在表21中示出。 Another exemplary syntax design according to the above embodiment is shown in Table 21.

在上表中,mapping_format指定如下表所示的360度視訊的輸入映射格式,其中mapping_format的值應在0到3的範圍內,包括0和3。 In the above table, mapping_format specifies the input mapping format of the 360-degree video as shown in the following table, where the value of mapping_format should be in the range of 0 to 3, including 0 and 3.

在表21中,cubic_type指定如表23所示的立方體格式的立方體類型,並且cubic_type的值應在0至3的範圍內,包括0和3。 In Table 21, cubic_type specifies the cube type of the cube format as shown in Table 23, and the value of cubic_type should be in the range of 0 to 3, including 0 and 3.

語法default_layout_flag等於1表示立方體佈局遵循表24中指定的默認佈局。語法default_layout_flag等於0表示默認立方體佈局不存在,並且顯式地標誌每個位置的相應面和旋轉。表24指定了默認佈局。根據第12圖所示的順序,立體佈局中每個位置的相應面和旋轉指定如下。第11圖中定義了從等距長方到立方面的映射,其中示出了立方面索引(1110)和對應的等距長方(1120)。 The syntax default_layout_flag equal to 1 indicates that the cube layout follows the default layout specified in Table 24. The syntax default_layout_flag equal to 0 indicates that the default cube layout does not exist and explicitly marks the corresponding face and rotation of each position. Table 24 specifies the default layout. According to the sequence shown in Fig. 12, the respective faces and rotations of each position in the three-dimensional layout are designated as follows. A map from equidistant rectangular to vertical aspects is defined in Fig. 11, showing the vertical aspect index (1110) and the corresponding equidistant rectangular (1120).

在表21中,layout_face[i]將剩餘面中的索引指定為佈局中第i個位置的對應面。根據第12圖所示的順序,每個位 置的面和旋轉(即,layout_face[i]和layout_rotation[i])如下迭代地標誌(signaled iteratively)。初始(面-選擇)陣列包括{Left,Front,Right,Rear,Top,Bottom}的順序的所有六個面。對於第一個位置,標誌layout_face[0],以指示陣列中所選面的索引(即{Left,Front,Right,Rear,Top,Bottom})作為位置0的對應面。layout_face[0]的值應在0到5的範圍內,包括0和5。所選擇的位置0的面隨後從剩餘候選者的陣列中移除。對於第i個位置,標誌layout_face[i],以指示由剩餘的未選擇面組成的更新陣列中的索引作為位置i的相應面。layout_face[i]的值應在0到5-i的範圍內,包括0和5-i。位置i的對應的選定面隨後從餘下的候補陣列中移除。對於最後的位置,它被推斷為最後剩下的未選擇的面。 In Table 21, layout_face[i] specifies the index in the remaining faces as the corresponding face of the i-th position in the layout. According to the order shown in Figure 12, each bit The faces and rotations (ie, layout_face[i] and layout_rotation[i]) are signaled iteratively as follows. The initial (face-select) array includes all six faces of the order of {Left, Front, Right, Rear, Top, Bottom}. For the first position, the layout_face[0] is flagged to indicate the index of the selected face in the array (ie {Left, Front, Right, Rear, Top, Bottom}) as the corresponding face of position 0. The value of layout_face[0] should be in the range 0 to 5, including 0 and 5. The face of the selected position 0 is then removed from the array of remaining candidates. For the ith position, the layout_face[i] is flagged to indicate the index in the updated array consisting of the remaining unselected faces as the corresponding face of position i. The value of layout_face[i] should be in the range 0 to 5-i, including 0 and 5-i. The corresponding selected face of position i is then removed from the remaining candidate array. For the last position, it is inferred to be the last remaining unselected face.

在表21中,layout_rotation[i]指定了表20中規定的佈局中第i個位置的相應面旋轉,並且layout_rotation[i]的值應在0到3的範圍內,包括0和3。 In Table 21, layout_rotation[i] specifies the corresponding face rotation of the i-th position in the layout specified in Table 20, and the value of layout_rotation[i] should be in the range of 0 to 3, including 0 and 3.

根據另一個實施例,表25中示出了用於佈局信令的示例性語法設計,其中針對這六個立方面的置換和旋轉的訊息標誌附加語法。 In accordance with another embodiment, an exemplary syntax design for layout signaling is shown in Table 25, with syntax added for the replacement and rotated message flags for the six aspects.

在本發明的另一個實施例中,如果選擇立方面作為佈局格式,則在位元流的序列級別、視圖級別、圖像級別、切片級別、瓦片(tile)級別、SPS、VPS或APS中發送第二標誌從一組立方體類型集合中選擇一種立方體類型,該立方體類型集合包括1×6立方體格式、2×3立方體格式、3×2立方體格式和6 ×1立方體格式中的至少兩種。 In another embodiment of the present invention, if the aspect is selected as the layout format, in the sequence level, view level, image level, slice level, tile level, SPS, VPS or APS of the bit stream Sending a second flag selects a cube type from a set of cube type sets including a 1 x 6 cube format, a 2 x 3 cube format, a 3 x 2 cube format, and 6 At least two of the ×1 cube formats.

在另一個實施例中,在位元流的序列級別,視圖級別、圖片級別、切片級別、瓦片級別、SPS、VPS或APS中發送標誌,以指示是否使用了立方體或其他格式的預定義默認佈局。如果不使用立方體或其他格式的預定義默認佈局,則佈局隨後被顯示地傳輸到位元流中。立方面的預定義佈局可以定義為表24。 In another embodiment, a flag is sent at the sequence level of the bitstream, view level, picture level, slice level, tile level, SPS, VPS or APS to indicate whether a predefined default of cube or other format is used. layout. If you do not use a predefined default layout of cubes or other formats, the layout is then displayed and transferred to the bitstream. The predefined layout of the aspect can be defined as Table 24.

也可以使用其他預定義的佈局。例如,也可以如第16a圖所示定義不同的6×1立方體佈局。第16b圖中顯示了不同的3x2立方體佈局。第16c圖中顯示了兩個不同的2x3立方體佈局(1610和1611)。第16d圖中顯示了六個不同的1x6立方體佈局(1620到1625)。 Other predefined layouts are also available. For example, a different 6x1 cube layout can also be defined as shown in Figure 16a. A different 3x2 cube layout is shown in Figure 16b. Two different 2x3 cube layouts (1610 and 1611) are shown in Figure 16c. Six different 1x6 cube layouts (1620 to 1625) are shown in Figure 16d.

為了明確地標誌佈局,在位元流中發送與每個相應面相關聯的對應面和/或旋轉。在一個示例中,初始(面-選擇)陣列包括具有給定預定義順序的{Left,Front,Right,Rear,Top,Bottom}中的至少兩個。 To explicitly mark the layout, corresponding faces and/or rotations associated with each respective face are transmitted in the bitstream. In one example, the initial (face-select) array includes at least two of {Left, Front, Right, Rear, Top, Bottom} with a given predefined order.

對於第一位置,標誌語法以將給定陣列中的選定面的索引指示為第一位置的對應面,並且語法的值應在0至(N-1),其中N表示總的面的數量。 For the first location, the flag syntax indicates the index of the selected face in the given array as the corresponding face of the first location, and the value of the grammar should be between 0 and (N-1), where N represents the total number of faces.

對於第i個位置,標誌另一個語法來指示由剩餘的未選擇面組成的更新的陣列中的索引作為位置i的對應面,並且該語法的值應在0至(N-1)的範圍內,包括0和(N-1)。對於最後的位置,它被推斷為最後剩下的未選擇的面。 For the ith position, another syntax is indicated to indicate the index in the updated array consisting of the remaining unselected faces as the corresponding face of position i, and the value of the grammar should be in the range of 0 to (N-1) , including 0 and (N-1). For the last position, it is inferred to be the last remaining unselected face.

在另一示例中,初始(面-選擇)陣列包括具有 給定預定義順序的{Left,Front,Right,Rear,Top,Bottom}中的至少兩個。 In another example, the initial (face-select) array includes Given at least two of {Left, Front, Right, Rear, Top, Bottom} in a predefined order.

對於第一位置,標誌語法,以將給定陣列中的選定面的索引指示為第一位置的對應面,並且語法的值應在0至(N-1)的範圍內,其中N表示總的面的數量。之後,第一個位置的選定面將從陣列中刪除。 For the first location, the flag syntax is to indicate the index of the selected face in the given array as the corresponding face of the first location, and the value of the grammar should be in the range of 0 to (N-1), where N represents the total The number of faces. The selected faces of the first position are then removed from the array.

對於第i個位置,標誌另一個語法來指示由剩餘未選擇面組成的更新陣列中的索引作為位置i的對應面,並且該語法的值應在0到(N-1-i)(含0和(N-1-i))的範圍內。隨後位置i的對應選定面隨後被從陣列中移除。對於最後的位置,它被推斷為最後剩下的未選擇的面。 For the ith position, another syntax is indicated to indicate the index in the updated array consisting of the remaining unselected faces as the corresponding face of position i, and the value of the grammar should be 0 to (N-1-i) (including 0) And (N-1-i)) within the scope. The corresponding selected face of position i is then removed from the array. For the last position, it is inferred to be the last remaining unselected face.

為了指定佈局中的每個位置的對應的面旋轉,發送每個位置的語法以指示從一組旋轉候選集合(其包括(-900,+900,+1800,00)中的至少兩個)中指示相應的面旋轉。旋轉索引的規格可以定義如表20所示。 To specify the corresponding face rotation for each position in the layout, the syntax for each position is sent to indicate at least one of the set of rotation candidates (which includes (-90 0 , +90 0 , +180 0 , 0 0 ) The corresponding face rotation is indicated in both). The specifications of the rotation index can be defined as shown in Table 20.

第17圖示出了根據本發明的實施例的系統處理全向圖像的示例性流程圖。流程圖中所示的步驟以及本公開的其他流程圖可以被實現為在編碼器側和/或解碼器側的一個或多個處理器(例如,一個或多個CPU)上可執行的程式代碼。流程圖中所示的步驟也可以基於諸如被配置為執行流程圖中的步驟的一個或多個電子設備或處理器的硬體來實現。根據該方法,在步驟1710中接收使用選擇的投影格式的360度全景視訊序列中的每個球面圖像轉換的當前全向圖像集合,其中所選擇的投影格式屬於包括立方面格式的投影格式組,而立方面格式 的當前組全方位圖像由六個立方面組成。在步驟1720中檢查所選擇的投影格式是否對應於立方面格式。如果所選擇的投影格式對應於立方面格式,則執行步驟1730和1740。否則(即非立方面格式的“否”路徑),步驟1730和1740被跳過。在步驟1730中,標誌將當前的全方向圖像集合映射到屬於輸出佈局格式集合的當前立方體貼圖(cubemap)圖像中的一個或多個映射語法元素,其中輸出佈局格式集合包括從包括立方體貼圖佈局和等距長方(equi-rectangular)格式的佈局組中選擇的至少兩個輸出佈局格式。在步驟1740中,為當前的全向圖像組提供包含該一個或多個映射語法元素的位元流中的編解碼資料。 Figure 17 shows an exemplary flow chart of a system processing an omnidirectional image in accordance with an embodiment of the present invention. The steps shown in the flowcharts and other flowcharts of the present disclosure may be implemented as program code executable on one or more processors (eg, one or more CPUs) on the encoder side and/or the decoder side. . The steps shown in the flowcharts can also be implemented based on hardware such as one or more electronic devices or processors configured to perform the steps in the flowcharts. According to the method, a current omnidirectional image set for each spherical image transformation in a 360 degree panoramic video sequence using the selected projection format is received in step 1710, wherein the selected projection format belongs to a projection format including an aspect format Group format The current set of omnidirectional images consists of six vertical aspects. It is checked in step 1720 whether the selected projection format corresponds to the aspect format. If the selected projection format corresponds to the aspect format, steps 1730 and 1740 are performed. Otherwise (ie, the "no" path of the non-formal format), steps 1730 and 1740 are skipped. In step 1730, the flag maps the current omnidirectional image set to one or more mapping syntax elements in a current cube map image belonging to the output layout format set, wherein the output layout format set includes from the including cube map At least two output layout formats selected in the layout and layout groups of the equi-rectangular format. In step 1740, the current omnidirectional image group is provided with codec material in the bitstream containing the one or more mapping syntax elements.

以上所示的流程圖旨在作為示例來說明本發明的實施例。本領域技術人員可以通過修改各個步驟,分離或組合步驟來實踐本發明,而不脫離本發明的精神。 The flowcharts shown above are intended to be illustrative of embodiments of the invention. The present invention can be practiced by a person skilled in the art by modifying various steps, separate or combining steps without departing from the spirit of the invention.

呈現上述描述以使得本領域具有習知技術者能夠在特定應用及其要求的上下文中實施本發明。對所描述的實施例的各種修改對於本領域具有習知技術者將是顯而易見的,並且本文定義的一般原理可以應用於其他實施例。因此,本發明並不限於所示出和描述的特定實施例,而是符合與本文所公開的原理和新穎特徵相一致的最寬範圍。在上述詳細描述中,示出了各種具體細節以便提供對本發明的透徹理解。然而,本領域具有習知技術者將理解,可以實施本發明。 The above description is presented to enable a person of ordinary skill in the art to practice the invention in the context of the particular application. Various modifications to the described embodiments will be apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the invention is not to be limited to the specific embodiments shown and described. In the above detailed description, various specific details are shown However, it will be understood by those skilled in the art that the present invention may be practiced.

如上所述的本發明的實施例可以以各種硬體、軟體代碼或兩者的組合來實現。例如,本發明的實施例可以是集成到視訊壓縮晶片中的一個或多個電子電路或集成到視頻壓 縮軟體中的程式代碼,以執行本文所述的處理。本發明的實施例也可以是要在數位信號處理器(DSP)上執行以執行本文所述的處理的程式代碼。本發明還可涉及由計算機處理器、數位信號處理器、微處理器或現場可程式化閘陣列(FPGA)執行的多個功能。可以通過執行定義本發明所體現的特定方法的機器可讀軟體代碼或韌體代碼來將這些處理器配置成執行根據本發明的特定任務。軟體代碼或韌體可以以不同的程式語言和不同的格式或風格開發。也可以為不同的目標平台編譯軟體代碼。然而,根據本發明的不同的代碼格式,軟體代碼的樣式和語言以及配置代碼的其他方式將不會脫離本發明的精神和範圍。 Embodiments of the invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, embodiments of the invention may be integrated into one or more electronic circuits in a video compression chip or integrated into video compression Program code in the software to perform the processing described herein. Embodiments of the invention may also be program code to be executed on a digital signal processor (DSP) to perform the processes described herein. The invention may also relate to a plurality of functions performed by a computer processor, a digital signal processor, a microprocessor or a field programmable gate array (FPGA). These processors may be configured to perform particular tasks in accordance with the present invention by executing machine readable software code or firmware code that defines a particular method embodied by the present invention. Software code or firmware can be developed in different programming languages and in different formats or styles. It is also possible to compile software code for different target platforms. However, the different code formats, the style and language of the software code, and other ways of configuring the code in accordance with the present invention will not depart from the spirit and scope of the present invention.

在不脫離本發明的精神或基本特徵的情況下,本發明可以以其他具體形式實施。所描述的例子僅在所有方面被認為是說明性的而不是限制性的。因此,本發明的範圍由申請 專利範圍而不是前面的描述來指示。屬於申請專利範圍請求項的等同物的含義和範圍內的所有變化將被包括在其範圍內。 The present invention may be embodied in other specific forms without departing from the spirit or essential characteristics of the invention. The described examples are to be considered in all respects illustrative illustrative Therefore, the scope of the invention is applied for The scope of the patent is not indicated by the foregoing description. All changes that come within the meaning and range of equivalents of the claims of the claims are intended to be included.

Claims (19)

一種全向圖像處理方法,所述方法包括:使用所選擇的投影格式接收從360度全景視訊序列中的每個球形圖像轉換的當前全向圖像集合,其中所選擇的投影格式屬於一投影格式組,該投影格式組包括一立方面格式,以及具有該立方面格式的該當前全方向圖像集合由六個立方面組成;以及如果該所選擇的投影格式對應於該立方面格式:標識一個或多個映射語法元素,以將該當前全方向圖像集合映射到一當前立方體貼圖圖像,該當前立方體貼圖圖像屬於一輸出佈局格式集合,該輸出佈局格式集合包括從包括一立方體貼圖佈局和一等距長方投影格式的一佈局組中選擇的至少兩個輸出佈局格式;以及提供一位元流中的編解碼資料,該位元流包括用於該當前全方向圖像集合的該一個或多個映射語法元素。 An omnidirectional image processing method, the method comprising: receiving, using a selected projection format, a current omnidirectional image set converted from each spherical image in a 360 degree panoramic video sequence, wherein the selected projection format belongs to a a projection format group comprising a facade format, and the current omnidirectional image collection having the facade format is composed of six aspects; and if the selected projection format corresponds to the aspect format: Identifying one or more mapping syntax elements to map the current omnidirectional image collection to a current cubemap image, the current cubemap image belonging to an output layout format set, the output layout format set including from a cube At least two output layout formats selected from a layout group and a layout group of an equidistant rectangular projection format; and providing codec data in a one-bit stream, the bitstream including the current omnidirectional image collection The one or more mapping syntax elements. 如申請專利範圍第1項所述之全向圖像處理方法,其中該投影格式組還包括180-3D格式、圓柱圖格式、cubemap_32佈局、cubemap_180佈局、plane_poles佈局、plane_poles_6佈局、plane_poles_cubemap佈局、plane_cubemap佈局、plane_cubemap_32佈局、flat_fixed佈局、cubemap_1x6佈局、cubemap_2x3佈局、cubemap_3x2佈局和cubmap_6x1佈局。 The omnidirectional image processing method according to claim 1, wherein the projection format group further comprises a 180-3D format, a cylinder format, a cubemap_32 layout, a cubemap_180 layout, a plane_poles layout, a plane_poles_6 layout, a plane_poles_cubemap layout, and a plane_cubemap layout. , plane_cubemap_32 layout, flat_fixed layout, cubemap_1x6 layout, cubemap_2x3 layout, cubemap_3x2 layout, and cubmap_6x1 layout. 如申請專利範圍第2項所述之全向圖像處理方法,其中如果該當前全方向圖像集合是該等距長方投影格式,則將該當 前全方向圖像集合轉換成該立方面格式,並且通過將轉換的當前全方向圖像集合視為具有該立方面格式,該一個或多個映射語法元素被標識用於該轉換的當前全方向圖像集合。 The omnidirectional image processing method of claim 2, wherein if the current omnidirectional image set is the equidistant rectangular projection format, The front omnidirectional image set is converted to the vertical aspect format, and by considering the converted current omnidirectional image set as having the vertical aspect format, the one or more mapping syntax elements are identified for the current omnidirectional direction of the transformation Image collection. 如申請專利範圍第1項所述之全向圖像處理方法,其中該一個或多個映射語法元素包括與該當前立方體貼圖圖像相關聯的當前立方體類型,並且該當前立方體類型屬於由1x6立方體貼圖佈局、2x3立方體貼圖佈局、3x2立方體貼圖佈局和6x1立方體貼圖佈局組成的當前輸出佈局格式集合。 The omnidirectional image processing method of claim 1, wherein the one or more mapping syntax elements include a current cube type associated with the current cubemap image, and the current cube type belongs to a 1x6 cube A collection of current output layout formats consisting of a texture layout, a 2x3 cubemap layout, a 3x2 cubemap layout, and a 6x1 cubemap layout. 如申請專利範圍第4項所述之全向圖像處理方法,其中該一個或多個映射語法元素還包括複數個佈局映射索引,其中每個佈局映射索引將該當前全方向圖像集合的一個立方面與該當前立方體貼圖圖像的一個位置相關聯。 The omnidirectional image processing method of claim 4, wherein the one or more mapping syntax elements further comprise a plurality of layout mapping indexes, wherein each layout mapping index is one of the current omnidirectional image collections The aspect is associated with a location of the current cubemap image. 如申請專利範圍第5項所述之全向圖像處理方法,其中每個佈局映射索引使用具有等於要映射的立方面數的條目數的代碼表進行編解碼。 The omnidirectional image processing method of claim 5, wherein each of the layout map indexes is coded using a code table having an entry number equal to the number of vertical points to be mapped. 如申請專利範圍第5項所述之全向圖像處理方法,其中,對於該當前全方向圖像集合的每個立方面,除了該當前全方向圖像集合的最後一個立方面之外,標識一個佈局映射索引。 The omnidirectional image processing method of claim 5, wherein for each aspect of the current omnidirectional image set, in addition to the last aspect of the current omnidirectional image set, the identification A layout map index. 如申請專利範圍第7項所述之全向圖像處理方法,其中每個佈局映射索引使用具有等於要映射的剩餘立方面數的條目數的代碼表進行編解碼。 An omnidirectional image processing method according to claim 7, wherein each layout map index is coded using a code table having an entry number equal to the number of remaining facades to be mapped. 如申請專利範圍第5項所述之全向圖像處理方法,其中該一 個或多個映射語法元素還包括複數個旋轉索引,其中每個旋轉索引指示該當前全方向圖像集合的一個立方面在該當前立方體貼圖圖像的所述一個位置處的旋轉角度。 An omnidirectional image processing method according to claim 5, wherein the one The one or more mapping syntax elements further include a plurality of rotation indices, wherein each rotation index indicates a rotation angle of an aspect of the current omnidirectional image collection at the one location of the current cubemap image. 如申請專利範圍第9項所述之全向圖像處理方法,其中,針對該當前全方向圖像集合的每個立方面,標識一個旋轉索引。 The omnidirectional image processing method of claim 9, wherein a rotation index is identified for each aspect of the current omnidirectional image collection. 如申請專利範圍第10項所述之全向圖像處理方法,其中每個旋轉索引使用代碼表進行編解碼,以指示從對應於{0°和90°},{0°,+90°,-90°和180°}或{0°,90°,180°和270°}選轉角度集合中選擇的一個旋轉角度。 An omnidirectional image processing method according to claim 10, wherein each of the rotation indexes is coded and decoded using a code table to indicate that the correspondence corresponds to {0° and 90°}, {0°, +90°, -90° and 180°} or {0°, 90°, 180° and 270°} select one of the rotation angles selected in the set of angles. 如申請專利範圍第9項所述之全向圖像處理方法,其中該一個或多個映射語法元素還包括一默認佈局標誌,用於指示具有該當前立方體類型的該當前全方向圖像集合是否使用默認立方體貼圖圖像,並且其中僅當該默認佈局標誌指示該默認立方體貼圖圖像未被用於該當前全方向圖像集合時,該佈局映射索引和該旋轉索引被標識用於該當前全方向圖像集合。 The omnidirectional image processing method of claim 9, wherein the one or more mapping syntax elements further comprise a default layout flag for indicating whether the current omnidirectional image set having the current cube type is Using a default cubemap image, and wherein the layout map index and the rotation index are identified for the current full only if the default layout flag indicates that the default cubemap image is not used for the current omnidirectional image collection Directional image collection. 如申請專利範圍第9項所述之全向圖像處理方法,其中該一個或多個映射語法元素還包括一默認佈局標誌,用於指示具有該當前立方體類型的該當前全方向圖像集合是否使用默認立方體貼圖圖像,並且其中若該默認佈局標誌指示該默認立方體貼圖圖像被用於該當前全方向圖像集合時,默認佈局映射索引和默認旋轉索引被用於該當前全方向圖像集合。 The omnidirectional image processing method of claim 9, wherein the one or more mapping syntax elements further comprise a default layout flag for indicating whether the current omnidirectional image set having the current cube type is Using a default cubemap image, and wherein if the default layout flag indicates that the default cubemap image is used for the current omnidirectional image collection, the default layout map index and the default spin index are used for the current omnidirectional image set. 如申請專利範圍第1項所述之全向圖像處理方法,其中用於該當前全方向圖像集合的該輸出佈局格式集合以序列級別、視圖級別、圖像級別、切片級別、序列參數集合、視訊參數集合、或應用參數集合標識在360度全景視訊序列的位元流中。 The omnidirectional image processing method of claim 1, wherein the output layout format set for the current omnidirectional image set is at a sequence level, a view level, an image level, a slice level, and a sequence parameter set. The set of video parameters, or the set of application parameters, is identified in the bitstream of the 360-degree panoramic video sequence. 如申請專利範圍第1項所述之全向圖像處理方法,其中該一個或多個映射語法元素以序列級別、視圖級別、圖像級別、切片級別、序列參數集、視訊參數集、或應用參數集標識在360度全景視訊序列的位元流中。 The omnidirectional image processing method of claim 1, wherein the one or more mapping syntax elements are at a sequence level, a view level, an image level, a slice level, a sequence parameter set, a video parameter set, or an application. The parameter set is identified in the bit stream of the 360 degree panoramic video sequence. 如申請專利範圍第1項所述之全向圖像處理方法,其中該一個或多個映射語法元素基於一個或多個參考映射語法元素被預測地標識。 The omnidirectional image processing method of claim 1, wherein the one or more mapping syntax elements are predictively identified based on one or more reference mapping syntax elements. 如申請專利範圍第16項所述之全向圖像處理方法,其中該一個或多個參考映射語法元素的多個集合以序列級別、視圖級別、或圖像級別標識在用於360度全景視訊序列的位元流中,並且在切片級別或圖像級別中標識標誌,以從該當前全方向圖像集合的一個或多個參考映射語法元素的該多個集合中選擇所述一個或多個映射語法元素。 The omnidirectional image processing method of claim 16, wherein the plurality of sets of one or more reference mapping syntax elements are identified at a sequence level, a view level, or an image level for use in 360 degree panoramic video. Identifying a flag in a stream of sequences and at a slice level or an image level to select the one or more of the plurality of sets of one or more reference mapping syntax elements of the current omnidirectional image set Map syntax elements. 如申請專利範圍第16項所述之全向圖像處理方法,其中,一個或多個參考映射語法元素由來自先前圖片、切片或訊框的一個或多個第一映射語法元素預測。 The omnidirectional image processing method of claim 16, wherein the one or more reference mapping syntax elements are predicted by one or more first mapping syntax elements from a previous picture, slice or frame. 一種全向圖像處理裝置,所述全向圖像處理裝置包括一個或多個電子電路或處理器,其被佈置為:使用所選擇的投影格式接收從360度全景視訊序列中的每 個球形圖像轉換的當前全向圖像集合,其中所選擇的投影格式屬於一投影格式組,該投影格式包括一立方面格式,以及具有該立方面格式的該當前全方向圖像集合由六個立方面組成;以及如果該所選擇的投影格式對應於該立方面格式:標識一個或多個映射語法元素以將該當前全方向圖像集合映射到一當前立方體貼圖圖像,該當前立方體貼圖圖像屬於一輸出佈局格式集合,該輸出佈局格式集合包括從包括立方體貼圖佈局、cubemap_32佈局、cubemap_180佈局、plane_poles佈局、plane_poles_6佈局、plane_poles_cubemap佈局、plane_cubemap佈局、plane_cubemap_32佈局、flat_fixed佈局、cubemap_1x6佈局、cubemap_2x3佈局、cubemap_3x2佈局,和cubmap_6x1佈局的佈局集合中選擇的至少兩個輸出佈局格式;以及提供一位元流中的編解碼資料,該位元流包括用於該當前全方向圖像集合的該一個或多個映射語法元素。 An omnidirectional image processing apparatus comprising one or more electronic circuits or processors arranged to receive each of a 360 degree panoramic video sequence using the selected projection format a current omnidirectional image set of spherical image transformations, wherein the selected projection format belongs to a projection format group, the projection format includes a vertical aspect format, and the current omnidirectional image set having the vertical aspect format is composed of six a personal aspect composition; and if the selected projection format corresponds to the aspect format: identifying one or more mapping syntax elements to map the current omnidirectional image collection to a current cubemap image, the current cubemap The image belongs to an output layout format set including from cube map layout, cubemap_32 layout, cubemap_180 layout, plane_poles layout, plane_poles_6 layout, plane_poles_cubemap layout, plane_cubemap layout, plane_cubemap_32 layout, flat_fixed layout, cubemap_1x6 layout, cubemap_2x3 layout At least two output layout formats selected from a layout set of cubemap_3x2, and a layout of the cubmap_6x1 layout; and a codec material in a one-bit stream, the bitstream including the one or for the current omnidirectional image collection Multiple images Syntax elements.
TW106121387A 2016-06-30 2017-06-27 Method and apparatus for mapping omnidirectional image to a layout output format TWI637356B (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201662356571P 2016-06-30 2016-06-30
US62/356,571 2016-06-30
US15/585,224 US10249019B2 (en) 2016-05-06 2017-05-03 Method and apparatus for mapping omnidirectional image to a layout output format
US15/585,224 2017-05-03

Publications (2)

Publication Number Publication Date
TW201802771A TW201802771A (en) 2018-01-16
TWI637356B true TWI637356B (en) 2018-10-01

Family

ID=61725159

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106121387A TWI637356B (en) 2016-06-30 2017-06-27 Method and apparatus for mapping omnidirectional image to a layout output format

Country Status (1)

Country Link
TW (1) TWI637356B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI690728B (en) * 2018-03-02 2020-04-11 聯發科技股份有限公司 Method for processing projection-based frame that includes projection faces packed in cube-based projection layout with padding

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1491403A (en) * 2001-10-29 2004-04-21 ���ṫ˾ Non-flat image processing apparatus and image processing method, and recording medium and computer program
WO2015128801A2 (en) * 2014-02-26 2015-09-03 Ecole Polytechnique Federale De Lausanne (Epfl) Large field of view multi-camera endoscopic apparatus with omni-directional illumination
TWI521953B (en) * 2012-09-21 2016-02-11 高通公司 Indication and activation of parameter sets for video coding
TWI535271B (en) * 2013-01-04 2016-05-21 高通公司 Signaling of spatial resolution of depth views in multiview coding file format

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1491403A (en) * 2001-10-29 2004-04-21 ���ṫ˾ Non-flat image processing apparatus and image processing method, and recording medium and computer program
TWI521953B (en) * 2012-09-21 2016-02-11 高通公司 Indication and activation of parameter sets for video coding
TWI535271B (en) * 2013-01-04 2016-05-21 高通公司 Signaling of spatial resolution of depth views in multiview coding file format
WO2015128801A2 (en) * 2014-02-26 2015-09-03 Ecole Polytechnique Federale De Lausanne (Epfl) Large field of view multi-camera endoscopic apparatus with omni-directional illumination

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
G. Tech, Y. Chen, K. Müller, J.-R. Ohm, A. Vetro, and Y.-K. Wang, Overview of the Multiview and 3D Extensions of High Efficiency Video Coding, IEEE Transactions on Circuits and Systems for Video Technology, vol. 26, no. 1, pp. 35-49, January 2016, doi: 10.1109/TCSVT.2015.2477935. *
P. Merkle, A. Smolic, K. Müller, and T. Wiegand, "Efficient Prediction Structures for Multiview Video Coding," IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 11, pp. 1461- 1473, November 2007. *

Also Published As

Publication number Publication date
TW201802771A (en) 2018-01-16

Similar Documents

Publication Publication Date Title
CN109076240B (en) Method and apparatus for mapping an omnidirectional image to a layout output format
US10264282B2 (en) Method and apparatus of inter coding for VR video using virtual reference frames
US20190191180A1 (en) Method for sub-pu motion information inheritance in 3d video coding
US10085041B2 (en) Method for depth lookup table signaling
CN106471807B (en) The coding method of three-dimensional or multi-angle video including visual angle synthesis prediction
US20190191170A1 (en) System and method for improving efficiency in encoding/decoding a curved view video
TWI655862B (en) Video encoding or decoding method and device
US11659206B2 (en) Video encoding method with syntax element signaling of guard band configuration of projection-based frame and associated video decoding method and apparatus
CN110022481A (en) Decoding, generation method and device, storage medium, the electronic device of video code flow
US20150071362A1 (en) Image encoding device, image decoding device, image encoding method, image decoding method and program
Chan et al. Overview of current development in depth map coding of 3D video and its future
TWI637356B (en) Method and apparatus for mapping omnidirectional image to a layout output format
US20180084276A1 (en) Method of Lookup Table Size Reduction for Depth Modelling Mode in Depth Coding
CN111726598B (en) Image processing method and device
WO2023221764A1 (en) Video encoding method, video decoding method, and related apparatus
EP3949420A1 (en) A method and apparatus for encoding and decoding volumetric video
US11736725B2 (en) Methods for encoding decoding of a data flow representing of an omnidirectional video
CN114930812B (en) Method and apparatus for decoding 3D video
KR20220066328A (en) Method and apparatus for encoding, transmitting and decoding volumetric video
EP3349460A1 (en) Method and apparatus for coding/decoding a picture of an omnidirectional video
EP3349462A1 (en) Method and apparatus for coding/decoding a picture of an omnidirectional video
US11190768B2 (en) Video encoding method with syntax element signaling of packing of projection faces derived from cube-based projection and associated video decoding method and apparatus
KR20220069040A (en) Method and apparatus for encoding, transmitting and decoding volumetric video
CN116235497A (en) Method and apparatus for signaling depth of volumetric video based on multi-planar images