TW201711477A - Video encoding methods and systems using adaptive color transform - Google Patents

Video encoding methods and systems using adaptive color transform Download PDF

Info

Publication number
TW201711477A
TW201711477A TW105114323A TW105114323A TW201711477A TW 201711477 A TW201711477 A TW 201711477A TW 105114323 A TW105114323 A TW 105114323A TW 105114323 A TW105114323 A TW 105114323A TW 201711477 A TW201711477 A TW 201711477A
Authority
TW
Taiwan
Prior art keywords
coding
size
threshold
act
unit
Prior art date
Application number
TW105114323A
Other languages
Chinese (zh)
Other versions
TWI597977B (en
Inventor
張耀仁
林俊隆
涂日昇
林敬傑
Original Assignee
財團法人工業技術研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US14/757,556 external-priority patent/US20160360205A1/en
Application filed by 財團法人工業技術研究院 filed Critical 財團法人工業技術研究院
Priority to CN201610357374.8A priority Critical patent/CN106254870B/en
Priority to US15/196,108 priority patent/US10390020B2/en
Publication of TW201711477A publication Critical patent/TW201711477A/en
Application granted granted Critical
Publication of TWI597977B publication Critical patent/TWI597977B/en

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A video encoding method including the following step is provided. A source video frame is received. The source video frame is divided into a coding tree unit. A coding unit is determined from the coding tree unit. A coding mode of the coding unit is enabled or disabled. If the coding mode is enabled, whether to evaluate a size of a transform unit for the enabled coding mode is determined. A transform unit of the coding unit for the enabled coding mode is determined. The size of the coding unit is defined by a number (N) of samples.

Description

採用自適性色彩轉換之視訊編碼方法及系統 Video coding method and system using adaptive color conversion

本揭露係有關於視訊編碼與解碼方法及系統。 The disclosure relates to a video encoding and decoding method and system.

對於高品質影像的需求逐漸增加。隨著4K及8K等視訊規格的來臨,極需提升視訊編碼與解碼效率。此外,消費者期待能夠透過各種傳輸媒介來傳輸與接收高品質影像。舉例來說,消費者希望能夠透過網路於攜帶式裝置(如智慧型手機、平板電腦、筆記型電腦)及家用電視與電腦上觀看高品質影像。消費者也希望能夠在視訊會議及螢幕共享的過程中顯示高品質影像。 The demand for high quality images is gradually increasing. With the advent of video formats such as 4K and 8K, it is highly desirable to improve video encoding and decoding efficiency. In addition, consumers expect to be able to transmit and receive high quality images through a variety of transmission media. For example, consumers want to be able to view high-quality images on portable devices (such as smart phones, tablets, laptops) and home TVs and computers over the Internet. Consumers also want to be able to display high-quality images during video conferencing and screen sharing.

高效率視訊編碼標準(High Efficiency Video Coding,HEVC)H.265在改進視訊壓縮的編碼與解碼效能上提供了一個新的標準。相較於原先的AVC(Advanced Video Coding)標準,由ISO/IEC JTC 1/SC 29/WG 11 MPEG(Moving Picture Experts Group)及ITU-T SG16 VCEG(Video Coding Experts Group)所建立的 HEVC能夠降低壓縮高品質視訊之資料率。AVC標準亦稱為H.264。 High Efficiency Video Coding (HEVC) H.265 provides a new standard for improving the encoding and decoding performance of video compression. Compared with the original AVC (Advanced Video Coding) standard, it is established by ISO/IEC JTC 1/SC 29/WG 11 MPEG (Moving Picture Experts Group) and ITU-T SG16 VCEG (Video Coding Experts Group). HEVC can reduce the data rate of compressed high quality video. The AVC standard is also known as H.264.

HEVC利用幀間預測技術(Inter prediction)及幀內預測技術(Intra prediction)等各種編碼工具來壓縮視訊。幀間預測技術利用視訊串流之不同視訊畫面之間的時間冗餘(temporal redundancies)來壓縮視訊資料。舉例來說,含有相似內容之已編碼及已解碼之視訊畫面可用來編碼目前之視訊畫面。這些已編碼及已解碼之視訊畫面可以用來預測目前視訊畫面之編碼區域。相對地,幀內預測技術僅利用目前編碼視訊畫面之內部資料來壓縮視訊資料。幀內預測技術並不使用不同視訊畫面之時間冗餘。舉例來說,目前視訊畫面是利用同一畫面的另一部份來進行編碼。幀內預測技術包括35種幀內模式,包含平面模式(Planar mode)、DC模式及33種定向模式(directional modes)。 HEVC compresses video using various coding tools such as inter prediction and intra prediction. Inter-frame prediction techniques use temporal redundancies between different video frames of a video stream to compress video material. For example, an encoded and decoded video frame containing similar content can be used to encode the current video frame. These encoded and decoded video pictures can be used to predict the coding area of the current video picture. In contrast, intra prediction techniques only compress the video data using the internal data of the currently encoded video frame. Intra prediction techniques do not use temporal redundancy of different video frames. For example, current video pictures are encoded using another part of the same picture. Intra prediction techniques include 35 intra modes, including Planar mode, DC mode, and 33 directional modes.

相較於AVC標準,HEVC標準對各個輸入視訊畫面採用擴張分割技術(expansive partitioning and dividing)。AVC標準在編碼及解碼時僅利用輸入視訊畫面的大區塊(macroblock)進行分割。相反地,HEVC標準可以分割輸入視訊畫面成不同尺寸的資料單元及區塊,相關說明如後。相較於AVC標準,HEVC標準對動態、多細節及多邊緣之視訊畫面的編碼及解碼程序提供了更多的彈性。 Compared to the AVC standard, the HEVC standard employs expansive partitioning and dividing for each input video picture. The AVC standard uses only macroblocks of input video pictures for encoding and decoding. Conversely, the HEVC standard can divide the input video frames into different sizes of data units and blocks, as described later. Compared to the AVC standard, the HEVC standard provides more flexibility for encoding, decoding, and decoding of dynamic, multi-detail and multi-edge video images.

一些能夠改善視訊編碼程序的編碼工具亦列入於HEVC標準中。此些編碼工具被稱為編碼擴展(coding extensions)。 螢幕內容編碼擴展(Screen Content Coding extension,SCC extension)專注於改善HEVC標準下的視訊螢幕內容的處理效能。螢幕內容係為圖案、文字或動畫所成像(render)的視訊,而不是照相機所擷取的視訊場景。成像的圖案、文字或動畫可以是動態或靜態,且可以提供於照相機所擷取之視訊場景內的視訊。SCC的應用實例可以包含螢幕鏡射(Screen mirroring)、雲端遊戲(cloud gaming)、無線顯示內容(wireless display of content)、遠端電腦存取時的顯示(displays generated during remote computer desktop access)及螢幕共享(screen sharing)(例如是視訊會議之即時螢幕共享)。 Some coding tools that improve the video coding process are also included in the HEVC standard. Such coding tools are called coding extensions. The Screen Content Coding extension (SCC extension) focuses on improving the processing performance of video content under the HEVC standard. The screen content is the video that is rendered by the pattern, text, or animation, not the video scene captured by the camera. The imaged pattern, text or animation can be dynamic or static and can be provided for video within the video scene captured by the camera. Application examples of the SCC may include screen mirroring, cloud gaming, wireless display of content, displays generated during remote computer desktop access, and screens. Screen sharing (for example, instant screen sharing of video conferencing).

SCC內之一編碼工具係為自適性色彩轉換(adaptive color transform,ACT)。ACT係為應用於編碼單元(coding unit,CU)之殘差像素樣本(residue pixel samples)的色彩空間轉換。對特定的色彩空間而言,已存在一編碼單元(CU)之一畫素的色彩元素(color components)之相關性。當畫素之色彩元素的相關性高時,於畫素執行ACT可以透過去相關性(de-correlating)來幫助相關的色彩元素集中能量。這種集中能量的作法能夠提高編碼效率,並降低編碼成本。因此,ACT能夠在HEVC編碼過程中增進編碼效能。 One of the coding tools in the SCC is an adaptive color transform (ACT). The ACT is a color space conversion applied to a residual pixel sample of a coding unit (CU). For a particular color space, there is already a correlation of the color components of one of the coding units (CU). When the correlation of the color elements of the pixels is high, performing ACT on the pixels can help the related color elements to concentrate energy through de-correlating. This concentrated energy approach can increase coding efficiency and reduce coding costs. Therefore, ACT can improve coding efficiency in the HEVC encoding process.

然而,在編碼過程中,需要額外的碼率失真函數(rate distortion optimization,RDO)來評估是否啟用ACT。RDO用來評估碼率失真(rate distortion,RD)的成本。這些評估過程可能 會增加編碼複雜度及編碼時間。再者,當畫素之色彩元素已經去相關時,ACT可能就不是必須的。在這種情況下,由於執行ACT的成本高於編碼的效益,進一步對色彩元素進行的去相關性程序可能無法帶來任何好處。 However, in the encoding process, an additional rate distortion optimization (RDO) is needed to evaluate whether ACT is enabled. RDO is used to estimate the cost of rate distortion (RD). These assessment processes may Will increase the coding complexity and coding time. Furthermore, ACT may not be necessary when the color elements of the pixels have been correlated. In this case, since the cost of executing the ACT is higher than the benefit of the encoding, further de-correlation procedures for color elements may not bring any benefit.

根據本揭露之一方面,提供一種視訊編碼方法。視訊編碼方法包括以下步驟。接收一原始視訊畫面(source video frame)。分割原始視訊畫面為一編碼樹單元(coding tree unit)。從編碼樹單元決定一編碼單元(coding unit)。啟用或禁用編碼單元之一編碼模式(coding mode)。若啟用編碼模式,則於啟用該編碼模式判斷是否估計一轉換單元(transform unit)之尺寸。於啟用之編碼模式決定編碼單元之轉換單元。編碼單元之尺寸係為NxN。 According to an aspect of the disclosure, a video encoding method is provided. The video encoding method includes the following steps. Receive a source video frame. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enable or disable one of the coding units' coding modes. If the encoding mode is enabled, the encoding mode is enabled to determine whether to estimate the size of a transform unit. The coding mode enabled determines the conversion unit of the coding unit. The size of the coding unit is NxN.

根據本揭露之另一方面,提供一種視訊編碼系統。視訊編碼系統包括一記憶體及一處理器。記憶體用以儲存一組指令。處理器用以執行此組指令。此組指令包括以下步驟。接收一原始視訊畫面(source video frame)。分割原始視訊畫面為一編碼樹單元(coding tree unit)。從編碼樹單元決定一編碼單元(coding unit)。啟用或禁用編碼單元之一編碼模式(coding mode)。若啟用編碼模式,則於啟用之編碼模式判斷是否估計一轉換單元(transform unit)之尺寸。於啟用之編碼模式決定編碼單元之轉 換單元。編碼單元之尺寸係為NxN。 According to another aspect of the present disclosure, a video encoding system is provided. The video coding system includes a memory and a processor. Memory is used to store a set of instructions. The processor is configured to execute the set of instructions. This set of instructions includes the following steps. Receive a source video frame. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enable or disable one of the coding units' coding modes. If the encoding mode is enabled, it is determined in the enabled encoding mode whether to estimate the size of a transform unit. The coding mode is enabled to determine the rotation of the coding unit Change the unit. The size of the coding unit is NxN.

根據本揭露之另一方面,提供一種非暫態電腦可讀取記錄媒體。非暫態電腦可讀取記錄媒體用以儲存一組指令。此組指令由一或多個處理器執行,以執行一視訊編碼方法。此視訊編碼方法包括以下步驟。接收一原始視訊畫面(source video frame)。分割原始視訊畫面為一編碼樹單元(coding tree unit)。從編碼樹單元決定一編碼單元(coding unit)。啟用或禁用編碼單元之一編碼模式(coding mode)。若啟用編碼模式,則於啟用之編碼模式判斷是否估計一轉換單元(transform unit)之尺寸。於啟用之編碼模式決定編碼單元之轉換單元。編碼單元之尺寸係為NxN。 According to another aspect of the present disclosure, a non-transitory computer readable recording medium is provided. A non-transitory computer can read a recording medium for storing a set of instructions. The set of instructions is executed by one or more processors to perform a video encoding method. This video encoding method includes the following steps. Receive a source video frame. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enable or disable one of the coding units' coding modes. If the encoding mode is enabled, it is determined in the enabled encoding mode whether to estimate the size of a transform unit. The coding mode enabled determines the conversion unit of the coding unit. The size of the coding unit is NxN.

為了對本揭露之上述及其他方面有更佳的瞭解,下文特舉多個實施例,並配合所附圖式,作詳細說明如下: In order to better understand the above and other aspects of the present disclosure, various embodiments are described below, and in conjunction with the drawings, the detailed description is as follows:

101‧‧‧視訊畫面(video frame) 101‧‧‧Video video (video frame)

102‧‧‧編碼樹單元(coding tree unit,CTU) 102‧‧‧coding tree unit (CTU)

103‧‧‧亮度編碼樹區塊(luma coding tree block,luma CTB) 103‧‧‧Luma coding tree block (luma CTB)

104‧‧‧Cb CTB 104‧‧‧Cb CTB

105‧‧‧Cr CTB 105‧‧‧Cr CTB

106、111‧‧‧相關說明 106, 111‧‧‧Related notes

107-1、107-2、107-3、107-4‧‧‧亮度編碼區塊(luma coding block,luma CB) 107-1, 107-2, 107-3, 107-4‧‧‧ luma coding block (luma CB)

108‧‧‧編碼單元(Coding unit,CU) 108‧‧‧Coding unit (CU)

109‧‧‧Cb CB 109‧‧‧Cb CB

110‧‧‧Cr CB 110‧‧‧Cr CB

112‧‧‧亮度預測區塊(luma prediction block,PB) 112‧‧‧Luma prediction block (PB)

113-1、113-2、113-3、113-4‧‧‧轉換區塊(transform block,TB) 113-1, 113-2, 113-3, 113-4‧‧‧transform block (TB)

114‧‧‧轉換單元(Transform unit,TU) 114‧‧‧Transformation unit (TU)

200‧‧‧視訊編碼器 200‧‧‧Video Encoder

202‧‧‧畫面分割模組(Frame Dividing Module) 202‧‧‧Frame Dividing Module

204‧‧‧幀間預測啟用ACT模組(Inter Prediction enabling adaptive color transformation Module) 204‧‧‧Inter Prediction enabling adaptive color transformation Module

206‧‧‧幀間預測禁用ACT模組(Inter Prediction disabling ACT Module) 206‧‧‧Inter Prediction disabling ACT Module

208‧‧‧畫面暫存器(Frame Buffer) 208‧‧‧ Frame Buffer

210‧‧‧模式決定模組(Mode Decision Module) 210‧‧‧Mode Decision Module

212‧‧‧幀內預測啟用ACT模組(Intra Prediction enabling ACT Module) 212‧‧‧Intra Prediction enabling ACT Module

214‧‧‧幀內預測禁用ACT模組(Intra Prediction disabling ACT Module) 214‧‧‧Intra Prediction disabling ACT Module

216、218‧‧‧加總模組(Summing Module) 216, 218‧‧‧Summing Module

220‧‧‧切換器 220‧‧‧Switch

222‧‧‧自適性色彩轉換(ACT)模組 222‧‧‧Adaptive Color Conversion (ACT) Module

224‧‧‧CCP、轉換及量化模組(CCP,Transform,and Quantization Module) 224‧‧‧CCP, Conversion and Quantization Module (CCP, Transform, and Quantization Module)

226‧‧‧熵編碼模組(Entropy Coding Module) 226‧‧‧Entropy Coding Module

228‧‧‧反運算CCP、轉換及量化模組(Inverse CCP,Transform,and Quantization Module) 228‧‧‧Inverse CCP, Transform, and Quantization Module

230‧‧‧切換器 230‧‧‧Switch

232‧‧‧反運算ACT模組(Inverse ACT Module) 232‧‧‧Inverse ACT Module

300、400、500、600、700、800‧‧‧編碼方法 300, 400, 500, 600, 700, 800‧‧‧ encoding methods

304‧‧‧成分相關性分析(component correlation analysis) 304‧‧‧Component correlation analysis

306‧‧‧概略模式決定(Rough mode decision) 306‧‧‧ Rough mode decision

308‧‧‧結束 End of 308‧‧

310‧‧‧碼率失真函數模式決定(rate distortion optimization mode decision,RDO mode decision) 310‧‧‧rate distortion optimization mode decision (RDO mode decision)

311‧‧‧色度格式是否為非444(non-444)之判斷 311‧‧‧ Whether the chroma format is judged by non-444 (non-444)

312‧‧‧CU尺寸是否大於臨界值T1之判斷 312‧‧‧Determination of whether the CU size is greater than the critical value T1

314‧‧‧TU尺寸決定(TU size ecision) 314‧‧‧TU size decision (TU size ecision)

316‧‧‧色度模式決定(chroma mode decision) 316‧‧‧chroma mode decision

402‧‧‧CU尺寸是否小於臨界值T2之判斷 Whether the size of the 402‧‧‧ CU is less than the critical value T2

702‧‧‧非暫態電腦可讀取媒體 702‧‧‧ Non-transitory computer readable media

704‧‧‧處理器 704‧‧‧ processor

第1A~1J圖繪示視訊畫面及根據本揭露數個實施例之相關的分割。 Figures 1A-1J illustrate video screens and related segments in accordance with several embodiments of the present disclosure.

第2圖繪示本揭露之視訊編碼器。 Figure 2 illustrates the video encoder of the present disclosure.

第3圖說明根據本揭露一實施例之編碼方法。 Figure 3 illustrates an encoding method in accordance with an embodiment of the present disclosure.

第4圖說明根據本揭露另一實施例之編碼方法。 Figure 4 illustrates an encoding method in accordance with another embodiment of the present disclosure.

第5圖說明根據本揭露另一實施例之編碼方法。 Figure 5 illustrates an encoding method in accordance with another embodiment of the present disclosure.

第6圖說明根據本揭露另一實施例之編碼方法。 Figure 6 illustrates an encoding method in accordance with another embodiment of the present disclosure.

第7圖說明非444色度格式之IPM的演算流程。 Figure 7 illustrates the flow of the IPM in a non-444 chroma format.

第8圖繪示執行本揭露之編碼與解碼方法的系統。 Figure 8 illustrates a system for performing the encoding and decoding methods of the present disclosure.

以下將搭配附圖詳細地說明示例性的實施例。在下面描述的附圖中,除非另有說明,於不同圖式之相同標號代表相同或近似的元件。以下提出的實施例並非代表本揭露的所有實施情況。事實上,這些實施例僅僅是對應於申請專利範圍之系統與方法的一些實例。 The exemplary embodiments will be described in detail below with reference to the accompanying drawings. In the drawings described below, the same reference numerals are used to refer to the same or the The embodiments presented below are not representative of all implementations of the present disclosure. In fact, these embodiments are merely some examples of systems and methods that correspond to the scope of the patent application.

第1A~1J圖說明根據本揭露之實施例的視訊畫面及其相關的分割。 1A-1J illustrate a video screen and its associated segmentation in accordance with an embodiment of the present disclosure.

第1A圖繪示視訊畫面101。視訊畫面101包括數個畫素。視訊畫面101被分割為數個編碼樹單元(coding tree units,CTUs)102。每個CTU 102的尺寸是根據L個垂直樣本及L個水平樣本(LxL)來決定。每個樣本於CTU的不同畫素位置對應於一畫素值。舉例來說,L可以是16、32、或64。畫素位置可以是畫素於CTU所在的位置或畫素之間的位置。當畫素位置是畫素之間的位置,畫素值可以是畫素位置附近之一或多個畫素的內插值。各個CTU 102包括亮度編碼樹區塊(luma coding tree block,luma CTB)、色度編碼樹區塊(chroma CTB)及相關說明(associated syntax)。 FIG. 1A illustrates a video screen 101. The video screen 101 includes a number of pixels. The video screen 101 is divided into a number of coding tree units (CTUs) 102. The size of each CTU 102 is determined based on L vertical samples and L horizontal samples (LxL). Each sample corresponds to a pixel value at a different pixel location of the CTU. For example, L can be 16, 32, or 64. The pixel position can be the position where the pixel is located or the position between the pixels. When the pixel position is the position between the pixels, the pixel value may be an interpolation value of one or more pixels near the pixel position. Each CTU 102 includes a luma coding tree block (luma CTB), a chroma coding tree block (chroma CTB), and an associated syntax.

第1B圖繪示數個CTB可以被包含於第1A圖之一個CTU 102中。舉例來說,CTU 102可以包含亮度CTB(luma CTB)103、色度CTB(chroma CTB)(含Cb CTB 104 Cr CTB 105)。CTU 102也可以包括相關說明(associated syntax)106。Cb CTB 104係為藍色色差CTB(blue difference chroma component CTB),其表示CTB在藍色的變化。Cr CTB 105係為紅色色差CTB(red difference chroma component CTB),其表示CTB在紅色的變化。相關說明106包含亮度CTB 103、Cb CTB 104及Cr CTB 105如何被編碼的資訊、以及亮度CTB 103、Cb CTB 104及Cr CTB 105的進一步分割。CTB 103、Cb CTB 104及Cr CTB 105之尺寸可以相同於CTU 102之尺寸。或者,亮度CTB 103之尺寸可以相同於CTU 102之尺寸,但Cb CTB 104及Cr CTB 105之尺寸可以小於CTU 102之尺寸。 FIG. 1B illustrates that several CTBs can be included in one CTU 102 of FIG. 1A. For example, CTU 102 can include luminance CTB (luma CTB) 103, chroma CTB (chroma CTB) (including Cb CTB 104 Cr CTB 105). The CTU 102 may also include an associated syntax 106. Cb CTB 104 is a blue difference chroma component CTB, which indicates a change in CTB in blue. Cr CTB 105 is a red difference chroma component CTB, which indicates a change in CTB in red. The related description 106 includes information on how the luminance CTB 103, Cb CTB 104, and Cr CTB 105 are encoded, and further segmentation of the luminance CTB 103, Cb CTB 104, and Cr CTB 105. The CTB 103, Cb CTB 104, and Cr CTB 105 may be the same size as the CTU 102. Alternatively, the brightness CTB 103 may be the same size as the CTU 102, but the Cb CTB 104 and Cr CTB 105 may be smaller than the CTU 102.

幀內預測(intra prediction)、幀間預測(inter prediction)及其他等編碼工具運作於編碼區塊(coding blocks,CBs)上。為了決定編碼的程序是要採用幀內預測還是幀間預測,CTB可以被分割為一或多個CB。CTB分割為CB的程序是根據四分樹分割(quad-tree partitioning)技術。因此,CTB可以分割為四個CB,各個CB可以再分割為四個CB。根據CTB的尺寸,可以繼續進行這樣的分割程序。 Intra prediction, inter prediction, and other coding tools operate on coding blocks (CBs). To determine whether the coded program is to use intra prediction or inter prediction, the CTB can be partitioned into one or more CBs. The program for dividing CTB into CB is based on quad-tree partitioning. Therefore, the CTB can be divided into four CBs, and each CB can be further divided into four CBs. According to the size of the CTB, such a division procedure can be continued.

第1C圖繪示第1B圖之亮度CTB 103被分割為一或多個亮度CB 107-1、107-2、107-3或107-4。以64x64的亮度CTB 為例,對應的亮度CB 107-1、107-2、107-3或107-4可以是NxN的尺寸,例如是64x64、32x32、16x16或8x8。在第1C圖中,亮度CTB 103的尺寸為64x64。而亮度CTB 103的尺寸可以為32x32或16x16。 1C shows that the luminance CTB 103 of FIG. 1B is divided into one or more luminances CB 107-1, 107-2, 107-3 or 107-4. CTB with a brightness of 64x64 For example, the corresponding brightness CB 107-1, 107-2, 107-3 or 107-4 may be the size of NxN, for example 64x64, 32x32, 16x16 or 8x8. In the 1Cth picture, the size of the luminance CTB 103 is 64x64. The brightness CTB 103 can be 32x32 or 16x16 in size.

第1D圖繪示第1B圖之亮度CTB 103進行四分樹分割之一實例,其中亮度CTB 103分割為第1C圖之亮度CB 107-1、107-2、107-3或107-4。在第1D圖中,亮度CTB 103的尺寸係為64x64。然而,亮度CTB 103的尺寸亦可以是32x32或16x16。 FIG. 1D illustrates an example in which the luminance CTB 103 of FIG. 1B performs quadtree partitioning, in which the luminance CTB 103 is divided into luminances CB 107-1, 107-2, 107-3, or 107-4 of the 1Cth graph. In the 1D diagram, the size of the luminance CTB 103 is 64x64. However, the brightness CTB 103 may also be 32x32 or 16x16 in size.

在第1D圖中,亮度CTB 103分割為四個32x32的亮度CB 107-2。各個32x32的亮度CB可以更分割為四個16x16的亮度CB 107-3。各個16x16的亮度CB可以更分割為四個8x8的亮度CB 107-4。 In the 1D picture, the luminance CTB 103 is divided into four 32x32 luminances CB 107-2. The brightness CB of each 32x32 can be further divided into four 16x16 brightness CB 107-3. The brightness CB of each 16x16 can be further divided into four 8x8 brightness CB 107-4.

編碼單元(Coding unit,CU)用以編碼CB。CTB可以包括唯一一個CU、或者分割為數個CU。因此CU的尺寸也可以是NxN,例如是64x64、32x32、16x16或8x8。各個CU包括一個亮度CB、兩個色度CB及相關說明。於編碼及解碼程序中產生之殘差CU的尺寸可相同於其對應之CU的尺寸。 A coding unit (CU) is used to encode the CB. The CTB can include a single CU or be divided into several CUs. Therefore, the size of the CU can also be NxN, for example 64x64, 32x32, 16x16 or 8x8. Each CU includes a luminance CB, two chrominance CBs, and related descriptions. The size of the residual CU generated in the encoding and decoding process may be the same as the size of its corresponding CU.

第1E圖繪示CB(第1C圖之亮度CB 107-1)的示意圖,此些CB可以是CU 108的一部份。舉例來說,CU 108可以包括亮度CB 107-1及色度CB(Cb CB 109)及色度CB(Cr CB 110)。CU 108可以包括相關說明111。相關說明111包含如何對亮度CB 107-1、Cb CB 109及Cr CB 110進行編碼的資訊,例如 是四分樹資訊的說明(亮度CB及色度CB的尺寸、位置與進一步的分割)。各個CU 108可於亮度CB 107-1、Cb CB 109及Cr CB 110具有相關的預測區塊(prediction blocks,PBs)。預測區塊組合成預測單元(prediction units,PUs)。 FIG. 1E is a schematic diagram showing CB (brightness CB 107-1 of FIG. 1C), and such CBs may be part of CU 108. For example, CU 108 may include luminance CB 107-1 and chrominance CB (Cb CB 109) and chrominance CB (Cr CB 110). The CU 108 can include a related description 111. Related Description 111 contains information on how to encode luminances CB 107-1, Cb CB 109, and Cr CB 110, such as It is the description of the quadtree information (the size, position and further division of the luminance CB and the chrominance CB). Each CU 108 may have associated prediction blocks (PBs) at luminances CB 107-1, Cb CB 109, and Cr CB 110. The prediction blocks are combined into prediction units (PUs).

第1F圖繪示第1D圖之CB 107-1分割為亮度PB 112的各種可能情況。亮度CB 107-1例如是根據亮度CB 107-1之不同區域的可預測性來分割為亮度PB 112。舉例來說,亮度CB 107-1可以包含單一個亮度PB 112,其尺寸相同於亮度CB 107-1。或者,亮度CB 107-1可以垂直地或水平地分割為兩個偶數亮度PB 112。或者亮度CB 107-1可以垂直地或水平地分割為四個亮度PB 112。需說明的是第1F圖僅僅作為示例。在HEVC標準下的任何分割為PB的方式都屬於本揭露的範圍。第1F圖所繪示將亮度CB 107-1分割為亮度PB 112的方式是互斥的。舉例來說,在HEVC的幀內預測模式中,64x64、32x32及16x16的CB可能被分割為單一個PB,其尺寸相同於CB。然而,8x8的CB可能被分割為單一個8x8的PB或四個4x4的PB。 FIG. 1F illustrates various possible cases in which the CB 107-1 of the 1D map is divided into the luminance PB 112. The luminance CB 107-1 is divided into the luminance PB 112, for example, according to the predictability of different regions of the luminance CB 107-1. For example, the brightness CB 107-1 may include a single brightness PB 112 that is the same size as the brightness CB 107-1. Alternatively, the luminance CB 107-1 may be vertically or horizontally divided into two even luminances PB 112. Alternatively, the luminance CB 107-1 may be divided into four luminances PB 112 vertically or horizontally. It should be noted that the 1Fth drawing is merely an example. Any manner of segmentation into PB under the HEVC standard is within the scope of the present disclosure. The manner in which the luminance CB 107-1 is divided into the luminance PB 112 as shown in FIG. 1F is mutually exclusive. For example, in the intra prediction mode of HEVC, 64x64, 32x32, and 16x16 CBs may be split into a single PB, which is the same size as CB. However, an 8x8 CB may be split into a single 8x8 PB or four 4x4 PBs.

一但採用幀內預測或幀間預測,由預測區塊與來源視訊影像區塊之間不同處所產生之殘差訊號(residual signal)被轉換至另一域(domain),以進一步進行離散餘弦轉換(discrete cosine transform,DCT)或離散正弦變換(discrete sine transform,DST)的編碼。為了提供這些轉換,各個CU或各個CB需要利用一或多個轉換區塊(transform block,TB)。 Once intra prediction or inter prediction is used, the residual signal generated by the difference between the prediction block and the source video image block is converted to another domain for further discrete cosine conversion. (discrete cosine transform, DCT) or discrete sine transform (DST) encoding. In order to provide these conversions, each CU or individual CBs need to utilize one or more transform blocks (TBs).

第1G圖繪示第1E圖或第1F圖之亮度CB 107-1如何被分割為不同的TB 113-1、113-2、113-3及113-4。若亮度CB 107-1係為64x64的CB,TB 113-1係為32x32的TB,TB 113-2係為16x16的TB,TB 113-3係為8x8的TB,並且TB 113-4係為4x4的TB。亮度CB 107-1可以被分割為4個TB 113-1、16個TB 113-2、64個TB 113-3及256個TB 113-4。一個亮度CB 107-1可以被分割為相同尺寸的TB 113或不同尺寸的TB 113。 Fig. 1G shows how the luminance CB 107-1 of the 1E or 1F is divided into different TBs 113-1, 113-2, 113-3, and 113-4. If the brightness CB 107-1 is 64x64 CB, TB 113-1 is 32x32 TB, TB 113-2 is 16x16 TB, TB 113-3 is 8x8 TB, and TB 113-4 is 4x4 TB. The luminance CB 107-1 can be divided into 4 TBs 113-1, 16 TBs 113-2, 64 TBs 113-3, and 256 TBs 113-4. One luminance CB 107-1 can be divided into TB 113 of the same size or TB 113 of different sizes.

將CB分割為TB的程序係根據四分樹分割(quad-tree splitting)。因此,一個CB可以被分割為一或多個TB,其中各個TB可以更進一步被分割為4個TB。這樣的分割程序可以根據CB的尺寸來繼續進行。 The program for dividing CB into TB is based on quad-tree splitting. Therefore, one CB can be divided into one or more TBs, where each TB can be further divided into 4 TBs. Such a segmentation program can continue according to the size of the CB.

第1H圖繪示第1E圖或第1F圖之亮度CB 107-1之四分樹分割,其利用各種分割方式分割為第1G圖之TB 113-1、113-2、113-3或113-4。在第1H圖中,亮度CB 107-1之尺寸為64x64。然而,亮度CB 107-1之尺寸亦可以是32x32或16x16。 FIG. 1H is a diagram showing a quadtree division of luminance CB 107-1 of FIG. 1E or FIG. 1F, which is divided into TBs 113-1, 113-2, 113-3 or 113 of the 1G map by various division methods. 4. In the 1H picture, the size of the luminance CB 107-1 is 64x64. However, the size of the brightness CB 107-1 may also be 32x32 or 16x16.

在第1H圖中,亮度CB 107-1被分割為四個32x32的TB 113-1。各個32x32的TB可以更進一步被分割為4個16x16的TB 113-2。各個16x16的TB可以更進一步被分割為4個8x8的TB 113-3。各個8x8的TB可以更進一步被分割為4個4x4的TB 113-4。 In the 1Hth picture, the luminance CB 107-1 is divided into four 32x32 TBs 113-1. Each 32x32 TB can be further divided into four 16x16 TB 113-2. Each 16x16 TB can be further divided into four 8x8 TB 113-3s. Each 8x8 TB can be further divided into four 4x4 TBs 113-4.

TB 113接著以進行DCT或任何HEVC標準的轉換。轉換單元(Transform units,TUs)匯總TB 113。一或多個TB被 各個CB採用。CB形成各個CU。因此,轉換單元(TU)的結構於不同的CU 108是不同的,並且是由CU 108來決定的。 TB 113 is then subjected to conversion by DCT or any HEVC standard. Transform units (TUs) summarize TB 113. One or more TBs are Adopted by each CB. The CB forms each CU. Therefore, the structure of the conversion unit (TU) is different for different CUs 108 and is determined by the CU 108.

第1I圖繪示TU 114各種不同分割的TB 113-1、113-2、113-3及113-4。各個TU匯總第1G圖或第1H圖分割的TB。32x32的TU 114可以採用32x32的單一個TB 113-1、或一或多個16x16的TB 113-2、8x8的TB 113-3、或4x4的TB 113-4。對採用HEVC之幀間預測的CU而言,TU可以大於PU,使得TU可以包含PU邊界(boundaries)。然而,對採用HEVC之幀內預測的CU而言,TU可以不穿越(cross)PU邊界。 FIG. 1I illustrates various differently divided TBs 113-1, 113-2, 113-3, and 113-4 of the TU 114. Each TU summarizes the TB of the 1G map or the 1H map. The 32x32 TU 114 can be a single 32x32 TB 113-1, or one or more 16x16 TB 113-2, 8x8 TB 113-3, or 4x4 TB 113-4. For a CU employing inter-prediction of HEVC, the TU may be larger than the PU such that the TU may contain PU boundaries. However, for a CU that employs intra prediction of HEVC, the TU may not cross the PU boundary.

第1J圖繪示第1I圖之TU 114的四分樹分割,其利用第1I圖之各種TB 113-1、113-2、113-3或113-4。在第1J圖中,TU 114的尺寸係為32x32。然而,TU的尺寸可以是16x16、8x8、或4x4。 FIG. 1J illustrates a quadtree partition of the TU 114 of FIG. 1I, which utilizes various TBs 113-1, 113-2, 113-3, or 113-4 of FIG. In Figure 1J, the size of the TU 114 is 32x32. However, the size of the TU can be 16x16, 8x8, or 4x4.

在第1J圖中,TU 114被分割為一個32X32的TB 113-1及4個16x16的TB 113-2。各個16x16的TB可以更進一步分割為4個8x8的TB 113-3。各個8x8的TB可以更進一步分割為4個4x4的TB 113-4。 In Figure 1J, the TU 114 is split into a 32x32 TB 113-1 and four 16x16 TB 113-2. Each 16x16 TB can be further divided into four 8x8 TB 113-3s. Each 8x8 TB can be further divided into four 4x4 TB 113-4.

本揭露所述之CTU、CTB、CB、CU、PU、PB、TU或TB皆可以包括HEVC標準之任何特徵(feature)、尺寸(size)及特性(property)。第1C、1E及1F圖所述之分割也可以應用於色度CTB(Cb CTB 104)、色度CTB(Cr CTB 105)及色度CB(Cb CB 109)、色度CB(Cr CB 110)。 The CTU, CTB, CB, CU, PU, PB, TU or TB described in the disclosure may include any feature, size and property of the HEVC standard. The divisions described in Figures 1C, 1E, and 1F can also be applied to chroma CTB (Cb CTB 104), chroma CTB (Cr CTB 105), and chroma CB (Cb CB 109), and chroma CB (Cr CB 110). .

第2圖繪示執行本揭露之編碼方法的視訊編碼器200。視訊編碼器200可以包括一或多個附加元件,其提供HEVC-SCC之附加的編碼功能,如調色盤模式(palette mode)、樣本自適性偏移(sample adaptive offset)及去塊濾波(de-blocking filtering)。此外,本揭露考慮到ACT的幀內預測模式及其他編碼模式,例如是ACT的幀間預測模式。 FIG. 2 illustrates a video encoder 200 that performs the encoding method of the present disclosure. Video encoder 200 may include one or more additional components that provide additional encoding functions for HEVC-SCC, such as palette mode, sample adaptive offset, and deblocking filtering. -blocking filtering). In addition, the present disclosure considers the intra prediction mode of ACT and other coding modes, such as the inter prediction mode of ACT.

視訊編碼器200接收輸入之一原始視訊畫面(source video frame)。此輸入原始視訊畫面先輸入至畫面分割模組(Frame Dividing Module)202。畫面分割模組202分割原始視訊畫面為至少一原始CTU(source CTU)。原始CU(source CU)再由原始CTU來獲得。原始CTU的尺寸及原始CU的尺寸係由畫面分割模組202來決定。接著,以逐CU的方式進行編碼。原始CU由畫面分割模組202輸出後,輸入至幀間預測啟用ACT模組(Inter Prediction enabling adaptive color transformation Module)204、幀間預測禁用ACT模組(Inter Prediction disabling ACT Module)206、幀內預測啟用ACT模組(Intra Prediction enabling ACT Module)212及幀內預測禁用ACT模組(Intra Prediction disabling ACT Module)214。 Video encoder 200 receives one of the input source video frames. The input original video picture is first input to the Frame Dividing Module 202. The picture segmentation module 202 divides the original video picture into at least one original CTU (source CTU). The original CU (source CU) is then obtained by the original CTU. The size of the original CTU and the size of the original CU are determined by the picture segmentation module 202. Next, encoding is performed in a CU-by-CU manner. The original CU is outputted by the screen segmentation module 202, and then input to the Inter Prediction enabling adaptive color module 204, the Inter Prediction disabling ACT Module 206, and the intra prediction. The ACT module (Intra Prediction enabling ACT Module) 212 and the Intra Prediction disabling ACT Module 214 are enabled.

輸入畫面之原始CU被幀間預測啟用ACT模組204編碼,其利用幀間預測技術且啟用自適性色彩轉換(ACT)自輸入畫面決定一原始CU之預測。輸入畫面之原始CU也被幀間預測禁用ACT模組206編碼,其利用幀間預測技術且不啟用自適性 色彩轉換(ACT)自輸入畫面決定一原始CU之預測(即禁用ACT)。 The original CU of the input picture is encoded by the inter prediction enable ACT module 204, which uses inter prediction techniques and enables adaptive color conversion (ACT) to determine a prediction of the original CU from the input picture. The original CU of the input picture is also encoded by the inter prediction disable ACT module 206, which utilizes inter prediction techniques and does not enable self-adaptability. Color Conversion (ACT) determines the prediction of an original CU from the input screen (ie, disables ACT).

在幀間預測時可以使用儲存於畫面暫存器(Frame Buffer)208的參考CU。原始PU及PB也由原始CU來獲得,且使用於幀間預測啟用ACT模組204及幀間預測禁用ACT模組206的幀間預測程序。幀間預測利用不同時間之視訊畫面的區域來進行運動偵測。幀間預測啟用ACT模組204及幀間預測禁用ACT模組206之已編碼幀間預測CU預定為最高畫面品質。已編碼幀間預測CU接著被輸入至模式決定模組(Mode Decision Module)210。 A reference CU stored in a Frame Buffer 208 can be used for inter prediction. The original PU and PB are also obtained by the original CU, and are used in the inter prediction to enable the ACT module 204 and the inter prediction to disable the inter prediction program of the ACT module 206. Inter-frame prediction utilizes areas of video pictures at different times for motion detection. The interframe prediction enable ACT module 204 and the interframe prediction disable ACT module 206 have an encoded inter prediction CU that is predetermined to be the highest picture quality. The encoded inter prediction CU is then input to a Mode Decision Module 210.

輸入畫面之原始CU也藉由幀內預測啟用ACT模組212進行編碼,其利用幀內預測技術且啟用自適性色彩轉換(ACT)自輸入畫面決定一原始CU之預測。 The original CU of the input picture is also encoded by the intra prediction enable ACT module 212, which utilizes intra prediction techniques and enables adaptive color conversion (ACT) to determine a prediction of the original CU from the input picture.

輸入畫面之原始CU也藉由幀內預測禁用ACT模組214進行編碼,其利用幀內預測技術且不啟用自適性色彩轉換(ACT)自輸入畫面決定一原始CU之預測(即禁用ACT)。 The original CU of the input picture is also encoded by the intra prediction disable ACT module 214, which utilizes intra prediction techniques and does not enable adaptive color conversion (ACT) to determine a prediction of the original CU from the input picture (ie, disable ACT).

幀內預測啟用ACT模組212及幀內預測禁用ACT模組214進行幀內預測時,可使用儲存於畫面暫存器208之同一畫面的原始CU。原始PU及PB也由原始CU來獲得,且使用於幀內預測啟用ACT模組212及幀內預測禁用ACT模組214的幀內預測程序。已編碼之幀內預測CU預定為最高畫面品質。從幀內預測啟用ACT模組212及幀內預測禁用ACT模組214輸出之 已編碼幀內預測CU輸入至模式決定模組210。 When the intra prediction enable ACT module 212 and the intra prediction disable ACT module 214 for intra prediction, the original CU stored in the same picture of the picture buffer 208 can be used. The original PU and PB are also obtained by the original CU, and are used for intra prediction to enable the ACT module 212 and the intra prediction to disable the intra prediction program of the ACT module 214. The encoded intra prediction CU is predetermined to be the highest picture quality. Enable ACT module 212 and intra prediction from intra prediction to disable ACT module 214 output The encoded intra prediction CU is input to the mode decision module 210.

在模式決定模組210中,採用幀間預測啟用ACT、幀間預測禁用ACT、幀內預測啟用ACT及幀內預測禁用ACT等方式進行原始CU的編碼之成本搭配預測CU之品質進行比較。根據比較的結果,決定哪一編碼模式的預測CU(例如是幀間預測CU或幀內預測CU)。被選擇的預測CU接著被傳送至加總模組(Summing Module)216、218。 In the mode decision module 210, the cost of the original CU is compared with the quality of the predicted CU by using the inter prediction enable ACT, the inter prediction disable ACT, the intra prediction enable ACT, and the intra prediction disable ACT. Based on the result of the comparison, the prediction CU of which coding mode is determined (for example, an inter prediction CU or an intra prediction CU). The selected predicted CUs are then passed to the Summing Modules 216, 218.

在加總模組216中,原始CU減去已選擇的預測CU,以提供一剩餘CU(residual CU)。若已選擇的預測CU是來自於幀間預測啟用ACT模組204及幀內預測啟用ACT模組121之其中之一,切換器(switch)220切換至位置A。在位置A,剩餘CU輸入至ACT模組(ACT Module)222,並接著輸入至CCP、轉換及量化模組224(CCP,Transform,and Quantization Module)224。然而,若已選擇的預測CU是來自於幀間預測禁用ACT模組206及幀內預測禁用ACT模組214之其中之一,切換器220切換至位置B。在位置B,ACT模組222被跳過且在編碼過程中不被執行。剩餘CU從加總模組216直接被輸入至CCP、轉換及量化模組224。 In the summation module 216, the original CU subtracts the selected predicted CU to provide a residual CU (residual CU). If the selected predicted CU is from one of the inter prediction enabled ACT module 204 and the intra prediction enabled ACT module 121, the switch 220 switches to position A. At position A, the remaining CU is input to the ACT Module 222 and then to the CCP, Transform, and Quantization Module 224. However, if the selected predicted CU is from one of the inter prediction disabled ACT module 206 and the intra prediction disabled ACT module 214, the switch 220 switches to position B. At location B, the ACT module 222 is skipped and is not executed during the encoding process. The remaining CUs are directly input from the summation module 216 to the CCP, conversion and quantization module 224.

在ACT模組222,自適性色彩轉換(adaptive color transform)執行於剩餘CU。ACT模組222的輸出接入至CCP、轉換及量化模組224。 In the ACT module 222, an adaptive color transform is performed on the remaining CUs. The output of the ACT module 222 is coupled to the CCP, conversion and quantization module 224.

CCP、轉換及量化模組224執行跨組件預測(cross component prediction,CCP)、轉換(如離散續餘弦轉換(Discrete Cosine Transform,DCT)或離散正弦轉換(Discrete Sine Transform,DST)、及剩餘CU的量化。CCP、轉換及量化模組224的輸出接入至熵編碼模組(Entropy Coding Module)226及反運算CCP、轉換及量化模組(Inverse CCP,Transform,and Quantization Module)228。 CCP, conversion and quantization module 224 performs cross-component prediction (cross Component prediction, CCP), conversion (such as Discrete Cosine Transform (DCT) or Discrete Sine Transform (DST), and quantization of remaining CU. Output of CCP, conversion and quantization module 224 The Entropy Coding Module 226 and the Inverse CCP (Transform, and Quantization Module) 228.

熵編碼模組226執行剩餘熵編碼(entropy encoding)。舉例來說,全文自適應二進位算術編碼(Context Adaptive Binary Arithmetic Coding,CABAC)可以被執行以編碼剩餘CU。HEVC所提供之任何其他熵編碼程序皆可執行於熵編碼模組226中。 Entropy encoding module 226 performs entropy encoding. For example, Context Adaptive Binary Arithmetic Coding (CABAC) can be executed to encode the remaining CUs. Any other entropy encoding procedure provided by HEVC can be performed in the entropy encoding module 226.

在執行熵編碼之後,輸入視訊畫面之CU的已編碼位元流自視訊編碼器200輸出。輸出之已編碼位元流可以儲存於一記憶體、透過傳輸線廣播或網路、或提供至一顯示器等。 After performing entropy encoding, the encoded bit stream of the CU of the input video picture is output from video encoder 200. The output encoded bit stream can be stored in a memory, broadcast or networked through a transmission line, or provided to a display or the like.

在反運算CCP、轉換及量化模組228中,執行CCP、轉換及量化模組224之相反決定於剩餘CU,以提供一重建的剩餘CU。 In the inverse CCP, transform and quantization module 228, the inverse of the CCP, translation and quantization module 224 is determined by the remaining CUs to provide a reconstructed remaining CU.

若已選擇的預測CU是來自於幀間預測啟用ACT模組204或幀內預測啟用ACT模組212,切換器(switch)230切換至位置C。在位置C,重建的剩餘CU輸入至反運算ACT模組(Inverse ACT Module)232並接著輸入至加總模組(Summing Module)218。然而,若已選擇的預測CU是來自於幀間預測禁用ACT模組206或幀內預測禁用ACT模組214,切換器230切換至 位置D。在位置D,反運算ACT模組232被跳過而不被執行,且重建的剩餘CU被直接輸入至加總模組218。 If the selected predicted CU is from the inter prediction enabled ACT module 204 or the intra prediction enabled ACT module 212, the switch 230 switches to position C. At position C, the reconstructed remaining CU is input to the inverse ACT Module 232 and then to the summing module 218. However, if the selected predicted CU is from the inter prediction disable ACT module 206 or the intra prediction disable ACT module 214, the switch 230 switches to Position D. At position D, the inverse ACT module 232 is skipped and not executed, and the reconstructed remaining CUs are directly input to the summation module 218.

反運算ACT模組232對重建的剩餘CU執行ACT模組232之自適性色彩轉換的反運算。反運算ACT模組232的輸出輸入至加總模組218。 The inverse ACT module 232 performs an inverse of the adaptive color conversion of the ACT module 232 on the reconstructed remaining CUs. The output of the inverse ACT module 232 is input to the summation module 218.

在加總模組218中,來自模式決定模組210的已選擇的預測CU加上重建的剩餘CU,以提供重建的原始CU(reconstructed source CU)。重建的原始CU接著被儲存於畫面暫存器208,以供其他CU的幀間預測及幀內預測使用。 In the summation module 218, the selected predicted CU from the mode decision module 210 is added to the reconstructed remaining CU to provide a reconstructed original CU (reconstructed source CU). The reconstructed original CU is then stored in picture buffer 208 for inter prediction and intra prediction by other CUs.

以下說明的編碼方法300、400及500如何執行於幀內預測啟用ACT模組212內。編碼方法300、400及500能夠改善編碼效率與編碼時間。 How the encoding methods 300, 400, and 500 described below are performed within the intra prediction enable ACT module 212. The encoding methods 300, 400, and 500 can improve encoding efficiency and encoding time.

幀間預測啟用ACT模組204、幀間預測禁用ACT模組206、幀內預測啟用ACT模組212及幀內預測禁用ACT模組214並非侷限於以平行的方式排列。在一實施例中,幀間預測啟用ACT模組204、幀間預測禁用ACT模組206、幀內預測啟用ACT模組212及幀內預測禁用ACT模組214可以依序排列。幀間預測啟用ACT模組204、幀間預測禁用ACT模組206、幀內預測啟用ACT模組212及幀內預測禁用ACT模組214的排列方式可以變更。 The inter prediction enable ACT module 204, the inter prediction disable ACT module 206, the intra prediction enable ACT module 212, and the intra prediction disable ACT module 214 are not limited to being arranged in a parallel manner. In an embodiment, the inter prediction enable ACT module 204, the inter prediction disable ACT module 206, the intra prediction enable ACT module 212, and the intra prediction disable ACT module 214 may be sequentially arranged. The arrangement of the inter prediction enable ACT module 204, the inter prediction disable ACT module 206, the intra prediction enable ACT module 212, and the intra prediction disable ACT module 214 can be changed.

第3圖說明根據本揭露一實施例之編碼方法300,其決定TU尺寸的估計(TU size evaluation)是否需執行於啟用 ACT之幀內預測編碼程序(ACT enabled intra prediction encoding process)中。更具體來說,編碼方法300利用關於CU尺寸之臨界值計算(threshold calculation)來決定是否需執行TU尺寸的估計。 Figure 3 illustrates an encoding method 300 for determining whether a TU size evaluation needs to be performed for enabling, in accordance with an embodiment of the present disclosure. In the ACT enabled intra prediction encoding process. More specifically, encoding method 300 utilizes a threshold calculation for the CU size to determine whether an estimate of the TU size needs to be performed.

在步驟304中,成分相關性分析(component correlation analysis)執行於一原始CU,以決定CU之ACT的編碼模式是否需被啟用。CU內之各個畫素之色彩成分的相關性被分析出來。在各個畫素中,色彩成分之相關性與一畫素相關臨界值(pixel correlation threshold)進行比較,以分析出相關性是否高於、等於或低於畫素相關性臨界值。 In step 304, component correlation analysis is performed on an original CU to determine if the coding mode of the ACT of the CU needs to be enabled. The correlation of the color components of the various pixels in the CU is analyzed. In each pixel, the correlation of color components is compared to a pixel correlation threshold to analyze whether the correlation is above, equal to, or below the pixel correlation threshold.

於一CU中,計算出高於畫素相關性臨界值之畫素的總數量,其中等於畫素相關性臨界值之畫素也視為高於畫素相關性臨界值而被計算在內。畫素之總數量接著與一CU相關性臨界值(CU correlation threshold)進行比較。 In a CU, the total number of pixels above the pixel correlation threshold is calculated, and the pixel equal to the pixel correlation threshold is also considered to be higher than the pixel correlation threshold. The total number of pixels is then compared to a CU correlation threshold.

若畫素之總數量低於CU相關性臨界值,則判定CU之色彩成分具有低相關性。因此,CU並不需要ACT,故流程進入步驟308,而於CU的編碼禁用ACT。 If the total number of pixels is lower than the CU correlation threshold, it is determined that the color component of the CU has a low correlation. Therefore, the CU does not need ACT, so the flow proceeds to step 308, and the ACT is disabled in the encoding of the CU.

然而,若畫素之總數量高於CU相關性臨界值,則判定CU的色彩成分具有高相關性。在這種情況下,ACT是需要用來去除CU之各個畫素之成分相關性。當確認為高相關性,ACT被啟用且流程進入步驟306。於步驟306,於幀內預測啟用ACT之下,進行一概略模式決定。 However, if the total number of pixels is higher than the CU correlation threshold, it is determined that the color component of the CU has a high correlation. In this case, ACT is the component correlation that needs to be used to remove the various pixels of the CU. When it is confirmed as high correlation, ACT is enabled and the flow proceeds to step 306. In step 306, under the intra prediction enable ACT, a summary mode decision is made.

步驟304之相關性分析可以更進一步或可選擇性地根據CU之色彩空間(color space)來進行。舉例來說,在步驟304,CU內之畫素的色彩成分可以進行分析,且CU之色彩空間可以進行判定。色彩空間可以判定為紅色、綠色及藍色(RGB)空間或亮度與色度(luminance and chrominance,YUV)空間。 The correlation analysis of step 304 can be performed further or selectively according to the color space of the CU. For example, in step 304, the color components of the pixels in the CU can be analyzed, and the color space of the CU can be determined. The color space can be determined as red, green, and blue (RGB) space or luminance and chrominance (YUV) space.

當判定為色彩空間為RGB色彩空間,流程進入步驟306。於步驟306,於幀內預測啟用ACT之下,進行概略模式決定(Rough mode decision)。由於RGB畫素成分通常具有高相關性,需要進行ACT來去除CU內各個畫素之成分的相關性,以將畫素能量(pixel energy)隔離為單一成份(single component)。 When it is determined that the color space is the RGB color space, the flow proceeds to step 306. In step 306, under the intra prediction enable ACT, a rough mode decision is performed. Since the RGB pixel components usually have high correlation, ACT is required to remove the correlation of the components of the respective pixels in the CU to isolate the pixel energy into a single component.

相對地,當色彩空間判定為YUV色彩空間,流程進入步驟308,而禁用ACT。這是由於YUV畫素成分通常具有低相關性,且多數畫素能量(pixel energy)儲存於單一像素成分(single pixel component)。由於CU畫素成分的進一步去相關性動作(de-correlation)不會產生額外的編碼效益,故並不需要於YUV畫素成分啟用ACT。 In contrast, when the color space is determined to be the YUV color space, the flow proceeds to step 308, and ACT is disabled. This is because the YUV pixel component usually has a low correlation, and most of the pixel energy is stored in a single pixel component. Since the further de-correlation of the CU pixel component does not create additional coding benefits, it is not necessary to enable ACT in the YUV pixel component.

在幀內預測啟用ACT模組212中,於編碼方法300禁用ACT時,幀內預測啟用ACT的編碼模式被禁用,且在幀內預測啟用ACT模組212不會輸出預測至模式決定模組210。 In the intra prediction enable ACT module 212, when the encoding method 300 disables ACT, the intra prediction enable ACT encoding mode is disabled, and the intra prediction enable ACT module 212 does not output the prediction to mode decision module 210. .

在幀間預測啟用ACT模組204,於幀間預測編碼禁用ACT時,幀間預測啟用ACT的編碼模式被禁用,且幀間預測啟用ACT模組204不會輸出預測至模式決定模組210。 The ACT module 204 is enabled for inter prediction. When the inter prediction encoding disable ACT, the inter prediction enable ACT encoding mode is disabled, and the inter prediction enabled ACT module 204 does not output the prediction to mode decision module 210.

在步驟306中,幀內預測啟用ACT下進行概略模式決定。概略模式決定可以是一成本模式決定(cost-based mode decision)。舉例來說,於概略模式決定中,可以決定為低複雜度成本之已選擇編碼模式,以快速做出決定,其通常具有最高品質及最低編碼成本。 In step 306, the intra prediction is enabled to perform a summary mode decision under ACT. The summary mode decision can be a cost-based mode decision. For example, in a summary mode decision, a selected coding mode that is low complexity cost can be determined to make a quick decision, which typically has the highest quality and lowest coding cost.

在步驟310中,在啟用ACT之編碼模式下,進行碼率失真函數模式決定(rate distortion optimization mode decision,RDO mode decision)。在此,當ACT、CCP、轉換、量化及熵編碼執行時,計算原始視訊之變異(deviation)及編碼模式的位元成本。變異可以由錯誤計算(error calculation)來獲得,例如是均方差(mean squared error,MSE)。接著,ROD分析選擇出具有最低編碼成本及最高編碼品質之編碼模式由。 In step 310, a rate distortion optimization mode decision (RDO mode decision) is performed in the ACT-enabled encoding mode. Here, when ACT, CCP, conversion, quantization, and entropy coding are performed, the derivation of the original video and the bit cost of the coding mode are calculated. The variation can be obtained from an error calculation, such as a mean squared error (MSE). Next, the ROD analysis selects the coding mode with the lowest coding cost and the highest coding quality.

舉例來說,在幀內預測啟用ACT模組212中,35個幀內預測模式(intra prediction modes,IPMs)可供編碼。幀內預測啟用ACT模組212在步驟306之概略模式決定中,採用簡單、低複雜度編碼成本決定法來從這些幀內預測模式選擇出最低編碼成本及最高編碼品質者。舉例來說,絕對轉換誤差和(sum of absolute transform distortion,SATD)成本可用來決定出各個IPM的低複雜度編碼成本。舉例來說,最低編碼成本及最高偏碼品質的選擇可以是選擇3個IPM或選擇8個IPM。幀內預測啟用ACT模組212在步驟310之RDO模式決定中,對各個已選擇IPM進行RDO模式決定。當ACT、CCP、轉換、量化及熵編碼執行時, 計算各個已選擇IPM之原始視訊之變異及編碼的位元成本。變異可以由錯誤計算(error calculation)來獲得,例如是均方差(mean squared error,MSE)。接著,藉由ROD分析從已選擇IPM中選擇出具有最低編碼成本及最高編碼品質之IPM。 For example, in the intra prediction enable ACT module 212, 35 intra prediction modes (IPMs) are available for encoding. The intra prediction enable ACT module 212 uses the simple, low complexity coding cost decision method to select the lowest coding cost and the highest coding quality from the intra prediction modes in the summary mode decision of step 306. For example, the sum of absolute transform distortion (SATD) cost can be used to determine the low complexity coding cost of each IPM. For example, the lowest coding cost and the highest code quality can be selected by selecting 3 IPMs or selecting 8 IPMs. The intra prediction enable ACT module 212 performs an RDO mode decision for each selected IPM in the RDO mode decision of step 310. When ACT, CCP, conversion, quantization, and entropy coding are performed, Calculate the variance of the original video and the bit cost of the encoding for each selected IPM. The variation can be obtained from an error calculation, such as a mean squared error (MSE). Next, the IPM with the lowest coding cost and the highest coding quality is selected from the selected IPM by ROD analysis.

上述相關於幀內預測啟用ACT模組212之變化流程也可以執行於幀間預測啟用ACT模組204。舉例來說,當幀間預測啟用ACT模組204執行編碼方法300,於步驟306,進行時間相鄰之視訊畫面的最佳幀間預測的概略模式決定,其提供最低編碼成本及最高編碼品質。在步驟310,進行幀間預測之RDO模式決定。在此,當ACT、CCP、轉換、量化及熵編碼執行時,計算幀間預測之原始視訊的變異(deviation)及編碼位元成本。變異可以由錯誤計算(error calculation)來獲得,例如是均方差(mean squared error,MSE)。接著,ROD分析選擇出具有最低編碼成本及最高編碼品質之幀間預測。 The above-described change process related to the intra prediction enable ACT module 212 can also be performed on the inter prediction enable ACT module 204. For example, when the inter prediction enable ACT module 204 performs the encoding method 300, in step 306, a summary mode decision of the optimal inter prediction of the temporally adjacent video frames is performed, which provides the lowest encoding cost and the highest encoding quality. At step 310, an RDO mode decision for inter prediction is performed. Here, when ACT, CCP, conversion, quantization, and entropy coding are performed, the original video's deviation and coding bit cost of the inter prediction are calculated. The variation can be obtained from an error calculation, such as a mean squared error (MSE). Next, the ROD analysis selects the inter prediction with the lowest coding cost and the highest coding quality.

在步驟312,計算目前處理之CU的CU尺寸。CU的尺寸可以是NxN,其中N可以是4、8、16、32或64。CU的N值與臨界值T1比較。臨界值T1可以是4、8、16、32或64。根據比較結果,判定出CU尺寸是否小於臨界值T1,並藉此估計出欲啟用編碼模式之轉換單元的尺寸。若CU尺寸小於臨界值T1,流程進入步驟314,以進行TU尺寸的決定(TU size ecision)。然而,若CU尺寸等於或大於臨界值T1,流程進入步驟316,而跳過步驟314的TU尺寸決定步驟。在步驟312,當CU尺寸大於臨 界值T1,決定出TU。若CU尺寸CU等於或大於臨界值T1,TU四分樹結構(quadtree structure)可以決定為最大可能的TU尺寸。舉例來說,當CU尺寸等於或大於臨界值T1,對於尺寸為64x64的PU,可以決定出四個32x32的TU。在另一實施例中,當CU尺寸等於或大於臨界值T1,對於32x32、16x16、8x8或4x4的PU來說,TU可以與PU相同尺寸。舉例來說,若PU的尺寸為32x32,對應的PU尺寸可以是32x32。 At step 312, the CU size of the currently processed CU is calculated. The size of the CU may be NxN, where N may be 4, 8, 16, 32 or 64. The N value of the CU is compared to the threshold T1. The threshold T1 can be 4, 8, 16, 32 or 64. Based on the comparison result, it is determined whether the CU size is smaller than the threshold value T1, and thereby the size of the conversion unit to which the encoding mode is to be enabled is estimated. If the CU size is less than the threshold T1, the flow proceeds to step 314 for TU size ecision. However, if the CU size is equal to or greater than the threshold T1, the flow proceeds to step 316, and the TU size decision step of step 314 is skipped. At step 312, when the CU size is greater than The boundary value T1 determines the TU. If the CU size CU is equal to or greater than the threshold T1, the TU quadtree structure can be determined as the maximum possible TU size. For example, when the CU size is equal to or greater than the threshold T1, for a 64x64 PU, four 32x32 TUs can be determined. In another embodiment, when the CU size is equal to or greater than the threshold T1, the TU may be the same size as the PU for a 32x32, 16x16, 8x8 or 4x4 PU. For example, if the size of the PU is 32x32, the corresponding PU size may be 32x32.

由於TU尺寸的決定耗費時間且增加編碼成本,步驟312可改善編碼時間及效率。因此,若TU尺寸的決定能夠省略,則可解省編碼成本即時間。再者,CU尺寸等於或大於臨界值T1表示CU的內容並不複雜。舉例來說,CU尺寸大於臨界值T1可能表示視訊影像有大範圍區域沒有邊界、動態或複雜圖像。因此,TU尺寸的決定可以不需要進行,以有效率地進行高視訊品質之CU的編碼。 Since the decision of the TU size is time consuming and increases the coding cost, step 312 can improve coding time and efficiency. Therefore, if the decision of the TU size can be omitted, the coding cost, that is, the time can be solved. Furthermore, the CU size is equal to or greater than the threshold T1 indicating that the content of the CU is not complicated. For example, a CU size greater than a threshold T1 may indicate that the video image has a wide range of areas without borders, dynamics, or complex images. Therefore, the decision of the TU size may not be required to efficiently perform encoding of the CU of high video quality.

在步驟314中,若CU尺寸低於臨界值T1,則執行TU尺寸的決定。在此,決定了原始CU之TU。藉由步驟310之RDO成本估計,分析出TU尺寸,已獲得最高效率及高視訊品質之CU的ACT轉換。舉例來說,可分析出4x4、8x8、16x16及32x32的TU尺寸。當能夠獲得最高效率之ACT轉換的TU尺寸被決定出來,此TU尺寸被選擇用來作CU的ACT轉換並進入步驟316。已選擇的TU尺寸作為最佳的TU四分樹結構尺寸。 In step 314, if the CU size is below the threshold T1, the decision of the TU size is performed. Here, the TU of the original CU is determined. With the RDO cost estimate of step 310, the TU size is analyzed, and the ACT conversion of the CU with the highest efficiency and high video quality has been obtained. For example, TU sizes of 4x4, 8x8, 16x16, and 32x32 can be analyzed. When the TU size of the ACT conversion that can achieve the highest efficiency is determined, this TU size is selected for the ACT conversion of the CU and proceeds to step 316. The selected TU size is used as the optimal TU quadtree structure size.

在步驟316,進行色度模式決定(chroma mode decision)。色度模式的決定是依據步驟310之預測模式的決定來進行,且利用已決定的預測模式(determined prediction mode)來使色度預測(chroma prediction)產生色度PU(chroma PU)及對應的色度TU(chroma TU)。從步驟312或步驟314決定的TU也可用來產生色度TU。色度TU亦根據色度格式(chroma format)進行二次採樣(subsample)。因此,在一實施例中,當色度格式為4:2:0,且亮度TU的尺寸為32x32,決定之色度TU為16x16的色度TU。 At step 316, a chroma mode decision is made (chroma mode) Decision). The determination of the chrominance mode is performed according to the determination of the prediction mode of step 310, and the determined prediction mode is used to cause chroma prediction to generate chrominance PU (chroma PU) and corresponding color. Degree TU (chroma TU). The TU determined from step 312 or step 314 can also be used to generate a chroma TU. The chroma TU is also subsampled according to the chroma format. Thus, in one embodiment, when the chroma format is 4:2:0 and the size of the luma TU is 32x32, the determined chroma TU is a 16x16 chroma TU.

在步驟308,幀內預測啟用ACT模組之選擇最佳幀內預測模式及選擇最佳TU四分樹結構尺寸的程序已完成。預測及RDO成本已產生,且輸入至模式決定模組210,以與其他預測模組輸入至模式決定模組210之RDO成本進行比較。舉例來說,幀間預測啟用ACT模組204可能產生啟用ACT之CU的預測及RDO成本,並輸入預測CU及RDO成本至模式決定模組210。幀間預測禁用ACT模組206及幀內預測禁用ACT模組214也產生預測CU及RDO成本,並輸入其各自之預測CU及RDO成本至模式決定模組210。模式決定模組210比較幀間預測啟用ACT模組204、幀間預測禁用ACT模組206、幀內預測啟用ACT模組212及幀內預測禁用ACT模組214所輸入之預測CU及RDO成本,並決定將要輸入至加總模組216、218之預測CU。 At step 308, the intra prediction enable ACT module selects the optimal intra prediction mode and the procedure for selecting the optimal TU quadtree size is completed. The prediction and RDO costs have been generated and input to the mode decision module 210 for comparison with the RDO costs of the other prediction modules input to the mode decision module 210. For example, inter prediction enable ACT module 204 may generate prediction and RDO costs for ACT enabled CUs and input predicted CU and RDO costs to mode decision module 210. The inter prediction disable ACT module 206 and the intra prediction disable ACT module 214 also generate predicted CU and RDO costs and enter their respective predicted CU and RDO costs to the mode decision module 210. The mode determining module 210 compares the predicted CU and RDO costs input by the inter prediction enable ACT module 204, the inter predictive disable ACT module 206, the intra prediction enable ACT module 212, and the intra prediction disable ACT module 214. And the predicted CU to be input to the summation modules 216, 218 is determined.

第4圖說明根據本揭露另一實施例之編碼方法400,其根據本揭露之另一實施例決定ACT是否需要啟用。更具體來說, 編碼方法400利用了關於CU尺寸的臨界值計算(threshold calculation)及CU畫素之色彩成分的相關性的決定。根據臨界值計算,ACT可以啟用或禁用。相同標號之元件可參考前述相關說明。 FIG. 4 illustrates an encoding method 400 in accordance with another embodiment of the present disclosure that determines whether ACT needs to be enabled in accordance with another embodiment of the present disclosure. More specifically, The encoding method 400 utilizes a determination of the correlation between the threshold calculation of the CU size and the color component of the CU pixel. Depending on the threshold calculation, ACT can be enabled or disabled. For the same reference numerals, reference may be made to the aforementioned related description.

在步驟304,成分相關性分析(component correlation analysis)執行於原始CU,以決定ACT是否需啟用或禁用。步驟304如同編碼方法300之說明。若CU之色彩成分的相關性高,則啟用ACT且流程進入步驟306、310、314、316及308(同上述編碼步驟300)。然而,若相關性低,則流程進入步驟402。 At step 304, component correlation analysis is performed on the original CU to determine if ACT needs to be enabled or disabled. Step 304 is as described for encoding method 300. If the correlation of the color components of the CU is high, ACT is enabled and the flow proceeds to steps 306, 310, 314, 316, and 308 (same as encoding step 300 above). However, if the correlation is low, the flow proceeds to step 402.

在步驟402,決定目前處理之CU的尺寸。如前所述,CU尺寸為NxN,其中N可以是4、8、16、32或64。CU之N值與臨界值T2進行比較,以比較出CU尺寸是否小於臨界值T2。臨界值T2可以是4、8、16、32或64。若CU尺寸小於臨界值T2,則啟用ACT且流程進入步驟310,如同編碼方法300之步驟310的RDO模式決定。然而,若CU尺寸等於或大於臨界值T2,流程進入步驟308,而禁用ACT。 At step 402, the size of the currently processed CU is determined. As previously mentioned, the CU size is NxN, where N can be 4, 8, 16, 32 or 64. The N value of the CU is compared to a threshold T2 to compare whether the CU size is less than the threshold T2. The threshold T2 can be 4, 8, 16, 32 or 64. If the CU size is less than the threshold T2, ACT is enabled and the flow proceeds to step 310 as determined by the RDO mode of step 310 of the encoding method 300. However, if the CU size is equal to or greater than the threshold T2, the flow proceeds to step 308, and ACT is disabled.

於幀間預測啟用ACT模組204,當編碼方法400中ACT被禁用,幀間預測啟用ACT模組204之輸出為未應用ACT之幀間預測CU。因此,在這種情況下,幀間預測啟用ACT模組204輸出之CU相同於幀間預測禁用ACT模組206的輸出。同樣地,於幀內預測啟用ACT模組212,當編碼方法400中ACT被禁用,幀內預測啟用ACT模組212之輸出為未應用ACT之幀內 預測CU。因此,在這種情況下,幀內預測啟用ACT模組212的輸出CU相同於幀內預測禁用ACT模組214的輸出。 The ACT module 204 is enabled for inter prediction. When ACT is disabled in the encoding method 400, the output of the inter prediction enable ACT module 204 is an inter prediction CU to which ACT is not applied. Therefore, in this case, the CU outputted by the inter prediction enable ACT module 204 is the same as the output of the inter prediction disable ACT module 206. Similarly, the ACT module 212 is enabled in the intra prediction. When the ACT is disabled in the encoding method 400, the output of the intra prediction enable ACT module 212 is within the frame in which ACT is not applied. Predict CU. Therefore, in this case, the output prediction CU of the intra prediction enable ACT module 212 is the same as the output of the intra prediction disable ACT module 214.

由於CU尺寸相同或大於臨界值T2表示CU的內容不複雜,步驟402可改善編碼時間及編碼效率。CU尺寸大於臨界值T2可能表示視訊影像有大範圍區域沒有邊界、動態或複雜圖像。在組合已經充分去相關性之色彩成分下,為了有效率地編碼CU,可能不需要ACT。 Since the CU size is the same or larger than the threshold T2 indicating that the content of the CU is not complicated, step 402 can improve the encoding time and the encoding efficiency. A CU size greater than the threshold T2 may indicate that the video image has a wide range of areas without borders, dynamic or complex images. In order to efficiently encode a CU under the color component that has been sufficiently de-correlated, ACT may not be needed.

第5圖說明根據本揭露另一實施例之編碼方法500,其根據本揭露之另一實施例決定ACT是否需要啟用以及是否需要透過兩個臨界值計算來進行TU尺寸估計。更具體來說,編碼方法500使用關於CU尺寸之第一臨界值計算(first threshold calculation)以及用以判斷是否要啟用ACT之CU畫素色彩成分的相關性決定。編碼方法500也使用關於CU尺寸之第二臨界值計算(second threshold calculation),以決定TU尺寸的估計是否需執行。相同標號之元件可參考前述相關說明。 FIG. 5 illustrates an encoding method 500 according to another embodiment of the present disclosure, which determines whether ACT needs to be enabled and whether TU size estimation needs to be performed through two threshold calculations according to another embodiment of the present disclosure. More specifically, encoding method 500 uses a first threshold calculation for the CU size and a correlation decision to determine whether to enable the CU pixel color component of ACT. The encoding method 500 also uses a second threshold calculation for the CU size to determine if an estimate of the TU size needs to be performed. For the same reference numerals, reference may be made to the aforementioned related description.

在步驟304,成分相關性分析(component correlation analysis)執行於原始CU,以決定ACT是否需啟用或禁用。步驟304如同編碼方法300之說明。若CU之色彩成分的相關性高,則啟用ACT且流程進入步驟306,以進行概略模式決定及步驟310的RDO模式決定。步驟306及310如同前述編碼方法300之敘述。然而,若相關性低,則流程進入步驟402。 At step 304, component correlation analysis is performed on the original CU to determine if ACT needs to be enabled or disabled. Step 304 is as described for encoding method 300. If the correlation of the color components of the CU is high, ACT is enabled and the flow proceeds to step 306 to make a summary mode decision and the RDO mode decision of step 310. Steps 306 and 310 are as described in the aforementioned encoding method 300. However, if the correlation is low, the flow proceeds to step 402.

在步驟402,決定目前處理之CU的尺寸(如前述 第4圖之編碼方法400所述)。若CU尺寸小於臨界值T2,則啟用ACT,並進入步驟310,以進行RDO模式決定。然而,若CU尺寸等於或大於臨界值T2,則流程進入步驟308,而禁用ACT。 At step 402, the size of the currently processed CU is determined (as described above) The encoding method 400 of Figure 4). If the CU size is less than the threshold T2, ACT is enabled and proceeds to step 310 for RDO mode decision. However, if the CU size is equal to or greater than the threshold T2, the flow proceeds to step 308, and ACT is disabled.

於幀間預測啟用ACT模組204,當編碼方法500中ACT被禁用,幀間預測啟用ACT模組204之輸出為未應用ACT之幀間預測CU。因此,在這種情況下,幀間預測啟用ACT模組204輸出之CU相同於幀間預測禁用ACT模組206的輸出。 The ACT module 204 is enabled for inter prediction. When ACT is disabled in the encoding method 500, the output of the inter prediction enable ACT module 204 is an inter prediction CU to which ACT is not applied. Therefore, in this case, the CU outputted by the inter prediction enable ACT module 204 is the same as the output of the inter prediction disable ACT module 206.

同樣地,於幀內預測啟用ACT模組212,當編碼方法500中ACT被禁用,幀內預測啟用ACT模組212之輸出為未應用ACT之幀內預測CU。因此,在這種情況下,幀內預測啟用ACT模組212的輸出CU相同於幀內預測禁用ACT模組214的輸出。 Similarly, the ACT module 212 is enabled for intra prediction. When ACT is disabled in the encoding method 500, the output of the intra prediction enabled ACT module 212 is an intra prediction CU that does not apply ACT. Therefore, in this case, the output prediction CU of the intra prediction enable ACT module 212 is the same as the output of the intra prediction disable ACT module 214.

在步驟310,RDO模式決定如同前述編碼方法300所述之內容。 At step 310, the RDO mode determines the content as described in the aforementioned encoding method 300.

在步驟312,目前處理之CU尺寸的計算如同前述編碼方法300所述之內容,來決定CU尺寸是否小於臨界值T1。若CU尺寸小於臨界值T1,則流程進入步驟314,以進行TU尺寸決定。然而,若CU尺寸等於或大於臨界值T1,流程進入步驟316,而跳過步驟314的TU尺寸決定。步驟314、316的決定過程如同前述的編碼方法300。 At step 312, the current processed CU size is calculated as described in the foregoing encoding method 300 to determine if the CU size is less than the threshold T1. If the CU size is less than the threshold T1, the flow proceeds to step 314 to make a TU size decision. However, if the CU size is equal to or greater than the threshold T1, the flow proceeds to step 316, and the TU size decision of step 314 is skipped. The decision process of steps 314, 316 is as in the encoding method 300 described above.

臨界值T1及T2可以設定為相同或不同值。 The thresholds T1 and T2 can be set to the same or different values.

第5圖之編碼方法500結合臨界值計算來改善編碼 效率及時間。如上所述,CU尺寸等於或大於臨界值T2表示CU的內容不複雜,且可預期有大範圍區域的無邊界、動態或複雜圖樣。在組合已經充分去相關性之色彩成分下,為了有效率地編碼CU,可能不需要ACT。再者,步驟314的TU尺寸決定被省略後,能夠節省編碼成本。 The encoding method 500 of Figure 5 combines threshold calculation to improve encoding Efficiency and time. As described above, the CU size equal to or greater than the threshold T2 indicates that the content of the CU is not complicated, and a borderless, dynamic or complex pattern of a wide range of areas can be expected. In order to efficiently encode a CU under the color component that has been sufficiently de-correlated, ACT may not be needed. Furthermore, after the TU size decision of step 314 is omitted, the coding cost can be saved.

第6圖說明根據本揭露另一實施例之編碼方法600(類似於編碼方法300),其根據本揭露之另一實施例決定是否需要於啟用ACT之幀內預測程序中執行TU尺寸估計。更具體來說,編法方法600使用關於CU尺寸之臨界值計算(threshold calculation),並根據臨界值計算判斷是否需要執行TU尺寸估計。 FIG. 6 illustrates an encoding method 600 (similar to encoding method 300) in accordance with another embodiment of the present disclosure that determines whether TU size estimation needs to be performed in an intra-prediction program that enables ACT in accordance with another embodiment of the present disclosure. More specifically, the scribing method 600 uses a threshold calculation for the CU size and determines whether a TU size estimation needs to be performed based on the threshold calculation.

在步驟304,成分相關性分析(component correlation analysis)執行於原始CU,以決定ACT是否需啟用或禁用。步驟304如同編碼方法300之說明。若CU之色彩成分的相關性高,則啟用ACT且流程進入步驟306,以進行概略模式決定及步驟310的RDO模式決定。步驟306及310如同前述編碼方法300之敘述。然而,若於步驟304之相關性低,或色彩空間判斷為YUV色彩空間,則啟用ACT之編碼模式並直接進入步驟310,但不執行步驟306之概略模式決定。在此,對於低相關性畫素成分或YUV色彩空間,ACT仍然啟用,以確認畫素成分之去相關性可能會產生附加的編碼效益。 At step 304, component correlation analysis is performed on the original CU to determine if ACT needs to be enabled or disabled. Step 304 is as described for encoding method 300. If the correlation of the color components of the CU is high, ACT is enabled and the flow proceeds to step 306 to make a summary mode decision and the RDO mode decision of step 310. Steps 306 and 310 are as described in the aforementioned encoding method 300. However, if the correlation at step 304 is low, or the color space is determined to be a YUV color space, the ACT encoding mode is enabled and proceeds directly to step 310, but the summary mode decision of step 306 is not performed. Here, for low correlation pixel components or YUV color spaces, ACT is still enabled to confirm that the de-correlation of pixel components may result in additional coding benefits.

在步驟310,RDO模式決定的計算如同前述編碼方 法300。 At step 310, the RDO mode determines the calculation as in the aforementioned encoding side. Method 300.

在步驟312,目前處理之CU尺寸的計算如同前述編碼方法300所述之內容,來決定CU尺寸是否小於臨界值T1。若CU尺寸小於臨界值T1,則流程進入步驟314,以進行TU尺寸決定。然而,若CU尺寸等於或大於臨界值T1,流程進入步驟316,而跳過步驟314的TU尺寸決定。步驟314、316的決定過程如同前述的編碼方法300。 At step 312, the current processed CU size is calculated as described in the foregoing encoding method 300 to determine if the CU size is less than the threshold T1. If the CU size is less than the threshold T1, the flow proceeds to step 314 to make a TU size decision. However, if the CU size is equal to or greater than the threshold T1, the flow proceeds to step 316, and the TU size decision of step 314 is skipped. The decision process of steps 314, 316 is as in the encoding method 300 described above.

臨界值T1及T2可以設定為相同或不同值。 The thresholds T1 and T2 can be set to the same or different values.

執行編碼方法300、400、500、600之相反步驟的解碼方法可以有效率地對編碼方法300、400、500、600所編碼之視訊進行解碼。因此,本揭露的上述內容足以瞭解執行編碼方法300、400、500、600之相反步驟的解碼方法。本揭露上述內容亦足以瞭解對編碼方法300、400、500、600所編碼之視訊進行解碼所需之其他解碼程序。 The decoding method that performs the reverse steps of the encoding methods 300, 400, 500, 600 can efficiently decode the video encoded by the encoding methods 300, 400, 500, 600. Accordingly, the above disclosure of the present disclosure is sufficient to understand the decoding method of performing the reverse steps of the encoding methods 300, 400, 500, 600. The foregoing disclosure is also sufficient to understand other decoding procedures required to decode the video encoded by the encoding methods 300, 400, 500, 600.

如果大CU使用1PM作為螢幕虛擬內容(screen visual content),則可能表示該區域之內容並不複雜,且並不需要估計TU的尺寸。因此,非444色度格式之IPM被禁止部分大CU的TU分割。第7圖說明非444色度格式之IPM的演算流程。步驟306及310如同前述編碼方法300之敘述。在步驟310,RDO模式決定的計算如同前述編碼方法300。 If the large CU uses 1PM as the screen visual content, it may indicate that the content of the area is not complicated and does not need to estimate the size of the TU. Therefore, the IPM of the non-444 chroma format is prohibited from splitting the TU of some large CUs. Figure 7 illustrates the flow of the IPM in a non-444 chroma format. Steps 306 and 310 are as described in the aforementioned encoding method 300. At step 310, the RDO mode decision is calculated as the aforementioned encoding method 300.

在步驟311,判斷色度格式是否為非444。若色度格式為非444,則進入步驟312。若色度格不是非444,則進入步驟 314,以近行TU尺寸決定。 At step 311, it is determined whether the chroma format is non-444. If the chroma format is non-444, then step 312 is entered. If the chroma is not 444, go to the step 314, determined by the size of the near-row TU.

在步驟312,目前處理之CU尺寸的計算如同前述編碼方法300所述之內容,來決定CU尺寸是否小於臨界值T1。若CU尺寸小於臨界值T1,則流程進入步驟314,以進行TU尺寸決定。然而,若CU尺寸等於或大於臨界值T1,流程進入步驟316,而跳過步驟314的TU尺寸決定。步驟314、316的決定過程如同前述的編碼方法300。 At step 312, the current processed CU size is calculated as described in the foregoing encoding method 300 to determine if the CU size is less than the threshold T1. If the CU size is less than the threshold T1, the flow proceeds to step 314 to make a TU size decision. However, if the CU size is equal to or greater than the threshold T1, the flow proceeds to step 316, and the TU size decision of step 314 is skipped. The decision process of steps 314, 316 is as in the encoding method 300 described above.

臨界值T1及T2可以設定為相同或不同值。 The thresholds T1 and T2 can be set to the same or different values.

第8圖繪示執行本揭露之編碼與解碼方法的系統700。系統700包括一非暫態電腦可讀取媒體(non-transitory computer-readable medium)702,其可以是儲存數組指令的記憶體。此些指令可被處理器704執行。值得注意的是,一或多個非暫態電腦可讀取媒體702及/或一或多個處理器704可以選擇性地採用,以執行本揭露之編碼與解碼方法。 FIG. 8 illustrates a system 700 for performing the encoding and decoding methods of the present disclosure. System 700 includes a non-transitory computer-readable medium 702, which can be a memory that stores array instructions. Such instructions may be executed by processor 704. It is noted that one or more non-transitory computer readable media 702 and/or one or more processors 704 can be selectively employed to perform the encoding and decoding methods of the present disclosure.

非暫態電腦可讀取媒體702可以是任何類型的非暫態電腦可讀取記錄媒體(non-transitory computer-readable storage medium,non-transitory CRM)。非暫態電腦可讀取記錄媒體可以包括軟性磁碟(floppy disk)、可撓性碟片(flexible disk)、硬碟(hard disk)、硬碟機(hard drive)、固態硬碟(solid state drive)、磁帶(magnetic tape)、任何磁性資料儲存媒體(magnetic data storage medium)、光碟機(CD-ROM)、任何光學資料儲存媒體(optical data storage medium)、任何具有孔洞圖樣之物理性媒體、 動態隨機存取記憶體(RAM)、可程式唯讀記憶體(PROM)、可抹除可程式化唯讀記憶體(EPROM)、快閃可抹除可程式化唯讀記憶體(FLASH-EPROM)、任何快閃記憶體、非揮發性記憶體(NVRAM)、快取(cache)、暫存器(register)、記憶體晶片(memory chip)、膠卷(cartridge)及網路。電腦可讀取記錄媒體可儲存由至少一處理器執行之數組指令。此些指令包含令處理器去執行本揭露的編碼與解碼方法的步驟或階段。再者,一或多個電腦可讀取記錄媒體可以用來實現本揭露之編碼與解碼方法。「電腦可讀取記錄媒體」包含有形物體但不包含載體載波訊號和瞬態訊號。 The non-transitory computer readable storage medium 702 can be any type of non-transitory computer-readable storage medium (non-transitory CRM). The non-transitory computer readable recording medium may include a floppy disk, a flexible disk, a hard disk, a hard drive, and a solid state disk. Drive), magnetic tape, any magnetic data storage medium, CD-ROM, any optical data storage medium, any physical medium with a hole pattern, Dynamic Random Access Memory (RAM), Programmable Read Only Memory (PROM), Erasable Programmable Read Only Memory (EPROM), Flash Erasable Programmable Read Only Memory (FLASH-EPROM) ), any flash memory, non-volatile memory (NVRAM), cache, register, memory chip, cartridge, and network. The computer readable recording medium can store array instructions executed by at least one processor. Such instructions include steps or stages that cause the processor to perform the encoding and decoding methods of the present disclosure. Furthermore, one or more computer readable recording media can be used to implement the encoding and decoding methods of the present disclosure. "Computer-readable recording media" contains tangible objects but does not contain carrier carrier signals and transient signals.

處理器704可以是任何形式的數位訊號處理器(digital signal processor,DSP)、特定應用積體電路(application specific integrated circuit,ASIC)、數位訊號處理裝置(digital signal processing device,DSPD)、可程式化邏輯裝置(programmable logic device,PLD)、可程式邏輯陣列(field programmable gate arrays,FPGA)、控制器(controller)、微控制器(micro-controller)、微處理器(micro-processor)、電腦或任何其他能夠執行本揭露之編碼與解碼方法之電子元件。 The processor 704 can be any type of digital signal processor (DSP), an application specific integrated circuit (ASIC), a digital signal processing device (DSPD), and can be programmed. Logic device (PLD), field programmable gate arrays (FPGA), controller, micro-controller, micro-processor, computer or any Other electronic components capable of performing the encoding and decoding methods of the present disclosure.

實驗結果 Experimental result

以下說明本揭露之編碼方法的實驗結果。 The experimental results of the encoding method of the present disclosure are described below.

此處的實驗室採用HEVC SCC參考模型、一般測試條件(common test conditions,CTC)下的SCM 4.0。本揭露的編碼方法的編碼效能是與HEVC的參考模型進行比較。HEVC參考 模型花費了編碼時間A來進行編碼。本揭露之測試編碼方法花費了來進行編碼時間B來進行編碼。編碼時間百分比係為編碼時間B除以編碼時間A。實驗可採用HEVC一般測試流程。視訊可混合文字、圖像、動態畫面、混合內容、動畫、照相機擷取內容。視訊可以是具有720p、1080p、或1440p之畫質的RGB色彩空間及YUV色彩空間。實驗採用有損條件(lossy condition)下的全幀內預測、隨機存取及低B預測(low-B prediction)。全幀內預測採用目前正被壓縮之畫面內的資訊來壓縮視訊畫面,而隨機存取及低B預測採用先前已編碼之畫面及目前正被壓縮之畫面的資訊來壓縮視訊畫面。在以下的說明中,低B預測也可以是指低延遲B預測(low delay B prediction)。在每次的實驗中,編碼時間及解碼時間都被以百分比記錄下來,此些百分比表示相對於參考模型之編碼方法與解碼方法的比例。相對於原始視訊源,關於各個G/Y、B/U及R/V成分之正的百分比表示位元率編碼損失(bit rate coding loss),負的百分比表示位元率編碼增益(bit rate coding gain)。舉例來說,G/Y成分的0.1%數值表示已編碼視訊之G/Y成分相對於原始視訊之G/Y成分的編碼損失為0.1%。在另一實例中,G/Y成分的-0.1%數值表示已編碼視訊之G/Y成分相對於原始視訊之G/Y成分的編碼增益為0.1%。 The laboratory here uses the HEVC SCC reference model, SCM 4.0 under common test conditions (CTC). The coding performance of the coding method of the present disclosure is compared with the reference model of HEVC. HEVC reference The model takes the encoding time A for encoding. The test coding method of the present disclosure takes the coding time B to perform coding. The coding time percentage is the coding time B divided by the coding time A. The experiment can use the HEVC general test procedure. Video can mix text, images, motion pictures, mixed content, animations, and camera captures. The video can be an RGB color space with a picture quality of 720p, 1080p, or 1440p and a YUV color space. The experiment uses full intra prediction, random access, and low-B prediction under lossy conditions. Full intra prediction uses the information in the picture being compressed to compress the video picture, while random access and low B prediction use the previously encoded picture and the information of the picture currently being compressed to compress the video picture. In the following description, the low B prediction may also refer to a low delay B prediction. In each experiment, the encoding time and decoding time are recorded as a percentage, which represents the ratio of the encoding method to the decoding method relative to the reference model. Relative to the original video source, a positive percentage for each G/Y, B/U, and R/V component indicates a bit rate coding loss, and a negative percentage indicates a bit rate coding gain (bit rate coding). Gain). For example, a 0.1% value for the G/Y component indicates that the G/Y component of the encoded video has a coding loss of 0.1% relative to the G/Y component of the original video. In another example, a value of -0.1% of the G/Y component indicates that the coded gain of the G/Y component of the encoded video relative to the G/Y component of the original video is 0.1%.

請參考第5圖之編碼方法500及下表1。於編碼方法500,實驗室執行於以下三種設定之下。於設定一,臨界值T2及臨界值T1皆設定為64。於設定二,臨界值T2設定為64,臨 界值T1設定為32。於設定三,臨界值T2設定為64,臨界值T1設定為16。幀內預測係為預定的編碼模式。 Please refer to the coding method 500 of Figure 5 and Table 1 below. In the encoding method 500, the laboratory is executed under the following three settings. In the first setting, the threshold T2 and the threshold T1 are both set to 64. In setting 2, the threshold T2 is set to 64, Pro The threshold T1 is set to 32. In setting three, the threshold T2 is set to 64, and the threshold T1 is set to 16. The intra prediction is a predetermined coding mode.

於設定一,當畫素成分具有低相關性,尺寸大於或等於64x64的CU被以不啟用ACT的方式編碼。尺寸小於64x64的CU被以啟用ACT的方式編碼。再者,在CU尺寸大於64x64的情況下,省略TU尺寸決定的步驟314。對於CU尺寸小於64x64的情況下,執行TU尺寸決定的步驟314。 For setting one, when the pixel component has low correlation, a CU whose size is greater than or equal to 64x64 is encoded in a manner that does not enable ACT. CUs smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, in the case where the CU size is greater than 64x64, step 314 of TU size determination is omitted. In the case where the CU size is less than 64x64, step 314 of the TU size decision is performed.

於設定二,當畫素成分具有低相關性,尺寸大於或等於64x64的CU被以不啟用ACT的方式編碼。尺寸小於64x64的CU被以啟用ACT的方式編碼。再者,在CU尺寸大於32x32的情況下,省略TU尺寸決定的步驟314。對於CU尺寸小於32x32的情況下,執行TU尺寸決定的步驟314。 In the second setting, when the pixel component has low correlation, the CU whose size is greater than or equal to 64x64 is encoded in a manner that does not enable ACT. CUs smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, in the case where the CU size is larger than 32x32, step 314 of TU size determination is omitted. In the case where the CU size is less than 32x32, step 314 of the TU size decision is performed.

於設定三,當畫素成分具有低相關性,尺寸大於或等於64x64的CU被以不啟用ACT的方式編碼。尺寸小於64x64的CU被以啟用ACT的方式編碼。再者,在CU尺寸大於16x16的情況下,省略TU尺寸決定的步驟314。對於CU尺寸小於16x16的情況下,執行TU尺寸決定的步驟314。 In setting three, when the pixel component has low correlation, a CU whose size is greater than or equal to 64x64 is encoded in a manner that does not enable ACT. CUs smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, in the case where the CU size is larger than 16x16, step 314 of TU size determination is omitted. In the case where the CU size is less than 16x16, step 314 of the TU size decision is performed.

如表1所示,設定一、設定二及設定三的編碼效能均有改善。設定一降低了3%的編碼複雜度(encoding complexity),設定二降低了6%的編碼複雜度。設定三降低了9%的編碼複雜度(設定三降低最多)。因此,所有的設定都能夠改善編碼效率。各個設定在位元率的最小損失(minimal loss of bit rate)下,編碼時間及效率都有改善。 As shown in Table 1, the coding performance of setting 1, setting 2 and setting 3 is improved. Setting one reduced the coding complexity by 3%, and setting two reduced the coding complexity by 6%. Setting three reduces the encoding complexity by 9% (set three to reduce the most). Therefore, all settings can improve coding efficiency. Each setting has an improvement in coding time and efficiency at a minimum loss of bit rate.

請參照編碼方法500及下表2、3。在此,實驗是在全幀內、隨機存取及低延遲B(low delay B)之下進行。在實驗一,臨界值T2及臨界值T1皆設定為32。在實驗二,臨界值T2及臨界值T1皆設定為16。如同編碼方法500,於實驗一,尺寸 大於或等於32x32的CU禁用TU估計(TU evaluation),且尺寸大於或等於32x32以不啟用ACT的方式編碼。於實驗二,尺寸大於或等於16x16的CU禁用TU估計,且尺寸大於或等於16x16以不啟用ACT的方式編碼。尺寸小於16x16的CU於啟用ACT的方式編碼。實驗係在有損條件(lossy condition)及全畫面幀內區塊複製技術(full frame intra block copy)下進行。 Please refer to coding method 500 and Tables 2 and 3 below. Here, the experiment is performed under full frame, random access, and low delay B. In the first experiment, the critical value T2 and the critical value T1 are both set to 32. In the second experiment, the critical value T2 and the critical value T1 are both set to 16. As with encoding method 500, in experiment one, size A CU greater than or equal to 32x32 disables TU evaluation, and a size greater than or equal to 32x32 is encoded in a manner that does not enable ACT. In Experiment 2, a CU with a size greater than or equal to 16x16 disables TU estimation, and a size greater than or equal to 16x16 is encoded in a manner that does not enable ACT. CUs smaller than 16x16 are coded in an ACT-enabled mode. The experiment was performed under lossy conditions and full frame intra block copy.

如表2所述,於實驗一,全幀內模式(all intra mode)降低了5%的編碼複雜度。隨機存取及低延遲B各降低了1%的編碼複雜度。各個設定顯示出非常低的位元率損失,全幀內及隨機存取幾乎沒有改變位元率。 As shown in Table 2, in Experiment 1, the all intra mode reduced the coding complexity by 5%. Random access and low latency B each reduce the coding complexity by 1%. Each setting shows a very low bit rate loss, with almost no bit rate changes in full intraframe and random access.

於實驗二,全幀內模式降低了8%的編碼複雜度。隨機存取降低了1%的編碼複雜度。低延遲B沒有改變編碼複雜度。相較於實驗一,各個模式具有較多的位元率損失,但位元率損失仍然維持在最小(僅在百分比的小數範圍內)。相較於原始視訊,已編碼視訊僅略微降低位元率,故僅損失少部分的視訊品質。由於編碼方法500改善了編碼時間,故這樣的視訊品質在大部分的應用是可以接受的。 In Experiment 2, the full intra mode reduced the coding complexity by 8%. Random access reduces the coding complexity by 1%. Low latency B does not change the coding complexity. Compared to Experiment 1, each mode has more bit rate loss, but the bit rate loss is still kept to a minimum (only in the fractional range of percentages). Compared to the original video, the encoded video only slightly reduces the bit rate, so only a small portion of the video quality is lost. Since the encoding method 500 improves the encoding time, such video quality is acceptable for most applications.

如表3所述,於實驗一及實驗二,各個模式在全部或平均來看,對位元率都沒有改變。全幀內降低了最多比率的編碼複雜度(於各實驗均降低1%)。 As shown in Table 3, in Experiments 1 and 2, the mode did not change in all or on average. The coding complexity of the most ratio was reduced in the full frame (1% reduction for each experiment).

請參考第5圖之編碼方法500及下表4。在此,實驗是在有損條件(lossy condition)、4-CTU幀內區塊複製技術(Intra block copy)及4:4:4色度模式下進行。幀內區塊複製技術利用運動向量從先前已編碼CU複製一區塊至目前編碼視訊畫面。4-CTU指出運動向量能夠搜尋的範圍。 Please refer to coding method 500 and Figure 4 below in Figure 5. Here, the experiment was performed in a lossy condition, a 4-CTU intra block copy technique, and a 4:4:4 chroma mode. The intra block copy technique utilizes motion vectors to copy a block from a previously encoded CU to the current encoded video picture. 4-CTU indicates the range in which the motion vector can be searched.

於實驗一,臨界值T2及臨界值T1皆設定為32。於實驗二,臨界值T2及臨界值T1皆設定為16。如同編碼方法500,於實驗一,尺寸大於或等於32x32的CU禁用TU估計。於實驗二,尺寸大於或等於16x16的CU禁用TU估計。於實驗一,尺寸大於32x32的CU啟用ACT,尺寸大於或等於32x32的CU禁用ACT。於實驗二,尺寸小於16x16的CU啟用ACT,尺寸大於或等於16x16的CU禁用ACT。 In Experiment 1, the critical value T2 and the critical value T1 were both set to 32. In Experiment 2, the critical value T2 and the critical value T1 are both set to 16. As with encoding method 500, in experiment one, a CU with a size greater than or equal to 32x32 disables the TU estimate. In Experiment 2, a CU with a size greater than or equal to 16x16 disables the TU estimate. In Experiment 1, CUs with sizes larger than 32x32 enable ACT, and CUs with sizes greater than or equal to 32x32 disable ACT. In Experiment 2, CUs with sizes smaller than 16x16 enable ACT, and CUs with sizes greater than or equal to 16x16 disable ACT.

如表4所述,於實驗一及實驗二,全幀內、隨機存取或低延遲B模式皆為最小位元率改變。全幀內降低了最多的編碼複雜度,其於實驗一降低了5%,於實驗二降低了8%。 As shown in Table 4, in Experiment 1 and Experiment 2, the full intraframe, random access or low latency B mode is the minimum bit rate change. The maximum coding complexity was reduced in the full frame, which was reduced by 5% in Experiment 1 and by 8% in Experiment 2.

請參考第4圖之編碼方法400及以下的表5.1及表5.2。在此,臨界值T2設定為64。因此,當步驟304的成分相關性分析分析出CU的色彩成分具有低相關性時,執行步驟402,以判斷CU尺寸是否小於64x64。若CU尺寸小於64x64,則啟用ACT且執行步驟310之RDO模式決定。若CU尺寸大於或等於64x64,則禁用ACT且進入步驟308。實驗一係採用全畫面幀內區塊複製技術(full frame intra block copy)的有損全幀內編碼模式(lossy all intra encoding mode),實驗二採用4 CTU IBC技術的有損全幀內編碼模式。色度模式於各個實驗選擇為4:4:4。 Please refer to coding method 400 in Figure 4 and Tables 5.1 and 5.2 below. Here, the threshold T2 is set to 64. Therefore, when the component correlation analysis of step 304 analyzes that the color component of the CU has a low correlation, step 402 is performed to determine whether the CU size is less than 64x64. If the CU size is less than 64x64, ACT is enabled and the RDO mode decision of step 310 is performed. If the CU size is greater than or equal to 64x64, ACT is disabled and proceeds to step 308. Experiment 1 uses lossy all intra encoding mode of full frame intra block copy, and experiment 2 uses lossy full intra coding mode of 4 CTU IBC technique. . The chroma mode was chosen to be 4:4:4 for each experiment.

如表5.1所示,於YUV色彩空間且全幀內、有損、全畫面幀內區塊複製技術下,編碼方法400於最小位元率損失下,降低了1%到3%的編碼時間。如表5.2所示,在全幀內、有損、4CTU幀內區塊複製技術下,編碼方法400於最小位元損失下,降低編碼時間的比率近似於表5.1之實驗一。 As shown in Table 5.1, under the YUV color space and full-frame, lossy, full-picture intra-frame copying technique, the encoding method 400 reduces the encoding time by 1% to 3% with minimal bit rate loss. As shown in Table 5.2, under the full-frame, lossy, 4 CTU intra-frame copy technique, the coding method 400 reduces the coding time at a minimum bit loss ratio similar to Experiment 1 of Table 5.1.

請參考編碼方法400及下表6。在此,臨界值T2設定為64。於4:4:4的色度模式執行無損幀內編碼(Lossless intra encoding)。 Please refer to encoding method 400 and Table 6 below. Here, the threshold T2 is set to 64. Lossless intra encoding is performed in the 4:4:4 chroma mode.

於YUV色彩空間,編碼方法節省了0%到2%的編碼時間。 In the YUV color space, the encoding method saves 0% to 2% of the encoding time.

請參考第3圖之編碼方法300及下表7。在此,臨界值T1於實驗一設定為32,於實驗二設定為16。如同編碼方法300,於實驗一,CU尺寸大於或等於32x32時,將省略步驟314之TU尺寸決定;CU尺寸小於32x32時,則執行步驟314之TU尺寸決定。於實驗二,CU尺寸大於或等於16x16時,將省略步驟314之TU尺寸決定;CU尺寸小於16x16時,則執行步驟314之TU尺寸決定。實驗執行啟用ACT之有損全幀內編碼。 Please refer to the coding method 300 of Figure 3 and Table 7 below. Here, the critical value T1 is set to 32 in Experiment 1, and set to 16 in Experiment 2. As in the encoding method 300, in Experiment 1, when the CU size is greater than or equal to 32x32, the TU size decision of step 314 will be omitted; when the CU size is less than 32x32, the TU size decision of step 314 is performed. In experiment 2, when the CU size is greater than or equal to 16x16, the TU size decision of step 314 will be omitted; when the CU size is less than 16x16, the TU size decision of step 314 is performed. The experiment performs a lossy full intraframe encoding that enables ACT.

實驗一之編碼時間節省了3%到6%。實驗二之編碼時間節省了6%到10%。因此,僅在CU尺寸低於32x32或16x16 之下才允許進行TU尺寸決定,以幫助編碼效率。 The coding time of Experiment 1 was saved by 3% to 6%. The coding time of Experiment 2 was saved by 6% to 10%. Therefore, only in CU sizes below 32x32 or 16x16 The TU size decision is allowed below to help with coding efficiency.

上述內容係用以說明本揭露之技術,然其並非用以侷限本發明的內容。實施例的修改與調整均落於本揭露之範圍。舉例來說,所揭露的實施例包含軟體及硬體,但本揭露之系統與方法可以僅以硬體來實現。 The above description is used to illustrate the technology of the present disclosure, but it is not intended to limit the content of the present invention. Modifications and adjustments of the embodiments are within the scope of the disclosure. For example, the disclosed embodiments include both software and hardware, but the systems and methods of the present disclosure can be implemented only in hardware.

軟體開發者可基於本揭露之方法開發一電腦程式,其可採用各種電腦程式技術來開發。舉例來說,程式片段或程式模組可以藉由Java、C、C++、組合語言或任何其他程式語言來開發。一或多個軟體片段與模組可以安裝於一電腦系統、非暫態電腦可讀取媒體、或現存的通訊軟體。 The software developer can develop a computer program based on the method of the present disclosure, which can be developed using various computer program technologies. For example, a program fragment or a program module can be developed by Java, C, C++, a combination language, or any other programming language. One or more software segments and modules can be installed in a computer system, non-transitory computer readable media, or existing communication software.

再者,雖然上述以揭露各種實施例,然本揭露之範圍包含各種元件之均等、修改、省略、組合(例如不同實施例之間的組合)、應用、或選擇。申請專利範圍之元件係以最廣的範圍來作解釋,而不侷限於實施例的內容。此外,方法的步驟可以進行修改(包含調整順序、插入或刪除步驟)。雖然本揭露已以較佳實施例揭露如上,然其並非用以限定本揭露。本揭露之保護範圍當視後附之申請專利範圍所界定者為準。 Furthermore, the scope of the disclosure is to be construed as being limited, modified, omitted, and combinations (e.g., combinations of different embodiments), applications, or alternatives. The components of the patent application are to be interpreted in the broadest scope and not limited to the scope of the embodiments. In addition, the steps of the method can be modified (including the order of adjustment, insertion or deletion steps). Although the disclosure has been disclosed above in the preferred embodiments, it is not intended to limit the disclosure. The scope of protection of this disclosure is subject to the definition of the scope of the patent application.

本發明所屬技術領域中具有通常知識者亦可根據本揭露之說明瞭解其他的實施例。本揭露之範圍包含結合一般知識的各種變化、實施與應用。說明書與實施例僅僅作為示例,本揭露之保護範圍當視後附之申請專利範圍所界定者為準。 Other embodiments of the invention will be apparent to those skilled in the art of the invention. The scope of the disclosure encompasses various variations, implementations, and applications in conjunction with the general knowledge. The specification and the examples are intended to be illustrative only, and the scope of the disclosure is defined by the scope of the appended claims.

300‧‧‧編碼方法 300‧‧‧ coding method

304‧‧‧成分相關性分析 304‧‧‧ Component correlation analysis

306‧‧‧概略模式決定 306‧‧‧Summary mode decision

308‧‧‧結束 End of 308‧‧

310‧‧‧RDO模式決定 310‧‧‧RDO mode decision

312‧‧‧CU尺寸是否大於臨界值T1之判斷 312‧‧‧Determination of whether the CU size is greater than the critical value T1

314‧‧‧TU尺寸決定 314‧‧‧ TU size decision

316‧‧‧色度模式決定 316‧‧‧Chroma mode decision

Claims (17)

一種視訊編碼方法,包括:接收一原始視訊畫面(source video frame);分割該原始視訊畫面為一編碼樹單元(coding tree unit);從該編碼樹單元決定一編碼單元(coding unit);啟用或禁用該編碼單元之一編碼模式(coding mode);若啟用該編碼模式,則於啟用之該編碼模式判斷是否估計一轉換單元(transform unit)之尺寸;以及於啟用之該編碼模式決定該編碼單元之該轉換單元;其中該編碼單元之尺寸係為NxN。 A video encoding method includes: receiving an original video frame; dividing the original video frame into a coding tree unit; determining a coding unit from the coding tree unit; enabling or Disabling a coding mode of the coding unit; if the coding mode is enabled, determining whether to estimate a size of a transform unit in the coding mode enabled; and determining the coding unit when the coding mode is enabled The conversion unit; wherein the coding unit has a size of NxN. 如申請專利範圍第1項所述之方法,更包括:判斷該編碼單元之一色彩空間;其中判斷該編碼單元之該色彩空間包括判斷該色彩空間是否為一紅色、綠色及藍色(RGB)色彩空間或一亮度及色度(YUV)色彩空間。 The method of claim 1, further comprising: determining a color space of the coding unit; wherein determining the color space of the coding unit comprises determining whether the color space is a red, green, and blue (RGB) Color space or a luminance and chrominance (YUV) color space. 如申請專利範圍第2項所述之方法,更包括:若啟用之該編碼模式係為啟用一自適性色彩轉換(adaptive color transform,ACT)之一幀內預測模式(intra prediction mode),當該色彩空間被判斷為該RGB色彩空間時,執行一成本模式決定(cost-based mode decision)。 The method of claim 2, further comprising: if the encoding mode is enabled to enable an intra prediction mode of an adaptive color transform (ACT), when When the color space is judged as the RGB color space, a cost-based mode decision is performed. 如申請專利範圍第2項所述之方法,更包括:若啟用之該編碼模式係為啟用一自適性色彩轉換(adaptive color transform,ACT)之一幀內預測模式(intra prediction mode),當該色彩空間被判斷為一亮度及色度(YUV)色彩空間時,執行 一成本模式決定(cost-based mode decision)。 The method of claim 2, further comprising: if the encoding mode is enabled to enable an intra prediction mode of an adaptive color transform (ACT), when When the color space is judged as a luminance and chrominance (YUV) color space, execution A cost-based mode decision. 如申請專利範圍第2項所述之方法,更包括:當該色彩空間被判斷為該YUV色彩空間且N大於或等於一臨界值,禁用該編碼單元之該編碼模式。 The method of claim 2, further comprising: disabling the coding mode of the coding unit when the color space is determined to be the YUV color space and N is greater than or equal to a threshold. 如申請專利範圍第2項所述之方法,更包括:判斷N是否小於一臨界值;以及當該色彩空間被判斷為該YUV色彩空間且N小於該臨界值,啟用該編碼模式,該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT)。 The method of claim 2, further comprising: determining whether N is less than a threshold; and when the color space is determined to be the YUV color space and N is less than the threshold, enabling the encoding mode, the encoding mode Enable an adaptive color transform (ACT). 如申請專利範圍第2項所述之方法,更包括:判斷N是否小於一第一臨界值;判斷N是否小於一第二臨界值;當色彩空間被判斷為該YUV色彩空間且N小於該第一臨界值,啟用該編碼模式,該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT);以及當N小於該第二臨界值,於該編碼模式估計該轉換單元的尺寸,該編碼模式啟用該自適性色彩轉換。 The method of claim 2, further comprising: determining whether N is less than a first threshold; determining whether N is less than a second threshold; when the color space is determined to be the YUV color space and N is less than the first a threshold value, the encoding mode is enabled, the encoding mode enabling an adaptive color transform (ACT); and when N is less than the second threshold, estimating a size of the converting unit in the encoding mode, the encoding mode Enable this adaptive color conversion. 如申請專利範圍第2項所述之方法,更包括:判斷N是否小於一第一臨界值;判斷N是否大於或等於一第二臨界值;當該色彩空間被判斷為該YUV色彩空間且N小於該第一臨界值,啟用該編碼模式,該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT);以及當N大於或等於該第二臨界值,於該編碼模式不估計該轉換 單元之尺寸,該編碼模式啟用該自適性色彩轉換。 The method of claim 2, further comprising: determining whether N is less than a first threshold; determining whether N is greater than or equal to a second threshold; when the color space is determined to be the YUV color space and N Less than the first threshold, the encoding mode is enabled, the encoding mode enables an adaptive color transform (ACT); and when N is greater than or equal to the second threshold, the conversion is not estimated in the encoding mode. The size of the unit that enables this adaptive color conversion. 如申請專利範圍第1項所述之方法,更包括:判斷N是否大於或等於一臨界值;以及當N大於或等於該臨界值且啟用之該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT),不評估該轉換單元之尺寸。 The method of claim 1, further comprising: determining whether N is greater than or equal to a threshold; and when the N is greater than or equal to the threshold and enabling the encoding mode to enable an adaptive color transform (adaptive color transform) , ACT), the size of the conversion unit is not evaluated. 如申請專利範圍第1項所述之方法,更包括:判斷N是否小於一臨界值;當N小於該臨界值且啟用之該編碼模式啟用自適性色彩轉換(adaptive color transform,ACT),評估該轉換單元之尺寸;以及選擇該轉換單元之尺寸。 The method of claim 1, further comprising: determining whether N is less than a threshold; and when N is less than the threshold and enabling the encoding mode to enable adaptive color transform (ACT), evaluating the The size of the conversion unit; and the size of the conversion unit. 如申請專利範圍第1項所述之方法,更包括:若該原始視訊畫面係為非444(non-444)且該編碼單元之尺寸小於NxN,評估該轉換單元之尺寸。 The method of claim 1, further comprising: if the original video picture is non-444 (non-444) and the size of the coding unit is smaller than NxN, the size of the conversion unit is evaluated. 一種視訊編碼系統,包括:一記憶體,用以儲存一組指令;以及一處理器,用以執行該組指令,該組指令包括:接收一原始視訊畫面(source video frame);分割該原始視訊畫面為一編碼樹單元(coding tree unit);從該編碼樹單元決定一編碼單元(coding unit);啟用或禁用該編碼單元之一編碼模式(coding mode);若啟用該編碼模式,則於啟用之該編碼模式判斷是否估計一轉換單元(transform unit)之尺寸;以及 於啟用之該編碼模式決定該編碼單元之該轉換單元;其中該編碼單元之尺寸係為NxN。 A video encoding system includes: a memory for storing a set of instructions; and a processor for executing the set of instructions, the set of instructions comprising: receiving an original video frame; dividing the original video The picture is a coding tree unit; a coding unit is determined from the coding tree unit; one of the coding units is enabled or disabled; if the coding mode is enabled, the picture is enabled The encoding mode determines whether to estimate the size of a transform unit; The coding mode enabled to determine the conversion unit of the coding unit; wherein the coding unit has a size of NxN. 如申請專利範圍第12項所述之系統,其中該處理器用以執行之該組指令更包括:判斷該編碼單元之一色彩空間;其中該色彩空間係被判斷是否為一紅色、綠色及藍色(RGB)色彩空間或一亮度及色度(YUV)色彩空間。 The system of claim 12, wherein the processor is configured to execute the set of instructions further comprising: determining a color space of the coding unit; wherein the color space is determined to be a red, green, and blue (RGB) color space or a luminance and chrominance (YUV) color space. 如申請專利範圍第13項所述之系統,其中該處理器用以執行之該組指令更包括:判斷N是否小於一第一臨界值;判斷N是否小於一第二臨界值;當色彩空間被判斷為該YUV色彩空間且N小於該第一臨界值,啟用該編碼模式,該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT);以及當N小於該第二臨界值,於該編碼模式估計該轉換單元的尺寸,該編碼模式啟用該自適性色彩轉換。 The system of claim 13, wherein the processor is configured to: determine whether N is less than a first threshold; determine whether N is less than a second threshold; when the color space is determined For the YUV color space and N is less than the first threshold, the encoding mode is enabled, the encoding mode enabling an adaptive color transform (ACT); and when N is less than the second threshold, in the encoding mode The size of the conversion unit is estimated, which enables the adaptive color conversion. 如申請專利範圍第13項所述之系統,其中該處理器用以處理之該組指令更包括:判斷N是否小於一第一臨界值;判斷N是否大於或等於一第二臨界值;當該色彩空間被判斷為該YUV色彩空間且N小於該第一臨界值,啟用該編碼模式,該編碼模式啟用一自適性色彩轉換(adaptive color transform,ACT);以及當N大於或等於該第二臨界值,於該編碼模式不估計該轉換 單元之尺寸,該編碼模式啟用該自適性色彩轉換。 The system of claim 13, wherein the processor for processing the set of instructions further comprises: determining whether N is less than a first threshold; determining whether N is greater than or equal to a second threshold; The space is determined to be the YUV color space and N is less than the first threshold, the encoding mode is enabled, the encoding mode enables an adaptive color transform (ACT); and when N is greater than or equal to the second threshold , the conversion is not estimated in the encoding mode The size of the unit that enables this adaptive color conversion. 如申請專利範圍第12項所述之系統,其中該處理器用以處理之該組指令更包括:若該原始視訊畫面係為非444(non-444)且該編碼單元之尺寸小於NxN,評估該轉換單元之尺寸。 The system of claim 12, wherein the processor is configured to process the set of instructions further comprising: if the original video frame is non-444 (non-444) and the size of the coding unit is less than NxN, The size of the conversion unit. 一種非暫態電腦可讀取記錄媒體,用以儲存一組指令,該組指令由一或多個處理器執行,以執行一視訊編碼方法,其中該視訊編碼方法包括:接收一原始視訊畫面(source video frame);分割該原始視訊畫面為一編碼樹單元(coding tree unit);從該編碼樹單元決定一編碼單元(coding unit);啟用或禁用該編碼單元之一編碼模式(coding mode);若啟用該編碼模式,則於啟用之該編碼模式判斷是否估計一轉換單元(transform unit)之尺寸;以及於啟用之該編碼模式決定該編碼單元之該轉換單元;其中該編碼單元之尺寸係為NxN。 A non-transitory computer readable recording medium for storing a set of instructions executed by one or more processors to perform a video encoding method, wherein the video encoding method comprises: receiving an original video frame ( Source video frame); dividing the original video picture into a coding tree unit; determining a coding unit from the coding tree unit; enabling or disabling a coding mode of the coding unit; If the encoding mode is enabled, determining whether to optimize a size of a transform unit in the encoding mode enabled; and determining the converting unit of the encoding unit when the encoding mode is enabled; wherein the size of the encoding unit is NxN.
TW105114323A 2015-06-08 2016-05-09 Video encoding methods and systems using adaptive color transform TWI597977B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610357374.8A CN106254870B (en) 2015-06-08 2016-05-26 Video encoding method, system and computer-readable recording medium using adaptive color conversion
US15/196,108 US10390020B2 (en) 2015-06-08 2016-06-29 Video encoding methods and systems using adaptive color transform

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201562172256P 2015-06-08 2015-06-08
US14/757,556 US20160360205A1 (en) 2015-06-08 2015-12-24 Video encoding methods and systems using adaptive color transform
US201662290992P 2016-02-04 2016-02-04

Publications (2)

Publication Number Publication Date
TW201711477A true TW201711477A (en) 2017-03-16
TWI597977B TWI597977B (en) 2017-09-01

Family

ID=57752462

Family Applications (1)

Application Number Title Priority Date Filing Date
TW105114323A TWI597977B (en) 2015-06-08 2016-05-09 Video encoding methods and systems using adaptive color transform

Country Status (2)

Country Link
JP (1) JP6670670B2 (en)
TW (1) TWI597977B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020253864A1 (en) * 2019-06-21 2020-12-24 Beijing Bytedance Network Technology Co., Ltd. Selective enablement of adaptive in-loop color-space transform in video coding
US11445206B2 (en) * 2020-04-16 2022-09-13 Tencent America LLC Method and apparatus for video coding

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2688086T3 (en) * 2010-11-26 2018-10-30 Nec Corporation Video decoding device, video decoding method and program
US9380319B2 (en) * 2011-02-04 2016-06-28 Google Technology Holdings LLC Implicit transform unit representation
JP5719410B2 (en) * 2013-07-25 2015-05-20 日本電信電話株式会社 Image coding method, image coding apparatus, and image coding program
JP6176044B2 (en) * 2013-10-07 2017-08-09 日本電気株式会社 Block structure determination circuit and information compression circuit
JP2015076765A (en) * 2013-10-09 2015-04-20 キヤノン株式会社 Image processing device, control method thereof, and computer program

Also Published As

Publication number Publication date
TWI597977B (en) 2017-09-01
JP2017005688A (en) 2017-01-05
JP6670670B2 (en) 2020-03-25

Similar Documents

Publication Publication Date Title
CN106254870B (en) Video encoding method, system and computer-readable recording medium using adaptive color conversion
US10390020B2 (en) Video encoding methods and systems using adaptive color transform
US20160360205A1 (en) Video encoding methods and systems using adaptive color transform
EP3158751B1 (en) Encoder decisions based on results of hash-based block matching
CN105409221B (en) Encoder-side decision for sample adaptive offset filtering
TWI737549B (en) Video decoding method
US10735725B2 (en) Boundary-intersection-based deblock filtering
JP6355715B2 (en) Encoding device, decoding device, encoding method, decoding method, and program
KR102606414B1 (en) Encoder, decoder and corresponding method to derive edge strength of deblocking filter
EP3095239A1 (en) Intra block copy prediction with asymmetric partitions and encoder-side search patterns, search ranges and approaches to partitioning
WO2015139165A1 (en) Encoder-side decisions for screen content encoding
US9294765B2 (en) Video encoder with intra-prediction pre-processing and methods for use therewith
JP7463460B2 (en) Video decoding method and video decoder
CN111327904B (en) Image reconstruction method and device
JP7351908B2 (en) Encoder, decoder, and corresponding method of deblocking filter adaptation
EP3114837B1 (en) Flicker detection and mitigation in video coding
EP3104606B1 (en) Video encoding methods and systems using adaptive color transform
EP2890128A1 (en) Video encoder with block merging and methods for use therewith
TWI597977B (en) Video encoding methods and systems using adaptive color transform
CN112055211B (en) Video encoder and QP setting method
CN110771166A (en) Apparatus and method for video encoding
CN111327894B (en) Block division method, video coding and decoding method and video coder and decoder
EP2899975A1 (en) Video encoder with intra-prediction pre-processing and methods for use therewith
CN114598873B (en) Decoding method and device for quantization parameter