TWI482501B

TWI482501B - Fast mode decision method for scalable video coding multi-layer encoder control

Info

Publication number: TWI482501B
Application number: TW101100464A
Authority: TW
Inventors: Wen Hsiao Peng; Ti Hao Chiang; Ruen Shiu Tsai
Original assignee: Ambarella Taiwan Ltd; Univ Nat Chiao Tung
Priority date: 2012-01-05
Filing date: 2012-01-05
Publication date: 2015-04-21
Also published as: TW201330622A

Description

Adjustable video multi-layer coding control fast decision mode algorithm

本發明係有關於一種視訊編碼演算法，更特別指一種可調性視訊多層編碼控制快速決策模式演算法。藉由使用時間性編碼模式分數(Temporal Mode Score,簡稱TMS)之概念，以更準確的預測出增進層的編碼模式，並且準確的找出基礎層和增進層的編碼模式並可隨視訊序列的性質做不同的配對計算，具有較低編碼失真、減少編碼時間、可調性及具較佳位元率-失真效能(Rate-Distortion performance)之編碼模式等優點。The invention relates to a video coding algorithm, and more particularly to an adjustable video multi-layer coding control fast decision mode algorithm. By using the concept of Temporal Mode Score (TMS) to more accurately predict the coding mode of the enhancement layer, and accurately find the coding mode of the base layer and the enhancement layer and can follow the video sequence. The nature of the different pairing calculations, with lower coding distortion, reduced encoding time, adjustability and encoding mode with better bit rate-distortion performance (Rate-Distortion performance).

由於多變容量的不同設備以及網路上的視訊撥放服務越來越多，單一視訊編碼已不足以應付這些項目。基於這個理由，Joint video Team(JVT)以H.264/AVC為基礎，制定了一套新的標準-可調性視訊編碼(Scalable Video Coding,簡稱SVC)，其提供了單一位元串流擁有在空間性、時間性、品質性以及三種之間組合的可調性，利用這種可調性性質對單一位元串流去做編碼，可得到不同解析度以及不同品質的服務。Due to the increasing variety of devices with varying capacities and the increasing number of video playback services on the network, single video encoding is no longer sufficient for these projects. For this reason, the Joint Video Team (JVT) has developed a new standard, Scalable Video Coding (SVC) based on H.264/AVC, which provides a single bit stream. The spatial, temporal, quality, and tunability of the combination between the three, using this tunable property to encode a single bit stream, can get different resolution and different quality services.

可調性視訊編碼擁有單一基礎層和一個或多個增進層，其增進層盡可能的利用基礎層的資訊，且一次編碼多層可以達到減少位元率並且也可達到預期中的畫面品質。傳統的可調性視訊編碼在決定基礎層和增進層的編碼模式時，共用式可調性視訊編碼(Joint Scalable video Model,簡稱JSVM)是採取由下而上的編碼方法(Bottom up encoder control)。在這種編碼方法中，只考慮到兩層之間較低層的效能。故傳統的編碼方法，將會造成兩層之間較高層出現較嚴重的失真，造成在選擇基礎層(Base Layer,簡稱BL)的編碼模式時，完全沒有考慮到增進層(Enhancement Layer,簡稱EL)的編碼效能，因此會導致增進層有不可預期的失真錯誤產生。The tunable video coding has a single base layer and one or more enhancement layers, and the enhancement layer utilizes the information of the base layer as much as possible, and the coded multi-layer can achieve a reduced bit rate and can also achieve the expected picture quality. Traditional Adjustable Video Coding When determining the coding mode of the base layer and the enhancement layer, the Joint Scalable Video Model (JSVM) adopts a Bottom up encoder control. . In this encoding method, only the performance of the lower layer between the two layers is considered. Therefore, the traditional coding method will cause more serious distortion between the two layers, resulting in the selection of the base layer (BL) coding mode, without considering the enhancement layer (Enhancement Layer, EL for short). The coding efficiency, therefore, leads to unpredictable distortion errors in the enhancement layer.

Schwarz et al.在早期提供了一種可以在基礎層和增進層之間給予適當權重的方法，稱做多層式編碼控制(Multilayer Encoder Control)。在做編碼模式的決策時，基礎層使用一種權重式的Lagrangian成本決策方法。如圖1，在多層式編碼模式決策時，其基本單位是一對具觀區塊(Macroblock,MB)組合，分別來自基礎層和增進層相對位置的具觀區塊。對於兩層之間的編碼模式，其配對組合一共有56組，若計算這56組的編碼模式，並在其中找出一組最好的，比起傳統由下而上的編碼方法，明顯有很大的計算量，由此可見一個好的快速決策模式方法顯得非常重要。In the early days, Schwarz et al. provided a method for giving appropriate weights between the base layer and the promotion layer, called Multilayer Encoder Control. When making coding mode decisions, the base layer uses a weighted Lagrangian cost decision method. As shown in Fig. 1, in the multi-layer coding mode decision, the basic unit is a pair of viewing blocks (Macroblock, MB), which are derived from the base layer and the relative position of the promotion layer. For the coding mode between the two layers, there are a total of 56 pairs. If you calculate the coding mode of these 56 groups and find the best one among them, compared with the traditional bottom-up coding method, there is obviously A lot of calculations, it can be seen that a good fast decision mode approach is very important.

Schwarz et al.又提出了一個兩段式快速編碼模式決策演算法。其做法是當基礎層的具觀區塊做編碼模式決策時，每一個編碼模式都會有一個增進層的編碼模式做為配對，此種配對稱為主導性配對(Dominant Mode Pair)。如圖2所示，其配對方法是當基礎層選擇畫面間預測(Inter Prediciton)或畫面內預測(Intra Prediction)的編碼模式時，增進層對應選擇BL_skip或IntraBL做為預測，然後基礎層利用這組配對去計算位元率-失真效能成本(Rate-Distortion-cost,簡稱RD-cost)，並選擇RD-cost最小的編碼模式做為基礎層的編碼模式，最後在增進層中對應的具觀區塊再根據選定好的基礎層編碼模式去做完整搜尋，找出最小的RD-cost編碼模式做壓縮。但此方法存在一些嚴重的問題：Schwarz et al. proposed a two-stage fast coding mode decision algorithm. The approach is that when the base layer of the base layer makes the coding mode decision, each coding mode has a coding mode of the enhancement layer as a pairing. This pairing is called Dominant Mode Pair. As shown in FIG. 2, the pairing method is that when the base layer selects an inter prediction mode (Inter Prediciton) or an intra prediction (Intra Prediction) coding mode, the promotion layer correspondingly selects BL_skip or IntraBL as a prediction, and then the base layer utilizes this. Group pairing is used to calculate the bit rate-distortion-cost (Rate-Distortion-cost, RD-cost for short), and the coding mode with the smallest RD-cost is selected as the coding mode of the base layer, and finally the corresponding view in the promotion layer The block then performs a complete search based on the selected base layer coding mode, and finds the smallest RD-cost coding mode for compression. But this method has some serious problems:

(1)在選擇基礎層中每一個具觀區塊的編碼模式時，都是使用圖2中的7組配對組合去做計算，並不會依照視訊序列的特性做調整。這種配對方式太過於單調，不具任何可調性。(1) When selecting the coding mode of each block in the base layer, the 7 sets of pairing combinations in Fig. 2 are used for calculation, and the adjustment is not performed according to the characteristics of the video sequence. This pairing method is too monotonous and does not have any adjustability.

(2)此配對模式因為只是56組配對模式中的7組，所以此配對模式與實際的編碼模式可能並不相同。若我們利用此方法去預估增進層的編碼模式，可能與實際增進層的編碼模式相差太大，因此會導致基礎層在做編碼模式的選擇時出現錯誤，若再用此錯誤的基礎層編碼模式去找增進層中的編碼模式，將會造成不可預期的嚴重失真。(2) This pairing mode may be different from the actual encoding mode because it is only 7 of the 56 pairing modes. If we use this method to estimate the coding mode of the enhancement layer, it may be too different from the coding mode of the actual enhancement layer, so it will cause the base layer to make an error in the selection of the coding mode. If this error is used, the base layer coding is used again. The mode to find the coding mode in the enhancement layer will cause unpredictable severe distortion.

鑒於先前技術之缺點，有必要提出一種可調整性影像處理方法，藉由時間性之編碼模式分數之使用，可以減少其編碼時所需之運算量並可解決上述所提及之問題。In view of the shortcomings of the prior art, it is necessary to propose an adjustable image processing method. By using the temporal coding mode score, the amount of computation required for encoding can be reduced and the above mentioned problems can be solved.

鑒於上述先前技術之缺點，本發明之主要目的即在於提供一種可調性視訊多層編碼控制快速決策模式演算法，可以判斷小波轉換的次頻帶是否適合方向性拆解的方法，再根據所有方向次頻帶上的訊號特性，提出新的相鄰表，來達成更佳的壓縮效果。In view of the above disadvantages of the prior art, the main object of the present invention is to provide a scalable video multi-layer coding control fast decision mode algorithm, which can determine whether the sub-band of wavelet transform is suitable for directional disassembly, and then according to all directions. The signal characteristics on the frequency band propose a new adjacent table to achieve better compression.

為達上述目的，本發明提供一種可調性視訊多層編碼控制快速決策模式演算法，用於一影像訊號之壓縮與編碼，該方法至少包含下列步驟：步驟(1)：預測一假設之第一畫面於增進層之具觀區塊之編碼模式；步驟(2)：預測該假設之第一畫面於增進層之具觀區塊之編碼模式，利用一假設之第二畫面於增進層之具觀區塊之編碼模式或複數個假設之第二畫面於增進層之具觀區塊之編碼方式之周圍具觀區塊之編碼模式；步驟(3)：利用一時間性編碼模式之計算方法，對該假設之第二畫面於增進層之具觀區塊之編碼方式及該假設之複數個第二畫面於增進層之具觀區塊之編碼模式之周圍具觀區塊之編碼模式進行計算，可得一時間性之編碼模式分數；步驟(4)：根據該時間性之編碼模式分數及複數個視訊續列內容，決定一機率分佈；步驟(5)：根據該機率分佈，決定一第一畫面於增進層之具觀區塊之編碼模式與一第一畫面於基礎層之具觀區塊之編碼模式之可能組合；步驟(6)：針對該第一畫面於增進層之具觀區塊之編碼模式與該第一畫面於基礎層之具觀區塊之編碼模式之可能組合，進行位元率-失真成本之計算；步驟(7)：根據具有一最小位元率-失真成本之第一畫面於基礎層之具觀區塊之編碼模式，決定一第一畫面於基礎層之具觀區塊之編碼模式；步驟(8)：根據該第一畫面於基礎層之具觀區塊之編碼模式與所有可行之該第一畫面於增進層之具觀區塊之編碼模式，進行位元率-失真成本之計算；步驟(9)：根據具有一最小位元率-失真成本之第一畫面於增進層之具觀區塊之編碼模式，決定一第一畫面於增進層之具觀區塊之編碼模式。To achieve the above objective, the present invention provides an adjustable video multi-layer coding control fast decision mode algorithm for compression and encoding of an image signal, the method comprising at least the following steps: Step (1): predicting the first of a hypothesis The coding mode of the viewing block in the enhancement layer; step (2): predicting the coding mode of the first picture of the hypothesis in the enhancement layer, and using the second picture of the hypothesis in the enhancement layer The coding mode of the block or the second picture of the plurality of hypotheses has an encoding mode of the viewing block around the encoding mode of the viewing block of the enhancement layer; step (3): using a calculation method of a temporal coding mode, The second picture of the hypothesis is calculated by the encoding mode of the viewing block of the enhancement layer and the plurality of second pictures of the hypothesis, and the coding mode of the viewing block with the viewing block of the enhancement layer is calculated. Obtaining a temporal coding mode score; step (4): determining a probability distribution according to the temporal coding mode score and a plurality of video continuation contents; and step (5): determining a first according to the probability distribution a possible combination of the coding mode of the viewing block in the enhancement layer and the coding mode of the viewing block of the first picture in the base layer; step (6): viewing blocks in the enhancement layer for the first picture The coding mode is combined with the possible coding mode of the first picture in the base layer, and the bit rate-distortion cost is calculated; step (7): according to the first bit rate-distortion cost A coding mode of the viewing block in the base layer determines a coding mode of the first picture in the base layer; and step (8): according to the first picture, the view block in the base layer Encoding mode and all feasible coding modes of the first picture in the enhancement layer, performing bit rate-distortion cost calculation; step (9): according to having a minimum bit rate-distortion cost first The coding mode of the picture in the enhancement layer determines the coding mode of the first picture in the enhancement layer.

綜上所述，本發明之一種可調性視訊多層編碼控制快速決策模式演算法將具有以下之功效：In summary, the adjustable video multi-layer coding control fast decision mode algorithm of the present invention has the following effects:

1.利用時間(Temporal)的性質去預測增進層中的編碼模式，再將這個預測和基礎層的編碼模式做配對。先去預測增進層的編碼模式，因此沒有固定的Dominant Mode Pair概念，在選擇基礎層的編碼模式時，不同的具觀區塊可能會搭配不同的增進層編碼模式做計算，可減少編碼時間。1. Use the nature of Temporal to predict the coding mode in the enhancement layer, and then pair this prediction with the coding mode of the base layer. First, we predict the coding mode of the enhancement layer. Therefore, there is no fixed Dominant Mode Pair concept. When selecting the coding mode of the base layer, different viewing blocks may be calculated with different enhancement layer coding modes to reduce the coding time.

2.提供時間性編碼模式分數(Temporal Mode Score,簡稱TMS)的概念，TMS可以更準確的預測出增進層的編碼模式，並且準確的找出基礎層和增進層的編碼模式。2. Providing the concept of Temporal Mode Score (TMS), TMS can more accurately predict the coding mode of the enhancement layer, and accurately find the coding mode of the base layer and the enhancement layer.

3.根據視訊序列的性質做調整，每一個具觀區塊都先去增進層預測出機率最大的編碼模式做為基礎層的配對，因此具有可調性。3. According to the nature of the video sequence, each of the viewing blocks first adjusts the encoding mode with the highest probability of the layer prediction as the base layer pairing, so it is adjustable.

為讓本發明之上述和其他目的、特徵、和優點能更明顯易懂，下文特舉數個較佳實施例，並配合所附圖式，作詳細說明如下。The above and other objects, features, and advantages of the present invention will become more apparent and understood.

雖然本發明可表現為不同形式之實施例，但附圖所示者及於下文中說明者係為本發明可之較佳實施例，並請了解本文所揭示者係考量為本發明之一範例，且並非意圖用以將本發明限制於圖示及/或所描述之特定實施例中。While the invention may be embodied in various forms, the embodiments illustrated in the drawings It is not intended to limit the invention to the particular embodiments illustrated and/or described.

現請參考第3圖，其顯示本發明之快速決策模式方法之架構示意圖。圖中的A是基礎層中目前畫面的具觀區塊、B是增進層中相對於A位置的具觀區塊而C是在增進層中的前一張畫面相對於B的九個具觀區塊位置。Referring now to Figure 3, there is shown a schematic diagram of the architecture of the fast decision mode method of the present invention. In the figure, A is the viewing block of the current picture in the base layer, B is the viewing block relative to the A position in the promotion layer, and C is the nine pictures of the previous picture in the promotion layer relative to B. Block location.

現請搭配參考第4圖，其顯示本發明之可調性視訊多層編碼控制快速決策模式演算法之步驟流程圖，主要包含九個步驟。Referring now to Figure 4, there is shown a flow chart of the steps of the adjustable video multi-layer coding control fast decision mode algorithm of the present invention, which mainly comprises nine steps.

首先於步驟(1)中，其為預測一假設之第一畫面於增進層之具觀區塊之編碼模式，主要由於在尋找基礎層之具觀區塊之編碼模式時，必須先預測於增進層之具觀區塊之編碼模式。Firstly, in step (1), it is a coding mode for predicting the first picture of the hypothesis in the enhancement layer, mainly because it is necessary to predict the enhancement when searching for the coding mode of the observation layer of the base layer. The coding mode of the layer with the view block.

於步驟(2)中，預測該假設之第一畫面於增進層之具觀區塊之編碼模式，其利用一假設之第二畫面於增進層之具觀區塊之編碼模式或複數個假設之第二畫面於增進層之具觀區塊之編碼方式之周圍具觀區塊之編碼模式。In the step (2), the coding mode of the first picture of the hypothesis is predicted in the enhancement layer, and the second picture of the hypothesis is used in the coding mode of the enhancement layer or the plurality of hypotheses. The second picture has an encoding mode of the viewing block around the encoding mode of the viewing block of the enhancement layer.

於步驟(3)中，其利用一時間性編碼模式之計算方法，對該假設之第二畫面於增進層之具觀區塊之編碼方式及該假設之複數個第二畫面於增進層之具觀區塊之編碼模式之周圍具觀區塊之編碼模式進行計算，可得一時間性之編碼模式分數(Temporal Mode Score,簡稱TMS)。In the step (3), the calculation method of the temporal coding mode is used, and the second picture of the hypothesis is encoded in the enhancement layer and the second picture of the hypothesis is in the enhancement layer. The coding mode of the viewing block is calculated by the coding mode of the viewing block, and a Temporal Mode Score (TMS) can be obtained.

現請參考第5圖，其顯示本發明之增進層(EL)中之一個具觀區塊於第二畫面之所對應九格位置所選擇的編碼模式，該圖顯示第一畫面基礎層的一個具觀區塊在第一畫面增進層所對應到的具觀區塊位置為E。亦即，若要決策基礎層一個具觀區塊的編碼模式時，需要先找出此該E位置的具觀區塊所預測之編碼模式，作為第一畫面基礎層的一個配對。此外，第一畫面E位置假設之編碼模式為利用第二畫面於增進層之具觀區塊之編碼方式及該假設之複數個第二畫面於增進層之具觀區塊之編碼模式之周圍具觀區塊之編碼模式。亦即，圖中E所假設之編碼模式就是利用圖中中A ^' ~I ^' 位置之編碼模式，搭配圖6(a)之具觀區塊相對位置之時間性編碼模式之計算方法所計算出來，進而可得到圖6(b)之各種編碼模式之時間性之編碼模式分數示意圖。Referring now to FIG. 5, there is shown an encoding mode selected by a viewing block (EL) of the present invention in a corresponding nine-frame position of the second picture, the figure showing one of the first picture base layers. The position of the viewing block corresponding to the viewing layer in the first picture enhancement layer is E. That is, if a coding mode with a viewing block is to be determined in the base layer, it is necessary to first find the encoding mode predicted by the viewing block of the E position as a pairing of the first picture base layer. In addition, the coding mode of the first picture E position hypothesis is that the coding mode of the viewing block using the second picture in the enhancement layer and the plurality of second pictures of the hypothesis are around the coding mode of the viewing block of the enhancement layer. The coding mode of the block. That is, the coding mode assumed by E in the figure is calculated by using the coding mode of the A ^' ~ I ^' position in the figure, and the calculation method of the temporal coding mode with the relative position of the block in Fig. 6(a). Further, a schematic diagram of the temporal coding mode scores of the various coding modes of FIG. 6(b) can be obtained.

需注意的是，其中該時間性編碼模式之計算方法為TMS =A ^' ×1+B ^' ×2+C ^' ×1+D ^' ×2+E ^' ×4+F ^' ×2+G ^' ×1+H ^' ×2+I ^' ×1，此外，其A’、B’、C’、D’、E’、F’、G’、H’、I’ 為該假設之第二畫面於增進層之具觀區塊之編碼模式所對應九格位置所選擇的編碼模式。該時間性之編碼模式分數之範圍介於0至64。若A’、B’、C’、D’、E’、F’、G’、H’、I’ 出現畫面內預測(Intra Prediciton)之編碼模式，其值為0。It should be noted that the calculation method of the temporal coding mode is TMS = A ^' × 1 + B ^' × 2 C ^' × 1 + D ^' × 2 E ^' × 4 + F ^' × 2 G ^' × 1+ H ^' × 2 I ^' ×1, in addition, A', B', C', D', E', F', G', H', I' are the second picture of this hypothesis The coding mode selected by the nine-grid position corresponding to the coding mode of the layer. The temporal coding mode score ranges from 0 to 64. If A', B', C', D', E', F', G', H', I' appear in the intra-prediction mode, the value is 0.

於步驟(4)中，其根據該時間性之編碼模式分數及複數個視訊續列內容，決定一機率分佈。該複數個視訊續列內容係包含Akiyo、Carphone、City、Coastguard、Highway、Stefan及Walk視訊續列。In step (4), the probability distribution is determined according to the temporal coding mode score and the plurality of video continuation contents. The plurality of video continuation contents include Akiyo, Carphone, City, Coastguard, Highway, Stefan, and Walk video.

於步驟(5)中，其根據該機率分佈，決定一第一畫面於增進層之具觀區塊之編碼模式與一第一畫面於基礎層之具觀區塊之編碼模式之可能組合。In the step (5), according to the probability distribution, a possible combination of the coding mode of the viewing block of the first picture in the enhancement layer and the coding mode of the viewing block of the first picture in the base layer is determined.

於步驟(6)中，其針對該第一畫面於增進層之具觀區塊之編碼模式與該第一畫面於基礎層之具觀區塊之編碼模式之可能組合，進行位元率-失真成本之計算。In step (6), the bit rate-distortion is performed for a possible combination of the encoding mode of the viewing block of the first picture in the enhancement layer and the encoding mode of the viewing layer of the first picture in the base layer. Calculation of cost.

於步驟(7)中，其根據具有一最小位元率-失真成本之第一畫面於基礎層之具觀區塊之編碼模式，決定一第一畫面於基礎層之具觀區塊之編碼模式。In the step (7), the coding mode of the viewing block of the first picture in the base layer is determined according to the coding mode of the first picture in the base layer with a minimum bit rate-distortion cost. .

於步驟(8)中，其根據該第一畫面於基礎層之具觀區塊之編碼模式與所有可行之該第一畫面於增進層之具觀區塊之編碼模式，進行位元率-失真成本之計算。In step (8), the bit rate-distortion is performed according to the coding mode of the first picture in the base layer and the coding mode of the viewable block in the enhancement layer of the first picture. Calculation of cost.

於步驟(9)中，其根據具有一最小位元率-失真成本之第一畫面於增進層之具觀區塊之編碼模式，決定一第一畫面於增進層之具觀區塊之編碼模式。In the step (9), the coding mode of the viewing block of the first picture in the enhancement layer is determined according to the coding mode of the first picture in the enhancement layer of the first picture having a minimum bit rate-distortion cost. .

為更了解本發明之精神，此處以一第一實施例進行快速決策模式方法之說明。若假設之第一畫面於增進層之具觀區塊之編碼模式為Inter 8x8，請同時參考第6(a)圖及第7圖，並計算出假設之第二畫面於增進層之具觀區塊之編碼方式及該假設之複數個第二畫面於增進層之具觀區塊之編碼模式之周圍具觀區塊之編碼模式之TMS，計算方式如下：In order to better understand the spirit of the present invention, a description of the fast decision mode method is made herein with a first embodiment. If the coding mode of the first picture in the enhancement layer is Inter 8x8, please refer to the 6th (a) and 7th pictures, and calculate the hypothetical second picture in the enhancement layer. The encoding mode of the block and the TMS of the plurality of second pictures of the hypothesis in the coding mode of the viewing block of the enhancement layer with the viewing block are calculated as follows:

TMS =0×1+1×2+1×1+1×2+4×4+2×2+2×1+2×2+1×1=32 TMS =0×1+1×2+1×1+1×2+4×4+2×2+2×1+2×2+1×1=32

接著，請參考第8圖依據TMS=32所對應的機率分佈，可找出機率最高的編碼模式做為第一畫面於增進層之具觀區塊之編碼模式之可能組合。此處則以BL_skip做為第一畫面於增進層之具觀區塊之編碼模式之可能組合。Next, please refer to Figure 8 according to the probability distribution corresponding to TMS=32, and find the most probable coding mode as the possible combination of the coding mode of the first picture in the enhancement layer. Here, BL_skip is used as the possible combination of the coding mode of the viewing block of the first layer in the enhancement layer.

現請參考第9圖，根據第一畫面於增進層之具觀區塊之編碼模式之可能組合為BL_skip，如圖中實線框所示。接著，針對實線框中之各組可能之編碼模式進行位元率-失真成本之計算，並以具有最小之位元率-失真成本做為第一畫面於基礎層之具觀區塊之編碼模式。Referring now to FIG. 9, the possible combination of the coding modes of the viewing blocks in the enhancement layer according to the first picture is BL_skip, as shown by the solid line in the figure. Then, the bit rate-distortion cost is calculated for each possible coding mode of the solid line frame, and the code with the smallest bit rate-distortion cost is used as the first picture in the base layer. mode.

請再次參考第9圖，根據第一畫面於基礎層之具觀區塊之編碼模式進行位元率-失真成本之計算，此處以16×16做為第一畫面於基礎層之具觀區塊之編碼模式，如虛線框所示，針對虛線框中之各組可能之編碼模式進行位元率-失真成本之計算，並以具有最小之位元率-失真成本做為第一畫面於增進層之具觀區塊之編碼模式。Please refer to FIG. 9 again, and calculate the bit rate-distortion cost according to the coding mode of the first layer in the base layer. Here, 16×16 is used as the first picture in the base layer. The coding mode, as indicated by the dashed box, calculates the bit rate-distortion cost for each possible coding mode in the dashed box, and uses the minimum bit rate-distortion cost as the first picture in the promotion layer. The coding mode of the block.

現請再次參考第7圖，現請參考圖中括號之部分，亦即出現畫面內預測的情況下，其TMS的計算方式如下：Please refer to Figure 7 again. Please refer to the brackets in the figure, that is, in the case of intra-picture prediction, the TMS is calculated as follows:

TMS =0×1+1×2+1×1+0×2+4×4+2×2+2×1+0×2+0×1=25 TMS =0×1+1×2+1×1+0×2+4×4+2×2+2×1+0×2+0×1=25

由於有畫面內預測的編碼模式存在，故須進行正規化的動作。其方法為：Since there is an intra-picture prediction coding mode, it is necessary to perform normalization. The method is:

Round [(25/11)×16]=36，其中Round 為四捨五入之運算，除以11是因其於九格的位置中，畫面間預測編碼模式位置的分數加總為11。故此處以TMS=36所對應的機率分布，找出機率最高的編碼模式做為基礎層的配對。 Round [(25/11)×16]=36, where Round is the rounding operation, and dividing by 11 is due to its position in the nine grids, and the score of the inter-picture predictive coding mode position is increased to 11. Therefore, with the probability distribution corresponding to TMS=36, the coding mode with the highest probability is found as the base layer pairing.

綜上所述，本發明之一種可調性視訊編碼之多層編碼控制之快速決策模式方將具有以下之功效：In summary, the fast decision mode of the multi-layer coding control of the adjustable video coding of the present invention has the following effects:

1.　利用時間(Temporal)的性質去預測增進層中的編碼模式，再將這個預測和基礎層的編碼模式做配對。先去預測增進層的編碼模式，因此沒有固定的Dominant Mode Pair概念，在選擇基礎層的編碼模式時，不同的具觀區塊可能會搭配不同的增進層編碼模式做計算，可減少編碼時間。1. Use the nature of Temporal to predict the coding mode in the enhancement layer, and then pair this prediction with the coding mode of the base layer. First, we predict the coding mode of the enhancement layer. Therefore, there is no fixed Dominant Mode Pair concept. When selecting the coding mode of the base layer, different viewing blocks may be calculated with different enhancement layer coding modes to reduce the coding time.

2.　提供時間性編碼模式分數(Temporal Mode Score,TMS)的概念，TMS可以更準確的預測出增進層的編碼模式，並且準確的找出基礎層和增進層的編碼模式。2. Provide the concept of Temporal Mode Score (TMS). TMS can more accurately predict the coding mode of the enhancement layer and accurately find the coding mode of the base layer and the enhancement layer.

3.　根據視訊序列的性質做調整，每一個具觀區塊都先去增進層預測出機率最大的編碼模式做為基礎層的配對，因此具有可調性。3. According to the nature of the video sequence, each of the viewing blocks first adjusts the encoding mode with the highest probability of the layer prediction as the base layer pairing, so it is adjustable.

雖然本發明已以前述較佳實施例揭示，然其並非用以限定本發明，任何熟習此技藝者，在不脫離本發明之精神和範圍內，當可作各種之更動與修改。如上述的解釋，都可以作各型式的修正與變化，而不會破壞此創作的精神。因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。While the present invention has been described in its preferred embodiments, it is not intended to limit the scope of the invention, and various modifications and changes can be made without departing from the spirit and scope of the invention. As explained above, all types of corrections and changes can be made without destroying the spirit of this creation. Therefore, the scope of the invention is defined by the scope of the appended claims.

第1圖係為習之技術之基礎層(BL)及增進層(EL)之編碼模式；Figure 1 is the coding mode of the base layer (BL) and the enhancement layer (EL) of the technology of the prior art;

第2圖係為習之技術之由Schwarz et al.提出之演算法編碼模式之配對規則；Figure 2 is the pairing rule of the coding mode of the algorithm proposed by Schwarz et al.

第3圖係為本發明之可調性視訊多層編碼控制快速決策模式演算法之架構示意圖；Figure 3 is a schematic diagram showing the architecture of the adjustable video multi-layer coding control fast decision mode algorithm of the present invention;

第4圖係為本發明之快速決策模式方法之步驟流程圖；Figure 4 is a flow chart showing the steps of the fast decision mode method of the present invention;

第5圖係為本發明之增進層(EL)中之一個具觀區塊於第二畫面之所對應九格位置所選擇的編碼模式；Figure 5 is a coding mode selected by a viewing block in the enhancement layer (EL) of the present invention at a position corresponding to nine locations of the second picture;

第6圖係為本發明之(a)具觀區塊相對位置之時間性編碼模式之計算分數及(b)各種編碼模式的時間性之編碼模式分數示意圖；Figure 6 is a schematic diagram of the (a) calculated fraction of the temporal coding mode of the relative position of the viewing block and (b) the temporal coding pattern score of the various coding modes;

第7圖係為本發明之第一實施例之九格對應編碼模式；Figure 7 is a nine-frame corresponding coding mode of the first embodiment of the present invention;

第8圖係為本發明之第一實施例之不同時間性之編碼模式分數對應複數個視訊續列之機率分佈圖；以及Figure 8 is a diagram showing the probability distribution of the plurality of video continuation sequences corresponding to the different temporal coding mode scores of the first embodiment of the present invention;

第9圖係為本發明之第一實施例之基礎層(BL)及增進層(EL)之編碼模式之組合示意圖。Figure 9 is a combination diagram of the coding modes of the base layer (BL) and the enhancement layer (EL) of the first embodiment of the present invention.

Claims

A tunable video multi-layer coding control fast decision mode algorithm for compressing and encoding an image signal, the method comprising at least the following steps: Step (1): predicting a first picture of a hypothesis in a viewing area of the enhancement layer Encoding mode of the block; step (2): using the second picture of the hypothesis in the coding mode of the viewing block of the enhancement layer or the second picture of the plurality of hypotheses in the surrounding of the coding mode of the enhancement layer Observing the coding mode of the block; step (3): using a calculation method of the temporal coding mode, the second picture of the hypothesis is encoded in the enhancement layer and the second picture of the hypothesis The coding mode of the viewing block of the enhancement layer is calculated by the coding mode of the viewing block, and a time-coded coding mode score is obtained; step (4): according to the temporal coding mode score and a plurality of video continuations Column content, determining a probability distribution; step (5): determining, according to the probability distribution, a coding mode of the first picture in the enhancement layer and a coding mode of the first picture in the base layer Possible combination; step (6): performing bit rate-distortion for the possible combination of the encoding mode of the viewing block of the first picture in the enhancement layer and the encoding mode of the first picture in the base layer Cost calculation; step (7): determining the encoding of the viewing block of the first picture in the base layer according to the coding mode of the first picture in the base layer with a minimum bit rate-distortion cost Mode; step (8): performing bit rate-distortion cost according to the coding mode of the first picture in the base layer and the coding mode of the viewable block of the first picture in the enhancement layer The calculation (step) (9): determining the coding mode of the viewing block of the first picture in the enhancement layer according to the coding mode of the first picture in the enhancement layer of the first picture with a minimum bit rate-distortion cost .

For example, the tunable video multi-layer coding control fast decision mode algorithm of the first application patent scope, wherein the time coding mode is calculated as TMS = A ^' × 1 + B ^' × 2 C ^' × 1 + D ^' ×2+ E ^' ×4+ F ^' ×2+ G ^' ×1+ H ^' ×2+ I ^' ×1; where A', B', C', D', E', F', G'H',I' is the coding mode selected by the second picture of the hypothesis in the nine-grid position corresponding to the coding mode of the viewing block of the enhancement layer.

For example, the tunable video multi-layer coding control fast decision mode algorithm of claim 2, wherein the action of step (1) is performed before searching for the coding mode of the base layer.

For example, the adjustable video multi-layer coding control fast decision mode algorithm of the second application of the patent scope 'A',B',C',D',E', F',G',H',I' is on the screen The value of Inter prediction (Inter Prediciton) or intra-prediction (Intra Prediciton) is 0.

For example, the tunable video multi-layer coding control fast decision mode algorithm of claim 1 of the patent scope, wherein the temporal coding mode score ranges from 0 to 64.

For example, the tunable video multi-layer coding control fast decision mode algorithm of claim 1 of the patent scope includes the Akiyo, Carphone, City, Coastguard, Highway, Stefan, and Walk video continuation.