TW200529650A

TW200529650A - Video coding method and apparatus thereof

Info

Publication number: TW200529650A
Application number: TW93104201A
Authority: TW
Inventors: Ming-Chieh Chi; Mei-Juan Chen
Original assignee: Leadtek Research Inc
Priority date: 2004-02-20
Filing date: 2004-02-20
Publication date: 2005-09-01
Also published as: TWI241130B

Abstract

A region-of-interest (ROI) video-coding method and apparatus based on fuzzy logic control for a video encoder is provided. Providing an image having a plurality of region-of-interest regions and a plurality of non-region-of-interest regions, the first step is to separate the region-of-interest regions and the non-region-of-interest regions from the image. Then by sending the region-of-interest regions from an input image to a fuzzy logic controller, in which the fuzzy logic controller performs fuzzy manipulations that enhance the quality of the region-of-interest regions, and therefore the region-of-interest quality of an output image will be improved. The method and apparatus are particularly useful in videophone and videoconferencing.

Description

200529650 五、發明說明α) 發明所屬之技術領域本發明是有關於一種改善影像品質的方法與裝置，且特別是，有關於一種用於影像編碼器的模糊邏輯控制的目標區影像編碼方法與其裝置。 1 先前技術近來，使用視訊會議與視訊電話的數位影像通訊的應用需求與曰倶增。然而，因為網路傳輸率有限，所以作為這些應用的極低位元率影像編碼，是' 種可用來降低圖像序列（picture sequence)的資料傳輸率（data rate)，而又不會降低其品質的重要技術。這些標準的大部分實施方式都是對每個方塊（b 1 oc k )的重要程度視為均等。雖然在相同圖像之内的不同方塊可能是以不同模式編碼，但沒有任何一方塊比另一方塊更重要。這種傳統模型並不適用於在影像序列（video sequence)的任何目標區（region-in-interest， R0I)應用。在H· 263 + 標準中，會調整在大方塊（macro-block, MB)層的失真加權參數（distortion weight parameter)與訊號變化，藉以控制不同區的品質。對應於相同重點區（focus area)的方塊會比在背景或其他不必要區中的方塊還更重要。雖然會犧牲背景或非重點區的品質，然而可以對使用者重視所在的區配置更多頻寬。對於像是視訊會議的影像序列而言，是一個好的編碼策略。除了 R〇 I具有較高品質之外，也可能會忽略部分背景資訊，藉以提高編碼速度。如最大位元傳輸（maximum bit transfer, MBT) —樣，背200529650 V. Description of the invention α) Technical field to which the invention belongs The present invention relates to a method and device for improving image quality, and in particular, to a method and device for image coding of a target area for fuzzy logic control of an image encoder . 1 Prior Technology Recently, the demand for digital video communication using video conferences and video phones has increased. However, because the network transmission rate is limited, as a very low bit rate image coding for these applications, it is a kind of data rate that can be used to reduce the picture sequence without reducing its data rate. Important technology of quality. Most implementations of these standards are considered equal for each block (b 1 oc k). Although different blocks within the same image may be coded in different modes, no one block is more important than the other. This traditional model is not suitable for application in any region-in-interest (ROI) of a video sequence. In the H · 263 + standard, the distortion weight parameter and signal change at the macro-block (MB) layer are adjusted to control the quality of different regions. Blocks that correspond to the same focus area are more important than blocks in the background or other unnecessary areas. Although it sacrifices the quality of the background or non-emphasis area, it is possible to allocate more bandwidth to the area where the user cares. It is a good coding strategy for video sequences like video conferences. In addition to the high quality of ROI, some background information may also be ignored to improve encoding speed. For example, maximum bit transfer (MBT)

12429TWF.PTD 第6頁 200529650 五、發明說明（2) 景總是以其中最粗糙的量化位準（(1113111：12以丨〇1116乂61) 編碼。對此，傳統上一般採用一種以區域為主 (region-based)的模糊法則（blurring algorithm) ’ 以降低極低位元率影像編石馬的位元率。另一種方法是使用可提昇R 0 I品質，並且降低編碼背景位元的對每一 R 0 1 Μ B 與非ROI MB的三個固定因數（fixed factors)，以大量改善R 0 I品質。本發明可根據模糊邏輯率控制而調適性地改善R〇 I品質，而且適用於即時性的視訊會議。模糊邏輯（fuzzy logic)首先疋由在柏克來 (Berkeley)工作的L· A· Zadeh在1 9 6 5年提出，而且是在自然人群達成三點解決方案之後才確定模型。第一點：對相同問題使用不同法則的解決方案。第二點：對相同問題同時使用一個以上的規則。第三點：接受特定程度的不確定性（imprecision)，這對達到可接受的解決方案而言是有相當助益的。很明顯的，在例*ΤΜΝ5、TMN8、等等的不同標準測试模型中所用的正常率控制法則是符合這二點的。在每個測試模型中，都有特定的數學解決方案’用以決定每個MB的量化參數（quantizati〇rl parameter)，而且可接受適當的不確定性 (inaccuracies)，藉以估算下一個〇的位元率。看起來模糊邏輯控制對於解決影像編碼的率控制是相當適當的。第1 a圖是一個習知的回饋控制系統丨〇〇的方塊示意圖。圖中的控制器根據一個處理的數學模型，或是數學12429TWF.PTD Page 6 200529650 V. Description of the invention (2) The scene is always coded at the roughest quantization level ((1113111: 12 to 〇〇11616 乂 61). For this reason, traditionally, a region is used Region-based blurring algorithm 'to reduce the bit rate of the very low bit-rate video editing stone horse. Another method is to improve the quality of R 0 I and reduce the encoding background bit. Three fixed factors of each R 0 1 MB and non-ROI MB to improve the quality of R 0 I. The present invention can adaptively improve the quality of R 0I according to the fuzzy logic rate control, and is applicable to Immediate video conference. Fuzzy logic was first proposed by AZ Zadeh, who worked at Berkeley in 1965, and was determined after the natural population reached a three-point solution Model. The first point: solutions using different rules for the same problem. The second point: using more than one rule for the same problem at the same time. The third point: accepting a certain degree of uncertainty (imprecision). The solution is quite helpful. Obviously, the normal rate control rules used in different standard test models such as * TMN5, TMN8, etc. are in line with these two points. In each test model There are specific mathematical solutions' to determine the quantizati rl parameter of each MB, and accept the appropriate inaccuracies to estimate the bit rate of the next 0. It seems Fuzzy logic control is quite suitable for solving the rate control of image coding. Figure 1a is a block diagram of a conventional feedback control system. The controller in the figure is based on a mathematical model of processing, or mathematics.

12429TWF.PTD 第7頁 200529650 五、發明說明（3) 關係的固定集合，決定接下來如何處理。第1 b圖是一個模糊邏輯控制系統1 5 0的方塊示意圖。模糊邏輯控制器1 5 0將由具相關經驗的操作者或系統工程師所制定的一組反應規則，當成其操作指南（g u i d e )。請參考第lb圖所示，量化器（quantizer)152從感測器157取得資料，並且將資料轉換成可為模糊邏輯控制器1 5 3所用的格式。模糊邏輯控制器1 5 3接下來執行計算，藉以決定特定資料的模糊狀態（f u z z y s i t u a t i ο η )。綜合上述說明，當資訊高速公路（information highway )以有限傳輸率開始開展時，就需要一種改善影像的方法。近來，已經有一種可改善影像品質的目標區 (R 0 I )方法。然而，目前的R 0 I方法的解決方案仍具有其性能上的障礙。因此，相當需要一種可獲得高品質視訊影像的方法或法則。發明内容有鑑於此，本發明之目的之一是，提供一種可用於改善，例如說，視訊電話及視訊會議應用中的影像品質需求的方法與裝置。為達成本發明上述及其他目的，本發明提供一種根據目標區（R0 I )與模糊邏輯控制的新方法與裝置，並且在此以實施例詳細說明。首先，該方法將一個影像（image)的複數個目標區與複數個非目標區分離。接下來，來自目標區的輸入會送到一個模糊邏輯控制，其中模糊邏輯控制是用來改善目12429TWF.PTD Page 7 200529650 V. Description of the invention (3) A fixed set of relationships determines how to proceed next. Figure 1b is a block diagram of a fuzzy logic control system 150. The fuzzy logic controller 150 uses a set of reaction rules formulated by an operator or system engineer with relevant experience as its operation guide (g u i d e). Referring to FIG. 1b, the quantizer 152 obtains data from the sensor 157 and converts the data into a format that can be used by the fuzzy logic controller 153. The fuzzy logic controller 1 5 3 then performs calculations to determine the fuzzy state of the particular data (f u z z y s i t u a t i ο η). Based on the above description, when the information highway begins to develop with a limited transmission rate, a method for improving the image is needed. Recently, there has been a target area (R 0 I) method that can improve image quality. However, the current R 0 I solution still has its performance obstacles. Therefore, there is a great need for a method or rule for obtaining high-quality video images. SUMMARY OF THE INVENTION In view of this, one object of the present invention is to provide a method and device that can be used to improve image quality requirements in, for example, video phone and video conference applications. In order to achieve the above and other objectives of the present invention, the present invention provides a new method and device based on the target area (R0 I) and fuzzy logic control, and will be described in detail with embodiments herein. First, the method separates a plurality of target regions of an image from a plurality of non-target regions. Next, the input from the target area is sent to a fuzzy logic control, where the fuzzy logic control is used to improve the objective.

12429TWF.PTD 第8頁 200529650 五、發明說明（4) 標區的品質，以及改善輸出影像的整體品質。在本發明一較佳實施例中，來自目標區的輸入是從來自目標區的第一控制輸入與第二控制輸入所計算而得。其中，第一控制輸入與第二控制輸入分別包括一個來自一個目前的第i個大方塊的第一變異數（first variance)與一個變異數差（variance difference) ° 變異數差是由將第一變異數減去前一個的第i-Ι個大方塊的第二變異數（second variance)，並且再除以第一變異數所得。第i個大方塊與第i - 1個大方塊代表在其中一個目標區之内的大方塊的序列，而且第i - 1個大方塊是第i個大方塊的前一個大方塊。在本發明另一較佳實施例中，模糊邏輯控制包括一個用來將控制輸入轉換成模糊判定（fuzzy predicates) 的法則。在本發明另一較佳實施例中，模糊邏輯控制包括一個控制功能’藉以計算一個用來決定主控制輸入的模糊狀態的语a從屬功能（linguistic membership function)。該控制功能使用一種中央面積區（center 0f a r e a， C 0 A )方法，來決定語言從屬功能之歸屬。在本發明另一實施例中，模糊邏輯控制包括用來設疋一個决桌位準（decisional level)與產生一個加權因數（weigh ted factor)的複數個探查表（1〇〇kup tables)，藉以加重其中—個目標區的品質。在本發明再另一實施例中，該些探查表包括複數個12429TWF.PTD Page 8 200529650 V. Description of the invention (4) The quality of the target area and the improvement of the overall quality of the output image. In a preferred embodiment of the present invention, the input from the target area is calculated from the first control input and the second control input from the target area. The first control input and the second control input respectively include a first variance and a variance difference from a current i-th large block. The variance of the variance is determined by the first The number of mutations is obtained by subtracting the second variance of the previous (i-1) th large square, and dividing by the first variance. The i-th large block and the i-1 large block represent a sequence of large blocks within one of the target areas, and the i-1 large block is the previous large block of the i-th large block. In another preferred embodiment of the present invention, the fuzzy logic control includes a rule for converting control inputs into fuzzy predicates. In another preferred embodiment of the present invention, the fuzzy logic control includes a control function 'for calculating a linguistic membership function for determining the fuzzy state of the main control input. This control function uses a central area (center 0f a r e a, C 0 A) method to determine the affiliation of the language subordinate function. In another embodiment of the present invention, the fuzzy logic control includes a plurality of probe tables (100kup tables) for setting a decisional level and generating a weighted factor. Aggravate the quality of one of the target areas. In still another embodiment of the present invention, the probe tables include a plurality of

12429TWF.PTD 第9頁 200529650 五、發明說明（5) 用來對其中一個其縮放探查表（scaled lookup tables) 目標區提供一種類似優先權（priority-like)品質中’縮放探查表是使用一個one-fixed與one- various從屬功能成形。綜合上述說明，本發明提供一種模糊控制的R 〇 I影像編碼。模糊控制的R〇 I影像編碼可適應性地調整影像的輸出品質:該方法可輕易地改善R〇I品質，保持固定位元器過溢（buffer 〇verfl〇w)，並且較習知技蓺更此以較低位凡率，輕易提供更佳品^ 碼可不需複雜運算，就沪女I并关— 夕$ κυ 1〜像編質。异就此大罝改善母一個R0I的輸出品為讓本發明之上述和其他明顯易懂，下文牿以钤处每Α 将徵、和優點能更詳細說明如下：、乂佳實轭例，並配合所附圖式，作實施方式以下將參考所附繪圖，例。口序、、、田說明本發明的較佳實施雖然在此以該〇tb會絲/丨抑限於該些實施例，；可以2 “ ：：：眚本發明並不受詳細内容。在下文中，目、 ^二者熟習本發明範疇及件。相冋的參考號碼代表相同的元首先，藉由模糊控制組成，包括（1)目標區It t衫像編碼可由兩部分與(2)模糊控制。請參考第2圖所麵 12429TWF.1 第10頁 200529650 五、發明說明（6) 示，一個目標區包括一個切割單元（segmentation)302。一個模糊邏輯控制器3 2 0包括一個計算微分變異單元 (calculate differential variance)303 、一個量化器 (quantizer)304 、模糊子集合（fuzzy subsets)305 、一個模糊控制器306、一個模糊變異數運算器（fuzzy variance operator ) 3 0 7、一個加權解模糊器（weighted deiuzzifier)308、以及一個模糊探查表（fuzzy lookup table ) 3 0 9。此外，整個編碼系統還包括一個Η· 2 6 3 +影像編碼器（video encoder)與一個虛擬緩衝器（virtual buffer) 〇請參考第2圖所示，模糊邏輯控制器3 2 0根據一個變異數Ji332與一個變異數差^(7^34，改善目標區品質。在輸入一個訊框（f r a m e ) 3 0 1之後，如外觀偵測與移動偵測的切割單元3 0 2，會被用來將訊框3 0 1切割成目標區 (R0 1)330與非目標區331。在非目標區331中的大區塊會不經調整任何參數，以位元率控制直接送至一個QP選擇器310°ROI 330的第i個大區塊中的變異數差△σί334，是從aJ32與（7/333計算而得，其中σi332與σi’333分別是目前與前一個第i個大區塊的變異數。變異數差Δσ i334與目前大區塊的變異數口332，是使用模糊邏輯方法的兩個輸入，而且(7135是一個即將當成輸入加權因數的模糊輸出。第3圖與第4圖分別繪示代表0^332與△ σί334的圖形。請參考第3圖與第4圖所示，語言組（linguistic12429TWF.PTD Page 9 200529650 V. Description of the invention (5) It is used to provide a priority-like quality to one of its scaled lookup tables. The 'scaled lookup table' uses a one -Fixed and one- various slave functions formed. To sum up the above description, the present invention provides a fuzzy controlled ROI image coding. The fuzzy-controlled R0I image coding can adaptively adjust the output quality of the image: This method can easily improve the R0I quality and maintain a fixed bit device overflow (buffer 〇verfl0w), which is more familiar than conventional techniques. In addition, at a lower rate, it is easy to provide better products. ^ Codes can be used without complicated calculations, and the Shanghai Girls I will be closed together. Evening $ κυ 1 ~ Image quality. In order to make the above and other aspects of the present invention clearly understandable, the following features and advantages can be explained in more detail as follows: The attached drawings and embodiments will be described below with reference to the accompanying drawings and examples. Oral, ,, and field descriptions of the preferred implementation of the present invention are limited to these embodiments with the 0tb will be described here; may 2 "::: 眚 The present invention is not subject to the details. In the following, Both of them are familiar with the scope and components of the present invention. The corresponding reference numbers represent the same elements. First, they are composed of fuzzy control, including (1) the target area It t-shirt image coding can be controlled by two parts and (2) fuzzy control. Please refer to Figure 12429TWF.1 Page 10 200529650 V. Description of Invention (6) shows that a target area includes a segmentation unit 302. A fuzzy logic controller 3 2 0 includes a computational differential mutation unit ( calculate differential variance 303, a quantizer 304, fuzzy subsets 305, a fuzzy controller 306, a fuzzy variance operator 3 0 7, a weighted defuzzifier ( weighted deiuzzifier) 308, and a fuzzy lookup table 3 0 9. In addition, the entire encoding system also includes a Η · 2 6 3 + video encoder (video encoder) and a virtual buffer (refer to Figure 2), the fuzzy logic controller 3 2 0 improves the quality of the target area based on the difference between a variation number Ji332 and a variation number ^ (7 ^ 34). After a frame 3 0 1, the cutting unit 3 0 2 for appearance detection and motion detection will be used to cut the frame 3 0 1 into a target area (R0 1) 330 and a non-target area 331. The large block in the non-target area 331 will be directly sent to a QP selector 310 ° ROI 330 with a bit rate control without adjusting any parameters. The variation number difference Δσί334 in the i-th large block of 310 330 Calculated from aJ32 and (7/333, where σi332 and σi'333 are the variation numbers of the current and previous i-th large block respectively. The difference between the variation number Δσ i334 and the current large block variation number 332 is Use two inputs of the fuzzy logic method, and (7135 is a fuzzy output that will be used as the input weighting factor. Figures 3 and 4 show graphs representing 0 ^ 332 and △ σί334 respectively. Please refer to Figures 3 and As shown in Figure 4, the language group (linguistic

12429TWF.PTD 第11頁 200529650 五、發明說明（7) sets)的符號，LN351 與401 、LN352 與402 、LN353 與403 、 LN354與404、以及LN355與405，分別為π大正（Large Positive)"、丨丨小正（Smal 1 Positive”、丨，零（Zero)11、小負（Small Negative)” 、以及”大負(Large Negative)"。除了所有的σι332都為正值，以及在統計上大部分每一大區塊的變異數σί334都在ΖΕ 3〇3的中心之外’第3圖的符號與第4圖的符號完全相同。第4圖繪示以Δ^=( CTi- (7/)/ (Ji定義的變異數差次的子集合。請參考第4圖所示，在統計上大部分的△ σ〖3 3 4都是 3人在[一 1〇 ’ +1〇]的區間中。接下來’量化器304將0^ 么^\ΤΔ 34輸入模糊子集合30 5，並且將其程度轉換 i 糊剌…351 f SN 3 5 2、ZE 3 5 3、LP 3 54、以及SP 3 5 5 的 .3 34，模，控制器3 0 6接下來藉由量化~ 3 3 2與△ σ 方法，其語言從屬功能’並且使用中央面積區（C0 A ) △ σ .對丄1二模糊狀態。在完成計算之後，每一個σ i/ 所示的—籍、一對應主控制輸入值。決策表是以第5圖模糊器3 π 一 ^u ν、坷仔隹汜m體甲。加櫂厍態；二根Λ模糊探查表309，考慮㈧/“的兩種狀 W加權因數ω ai3 3 5，以加重RCH 3 3 0大區塊。^、;月探查表3 〇 9形式儲存在記憶體中。加權解的品質先權，可雜士一實施例中’為使不同㈧1 330具有不同優出模糊表。曰笛原始輸出模糊，縮放（s c a 1 e ) —組不同的輪個R0I優先權，是用來運用與分辨不同R0I 3 3 0的每一、個0ne-flxed與one-various從屬功能範12429TWF.PTD Page 11 200529650 V. Description of the invention (7) The symbols of sets), LN351 and 401, LN352 and 402, LN353 and 403, LN354 and 404, and LN355 and 405, respectively, are "Large Positive" " , 丨丨 Small Positive (Smal 1 Positive), 丨, Zero (Zero) 11, Small Negative (Small Negative), and "Large Negative" ". Except all σι332 are positive values, and in statistics The variation number σί334 of each large block in most of the above is outside the center of ZE3 03. The symbol in Figure 3 is exactly the same as that in Figure 4. Figure 4 shows that Δ ^ = (CTi- ( 7 /) / (Ji-defined sub-set of the number of variants. Please refer to Figure 4. As shown in Figure 4, most of the statistics △ σ 〖3 3 4 are 3 people in [一１０ '+ 1〇] Next, the 'quantizer 304 inputs 0 ^ Mod ^ \ ΤΔ 34 into the fuzzy sub-set 30 5 and converts its degree to i ... 351 f SN 3 5 2, ZE 3 5 3, LP 3 54, And SP 3 5 5 .3 34, module, controller 3 0 6 Next by quantification ~ 3 3 2 and △ σ method, its language subordinate function 'and use the central area (C0 A ) △ σ. Two fuzzy states for 丄 1. After the calculation is completed, each of σ i /-shown in Figure 1 corresponds to the main control input value. The decision table is based on the fuzzer in Figure 5 3 π a ^ u ν,坷仔隹汜 m 体甲. Add 棹厍 state; two Λ fuzzy probe table 309, consider ㈧ / "two kinds of W weighting factors ω ai3 3 5 to increase the RCH 3 3 0 large block. ^ ,; The monthly lookup table 3 is stored in the memory. The quality of the weighted solution is prioritized, but in one embodiment, 'in order to make different ㈧1 330 have different excellent fuzzy tables. That is, the original output of the flute is blurred and scaled (sca 1 e) — different sets of R0I priorities are used to distinguish and distinguish each and every 0ne-flxed and one-various subordinate functions of different R0I 3 3 0

200529650 五、發明說明（8) 例。加權因數是使用模糊規則在H. 2 6 3 +影像編碼器31 1 中，針對給定的每一大區塊計算而得。在本發明之一實施例的實驗結果中，可以驗証本發明實施例具有較其他既有習知法則為佳之性能。該實驗測試Carphone 、Claire 、以及Foreman三種序歹1J 。為定義在一訊框中的R0I ，臉部偵測被用來自動選擇R0I。在測試序列中比較四種不同方法。該四種不同方法為：不用 R0I編碼訊框（WR)、乘上一個加權因數（WA) α編碼R0I、以三個因素（TF)編碼R0I、以及本發明（模糊）。這四種方法都設成相似的平均位元率。對目標位元率為6 4每秒千位元的I -訊框與Ρ-訊框而言，QP設定成5與3，而對目標位元率為32每秒千位元的I -訊框與Ρ-訊框而言，QP則設定成15與13。在WA中，加權因數設定成450。在TF中，三個因素分別設定為4 5 0、2、以及1 0。為以類似加權比較另兩種方法，ΖΕ13設定為450，而且LP卜LN25設定為 350〜550 ° 如第7圖到第1 0圖所示，相較於其他方法，在類似位元率之下，本發明實施例具有較佳的ROI PSNR。因為WA 與TF都是以固定參數改善R0I品質，所以當每一大區塊複雜度大量變化時，這兩種方法無法調整其加權因數。綜合上述說明，本發明實施例可獲得較佳的R 0 I品質，並且即使是以較低位元率工作時，遺漏訊框（s k i ρ p i n g frame)的現象也會較少發生。本發明可適用於任何影像處理工作，特別是用於即200529650 V. Description of Invention (8) Example. The weighting factor is calculated in H. 2 6 3 + image encoder 31 1 using fuzzy rules for each given large block. In the experimental results of one embodiment of the present invention, it can be verified that the embodiment of the present invention has better performance than other conventionally known rules. This experiment tests Carphone, Claire, and Foreman. To define R0I in a frame, face detection is used to automatically select R0I. Compare four different methods in the test sequence. The four different methods are: encoding the frame (WR) without R0I, encoding R0I by multiplying by a weighting factor (WA), encoding R0I with three factors (TF), and the present invention (fuzzy). All four methods are set to similar average bit rates. For I-frames and P-frames with a target bit rate of 64 kbits per second, QP is set to 5 and 3, while for target I-frames with an I-signal of 32 kbits per second. For the frame and P-frame, the QP is set to 15 and 13. In WA, the weighting factor is set to 450. In TF, the three factors are set to 450, 2, and 10 respectively. In order to compare the other two methods with similar weighting, ZE13 is set to 450, and LP and LN25 are set to 350 ~ 550 °. As shown in Figure 7 to Figure 10, compared with other methods, at a similar bit rate The embodiments of the present invention have better ROI PSNR. Because both WA and TF improve the quality of ROI with fixed parameters, when the complexity of each large block changes a lot, these two methods cannot adjust their weighting factors. In summary, the embodiment of the present invention can obtain better R 0 I quality, and even when working at a lower bit rate, the phenomenon of missing frame (ski i ρ p i n g frame) will rarely occur. The invention can be applied to any image processing work, especially for immediate use.

12429TWF.PTD 第13頁 200529650 五、發明說明（9) 時影像編碼。因此，本發明可輕易改善R01品質，並且保持位元率，以避免緩衝器過溢。相較於習知技藝而言，本發明可以以較少位元率，輕易改善該晝面品質。此外，多重R 0 I影像編碼亦可大量改善每一個R 0 I品質，而不需複雜運算。雖然本發明已以較佳實施例揭露如上，然其並非用以限定本發明，任何熟習此技藝者，在不脫離本發明之精神和範圍内，當可作各種之更動與潤飾，因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。12429TWF.PTD Page 13 200529650 V. Description of the invention (9) Image coding. Therefore, the present invention can easily improve the quality of R01 and maintain the bit rate to avoid buffer overflow. Compared with the conventional techniques, the present invention can easily improve the quality of the daylight surface with a lower bit rate. In addition, multiple R 0 I image coding can also greatly improve the quality of each R 0 I without the need for complicated operations. Although the present invention has been disclosed as above with preferred embodiments, it is not intended to limit the present invention. Any person skilled in the art can make various modifications and retouches without departing from the spirit and scope of the present invention. Therefore, the present invention The scope of protection shall be determined by the scope of the attached patent application.

12429TWF.PTD 第14頁 200529650 圖式簡單說明第1 a圖是一個習知的回饋控制法則的方塊示意圖。第1 b圖是一個習知的模糊邏輯控制法則的方塊示意圖。第2圖是一個根據本發明一實施例，由模糊邏輯控制法則執行目標區影像編碼的方塊示意圖。第3圖是第2圖中所示的模糊邏輯控制裝置中的變異數i的子集合範例。第4圖是第2圖中所示的模糊邏輯控制裝置中的變異數變動△ i的子集合範例。第5圖是第2圖中所示的模糊邏輯控制裝置中的模糊輸出探查表範例。第6圖是一個one-fixed與one-various從屬功能範例。第7圖是針對64每秒千位元的100個訊框的Carphone 序列的各種不同方法比較表。第8圖是針對3 2每秒千位元的1 5 0個訊框的C 1 a i r e序列的各種不同方法比較表。第9圖是針對64每秒千位元的150個訊框的Foreman序列的各種不同方法比較表。第1 0圖是針對6 4每秒千位元的1 5 0個訊框的N e w s序列的多重目標區比較表。圖式標記說明： 1 0 0 :回饋控制系統 1 0 1 :設定點12429TWF.PTD Page 14 200529650 Brief description of the diagram Figure 1a is a block diagram of a conventional feedback control rule. Figure 1b is a block diagram of a conventional fuzzy logic control law. Fig. 2 is a block diagram of image coding of a target area performed by a fuzzy logic control rule according to an embodiment of the present invention. Fig. 3 is an example of a subset of the variation number i in the fuzzy logic control device shown in Fig. 2. Fig. 4 is an example of a subset of the variation Δi in the fuzzy logic control device shown in Fig. 2. Fig. 5 is an example of a fuzzy output lookup table in the fuzzy logic control device shown in Fig. 2. Figure 6 is an example of one-fixed and one-various slave functions. Figure 7 is a comparison table of various methods for a Carphone sequence of 100 frames of 64 kbits per second. Figure 8 is a comparison table of various methods for the C 1 a i r e sequence of 150 frames of 32 kilobits per second. Figure 9 is a comparison table of the various methods of the Foreman sequence of 150 frames of 64 kbits per second. Figure 10 is a multi-target region comparison table for a NeW s sequence of 150 frames of 64 kilobits per second. Graphical label description: 1 0 0: feedback control system 1 0 1: set point

12429TWF.PTD 第15頁 200529650 圖式簡單說明 102 控制器 103 處理 104 系統數學模型 105 感測器 150 模糊邏輯控制系統 15 1 ri-rL δ又定點 152 量化器 153 模糊邏輯控制器 154 解模糊器 155 根據人性的規則組 156 處理 157 感測器 30 1 訊框 m 入 302 切割單元 303 計算微分差異單元 304 量化器 305 模糊子集合 306 模糊控制器 307 模糊變異數運算器 308 加權解模糊器 309 模糊探查表 310 加權QP 選擇器 31 1 Η. 2 6 3 + 影像編碼器 312 虛擬緩衝器12429TWF.PTD Page 15 200529650 Simple description of the diagram 102 controller 103 processing 104 system mathematical model 105 sensor 150 fuzzy logic control system 15 1 ri-rL δ and fixed point 152 quantizer 153 fuzzy logic controller 154 defuzzifier 155 Group of rules according to human nature 156 processing 157 sensor 30 1 frame m input 302 cutting unit 303 calculation differential difference unit 304 quantizer 305 fuzzy subset 306 fuzzy controller 307 fuzzy variation number operator 308 weighted defuzzifier 309 fuzzy exploration Table 310 Weighted QP selector 31 1 Η. 2 6 3 + image encoder 312 virtual buffer

12429TWF.PTD 第16頁 20052965012429TWF.PTD Page 16 200529650

12429TWF.PTD 第17頁12429TWF.PTD Page 17

Claims

200529650 6. Scope of patent application 1. An image coding method suitable for video calls and video conferences, including: separating a plurality of target areas of an image from a plurality of non-target areas; and an input from the target areas, Send to a fuzzy logic control, where the fuzzy logic control is used to improve the quality of one of the target areas and to improve the overall quality of an output image. 2. The image coding method described in item 1 of the scope of patent application, wherein the input from the target areas is calculated from a first control input and a second control input from the target areas. 3. The image coding method as described in item 2 of the scope of patent application, wherein the first control input and the second control input respectively include a first variation number and a variation number difference from a current i-th large block, The variation number difference is obtained by subtracting a first variation number from a second variation number of a previous i_l large block, and then dividing the first variation number by the first variation number. The I large block represents a sequence of the large blocks within one of the target areas, and the i-1 large block is a previous large block of the i th large block. 4. The image coding method described in item 1 of the patent application scope, wherein the fuzzy logic control includes a rule for converting the input from the target areas into a plurality of fuzzy decisions. 5. The image coding method as described in item 1 of the scope of patent application, wherein the fuzzy logic control includes a control function to calculate a language subordinate function for determining a fuzzy state.

12429TWF.PTD Page 18 200529650 6. Patent application scope 6. The image coding method described in item 5 of the patent application scope, wherein the control function includes a central area (C 0 A) method to determine the language subordinate function 〇7. The image coding method described in item 1 of the patent application scope, wherein the fuzzy logic control includes a plurality of probe tables for setting a decision level and generating a weighting factor to aggravate one of the target areas The quality of the image coding method described in item 7 of the scope of the patent application, wherein the look-up tables include a plurality of zoom look-up tables to provide a similar priority quality to one of the target areas. 9. The image coding method described in item 8 of the scope of patent application, wherein the zoom lookup tables are formed using a one-fixed and one-various subordinate function. 10. The image coding described in item 1 of the scope of patent application In the method, the fuzzy logic control further includes: · converting an input from the giant 1 marks into a plurality of fuzzy judgments;--used to determine-• each of the fuzzy judgment control functions 1 in the fuzzy state. Language subordinate function, and from this fuzzy state used to set-decision level and generate a weighting factor, a plurality of probe tables are generated to aggravate the quality of one of the target areas. 0 1 1 The image coding method according to item 10, wherein the input from the target areas is from the target areas

12429TWF.PTD Page 19 200529650 6. Scope of patent application Calculated by the first control input and a second control input. 12. The image coding method as described in item 11 of the scope of patent application, wherein the first control input and the second control input respectively include a difference between a first variation number and a variation number from an i-th block The difference between the i-th large block and the i-th large block is obtained by subtracting a second one from the first i-1 block and dividing it by the first one. The i-1 large block represents a sequence of the large blocks within one of the target areas, and the i-1 large block is a previous large block of the i-th large block. 13. The image coding method described in item 10 of the scope of patent application, wherein the control function uses a central area (CO A) method to determine the language dependent function. 14. The image coding method as described in item 10 of the scope of the patent application, wherein the lookup tables include a plurality of zoom lookup tables to provide a similar priority quality to one of the target areas. 15. The image coding method described in item 14 of the scope of the patent application, wherein the zoom lookup tables are formed using one-fixed and one-various dependent functions. 16. An image encoding device suitable for video telephone and video conference, including: an encoder having an input end and an output end, wherein the input end of the encoder is electrically coupled to an input frame; a cutting The device has an input terminal, a first output terminal and a second output terminal, wherein the input terminal of the cutting device is electrically coupled to the output terminal.

12429TWF.PTD Page 20 200529650 VI. Patent application scope; and-fuzzy logic control device with--input terminal and an output terminal, wherein the input terminal of the fuzzy logic control device is electrically coupled to the cutting device The first output terminal of the encoder, and the output terminal of the fuzzy logic control device is electrically coupled to the input terminal of the encoder. 1 7. According to the scope of patent application: the video editing described in item 16; horse device, wherein the fuzzy logic control device further includes a quantizer, which has an input terminal and an output terminal, wherein the quantizer's The input terminal is electrically coupled to the first output terminal of the cutting device, and the quantizer converts a signal 'from the first output terminal of the cutting device into a fuzzy decision;-the--controller has--the input terminal And an output terminal, wherein the input terminal of the first controller is electrically connected to the output terminal of the quantizer, the second controller converts the fuzzy decision into a fuzzy state, and the second controller, Having an input terminal and a round output terminal: wherein, the input terminal and the output terminal of the first,-'controller are electrically coupled to the output terminal of the first controller and the input terminal of the encoder, respectively; The second controller converts the logic state into an output of the fuzzy logic control device. 1 8. The image coding device described in item 17 of the scope of patent application, further includes a differential device, having an input terminal and an output terminal, wherein the input terminal and the output terminal of the differential device are electrically coupled to The first output terminal of the cutting device and the input terminal of the quantizer.

12429TWF.PTD Page 21 200529650 VI. Patent application scope 19. The image coding device described in item 18 of the patent application scope, wherein the input end of the encoder is electrically coupled to the first output end of the cutting device . 2 0. The image coding device described in item 19 of the scope of patent application, further comprising a buffer having an input end and an output end, wherein the input end and the output end of the buffer are electrically coupled respectively. To the output terminal of the encoder and the first output terminal of the cutting device.

12429TWF.PTD Page 22