TW490634B - Method for performing two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations - Google Patents
Method for performing two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations Download PDFInfo
- Publication number
- TW490634B TW490634B TW87115327A TW87115327A TW490634B TW 490634 B TW490634 B TW 490634B TW 87115327 A TW87115327 A TW 87115327A TW 87115327 A TW87115327 A TW 87115327A TW 490634 B TW490634 B TW 490634B
- Authority
- TW
- Taiwan
- Prior art keywords
- data
- dct
- unit
- dimensional
- level
- Prior art date
Links
Landscapes
- Complex Calculations (AREA)
- Image Processing (AREA)
Abstract
Description
經濟部中央標準局負工消費合作社印製 490634 A7 _ B7 五、發明説明(、) ^^~^ 本發明係有關於一種執行離散餘弦轉換(DCT)及其逆轉 的方法,更特別地,係有關於一種可減少乘法運算之執行 二維DCT/IDCT的方法。 本案申請人的美國專利第mi,412號一案揭露—種 5 使用六級運算DCT/IDCT迅速演算法來處理一 8x8資料塊 之連續輸入資料的離散餘弦轉換(DCT)和逆轉離散餘弦轉 換(工DCT)方法及裝置。該DCT/IDCT迅速演算法的該六级 運算通常係包含交換插入的蝴蝶型運算級與乘法運算、級。 該等乘法運算級包括内部乘法運算、相加後相乘運算、及 10 相乘後相減運算。在前述的美國專利中,一單一蝴蝶型運 算單元執行該等蝴蝶型運算級,而一單一乘法運算單元執 行該等乘法運算級。該蝴蝶型運算單元和該乘法運算單元 係以循環及平行處理形式運作,因此DCT和工DCT能夠在 相當低的硬體成本下有效率地達成。 15 第一和三圖分別描繪在前述之美國專利案中所使用之六 級運算DCT和IDCT迅速演算法的流向圖。該DCT迅速演 算法使用三種算術運算:蝴蝶型、内部乘法、及相加後相 乘,如在第二A至二C圖所顯示般。該IDCT迅速演算法 亦使用三種算術運算:蝴蝶型、内部乘法、及相乘後相減 20 ,如在第二A、二B和二D圖所顯示般。 請參閱第一圖所示,該DCT迅速演算法的六級運算包 括一個包含四個蝴蝶型運算的第/级運算、一個包含雨個 相加後相乘運算的第二級運算、/個包含四個蝴蝶蜇運算 的第三級運算、一個包含三個相加後相乘運算的第四级運 第4頁 本紙張尺度適用中國國家標準(CNS ) A4規格(210 X 297公釐) C请先聞讀背N%之注意事項存填寫本貢) .tr 490634 A7 B7 五、發明説明(2 ) , 算、一個包含四個蝴蝶型運算的第五級運算、及一個包含 八個内部乘法運算的第六級運算。 請參閱第三圖所示,該工DCT迅速演算法的六級運算包 括一個包含八個内部乘法運算的第一級運算、一個包含四 5 個蝴蝶型運算的第二級運算、一個包含三個相乘後相減運 算的第三級運算、一個包含四個蝴蝶型運算的第四級運算 、一個包含兩個相乘後相減運算的第五級運算、及一個包 含四個蝴蝶型運算的第六級運算。 通常,DCT/IDCT的乘法運算係相當耗時且需要相當複 10 雜的硬體。雖然前述之美屬專利案使用一個對於一維轉換 僅包含十三個乘法運算,或者對於一 8x8資料塊之二維轉 換僅包含總數208個(2x8x13)乘法運算的迅速演算法, 然而,依然係進一步希望減少乘法運算的數目俾可實現更 高的處理速度。 15 因此,本發明之目的是為提供一種可減少乘法運算之執 行二維DCT/IDCT的方法。 經濟部中央標準局員工消費合作社印製 (請先閲讀背面之注意事項再填寫本頁) 根據本發明的一特徵,一種包含連續之第一和第二輪一 維離散餘弦轉換(DCT)運算的二維DCT方法係被提供。每 個第一和第二輪一維DCT運算使用一六級運算DCT迅速演 2 0 算法來處理一 8x8資料塊的連續輸入資料俾可產生連續的 轉換資料。該DCT迅速演算法包括包含數個蝴蝶型運算的 第一、第三和第五級,包含數個相加後相乘運算的第二和 第四級 '、及包含數個内部乘法運算的第六級。該二維DCT 方法包含如下之步驟: 第5頁 本紙張尺度適用中國國家標準(CNS ) A4規格(210X297公釐) 490634 經濟部中央標準局員工消費合作社印製 A7 B7 五、發明説明(3 ) (a) 提供一輸入單元接收輸入資料; (b) 控制該輸入單元來提供該輸入資料到一蝴蝶型運算 單元俾可致能該蝴蝶型運算單元執行該第一輪一維DCT運 算之DCT迅速演算法的第一級運算; 5 (c)控制一資料暫存器單元儲存來自該蝴蝶型運算單元 的第一級輸出資料; (d) 控制該資料暫存器單元提供預定的一個第一級輸出 資料到一乘法運算單元俾可在該預定的一個第一級輸出資 料業已儲存於該資料暫存器單元内時致能該乘法運算單元 10 執行該DCT迅速演算法的第二級運算; (e) 控制該資料暫存器單元儲存來自該乘法運算單元的 第二級輸出資料; (f) 控制該資料暫存器單元以一預定順序提供該第一級 和第二級輸出資料到該蝴蝶型運算單元俾可在該蝴蝶型運 15 算單元已完成執行該DCT迅速演算法的第一級運算之後致 能該蝴蝶型運算單元執行該DCT迅速演算法的第三級運算 (g) 控制該資料暫存器單元儲存來自該蝴蝶型運算單元 的第三級輸出資料; 2 0 (h)控制該資料暫存器單元提供預定的一個第三級輸出 資料到該乘法運算單元俾可在該預定的一個第三級輸出資 料業已儲存於該資料暫存器單元時致能該乘法運算單元執 行該DCT迅速演算法的第四級運算; (i)控制該資料暫存器單元儲存來自該乘法運算單元的 第6頁 本紙張尺度適用中國國家標準(CNS ) A4規格(210X 297公釐) (請先閱讀背面之注意事項再填寫本頁) 490634Printed by the Central Bureau of Standards, Ministry of Economic Affairs and Consumer Cooperatives 490634 A7 _ B7 V. Description of the Invention (,) ^^ ~ ^ The present invention relates to a method for performing discrete cosine transform (DCT) and its inversion. More specifically, the system It relates to a method for performing two-dimensional DCT / IDCT which can reduce multiplication. The applicant's U.S. Patent No. mi, 412 discloses-a type 5 discrete cosine transform (DCT) and inverse discrete cosine transform (DCT) of continuous input data of an 8x8 data block using a six-stage arithmetic DCT / IDCT rapid algorithm Industrial DCT) method and device. The six-level operation of the DCT / IDCT rapid algorithm usually includes butterfly-type operation stages, multiplication operations, and stages that are interchangeably inserted. These multiplication stages include internal multiplication, multiplication after addition, and subtraction after multiplication by 10. In the aforementioned U.S. patent, a single butterfly operation unit performs the butterfly operation stages, and a single multiplication operation unit performs the multiplication stages. The butterfly operation unit and the multiplication operation unit operate in a cyclic and parallel processing form, so DCT and industrial DCT can be efficiently achieved at a relatively low hardware cost. 15 The first and third figures respectively depict the flow diagrams of the six-stage operation DCT and IDCT rapid algorithms used in the aforementioned US patent case. The DCT rapid algorithm uses three arithmetic operations: butterfly, internal multiplication, and multiplication after addition, as shown in the second A to C diagrams. The IDCT rapid algorithm also uses three arithmetic operations: butterfly, internal multiplication, and subtraction 20 after multiplication, as shown in the second A, two B, and two D diagrams. Please refer to the first figure. The six-level operation of the DCT rapid algorithm includes a first-level operation including four butterfly-type operations, a second-level operation including multiplication operations after rain, and / The third-level operation of the four butterfly pupa operations, and the fourth-level operation including three multiplication operations after addition. Page 4 This paper size applies to the Chinese National Standard (CNS) A4 specification (210 X 297 mm). C Please First read and read the N% notes and fill in this tribute.) Tr 490634 A7 B7 V. Description of the invention (2), arithmetic, a fifth-level operation including four butterfly-type operations, and one including eight internal multiplication operations Sixth-level operation. Please refer to the third figure, the six-level operation of the DCT rapid algorithm includes a first-level operation including eight internal multiplication operations, a second-level operation including four or five butterfly operations, and one including three A third-level operation including multiplication and subtraction, a fourth-level operation including four butterfly operations, a fifth-level operation including two multiplication-subtraction operations, and a four- butterfly operation Sixth level operation. In general, the DCT / IDCT multiplication system is time consuming and requires quite complex hardware. Although the aforementioned US patent case uses a rapid algorithm that only contains thirteen multiplication operations for one-dimensional transformation, or only 208 (2x8x13) multiplication operations for two-dimensional transformation of an 8x8 data block, it is still a further step It is desirable to reduce the number of multiplication operations and achieve higher processing speed. 15 Therefore, an object of the present invention is to provide a method for performing two-dimensional DCT / IDCT which can reduce multiplication operations. Printed by the Consumer Standards Cooperative of the Central Bureau of Standards of the Ministry of Economic Affairs (please read the notes on the back before filling this page). According to a feature of the present invention, a continuous one-dimensional discrete cosine transform (DCT) operation including first and second rounds A two-dimensional DCT method is provided. Each of the first and second rounds of one-dimensional DCT operations uses a six-stage operation DCT to quickly perform a 20 algorithm to process continuous input data of an 8x8 data block, which can generate continuous conversion data. The DCT rapid algorithm includes the first, third, and fifth stages including several butterfly operations, the second and fourth stages including several multiplication operations after addition, and the first stage including several internal multiplication operations. Six levels. The two-dimensional DCT method includes the following steps: Page 5 This paper size applies the Chinese National Standard (CNS) A4 specification (210X297 mm) 490634 Printed by the Consumers Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs A7 B7 V. Invention Description (3) (a) Provide an input unit to receive input data; (b) Control the input unit to provide the input data to a butterfly-type computing unit; enable the butterfly-type computing unit to perform the DCT of the first round of one-dimensional DCT operations quickly First-level operation of the algorithm; 5 (c) Control a data register unit to store the first-level output data from the butterfly-type arithmetic unit; (d) Control the data register unit to provide a predetermined first-level unit Output data to a multiplication unit. The multiplication unit 10 can be enabled to perform the second-stage operation of the DCT rapid algorithm when the predetermined first-stage output data has been stored in the data register unit; ( e) controlling the data register unit to store second-level output data from the multiplication unit; (f) controlling the data register unit to provide the first stage and The second stage outputs data to the butterfly-type arithmetic unit. After the butterfly-type arithmetic unit has completed the first-stage operation of the DCT rapid algorithm, the butterfly-type arithmetic unit can be enabled to execute the first stage of the DCT rapid algorithm. Three-level operation (g) Control the data register unit to store the third-level output data from the butterfly-type arithmetic unit; 2 0 (h) Control the data register unit to provide a predetermined third-level output data to the The multiplication operation unit may enable the multiplication operation unit to perform the fourth-level operation of the DCT rapid algorithm when the predetermined third-level output data has been stored in the data register unit; (i) control the data temporarily The register unit stores page 6 from the multiplication unit. The paper size is applicable to the Chinese National Standard (CNS) A4 specification (210X 297 mm) (Please read the precautions on the back before filling this page) 490634
、發明説明(江) 5 經濟部中央標準局員工消費合作社印製 第四級輸出資料; (>j)控制該資料暫存器單元以一預定的順序提供該第三 、級t第四級輸出資料到該蝴蝶型運算單元俾可在該蝴蝶型 運,單70已完成執行該DCT迅速演算法的第三級運算之後 致能該蝴蝶型運算單元執行該DCT迅速演算法的第五級運 算; 户(k)控制該資料暫存器單元儲存來自該蝴蝶型運算單元 的第五級輸出資料,該第五級輸出資料作用如倍率轉換一 維轉換資料; (l) 控制該資料暫存器單元以另一個方向提供該倍率轉 換一維轉換資料到該蝴蝶型運算單元俾可致能該蝴蝶型運 异單元執行該第二輪一維DCT運算之DCT迅速演算法的第 一級運算; (m) ·重覆步驟(C)到(j)來執行該第二輪一維DCT運算 之DCT迅速演算法的第二至第五級運算; (η)控制該資料暫存器單元儲存來自該蝴蝶型運算單元 的第五級輸出資料,該第五級輸出資料作用如倍率轉換二 維轉換資.料; (〇)控制該資料暫存器單元提供該倍率轉換二維轉換資 料到該乘法運算單元俾可致能該乘法運算單元根據一組儲 存於該乘法運异早元之係數R〇jy[内的倍率轉換加權係數執 行該第二輪一維DCT運算之DCT迅速演算法的第六級運算 ,藉此獲得對應於該輸入資料的二維轉換資料,該等倍率 轉換加權係數是為該第一與第二輪/維DCT運算之DCT迅 第7頁 本紙張尺度適用中國國家標準(CNS )八4規格(210X 297公釐) (請先閱讀背面之注意事項再填寫本頁) 、11 經濟部中央標準局員工消費合作社印製 490634 A7 ^__B7 _________ 五、發明説明(S ^ 速演算法之第六級加權係數的乘積;及 (р) 控制一輸出單元從該乘法運算單元接收該二維轉換 資料。 根據本發明的另一特徵,一種包含連績之弟一和第一輪 5 —維逆轉難散餘弦轉換(工DCT)運算的二維IDCT方法係被 提供。每個第一和第二輪一維工DCT運算使用一六級運算 工DCT迅速肩异法來處理一 8x8資料塊的連續輸入資料俾 可產生連續的轉換資料。該IDCT迅速演算法包括包含數個 内部乘法運算的第一級運算、包含數個蝴蝶型運算的第二 10 、第四和第六級運算、及包含數個相乘後相減運算的第三 和第五級運算。該二維IDCT方法包含如下之步驟: (a) 提供一輸入單元接收輸入資料; (b) 控制該輸入單元提供該輸入資料到一乘法.運算單元 俾可致能該乘法運算單元根據一組儲存於該乘法運算單元 15 之係數R〇M内的倍率轉換加權係數執行該第一輪一維 工DCT運算之IDCT迅速演算法的第〆級運算,該等倍率轉 換加權係數是為該第一與第二輪一維IDCT運算之IDCT迅 速演算法之第一級加權係數的乘積; (с) 控制一資料暫存器單元儲存來自該乘法運算單元的 20 倍率轉換第一級輸出資料; (d)控制該資料暫存器單元提供該倍率轉換第一級輸出 資料到一蝴蝶型運算單元俾可致能該蝴蝶型運算單元執行 該第一輪一維IDCT運算之IDCT迅速演算法的第二級運算 第8頁 本紙張尺度適用中國國家標準(CNS ) Μ規格(210X297公釐) (請先閱讀背面之注意事項再填寫本頁) 訂 490634 A7 B7 五、發明説明 5 10 15 經於部中央«.冬而lhc-τ.消於合作私卬製 20 (e) 控制該資料暫存器單元儲存來自該蝴蝶型運算單元 的第二級輸出資料, (f) 控制該資料暫存器單元提供預定的一個第二級輸出 資料到該乘法運算單元俾可在該預定的一個第二級輸出資 料業已儲存於該資料暫存器單元時致能該乘法運算單元執 行該IDCT迅速演算法的第三級運算; (g) 控制該資料暫存器單元儲存來自該乘法運算單元的 第三級輸出資料; (h) 控制該資料暫存器單元以一預定順序提供該第二級 與第三級輸出資料到該蝴蝶型運算單元俾可在該蝴蝶型運 算單元已完成執行該IDCT迅速演算法的第二級運算之後致 能該蝴蝶型運算單元執行該工;DCT迅速演算法的第四級運算 j (i) 控制該資料暫存器單元儲存來自該蝴蝶型運算單元 的第四級輸出資料; (j) 控制該資料暫存器單元提供預定的_個第四級輸出 二貝料到該乘法運算單元俾可在該預定的一個第四級輸出資 料業已儲存於該資料暫存器單元内時致能該乘法運算單元 執行該IDCT迅速演算法的第五級運算; 斤 0〇控制該資料暫存器單元儲存來自該 ^運异早 盘第料暫存11科p料料提供該第四$ 算單心成 蝴蝶1 運以 疋战執仃該IDCT迅速凟异法的第四級運算之後】 元的 本紙張尺度適用 中國國家標準 第9買 (2K)X 297公釐) (請先閱讀背面之注意事項再填寫本頁)5. Description of the Invention (Jiang) 5 The level 4 output data is printed by the Consumer Cooperatives of the Central Standards Bureau of the Ministry of Economic Affairs; (> j) Control the data register unit to provide the level 3 and level 4 in a predetermined order. Output data to the butterfly-type arithmetic unit. After the butterfly-type operation, single 70 has completed executing the third-level operation of the DCT rapid algorithm, enable the butterfly-type arithmetic unit to perform the fifth-level operation of the DCT rapid algorithm. ; User (k) controls the data register unit to store fifth-level output data from the butterfly-type arithmetic unit, and the fifth-level output data functions as one-dimensional conversion data of rate conversion; (l) controls the data register The unit provides the magnification conversion one-dimensional conversion data to the butterfly-type operation unit in another direction, and can enable the butterfly-type operation unit to perform the first-level operation of the DCT rapid algorithm of the second round of one-dimensional DCT operation; ( m) · Repeat steps (C) to (j) to perform the second to fifth stage operations of the DCT rapid algorithm of the second round of one-dimensional DCT operation; (η) control the data register unit to store data from the Butterfly type arithmetic unit The fifth-level output data, the fifth-level output data functions as two-dimensional conversion data. (0) Controls the data register unit to provide the two-dimensional conversion data to the multiplication unit. The multiplication operation unit can perform the sixth-stage operation of the DCT rapid algorithm of the second round of one-dimensional DCT operation based on a set of multiplication factor weighting coefficients stored in the multiplication early difference element Rojy [ Obtain the two-dimensional conversion data corresponding to the input data. The ratio conversion weighting coefficients are DCT for the first and second round / dimensional DCT operations. Page 7 This paper applies Chinese National Standard (CNS) 8-4 specifications. (210X 297mm) (Please read the notes on the back before filling out this page), 11 Printed by the Employees' Cooperative of the Central Standards Bureau of the Ministry of Economic Affairs 490634 A7 ^ __ B7 _________ V. Description of the invention (S ^ Speed algorithm 6th level A product of the weighting coefficients; and (р) controlling an output unit to receive the two-dimensional conversion data from the multiplication unit. According to another feature of the present invention, a first-round and first-round 5-dimensional inverse are included. A two-dimensional IDCT method for transforming difficult-to-scatter cosine transform (ICD) operations is provided. Each of the first and second rounds of one-dimensional DCT operations uses a six-stage operator DCT to quickly process the 8x8 data block Continuous input data can generate continuous conversion data. The IDCT rapid algorithm includes a first-level operation including several internal multiplication operations, a second 10, fourth and sixth-level operations including several butterfly operations, and The third and fifth stage operations of a number of subtraction operations after multiplication. The two-dimensional IDCT method includes the following steps: (a) providing an input unit to receive input data; (b) controlling the input unit to provide the input data to A multiplication. Operation unit: enables the multiplication operation unit to perform the IDCT rapid algorithm of the first round of one-dimensional DCT operation according to a set of multiplication factor weighting coefficients stored in the coefficients ROM of the multiplication operation unit 15 The first-stage operation, the ratio conversion weighting coefficients are the product of the first-stage weighting coefficients of the IDCT rapid algorithm of the first and second rounds of one-dimensional IDCT operations; (с) controlling the storage of a data register unit The first-stage output data is converted from the multiplying unit at a rate of 20; (d) the data register unit is controlled to provide the first-stage output data to the rate-converting unit to a butterfly-type arithmetic unit; This first round of one-dimensional IDCT calculation is based on the second-level operation of the rapid IDCT algorithm. Page 8 This paper size is applicable to China National Standard (CNS) M specifications (210X297 mm) (Please read the precautions on the back before filling this page ) Order 490634 A7 B7 V. Description of the invention 5 10 15 The Ministry of Finance «.Winter lhc-τ. Eliminated by the cooperative private system 20 (e) Control the data register unit to store the data from the butterfly-type arithmetic unit. Second-level output data, (f) controlling the data register unit to provide a predetermined second-level output data to the multiplication unit, so that the predetermined second-level output data has been stored in the data register unit Enable the multiplication unit to perform the third-level operation of the IDCT rapid algorithm; (g) control the data register unit to store the third-level output data from the multiplication unit; (h) control the data The material register unit provides the second-level and third-level output data to the butterfly-type arithmetic unit in a predetermined order, and can be enabled after the butterfly-type arithmetic unit has completed executing the second-stage operation of the IDCT rapid algorithm. The butterfly-type arithmetic unit performs the job; the fourth-level operation of the DCT rapid algorithm j (i) controls the data register unit to store the fourth-level output data from the butterfly-type arithmetic unit; (j) controls the data temporarily The register unit provides a predetermined _ fourth level output to the multiplication unit. The multiplication unit can be enabled when the predetermined fourth level output data has been stored in the data register unit. The fifth-level operation of the IDCT's rapid algorithm; 00〇 controls the data register unit to store the 11 items of p material from the ^ Yunyi early plate temporarily to provide the fourth $ to calculate the single heart into a butterfly 1 After the fourth stage of the IDCT's rapid surprise method, the paper size of Yuan applies to the Chinese National Standard No. 9 Buy (2K) X 297 mm) (Please read the precautions on the back before filling this page)
4^U634 A7 B7 5 五、發明説明 此該蝴蝶型運算單元執行該IDCT迅速演算法的第六級運算 , (m)控制該資料暫存器單元俾可儲存由該蝴蝶型運算單 元於步驟(1)產生的倍率轉換一維轉換資料; (η)控制該資料暫存器單元以另一個·方向提供該倍率轉 換一維轉換資料到該蝴蝶型運算單元俾可致能該蝴蝶型運 算單元執行該第二輪一維工DCT運算之工DCT迅速演算法的 第二級運算; (〇)重覆步驟(㊀)到(1)來執行該第二輪一維IDCT運 算之IDCT迅速演算法的第三至第六級運算;及 (ρ)在步驟(〇)之後,控制一輸出單元從該蝴蝶型運算 單元接收該第六級輸出資料作為對應於該輸入資料的二維 轉換資料。 本發明之其他特徵和優點在下面配合附圖之較佳實施例 的詳細描述下將會變得更明白,其中: 第一圖係在習知技術中於一 DCT裝置之一維轉換運算 中所使用之DCT迅速演算法的流向圖; 第二Α圖描繪在第一圖之流向圖中的蝴蝶型運算; 第二B圖描繪在第一圖之流向圖中的内部乘法運算; 第二C圖描繪在第一圖之流向圖中的相加後相乘運算 第二D圖描繪在一1DCT迅速演算法中所使用的相乘後 相減運算; 第三圖係在習知技術中於〆IDCT裝置之一維轉換運算 第10頁 關家標率(CNS)A4^(21〇X 297^t / (請先閱讀背面之注意事項再填寫本頁) 訂 490634 五 5 10 A7 B7 、發明説明(¾ ) 中所使用之工DCT迅速演算法的流向圖; 第四圖係本發明之DCT方法之較佳實施例的流程圖; 第五圖係本發明之工DCT方法之較佳實施例的流程圖; 第六A圖描繪一組在本發明較佳實施例之DCT方法中 所使用的倍率轉換加權係數; 第六B圖描繪一組在本發明較佳實施例之IDCT方法中 所使用的倍率轉換加權係數; 第七圖係用於執行本發明較佳實施例之dct/idct方 法之DCT/IDCT裝置之例子的示意電路方塊圖; 第八圖係描繪第七圖之裝置在執行二維DCT時之運算 的時序圖; 第九圖係描綠第七圖之裝置在執行二維工DCT時夕、富a 的時序圖; (諳先閱讀背面之注意事項再填寫本頁} 15 第十圖係用於本發明較佳實施例之dct/idct DCT/IDCT裝置之另一例子的示意電路方塊圖;及 第十一圖係描緣第十圖之裝置之運作的時序圖。 元件標號對照表 方法之 訂 2 0 1 3 21 31 33 35 輸入單元 乘法運算單元 多工器 輸入選擇多工器 乘法器電路 輸出選擇多工器 2 4 22 32 34 WP1寫入埠 蝴蝶型運算單元 資料暫存器 蝴蝶型電路 加法/減法電路 係數ROM Dln輪入資料 WP2寫入埠 泉 第11頁 本纸張尺度適中國國家標準(CNS ) 乂 Μ規格(210X297公釐) 490634 A7 B7 五、發明説明 5 10 RP1讀取埠 RP2讀取埠 101 DCT/IDCT處理單元 102 DCT/IDCT 處理單一 103控制單元 1011 輸入單元 1012 蝴蝶型運算單元 10121 多工器 10122 蝴蝶型電路 1013 乘法運算單元 10131 輸入選擇多工器 10132 加法/減法電敗 10133 乘法器電路 10134 係數ROM 10135 輸出選擇多工器 1014 資料暫存器單# 1021 蝴蝶型運算單元 10211 多工器 10212 蝴蝶型電路 1022 乘法運算單元 10221 輸入選擇多工器 10222 加法/減法電路 10223 乘法器電路 10224 係數ROM 10225 輸出選擇多工器 1023 資料暫存器單元 1024 輸出單元 f靖先閑讀背面之注意事项再植'寫本頁j 15 經济部中夾«:^而0,J.消贽合作拉印繁 20 請參閱第四圖所示,本發明之二維DCT方法的較佳實 施例係被顯示包含連續的第一和第二輪一維DCT運算。該 第一輪一維DCT運算包括在第一圖中所顯示之DCT迅速演 算法的首五級運算,而且係被執行來從一輸入原始塊資料 得到一倍率轉換一維DCT塊資料。該第二輪一維DCT運算 包括在第一圖中所顯示之DCT迅速演算法的該六級運算, 而且係被執行來從該倍率轉換〆雉DGT塊資料得到該二維 DCT塊資料。, 在每個第-和第二輪-維DCT運算之第二和第四级運 … a2、、 异之相加後相乘運算中所使用的你數刀別為 al 第12頁 本紙張尺度適州中圈國家標準(CNS ) A4規格(210'〆297公釐) 490634 A7 B7 5 五、發明就明(乂 b2、和b3,如在第一圖中所顯示般。然而,因為該DCT 迅速演算法的第六級運算在本發明之DCT方法的第一輪一 雉DCT運算中被省略’一組倍率轉換加權係數係被使用於' 該第二輪一維DCT運算之DCT迅速演算法之第六級運算的 内部乘法運算。第六A圖顯示該第二輪一維DCT運算使用 真係於對應於一資料塊之8x8矩陣之處理中使用的倍率轉 換加權係數。如圖所示,在該第二輪一維DCT運算 之DCT迅速演算法之第六級運算之内部乘法運算中使用的 原始加權係數係由一個每行(或者每列)不同的倍率轉換器 進行倍率轉換。如果= 〇至7)是為在第一圖中所 顯示之DCT迅速演算法之内部乘法運算中所使用的原始加 權係數的話,該第一至第八行(或列)的倍率轉換器(ci)分 別為 cO、cl、c2、C3、c4、C5、C6 和 C7。即,第 i 行( 或列)之内部乘法運算的原始加權係數(cj )係被倍率轉換 到成為ci , j,其係相等於ci與cj的乘積’其中,i和j ==0到7,而每行(或列)的第j個一維轉換資料係如在第 一圖中所顯示的F (j )。 請參閱第五圖所示’本發明之二維工DCT方法的較佳實 施例係被顯示包含連續的第一和第二輪一維工DCT運算。該 第一輪一維工DCT運算包括在第三圖中所顯示之工DCT迅速 演算法的該六級運算,而且係被執行來從一輸入原始塊資 料獲得一倍率轉換一維工DCT塊資料。該第二輪一維IDCT 運算包括在第三圖中所顯示之IDCT迅速演算法的後五級運 算,而且係被執行來從該倍率轉換一維IDCT塊資料獲得該 第13頁 本紙张尺度適州中國國家標苹(CNS ) A4規格(210X 297公釐) 衣— (請先閱讀背面之注意事項再填寫本頁)4 ^ U634 A7 B7 5 V. INTRODUCTION OF THE INVENTION The butterfly-type arithmetic unit performs the sixth-level operation of the IDCT rapid algorithm. (M) Controls the data register unit. The butterfly-type arithmetic unit can be stored in steps ( 1) Generated one-dimensional conversion data of magnification conversion; (η) Control the data register unit to provide the magnification-converted one-dimensional conversion data to the butterfly-type arithmetic unit in another direction, enabling the butterfly-type arithmetic unit to execute The second-level operation of the second round of one-dimensional DCT calculations. The second-level operation of the fast DCT algorithm; (0) Repeat steps (i) to (1) to perform the second round of one-dimensional IDCT calculations. Third to sixth stage operations; and (ρ) after step (0), controlling an output unit to receive the sixth stage output data from the butterfly-type arithmetic unit as two-dimensional conversion data corresponding to the input data. Other features and advantages of the present invention will become clearer in the following detailed description of the preferred embodiments with reference to the accompanying drawings, in which: The first diagram is used in the conventional technique for one-dimensional transformation operation of a DCT device. Flow chart of the DCT rapid algorithm used; Figure A shows the butterfly operation in the flow chart of the first picture; Figure B shows the internal multiplication operation in the flow chart of the first picture; Figure C Draw the multiplication operation after the addition in the flow diagram of the first picture. The second D diagram describes the multiplication and subtraction operation used in a 1DCT rapid algorithm. The third picture is in the conventional technique of IDCT. One-dimensional transformation operation of the device. Page 10 Guan Family Standard Rate (CNS) A4 ^ (21〇X 297 ^ t / (Please read the precautions on the back before filling out this page) Order 490634 5 5 10 A7 B7 、 Description of invention (¾) The flow chart of the rapid DCT algorithm used in the process; the fourth diagram is a flowchart of the preferred embodiment of the DCT method of the present invention; the fifth diagram is the flowchart of the preferred embodiment of the DCT method of the present invention; FIG. 6A depicts a group used in the DCT method of the preferred embodiment of the present invention. Figure 6B depicts a set of ratio conversion weighting coefficients used in the IDCT method of the preferred embodiment of the invention; Figure 7 is used to implement the dct / idct method of the preferred embodiment of the invention Figure 8 is a schematic circuit block diagram of an example of a DCT / IDCT device; Figure 8 is a timing diagram depicting the operation of the device in Figure 7 when performing a two-dimensional DCT; Figure 9 is a device in which the device in Figure 7 is green Timing diagram of DCT time and rich a; (谙 Read the precautions on the back before filling out this page} 15 The tenth diagram is another example of a DCT / IDCT DCT / IDCT device used in the preferred embodiment of the present invention The schematic circuit block diagram; and the eleventh diagram is a timing diagram of the operation of the device drawing the tenth diagram. The component number comparison table method order 2 0 1 3 21 31 33 35 input unit multiplication unit multiplexer input selection Multiplexer circuit output selection multiplexer 2 4 22 32 34 WP1 write to the port butterfly type arithmetic unit data register butterfly type circuit addition / subtraction circuit coefficient ROM Dln rotation data WP2 write to port spring page 11 paper Zhang scales suitable for China Standard (CNS) 乂 specifications (210X297 mm) 490634 A7 B7 V. Invention description 5 10 RP1 read port RP2 read port 101 DCT / IDCT processing unit 102 DCT / IDCT processing single 103 control unit 1011 input unit 1012 butterfly type Operation unit 10121 Multiplexer 10122 Butterfly circuit 1013 Multiplication unit 10131 Input selection multiplexer 10132 Addition / subtraction electric defeat 10133 Multiplier circuit 10134 Coefficient ROM 10135 Output selection multiplexer 1014 Data register single # 1021 Butterfly operation Unit 10211 Multiplexer 10212 Butterfly circuit 1022 Multiplication unit 10221 Input selection multiplexer 10222 Addition / subtraction circuit 10223 Multiplier circuit 10224 Coefficient ROM 10225 Output selection multiplexer 1023 Data register unit 1024 Output unit Read the notes on the back to replant the 'write this page j 15 in the Ministry of Economic Affairs «: ^ , 0, J. Elimination of cooperation and printing of India 20 Please refer to the fourth figure. The two-dimensional DCT method of the present invention is better The embodiment is shown to include successive first and second rounds of one-dimensional DCT operations. The first round of one-dimensional DCT operation includes the first five stages of the DCT rapid algorithm shown in the first figure, and is executed to obtain a one-time conversion of one-dimensional DCT block data from an input original block data. The second round of one-dimensional DCT operation includes the six-level operation of the DCT rapid algorithm shown in the first figure, and is executed to obtain the two-dimensional DCT block data from the magnification conversion 〆 雉 DGT block data. In the second and fourth stages of each of the first- and second-dimensional DCT operations ... a2, the number of multipliers used in the multiplication operation after the addition of the difference is al. Page 12 of this paper Shizhou Central Circle National Standard (CNS) A4 specification (210'〆297 mm) 490634 A7 B7 5 5. The invention is clear (乂 b2 and b3, as shown in the first picture. However, because of the DCT The sixth-level operation of the rapid algorithm is omitted in the first round of a DCT operation of the DCT method of the present invention. 'A set of magnification conversion weighting coefficients is used.' The second round of one-dimensional DCT operation of the DCT rapid algorithm The sixth stage of the internal multiplication operation. Figure 6A shows that the second round of one-dimensional DCT operation uses the scaling factor used in the processing corresponding to the 8x8 matrix of a data block. As shown in the figure, The original weighting factor used in the internal multiplication of the sixth-level operation of the DCT rapid algorithm of the second one-dimensional DCT operation is converted by a different ratio converter for each row (or each column). If = 〇 to 7) is a quick calculation for the DCT shown in the first figure For the original weighting coefficients used in the internal multiplication of the method, the ratio converters (ci) of the first to eighth rows (or columns) are respectively cO, cl, c2, C3, c4, C5, C6, and C7. That is, the original weighting coefficient (cj) of the internal multiplication of the i-th row (or column) is converted to ci, j by the magnification, which is equal to the product of ci and cj ', where i and j == 0 to 7 , And the j-th one-dimensional transformed data of each row (or column) is F (j) as shown in the first figure. Please refer to the fifth figure. A preferred embodiment of the two-dimensional DCT method of the present invention is shown to include successive first and second rounds of one-dimensional DCT operations. The first round of one-dimensional DCT operation includes the six-level operation of the fast DCT algorithm shown in the third figure, and is executed to obtain one-time conversion of one-dimensional DCT block data from an input original block data. . The second round of one-dimensional IDCT operations includes the last five stages of the IDCT rapid algorithm shown in the third figure, and is executed to convert the one-dimensional IDCT block data from the magnification to obtain the appropriate paper size on page 13. State China National Standard Apple (CNS) A4 Specification (210X 297mm) Clothing — (Please read the precautions on the back before filling this page)
、1T 490634 A7 ___________ B7__ 五、發明説明) 二維工DCT塊資料。 在每個第一和第二輪一維工DCT運算之第三和第五級運 鼻之相乘後相減運鼻中所使用的係數分別為qi、q2、q3、 rl和r2,如在第三圖中所顯示般。然而’因為該工DCT迅 5 速演算法的第一級運算在本發明之IDCT方法中的第二輪一 維工DCT運算中被省略,一組倍率轉換加權係數係被使用於 該第一輪一維IDCT運算之IDCT迅速演算法之第一級運算 的内部乘法運算中。第六B圖顯示該第一輪一維工DCT運 异使用且於對應於一資料塊之8x8矩陣之處理中使用的倍 10 率轉換加權係數Pi , j。如圖所示,在該第一輪一維:[DCT 運算之IDCT迅速演算法之第一級運算之内部乘法運算中所 使用的原始加權係數係由一個每行(或每列)不同的倍率轉 換器進行倍率轉換。如果pj(j = ◦至7)是為在第三圖 中所顯示之工DCT迅速演算法之内部乘法運算中所使用的原 15 始加權係數的話,該第一至第八行(或列)的倍率轉換器 (Pi)分另,J 為 p〇、pl 、p2 、p3 、p4 、p5 、pe 和 P7 ° gp , 第i行(或列)之内部乘法運异的原始加權係數p (j )係被 倍率轉換到成為pi,j,其係相等於Pi與pj的乘積,其 中,i和j = 〇到7,而每行(或列)的第j個一維轉換資 20 料係如在第三圖中所顯示的F(j)。 第七圖描繪用於執行本發明較佳實施例之DCT/IDCT 方法的DCT/IDCT裝置。該DCT/IDCT裝置大致上於結構 上係與在美國專利第5,471,4 I2號一案中所描述的相同, 而且係被顯示包含一輸入單元1、一蝴蝶型運算單元2、一 第14頁 本紙張尺度適州巧國家標準(CNs ) /U規格(2丨〇'〆297公釐) 〜 '' ~ (請先閱讀背面之注意事項再填寫本頁)1T 490634 A7 ___________ B7__ V. Description of the invention) 2D DCT block data. In the first and second rounds of one-dimensional DCT calculations, the coefficients used in subtraction are multiplied by the third and fifth stage noses. The coefficients used in noses are qi, q2, q3, rl, and r2, as in As shown in the third picture. However, 'because the first-level operation of the DCT 5 speed calculation algorithm of this industry is omitted in the second round of one-dimensional DCT operation of the IDCT method of the present invention, a set of ratio conversion weighting coefficients is used in this first round. One-dimensional IDCT operation IDCT rapid algorithm in the first stage of the internal multiplication operation. Figure 6B shows the multiplier 10 conversion weighting coefficients Pi, j used in the first round of one-dimensional DCT operations and corresponding to the processing of an 8x8 matrix corresponding to a data block. As shown in the figure, in the first round of one-dimensional: [DCT operation IDCT rapid algorithm of the first stage of the internal multiplication operation of the original weighting coefficient used by a different magnification for each row (or each column) The converter performs rate conversion. The first to eighth rows (or columns) of pj (j = ◦ to 7) are the original 15 weighting coefficients used in the internal multiplication of the fast DCT algorithm shown in the third figure. The multiplier (Pi) is divided into two parts, J is p0, pl, p2, p3, p4, p5, pe, and P7 ° gp, and the original weighting coefficient p (j) of the inner multiplication of row i (or column) is different. ) Is scaled to become pi, j, which is equal to the product of Pi and pj, where i and j = 0 to 7, and the j-th one-dimensional conversion data of each row (or column) is 20 F (j) shown in the third figure. The seventh figure depicts a DCT / IDCT apparatus for performing the DCT / IDCT method of the preferred embodiment of the present invention. The DCT / IDCT device is substantially the same in structure as described in US Patent No. 5,471,4 I2, and is shown to include an input unit 1, a butterfly-type arithmetic unit 2, and a page 14 This paper is suitable for National Standards (CNs) / U specifications (2 丨 〇'〆297mm) ~ '' ~ (Please read the precautions on the back before filling this page)
5 10 15 怒免部中央榀半.^nc-T.消於合作,社卬則·?·: 20 ~---... 發明説明('孓 5去運异單元3、一資料 單元6。 °'4、一輪出單元5和一控制 該輪入單元i為一 示)接收-_資料塊的益鲁、從—外部裝置(圖中未 早元1係可運作俾可祀棱^輪入賁料(Din)。該輸入 ⑽)發送到該㈣型運管Λ的轉換運算來將該輸入資料 3。 开早凡2或者到該乘法運算單元 該蝴螺型運算單元2包括—多工哭 22。該蝴蝶型電路22係產生兩個輸二和一蝴蝶型電路 工器21具有—個被 ^貝料的和與差。該多 連接至該輪人單元丄與㈣;^^6的選擇輸入端和被 。該控制單元6㈣該多巧21 4的資料輸入端 輸入資料⑽η)或者來自„料暫^自該輸人料1的 且將兮、登捏次η 貝了寸货存态早兀4的資料,並 將该選擇資料提供到該蝴蝶型電路22俾致能 路22執行蝴蝶型運算。該” 4鄂尘電路22的輸出係被儲存 该貝枓暫存H單it 4或者被發送_輸出單元5。 該乘法運算料3包括—輪人選擇多H __加法 /法電路32、-乘法器電路33、一係數咖Μ、及一 】哕:擇多工器35。該係數職34包含數個作用為輸入 二^法器電路33之其中-個運算數輸人的加權係數,包 ^六A與六B圖中所顯示的倍率轉換加權係數。該乘 :運异早兀3能夠執行内部乘法運算、相加後相乘運营及 =乘後相減運算。來自該輸入單元工的輸入資料如二或 者來自該資料暫存器單元4的資料被發送到該加法/減法電 第15頁 囤國家標準(CNS ) A4規格(210X297公釐)5 10 15 The central part of the Department of Anger and Freedom. ^ Nc-T. Elimination of cooperation, social rules ...?: 20 ~ ---... Description of the invention ('孓 5 去 运 异 单元 3, 一 资料 料 单元 6 ° '4, one round-out unit 5 and one controlling the round-in unit i are shown) receiving -_ data blocks from Yilu, slave-external devices (not shown in the figure earlier than Yuan 1 series can be operated, can be worshipped ^ wheel The input data (Din) is sent to the conversion operation of the 运 -type transport pipe Λ to input the input data 3. Open Early Fan 2 or go to the multiplication unit. The butterfly type arithmetic unit 2 includes-multiplexing cry 22. The butterfly circuit 22 generates two input two and one butterfly circuit. The worker 21 has a sum and a difference of the materials. The multi-connected to the round unit 丄 and ㈣; ^^ 6's selection input and be. The control unit 6 inputs data from the data input terminal of the Duoqiao 21 4) or data from the material input ^ from the input material 1 and will be Xi, Deng pin times η, the inventory state is early 4, The selection data is provided to the butterfly circuit 22 and the enabling circuit 22 executes the butterfly operation. The output of the "4" dust circuit 22 is stored in the bezel temporary H list it 4 or sent to the output unit 5 . The multiplying material 3 includes a round-selection multi-H__addition / addition circuit 32, a multiplier circuit 33, a coefficient CM, and a] 哕: a multiplexer 35. The coefficient function 34 includes a plurality of weighting coefficients for inputting one of the operands 33 of the multiplexer circuit 33, including the weighting coefficients for the magnification conversion shown in the six A and six B figures. The multiplication: Yunyi early 3 can perform internal multiplication operations, multiplication operations after addition, and subtraction operations after multiplication. Input data from the input unit worker such as two or data from the data register unit 4 is sent to the addition / subtraction page 15 National Standard (CNS) A4 specification (210X297 mm)
490634 A7 B7 部 中 率 消 ii 合 作 卬 t 20 五、發明説明(o) 路32或者到該輸入選擇多工器31,俾可致能該乘法運算 單兀3執行希望的算術運算。該加法/減法電路32和該乘 法器電路33的輪出被發送到該輸出選擇多工器35俾可被 儲存在該資料暫存器單元4。該乘法器電路的輸出亦能 5 夠被直接發送到該輸出單元5。 至於該乘法運算單元;3如何執行内部乘法運算、相加 後相乘運算、或者相乘後相減運算,由於業已在前述的美 國專利案中描述,為了簡潔起見,於此恕不再重覆描述。 該資料暫存器單元4為一個具有兩寫入埠(wpl,wp2) 1〇和兩躓取埠(RP1'RP2)的四埠暫存器。第一組讀取與寫入 ,(RPl'WPl)係被連接至該蝴蝶型運算單元2,而第二組 讀取與寫入埠(RP2,WP2)係被連接至該乘法運算單元3。 該資料暫存器單元4作用來儲存來自該蝴蝶型運算單元2 彳該乘法運异單I 3的資料,並且進—步作用來提供資料 15 到它們那裡。 該輸出早& 5為—多卫器,其端視是DCT抑或是 執行來轉綱蝶型電路22或者縣法器電路 33的輸出。 最後’該控制單元6負責#告丨丨兮么 斜暫在哭留—^ ^ 貝控制該係數ROM 34和該資 枓暫存态早兀4的:|買取/寫入運作, 多工器35。該控制單而且亦負責控制不同的 DCT/取T裝置之餘下組件之運作二更胃責控制該 第八和九圖為描繪該裝置在 Μ?和IDCT方法時之運作的本發明較佳實施例之 ^ 圖。在本發明中,就一 第16頁 本紙張尺㈣财關糾样(cm) A4j^m ( 210^29¾ 490634 A7 B7 五、發明説明(\\V ) 5 10 15 經浐部中夾扰準ΛΒ-t.消处合作社卬% 20 資料塊(N)遭遇二維DCT/IDCT而言,倍率轉換一維轉換 ·-.....------------------ 資料係在第一輪一維DCT/IDCT運算之後被獲得。該倍率 ......-............ . ........... 一.............. .....… 轉換一維轉換資料然後遭遇第二輪一維DCT/IDCT運算俾 可獲得該二維DCT/IDCT資料。在第七圖的DCT/IDCT裝 置中,該第一輪一維DCT/IDCT運算的第一級運算係在接 收該輸入資料塊(N)時馬上被執行。該二維DCT/ IDCT資 料係從該第二輪一維DCT/IDCT運算之第六級運算的輸出 獲得。在二維DCT/IDCT期間,資料塊(N)之第一輪一維 DCT/IDCT運算的結果係被儲存於該資料暫存器單元4内 。由於如果原始資料塊係以行的方式輸入的話,從該第一 輪維DCT/IDCT運算獲得的該轉換資料塊必須轉置俾可 以列的方式處理,資料塊(N)之第一輪一維dct/idct運 算的結果被儲存於該資料暫存器單元4内係必須的。該資 料暫存器單元4的結構允許該蝴蝶型運算單元2和該乘法 運3同日悄取和寫人資料於其内,藉此致能該蝴蝶 ,運算單it 2和録法運算料3達絲雜平行管線式 處理。 下面係第七圖之裝置如何執行本發明較佳實施例之 DCT和工DCT方法的詳細描述。 μ請再次參閱第七和八圖所示,在該第—輪一維dct運 8x8像素塊的六十四個像素資料係依序以行 的方式(或者以列的方式)提供到嗜銓 一 j巧輸入早7L 1。該控制單 兀6控制該輸入單元;]_發送該輪 蕈瞀留-。^ ^别入像素資料(Din)到該蝴 螺i運^r單7〇 2,俾可致能該蝴焊刑曾 刀’坦運异早元2執行包含 第17頁 本紙張尺度剌巾_緖準(CNS ) A4規格 (誚先閱讀背面之注意事項再填寫本頁)490634 A7 B7 Intermediate rate ii Cooperation 五 t 20 V. Description of the invention (o) The circuit 32 or the multiplexer 31 is selected at the input, and the multiplication unit 3 can be enabled to perform the desired arithmetic operation. The rounds of the addition / subtraction circuit 32 and the multiplier circuit 33 are sent to the output selection multiplexer 35, which can be stored in the data register unit 4. The output of the multiplier circuit can also be sent directly to the output unit 5. As for the multiplication unit; 3 how to perform internal multiplication, multiplication after addition, or subtraction after multiplication, as already described in the aforementioned US patent case, for the sake of brevity, it will not be repeated Overlay description. The data register unit 4 is a four-port register with two write ports (wpl, wp2) 10 and two capture ports (RP1'RP2). The first group of read and write (RPl'WPl) is connected to the butterfly type arithmetic unit 2 and the second group of read and write ports (RP2, WP2) is connected to the multiplication unit 3. The data register unit 4 functions to store the data from the butterfly-type arithmetic unit 2 彳 the multiplication operation I3, and further functions to provide data 15 to them. This output is early & 5 is a multi-smart device, whose end view is DCT or is executed to transfer the output of the butterfly circuit 22 or the county circuit 33. Finally, the control unit 6 is responsible for the report. Xixi is temporarily crying — ^ ^ Bay controls the coefficient ROM 34 and the temporary storage state of the asset 4: | Buy / write operation, multiplexer 35 . The control sheet is also responsible for controlling the operation of the remaining components of different DCT / T-removing devices. Second, it is responsible for controlling the eighth and ninth figures. The eighth and ninth figures are the preferred embodiments of the present invention depicting the operation of the device in the M and IDCT methods. The ^ figure. In the present invention, on the 16th page of this paper, the paper size of the paper size (cm) A4j ^ m (210 ^ 29¾ 490634 A7 B7 V. Description of the invention (\\ V) 5 10 15 ΛΒ-t. Consumer Cooperative Cooperative 20% 20 When the data block (N) encounters two-dimensional DCT / IDCT, the magnification conversion is one-dimensional conversion · -.....-------------- ---- The data was obtained after the first round of one-dimensional DCT / IDCT operation. The magnification ......-............ ........ ... One .......................... Converting one-dimensional conversion data and then encountering the second round of one-dimensional DCT / IDCT operation. The two-dimensional DCT / IDCT data can be obtained In the DCT / IDCT device of the seventh figure, the first stage of the first round of one-dimensional DCT / IDCT operation is executed immediately upon receiving the input data block (N). The two-dimensional DCT / IDCT data system Obtained from the output of the sixth stage of the second round of one-dimensional DCT / IDCT operation. During the two-dimensional DCT / IDCT, the result of the first round of one-dimensional DCT / IDCT operation of the data block (N) is stored in the Data register unit 4. Since the original data block is input in a row, the transformation obtained from the first round of dimensional DCT / IDCT operations The material block must be transposed and can be processed in a row. The result of the first round of one-dimensional dct / idct operation of the data block (N) is required to be stored in the data register unit 4. The data register unit The structure of 4 allows the butterfly-type arithmetic unit 2 and the multiplication operation 3 to fetch and write personal data in it on the same day, thereby enabling the butterfly, operation list it 2 and recording operation material 3 to be processed in parallel pipelines. The following is a detailed description of how the device in Figure 7 performs the DCT and industrial DCT methods of the preferred embodiment of the present invention. Μ Please refer to Figures 7 and 8 again. In this first round of one-dimensional dct transport of 8x8 pixel blocks, The sixty-four pixel data are sequentially provided in a row manner (or in a column manner) to the enthusiasm input as early as 7L 1. The control unit 6 controls the input unit;] _ sends the round mushroom retention -. ^ ^ Do not enter the pixel data (Din) to the butterfly i ^ r single 702, 俾 can enable the butterfly torture swordsmanship 'Tan Yun Yi early Yuan 2 implementation including page 17 of this paper size 剌Towel _ (准 (CNS) A4 Specification (诮 Please read the precautions on the back before filling in this page)
、1T m--—-- 490634 A7 B7 五、發明説明(\$ ) 5 10 15 20 每行(或者列)之四個蝴蝶型運算之DCT迅速演算法的第一 級運算。該控制單元6然後控制該資料暫存器單元4,以 致於來自該蝴蝶型運算單元2的第一級輸出資料係經由該 寫入埠(WP1)儲存在該資料暫存器單元4内。當預定的第 一級輸出資料業已儲存於該資料暫存器單元4時,該控制 單元6控制該資料暫存器單元4經由該讀取埠(RP2)將該 等預定的第一級輸出資料供應到該乘法運算單元3,俾可致 月b該乘法運异單元3開始執行包含每行(或者列)之兩個相 加後相乘運算之DCT迅速演算法的第二級運算,而該蝴蝶 型運算單元2繼續執行該DCT迅速演算法的第一級運算。 該控制單元6再次控制該資料暫存器單元4經由該寫入埠 (WP2)把來自該乘法運算單元3的第二級輸出資料儲存在 該資料暫存器單元4内。在該蝴蝶型運算單元2已完成執 行該DCT迅速演算法的第一級運算之後,該控制單元6控 制該資料暫存器單元4以預定的順序透過該讀取埠(Rpi) 將該第一級和第二級輸出資料供應到該蝴蝶型運算單元2, 俾可致能該蝴蝶型運算單元2執行包含每行(或者列)之再 四個蝴蝶型運算之DCT迅速演算法的第三級運算。該控制 單元6再次控制該資料暫存器單元4透過該寫入埠(w^) 把來自該蝴蝶型運算單元2的第三級輸出資料儲存於該資 料暫存器單元4内。當預定的第三級輸出資料業已儲存於 5亥賣料暫存器單元4内時,該控制單元6控制該資料暫存 =單凡4透過該讀取埠(RP2)將該等預定的第三級輪由^ 剩i、應到該乘法運算單元3,俾可致能該乘法運算單元3 第18頁 -裳! (請先閱讀背面之注意事項兩填寫本頁) 、νφ 本紙張尺度適 CNS ) A4規格(210X297公釐) -—__ 490634 A7 B7 五、發明説明Ο 5 10 15 20 開始執行包含每行(或者列)之三個相加後相乘運算之dct 迅速演算法的第四級運算,而該蝴蝶型運算單元2繼於 行該DCT迅速演算法的第三級運算。該控制單元6再^ 制該資料暫存器單元4透過該寫人埠(wp2)把來自該^ 運算單元3的第四級輸出資料儲存在該資料暫存器單_ 内。在該蝴_運算單元2已完魏行該奶迅速演^ 的第二級運异之後,該控制單元6控制該資料暫存器單_ 2透過該讀取埠(RP1)將該第三級和第四級輪出資料^預= =順序供應到該蝴蝶型運算料2,俾可致能該蝴蝶型 早疋2執行包含每行(或者列)之另外四個蝴蝶型運 =二算法的第五級運算。該控制單元6再次控制該 ft暫存$早兀4透過該寫人埠(WP1)把來自該蝴蝶型運 ^单元2的弟五級輸出資料儲存於該資料暫存器單元4 該第五級輸出資料作用為該第—輪—維DCT運算的倍率植 換一維轉換資料。 ^ ^與在前述之美國專利案中所揭露的二維㈣方法不同 的疋,該DCT迅速演算法的第六級運算在該第—輪 = =='略。因此,當該倍率轉換—維轉換資料Ϊ :巧料暫存器單元4内時,該控制單元6控制該資 該俨=凡4透過讀取璋(RP1)依序以另-個方向輸出 率轉換-維轉換資料到該蝴蝶型運算單元2,例如,以 ^的方式(或者行),藉此致能該蝴蝶型運算單元2執行第 一輪一維DCT運算之DCT迅速演算法的笫一級運算。第二 -維DCT運异之DCT迅速演算法的第二至第五級運算然 本 第19頁 (CNS ) Am^ ( 210X 297^ ) (請先閱讀背面之注意事項再填寫本頁) :裴、 1T m ----- 490634 A7 B7 V. Description of the invention (\ $) 5 10 15 20 The first-level operation of the DCT rapid algorithm of four butterfly operations per row (or column). The control unit 6 then controls the data register unit 4 so that the first-stage output data from the butterfly-type arithmetic unit 2 is stored in the data register unit 4 via the write port (WP1). When the predetermined first-level output data has been stored in the data register unit 4, the control unit 6 controls the data register unit 4 to transmit the predetermined first-level output data through the read port (RP2). Supplied to the multiplication unit 3, the multiplication operation unit 3 may start to perform the second-level operation of the DCT rapid algorithm including the multiplication operation of two additions of each row (or column), and the The butterfly-type operation unit 2 continues to perform the first-level operation of the DCT rapid algorithm. The control unit 6 again controls the data register unit 4 to store the second-stage output data from the multiplication unit 3 in the data register unit 4 via the write port (WP2). After the butterfly-type arithmetic unit 2 has completed the first-level operation of the DCT rapid algorithm, the control unit 6 controls the data register unit 4 to pass the first port through the read port (Rpi) in a predetermined order. The second-stage and second-stage output data are supplied to the butterfly-type arithmetic unit 2. The third stage of the DCT rapid algorithm that enables the butterfly-type arithmetic unit 2 to perform four butterfly-type operations per row (or column) is enabled. Operation. The control unit 6 again controls the data register unit 4 to store the third-level output data from the butterfly-type arithmetic unit 2 in the data register unit 4 through the write port (w ^). When the predetermined third-level output data has been stored in the 5Hai material storage register unit 4, the control unit 6 controls the data to be temporarily stored = Shan Fan 4 uses the reading port (RP2) to store the predetermined first-stage data. The third round consists of ^ left i, should go to the multiplication unit 3, you can enable the multiplication unit 3 Page 18-Sang! (Please read the two notes on the back page and fill in this page first), νφ This paper is suitable for CNS) A4 size (210X297mm) -__ 490634 A7 B7 V. Description of the invention 〇 5 10 15 20 Start to include each line (or Column) of the three operations of the DCT rapid algorithm of the multiplication operation after the three additions, and the butterfly-type arithmetic unit 2 continues the third-level operation of the DCT rapid algorithm. The control unit 6 controls the data register unit 4 to store the fourth-level output data from the computing unit 3 in the data register list _ through the writing port (wp2). After the butterfly operation unit 2 has completed the second-level operation of the milk rapid play ^, the control unit 6 controls the data register list_ 2 through the read port (RP1) to the third level And the fourth-stage rotation data ^ pre == sequentially supplied to the butterfly-type computing material 2, which can enable the butterfly-type early 2 to execute the other four butterfly-type operations that include each row (or column). Fifth level operation. The control unit 6 again controls the ft temporary storage $ zaowu4 to store the fifth-level output data from the butterfly-type operation unit 2 in the data register unit 4 the fifth level through the writing port (WP1). The output data is used as the first-round-dimensional DCT operation to multiply the one-dimensional conversion data. ^ ^ Unlike the two-dimensional ㈣ method disclosed in the aforementioned US patent case, the sixth-stage operation of the DCT rapid algorithm is in the first round = == 'omitted. Therefore, when the magnification conversion-dimensional conversion data Ϊ: coincidence register unit 4 is controlled, the control unit 6 controls the data 俨 = Where 4 through reading 璋 (RP1) sequentially outputs the rate in another direction Conversion-dimensional conversion data to the butterfly-type arithmetic unit 2, for example, ^ (or row), thereby enabling the butterfly-type arithmetic unit 2 to perform the first-order operation of the DCT rapid algorithm of the first round of one-dimensional DCT operation. . Second-dimensional DCT operation of the second to fifth-level operations of the DCT rapid algorithm. Page 19 (CNS) Am ^ (210X 297 ^) (Please read the precautions on the back before filling this page): Pei
、1T — 4^0634 A7 B7 ’* ·· ~ ——-——-五、發明説明() 5 10 15 經濟部中^標^-.^5^.消贽合作社印繁 20 ,係以與第—輪—維DCT運算之DCT迅速演算法之第二至 第五級運算相同的形式被執行。其後,該控制單元6 暫存器單元4透過該寫入埠(wpi)把來自該蝴蝶型 連异早7L 2的第五级輸出資料儲存於該資料暫存器單元4 。該第五級輪出資料作用為該第二輪一維DCT 4 轉換二維轉換資料。 4 — ^接著,該控制單元6控制該資料暫存器單元4透過該 Γ取埠(RP2 )供應該倍率轉換二維轉換資料到該乘法運算 單元3,俾可致能該乘法運算單元3根據該組在第六a圖 中示且被儲存於該係數R〇M 34内之加權係數來執行 包含每,(或者行)之八個内部乘法運算之該第二輪一: …運斤之DCT迅速演异法的第六級運算,藉此得到對廣 =該輪入像素資料(Din)的二維轉換資料。最後,該控g 單元6控制該輸出單元5從該乘法運算單元3之乘電 路3:3接收二維轉換資料,俾可提供該二維轉換資料二 部裝置(圖中未示)。 ^請再次參閱第七和九圖所示,在該第一輪一維IDCT運 异期間,一 8XS轉換資料塊的六十四個轉換資料係依序以 列的方式(或者行)來被供應到該輸入單元1。 w 一 ,制該輪入單元i發送該輸入轉換資料(Din)到該乘法運 算單元3,俾可致能該乘法運算單元3執行包含每列(或者 行)之八個内部乘法運算之工DCT迅速演算法的第一級運瞀 在别'述之美國專利案中所揭露的^一維I dct方法·不门的 是,該第一輪一維工DCT運算之IDCT迅速演算法的第 第20頁 本纸A尺度適则’i]g]家標準(⑽)A4規格(21()>< 297公酱) (請先閱讀背面之注意事項再填寫本頁) 一裴- 、11 -1¾ "+^u〇J4 發明説明(A) A7 B7 5 10 15 經淤部中λ榀挲而員.T消贽合作社卬製 20 根據該組在第六B 11巾所顯*且_存於該係數 IDCT 1内的加權係數來被執行,俾可允許該第二輪一維 法運;!算之1DCT迅速演算法之第—級運算的省略。該乘 π早π3於此時輸出倍率轉換第一級輸出資料。 該㈣單s 6控制該:#料暫存科S 該乘法運算單元3的㈣缝⑨〜f 錢於木目 入德,早x的U轉換弟-級輸出資料係透過該寫 士 :2)儲存在該資料暫存器單元4内。在該倍率轉換 級輪出資料被儲存於該資料暫存器單元4㈣,該控 控制該資料暫存器單元4透過該讀取埠(则將 該可得到之倍率轉換第一級輸出資料供應到該蝴蝶型運算 2,俾可致能該蝴蝶型運算單元2執行包含每列(或者 ^丁)之四個蝴蝶型運算之IDCr迅速演算法的第二級運算。 该控制單元6再次控制該資料暫存器單元4透過該寫入埠 (WP1)把來自該蝴蝶型運算單元2的第二級輸出資料儲存 於該資料暫存器單元4内。當預定的第二級輸出資料業已 儲存於該資料暫存器單元4内時,該控制單元6控制該資 料暫存器單元4透過該讀取埠(RP2)將該等預定的第二級 輸出資料供應到該乘法運算單元3,俾可致能該乘法運算單 元;3開始執行包含每列(或者行)之三個相乘後相減運算之 工DCT迅速/貝异法的弟二級運算,而該蝴蝶型運算單元2繼 續執行該:EDCT迅速演算法的第二級運算。該控制單元6 再-人控制該 > 料暫存器單元4透過該寫入埠(Wp2)把來自 該乘法運异單元3的第三級輸出資料儲存於該資料暫存器 單元4内。在該蝴蝶型運算單元2已完成執行該工DCT迅 第21頁 本纸張尺度適州中國國•標準(CNS ) A4規格(21Q X 297公遵;) (請先閱讀背面之注意事項再填寫本頁) 、τ Γ fa+^u〇J4 A7 五 -----— 、發明説明 5 10 15 ¾¾部中央榀芈杓卩Η消货合作妇印製 20 存琴:苐 异之後,該控制單元6控制該資料暫=早7" 4透過料料(则以敢的順序將該第二級 第二級輸出資料供應到該蝴蝶型運算算單无:執行包含每列(或者行)之再二Ξ=之工町迅速演算法的第四級運算。該控制單元加4運,早π 2㈣四級輪出資料儲存於該資料暫存器 =二乂内。當預定的第四級輸出資料業已儲存於該資料暫 ^早兀4㈣,該控制單元6㈣該資料暫存器4透過= ㈣第四級輸出資料供應到該乘 早兀3’俾可致能該乘法運算單元3開始執行包含 =列(或者行)之兩個相乘後相減運算之idct迅速演算法 的弟五級運算,㈣_型運算單元2繼續執行該IDCT 法❹讀運算。魅料元6再故制該資料 存料兀4透過該寫入蜂(wp2)把來自該乘法運算單元 3的第五級輸出資料儲存於該資料暫存器單元4内。在該 ,蝶型運算單it 2己完成執行該idct迅速演算法的第四 、及j算之後’該控制單元6控制該資料暫存器單元4透過 該讀取埠(RP1)以預定的料將該第四級和第五級輸出資 料供應到該蝴蝶型運异單7C 2,俾可致能該蝴蝶型運算單元 2執行包含每列(或者行)之另外四個蝴蝶型運算之工岭 迅速演算法的第六級運算。該控制單元6控制該資料暫存 器單元4儲存該第六級輪出資料,該第六級輸出資料作用 為該第一輪_維IDCT運算的倍率轉換—維轉換資料。 第22頁 本紙張尺度適❿ (讀先閱讀背面之注意事項再填寫本頁) 4 五 、發明説明 A? Β7 5 10 15 中 Jk il 卑 Λ 消 合 作 社 卬 t 20 管、如先刖所述’該第二輪一維IDCT運算之IDCT迅速演 二法的第級運异在本發明較佳實施例的idgt方法中係被 f1此,當該倍率轉換一維轉換資料被儲存於該資料 、子斋單70 4内時,該控制單元6控制該資料暫存器單元 4透過該讀取埠(RP1)依序以另_個方向將該倍率轉換一維 轉換資料輸出到該蝴蝶型運算單元2,例如,以行的方式( 或者列),藉此致能該蝴蝶型運算單元2執行該第二輪一維 工DCT運异之工DCT迅速演算法的第二級運算。該第二輪〆 維工DCT運算之10〇]?迅速演算法的第三至第六級運算然後 係以與該第一輪一維工DCT運算之工DCT迅速演算法之第三 至第六級運算相同的形式被執行。其後,該控制單元6控 制該輸出單元5從該蝴蝶型運算單元2的蝴蝶型電路22接 收第六級輸出資料作為該二維轉換資料,俾可提供該第六 級輸出資料到一外部裝置(圖中未示)。 第十圖描繪另一種用於執行本發明較佳實施例之二維 DCT/IDCT方法妁DCT/IDCT裝置。該DCT/工DCT裝置的 結構係以在美國專利第5,471,412號一案中所述的為基礎 。第十圖的DCT/IDCT裝置能夠執行一 8χ8資料塊之二維 DCT/ IDCT循ί衣與平行管線式處理,而且係被顯示包含兩 個第一與第二輪一維DCT/IDCT處理單元ι〇1,102及一控 制單元1〇3。該第一輪一維DCT/IDCT處理單元1〇1包含 :一輸入單元1011,像一解多工器般;一蝴蝶型運算單元 1〇12 ’其包括多工^§ 1012 1和—蝴蝶剖電路1〇12 2 ; 一乘法運算單元1013,其包括一輪入選擇多工器1〇13工、 第23頁 本纸張尺度適川中國國家標準(CMS ) A4規格(210X 297公釐) (請先閱讀背面之注意事項再填寫本筲) 經浐部屮夾«.if^h.t.消资告作拉卬W未 490634 Λ7 -------—------------ --B7 五、發明説明(2\ ) ~S ~^~ --- 一加法/減法電路10132、一乘法器電路1〇133、一係數 R二則4、和一輸出選擇多工器1〇135;及_資料暫存 ,單兀(A)1014,其負責儲存該第一輪一維dct/工π?運 异之不同級的結果並且亦作用為一用於供應輸入資料至談 5第二輪一維禮/工町處理單元工〇2的轉置記憶體 ,輪一維DCT/工DCT處理單元1〇2包含:一蝴蝶型運算單 兀1〇21,其包括一多工器1〇21;ί和一蝴蝶型電路 ,一乘法運算單兀10¾2,其包括一輸入選擇多工器1〇221 、一加法/減法電路1〇222、一乘法器電路1〇223、一係 10數ROM 1〇224、和一輸出選擇多工器1〇225 ; 一資料暫存 器單元(B) 1023 ;及一輸出單元1〇24,像一多工器般。由 於該DCT迅速演算法的第六級運算係在該第一輪二維Xdct 運算中被省略,且由於該工DCT迅速演算法的第一級運算係 在該第二輪一維工DCT運算中被省略,該等乘法器電路 15 10133,10223係需要個別的係數r〇m 10134,1〇224。特 別地,該係數ROM 1〇 I3 4在DCT處理期間包含 al,a2,bl,b2 和 b3,而在 IDCT處理期間包含 ql,q2,q3,rl,r2,和 Pi, j (i, j = 〇 至 7)。該係數 R〇M 10224在DCT處理期間包含ai,a2,bl/b2,b3和 20 CiAj(i,j = 0到 7),而在工DCT處理期間包含 ql,q2,q3,rl和r2。該控制單元1〇3負責控制該等係數 ROM 10134,10224和該等資料暫存器單元ι〇14,1〇23 的讀取/寫入運作,而且亦係負責控制不同的多工器 10121,10131,10135,10211,10221,1024 ° 該控制單 第24頁 度適W中國0?:標準—(〔阳)乂4規格(21〇:><1^釐) " ' (請先閱讀背面之注意事項再填寫本頁)、 1T — 4 ^ 0634 A7 B7 '* ·· ~ ——————- V. Description of the invention () 5 10 15 In the Ministry of Economic Affairs ^ standard ^-. ^ 5 ^. Consumer Cooperative Co., Ltd. Yin Fan 20, related to The second to fifth stage operations of the DCT rapid algorithm of the first-round-dimensional DCT operation are performed in the same form. Thereafter, the control unit 6 register unit 4 stores the fifth-level output data from the butterfly-type Lianyi 7L 2 in the data register unit 4 through the write port (wpi). The fifth-stage rotation data is used for the second-round one-dimensional DCT 4 conversion and two-dimensional conversion data. 4 — ^ Then, the control unit 6 controls the data register unit 4 to supply the magnification conversion two-dimensional conversion data to the multiplication unit 3 through the Γ access port (RP2), and the multiplication unit 3 can be enabled according to This group is shown in the sixth a diagram and stored in the coefficient ROM 34 with a weighting coefficient to perform the second round of one including eight internal multiplication operations per (or row): ... The sixth-level operation of the different method is quickly performed, so as to obtain the two-dimensional conversion data of the pixel data (Din). Finally, the g-controlling unit 6 controls the output unit 5 to receive the two-dimensional conversion data from the multiplication circuit 3: 3 of the multiplication unit 3, and can provide two devices for the two-dimensional conversion data (not shown). ^ Please refer to Figures 7 and 9 again. During the first round of one-dimensional IDCT operation, sixty-four transformation data of an 8XS transformation data block were sequentially supplied in columns (or rows). Go to the input unit 1. w One, the round-robin unit i sends the input conversion data (Din) to the multiplication unit 3, and can enable the multiplication unit 3 to perform a DCT including eight internal multiplication operations per column (or row). The first-level operation of the rapid algorithm is the one-dimensional I dct method disclosed in the U.S. patent case mentioned above. What's more, the first round of the IDCT rapid algorithm of the first one-dimensional DCT operation is the first 20 pages of paper A-size rule 'i] g] Home standard (⑽) A4 size (21 () > < 297 male sauce) (Please read the precautions on the back before filling out this page) Yi Pei-、 11 -1¾ " + ^ u〇J4 Description of the invention (A) A7 B7 5 10 15 λ 榀 挲 in the sludge department. T elimination cooperative cooperative system 20 According to the group in the sixth B 11 towels * and _ The weighting coefficients stored in the coefficient IDCT 1 are executed, and the second round of one-dimensional method operation is allowed; the omission of the first-stage operation of the 1DCT rapid algorithm is calculated. The multiplication π is earlier than π3 at this time, and the output magnification converts the first-stage output data. The list s 6 controls this: # 料 暂存 科 S The quilting of the multiplication unit 3 ~ f Qian Yumu enters Germany, the U-converter of the early x-level output data is through the writer: 2) storage In the data register unit 4. At the magnification conversion stage, the data is stored in the data register unit 4㈣, and the control controls the data register unit 4 through the reading port (then the available magnification conversion first-stage output data is supplied to The butterfly-type operation 2 can enable the butterfly-type operation unit 2 to perform a second-level operation of the IDCr rapid algorithm including four butterfly-type operations per column (or Ding). The control unit 6 controls the data again. The register unit 4 stores the second-stage output data from the butterfly-type arithmetic unit 2 in the data register unit 4 through the write port (WP1). When the predetermined second-stage output data has been stored in the When in the data register unit 4, the control unit 6 controls the data register unit 4 to supply the predetermined second-level output data to the multiplication unit 3 through the read port (RP2). The multiplication unit can be executed; 3 starts to perform the DCT fast / beijing second-level operation including the three multiplication and subtraction operations of each column (or row), and the butterfly-type arithmetic unit 2 continues to perform the: EDCT rapid algorithm second The control unit 6 controls the > material register unit 4 and stores the third-level output data from the multiplication and transport unit 3 in the data register unit 4 through the write port (Wp2). The butterfly-type computing unit 2 has completed the execution of the DCT operation. Page 21 The paper size is suitable for China • Standard (CNS) A4 (21Q X 297); (Please read the notes on the back first (Fill in this page again), τ Γ fa + ^ u〇J4 A7 five --------, invention description 5 10 15 ¾ ¾ printed by the central government's consumer cooperation cooperation 20 save the piano: after surprise, the The control unit 6 controls the data temporarily = early 7 " 4 through the material (then supply the second-level and second-level output data to the butterfly-type calculation sheet in a dare order) None: execute the data containing each column (or row). The second-level operation of the second-level algorithm = the rapid calculation algorithm of Gongmachi. The control unit is added to the 4th stage, and the data of the fourth-level rotation is stored in the data register = the second level. When the predetermined fourth-level output is output, The data has been stored in the data temporary ^ early Wu 4㈣, the control unit 6㈣ the data register 4 through = ㈣ fourth-level output data If the material is supplied to the multiplication unit 3 '俾, the multiplication unit 3 can start to perform the fifth-level operation of the idct rapid algorithm including two columns (or rows) which are multiplied and subtracted. ㈣_ type The operation unit 2 continues to perform the IDCT reading operation. The charm element 6 then makes the data stock 4 and stores the fifth-level output data from the multiplication operation unit 3 in the data through the write bee (wp2). In the register unit 4. After the butterfly operation unit it 2 has completed the fourth and j calculations of the idct rapid algorithm, the control unit 6 controls the data register unit 4 through the reading port. (RP1) Supply the fourth-level and fifth-level output data to the butterfly-type transport order 7C 2 with predetermined materials, and enable the butterfly-type arithmetic unit 2 to execute the other four including each column (or row). The sixth-level operation of the Gongling rapid algorithm of a butterfly-type operation. The control unit 6 controls the data register unit 4 to store the sixth-stage rotation data, and the sixth-stage output data serves as the first-dimensional IDCT operation of the magnification conversion-dimensional conversion data. Page 22 The paper size is appropriate (read the precautions on the back and then fill out this page) 4 V. Description of the invention A? Β7 5 10 15 Jk il Λ Λ Consumer Cooperatives 卬 t 20 tubes, as described in the previous section ' The second-stage IDCT of the second round of one-dimensional IDCT calculations is performed in the idgt method of the preferred embodiment of the present invention by f1. When the magnification conversion one-dimensional conversion data is stored in the data, the When within Zhaidan 70 4, the control unit 6 controls the data register unit 4 to sequentially output the magnification-converted one-dimensional conversion data to the butterfly-type arithmetic unit 2 through the reading port (RP1) in another direction. For example, in a row manner (or a column), thereby enabling the butterfly-type arithmetic unit 2 to perform the second-stage operation of the second round of the one-dimensional DCT operation different algorithm DCT rapid algorithm. The second round of dimensional maintenance DCT calculations is performed in the third round. The third to sixth stages of the rapid algorithm are then compared with the first round of one-dimensional DCT calculations. Stage operations are performed in the same form. Thereafter, the control unit 6 controls the output unit 5 to receive the sixth-level output data from the butterfly-type circuit 22 of the butterfly-type arithmetic unit 2 as the two-dimensional conversion data, and may provide the sixth-level output data to an external device. (Not shown). The tenth figure depicts another two-dimensional DCT / IDCT method / DCT / IDCT device for implementing a preferred embodiment of the present invention. The structure of the DCT / DCT device is based on the one described in U.S. Patent No. 5,471,412. The DCT / IDCT device of the tenth figure can perform a two-dimensional DCT / IDCT cycle and parallel pipeline processing of an 8x8 data block, and is shown to include two first and second rounds of one-dimensional DCT / IDCT processing units. 〇1,102 and a control unit 103. The first round of one-dimensional DCT / IDCT processing unit 101 includes: an input unit 1011, like a demultiplexer; a butterfly-type computing unit 1012 'which includes multiplexing ^ § 1012 1 and-butterfly profile Circuit 1012 2; a multiplication unit 1013, which includes a round-in selection multiplexer 1013, page 23 This paper is compliant with Sichuan National Standard (CMS) A4 specification (210X 297 mm) (please Please read the notes on the back before filling in this card)) The Ministry of Economics and Economics folder «.if ^ ht Consumption Report is not included 490634 Λ7 ----------------------- ---B7 V. Description of the invention (2 \) ~ S ~ ^ ~ --- An addition / subtraction circuit 10132, a multiplier circuit 1〇133, a coefficient R two then 4, and an output selection multiplexer 1 〇135; and _ temporary storage of data, unit (A) 1014, which is responsible for storing the results of the first round of one-dimensional dct / work π different levels of results and also serves as a supply of input data to discuss 5 The second round of one-dimensional Liyi / Gongmachi processing unit worker 02 transposed memory, the one-dimensional DCT / industrial DCT processing unit 102 includes: a butterfly-type computing unit 1021, which includes a multiplexer 1〇21; ί and a butterfly circuit, The multiplication unit 10¾2 includes an input selection multiplexer 10221, an addition / subtraction circuit 1022, a multiplier circuit 1023, a series of 10-digit ROM 11042, and an output selection multiplexer. Device 1022; a data register unit (B) 1023; and an output unit 1024, like a multiplexer. Because the sixth-level operation of the DCT rapid algorithm is omitted in the first two-dimensional Xdct operation, and because the first-level operation of the rapid DCT algorithm is in the second one-dimensional DCT operation Omitted, the multiplier circuits 15 10133, 10223 need individual coefficients r0m 10134, 10224. In particular, the coefficient ROM 10I3 4 contains al, a2, bl, b2, and b3 during the DCT process, and contains ql, q2, q3, rl, r2, and Pi, j (i, j = 〇 to 7). The coefficient ROM 10224 includes ai, a2, bl / b2, b3, and 20 CiAj (i, j = 0 to 7) during the DCT process, and includes ql, q2, q3, rl, and r2 during the DCT process. The control unit 103 is responsible for controlling the read / write operations of the coefficient ROMs 10134, 10224 and the data register units ι14, 1023, and it is also responsible for controlling different multiplexers 10121, 10131,10135,10211,10221,1024 ° The 24th page of this control sheet is suitable for China 0 ?: Standard — ([阳] 乂 4 Specifications (21〇: > < 1 ^ ali) " '(Please first (Read the notes on the back and fill out this page)
490634 經济部中央樣準而以工消贽合作ii-印繁 Λ7 B7 五、發明説明(# ) 元103更負責控制该第一和第二輪一維DCT/IDCT處理單 元101,1〇2之餘下組件之運作的時序。 請參閱第十和十一圖所示,當該裝置執行二維DCT處 理時,該第一處理單元101執行第一圖之DCT迅速演算法 5 的首五級運算’以獲得被供應到該第二處理單元102的倍 率轉換一維轉換資料。該第二處理單元102然後執行第一 圖之DCT迅速演算法的該六級運算,俾得到由該輸出單元 1024傳輸到一外部裝置(圖中未示)的二維轉換資料。當該 裝置執行二維工DCT處理時,該第一處理單元101執行第 10 三圖之IDCT迅速演算法的所有六級運算,以獲得被供應到 該第二處理單元1〇2的倍率轉換一維轉換資料。該第二處 理單元1〇2然後執行第三圖之IDCT迅速演算法的後五級 運算,俾得到由該輸出單元1〇24傳輸到該外部裝置(圖中 未示)的二維轉換資料。 15 第十圖的裝置具有一個是為第七圖,之裝置之處理速度 兩倍的處理速度,藉此允許較高的輪出位元速率。 因此,業已被顯示的是,藉著在第二輪一維DCT運算 之DCT迅速演算法之内部乘法運算中,即,第六級運算, 使用一組倍率轉換加權係數,該第一輪一維DCT運算之 2 〇 DCT迅速演算法的第六級運算可以被省略。同樣’藉著在 該第一輪一維工DCT運算之IDCT迅速演算法之内部乘法運 算中,即,第一級運算,使用一組倍率轉換加權係數,該 第二輪一維IDCT運算之IDCT迅速演算法的第一級運算可 以被省略。因此,二維DCT/IDC丁處理係利用一個具有該 第25頁 本紙張尺度適;1]中國國家操準() 乂4規格(21〇 x 297公釐) (請先閱讀背面之注意事項再填寫本頁) 、11 ------ A7 _________________ B7 五、發明説明) 迅速演异法之六級運算的一維DCT/IDCT運算,以及另一 個具有該迅速演算法之五級運算的一維dct/jdct運算來 執行。據此,其中一個一維DCT/IDCT運算包含十三個乘 法運算,而另一個一維DCT/IDCT運算僅包含五個乘法運 5算。因此,就一個8x8資料塊而言,在本發明之二維 DCT/IDCT方法中之乘法運算的總數係被減少到144個 (Sxl3 + Sx5),與前述的美國專利案比較起來,其到達 31%的減少,藉此得到處理速度上明顯的提升。 雖然本發明業已配合較佳實施例作描述,要了解的是 10 ,本發明並不受限於所揭露的實施例而且係傾向於涵蓋包 括在本發明之精神與範圍内之各種改變及等效的實施例。 I.! . 0! (請先閱讀背面之注意事項再填寫本頁) 訂 ^.>,:-/-|部.中-3);对 ΛΥΊ" $ '令 'tl.^-w 策 第26頁 :;、纸張尺CNS 210 x 297公釐)490634 The central government of the Ministry of Economic Affairs has cooperated with the government to cooperate with the industry. Ii- 印 繁 Λ7 B7 V. Description of invention (#) Yuan 103 is also responsible for controlling the first and second rounds of one-dimensional DCT / IDCT processing units 101, 102. Timing of operation of the remaining components. Please refer to the tenth and eleventh figures. When the device performs two-dimensional DCT processing, the first processing unit 101 executes the first five stages of the DCT rapid algorithm 5 of the first figure 'to be supplied to the first The magnification conversion of the two processing units 102 converts the one-dimensional conversion data. The second processing unit 102 then executes the six-level operation of the DCT rapid algorithm of the first figure to obtain the two-dimensional conversion data transmitted from the output unit 1024 to an external device (not shown). When the device performs two-dimensional DCT processing, the first processing unit 101 executes all six-level operations of the IDCT rapid algorithm of the tenth and third images to obtain a magnification conversion supplied to the second processing unit 102. Dimension conversion data. The second processing unit 102 then executes the last five stages of the IDCT rapid algorithm of the third figure to obtain the two-dimensional conversion data transmitted by the output unit 1024 to the external device (not shown). 15 The device of the tenth figure has a processing speed that is twice that of the device of the seventh figure, thereby allowing a higher bit rate for rounds. Therefore, it has been shown that by using a set of magnification conversion weighting coefficients in the internal multiplication of the DCT rapid algorithm of the second round of one-dimensional DCT operations, that is, the sixth-level operation, the first round of one-dimensional DCT operation 2 DCT rapid algorithm sixth-level operation can be omitted. Similarly, 'in the internal multiplication of the IDCT rapid algorithm of the first round of one-dimensional DCT operation, that is, the first level operation, using a set of ratio conversion weighting coefficients, the IDCT of the second round of one-dimensional IDCT operation The first-level operation of the rapid algorithm can be omitted. Therefore, the two-dimensional DCT / IDC process uses a paper with the appropriate size for the 25th page; 1] China National Standards () 规格 4 specifications (21 × x297 mm) (Please read the precautions on the back before (Fill in this page), 11 ------ A7 _________________ B7 V. Description of the invention) One-dimensional DCT / IDCT operation that quickly performs six-level operations of different methods, and another one that has five-level operations of the rapid algorithm. Dimension dct / jdct operations to perform. Accordingly, one of the one-dimensional DCT / IDCT operations includes thirteen multiplication operations, and the other one-dimensional DCT / IDCT operation includes only five multiplication operations. Therefore, for an 8x8 data block, the total number of multiplication operations in the two-dimensional DCT / IDCT method of the present invention is reduced to 144 (Sxl3 + Sx5). Compared with the aforementioned US patent case, it reaches 31 % Reduction, which results in a significant increase in processing speed. Although the present invention has been described in conjunction with a preferred embodiment, it is understood that the invention is not limited to the disclosed embodiments and is intended to cover various changes and equivalents which are included in the spirit and scope of the invention The examples. I.!. 0! (Please read the notes on the back before filling this page) Order ^. ≫,:-/-| 部. 中 -3); For ΛΥΊ " $ '令' tl. ^-W Policy Page 26:;, Paper ruler CNS 210 x 297 mm)
Claims (1)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/153,055 US6189021B1 (en) | 1998-09-15 | 1998-09-15 | Method for forming two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations |
JP6901999A JP3852895B2 (en) | 1998-09-15 | 1999-03-15 | Method of performing two-dimensional discrete cosine transform capable of reducing multiplicative operation and its inverse transform |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/153,055 US6189021B1 (en) | 1998-09-15 | 1998-09-15 | Method for forming two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations |
Publications (1)
Publication Number | Publication Date |
---|---|
TW490634B true TW490634B (en) | 2002-06-11 |
Family
ID=22545597
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW87115327A TW490634B (en) | 1998-09-15 | 1998-09-15 | Method for performing two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW490634B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7587093B2 (en) | 2004-07-07 | 2009-09-08 | Mediatek Inc. | Method and apparatus for implementing DCT/IDCT based video/image processing |
-
1998
- 1998-09-15 TW TW87115327A patent/TW490634B/en not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7587093B2 (en) | 2004-07-07 | 2009-09-08 | Mediatek Inc. | Method and apparatus for implementing DCT/IDCT based video/image processing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Turnbull et al. | An introduction to the theory of canonical matrices | |
Mathai et al. | Bilinear forms and zonal polynomials | |
Billey | Kostant polynomials and the cohomology ring for G/B | |
JP2866754B2 (en) | Arithmetic processing unit | |
Grafakos et al. | On multilinear Fourier multipliers of limited smoothness | |
TW594502B (en) | Length-scalable fast Fourier transformation digital signal processing architecture | |
JPH03211604A (en) | Digital signal processor | |
TWI295787B (en) | A method and system for discrete cosine transforms / inverse discrete cosine transforms based on pipeline architecture | |
US6189021B1 (en) | Method for forming two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations | |
Lessard et al. | Rigorous continuation of bifurcation points in the diblock copolymer equation | |
Bolten et al. | Multigrid methods for tensor structured Markov chains with low rank approximation | |
Rodrı et al. | An algorithm for the Cartan–Dieudonné theorem on generalized scalar product spaces | |
TW490634B (en) | Method for performing two-dimensional discrete cosine transform and its inverse involving a reduced number of multiplication operations | |
Corless | Gröbner bases and matrix eigenproblems | |
Conley et al. | Shadows of rationals and irrationals: supersymmetric continued fractions and the super modular group | |
Abłamowicz | Computations with Clifford and Grassmann algebras | |
Kim et al. | Characterization of inert actions on periodic points. Part I | |
CN115859003A (en) | Method, device and equipment for executing FFT | |
TW410302B (en) | Pipelined fast fourier transform processor | |
Raihen | Convergence Rates for Hestenes' Gram-Schmidt Conjugate Direction Method without Derivatives in Numerical Optimization | |
JPH02273867A (en) | Matrix arithmetic unit | |
Jing et al. | Generating functions for symmetric and shifted symmetric functions | |
JP4248027B2 (en) | Device for converting a sequence of data elements | |
JP3088472B2 (en) | Fourier transform device | |
Brussel | The division algebras and Brauer group of a strictly Henselian field |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
GD4A | Issue of patent certificate for granted invention patent | ||
MM4A | Annulment or lapse of patent due to non-payment of fees |