WO2012142777A1 - 一种视频编码8x8整数变换方法 - Google Patents
一种视频编码8x8整数变换方法 Download PDFInfo
- Publication number
- WO2012142777A1 WO2012142777A1 PCT/CN2011/073586 CN2011073586W WO2012142777A1 WO 2012142777 A1 WO2012142777 A1 WO 2012142777A1 CN 2011073586 W CN2011073586 W CN 2011073586W WO 2012142777 A1 WO2012142777 A1 WO 2012142777A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- transform
- integer
- matrix
- integer transform
- transformation
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
- H04N19/122—Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
Definitions
- the present invention relates to the field of video coding technologies, and in particular, to a video coding 8x8 integer transform method.
- a complete video coding system consists of a series of algorithm modules such as prediction, transform, quantization, and entropy coding.
- intra-frame interframe prediction is compression data using spatial and temporal correlation of video data.
- the transform module transforms the image residual data outputted by the prediction module from the time domain to another space, so that the data in the time domain is uniformly concentrated in the transform space, and the energy of the data is mostly concentrated in the low frequency region of the space. .
- the transformed data distribution features are effectively utilized to further compress the data.
- the transform module is an important part of the video coding system, and the performance of the transform directly affects the performance of the video coding system.
- the DCT Discrete Cosine Transform
- the energy concentration of the transform is very good, which can greatly eliminate the correlation of video data in the transform space.
- this algorithm is insufficient.
- the DCT transformation matrix is represented by floating point. A large number of floating-point multiplications and additions are used in the calculation process, which takes up a lot of hardware resources and is computationally complex.
- the floating-point operation has a truncation error, and the floating-point calculation accuracy is not high.
- the DCT forward-reverse transform cannot be completely matched, and the transform is irreversible. Summary of the invention
- the purpose of the embodiment of the present invention is to provide a video coding 8x8 integer transform method, which aims to solve the problem that the prior art DCT transform matrix adopts floating point multiplication and addition, which occupies many hardware resources, is computationally complex, and has low calculation precision, and the DCT is positive.
- the inverse transformation cannot be completely matched, and the transformation is irreversible.
- a video coding 8x8 integer transform method includes:
- the 8x8 general integer transformation matrix is: kO kO kO kO kO kO kO kO kO kO kO kO kO kO
- kO, k5, k6 take values 1, 2, 1, kl k3 4 respectively [ is an integer transform base; positive transform, integer transform of 8x8 image residual data , the transformation formula is yo 3 ⁇ 4P ; wherein P is the transformation matrix; is the image residual data;
- the integer transform base is one of the following values: [15, 15, 10, 3], [6, 7, 5, 1].
- step of "selecting an integer transform base and constructing the transform matrix P" may further include the following steps:
- step of "positive transformation” performs integer transformation on 4x4 image residual data
- transformation formula may further include steps:
- ⁇ is the point multiplication operation of the matrix
- Y is the data after 8x8 positive transformation
- PF is the scaling matrix, which is the output value after the scaling process, representing the information of the video data transformed into the frequency domain
- the scaling matrix is:
- the transform method of the embodiment of the present invention has excellent energy concentration performance, and the energy concentration is close to 8x8DCT, which can greatly eliminate the transformation of video data. Spatial correlation. Low computational complexity, can be added And shift to achieve, easy to implement hardware, and high precision of operation, there is no problem of positive and negative transform mismatch.
- FIG. 1 is a flowchart of a method of a preferred embodiment of a video coding 8x8 integer transform method of the present invention
- FIG. 2 is a diagram of an integer transform base selection of a preferred embodiment of the video coding 8x8 integer transform method of the present invention [15, 15, 10, 3] a schematic diagram of the corresponding integer transformation calculation method;
- FIG. 3 is a calculation method diagram of the calculation module M1 in the schematic diagram of the integer transformation calculation method of FIG. 2;
- FIG. 4 is a selection of integer transformation bases for the preferred embodiment of the video coding 8x8 integer transformation method of the present invention [6, 7, 5, 1] a schematic diagram of the corresponding integer transformation calculation method;
- FIG. 5 is a diagram showing a calculation method of the calculation module M2 in the schematic diagram of the integer transformation calculation method of FIG. 4.
- FIG. 1 is a video coding 8x8 integer transform method according to Embodiment 1 of the present invention, where the method includes the following steps:
- S101 Receive 8 ⁇ 8 image residual data output by the prediction module in the encoder.
- S102 selecting an integer transform base, constructing a transform matrix P;
- the 8x8 general integer transformation matrix is:
- kO, k5, and k6 take values 1, 2, and 1, respectively.
- the definition k2 k3 is constructed as a transformation matrix P for an integer transform base.
- the integer transform base may select one of the following values:
- r [l/
- m . is the row vector of the transformation matrix P, i takes a value of 1-8;
- ⁇ is the vector norm of the row vector;
- ® is the point multiplication of the matrix
- Y is the 8x8 positive transformed data
- PF is the scaling matrix
- output value after the scaling process is the information of the video data transformed into the frequency domain.
- the transform method of the embodiment of the invention has excellent energy concentration performance, and the energy concentration is close to 8x8DCT, which can greatly eliminate the correlation of video data in the transform space; the computational complexity is low, and can be realized by addition and shifting, and is easy to implement in hardware. Moreover, the operation precision is high, and there is no problem that the forward and reverse transforms do not match.
- Embodiment 2
- Embodiment 2 of the present invention is an integer transform base of an 8x8 general integer transform matrix, and the method includes the following steps:
- the general integer transformation matrix of 8x8 is: k0 kO kO kO kO kO kO kO kO kO kO kO
- k0, k5, and k6 take values 1, 2, and 1, respectively.
- the definition k2 k3 is constructed as a transformation matrix P for an integer transform base.
- [15, 15, 10, 3] is selected as an integer transform base, and the transform matrix Pc is constructed.
- the transform matrix P is:
- Change matrix P construct: scaling matrix PF ;
- PF Z z ⁇ l ⁇ m2 ⁇ l/
- j [yl y2 y3 y4 y5 y6 yl yS] T ⁇ xl xl x3 x4 x5 x6 xl x8 represents the input eight-point value of the one-dimensional forward transform of the integer transform; ⁇ 2 3 4 5 ⁇ is the eight-point output value of the forward transform ;
- xl x2 x3 x4 x5 x6 xl x8 represents the input eight-point value of the one-dimensional forward transform of the integer transform
- yl y3 y4 y5 y6 yl y8 is the eight-point output value of the positive transform
- the data processing direction is from left to right, intersecting
- the value of the intermediate variable of a dot is equal to the data represented by the line on the left side of the dot; the square means the data is multiplied by a coefficient;
- the "-” means taking a negative value;
- the "2” means multiplying by 2, that is, one bit to the left;
- "4" represents multiplying by 4, that is, shifting two digits to the left;
- “8” means multiplying by 8, that is, shifting 3 digits to the left, and
- "16" means multiplying by 16, that is, shifting 4 digits to the left.
- ® is the point multiplication of the matrix
- Y is the 8x8 positive transformed data
- PF is the scaling matrix
- output value after the scaling process is the information of the video data transformed into the frequency domain.
- the transform method of the embodiment of the invention has excellent energy concentration performance, and the energy concentration is close to 8x8DCT, which can greatly eliminate the correlation of video data in the transform space; the computational complexity is low, and can be realized by addition and shifting, and is easy to implement in hardware. Moreover, the operation precision is high, and there is no problem that the forward and reverse transforms do not match.
- Embodiment 3
- Embodiment 3 of the present invention is an integer transform base of an 8x8 general integer transform matrix, and the method includes the following steps: 5301. Receive 8 ⁇ 8 image residual data X output by the prediction module in the encoder.
- the general integer transformation matrix of 8x8 is: kO kO kO kO kO kO kO kO kO kO kO kO kO
- kO, k5, and k6 take values 1, 2, and 1, respectively.
- k2 k3 as an integer 3 to construct a transformation matrix? .
- [6, 7, 5, 1] is selected as an integer transform base, and is constructed into a transform matrix P.
- the forward transform fast calculation method of this transform base is shown in FIG. 4, wherein the calculation module M2 is as shown in FIG. 5.
- xl x2 x3 x4 x5 x6 xl x8 represents the input four-point value of the one-dimensional forward transformation of the integer transformation
- yl y3 y4 y5 y6 yl y8 is the four-point output value of the positive transformation
- the data processing direction is from left to right, intersecting
- the value of the intermediate variable of a dot is equal to the data represented by the line on the left side of the dot; the square means the data is multiplied by a coefficient;
- the "-” means taking a negative value;
- the "2” means multiplying by 2, that is, one bit to the left;
- "4" Indicates multiplied by 4 that is, shifted to the left by two.
- ® is the point multiplication of the matrix
- Y is the 8x8 positive transformed data
- PF is the scaling matrix
- output value after the scaling process is the information of the video data transformed into the frequency domain.
- the transform method of the embodiment of the invention has excellent energy concentration performance, and the energy concentration is close to 8x8DCT, which can greatly eliminate the correlation of video data in the transform space; the computational complexity is low, and the addition can be performed by adding It is realized by shifting, easy to implement in hardware, and has high precision of operation, and there is no problem that the forward and reverse transforms do not match.
- all or part of the steps of the foregoing embodiments may be implemented by a program instruction related hardware, and the program may be stored in a computer readable storage medium.
- the storage medium may be a ROM, a RAM, a magnetic disk, an optical disk, or the like.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201110101417 CN102227135A (zh) | 2011-04-20 | 2011-04-20 | 一种视频编码8x8整数变换方法 |
CN201110101417.3 | 2011-04-20 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2012142777A1 true WO2012142777A1 (zh) | 2012-10-26 |
Family
ID=44808088
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2011/073586 WO2012142777A1 (zh) | 2011-04-20 | 2011-04-29 | 一种视频编码8x8整数变换方法 |
Country Status (2)
Country | Link |
---|---|
CN (2) | CN102227135A (zh) |
WO (1) | WO2012142777A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104486629B (zh) * | 2014-12-16 | 2017-10-20 | 北京金山云网络技术有限公司 | H.265视频编码中变换与反变换的快速实施方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1564602A (zh) * | 2004-03-18 | 2005-01-12 | 华中科技大学 | 视频编码的整数变换矩阵选择方法及相关的整数变换方法 |
CN101931817A (zh) * | 2010-04-12 | 2010-12-29 | 中山大学 | 一种变换基自适应的变换编码方法 |
CN101977321A (zh) * | 2010-11-04 | 2011-02-16 | 深圳市融创天下科技发展有限公司 | 一种视频编码的整数变换方法 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3129392B2 (ja) * | 1996-02-02 | 2001-01-29 | 日本電気株式会社 | 2次元idct回路 |
CN1209926C (zh) * | 2003-03-26 | 2005-07-06 | 浙江大学 | 运用于图像编码和视频编码的整数变换方法和装置 |
CN1210960C (zh) * | 2003-07-03 | 2005-07-13 | 浙江大学 | 运用于图像编码和视频编码的8×8整数变换方法和装置 |
US8069201B2 (en) * | 2003-11-25 | 2011-11-29 | Texas Instruments Incorporated | 8×8 transform and quantization |
CN100477796C (zh) * | 2005-12-27 | 2009-04-08 | 中国科学院计算技术研究所 | 用于视频转换的变换系数块的转换方法 |
CN100563337C (zh) * | 2007-10-24 | 2009-11-25 | 上海广电(集团)有限公司中央研究院 | 基于avs的整数变换量化和整数反变换反量化方法 |
-
2011
- 2011-04-20 CN CN 201110101417 patent/CN102227135A/zh active Pending
- 2011-04-29 WO PCT/CN2011/073586 patent/WO2012142777A1/zh active Application Filing
- 2011-09-19 CN CN2011102783360A patent/CN102281445A/zh active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1564602A (zh) * | 2004-03-18 | 2005-01-12 | 华中科技大学 | 视频编码的整数变换矩阵选择方法及相关的整数变换方法 |
CN101931817A (zh) * | 2010-04-12 | 2010-12-29 | 中山大学 | 一种变换基自适应的变换编码方法 |
CN101977321A (zh) * | 2010-11-04 | 2011-02-16 | 深圳市融创天下科技发展有限公司 | 一种视频编码的整数变换方法 |
Also Published As
Publication number | Publication date |
---|---|
CN102227135A (zh) | 2011-10-26 |
CN102281445A (zh) | 2011-12-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102149828B1 (ko) | 비디오의 부호화 방법 및 장치, 복호화 방법 및 장치 | |
AU2005237142B2 (en) | Reversible transform for lossy and lossless 2-D data compression | |
CN108200439B (zh) | 提高数字信号变换性能的方法及数字信号变换方法和装置 | |
AU2002259268C1 (en) | Apparatus and method for encoding and computing a discrete cosine transform using a butterfly processor | |
JP2005513588A5 (zh) | ||
WO2002093750A2 (en) | Apparatus and method for decoding and computing an inverse discrete cosine transform using a butterfly processor | |
Brahimi et al. | An efficient fast integer DCT transform for images compression with 16 additions only | |
WO2012142777A1 (zh) | 一种视频编码8x8整数变换方法 | |
JP2010081618A5 (zh) | ||
CN113592966B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
KR101722215B1 (ko) | 이산 여현 변환 장치 및 방법 | |
TWI468015B (zh) | 視訊解碼的平行處理裝置及方法 | |
WO2012142776A1 (zh) | 一种视频编码4x4整数变换方法 | |
KR101395143B1 (ko) | 영상처리의 정수변환 방법 및 정수변환 장치 | |
WO2012175033A1 (zh) | 图像变换处理方法、设备和系统 | |
JP4740992B2 (ja) | オーバーラップフィルタリングとコア変換を行うための方法及び装置 | |
CN206962992U (zh) | 用于数字视频编解码的3乘3整数dct变换量化器 | |
JP2006065678A5 (zh) | ||
CN113286150B (zh) | 用于视频编解码的变换编码硬件实现方法、装置及设备 | |
WO2012142792A1 (zh) | 一种视频编码16×16整数变换方法 | |
KR100790846B1 (ko) | 영상처리의 정수 변환 방법 및 그의 프로세서 | |
Saleh | Efficient mid-band exchange coefficient watermarking system | |
Li et al. | Efficient SIMD acceleration of DCT and IDCT for high efficiency video coding | |
WO2014060637A1 (en) | Image processing method, devices and system | |
Udomsiri et al. | Comparative study on recent integer DCTs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11864077 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11864077 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14/03/14) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11864077 Country of ref document: EP Kind code of ref document: A1 |