WO2016043417A1 - 분리 가능한 변환에 기초하여 적응적으로 비디오 신호를 인코딩 및 디코딩하는 방법 및 장치 - Google Patents
분리 가능한 변환에 기초하여 적응적으로 비디오 신호를 인코딩 및 디코딩하는 방법 및 장치 Download PDFInfo
- Publication number
- WO2016043417A1 WO2016043417A1 PCT/KR2015/007312 KR2015007312W WO2016043417A1 WO 2016043417 A1 WO2016043417 A1 WO 2016043417A1 KR 2015007312 W KR2015007312 W KR 2015007312W WO 2016043417 A1 WO2016043417 A1 WO 2016043417A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- transform
- optimal
- subset
- group index
- unit
- Prior art date
Links
- 230000009466 transformation Effects 0.000 title claims abstract description 64
- 238000000034 method Methods 0.000 title claims abstract description 46
- 230000003044 adaptive effect Effects 0.000 claims abstract description 8
- 238000013139 quantization Methods 0.000 claims description 14
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 claims description 13
- 238000000844 transformation Methods 0.000 abstract description 11
- 239000011159 matrix material Substances 0.000 description 25
- 238000006243 chemical reaction Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 8
- 230000006835 compression Effects 0.000 description 7
- 238000007906 compression Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 230000005540 biological transmission Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
- H04N19/122—Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/649—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding the transform being applied to non rectangular image segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to a method and apparatus for processing a video signal, and more particularly, to a method and apparatus for adaptively encoding and decoding a video signal based on a separable transform.
- Compression coding refers to a set of signal processing techniques for transmitting digitized information over a communication line or for storing the digitized information in a suitable form on a storage medium.
- Media such as video, images and voice may be subject to compression coding.
- a technique for performing compression coding on video is called video compression.
- Next-generation video content is expected to provide high spatial resolution, high frame rates, and high-dimensional video representations. Processing such content requires a significant amount of memory storage capacity, memory access speed, and processing power.
- the present invention provides a method of encoding transform matrix data and reducing bitrate overhead for encoding which transform to use in each 2N lines.
- the present invention can encode a set of line transforms using graph-based signal representations for video segments (blocks, frames, etc.). Then, the transform set can be formed by adding other basic transforms such as null transform and DCT.
- the transform set may be encoded, and each transform in the transform set may be defined as an index.
- the present invention can select an optimal transform set from among transform sets for each video segment, and encode the selected optimal transform set as side information.
- the present invention ensures the flexibility to adaptively change the transform, can reduce the computational complexity, and can also complement the coding transform coefficients.
- the present invention enables faster adaptation to changing statistical characteristics in different video segments and can provide variability in performing transformations.
- the present invention can reduce the computational complexity for coding a video signal by using a fixed separable transform, and can significantly reduce the overhead in transmission and transform selection of the transform matrix.
- FIG. 1 and 2 illustrate a schematic block diagram of an encoder and a decoder for processing a video signal as one embodiment to which the present invention is applied.
- FIG. 3 is an embodiment to which the present invention is applied; A diagram illustrating sample variation of residual pixel values in a transform block.
- FIG. 4 illustrates a row transformation and a column transformation for describing a separable transform according to an embodiment to which the present invention is applied.
- FIG. 5 illustrates a row transform and a column transform for describing a separable transform to which a different transform type is applied to each row and column according to an embodiment to which the present invention is applied.
- FIG. 6 shows an example of a transform type applicable to each row and column of a separable transform in an embodiment to which the present invention is applied.
- FIG. 7 illustrates a schematic block diagram of a transform unit combining the selection of a separable transform and zero signaling in one embodiment to which the present invention is applied.
- FIGS. 8 and 9 are flowcharts illustrating a method of coding a video signal based on a separable transform selection and zero signal according to embodiments of the present invention.
- the present invention provides a method of performing adaptive video coding, comprising: group index and And Determining a transform subset that includes linear transforms of the dimension; Selecting an optimal transform subset for a transform unit from the determined transform subsets; And encoding the optimal transform subset, wherein the linear transforms correspond to at least one of a null transform and a predefined transform, wherein each row and column of the transform unit is a different linear transform. It provides a method that can have a.
- the present invention also includes calculating a transform coefficient of a residual block based on the optimal transform subset; Quantizing the transform coefficients; And encoding indices of the quantized transform coefficients.
- the optimal transform subset is selected for each of the transform blocks.
- the transform blocks are characterized by including blocks of variable size or non-square blocks.
- the method is characterized in that it is performed repeatedly for the video segment.
- the present invention also provides a method of adaptively decoding a video signal, comprising: receiving a video signal comprising an index; Extracting an index from the video signal; And performing inverse transform of the residual block based on an optimal inverse transform subset corresponding to the index.
- the present invention in the apparatus for performing adaptive video coding, the group index and And Determining a transform subset that includes linear transforms of the dimension, selecting an optimal transform subset for the transform unit of the determined transform subsets, and coding the optimal transform subset, wherein the linear transforms are null A device corresponding to at least one of a (null) transformation and a predefined transformation, wherein each of the rows and columns of the transformation unit may have a different linear transformation.
- the present invention also includes a quantization unit for quantizing transform coefficients of a residual block calculated based on the optimal transform subset; And an entropy encoding unit for encoding the group index of the quantized transform coefficients.
- the present invention also provides an apparatus for adaptively decoding a video signal, the method comprising: receiving a video signal including a group index, extracting the group index from the video signal, and an optimal inverse transform subset corresponding to the group index And an inverse transform unit for performing inverse transform of the residual block on the basis of.
- signals, data, samples, pictures, frames, blocks, etc. may be appropriately replaced and interpreted in each coding process.
- FIG. 1 and 2 illustrate a schematic block diagram of an encoder and a decoder for processing a video signal as one embodiment to which the present invention is applied.
- the encoder 100 of FIG. 1 includes a transform unit 110, a quantization unit 120, an inverse quantization unit 130, an inverse transform unit 140, a buffer 150, a prediction unit 160, and an entropy encoding unit ( 170).
- the encoder 100 receives a video signal and generates a prediction error by subtracting the predicted signal output from the prediction unit 160 from the video signal.
- the generated prediction error is transmitted to the transform unit 110.
- the transform unit 110 generates a transform coefficient by applying a transform scheme to the prediction error.
- the present invention can be applied to a conventional form of video coding combining prediction and linear transformation.
- the traditional conversion process involves a block of pixels having the same size as a square (e.g., Pixel blocks).
- Pixel blocks e.g., Pixel blocks
- the present invention not only expands the selection of pixel blocks to be transformed, but also allows for blocks of variable size other than square.
- Equation 1 Consider a case of processing a block of residual signal values (that is, a value obtained by subtracting a prediction pixel value) from a matrix.
- the linear transform of the matrix R of Equation 1 may be defined in a fixed separable form as in Equation 2.
- C represents a transform coefficient matrix
- U and V are each And Represents an orthogonal transform of a dimension.
- the transform coefficient matrix Prior to coding, the transform coefficient matrix is quantized to Can be generated.
- the residual matrix reconstructed by the decoder may be calculated using an inverse transform as shown in Equation 3 below.
- the transform coefficient matrix C is Can be calculated by conference operation (multiplication and multiplication).
- U and V correspond to Discrete Cosine Transform (DCT)
- DCT Discrete Cosine Transform
- the quantization unit 120 quantizes the transform coefficients and transmits the quantized coefficients to entropy encoding unit 170.
- the entropy encoding unit 170 performs entropy coding on the quantized coefficients and outputs an entropy coded signal.
- the quantized signal output by the quantization unit 120 may be used to generate a prediction signal.
- the inverse quantization unit 130 and the inverse transform unit 140 in the encoder 100 loop may perform inverse quantization and inverse transformation on the quantized signal so that the quantized signal is restored to a prediction error. have.
- the reconstructed signal may be generated by adding the reconstructed prediction error to the prediction signal output by the prediction unit 160.
- the buffer 150 may store the reconstructed signal for future reference of the prediction unit 160.
- the prediction unit 160 may generate a prediction signal using a signal previously restored and stored in the buffer 150.
- the decoder 200 of FIG. 2 includes an entropy decoding unit 210, an inverse quantization unit 220, an inverse transform unit 230, a buffer 240, and a prediction unit 250.
- the decoder 200 of FIG. 2 receives a signal output by the encoder 100 of FIG. 1.
- the entropy decoding unit 210 performs entropy decoding on the received signal.
- the inverse quantization unit 220 obtains a transform coefficient from the entropy decoded signal based on the information on the quantization step size.
- the inverse transform unit 230 obtains a prediction error by performing an inverse transform on the transform coefficients.
- the reconstructed signal is generated by adding the obtained prediction error to the prediction signal output by the prediction unit 250.
- the buffer 240 stores the reconstructed signal for future reference of the prediction unit 250.
- the prediction unit 250 generates a prediction signal using a signal previously restored and stored in the buffer 240.
- the prediction method to which the present invention is applied will be used for both the encoder 100 and the decoder 200.
- FIG. 3 is an embodiment to which the present invention is applied; A diagram illustrating sample variation of residual pixel values in a transform block.
- Equation 2 The main problem in the definition of fixed and separable block linear transformation, such as Equation 2, is that all residual blocks can have the same isotropic statistical properties. In practice, however, quite different distributions are observed, depending on the video type as shown in FIG. 3, or depending on the prediction used for that pixel block.
- One way to take advantage of distribution variations for residual blocks and to obtain better compression is to use a different linear transform for each block, ie adaptively apply a linear transform.
- the residual blocks are divided into a certain number of classes (remaining block classification)
- the statistics for the blocks of each class are collected and the Karhunen-Lo'eve Transform for the class is collected.
- KLT can be calculated and a transform corresponding to the classification can be applied to each block.
- the present invention can change the display to indicate its general form. While the present invention scans the matrices R and C with row centers, it may be considered that p and f are defined as MN dimensional vectors as shown in Equation 4.
- Equation 5 the present invention can be expressed as shown in Equation 5.
- the present invention uses the non-separable transforms of Equation 5 to calculate C from R. May require operation The complexity of this operation can be significantly greater than in the separable embodiment of equation (2).
- the following method can be proposed as a method of implementing the adaptive transformation to which the present invention is applied.
- the first embodiment uses different transforms ⁇ using only information available at the encoder and decoder ⁇ ⁇ Is calculated and selected.
- the encoder has different transforms ⁇ ⁇ , Calculate and select, and send the decoder all the transformation matrices and information about which transform to use for each block.
- a third embodiment is a mixture of the two approaches, in which the encoder makes a decision about the transform, but the encoder and decoder share information to minimize the overhead required for coding transform data.
- the first embodiment may be more suitable for data with consistent statistical characteristics, while the second embodiment may have the overhead of encoding the entire dense matrix over the low bitrate required for the coding set of the dense residual signal. It is very large and can be applied in simple cases.
- the present invention may provide another embodiment to overcome the problems of the above embodiments.
- FIG. 4 illustrates a row transformation and a column transformation for describing a separable transform according to an embodiment to which the present invention is applied.
- the present invention can be designed as follows to solve the problems of the above embodiments.
- the overhead used for transmission and transform selection of transform matrix data should be small in order to benefit from overall coding.
- a separable transform may be defined as follows.
- Figure 4 (a) is 4 shows a row transform applied to the block
- FIG. Represents a column transform applied to a block.
- FIG. 4A it can be seen that the DCT transformation matrix is applied to each row in the same manner, and in FIG. 4B, the DCT transformation matrix is equally applied to each column.
- FIG. 5 illustrates a row transform and a column transform for describing a separable transform to which a different transform type is applied to each row and column according to an embodiment to which the present invention is applied.
- the present invention procession Instead of using, as in Equations 6 and 7, And Orthogonal matrix can be used.
- Matrix sets can be used sequentially to transform the rows and columns of R, thereby obtaining C.
- the entire process at the encoder can be defined as the sequence of the following operations, as in Equations 8-12.
- Equation 8 indicates obtaining a vector from a matrix row
- Equation 9 indicates performing a horizontal transformation
- Equation 10 indicates obtaining a vector from a transformed column.
- Equation 11 vertical transformation is performed.
- Equation 12 shows that the matrix column from the vector is calculated.
- the decoder And Inverse transform can be performed using.
- the maximum number of operations for the inverse transform is This can be
- the matrix of may be composed of a large number of zeros. Alternatively, every element may have a block equal to zero. Accordingly, the present invention seeks to provide a more general method.
- the null transform is not used during the actual transform, but is instead used to send a signal to the decoder whose signal is treated as 0, and thus not affected by any linear transform.
- another embodiment of the present invention may define a separable transform to which a different transform type is applied to each row and column.
- Figure 5 (a) is Row transform applied to the block, and FIG. Represents a column transform applied to a block.
- FIG. 5A it can be seen that different transformation matrices are applied to each row, and in FIG. 5B, different transformation matrices are applied to each column.
- the DCT transform is in the first row
- the null transform is in the second row
- the DST transform is in the third row
- the DCT transform is in the fourth row
- the KLT is in the i row.
- the transformation can be applied.
- the DCT transform is performed in the first column
- the null transform is in the second column
- the DST transform is in the third column
- the DCT transform is in the fourth column
- the KLT transform is in the i column. Can be applied.
- FIG. 6 shows an example of a transform type applicable to each row and column of a separable transform in an embodiment to which the present invention is applied.
- the present invention defines a separable transform to which a different transform type is applied to each row and column, wherein the other transform type may be defined by a transform type identifier.
- the other transform type may include at least one of a null transform and a predefined transform.
- the predefined transform may be a discrete cosine transform (DCT), an asymmetric disc sine transform (ADST), a discrete sine transform (DST), a discrete fourier transform (DFT), a Karhunen-Lo'eve Transform (KLT), or the like. It may include.
- the present invention is to identify the type of transformation to be applied to each row and column Can be defined.
- Is a null transform 1 is a discrete cosine transform (DCT)
- 2 is a discrete sine transform (DST)
- 3 is a Karhunen-Lo'eve Transform (KLT)
- 4 is a Discrete Fourier (DFT) Transform). It is also possible to define a reserved area for adding other transformation types.
- FIG. 7 illustrates a schematic block diagram of a transform unit combining the selection of a separable transform and zero signaling in one embodiment to which the present invention is applied.
- the transform unit 110 to which the present invention is applied includes a transform encoding unit 111, a transform adding unit 112, a transform selecting unit 113, and an index generating unit 114.
- the present invention provides a progressive coding scheme that is repeated for video segments (blocks, frames, etc.).
- the transform encoding unit 111 may have a size based on, for example, a graph Laplacian. And (or In this case, a set of orthogonal line transforms (one size) may be encoded.
- the transform adding unit 112 may form two transform sets as shown in Equation 13 below by adding a null transform and a predefined transform.
- Equation 13 G represents a set of transforms for a row, and H represents a set of transforms for a column.
- the row transformation set G is Conversions May include, and the thermal conversion set H is Conversions It may include, the Transformation elements may be different transformation matrices.
- the row transform set G and the column transform set H may already be stored in at least one of an encoder and a decoder, or may be inferred from other coding information.
- the row transform set G and the column transform set H may be encoded and transmitted to a decoder.
- the index information corresponding to the conversion table stored in at least one of the encoder and the decoder may be transmitted to the decoder, and the decoder may generate the row transformation set G and the column transformation set H based on the received index information.
- a transform set may be defined by encoding an index array of transmitted transforms.
- the index array may be expressed as an index set as shown in Equation 14 below.
- Equation 14 Represents an index set corresponding to a row transform, Denotes a set of indices corresponding to a column transform. And, ego, Indicates the index corresponding to each row conversion, Denotes an index corresponding to each column transformation, and k denotes a group index.
- Equation 15 The relationship between the index set for each row and column and the corresponding transform set may be defined as in Equation 15 below.
- M sets of row transformations can be defined
- N sets of thermal transformations can be defined.
- each of the M row transformations may correspond to any one of a predefined row transformation set.
- each of the M row transformations may be included in the row transformation set G of Equation 13. Conversions It may correspond to either.
- each of the N thermal transformations may correspond to any one of a predefined thermal transformation set.
- each of the N column transformations may be represented by Conversions It may correspond to either.
- the transform selection unit 113 sends the An optimal row / column transformation set may be selected from among row / column transformation sets, and the index generation unit 114 may encode a group index k corresponding to the optimal row / column transformation set.
- the optimal row / column transformation set may be selected based on a rate-distortion (RD) cost function.
- RD rate-distortion
- the difference between the transmitted pattern and the actual pattern e.g., more use of the null transform
- the difference between the transmitted pattern and the actual pattern can be coded immediately after encoding the group index k.
- transform C of the residual block R can be calculated using the sequence of operations of equation (8).
- the quantization unit 120 then quantizes transform C Obtain and quantize to an integer You can encode the group index of.
- the decoder can be defined by simply executing the encoder operation in reverse except for searching for the optimal group index k.
- the decoding process will be described in more detail with reference to FIG. 9.
- FIGS. 8 and 9 are flowcharts illustrating a method of coding a video signal based on a combination of separable transform selection and zero signal processing according to an embodiment to which the present invention is applied.
- a method for performing adaptive video encoding based on a combination of separable transform selection and zero signal processing.
- the encoder And An orthogonal transform of the dimension may be encoded (S810).
- the And Orthogonal transformation of dimensions can be based on graph laplacian.
- the encoder may generate a separate orthogonal transform set by adding at least one of a null transform and a predefined transform (S820).
- the null transform and the predefined transform may be a transform type identifier, ), And the encoder can increase transmission efficiency by coding and transmitting a transform type identifier.
- the encoder may select an optimal transform set that minimizes the rate-distortion (RD) cost (S830).
- the optimal transform set may be selected for each transform block.
- the transform blocks may include blocks of variable size or non-square blocks.
- the encoder may encode a group index corresponding to the optimal transform set (S840).
- the group index may be defined as in Equation 14.
- the orthogonal transforms And The size of the group index Index arrays are encoded.
- the process can be performed repeatedly for the video segment.
- a method for performing adaptive video decoding based on separable transform selection and zero signaling processing.
- the decoder may receive a video signal including a group index (S910), and extract a group index from the video signal (S920).
- the decoder may acquire an inverse transform set corresponding to the extracted group index.
- the inverse transform set may correspond to an optimal transform set selected by an encoder.
- the inverse transform set may already be stored in at least one of an encoder and a decoder, in which case the inverse transform set may be retrieved from where it is stored in the decoder using the group index.
- the decoder may perform entropy decoding and inverse quantization on the received video signal to obtain inverse quantized transform coefficients.
- the dequantized transform coefficients may mean coefficients transformed based on an optimal transform set selected by the encoder.
- the decoder may perform inverse transformation on a residual signal based on the inverse-transform set (S930).
- the residual signal may mean the dequantized transform coefficient.
- the inverse transform set may correspond to any one of a null transform and a separate transform set to which a predefined transform is added.
- the residual signal inversely transformed as described above may be added to the prediction signal to generate a reconstruction signal.
- the embodiments described herein may be implemented and performed on a processor, microprocessor, controller, or chip.
- the functional units illustrated in FIGS. 1, 2, and 7 may be implemented and performed on a computer, a processor, a microprocessor, a controller, or a chip.
- the decoder and encoder to which the present invention is applied include a multimedia broadcasting transmitting and receiving device, a mobile communication terminal, a home cinema video device, a digital cinema video device, a surveillance camera, a video chat device, a real time communication device such as video communication, a mobile streaming device, a storage medium.
- Camcorders video on demand (VoD) service providing devices, internet streaming service providing devices, three-dimensional (3D) video devices, video telephony video devices, and medical video devices, and the like, which may be used to process video signals and data signals. Can be.
- the decoding / encoding method to which the present invention is applied can be produced in the form of a program executed by a computer, and stored in a computer-readable recording medium.
- Multimedia data having a data structure according to the present invention can also be stored in a computer-readable recording medium.
- the computer readable recording medium includes all kinds of storage devices for storing computer readable data.
- the computer-readable recording medium may include, for example, a Blu-ray disc (BD), a universal serial bus (USB), a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device. Can be.
- the computer-readable recording medium also includes media embodied in the form of a carrier wave (eg, transmission over the Internet).
- the bit stream generated by the encoding method may be stored in a computer-readable recording medium or transmitted through a wired or wireless communication network.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (15)
- 적응적 비디오 코딩을 수행하는 방법에 있어서,그룹 인덱스와 MxM 및 NxN 차원의 선형 변환들을 포함하는 변환 서브셋을 결정하는 단계;상기 결정된 변환 서브셋들 중 변환 유닛에 대한 최적의 변환 서브셋을 선택하는 단계; 및상기 최적의 변환 서브셋을 인코딩하는 단계를 포함하되,상기 선형 변환들은 널(null) 변환과 사전에 정의된 변환들 중 적어도 하나에 대응되고,상기 변환 유닛의 행과 열 각각은 서로 다른 선형 변환을 가질 수 있는 것을 특징으로 하는 방법.
- 제1항에 있어서,상기 최적의 변환 서브셋에 기초하여 잔여 블록(residual block)의 변환 계수를 산출하는 단계;상기 변환 계수를 양자화하는 단계; 및상기 양자화된 변환 계수의 그룹 인덱스를 인코딩하는 단계를 더 포함하는 것을 특징으로 하는 방법.
- 제1항에 있어서,상기 최적의 변환 서브셋은 변환 블록들 각각에 대해 선택되는 것을 특징으로 하는 방법.
- 제3항에 있어서,상기 변환 블록들은 가변 크기의 블록들 또는 정사각형이 아닌 블록들을 포함하는 것을 특징으로 하는 방법.
- 제1항에 있어서,상기 방법은 비디오 세그먼트에 대해 반복적으로 수행되는 것을 특징으로 하는 방법.
- 비디오 신호를 적응적으로 디코딩하는 방법에 있어서,그룹 인덱스를 포함하는 비디오 신호를 수신하는 단계;상기 비디오 신호로부터 상기 그룹 인덱스를 추출하는 단계; 및상기 그룹 인덱스에 대응되는 최적의 역변환 서브셋에 기초하여 잔여 블록의 역변환을 수행하는 단계를 포함하는 것을 특징으로 하는 방법.
- 제6항에 있어서,상기 최적의 역변환 서브셋은 변환 블록들 각각에 대응되는 것을 특징으로 하는 방법.
- 제7항에 있어서,상기 변환 블록은 가변 크기의 블록 또는 정사각형이 아닌 블록을 포함하는 것을 특징으로 하는 방법.
- 적응적 비디오 코딩을 수행하는 장치에 있어서,그룹 인덱스와 MxM 및 NxN 차원의 선형 변환들을 포함하는 변환 서브셋을 결정하고, 상기 결정된 변환 서브셋들 중 변환 유닛에 대한 최적의 변환 서브셋을 선택하며, 상기 최적의 변환 서브셋을 코딩하는 변환 유닛을 포함하되,상기 선형 변환들은 널(null) 변환과 사전에 정의된 변환들 중 적어도 하나에 대응되고,상기 변환 유닛의 행과 열 각각은 서로 다른 선형 변환을 가질 수 있는 것을 특징으로 하는 장치.
- 제9항에 있어서,상기 최적의 변환 서브셋에 기초하여 산출된 잔여 블록(residual block)의 변환 계수를 양자화하는 양자화 유닛; 및상기 양자화된 변환 계수의 그룹 인덱스를 인코딩하는 엔트로피 인코딩 유닛을 더 포함하는 것을 특징으로 하는 장치.
- 제9항에 있어서,상기 최적의 변환 서브셋은 변환 블록들 각각에 대해 선택되는 것을 특징으로 하는 장치.
- 제11항에 있어서,상기 변환 블록들은 가변 크기의 블록들 또는 정사각형이 아닌 블록들을 포함하는 것을 특징으로 하는 장치.
- 비디오 신호를 적응적으로 디코딩하는 장치에 있어서,그룹 인덱스를 포함하는 비디오 신호를 수신하고, 상기 비디오 신호로부터 상기 그룹 인덱스를 추출하며, 상기 그룹 인덱스에 대응되는 최적의 역변환 서브셋에 기초하여 잔여 블록의 역변환을 수행하는 역변환 유닛을 포함하는 것을 특징으로 하는 장치.
- 제13항에 있어서,상기 최적의 역변환 서브셋은 변환 블록들 각각에 대응되는 것을 특징으로 하는 장치.
- 제14항에 있어서,상기 변환 블록은 가변 크기의 블록 또는 정사각형이 아닌 블록을 포함하는 것을 특징으로 하는 장치.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/512,428 US20170280140A1 (en) | 2014-09-19 | 2015-07-14 | Method and apparatus for adaptively encoding, decoding a video signal based on separable transform |
KR1020167036420A KR20170058335A (ko) | 2014-09-19 | 2015-07-14 | 분리 가능한 변환에 기초하여 적응적으로 비디오 신호를 인코딩 및 디코딩하는 방법 및 장치 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462052469P | 2014-09-19 | 2014-09-19 | |
US62/052,469 | 2014-09-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016043417A1 true WO2016043417A1 (ko) | 2016-03-24 |
Family
ID=55533432
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2015/007312 WO2016043417A1 (ko) | 2014-09-19 | 2015-07-14 | 분리 가능한 변환에 기초하여 적응적으로 비디오 신호를 인코딩 및 디코딩하는 방법 및 장치 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20170280140A1 (ko) |
KR (1) | KR20170058335A (ko) |
WO (1) | WO2016043417A1 (ko) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA3088026C (en) * | 2017-12-15 | 2023-10-03 | Lg Electronics Inc. | Image coding method on basis of transformation and device therefor |
CN112106373A (zh) * | 2018-03-28 | 2020-12-18 | 韩国电子通信研究院 | 用于图像编/解码的方法和装置及存储比特流的记录介质 |
WO2020013514A1 (ko) * | 2018-07-08 | 2020-01-16 | 엘지전자 주식회사 | 비디오 신호를 처리하기 위한 방법 및 장치 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20110115986A (ko) * | 2010-04-16 | 2011-10-24 | 에스케이 텔레콤주식회사 | 영상 부호화/복호화 장치 및 방법 |
US20120201303A1 (en) * | 2009-10-23 | 2012-08-09 | Huawei Technologies Co., Ltd. | Method and device for encoding and decoding videos |
US20130114730A1 (en) * | 2011-11-07 | 2013-05-09 | Qualcomm Incorporated | Coding significant coefficient information in transform skip mode |
KR20130098360A (ko) * | 2010-09-08 | 2013-09-04 | 삼성전자주식회사 | 내부 예측을 위한 적응적 이산 코사인 변환 및 이산 사인 변환을 이용한 낮은 복잡도의 변환 코딩 |
KR101403338B1 (ko) * | 2007-03-23 | 2014-06-09 | 삼성전자주식회사 | 영상의 부호화, 복호화 방법 및 장치 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5825422A (en) * | 1995-12-29 | 1998-10-20 | Daewoo Electronics Co. Ltd. | Method and apparatus for encoding a video signal based on inter-block redundancies |
US8488672B2 (en) * | 2007-04-17 | 2013-07-16 | Qualcomm Incorporated | Mode uniformity signaling for intra-coding |
US8929455B2 (en) * | 2011-07-01 | 2015-01-06 | Mitsubishi Electric Research Laboratories, Inc. | Method for selecting transform types from mapping table for prediction modes |
US10142642B2 (en) * | 2014-06-04 | 2018-11-27 | Qualcomm Incorporated | Block adaptive color-space conversion coding |
-
2015
- 2015-07-14 US US15/512,428 patent/US20170280140A1/en not_active Abandoned
- 2015-07-14 KR KR1020167036420A patent/KR20170058335A/ko not_active Application Discontinuation
- 2015-07-14 WO PCT/KR2015/007312 patent/WO2016043417A1/ko active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101403338B1 (ko) * | 2007-03-23 | 2014-06-09 | 삼성전자주식회사 | 영상의 부호화, 복호화 방법 및 장치 |
US20120201303A1 (en) * | 2009-10-23 | 2012-08-09 | Huawei Technologies Co., Ltd. | Method and device for encoding and decoding videos |
KR20110115986A (ko) * | 2010-04-16 | 2011-10-24 | 에스케이 텔레콤주식회사 | 영상 부호화/복호화 장치 및 방법 |
KR20130098360A (ko) * | 2010-09-08 | 2013-09-04 | 삼성전자주식회사 | 내부 예측을 위한 적응적 이산 코사인 변환 및 이산 사인 변환을 이용한 낮은 복잡도의 변환 코딩 |
US20130114730A1 (en) * | 2011-11-07 | 2013-05-09 | Qualcomm Incorporated | Coding significant coefficient information in transform skip mode |
Also Published As
Publication number | Publication date |
---|---|
KR20170058335A (ko) | 2017-05-26 |
US20170280140A1 (en) | 2017-09-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020036417A1 (ko) | 히스토리 기반 움직임 벡터에 기반한 인터 예측 방법 및 그 장치 | |
WO2012002785A2 (ko) | 화면내 예측 부호화를 위한 영상 부호화/복호화 장치 및 방법 | |
WO2018174402A1 (ko) | 영상 코딩 시스템에서 변환 방법 및 그 장치 | |
WO2010038961A2 (ko) | 복수 개의 움직임 벡터 추정을 이용한 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2011031030A2 (ko) | 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2017057953A1 (ko) | 비디오 코딩 시스템에서 레지듀얼 신호 코딩 방법 및 장치 | |
WO2020017840A1 (ko) | Dmvr에 기반하여 인터 예측을 수행하는 방법 및 장치 | |
WO2013109039A1 (ko) | 가중치예측을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2017043766A1 (ko) | 비디오 부호화, 복호화 방법 및 장치 | |
WO2015057033A1 (ko) | 3d 비디오 부호화/복호화 방법 및 장치 | |
WO2020141928A1 (ko) | 영상 코딩 시스템에서 mmvd 에 따른 예측에 기반한 영상 디코딩 방법 및 장치 | |
WO2020262951A1 (ko) | 동영상 데이터의 인트라 예측 코딩을 위한 방법 및 장치 | |
WO2020009390A1 (ko) | 영상 코딩 시스템에서 인터 예측에 따른 영상 처리 방법 및 장치 | |
WO2016140439A1 (ko) | 향상된 예측 필터를 이용하여 비디오 신호를 인코딩, 디코딩하는 방법 및 장치 | |
WO2011111954A2 (ko) | 움직임 벡터 해상도 조합을 이용한 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2016064242A1 (ko) | 그래프 템플릿으로부터 유도된 변환을 이용하여 비디오 신호를 디코딩/인코딩하는 방법 및 장치 | |
WO2013133648A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2016043417A1 (ko) | 분리 가능한 변환에 기초하여 적응적으로 비디오 신호를 인코딩 및 디코딩하는 방법 및 장치 | |
WO2020149630A1 (ko) | 영상 코딩 시스템에서 cclm 예측 기반 영상 디코딩 방법 및 그 장치 | |
WO2019059721A1 (ko) | 해상도 향상 기법을 이용한 영상의 부호화 및 복호화 | |
WO2011071316A2 (ko) | 영상 부호화 장치 및 방법, 및 거기에 이용되는 변환 부호화 장치 및 방법, 변환기저 생성장치 및 방법, 및 영상 복호화 장치 및 방법 | |
WO2019212230A1 (ko) | 영상 코딩 시스템에서 블록 사이즈에 따른 변환을 사용하는 영상 디코딩 방법 및 그 장치 | |
WO2019209026A1 (ko) | 비디오 코딩 시스템에서 인터 예측 방법 및 장치 | |
WO2017030418A1 (ko) | 다중 그래프 기반 모델에 따라 최적화된 변환을 이용하여 비디오 신호를 인코딩/ 디코딩하는 방법 및 장치 | |
WO2014073877A1 (ko) | 다시점 비디오 신호의 처리 방법 및 이에 대한 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15841165 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20167036420 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 15512428 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15841165 Country of ref document: EP Kind code of ref document: A1 |